Speech, Music and Hearing (TMH)

Research at the Division of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

The division is part of the Department of Intelligent Systems at the school of Electrical Engineering and Computer Science .

Conversational Systems

Human Speech and Communication

Music Informatics and Auditory Perception

Speech and Language Technologies

Social Robotics

Voice Science and Technical Vocology

Latest Publications

[1]

Best, P., Araya-Salas, M., Ekström, A. G., Freitas, B., Jensen, F. H., Kershenbaum, A. ... Marxer, R. (2025). Bioacoustic fundamental frequency estimation : a cross-species dataset and deep learning baseline. Bioacoustics, 34(4), 419-446.

[2]

Cros Vila, L., Sturm, B., Casini, L. & Dalmazzo, D. (2025). The AI Music Arms Race : On the Detection of AI-Generated Music. Transactions of the International Society for Music Information Retrieval, 8(1), 179-194.

[3]

Torubarova, E. (2025). Brain-Focused Multimodal Approach for Studying Conversational Engagement in HRI. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 1894-1896). Institute of Electrical and Electronics Engineers (IEEE).

[4]

Torubarova, E., Arvidsson, C., Berrebi, J., Uddén, J., Abelho Pereira, A. T. (2025). NeuroEngage: A Multimodal Dataset Integrating fMRI for Analyzing Conversational Engagement in Human-Human and Human-Robot Interactions. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 849-858). Institute of Electrical and Electronics Engineers (IEEE).

[5]

Tuttösí, P., Mehta, S., Syvenky, Z., Burkanova, B., Hfsafsti, M., Wang, Y., Yeung, H. H., Henter, G. E., Aucouturier, J. J., Lim, A. (2025). Take a Look, it's in a Book, a Reading Robot. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 1803-1805). Institute of Electrical and Electronics Engineers (IEEE).

Full list in the KTH publications portal

At TMH we regularly hold seminars from talented minds and bright researchers. You can check our calendar for the upcoming seminars or register as a speaker.

TMH Seminar Speaker Registration

Events

No up-to-date calendar events right now.

https://www.kth.se/is/tmh/calendar

News

Erik Ekstedt and Gabriel Skantze from the Division of Speech, Music and Hearing

How to predict a conversation
26 Sep 2022

The SIGIDAL best paper award went to Erik Ekstedt and Gabriel Skantze from Speech, Music and Hearing (TMH). Their model learns to predict what will happen in the next two seconds of the conversation. ...
Research on generating a faster iteration and a more personal voice for digital assistants
24 Jan 2022

Shivam Mehta, doctoral student at the Division of Speech, Music and Hearing, congratulations on winning the Poster exhibition at the EECS Winter Conference.
ICMI 2021 Best Paper Award Nomination!
21 Oct 2021
TMH gets Jury award at IVA Gala 2021
7 Oct 2021
TMH gets Honourable Mention at IVA 2021!
7 Oct 2021

Studies

Research

Collaboration

About KTH

Library

Speech, Music and Hearing (TMH)

Research Area

Latest Publications

Events

News

Contact