Speech, Music and Hearing (TMH)

Research at the Division of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

Latest Publications

[1]
Best, P., Araya-Salas, M., Ekström, A. G., Freitas, B., Jensen, F. H., Kershenbaum, A. ... Marxer, R. (2025). Bioacoustic fundamental frequency estimation : a cross-species dataset and deep learning baseline. Bioacoustics, 34(4), 419-446.
[2]
Cros Vila, L., Sturm, B., Casini, L. & Dalmazzo, D. (2025). The AI Music Arms Race : On the Detection of AI-Generated Music. Transactions of the International Society for Music Information Retrieval, 8(1), 179-194.
[3]
Torubarova, E. (2025). Brain-Focused Multimodal Approach for Studying Conversational Engagement in HRI. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 1894-1896). Institute of Electrical and Electronics Engineers (IEEE).
[4]
Torubarova, E., Arvidsson, C., Berrebi, J., Uddén, J., Abelho Pereira, A. T. (2025). NeuroEngage: A Multimodal Dataset Integrating fMRI for Analyzing Conversational Engagement in Human-Human and Human-Robot Interactions. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 849-858). Institute of Electrical and Electronics Engineers (IEEE).
[5]
Tuttösí, P., Mehta, S., Syvenky, Z., Burkanova, B., Hfsafsti, M., Wang, Y., Yeung, H. H., Henter, G. E., Aucouturier, J. J., Lim, A. (2025). Take a Look, it's in a Book, a Reading Robot. In HRI 2025 - Proceedings of the 2025 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 1803-1805). Institute of Electrical and Electronics Engineers (IEEE).
Full list in the KTH publications portal

News