Till KTH:s startsida Till KTH:s startsida

Speech, Music and Hearing (TMH)

Research at the Division of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

Latest Publications

[1]
Amerotti, M., Sturm, B., Benford, S., Maruri-Aguilar, H., Vear, C. (2024). Evaluation of an Interactive Music Performance System in the Context of Irish Traditional Dance Music. I Proceedings New Interfaces for Musical Expression NIME’24..
[2]
Jonason, N., Wang, X., Cooper, E., Juvela, L., Sturm, B., Yamagishi, J. (2024). DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input. I Proceedings of the 27th International Conference on Digital Audio Effects (DAFx24)..
[3]
Tånnander, C., O'Regan, J., House, D., Edlund, J., Beskow, J. (2024). Prosodic characteristics of English-accented Swedish neural TTS. I Proceedings of Speech Prosody 2024. (s. 1035-1039). Leiden, The Netherlands: International Speech Communication Association.
[4]
Misra, S., Boye, J. (2024). Nested Noun Phrase Identification using BERT. I 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. (s. 12138-12143). European Language Resources Association (ELRA).
[5]
Malisz, Z., Foremski, J., Kul, M. (2024). PRODIS - a speech database and a phoneme-based language model for the study of predictability effects in Polish. I 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings. (s. 13068-13073). European Language Resources Association (ELRA).
Fullständig lista i KTH:s publikationsportal

News