To KTH's start page

Institution of Speech, Music and Hearing (TMH)

Research at the Institution of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

Latest Publications

[1]
Rugayan, J., Salvi, G., Svendsen, T. (2026). Optimizing ASR Models with Semantic Information. In TEXT, SPEECH, AND DIALOGUE, TSD 2025, PT I. (pp. 25-35). Springer Nature.
[2]
Axelsson, A., Vaddadi, B., Bogdan, C. M., Tobin, D. & Skantze, G. (2026). Robots as Hosts in Autonomous Buses : A Field Trial. ACM Transactions on Human-Robot Interaction, 15(1).
[3]
Pandey, A., Edlund, J., Le Maguer, S. & Harte, N. (2026). The use of variable length stimuli for assessing segmental distortion in TTS evaluation. Computer speech & language (Print), 97.
[4]
Bokkahalli Satish, S. H., Henter, G. E., Székely, É. (2026). When Voice Matters : Evidence of Gender Disparity in Positional Bias of SpeechLLMs. In Speech and Computer - 27th International Conference, SPECOM 2025, Proceedings. (pp. 25-38). Springer Nature.
[5]
Amerotti, M., Benford, S., Sturm, B. L.T., Vear, C. (2026). A Live Performance Rule System Informed by Irish Traditional Dance Music. In Music and Sound Generation in the AI Era - 16th International Symposium, CMMR 2023, Revised Selected Papers. (pp. 127-139). Springer Nature.
Full list in the KTH publications portal

News