To KTH's start page

Speech, Music and Hearing (TMH)

Research at the Division of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

Latest Publications

[1]
Kejriwal, J., Mishra, C., Skantze, G., Offrede, T. & Beňuš, Š. (2024). Does a robot's gaze behavior affect entrainment in HRI?. Computing and informatics, 43(5), 1256-1284.
[3]
Green, O., Sturm, B., Born, G., Wald-Fuhrmann, M. (2024). A Critical Survey of Research in Music Genre Recognition. In Proc. International Society for Music Information Retrieval Conference. ISMIR.
[4]
Sturm, B., Déguernel, K., Huang, R. S., Kaila, A.-K., Jääskeläinen, P., Kanhov, E., Cros Vila, L., Dalmazzo, D., Casini, L., Bown, O., Collins, N., Drott, E., Sterne, J., Holzapfel, A., Ben-Tal, O. (2024). AI Music Studies : Preparing for the Coming Flood. In Proceedings of AI Music Creativity..
[5]
Thomé, C., Sturm, B., Pertoft, J., Jonason, N. (2024). Applying textual inversion to control and personalize text-to-music models. In Proc. 15th Int. Workshop on Machine Learning and Music..
Full list in the KTH publications portal

News