Speech, Music and Hearing (TMH)

Research at the Division of Speech, Music and Hearing (TMH) is truly multi-disciplinary including linguistics, phonetics, auditory perception, vision and experimental psychology. Rooted in an engineering modelling approach, our research forms a solid base for developing multimodal human-computer interaction systems in which speech, music, sound and gestures combine to create human-like communication.

Research Area

The division is part of the Department of Intelligenta System at the school of Elektroteknik och Datavetenskap .

Conversational Systems

Human Speech and Communication

Music Informatics and Auditory Perception

Speech and Language Technologies

Social Robotics

Voice Science and Technical Vocology

Latest Publications

[1]

Bokkahalli Satish, S. H., Henter, G. E., Székely, É. (2026). When Voice Matters : Evidence of Gender Disparity in Positional Bias of SpeechLLMs. I Speech and Computer - 27th International Conference, SPECOM 2025, Proceedings. (s. 25-38). Springer Nature.

[2]

Amerotti, M., Benford, S., Sturm, B. L.T., Vear, C. (2026). A Live Performance Rule System Informed by Irish Traditional Dance Music. I Music and Sound Generation in the AI Era - 16th International Symposium, CMMR 2023, Revised Selected Papers. (s. 127-139). Springer Nature.

[3]

Vaddadi, B., Axelsson, A., Skantze, G. (2026). The Role of Social Robots in Autonomous Public Transport. I Transport Transitions: Advancing Sustainable and Inclusive Mobility: Proceedings of the 10th TRA Conference, 2024, Dublin, Ireland - Volume 1: Safe and Equitable Transport. (s. 711-716). Springer Nature.

[4]

Qian, L., Figueroa, C., Skantze, G. (2025). Representation of perceived prosodic similarity of conversational feedback. I Interspeech 2025. (s. 374-378). International Speech Communication Association.

[5]

Bokkahalli Satish, S. H., Henter, G. E., Székely, É. (2025). Hear Me Out : Interactive evaluation and bias discovery platform for speech-to-speech conversational AI. I Interspeech 2025. (s. 2151-2152). International Speech Communication Association.

Fullständig lista i KTH:s publikationsportal

At TMH we regularly hold seminars from talented minds and bright researchers. You can check our calendar for the upcoming seminars or register as a speaker.

TMH Seminar Speaker Registration

Events

Perspectives on AI and Music

8 dec

Disputationer

måndag 2025-12-08, 15.00

Plats: Q2, Malvinas väg 10, Stockholm

Respondent: Laura Cros Vila , Tal, musik och hörsel, TMH

2025-12-08T15:00:00.000+01:00 2025-12-08T15:00:00.000+01:00 Perspectives on AI and Music (Disputationer) Q2, Malvinas väg 10, Stockholm (KTH, Stockholm, Sweden)Perspectives on AI and Music (Disputationer)
Evaluation of Artificial Intelligence in the Medical Domain

12 dec

Disputationer

fredag 2025-12-12, 13.00

Plats: Kollegiesalen, Brinellvägen 8, Stockholm

Videolänk: https://kth-se.zoom.us/j/69936124469

Respondent: Birger Moëll , Tal, musik och hörsel, TMH

2025-12-12T13:00:00.000+01:00 2025-12-12T13:00:00.000+01:00 Evaluation of Artificial Intelligence in the Medical Domain (Disputationer) Kollegiesalen, Brinellvägen 8, Stockholm (KTH, Stockholm, Sweden)Evaluation of Artificial Intelligence in the Medical Domain (Disputationer)

https://www.kth.se/is/tmh/calendar

News

Foto: E. WadeUnsplash

KTH bidrar till nytt uttalslexikon som förbättrar uppläsning med talsyntes
10 apr 2025

Språkbanken Tal på KTH och Myndigheten för tillgängliga medier (MTM) har utvecklat Braxen, ett lexikon som innehåller information om hur ord och namn uttalas. Braxen är fritt tillgängligt och består a...
Är det möjligt för robotar att engagera sig i meningsfulla samtal med människor?
9 apr 2025

Tre forskare vid Kungliga Tekniska Högskolan, bryter ny mark inom människa-robotinteraktion. Deras senaste studier tar sig an två svåra utmaningar: att göra AI-drivna sällskapsrobotar mer stödjande fö...
Erik Ekstedt och Gabriel Skantze från Avdelningen för tal, musik och hörsel

Hur man förutsäger en konversation
26 sep 2022

SIGIDALs best paper award gick till Erik Ekstedt och Gabriel Skantze från Tal, musik och hörsel (TMH). Deras modell kan förutsäga vad som kommer att hända under de kommande två sekunderna av ett samta...
Snabbare iteration och mer personlig röst för digitala assistenter
28 jan 2022

Shivam Mehta, doktorand på Avdelningen för tal musik pocherade hörsel, grattis till bästa poster på EECS vinterkonferens. Berätta lite om din forskning för oss.
ICMI 2021 Best Paper Award Nomination!
21 okt 2021

Utbildning

Forskning

Samverkan

Om KTH

Bibliotek

Speech, Music and Hearing (TMH)

Research Area

Latest Publications

Events

News

Kontakt