Skip to main content

TMH Publications (latest 50)

Below are the 50 latest publications from the Department of Speech, Music and Hearing.

TMH Publications

[1]
Saponaro, G., Jamone, L., Bernardino, A. & Salvi, G. (2020). Beyond the Self: Using Grounded Affordances to Interpret and Describe Others’ Actions. IEEE Transactions on Cognitive and Developmental Systems, 12(2), 209-221.
[2]
Ternström, S., D'Amario, S. & Selamtzis, A. (2020). Effects of the lung volume on the electroglottographic waveform in trained female singers. Journal of Voice, 34(3), 485.e1-485.e21.
[3]
Pabon, P. & Ternström, S. (2020). Feature maps of the acoustic spectrum of the voice. Journal of Voice, 34(1), 161.e1-161.e26.
[4]
Sundberg, J., Salomão, G. L. & Scherer, K. R. (2021). Analyzing Emotion Expression in Singing via Flow Glottograms, Long-Term-Average Spectra, and Expert Listener Evaluation. Journal of Voice, 35(1), 52-60.
[5]
Kontogiorgos, D., van Waveren, S., Wallberg, O., Abelho Pereira, A. T., Leite, I., Gustafson, J. (2020). Embodiment Effects in Interactions with Failing Robots. Presented at SIGCHI Conference on Human Factors in Computing Systems, CHI ’20, April 25–30, 2020, Honolulu, HI, USA. ACM Digital Library.
[6]
Abelho Pereira, A. T., Oertel, C., Fermoselle, L., Mendelson, J., Gustafson, J. (2020). Effects of Different Interaction Contexts when Evaluating Gaze Models in HRI. Presented at ACM/IEEE International Conference on Human-Robot Interaction (HRI), MAR 23-26, 2020, Cambridge, ENGLAND. (pp. 131-138). Association for Computing Machinery (ACM).
[7]
Kontogiorgos, D., Abelho Pereira, A. T., Sahindal, B., van Waveren, S., Gustafson, J. (2020). Behavioural Responses to Robot Conversational Failures. In HRI '20: Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction. ACM Digital Library.
[8]
Engwall, O., David Lopes, J. & Åhlund, A. (2020). Robot interaction styles for conversation practice in second language learning. International Journal of Social Robotics.
[9]
Lã, F. M.B. & Ternström, S. (2020). Flow ball-assisted voice training : Immediate effects on vocal fold contacting. Biomedical Signal Processing and Control, 62.
[10]
Engwall, O. & David Lopes, J. (2020). Interaction and collaboration in robot-assisted language learning for adults. Computer Assisted Language Learning.
[11]
Nix, J., Jers, H. & Ternström, S. (2020). Acoustical, psychoacoustical, and pedagogical considerations for choral singing with covid-19 health measures. Choral Journal, 61(3), 32-40.
[12]
Ben-Tal, O., Harris, M. & Sturm, B. (2020). How music AI is useful : Engagements with composers, performers, and audiences. Leonardo music journal, 1-13.
[14]
Petrovska, D., Hennebert, J., Melin, H., Genoud, D. (2020). Polycost : A telephone-speech database for speaker recognition. In RLA2C 1998 - Speaker Recognition and its Commercial and Forensic Applications. (pp. 211-214). International Speech Communication Association.
[15]
Elowsson, A. (2020). Polyphonic pitch tracking with deep layered learning. Journal of the Acoustical Society of America, 148(1), 446-468.
[16]
Henter, G. E., Alexanderson, S. & Beskow, J. (2020). MoGlow: Probabilistic and controllable motion synthesis using normalising flows. ACM Transactions on Graphics, 39(4), 236:1-236:14.
[17]
Cumbal, R., David Lopes, J., Engwall, O. (2020). Detection of Listener Uncertainty in Robot-Led Second Language Conversation Practice. In Proceedings ICMI '20: International Conference on Multimodal Interaction. Association for Computing Machinery (ACM).
[18]
Dabbaghchian, S., Arnela, M., Engwall, O. & Oriol, G. (2021). Simulation of vowel-vowel utterances using a 3D biomechanical-acoustic model. International Journal for Numerical Methods in Biomedical Engineering, 37(1).
[19]
Székely, É., Edlund, J., Gustafsson, J. (2020). Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis. In Proceedings of The 12th Language Resources and Evaluation Conference. (pp. 6368-6374). European Language Resources Association.
[20]
Székely, É., Henter, G. E., Beskow, J., Gustafsson, J. (2020). Breathing and Speech Planning in Spontaneous Speech Synthesis. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (pp. 7649-7653). IEEE.
[21]
Strombergsson, S., Holm, K., Edlund, J., Lagerberg, T. & McAllister, A. (2020). Audience Response System-Based Evaluation of Intelligibility of Children's Connected Speech - Validity, Reliability and Listener Differences. Journal of Communication Disorders, 87.
[22]
Kucherenko, T., Jonell, P., van Waveren, S., Henter, G. E., Alexanderson, S., Leite, I., Kjellström, H. (2020). Gesticulator : A framework for semantically-aware speech-driven gesture generation. In ICMI '20: Proceedings of the 2020 International Conference on Multimodal Interaction. Association for Computing Machinery (ACM).
[23]
Leijon, A., Dillon, H., Hickson, L., Kinkel, M., Kramer, S. E. & Nordqvist, P. (2020). Analysis of data from the International Outcome Inventory for Hearing Aids (IOI-HA) using Bayesian Item Response Theory. International Journal of Audiology.
[24]
Patel, R. R., Sundberg, J., Gill, B. & Lã, F. M. B. (2020). Glottal Airflow and Glottal Area Waveform Characteristics of Flow Phonation in Untrained Vocally Healthy Adults. Journal of Voice.
[25]
Chettri, B., Benetos, E. & Sturm, B. (2020). Dataset Artefacts in Anti-Spoofing Systems : A Case Study on the ASVspoof 2017 Benchmark. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 28, 3018-3028.
[26]
Sturm, B. (Ed.). (2020). Proceedings of The 2020 Joint Conference on AI Music Creativity . KTH Royal Institute of Technology.
[27]
Skantze, G. (2021). Turn-taking in Conversational Systems and Human-Robot Interaction : A Review. Computer speech & language (Print), 67.
[28]
Jonell, P., Kucherenko, T., Henter, G. E., Beskow, J. (2020). Let’s face it : Probabilistic multi-modal interlocutor-aware generation of facial gestures in dyadic settings. In IVA '20: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents. Association for Computing Machinery (ACM).
[29]
Jonell, P., Kucherenko, T., Torre, I., Beskow, J. (2020). Can we trust online crowdworkers? : Comparing online and offline participants in a preference test of virtual agents. In IVA '20: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents. Association for Computing Machinery (ACM).
[30]
Kucherenko, T., Hasegawa, D., Kaneko, N., Henter, G. E. & Kjellström, H. (2021). Moving Fast and Slow : Analysis of Representations and Post-Processing in Speech-Driven Automatic Gesture Generation. International Journal of Human-Computer Interaction, 1-17.
[31]
Torre, I., Dogan, F. I., Kontogiorgos, D. (2021). Voice, Embodiment, and Autonomy as Identity Affordances. In HRI '21 Companion: Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction..
[32]
Kontogiorgos, D., Sibirtseva, E., Gustafson, J. (2020). Chinese whispers : A multimodal dataset for embodied language grounding. In LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings. (pp. 743-749). European Language Resources Association (ELRA).
[33]
Fornhammar, L., Sundberg, J., Fuchs, M. & Pieper, L. (2020). Measuring Voice Effects of Vibrato-Free and Ingressive Singing : A Study of Phonation Threshold Pressures. Journal of Voice.
[34]
Domeij, R., Edlund, J., Eriksson, G., Fallgren, P., House, D., Lindström, E., Skog, S. N., Öqvist, J. (2020). Exploring the archives for textual entry points to speech - Experiences of interdisciplinary collaboration in making cultural heritage accessible for research. In CEUR Workshop Proceedings. (pp. 45-55). CEUR-WS.
[35]
Mishra, S., Benetos, E., Sturm, B., Dixon, S. (2020). Reliable Local Explanations for Machine Listening. In Proceedings of the International Joint Conference on Neural Networks. Institute of Electrical and Electronics Engineers Inc.
[36]
Ghosh, A., Honore, A., Liu, D., Henter, G. E., Chatterjee, S. (2020). Robust classification using hidden markov models and mixtures of normalizing flows. In IEEE International Workshop on Machine Learning for Signal Processing, MLSP. IEEE Computer Society.
[37]
Gillet, S., Cumbal, R., Abelho Pereira, A. T., Lopes, J., Engwall, O., Leite, I. (2021). Robot Gaze Can Mediate Participation Imbalance in Groups with Different Skill Levels. In Proceedings of the 2021 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 303-311). Association for Computing Machinery.
[38]
Alexanderson, S., Székely, É., Henter, G. E., Kucherenko, T., Beskow, J. (2020). Generating coherent spontaneous speech and gesture from text. In Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents, IVA 2020. Association for Computing Machinery, Inc.
[39]
Mishra, S., Benetos, E., Sturm, B., Dixon, S. (2020). Reliable Local Explanations for Machine Listening. In 2020 International joint conference on neural networks (IJCNN). IEEE.
[40]
Ghosh, A., Honore, A., Liu, D., Henter, G. E., Chatterjee, S. (2020). Robust classification using hidden markov models and mixtures of normalizing flows. In Proceedings of the 2020 IEEE 30th international workshop on machine learning for signal processing (MLSP). IEEE.
[41]
Nylén, H., Chatterjee, S., Ternström, S. (2021). Detecting Signal Corruptions in Voice Recordings For Speech Therapy. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (pp. 386-390). Institute of Electrical and Electronics Engineers (IEEE).
[42]
Kontogiorgos, D., Abelho Pereira, A. T. & Gustafsson, J. (2021). Grounding behaviours with conversational interfaces: effects of embodiment and failures. Journal on Multimodal User Interfaces.
[43]
Lee, M., Kontogiorgos, D., Torre, I., Luria, M., Tejwani, R., Dennis, M., Abelho Pereira, A. T. (2021). Robo-Identity: Exploring Artificial Identity and Multi-Embodiment. In ACM/IEEE International Conference on Human-Robot Interaction..
[44]
Patel, R., Ternström, S. (2020). Non-invasive evaluation of vibratory kinematics of phonation in children. In 12th International Conference on Voice Physiology and Biomechanics. (p. 28). Grenoble.
[45]
Kucherenko, T., Jonell, P., Yoon, Y., Wolfert, P., Henter, G. E. (2021). A large, crowdsourced evaluation of gesture generation systems on common data : The GENEA Challenge 2020. In Proceedings IUI '21: 26th International Conference on Intelligent User Interfaces. (pp. 11-21). Association for Computing Machinery (ACM).
[47]
Håkansson, K., Beskow, J., Kjellström, H., Gustafsson, J., Bonnard, A., Rydén, M., Stormoen, S., Hagman, G., Akenine, U., Peres, K. M., Henter, G. E., Kivipelto, M. (2020). Robot-assisted detection of subclinical dementia : progress report and preliminary findings. In In 2020 Alzheimer's Association International Conference. ALZ...
[48]
Alexanderson, S., Henter, G. E. (2020). Robust model training and generalisation with Studentising flows. In Proceedings of the ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models. (pp. 25:1-25:9).
[49]
Strombergsson, S., Edlund, J., McAllister, A. & Lagerberg, T. (2021). Understanding acceptability of disordered speech through Audience Response Systems-based evaluation. Speech Communication, 131, 13-22.
[50]
Patel, R. R. & Ternström, S. (2021). Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. Journal of Speech, Language and Hearing Research, 1-19.
Full list in the KTH publications portal
Page responsible:Web editors at EECS
Belongs to: Speech, Music and Hearing
Last changed: Oct 17, 2018