Till innehåll på sidan

TMH Publications (latest 50)

Below are the 50 latest publications from the Department of Speech, Music and Hearing.

TMH Publications

[1]
Saponaro, G., Jamone, L., Bernardino, A. & Salvi, G. (2020). Beyond the Self: Using Grounded Affordances to Interpret and Describe Others’ Actions. IEEE Transactions on Cognitive and Developmental Systems, 12(2), 209-221.
[2]
Sundberg, J., Salomão, G. L. & Scherer, K. R. (2021). Analyzing Emotion Expression in Singing via Flow Glottograms, Long-Term-Average Spectra, and Expert Listener Evaluation. Journal of Voice, 35(1), 52-60.
[3]
Kontogiorgos, D., Abelho Pereira, A. T., Sahindal, B., van Waveren, S., Gustafson, J. (2020). Behavioural Responses to Robot Conversational Failures. I HRI '20: Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot Interaction. ACM Digital Library.
[4]
Nix, J., Jers, H. & Ternström, S. (2020). Acoustical, psychoacoustical, and pedagogical considerations for choral singing with covid-19 health measures. Choral Journal, 61(3), 32-40.
[5]
Ben-Tal, O., Harris, M. & Sturm, B. (2021). How music AI is useful : Engagements with composers, performers, and audiences. Leonardo music journal, 54(5), 510-516.
[6]
Saldías, M., Laukkanen, A. -., Guzmán, M., Miranda, G., Stoney, J., Alku, P. & Sundberg, J. (2021). The Vocal Tract in Loud Twang-Like Singing While Producing High and Low Pitches. Journal of Voice, 35(5), 807.e1-807.e23.
[7]
Dabbaghchian, S., Arnela, M., Engwall, O. & Oriol, G. (2021). Simulation of vowel-vowel utterances using a 3D biomechanical-acoustic model. International Journal for Numerical Methods in Biomedical Engineering, 37(1).
[8]
Székely, É., Edlund, J., Gustafsson, J. (2020). Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis. I Proceedings of The 12th Language Resources and Evaluation Conference. (s. 6368-6374). European Language Resources Association.
[9]
Székely, É., Henter, G. E., Beskow, J., Gustafsson, J. (2020). Breathing and Speech Planning in Spontaneous Speech Synthesis. I 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (s. 7649-7653). IEEE.
[10]
Strombergsson, S., Holm, K., Edlund, J., Lagerberg, T. & McAllister, A. (2020). Audience Response System-Based Evaluation of Intelligibility of Children's Connected Speech - Validity, Reliability and Listener Differences. Journal of Communication Disorders, 87.
[11]
Leijon, A., Dillon, H., Hickson, L., Kinkel, M., Kramer, S. E. & Nordqvist, P. (2020). Analysis of data from the International Outcome Inventory for Hearing Aids (IOI-HA) using Bayesian Item Response Theory. International Journal of Audiology.
[12]
Chettri, B., Benetos, E. & Sturm, B. (2020). Dataset Artefacts in Anti-Spoofing Systems : A Case Study on the ASVspoof 2017 Benchmark. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 28, 3018-3028.
[13]
Skantze, G. (2021). Turn-taking in Conversational Systems and Human-Robot Interaction : A Review. Computer speech & language (Print), 67.
[14]
Jonell, P., Kucherenko, T., Torre, I., Beskow, J. (2020). Can we trust online crowdworkers? : Comparing online and offline participants in a preference test of virtual agents. I IVA '20: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents. Association for Computing Machinery (ACM).
[15]
Torre, I., Dogan, F. I., Kontogiorgos, D. (2021). Voice, Embodiment, and Autonomy as Identity Affordances. I HRI '21 Companion: Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction..
[16]
Kontogiorgos, D., Sibirtseva, E., Gustafson, J. (2020). Chinese whispers : A multimodal dataset for embodied language grounding. I LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings. (s. 743-749). European Language Resources Association (ELRA).
[17]
Gillet, S., Cumbal, R., Abelho Pereira, A. T., Lopes, J., Engwall, O., Leite, I. (2021). Robot Gaze Can Mediate Participation Imbalance in Groups with Different Skill Levels. I Proceedings of the 2021 ACM/IEEE International Conference on Human-Robot Interaction. (s. 303-311). Association for Computing Machinery.
[18]
Nylén, H., Chatterjee, S., Ternström, S. (2021). Detecting Signal Corruptions in Voice Recordings For Speech Therapy. I ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (s. 386-390). Institute of Electrical and Electronics Engineers (IEEE).
[19]
Kontogiorgos, D., Abelho Pereira, A. T. & Gustafsson, J. (2021). Grounding behaviours with conversational interfaces: effects of embodiment and failures. Journal on Multimodal User Interfaces.
[20]
Lee, M., Kontogiorgos, D., Torre, I., Luria, M., Tejwani, R., Dennis, M., Abelho Pereira, A. T. (2021). Robo-Identity: Exploring Artificial Identity and Multi-Embodiment. I ACM/IEEE International Conference on Human-Robot Interaction..
[21]
Kucherenko, T., Jonell, P., Yoon, Y., Wolfert, P., Henter, G. E. (2021). A large, crowdsourced evaluation of gesture generation systems on common data : The GENEA Challenge 2020. I Proceedings IUI '21: 26th International Conference on Intelligent User Interfaces. (s. 11-21). Association for Computing Machinery (ACM).
[23]
Strombergsson, S., Edlund, J., McAllister, A. & Lagerberg, T. (2021). Understanding acceptability of disordered speech through Audience Response Systems-based evaluation. Speech Communication, 131, 13-22.
[24]
Patel, R. R. & Ternström, S. (2021). Quantitative and Qualitative Electroglottographic Wave Shape Differences in Children and Adults Using Voice Map-Based Analysis. Journal of Speech, Language and Hearing Research, 64(8), 2977-2995.
[26]
Oertel, C., Jonell, P., Kontogiorgos, D., Mora, K. F., Odobez, J.-M. & Gustafsson, J. (2021). Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions. Frontiers in Robotics and AI, 8.
[28]
Kontogiorgos, D., Tran, M., Gustafson, J., Soleymani, M. (2021). A Systematic Cross-Corpus Analysis of Human Reactions to Robot Conversational Failures. Presenterad vid International Conference on Multimodal Interaction (ICMI).
[29]
Oertel, C., Jonell, P., Kontogiorgos, D., Mora, K. F., Odobez, J.-M. & Gustafson, J. (2021). Towards an Engagement-Aware Attentive Artificial Listener for Multi-Party Interactions. Frontiers in Robotics and AI.
[30]
Kammerlander, R. K., Abelho Pereira, A. T., Alexanderson, S. (2021). Using Virtual Reality to Support Acting in Motion Capture with Differently Scaled Characters. I 2021 IEEE VIRTUAL REALITY AND 3D USER INTERFACES (VR). (s. 402-410). Institute of Electrical and Electronics Engineers (IEEE).
[31]
Huang, R., Sturm, B. L.T., Holzapfel, A. (2021). De-centering the west : East asian philosophies and the ethics of applying artificial intelligence to music. Presenterad vid International Society for Music Information Retrieval Conference, ISMIR.
[32]
Kalpakchi, D., Boye, J. (2021). BERT-based distractor generation for Swedish reading comprehension questions using a small-scale dataset. I Proceedings of the 14th International Conference on Natural Language Generation. (s. 387-403).
[33]
Kucherenko, T., Nagy, R., Jonell, P., Neff, M., Kjellström, H., Henter, G. E. (2021). Speech2Properties2Gestures : Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech. I IVA '21: Proceedings of the 21st ACM International Conference on Intelligent Virtual Agents. (s. 145-147). Association for Computing Machinery (ACM).
[34]
Wennberg, U., Henter, G. E. (2021). The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models. I ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2. (s. 130-140). ASSOC COMPUTATIONAL LINGUISTICS-ACL.
[35]
Jonell, P., Moell, B., Håkansson, K., Henter, G. E., Kucherenko, T., Mikheeva, O. ... Beskow, J. (2021). Multimodal Capture of Patient Behaviour for Improved Detection of Early Dementia : Clinical Feasibility and Preliminary Results. Frontiers in Computer Science, 3.
[36]
Tånnander, C., Edlund, J. (2021). Stress manipulation in text-to-speech synthesis using speaking rate categories. I Proceedings of Fonetik 2021, Centre for Languages and Literature, Lund University. (s. 17-22). Lund.
[37]
Tånnander, C., Edlund, J. (2021). Methods of slowing down speech. I Proceedings. 11th ISCA Speech Synthesis Workshop (SSW 11). (s. 43-47).
[38]
Tånnander, C., Edlund, J. (2021). Self-perceived preferences of voice and speaking style characteristics in spoken text. Presenterad vid Swedish Language Technology Conference (SLTC) 2021.
[39]
Nagy, R., Kucherenko, T., Moell, B., Abelho Pereira, A. T., Kjellström, H., Bernardet, U. (2021). A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents. Presenterad vid 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)..
[40]
Sturm, B. & Maruri-Aguilar, H. (2022). The Ai Music Generation Challenge 2020 : Double Jigs in the Style of O’Neill’s “1001”. Journal of Creative Music Systems.
[41]
Dogruoz, A. S., Skantze, G. (2021). How "open" are the conversations with open-domain chatbots? : A proposal for Speech Event based evaluation. I SIGDIAL 2021: 22Nd Annual Meeting Of The Special Interest Group On Discourse And Dialogue (Sigdial 2021). (s. 392-402). ASSOC COMPUTATIONAL LINGUISTICS.
[42]
Ekstedt, E., Skantze, G. (2021). Projection of Turn Completion in Incremental Spoken Dialogue Systems. I SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021). (s. 431-437). ASSOC COMPUTATIONAL LINGUISTICS.
[43]
Strömbergsson, S., Götze, J., Edlund, J. & Nilsson Björkenstam, K. (2021). Simulating Speech Error Patterns Across Languages and Different Datasets. Language and Speech, 1-38.
[44]
Valle-Perez, G., Henter, G. E., Beskow, J., Holzapfel, A., Oudeyer, P.-Y. & Alexanderson, S. (2021). Transflower : probabilistic autoregressive dance generation with multimodal attention. ACM Transactions on Graphics, 40(6).
[45]
Axelsson, A. & Skantze, G. (2022). Multimodal User Feedback During Adaptive Robot-Human Presentations. Frontiers in Computer Science, 3.
[46]
Gustafsson, J. K., Södersten, M., Ternström, S. & Schalling, E. (2021). Treatment of Hypophonia in Parkinson’s Disease Through Biofeedback in Daily Life Administered with A Portable Voice Accumulator. Journal of Voice.
[47]
Engwall, O., Águas Lopes, J. D. & Cumbal, R. (2022). Is a Wizard-of-Oz Required for Robot-Led Conversation Practice in a Second Language?. International Journal of Social Robotics.
[48]
Sturm, B. & Maruri-Aguilar, H. (2021). The Ai Music Generation Challenge 2020: Double Jigs in the Style of O'Neill's ``1001''. Journal of Creative Music Systems, 5(1).
[49]
Weldon, C. F., Gillet, S., Cumbal, R., Leite, I. (2021). Exploring non-verbal gaze behavior in groups mediated by an adaptive robot. I ACM/IEEE International Conference on Human-Robot Interaction. (s. 357-361). IEEE Computer Society.
[50]
Skantze, G. (2021). Conversational interaction with social robots. I ACM/IEEE International Conference on Human-Robot Interaction. IEEE Computer Society.
Fullständig lista i KTH:s publikationsportal