TMH Publications (latest 50)

TMH Publications

[1]
Selamtzis, A., Ternström, S., Richter, B., Burk, F., Köberlein, M., Echternach, M. (2018). A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. (Manuscript).
[2]
Hultén, M., Artman, H. & House, D. (2018). A model to analyse students’ cooperative ideageneration in conceptual design. International journal of technology and design education, 28(2), 451-470.
[3]
Kontogiorgos, D., Avramova, V., Alexanderson, S., Jonell, P., Oertel, C., Beskow, J., Skantze, G., Gustafson, J. (2018). A Multimodal Corpus for Mutual Gaze and Joint Attention in Multiparty Situated Interaction. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). (pp. 119-127). Paris.
[4]
Fallgren, P., Malisz, Z., Edlund, J. (2018). A tool for exploring large amounts of found audio data. In CEUR Workshop Proceedings. (pp. 499-503). CEUR-WS.
[5]
Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J., Gustafson, J. (2018). Crowdsourced Multimodal Corpora Collection Tool. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). (pp. 728-734). Paris.
[6]
Selamtzis, A., Castellana, A., Salvi, G., Carullo, A., Astolfi, A. (2018). Effect of vowel context in cepstral and entropy analysis of pathological voices. (Manuscript).
[7]
Jonell, P., Mattias, B., Per, F., Kontogiorgos, D., David Aguas Lopes, J., Malisz, Z., Samuel, M., Oertel, C., Eran, R., Shore, T. (2018). FARMI: A Framework for Recording Multi-Modal Interactions. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). (pp. 3969-3974). Paris: European Language Resources Association.
[8]
[9]
Ternström, S., Johansson, D. & Selamtzis, A. (2018). FonaDyn - A system for real-time analysis of the electroglottogram, over the voice range. Software Quality Professional, 7, 74-80.
[10]
Shore, T., Androulakaki, T., Skantze, G. (2018). KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue. Presented at Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Tokyo.
[11]
Körner Gustafsson, J., Södersten, M., Ternström, S. & Schalling, E. (2018). Long-term effects of Lee Silverman Voice Treatment on daily voice use in Parkinson’s disease as measured with a portable voice accumulator. Logopedics, Phoniatrics, Vocology, 1-10.
[12]
Elowsson, A. (2018). Modeling Music : Studies of Music Transcription, Music Perception and Music Production (Doctoral thesis , KTH Royal Institute of Technology, Stockholm, TRITA-EECS-AVL 2018-35). Retrieved from http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-226894.
[14]
Wistbacka, G., Andrade, P. A., Simberg, S., Hammarberg, B., Sodersten, M., Svec, J. G. & Granqvist, S. (2018). Resonance Tube Phonation in Water-the Effect of Tube Diameter and Water Depth on Back Pressure and Bubble Characteristics at Different Airflows. Journal of Voice, 32(1).
[15]
Szabo Portela, A., Granqvist, S., Ternström, S. & Södersten, M. (2018). Vocal Behavior in Environmental Noise : Comparisons Between Work and Leisure Conditions in Women With Work-related Voice Disorders and Matched Controls. Journal of Voice, 32(1), 126.e23-126.e38.
[16]
Lopes, J., Engwall, O., Skantze, G. (2017). A First Visit to the Robot Language Café. In Proceedings of the ISCA workshop on Speech and Language Technology in Education. Stockholm.
[17]
Johansson, R., Skantze, G., Jönsson, A. (2017). A psychotherapy training environment with virtual patients implemented using the furhat robot platform. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 184-187). Springer.
[18]
Stefanov, K., Beskow, J. (2017). A Real-time Gesture Recognition System for Isolated Swedish Sign Language Signs. In Proceedings of the 4th European and 7th Nordic Symposium on Multimodal Communication (MMSYM 2016). Linköping University Electronic Press.
[19]
Arnela, M., Dabbaghchian, S., Guasch, O., Engwall, O. (2017). A semi-polar grid strategy for the three-dimensional finite element simulation of vowel-vowel sequences. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 3477-3481). The International Speech Communication Association (ISCA).
[20]
Degirmenci, N. C., Jansson, J., Hoffman, J., Arnela, M., Sánchez-Martín, P., Guasch, O., Ternström, S. (2017). A Unified Numerical Simulation of Vowel Production That Comprises Phonation and the Emitted Sound. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 3492-3496). The International Speech Communication Association (ISCA).
[21]
Avramova, V., Yang, F., Li, C., Peters, C., Skantze, G. (2017). A virtual poster presenter using mixed reality. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 25-28). Springer.
[22]
Strömbergsson, S., Edlund, J., Götze, J., Björkenstam, K. N. (2017). Approximating phonotactic input in children's linguistic environments from orthographic transcripts. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 2213-2217). International Speech Communication Association.
[23]
Mendelson, J., Aylett, M. (2017). Beyond the listening test : An interactive approach to TTS Evaluation. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (pp. 249-253). International Speech Communication Association.
[24]
Castellana, A., Selamtzis, A., Salvi, G., Carullo, A., Astolfi, A. (2017). Cepstral and entropy analyses in vowels excerpted from continuous speech of dysphonic and control speakers. In Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech 2017. (pp. 1814-1818). International Speech Communication Association.
[25]
[26]
Friberg, A. (2017). Commentary on Polak How short is the shortest metric subdivision?. Empirical Musicology Review, 12(3-4), 227-228.
[27]
Karipidou, K., Ahnlund, J., Friberg, A., Alexanderson, S., Kjellström, H. (2017). Computer Analysis of Sentiment Interpretation in Musical Conducting. In Proceedings - 12th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2017. (pp. 400-405). IEEE.
[28]
Malisz, Z., Berthelsen, H., Beskow, J., Gustafson, J. (2017). Controlling prominence realisation in parametric DNN-based speech synthesis. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 1079-1083). International Speech Communication Association.
[30]
Friberg, A., Choi, K., Schön, R., Downie, J. S., Elowsson, A. (2017). Cross-cultural aspects of perceptual features in K-pop : A pilot study comparing Chinese and Swedish listeners. In 2017 ICMC/EMW - 43rd International Computer Music Conference and the 6th International Electronic Music Week. (pp. 291-296). Shanghai Conservatory of Music.
[31]
Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J., Gustafson, J. (2017). Crowd-powered design of virtual attentive listeners. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 188-191). Springer.
[32]
Oertel, C., Jonell, P., Kontogiorgos, D., Mendelson, J., Beskow, J., Gustafson, J. (2017). Crowdsourced design of artificial attentive listeners. Presented at INTERSPEECH: Situated Interaction, Augusti 20-24 Augusti, 2017.
[33]
Herrera, M., Schierbeck, G., Hansen, K. F. (2017). Disadvantages of using non-linear video in shallow learning situations – a critical perspective on current trends. In KTH Scholarship of Teaching and Learning. KTH ECE.
[34]
Han, Q. & Sundberg, J. (2017). Duration, Pitch, and Loudness in Kunqu Opera Stage Speech. Journal of Voice, 31(2).
[35]
Havel, M., Becker, S., Schuster, M., Johnson, T., Maier, A. & Sundberg, J. (2017). Effects of functional endoscopic sinus surgery on the acoustics of the sinonasal tract. Rhinology, 55(1), 81-89.
[36]
Shore, T., Skantze, G. (2017). Enhancing reference resolution in dialogue using participant feedback. In Proc. GLU 2017 International Workshop on Grounding Language Understanding. (pp. 78-82). Stockholm, Sweden: International Speech Communication Association.
[37]
Pabon, P., Howard, D. M., Ternström, S., Kob, M. & Eckel, G. (2017). Future Perspectives. In Welch, Graham; Howard, David M.; Nix, John (Ed.), Oxford Handbook of Singing. Oxford University Press.
[38]
Schenkman, B., Gidla, V. K. (2017). Human echolocation in different situations and rooms : Threshold values Architectural Acoustics: Paper 1aAAa1. In Proceedings of Meetings on Acoustics. Acoustical Society of America.
[40]
Saponaro, G., Jamone, L., Bernardino, A., Salvi, G. (2017). Interactive Robot Learning of Gestures, Language and Affordances. In Grounding Language Understanding..
[41]
Arlinger, S., Nordqvist, P. & Öberg, M. (2017). International Outcome Inventory for Hearing Aids : Data From a Large Swedish Quality Register Database. American Journal of Audiology, 26(3), 443-450.
[43]
Echternach, M., Burk, F., Koeberlein, M., Selamtzis, A., Doellinger, M., Burdumy, M. ... Herbst, C. T. (2017). Laryngeal evidence for the first and second passaggio in professionally trained sopranos. PLoS ONE, 12(5).
[44]
Elowsson, A., Friberg, A. (2017). Long-term Average Spectrum in Popular Music and its Relation to the Level of the Percussion. In AES 142nd Convention, Berlin, Germany..
[45]
Zhang, Y., Beskow, J., Kjellström, H. (2017). Look but Don’t Stare : Mutual Gaze Interaction in Social Robots. In 9th International Conference on Social Robotics, ICSR 2017. (pp. 556-566). Springer.
[46]
Alexanderson, S., O'Sullivan, C., Neff, M. & Beskow, J. (2017). Mimebot—Investigating the Expressibility of Non-Verbal Communication Across Agent Embodiments. ACM Transactions on Applied Perception, 14(4).
[47]
Khan, M. S. L., ur Réhman, S., Mi, Y., Naeem, U., Beskow, J., Li, H. (2017). Moveable facial features in a social mediator. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 205-208). Springer.
[48]
Kontogiorgos, D. (2017). Multimodal language grounding for improved human-robot collaboration : Exploring Spatial Semantic Representations in the Shared Space of Attention. In ICMI 2017 - Proceedings of the 19th ACM International Conference on Multimodal Interaction. (pp. 660-664). ACM Digital Library.
[50]
Mendelson, J., Oplustil, P., Watts, O., King, S. (2017). Nativization of foreign names in TTS for automatic reading of world news in Swahili. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 2188-2192). International Speech Communication Association.
Full list in the KTH publications portal
Top page top