TMH Publications (latest 50)

TMH Publications

[1]
Selamtzis, A., Ternström, S., Richter, B., Burk, F., Köberlein, M., Echternach, M. (2018). A comparison of electroglottographic and glottal area waveforms for phonation type differentiation in male professional singers. (Manuscript).
[2]
Selamtzis, A., Castellana, A., Salvi, G., Carullo, A., Astolfi, A. (2018). Effect of vowel context in cepstral and entropy analysis of pathological voices. (Manuscript).
[3]
[4]
Ternström, S., Johansson, D. & Selamtzis, A. (2018). FonaDyn - A system for real-time analysis of the electroglottogram, over the voice range. Software Quality Professional, 7, 74-80.
[5]
Körner Gustafsson, J., Södersten, M., Ternström, S. & Schalling, E. (2018). Long-term effects of Lee Silverman Voice Treatment on daily voice use in Parkinson’s disease as measured with a portable voice accumulator. Logopedics, Phoniatrics, Vocology, 1-10.
[6]
Wistbacka, G., Andrade, P. A., Simberg, S., Hammarberg, B., Sodersten, M., Svec, J. G. & Granqvist, S. (2018). Resonance Tube Phonation in Water-the Effect of Tube Diameter and Water Depth on Back Pressure and Bubble Characteristics at Different Airflows. Journal of Voice, 32(1).
[7]
Szabo Portela, A., Granqvist, S., Ternström, S. & Södersten, M. (2018). Vocal Behavior in Environmental Noise : Comparisons Between Work and Leisure Conditions in Women With Work-related Voice Disorders and Matched Controls. Journal of Voice, 32(1), 126.e23-126.e38.
[8]
Lopes, J., Engwall, O., Skantze, G. (2017). A First Visit to the Robot Language Café. In Proceedings of the ISCA workshop on Speech and Language Technology in Education. Stockholm.
[9]
Johansson, R., Skantze, G., Jönsson, A. (2017). A psychotherapy training environment with virtual patients implemented using the furhat robot platform. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 184-187). Springer.
[10]
Stefanov, K., Beskow, J. (2017). A Real-time Gesture Recognition System for Isolated Swedish Sign Language Signs. In Proceedings of the 4th European and 7th Nordic Symposium on Multimodal Communication (MMSYM 2016). Linköping University Electronic Press.
[11]
Arnela, M., Dabbaghchian, S., Guasch, O., Engwall, O. (2017). A semi-polar grid strategy for the three-dimensional finite element simulation of vowel-vowel sequences. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 3477-3481). The International Speech Communication Association (ISCA).
[12]
Degirmenci, N. C., Jansson, J., Hoffman, J., Arnela, M., Sánchez-Martín, P., Guasch, O., Ternström, S. (2017). A Unified Numerical Simulation of Vowel Production That Comprises Phonation and the Emitted Sound. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 3492-3496). The International Speech Communication Association (ISCA).
[13]
Avramova, V., Yang, F., Li, C., Peters, C., Skantze, G. (2017). A virtual poster presenter using mixed reality. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 25-28). Springer.
[14]
Strömbergsson, S., Edlund, J., Götze, J., Björkenstam, K. N. (2017). Approximating phonotactic input in children's linguistic environments from orthographic transcripts. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 2213-2217). International Speech Communication Association.
[15]
Mendelson, J., Aylett, M. (2017). Beyond the listening test : An interactive approach to TTS Evaluation. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. (pp. 249-253). International Speech Communication Association.
[16]
Castellana, A., Selamtzis, A., Salvi, G., Carullo, A., Astolfi, A. (2017). Cepstral and entropy analyses in vowels excerpted from continuous speech of dysphonic and control speakers. In Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech 2017. (pp. 1814-1818). International Speech Communication Association.
[17]
[18]
Karipidou, K., Ahnlund, J., Friberg, A., Alexanderson, S., Kjellström, H. (2017). Computer Analysis of Sentiment Interpretation in Musical Conducting. In Proceedings - 12th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2017. (pp. 400-405). IEEE.
[19]
Malisz, Z., Berthelsen, H., Beskow, J., Gustafson, J. (2017). Controlling prominence realisation in parametric DNN-based speech synthesis. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 1079-1083). International Speech Communication Association.
[21]
Friberg, A., Choi, K., Schön, R., Downie, J. S., Elowsson, A. (2017). Cross-cultural aspects of perceptual features in K-pop : A pilot study comparing Chinese and Swedish listeners. In 2017 ICMC/EMW - 43rd International Computer Music Conference and the 6th International Electronic Music Week. (pp. 291-296). Shanghai Conservatory of Music.
[22]
Jonell, P., Oertel, C., Kontogiorgos, D., Beskow, J., Gustafson, J. (2017). Crowd-powered design of virtual attentive listeners. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 188-191). Springer.
[23]
Oertel, C., Jonell, P., Kontogiorgos, D., Mendelson, J., Beskow, J., Gustafson, J. (2017). Crowdsourced design of artificial attentive listeners. Presented at INTERSPEECH: Situated Interaction, Augusti 20-24 Augusti, 2017.
[24]
Herrera, M., Schierbeck, G., Hansen, K. F. (2017). Disadvantages of using non-linear video in shallow learning situations – a critical perspective on current trends. In KTH Scholarship of Teaching and Learning. KTH ECE.
[25]
Han, Q. & Sundberg, J. (2017). Duration, Pitch, and Loudness in Kunqu Opera Stage Speech. Journal of Voice, 31(2).
[26]
Havel, M., Becker, S., Schuster, M., Johnson, T., Maier, A. & Sundberg, J. (2017). Effects of functional endoscopic sinus surgery on the acoustics of the sinonasal tract. Rhinology, 55(1), 81-89.
[27]
Shore, T., Skantze, G. (2017). Enhancing reference resolution in dialogue using participant feedback. In Proc. GLU 2017 International Workshop on Grounding Language Understanding. (pp. 78-82). Stockholm, Sweden: International Speech Communication Association.
[28]
Pabon, P., Howard, D. M., Ternström, S., Kob, M. & Eckel, G. (2017). Future Perspectives. In Welch, Graham; Howard, David M.; Nix, John (Ed.), Oxford Handbook of Singing. Oxford University Press.
[29]
Schenkman, B., Gidla, V. K. (2017). Human echolocation in different situations and rooms : Threshold values Architectural Acoustics: Paper 1aAAa1. In Proceedings of Meetings on Acoustics. Acoustical Society of America.
[31]
Saponaro, G., Jamone, L., Bernardino, A., Salvi, G. (2017). Interactive Robot Learning of Gestures, Language and Affordances. In Grounding Language Understanding..
[32]
Arlinger, S., Nordqvist, P. & Öberg, M. (2017). International Outcome Inventory for Hearing Aids : Data From a Large Swedish Quality Register Database. American Journal of Audiology, 26(3), 443-450.
[34]
Echternach, M., Burk, F., Koeberlein, M., Selamtzis, A., Doellinger, M., Burdumy, M. ... Herbst, C. T. (2017). Laryngeal evidence for the first and second passaggio in professionally trained sopranos. PLoS ONE, 12(5).
[35]
Elowsson, A., Friberg, A. (2017). Long-term Average Spectrum in Popular Music and its Relation to the Level of the Percussion. In AES 142nd Convention, Berlin, Germany..
[36]
Zhang, Y., Beskow, J., Kjellström, H. (2017). Look but Don’t Stare : Mutual Gaze Interaction in Social Robots. In 9th International Conference on Social Robotics, ICSR 2017. (pp. 556-566). Springer.
[37]
Alexanderson, S., O'Sullivan, C., Neff, M. & Beskow, J. (2017). Mimebot—Investigating the Expressibility of Non-Verbal Communication Across Agent Embodiments. ACM Transactions on Applied Perception, 14(4).
[38]
Khan, M. S. L., ur Réhman, S., Mi, Y., Naeem, U., Beskow, J., Li, H. (2017). Moveable facial features in a social mediator. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. 205-208). Springer.
[40]
Mendelson, J., Oplustil, P., Watts, O., King, S. (2017). Nativization of foreign names in TTS for automatic reading of world news in Swahili. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017. (pp. 2188-2192). International Speech Communication Association.
[41]
Efraimsson, N. (2017). Onset detection in polyphonic music. .
[42]
Fahlström Myrman, A., Salvi, G. (2017). Partitioning of Posteriorgrams using Siamese Models for Unsupervised Acoustic Modelling. In Grounding Language Understanding..
[43]
Warhurst, S., Madill, C., McCabe, P., Ternström, S., Yiu, E. & Heard, R. (2017). Perceptual and Acoustic Analyses of Good Voice Quality in Male Radio Performers. Journal of Voice, 31(2), 259.e1-259.e12.
[44]
Alexanderson, S. (2017). Performance, Processing and Perception of Communicative Motion for Avatars and Agents (Doctoral thesis , KTH Royal Institute of Technology, Stockholm, TRITA-CSC-A 24). Retrieved from http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-218272.
[45]
Zimmerer, F., Andreeva, B., Möbius, B., Malisz, Z., Ferragne, E., Pellegrino, F., Brandt, E. (2017). Perzeption von Sprechgeschwindigkeit und der (nicht nachgewiesene) Einfluss von Surprisal. In ESSV - 28. Konferenz Elektronische Sprachsignalverarbeitung 2017. Saarbrücken.
[46]
Skantze, G. (2017). Predicting and Regulating Participation Equality in Human-robot Conversations : Effects of Age and Gender. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction. (pp. 196-204). IEEE Computer Society.
[47]
Elowsson, A. & Friberg, A. (2017). Predicting the perception of performed dynamics in music audio with ensemble learning. Journal of the Acoustical Society of America, 141(3), 2224-2242.
[48]
Beskow, J., Peters, C., Castellano, G., O'Sullivan, C., Leite, I., Kopp, S. (2017). Preface. In 17th International Conference on Intelligent Virtual Agents, IVA 2017. (pp. V-VI). Springer.
[49]
[50]
Alexanderson, S., O'Sullivan, C. & Beskow, J. (2017). Real-time labeling of non-rigid motion capture marker sets. Computers & graphics, 69(Supplement C), 59-67.
Full list in the KTH publications portal
Top page top