Skip to main content

Methods for utilizing found speech data

Time: Tue 2021-04-13 15.15

Location: Zoom

Participating: Per Fallgren

My research is about investigating, producing and evaluating methods that utilize found speech data - speech that was not recorded with the purpose of being used in research (e.g. radio segments, public speeches, podcasts, archival recordings). Such data hold great worth in several regards but is typically challenging to work with. Contemporary machine learning approaches will only get you so far given the noisy and varied nature of these types of data collections, not to mention the scarcity of high-quality labels. Because of this, my research is mainly revolving around using a combination of methods such as unsupervised learning, visualization techniques, perceptual tricks with regard to human hearing and having an overall focus on including the human in the loop. Among other things, this has resulted in Edyson, a sound browsing tool that have shown promise not only in an exploratory fashion, but also as a viable scientific method used to utilize found speech. In my 80% seminar I will elaborate on these results and give a general report on my progress so far. Welcome!

Belongs to: Speech, Music and Hearing
Last changed: Apr 07, 2021