Gonzalo Aponte Navarro: Speech Recognition with Hidden Markov Models - a study about isolated word recognition

Masters Thesis in Mathematical Statistics

Tid: On 2020-01-29 kl 11.15

Föreläsare: Gonzalo Aponte Navarro

Plats: Kräftriket, house 5, room 22

Handledare: Chun-Biu Li

Abstract

Hidden Markov Models (HMM’s) are a versatile class of statistical models which have been used in various contemporary speech recognition systems. In this study it was explored how these models could be applied for recognizing words in a data set with recordings of 11 dif- ferent words. Feature vectors based on the Discrete Fourier Transform (DFT) were used for word modelling and recognition. Variations of the HMM’s were fitted and tested on the data set. The out-of-sample error for each model was estimated by performing a 10-fold cross-validation. Moreover, the full data set was used for both training and testing. In the latter experiment, the smallest total error among the constructed word recognizers was 13.3 %. Furthermore, the study explored how HMM’s could be used for detecting the presence of words in a recording. The word detector failed to detect certain segments of different words but performed well in other cases.

Innehållsansvarig:webmaster@math.kth.se
Tillhör: Institutionen för matematik
Senast ändrad: 2020-01-22