Igenkänning av tal och talare
NEW VT2016: the course has been redesigned with:
NEW VT2015: the course has been redesigned with:
Automatic Speech Recognition (ASR) is concerned with the problem of transcribing spoken words and phrases into text. The ASR functionality is usually integrated into a larger system that makes it possible for humans to interact with computers using natural language. From a technical point of view, the ASR problem poses a number of challenges, emerging from the need to deal with real life signals produced by different individuals and in different conditions. The solutions are usually based on statistical modeling and machine learning.
This course gives insights into the signal processing and statistical methods employed in ASR and in Speaker identification.
Speech recognition, speech production, speech analysis, features, statistical modeling of sequences, hidden Markow models, search algorithms, language models, speaker identification.
The course can be also taken at the doctoral level with course number 2F5118. The extra requirements for doctoral level credits will be discussed on a individual basis.
- Giampiero Salvi Examinator