The central part of the course concerns how speech can be used in human-computer interaction. Applications such as speaking and speech understanding computers and multimodal dialogue systems are presented. The course describes the basic concepts of human communication regarding speech, language and hearing. The use of digital signal analysis and statistical methods for analysis and classification of speech are also addressed, as are the evaluation, research and development of speech technology methods.
Choose semester and course offering
Choose semester and course offering to see information from the correct course syllabus and course offering.
Content and learning outcomes
The course Speech Technology deals with research and development in speech technology and also povides basic insights in speech, language and hearing. The course contains segments concerning:
- Linguistic theory and phonetics
- Speech production by humans (speech physiology and acoustics) and computers (text-to-speech synthesis)
- Speech perception by humans (hearing and psycholinguistics) and computers (speech recognition)
- Multimodal dialogue systems for human-computer interaction with speech and vision
- Practical speech technology studies and experiments
- Methods for evaluation of and experimentation in speech technology
- Data collection for speech technology research and development
Intended learning outcomes
After completion of the course, participants shall be able to:
- Describe speech from an acoustic, phonetic, and linguistic perspective
- Explain how computers recognize speech with statistical methods, and evaluate the recognition results
- Describe and judge different methods used to produce speech with computers
- Analyze speech-driven dialogue systems with respect to application, components, functionality and user aspects
- Describe how evaluation of speech technology systems work and describe the special requirements that are posed by this type of system
- Design and describe data collections for speech technology research and development
- Give an account of available state-of-the art speech technology and exemplify the current speech research on e.g., mobile systems and IT applications
- Apply the theoretical knowledge in small speech technology projects
Literature and preparations
Single course students: At least 60 ECTS of which 30 ECTS within Mathematics or Computational Lingvistics. Furthermore English B, or equivalent.
Proficiency in mathematics and signal analysis corresponding to the first 2.5 years of Degree Programme in Media Technology.
Examination and completion
If the course is discontinued, students may request to be examined during the following two academic years.
- LABA - Laboratory assignments, 0.5 credits, grading scale: P, F
- LABB - Research laboratory assignment, - credits, grading scale: A, B, C, D, E, FX, F
- PR01 - Project and report, 3.5 credits, grading scale: A, B, C, D, E, FX, F
- TENA - Home exam, 3.5 credits, grading scale: A, B, C, D, E, FX, F
Based on recommendation from KTH’s coordinator for disabilities, the examiner will decide how to adapt an examination for students with documented disability.
The examiner may apply another examination format when re-examining individual students.
Other requirements for final grade
- Active participation in scheduled activities
- Pass on labs (timely)
- Pass on project bid (timely)
- Pass on project presentation (timely)
- E or better on project report (A-E, decides 50% of end grade)
- E or better on home exam (A-E, decides 50% of the end grade)
Pass on labs, project bid, and project presentation are necessary in order to continue the course. Late hand-in of home exam or project report results a grade lowering (one step). One reexam is given, with a limited range of passed grades (B-E).
Opportunity to complete the requirements via supplementary examination
Opportunity to raise an approved grade via renewed examination
- All members of a group are responsible for the group's work.
- In any assessment, every student shall honestly disclose any help received and sources used.
- In an oral assessment, every student shall be able to present and answer questions about the entire assignment and solution.
Further information about the course can be found on the Course web at the link below. Information on the Course web will later be moved to this site.Course web DT2112
Main field of study
In this course, the EECS code of honor applies, see: