Skip to main content
Before choosing courseDT2112 Speech Technology 7.5 creditsAdministrate About course

The central part of the course concerns how speech can be used in human-computer interaction. Applications such as speaking and speech understanding computers and multimodal dialogue systems are presented. The course describes the basic concepts of human communication regarding speech, language and hearing. The use of digital signal analysis and statistical methods for analysis and classification of speech are also addressed, as are the evaluation, research and development of speech technology methods.

Choose semester and course offering

Choose semester and course offering to see information from the correct course syllabus and course offering.

* Retrieved from Course syllabus DT2112 (Spring 2019–)

Content and learning outcomes

Course contents

The course Speech Technology deals with research and development in speech technology and also povides basic insights in speech, language and hearing. The course contains segments concerning:

  • Linguistic theory and phonetics
  • Speech production by humans (speech physiology and acoustics) and computers (text-to-speech synthesis)
  • Speech perception by humans (hearing and psycholinguistics) and computers (speech recognition)
  • Multimodal dialogue systems for human-computer interaction with speech and vision
  • Practical speech technology studies and experiments
  • Methods for evaluation of and experimentation in speech technology
  • Data collection for speech technology research and development

Intended learning outcomes

After completion of the course, participants shall be able to:

  • Describe speech from an acoustic, phonetic, and linguistic perspective
  • Explain how computers recognize speech with statistical methods, and evaluate the recognition results
  • Describe and judge different methods used to produce speech with computers
  • Analyze speech-driven dialogue systems with respect to application, components, functionality and user aspects
  • Describe how evaluation of speech technology systems work and describe the special requirements that are posed by this type of system
  • Design and describe data collections for speech technology research and development
  • Give an account of available state-of-the art speech technology and exemplify the current speech research on e.g., mobile systems and IT applications
  • Apply the theoretical knowledge in small speech technology projects

Course Disposition

  • Four full days of lectures and group labs
  • Project design and project execution in group
  • Project supervision in larger groups
  • Project presentations and project report writing
  • Home exam with result seminar

Literature and preparations

Specific prerequisites

Single course students: At least 60 ECTS of which 30 ECTS within Mathematics or Computational Lingvistics. Furthermore English B, or equivalent.

Recommended prerequisites

Proficiency in mathematics and signal analysis corresponding to the first 2.5 years of Degree Programme in Media Technology.

Equipment

No information inserted

Literature

No information inserted

Examination and completion

If the course is discontinued, students may request to be examined during the following two academic years.

Grading scale

A, B, C, D, E, FX, F

Examination

  • LABA - Laboratory assignments, 0,5 hp, betygsskala: P, F
  • LABB - Research laboratory assignment, - hp, betygsskala: A, B, C, D, E, FX, F
  • PR01 - Project and report, 3,5 hp, betygsskala: A, B, C, D, E, FX, F
  • TENA - Home exam, 3,5 hp, betygsskala: A, B, C, D, E, FX, F

Based on recommendation from KTH’s coordinator for disabilities, the examiner will decide how to adapt an examination for students with documented disability.

The examiner may apply another examination format when re-examining individual students.

In this course all the regulations of the code of honor at the School of Computer science and Communication apply, see: http://www.kth.se/csc/student/hederskodex/1.17237?l=en_UK.

Other requirements for final grade

  • Active participation in scheduled activities
  • Pass on labs (timely)
  • Pass on project bid (timely)
  • Pass on project presentation (timely)
  • E or better on project report (A-E, decides 50% of end grade)
  • E or better on home exam (A-E, decides 50% of the end grade)

Pass on labs, project bid, and project presentation are necessary in order to continue the course. Late hand-in of home exam or project report results a grade lowering (one step). One reexam is given, with a limited range of passed grades (B-E).

Opportunity to complete the requirements via supplementary examination

No information inserted

Opportunity to raise an approved grade via renewed examination

No information inserted

Examiner

Profile picture Jens Edlund

Profile picture Joakim Gustafsson

Ethical approach

  • All members of a group are responsible for the group's work.
  • In any assessment, every student shall honestly disclose any help received and sources used.
  • In an oral assessment, every student shall be able to present and answer questions about the entire assignment and solution.

Further information

Course web

Further information about the course can be found on the Course web at the link below. Information on the Course web will later be moved to this site.

Course web DT2112

Offered by

EECS/Intelligent Systems

Main field of study

Electrical Engineering

Education cycle

Second cycle

Add-on studies

No information inserted

Contact

Jens Edlund, e-post: edlund@speech.kth.se

Supplementary information

In this course, the EECS code of honor applies, see:
http://www.kth.se/en/eecs/utbildning/hederskodex