Skip to main content
Till KTH:s startsida

ID2214 Programming for Data Science 7.5 credits

The course covers the following topics:

  • Syntax and semantics for programming languages that are particularly suited for data science, e.g., Python.
  • Routines for importing, combining, transforming and selecting data.
  • Algorithms for handling missing values, discretisation and dimensionality reduction.
  • Algorithms for supervised machine learning, e.g., naïve Bayes, decision trees, random forests.
  • Algorithms for unsupervised machine learning, e.g., k-means clustering.
  • Libraries for data analysis.
  • Evaluation methods and performance metrics.
  • Visualising and analysing results.

Information per course offering

Choose semester and course offering to see current information and more about the course, such as course syllabus, study period, and application information.

Termin

Information for Autumn 2025 Start 27 Oct 2025 programme students

Course location

KTH Campus

Duration
27 Oct 2025 - 12 Jan 2026
Periods
P2 (7.5 hp)
Pace of study

50%

Application code

50341

Form of study

Normal Daytime

Language of instruction

English

Course memo
Course memo is not published
Number of places

Places are not limited

Target group

Open to TIDAB, TITEH (TIDB) and all master's programmes as long as it can be included in your programme.

Planned modular schedule
[object Object]
Schedule
Schedule is not published

Contact

Examiner
No information inserted
Course coordinator
No information inserted
Teachers
No information inserted

Course syllabus as PDF

Please note: all information from the Course syllabus is available on this page in an accessible format.

Course syllabus ID2214 (Autumn 2021–)
Headings with content from the Course syllabus ID2214 (Autumn 2021–) are denoted with an asterisk ( )

Content and learning outcomes

Course contents

Syntax and semantics for programming languages that are particularly suited for data science, e.g., Python.

Routines to import, combine, convert and make selection of data.

Algorithms for handling of missing values, discretisation and dimensionality reduction.

Algorithms for supervised machine learning, e.g., naïve Bayes, decision trees, random forests.

Algorithms for unsupervised machine learning, e.g., k-means clustering.

Libraries for data analysis.

Evaluation methods and performance metrics.

Visualisation and analysis of results of data analysis.

Intended learning outcomes

Having passed the course, the student should be able to

  • account for and discuss the application of i) technologies to convert data to an appropriate format for data analysis ii) algorithms to analyse data through supervised and unsupervised machine learning as well as iii) technologies and performance metrics for evaluation of data analysis results
  • implement and apply i) technologies to convert data to an appropriate format for data analysis ii) algorithms for supervised and unsupervised machine learning as well as iii) technologies and performance metrics for evaluation of data analysis results.

Literature and preparations

Specific prerequisites

Completed course in programming equivalent to ID1018/DD1310/DD1311/DD1312/DD1314/DD1315/DD1316/DD1318/DD1331/DD1337/DD100N.

Active participation in a course offering where the final examination is not yet reported in Ladok is considered equivalent to completion of the course.

Registering for a course is counted as active participation.

The term 'final examination' encompasses both the regular examination and the first re-examination.

Literature

You can find information about course literature either in the course memo for the course offering or in the course room in Canvas.

Examination and completion

If the course is discontinued, students may request to be examined during the following two academic years.

Grading scale

A, B, C, D, E, FX, F

Examination

  • INL1 - Assignment, 4.5 credits, grading scale: A, B, C, D, E, FX, F
  • TEN1 - Examination, 3.0 credits, grading scale: A, B, C, D, E, FX, F

Based on recommendation from KTH’s coordinator for disabilities, the examiner will decide how to adapt an examination for students with documented disability.

The examiner may apply another examination format when re-examining individual students.

Written examination. Written assignments.

Examiner

Ethical approach

  • All members of a group are responsible for the group's work.
  • In any assessment, every student shall honestly disclose any help received and sources used.
  • In an oral assessment, every student shall be able to present and answer questions about the entire assignment and solution.

Further information

Course room in Canvas

Registered students find further information about the implementation of the course in the course room in Canvas. A link to the course room can be found under the tab Studies in the Personal menu at the start of the course.

Offered by

Main field of study

Computer Science and Engineering

Education cycle

Second cycle

Supplementary information

In this course, the EECS code of honor applies, see: http://www.kth.se/en/eecs/utbildning/hederskodex.