Skip to main content
Till KTH:s startsida

FSK3538 Data-driven Life Sciences 7.5 credits

The future of life science is data-driven, providing major opportunities to explore and understand biology, human health, and ecosystems.

This course introduces students to the rapidly growing field of data-driven life sciences, with a focus on how modern biology, and computational methods come together to transform research. Students will engage with real biological data from genomics, imaging, proteomics, metabolomics, molecular dynamics, and population-scale studies, and learn how models of biological systems are built from these data.

Alongside biological perspectives, the course equips students with practical digital skills for the next generation of scientists. Participants will learn how to:

  • use Generative AI tools (e.g. ChatGPT) to analyze, visualize, and communicate scientific results,

  • practice prompt engineering skills, vibe coding to efficiently use AI coding assistants to perform data analysis

  • design and build their own AI agents to accelerate scientific discovery.

By combining scientific content with hands-on skills in coding, data analysis, and GenAI, the course prepares students for the future role of the AI-empowered scientist. Ethical aspects of data collection, management, and sharing are also emphasized throughout.

This course is offered to master's and PhD-level students.

For more information, please check our course website: https://ddls.aicell.io/

 

Information per course offering

Termin

Information for Autumn 2025 Start 25 Aug 2025 programme students

Course location

KTH Solna

Duration
25 Aug 2025 - 24 Oct 2025
Periods

Autumn 2025: P1 (4 hp), P2 (3.5 hp)

Pace of study

50%

Application code

10519

Form of study

Normal Daytime

Language of instruction

English

Course memo
Course memo is not published
Number of places

Places are not limited

Target group
No information inserted
Planned modular schedule
[object Object]
Schedule
Schedule is not published
Part of programme
No information inserted

Contact

Examiner
No information inserted
Course coordinator
No information inserted
Teachers
No information inserted

Course syllabus as PDF

Please note: all information from the Course syllabus is available on this page in an accessible format.

Course syllabus FSK3538 (Spring 2022–)
Headings with content from the Course syllabus FSK3538 (Spring 2022–) are denoted with an asterisk ( )

Content and learning outcomes

Course disposition

The course is organized into six modules. Each module (except the introductory module) covers a major topic in life science, such as genomics, imaging, proteomics, or molecular dynamics. Every module features invited lectures by leading experts, including international researchers and national fellows from the Data-Driven Life Science (DDLS) program.

In addition to these lectures, the course integrates two recurring activities:

  • Computer lab sessions: Students receive guided instruction in web-based coding and the use of generative AI tools (e.g. ChatGPT, Cursor, Gemini CLI). These sessions focus on practical skills such as building AI agents, designing workflows, training models, and creating MCP tools. Students will learn prompt engineering and context building for AI agents to support scientific research and discovery.

  • Journal clubs: Each week, students read and critically discuss a selected research paper. In small groups, participants analyze the structure, strengths, and limitations of the paper. This format helps students develop critical thinking skills, scientific communication, and the ability to evaluate cutting-edge research. Previous cohorts have highlighted the journal club as one of the most enjoyable and engaging parts of the course.

This combined structure—expert lectures, hands-on labs, and interactive journal discussions—ensures that students not only gain knowledge of data-driven life science but also develop practical skills and critical perspectives needed for future AI-empowered scientists.

Course contents

The course aims to introduce students to the field of computer-driven life sciences by letting them learn about their different application areas.

This course will introduce the student to data sets of different types, such as genomics, proteomics, metabolomics, transcriptomics, biomolecular structure, molecular dynamics simulations, imaging, video / audio recording, organism and habitat monitoring, population scale genetics, biobanks. Models of the biological phenomena and the related scientific breakthroughs based on the analysis of such data sets will be presented, analyzed and discussed.

Analysis techniques that will be introduced and used in this class belong to machine learning, artificial intelligence, other computational techniques for statistical analysis. In addition, visualization techniques will be introduced and discussed.

Another important aspect that will be introduced and discussed is related to ethics for data collection, management, analysis and sharing. The students will be specially trained in good practice related to computer-driven life sciences.

Intended learning outcomes

After passing the course, the student should be able to:

  • describe the field of data-driven life sciences, including an overview of the different application areas, and give examples of applications and their associated analysis methods
  • apply statistical analysis and machine learning analysis to biological data sets and formulate models of biological phenomena
  • present and review scientific literature in the field of computer-driven life sciences
  • reflect on ethical consequences of data-driven life sciences and describe good practice around the computer life cycle (collection, handling, sharing and analysis)

Literature and preparations

Specific prerequisites

Enrolled PhD student.

Literature

You can find information about course literature either in the course memo for the course offering or in the course room in Canvas.

Examination and completion

Grading scale

P, F

Examination

  • LAB1 - Computer lab, 2.0 credits, grading scale: P, F
  • PRO1 - Project, 5.5 credits, grading scale: P, F

Based on recommendation from KTH’s coordinator for disabilities, the examiner will decide how to adapt an examination for students with documented disability.

The examiner may apply another examination format when re-examining individual students.

If the course is discontinued, students may request to be examined during the following two academic years.

Examiner

Ethical approach

  • All members of a group are responsible for the group's work.
  • In any assessment, every student shall honestly disclose any help received and sources used.
  • In an oral assessment, every student shall be able to present and answer questions about the entire assignment and solution.

Further information

Course room in Canvas

Registered students find further information about the implementation of the course in the course room in Canvas. A link to the course room can be found under the tab Studies in the Personal menu at the start of the course.

Offered by

Education cycle

Third cycle

Postgraduate course

Postgraduate courses at SCI/Applied Physics