Skip to main content
Till KTH:s startsida Till KTH:s startsida

SF2930 Regression Analysis 7.5 credits

This course offers an introduction to modern methods of regression analysis with applications.   Regression analysis is a statistical technique for investigating and modelling the relationship between a variable of interests,  the response,  and a set of related predictor variables. Regression techniques are of high practical importance and their extensive use is a hallmark of modern statistical applications. Successful application of regression analysis demands appropriate acquaintance with underlying theory and handling of real world problems. The overall goal of the course meeting the demand is thus twofold:  to acquaint students with the statistical methodology of the regression modelling and to develop advanced practical skills that are necessary for applying regression techniques to a  real-world data analysis problem.

Choose semester and course offering

Choose semester and course offering to see current information and more about the course, such as course syllabus, study period, and application information.

Application

For course offering

Spring 2025 CINEK m.fl. programme students

Application code

60214

Headings with content from the Course syllabus SF2930 (Spring 2022–) are denoted with an asterisk ( )

Content and learning outcomes

Course contents

The course begins with simple and multiple linear regression models for which fitting, parametric and model inference as well as prediction will be explained.  Topics covered are least squares (LS) and generalised LS, the Gauss-Markov theorem,  geometry of least squares and orthogonal projections.  A special attention is paid to the diagnostic strategies which are key components of good model fitting.  Further topics include transformations and weightings to correct model inadequacies, the multicollinearity issue, variable subset  selection and model building techniques.  Later in the course, some general strategies for regression modelling will be presented with a particular focus on the generalized linear models (GLM) using the examples with binary and count response variables.

As the high-dimensional data, order of magnitude larger than those that the classic regression theory is designed for,  are nowadays a rule rather than an exception in computer-age practice  (examples include  information technology,  finance,  genetics and astrophysics,  to name just a few),   regression methodologies  which allow to cope with the high dimensionality are presented.  The emphasis is placed on methods of controlling the regression fit by  regularization (Ridge, Lasso and Elastic-Net),  as well as methods using derived input directions  (Principal Components regression and Partial Least Squares) that allow to tamp down statistical variability in high-dimensional estimation and prediction problems. 

A number of statistical learning procedures with the focus on computer-based algorithms is presented from a regression perspective. 

Computer-aided project work with a variety of datasets forms an essential learning activity.

Intended learning outcomes

To pass the course, the student shallbe able to:

  • Formulate and apply statistical regression theory
  • Formulate and apply advanced methods in statistical regression modeling
  • Design and implement advanced methods in regression analysis for applications

Literature and preparations

Specific prerequisites

  • English B/English 6
  • Completed basic course in numerical analysis (SF1544, SF1545 or equivalent)
  • Completed basic course in probability theory and statistics (SF1922, SF1914 or equivalent)

Recommended prerequisites

A passed course corresponding to SF1811 Optimizatio

Equipment

No information inserted

Literature

See the course web page

Examination and completion

If the course is discontinued, students may request to be examined during the following two academic years.

Grading scale

A, B, C, D, E, FX, F

Examination

  • OVN1 - Assignments, 3.0 credits, grading scale: P, F
  • TENA - Examination, 4.5 credits, grading scale: A, B, C, D, E, FX, F

Based on recommendation from KTH’s coordinator for disabilities, the examiner will decide how to adapt an examination for students with documented disability.

The examiner may apply another examination format when re-examining individual students.

Opportunity to complete the requirements via supplementary examination

No information inserted

Opportunity to raise an approved grade via renewed examination

No information inserted

Examiner

Ethical approach

  • All members of a group are responsible for the group's work.
  • In any assessment, every student shall honestly disclose any help received and sources used.
  • In an oral assessment, every student shall be able to present and answer questions about the entire assignment and solution.

Further information

Course room in Canvas

Registered students find further information about the implementation of the course in the course room in Canvas. A link to the course room can be found under the tab Studies in the Personal menu at the start of the course.

Offered by

Main field of study

Mathematics

Education cycle

Second cycle

Add-on studies

No information inserted

Supplementary information

Replaces SF2950