Till KTH:s startsida Till KTH:s startsida

Project

The project is performed in groups of 4 students in the period December 12, 2016 - January 16, 2017. To make the groups as homogeneous as possible in terms of ambition level for this course, we will divid you into groups dependent on grade performance on Assignment 1.

The final group assignments with assigned paper to each group will be announced on  Project groups, which is only open to students registered for this course round. Each group selects a project together and emails the supervisor see the text under supervision. 

PhD students have been placed in separate groups, and working with papers related to their research.

The task of the project is to reproduce the results presented in a published scientific article, describe the article orally and in written form to your peer students, and argue for and against the method presented in the article. From this you will learn how to read scientific articles, how to implement and use a particular method, how to argue for and against a method, and how to adapt the presentation of a method to different target groups (i.e., adapt the presentation of the method in the article - targeted to active researchers in Machine Learning - so that it is understandable to first year Master students in Machine Learning).

Below, you find lists of scientific articles; Each teacher has provided their own list, and will take responsibility for the supervision of the projects on their papers. 

Supervision

The group should plan and carry out the project autonomously. However, the teacher who listed the article you work with is available to answer questions, should you get stuck. We may also schedule some help sessions. Please, contact you supervisor as soon as the group has selected a paper. Send a single email from the group to the supervisor no later than noon Friday December 16. If you are too unevenly distributed over supervisors, we may move some groups to another supervisor than the one who suggested the paper. We will first move the ones that last emailed their selection. 

Grading

The requirements for pass (i.e., E or higher) are

  • an implementation of the method and recreation of the experiments, according to the instructions below,
  • a written report according to the instructions below,
  • an oral presentation according to the instructions below.

Higher grades (A-D) are set based on

  • how well you are able to repeat the results in the article (or alternatively, how well you argue for deviations between the results in the article and your own results),
  • how well you are able to argue for and against the method in the article, and
  • how well you are able to present the method in the article, making it understandable to your peer students. Both the written and oral presentation are taken into account.

Implementation

Each group should implement the method as described in the paper. You may use any programming language. You should implement the entire algorithm "from scratch" to fully understand all details of it. there are exceptions to this, such as complex optimization schemes, which are too time consuming to implement - these exceptions are listed in connection to each paper.

You are not allowed to contact the authors and ask for their source code.

You will then perform the same experiments as those described in the article. Ideally, you should get the same results. If this is not the case, you must make an argument about possible reasons, and prove your argument by small experiments which you design and carry out yourself. A negative reproduction result together with a good argument gives an equally high grade as a positive reproduction result.

All group members must participate actively in the implementation, i.e., write code. Each piece of code that you write should have a comment stating who contributed to that piece of code. You do not have to submit the code for assessment, but be prepared to show it upon request. (For more information about code plagiarism, se General > Honor code in the menu to the left.)

Written Report

The article, the re-implementation, and your results are presented in a written report, to be sent to your supervisor via email in pdf format before the oral presentation on January 16. The report should be at most 7 pages, with 12 pt font and about 2.5 cm margins, including references, images, and tables. In addition to the 7 pages, the report should have a cover page with title, group number, author list and (optional) abstract. The report should be written in (to a reasonable level) grammatically correct English.

In the report you should first describe the article on such a level of detail that your peer students in this course understand the method, and so that it is clear to the reader that you understand the method too.

You should then present your re-implementation of the method, and your reproduction of the results, again on such a level that your peer students understand what you have done, and so that it is clear to the reader what results you got and if, how, and why they deviate from the results presented in the original article.

Finally you should argue for and against the method, possibly suggesting improvements.

All statements made in the report (e.g., "method X is better than method Y") should be supported by either a reference to the original paper or report where the statement was made, or if the statement originates from you, you should explain why this statement is true.

A technically correct, well organized report with good language and a clear line of argument will receive a high grade. Missed hand-in deadline, violations of the length and formatting requirements, as well as statements not supported by references will have a heavy negative effect on the grade.

All group members must participate actively in the writing of the report. By adding a group member to the author list of the report, you certify that this person has written at least one section of the report.

Oral Presentation

You will also present your project to the other students of the course in a short oral presentation on January 16.

Each presentation is 10 minutes long and should follow the this format:

  • The aim of the article - what problem is addressed?
  • The method
  • The results that they get in the article
  • The results that you get
  • If there are differences - what are the reasons?
  • Arguments for and against the method

A clear, well organised presentation will receive a high grade. We can assure you that you will not get lower grades if you are nervous and make mistakes because of that. However, do not try to cram too much into the presentation - violations of the 10 minute time limit will have a negative effect on the grade.

Slides are optional, but if you have them they should be in pptx or pdf format. Bring the slides to the session on a memory stick, so that they can be uploaded to one computer in the beginning of the session.

All group members must participate actively in the oral presentation, i.e., talk for at least a minute. Those that cannot be present can do their part over Skype. 

Jens Lagergren skapade sidan 19 oktober 2016

kommenterade 8 november 2016

This information seems to be outdated; is there any chance you could update it, in particular the information concerning important dates for the project?

Lärare kommenterade 11 november 2016

Sorry, now it's the correct years. 

kommenterade 13 december 2016

Hello! 

Should the report be at most 7 pages or 10 pages? 

Thanks! 

Lärare kommenterade 13 december 2016

7

kommenterade 15 december 2016

Hi. I have an exam 08-12 on January 16th in Kista, how does the schedule look like for the presentation for the project?

Lärare kommenterade 15 december 2016

Afternoon. 

Assistent kommenterade 17 december 2016

Dear students

Please find the following project assignments (on a personal and group level) concerned with the project papers suggested by Pawel. Since I am away until January 6, I suggest that we communicate via email. I also propose that you create a googledoc where I could address some of your questions and leave comments. As for your work, it is not necessary for you to reproduce all the results but what you create has to be self-consistent and has to stand on its own. For example, in the paper "Text classification using string kernels" it is possible to focus on the first part (without Approximating kernels) as it constitutes a logic entity on its own.

To start with, could each group email me with a plan on what they are going to do and a list of potential questions? Later on, we can share the document and I will certainly answer further questions that will inevitably emerge (please try to group them in packets rather than sending a plethora of single-question emails).

"Text classification using string kernels"

Group 2: JJ, DS, GT, PM

Group 3: PB, PK, ED, BC

Group 4: MV, JH, DM, TB

Group 13: VP, BG, WK, FF

Group 15: JA, NBR, SL, ELM

Group 23: MG, PM, SA, IJ

Group 36: MD, JD, FH, ÞDG

 

“The Relevance Vector Machine”

Group 8: JH, AD, MM

Group 16: HH, DK, OXL, ML, HA

Group 37: OH, PF, HN, OB

 

"Sparse Gaussian processes using pseudo-inputs"

Group 9: SL, LM, MN, HH, CM

"Probabilistic principal components analysis"

Group 22: GC, AT, PM, AD

kommenterade 23 december 2016

Hi,

I have question regarding projects.

In order to implement the method, we need to have access to the data used in the paper.

Do some of u professors have license to access?

Or can we create some synthetic data for numerical simulation purpose?

Thank you!

Motoya

Lärare kommenterade 23 december 2016

The data is sometimes accessible in the paper or in a supplement of it, which you can access from KTH as well as any of us. If you cannot find data, please go ahead and create synthetic data. This is also a good idea if the original paper use synthetic data.

kommenterade 4 januari 2017

Given that the criteria for the higher grades are rather vague, what can those of us who are aiming for a high grade do to ensure that we get the grade we are aiming for?

Also, will the difficulty (theoretical or practical) of the paper be taken into account during grading, assuming that there is a significant difference in difficulty between the papers?

kommenterade 7 januari 2017

The 7 page limit is for a two column report or single column report?

Lärare kommenterade 8 januari 2017

Single column :). 

kommenterade 13 januari 2017

For those of you implementing "Kernel PCA and de-noising in feature spaces", some of the versions found on google do not display some of the figures correctly (for example figure 1 where lots of data has gone missing). Here is a link for a paper with a correct figure 1.

http://doc.ml.tu-berlin.de/publications/publications/MikSchSmoMueRaeSch99.pdf

kommenterade 14 januari 2017

Hello!

I have a question regarding the oral presentations taking place on Jan 16 between 14:00-19:00. Do we have to attend all 5 hours? And is there a presenting schedule we can have access to in advance? 

Best,
Michael

Lärare kommenterade 14 januari 2017

Sorry, it seems that I forgot to add 

   You have to participate during "your hour".

So you only have to be there during 1 hour. 

kommenterade 15 januari 2017

May people not physically present record a video, or is skype presence specifically what you are after? How will this work?

kommenterade 15 januari 2017

Where can I see where my groups "hour" is?

Lärare kommenterade 15 januari 2017

The schedule can be found under schedule. I repeat it below. 

We will run the presentations in parallell sessions. Hedvig will be in charge of the presentations in Q2 and Pawel of those in Q31. I will be running back and fourth between Q2 and Q31. Five groups will present in each lecture hall during each hour. You have to participate during "your hour". The schedule is design without considering who your supervisor is. Check the format under projects (to the left). I'll make sure that, we can use the lecture halls after 18.00 (but if your schedule late keep an eye on this page).

Q31:
14.00-15.00 group 1-5
15.00-16.00 group 11-15
16.00-17.00 group 21-25
17.00-18.00 group 31-25
18.00-18.20 group 41-42

Q2:
14.00-15.00 group 6-10
15.00-16.00 group 16-20
16.00-17.00 group 26-30
17.00-18.00 group 36-40
18.00-18.20 group 43-44

Lärare kommenterade 15 januari 2017

Yes Nikos, missing members should do their part over Skype. The group is in charge of organizing it.

En användare har tagit bort sin kommentar
kommenterade 15 januari 2017

Have I understod it correctly, the missing member should present its part live during the presentation? Is it ok to show a recorded video?

Lärare kommenterade 15 januari 2017

It should be Skype. 

kommenterade 30 januari 2017

When can we expected to get the results for home assignment 1 & 2 and the project?

kommenterade 16 mars 2017

Hello, my results did not show in my Ladok yet.