PhD Courses in Denmark

Machine Learning for SCIENCE (MLS)

PhD School at the Faculty of SCIENCE at University of Copenhagen

Aim and content


The course will take place over five consecutive Tuesdays, starting Apr. 29, 2025 (i.e. April 29. May 6, May 13, May 20, May 27).

The Machine Learning for SCIENCE (MLS) course introduces key analysis methods in Machine Learning. These methods allow investigations of scientific data from most fields, including data from physical measurements, questionnaires, pictures, internet searches, satellites, and biochemical outcomes. We cover data cleaning (e.g. missing data, denoising), feature extraction, machine learning basics (labels, variables, parameter optimization, overfitting, cross-validation), key machine learning and image analysis methods based on both unsupervised and supervised learning, and visualization. Method-wise, we start at Linear Discriminant Analysis and end with Deep Learning.
At the end of the course, the students must write a synopsis with a suggestion for an analysis ideally performed on their own data including a small implementation of a key concept. This synopsis could form the basis for the Data Science Projects PhD course also offered by the Data Science Lab.

Formal requirements

The number of participants is limited at 50, and priority will be given to PhD students enrolled at UCPH-SCIENCE and participants from a previous Data Science Lab course (Introduction to Python or R, Statistical Methods I).

We assume that the students have some experience with Python programming.



Learning outcome

After course completion the students are expected to be able to:

Knowledge:
- Understand key machine learning concepts (parameter training, overfitting).
- Understand key machine learning methods (LDA, (un-) supervised learning).
- Understand key image analysis methods (e.g. feature extraction).

Skills:
- Develop/adapt/extend a computer-based software method for analysis of relevant data.
Competences:
- Propose relevant analysis methods for scientific data science problems.
- Consider cross-disciplinary data science methods in their research.

Literature

Course lecture slides and exercises.
We will use data, examples, and other material from publicly available sources.

Teaching and learning methods

The course is composed of sessions combining lectures and exercises. For each topic, the students will get hands-on experience in applying, modifying, and programming analysis methods.
The programming examples will be implemented using Python in JupyterLab notebooks.

Remarks

Questions:
If you have ay questions please contact course organizer Raghavendra Selvan (raghav@di.ku.dk)

Examination:
The students need to hand in their synopsis (10 days after the final course day). The synopsis must be approved. The students are allowed to work in 2-person groups.

Participation fee:
PhD students enrolled at the PhD School of SCIENCE are exempt from the participation fee.
All other students are required to pay the participation fee of DKK 3600.


Details and Updates:
For details for this and other Data Science Lab courses, see: http://datalab.science.ku.dk/english/course/