PhD Courses in Denmark

Introduction to Machine Learning for the Social Sciences

Copenhagen Graduate School of Social Sciences

Copenhagen Centre for Social Data Science

Dates and time:
27-29 August 2024 from 9 am to 4 pm.

This course will introduce the basics of big data and machine learning and how it can be used in the context of social science research. No prior knowledge is assumed, and it is well-suited for people with no prior experience using big data or machine learning. The course will cover different topics including how big data methods differ from inferential statistics, data cleaning and pre-processing, different machine learning models and explainable AI methods, data ethics and privacy, as well as bias and responsible AI.

Core machine learning principles such as cross-validation, out-of-sample prediction, and hyper-parameter tuning will be introduced. A key focus will be on interpretable machine learning models such as regression-based models, decision trees, and random forests.       

The assessment will involve completing a machine learning analysis, either in Python or in R. Therefore, some prior experience with one of these coding languages is preferred. A basic understanding of inferential statistics is also preferred. Participants can bring their own data or use the data that will be provided. 

Academic Aim:
- To think critically about when Big Data and machine learning should be used for social science research (assess advantages, disadvantages, and limitations)
- To be able to perform a machine learning analysis

Target group: Social science background. To gain most from the course some prior experience with R or Python and a basic understanding of inferential statistics is ideal.

Course organiser: Rosa Ellen Lavelle-Hill, Assistant Professor, Department of Psychology, University of Copenhagen

Programme:

Language: English

ECTS: 2

Max. numbers of participants: 25

Course fee: Free for PhD students at all Danish Universities, except PhD students at CBS who will be charged a fee of DKK 2,400 (1.200 per ECTS).

Registration: Please register via the link in the box no later than 13 August 2024.

Further information: For more information about the PhD course, please contact the PhD Administration (phd@hrsc.ku.dk).

Literature:
Breiman, L. (2001). Statistical modeling: The two cultures (with comments and a rejoinder by the author). Statistical science, 16(3), 199-231.

Yarkoni, T., & Westfall, J. (2017). Choosing prediction over explanation in psychology: Lessons from machine learning. Perspectives on Psychological Science, 12(6), 1100-1122.

Molnar, Christoph. “Interpretable machine learning. A Guide for Making Black Box Models Explainable”, 2019. https://christophm.github.io/interpretable-ml-book/.

Shmueli, G. (2010). To explain or to predict?