PhD Courses in Denmark

Computational Research Methods

CBS PhD School

Course Coordinators: Michel Avital and Jason Burton, Department of Digitalization (DIGI)

 

Faculty

Assistant Professor Jason Burton 

Department of Digitalization 

 

Aims and Objectives

This course is designed for doctoral students who are interested in applying computational research methods for social science research. The overarching objectives of the course are to (1) familiarize students with key concepts in the field of computational social science and (2) equip them with practical knowledge of computational research methods to apply to their own research interests. Special focus is given to the collection and analysis of digital trace data and agent-based modeling and simulation.
 

Learning Objectives

At the end of the course, students should be able to:

  • Critically discuss the emergence of computational social science as a field of research;
  • Reflect on the opportunities and challenges of applying computational research methods to their own domain of interest, and social science at large;
  • Collect digital trace data via APIs and web scraping;
  • Statistically analyze digital trace data and interpret results;
  • Build agent-based models of social systems and run simulations;
  • Develop and present a proof-of-concept study applying computational research methods to their own domain of interest.
     

Structure and Format

Three weekly full-day sessions plus one half-day session to wrap up the course. Each full-day session consists of lectures, group discussions, and hands-on exercises with the R programming language.

  

Course Project

The course project is designed to assess each participant's understanding of the topics covered in class. The course project requires each student to develop a research study that applies one or more of the computational research methods covered. The course project report should be up to ten pages and must include:

·       A well-defined and motivated research question (ideally oriented in their ongoing doctoral work);

·       Justified selection of one or more computational research methods;

·       An overview of potential results and research impact;

·       A proof-of-concept analysis (e.g., descriptive analysis of a newly collected digital trace dataset; a visualization of preliminary agent-based simulations).

On the last day of the course, students will deliver a short oral presentation of their project and receive feedback from the teacher and their peers.
 

Evaluation
 

A Pass/Fail grade will be based on the timely submission of a 10-page course project paper and the quality of the oral presentation in the last session. A retake exam, if necessary, will be administered about a month following the ordinary exam.

 

Prerequisite Statistical Software
 

This course will use the R programming language and the RStudio IDE. Before the first class session, students should download R and RStudio here: https://posit.co/download/rstudio-desktop/. Students with no prior experience using are strongly encouraged to complete an introductory tutorial before starting the course (e.g., sections 1-9 of R for Data Science, and/or the first six of these Posit Primers).
 

Readings
 

This course is research-based, and the required readings primarily consist of academic articles. See the course plan below for assigned readings for each session. Students should complete the readings before each session and be prepared to discuss and answer questions pertaining to the reading material.


Workload
 

Pre-class preparation

54 Hours

Class sessions

21 Hours

Project and presentation preparation

23 Hours

 

 

TOTAL

98 Hours

1 ECTS = 28 hrs

 

Course Plan
 

Session

Week

Time

Topic

1

28 April

09:00-16:00

Introduction to Computational Social Science

2

07 May

09:00-16:00

Analyzing digital trace data

3

08 May

09:00-16:00

Agent-based modeling and simulation

4

09 May

09:00-12:00

Integration and project presentations

 

Session 1: Introduction to Computational Social Science

Overview of CSS as a field: motivation, methods, applications, ethics.

Exercise:

·       Basic web scraping

·       Collecting data from APIs

Required reading:

·       Lazer, D., Pentland, A., Adamic, L., Aral, S., Barabási, A. L., Brewer, D., ... & Van Alstyne, M. (2009). Computational social science. Science323(5915), 721-723. [Link]

·       Kramer, A. D., Guillory, J. E., & Hancock, J. T. (2014). Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academy of Sciences, 111(24), 8788. [Link]

 

Session 2: Analyzing digital trace data

Recognize key characteristics of digital trace data (e.g., big; nonreactive; incomplete; confounded) and the opportunities and challenges they pose to social science research. Understand different methodological use cases (e.g., prediction vs. explanation) and real-world applications.

Exercise:

·       Basic text analysis

·       Descriptive social network analysis

Required reading:

·       Lazer, D., Hargittai, E., Freelon, D., Gonzalez-Bailon, S., Munger, K., Ognyanova, K., & Radford, J. (2021). Meaningful measures of human society in the twenty-first century. Nature595(7866), 189-196. [Link]

·       Tufekci, Z. (2014). Big questions for social media big data: Representativeness, validity and other methodological pitfalls. In Proceedings of the international AAAI conference on web and social media (Vol. 8, No. 1, pp. 505-514). [Link]

 

Session 3: Agent-based modeling and simulation

Understand what an agent-based model is and what makes a "good" model. Appreciate seminal models, survey real-world applications, and learn how to critique simulation results.

Exercise:

·       Program an agent-based model of opinion dynamics, run simulations, and interpret results.

Required reading:

  • Epstein, J. M. (1999). Agent‐based computational models and generative social science. Complexity4(5), 41-60. [Link]
  • Bonabeau, E. (2002). Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences of the United States of America99(suppl. 3), 7280–7287. [Link]
  • Smaldino, P. E. (2017). Models are stupid, and we need more of them. In R. R. Vallacher, S. J. Read, & A. Nowak (Eds.), Computational social psychology (pp. 311–331). Routledge. [Link]

 

Session 4: Integration and project presentations

In this session, we will wrap up the course material and provide each student with an opportunity to deliver an oral presentation of their course project.