Computational Research Methods
CBS PhD School
Course Coordinators: Michel Avital and Jason Burton, Department of Digitalization (DIGI)
Faculty
Assistant Professor Jason Burton
Department of Digitalization
Aims and Objectives
This course is designed for doctoral students who are interested in applying computational research methods for social science research. The overarching objectives of the course are to (1) familiarize students with key concepts in the field of computational social science and (2) equip them with practical knowledge of computational research methods to apply to their own research interests. Special focus is given to the collection and analysis of digital trace data and agent-based modeling and simulation.
Learning Objectives
At the end of the course, students should be able to:
- Critically discuss the emergence of computational social science as a field of research;
- Reflect on the opportunities and challenges of applying computational research methods to their own domain of interest, and social science at large;
- Collect digital trace data via APIs and web scraping;
- Statistically analyze digital trace data and interpret results;
- Build agent-based models of social systems and run simulations;
- Develop and present a proof-of-concept study applying computational research methods to their own domain of interest.
Structure and Format
Three weekly full-day sessions plus one half-day session to wrap up the course. Each full-day session consists of lectures, group discussions, and hands-on exercises with the R programming language.
Course Project
The course project is designed to assess each participant's understanding of the topics covered in class. The course project requires each student to develop a research study that applies one or more of the computational research methods covered. The course project report should be up to ten pages and must include:
· A well-defined and motivated research question (ideally oriented in their ongoing doctoral work);
· Justified selection of one or more computational research methods;
· An overview of potential results and research impact;
· A proof-of-concept analysis (e.g., descriptive analysis of a newly collected digital trace dataset; a visualization of preliminary agent-based simulations).
On the last day of the course, students will deliver a short oral presentation of their project and receive feedback from the teacher and their peers.
Evaluation
A Pass/Fail grade will be based on the timely submission of a 10-page course project paper and the quality of the oral presentation in the last session. A retake exam, if necessary, will be administered about a month following the ordinary exam.
Prerequisite Statistical Software
This course will use the R programming language and the RStudio IDE. Before the first class session, students should download R and RStudio here: https://posit.co/download/rstudio-desktop/. Students with no prior experience using are strongly encouraged to complete an introductory tutorial before starting the course (e.g., sections 1-9 of R for Data Science, and/or the first six of these Posit Primers).
Readings
This course is research-based, and the required readings primarily consist of academic articles. See the course plan below for assigned readings for each session. Students should complete the readings before each session and be prepared to discuss and answer questions pertaining to the reading material.
Workload
Pre-class preparation |
54 Hours |
Class sessions |
21 Hours |
Project and presentation preparation |
23 Hours |
|
|
TOTAL |
98 Hours |
1 ECTS = 28 hrs
Course Plan
Session |
Week |
Time |
Topic |
1 |
28 April |
09:00-16:00 |
Introduction to Computational Social Science |
2 |
07 May |
09:00-16:00 |
Analyzing digital trace data |
3 |
08 May |
09:00-16:00 |
Agent-based modeling and simulation |
4 |
09 May |
09:00-12:00 |
Integration and project presentations |
Session 1: Introduction to Computational Social Science
Overview of CSS as a field: motivation, methods, applications, ethics.
Exercise:
· Basic web scraping
· Collecting data from APIs
Required reading:
· Lazer, D., Pentland, A., Adamic, L., Aral, S., Barabási, A. L., Brewer, D., ... & Van Alstyne, M. (2009). Computational social science. Science, 323(5915), 721-723. [Link]
· Kramer, A. D., Guillory, J. E., & Hancock, J. T. (2014). Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academy of Sciences, 111(24), 8788. [Link]
Session 2: Analyzing digital trace data
Recognize key characteristics of digital trace data (e.g., big; nonreactive; incomplete; confounded) and the opportunities and challenges they pose to social science research. Understand different methodological use cases (e.g., prediction vs. explanation) and real-world applications.
Exercise:
· Basic text analysis
· Descriptive social network analysis
Required reading:
· Lazer, D., Hargittai, E., Freelon, D., Gonzalez-Bailon, S., Munger, K., Ognyanova, K., & Radford, J. (2021). Meaningful measures of human society in the twenty-first century. Nature, 595(7866), 189-196. [Link]
· Tufekci, Z. (2014). Big questions for social media big data: Representativeness, validity and other methodological pitfalls. In Proceedings of the international AAAI conference on web and social media (Vol. 8, No. 1, pp. 505-514). [Link]
Session 3: Agent-based modeling and simulation
Understand what an agent-based model is and what makes a "good" model. Appreciate seminal models, survey real-world applications, and learn how to critique simulation results.
Exercise:
· Program an agent-based model of opinion dynamics, run simulations, and interpret results.
Required reading:
- Epstein, J. M. (1999). Agent‐based computational models and generative social science. Complexity, 4(5), 41-60. [Link]
- Bonabeau, E. (2002). Agent-based modeling: Methods and techniques for simulating human systems. Proceedings of the National Academy of Sciences of the United States of America, 99(suppl. 3), 7280–7287. [Link]
- Smaldino, P. E. (2017). Models are stupid, and we need more of them. In R. R. Vallacher, S. J. Read, & A. Nowak (Eds.), Computational social psychology (pp. 311–331). Routledge. [Link]
Session 4: Integration and project presentations
In this session, we will wrap up the course material and provide each student with an opportunity to deliver an oral presentation of their course project.