Modeling Social Data

  • Syllabus
  • Posts
  • Projects
  • Scribing

Final project reports

May 17, 2019

Below is a list of the final projects for the Spring 2019 semester, including a link to the original paper, the students’ final report, and all code and data necessary to reproduce the final report.

Continue reading

Lecture 12: Causality & Experiments

April 26, 2019

In this lecture we discussed causal inference, randomized experiments, and natural experiments.

Continue reading

Lecture 11: Networks II

April 12, 2019

We spent this lecture discussing representations and characteristics of networks and algorithms for analyzing network data.

Continue reading

Homework 4

April 10, 2019

The fourth homework assignment, posted on Github, is due on Thursday, April 25 by 11:59pm ET.

Continue reading

Lecture 10: Networks I

April 5, 2019

We used this lecture to first go through applications of logistic regression and then to discuss the history of network science.

Continue reading

Lecture 9: Classification

March 29, 2019

In this lecture we covered classification with linear models, specifically naive Bayes and logistics regression.

Continue reading

Homework 3

March 29, 2019

The third homework assignment, posted on Github, is due on Thursday, April 11 by 11:59pm ET.

Continue reading

Lecture 8: Regression, Part 2

March 15, 2019

This was the second lecture on the theory and practice of regression, focused on model complexity and generalization.

Continue reading

Lecture 7: Regression, Part 1

March 8, 2019

This was the first of two lectures on the theory and practice of regression.

Continue reading

Lecture 6: Reproducibility and replication, Part 2

March 1, 2019

This was our second lecture on reproducibility and replication in which we discussed false discoveries, effect sizes, and p-hacking / researcher degrees of freedom.

Continue reading

Homework 2

February 28, 2019

The second homework assignment, posted on Github, is due on Thursday, March 14 by 11:59pm ET.

Continue reading

Lecture 5: Reproducibility and replication, Part 1

February 22, 2019

We discussed the ongoing replication crisis in the sciences, wherein it has proven difficult or impossible for researchers to independently verify results of previously published studies.

Continue reading

Lecture 4: Data Visualization

February 15, 2019

We used this lecture to discuss data manipulation and data visualization in R, specifically focusing on dplyr and ggplot2 from the tidyverse.

Continue reading

Lecture 3: Computational complexity

February 8, 2019

We had a guest lecture from Sid Sen on computational complexity and algorithm analysis.

Continue reading

Homework 1

February 7, 2019

The first homework assignment, posted on Github, is due on Thursday, February 21 by 11:59pm ET.

Continue reading

Lecture 2: Introduction to Counting

February 1, 2019

Counting is surprisingly useful for understanding and summarizing social data. The key is figuring out what to count and how to count it efficiently.

Continue reading

Lecture 1: Overview

January 25, 2019

We used our first lecture to look at case studies in four main areas: exploratory data analysis, classification, regression, and working with network data.

Continue reading

Installing tools

January 24, 2019

This class will involve a good deal of coding, for which you will need some basic tools. Please make sure to set up the following tools after the first day of class.

Continue reading

Powered by Jekyll with Type Theme