Below is a list of the final projects for the Spring 2019 semester, including a link to the original paper, the students’ final report, and all code and data necessary to reproduce the final report.
Continue reading
In this lecture we discussed causal inference, randomized experiments, and natural experiments.
Continue reading
We spent this lecture discussing representations and characteristics of networks and algorithms for analyzing network data.
Continue reading
We used this lecture to first go through applications of logistic regression and then to discuss the history of network science.
Continue reading
In this lecture we covered classification with linear models, specifically naive Bayes and logistics regression.
Continue reading
This was the second lecture on the theory and practice of regression, focused on model complexity and generalization.
Continue reading
This was the first of two lectures on the theory and practice of regression.
Continue reading
This was our second lecture on reproducibility and replication in which we discussed false discoveries, effect sizes, and p-hacking / researcher degrees of freedom.
Continue reading
We discussed the ongoing replication crisis in the sciences, wherein it has proven difficult or impossible for researchers to independently verify results of previously published studies.
Continue reading
Counting is surprisingly useful for understanding and summarizing social data. The key is figuring out what to count and how to count it efficiently.
Continue reading
We used our first lecture to look at case studies in four main areas: exploratory data analysis, classification, regression, and working with network data.
Continue reading
This class will involve a good deal of coding, for which you will need some basic tools. Please make sure to set up the following tools after the first day of class.
Continue reading