This information is for the 2019/20 session.
Teacher responsible
Dr Blake Miller COL.7.14
Availability
This course is available on the MPhil/PhD in Social Research Methods. This course is available with permission as an outside option to students on other programmes where regulations permit.
This course is available to research students only.
Pre-requisites
Applied Regression Analysis (MY452) or equivalent is required.
Course content
Machine learning uses algorithms to find patterns in large datasets and make predictions based on them. This course will use prominent examples from social science research to cover major machine learning tasks including regression, classification, clustering, and dimensionality reduction. A particular emphasis will be placed on the ethical issues surrounding machine learning applications, including privacy, algorithmic bias, and informed consent. Lectures will use case studies to introduce specific machine learning algorithms including LASSO, ridge regression, logistic regression, k-nearest neighbour classification, decision trees, support vector machines, k-means clustering, hierarchical clustering, principal component analysis, and linear discriminant analysis. Students will learn to apply these algorithms to data and validate and evaluate models. Students will work directly with social data and analyse these data using Python or R.
Teaching
20 hours of lectures and 10 hours of computer workshops in the MT.
The workshops will last two hours and take place every two weeks.
Formative coursework
Students will be expected to produce 1 problem set in the LT.
One structured problem set will be provided in the first weeks of the course. Students will start the problem set in the first computer workshop session and complete it outside of class.
Indicative reading
Assessment
In-class assessment (40%) in the MT.
Take-home assessment (60%) in the LT.
Four structured problem sets will be marked and will provide 40% of the mark. For the take home exam, students will be expected to submit a 3000-word paper in which they identify and contextualize a relevant social data science problem related to their dissertation research, find suitable data to address it, plan and conduct extensive machine learning analysis on the data, and present the findings.
Marking of these assessments will be at a level appropriate for PhD students.
Key facts
Department: Methodology
Total students 2018/19: Unavailable
Average class size 2018/19: Unavailable
Value: Half Unit
Personal development skills