Skip to main content

ME314: Introduction to Data Science and Machine Learning

Subject Area: Research Methods, Data Science, and Mathematics

Apply now

Course details

  • Department
    Data Science Institute
  • Application code
    SS-ME314
Dates
Session oneNot running in 2025
Session twoOpen - 14 Jul 2025 - 1 Aug 2025
Session threeNot running in 2025

Apply

Applications are open

We are accepting applications. Apply early to avoid disappointment.

Overview

Data Science and Big Data Analytics are exciting new areas that combine scientific inquiry, statistical knowledge and computer programming.

Organisations are turning to customer data in order to innovate and respond quickly to shifts in the market. Meanwhile, Governments are using to data to help guide policy decisions, making this a prime area for social scientists with an interest in quantitative methods.

This course aims to provide an introduction to the quantitative analysis of data, blending classical statistical methods with recent advances in computational and machine learning. You will cover key topics such as the challenges of analysing big data using statistical methods, and how machine learning and data science can aid in knowledge generation and improve decision-making. You will also explore some of the most exciting frontiers of data science, including large language models, generative AI and “unstructured” data like images and map data.

Engaging with leading faculty, you will cover the main analytical methods from this field and the hands-on application of these methods using example datasets. As a result, the course allows you to gain experience and confidence in using the methods covered during the course in different contexts.

Key information

Prerequisites: Students should be familiar with quantitative methods at an introductory level, up to linear regression analysis, and should be generally comfortable engaging quantitative notation and concepts. The course will use the R and python programming languages, so some exposure to computer programming with either language is encouraged although not formally required.

Students will be expected to have a laptop that they can bring to class, which can run the required statistical software and do the required data analysis—for example, a recent MacBook Pro/Air or Windows/Linux laptop with at least 8 GB RAM, and not a Chromebook or a tablet. Please contact the instructors if you have any questions or concerns.

Level: 300 level. Read more information on levels in our FAQs

Fees: Please see Fees and payments

Lectures: 36 hours

Classes: 18 hours

Assessment: Two take-home assessments

Typical credit: 3-4 credits (US) 7.5 ECTS points (EU)

Please note: Assessment is optional but may be required for credit by your home institution. Your home institution will be able to advise how you can meet their credit requirements. For more information on exams and credit, read Teaching and assessment

Is this course right for you?

This course is suitable if you already have prior training in quantitative methods and coding, and want to enhance this training with the fundamental concepts and techniques of Data Science and Data Analytics. It is also suitable for practitioners from industry, government, or research organisations with some basic training in quantitative analysis or computer programming.

The course surveys diverse techniques and methods, making it an ideal foundation for more advanced or specific training. If you are targeting a role in government, politics, data science, research, law, business management, consulting or marketing you should consider this course.

Outcomes

  • Gain practical experience working with data using statistical methods, R, Python, and modern data science workflows

  • Learn how to draw credible conclusions from data and connect empirical results to real-world questions, theories, and decisions

  • Learn to acquire, clean, transform, store and analyse both structured and unstructured data from diverse sources, including APIs and the web

  • Apply core techniques in data wrangling, visualisation, regression, and statistical reasoning

  • Understand the basics of statistical inference including probability and probability distributions, modelling and experimental design

  • Gain an overview of machine learning methods and related techniques for assessing model fit and cross-validating predictive models

  • Learn techniques for text-as-data analysis, including working with large language models and data from social media

  • Build the ability to use generative AI effectively in applied data science workflows, including critically evaluating outputs

Content

Jonathan Tam, Canada

The fundamentals of my course are covered at my home institution, but the summer school course gives me an extra breadth into how the industry works. It’s been a really good experience in diversifying my skill set.

Faculty

The design of this course is guided by LSE faculty, as well as industry experts, who will share their experience and in-depth knowledge with you throughout the course.

Daniel De Kadt

Dr Daniel De Kadt

Assistant Professor

Ryan Hübert

Dr Ryan Hübert

Associate Professor of Computational Social Science

Department

The Data Science Institute (DSI) forms the institutional cornerstone of data science activity at the London School of Economics and Political Science. Working alongside the academic departments across the School, the DSI's mission is to foster the study of data science and new forms of data with a focus on their social, economic, and political aspects.

The DSI aims to host, facilitate and promote research in social and economic data science through an annual programme of seminars, workshops and research projects delivered by a range of academic experts and research students.

Apply

Applications are open

We are accepting applications. Apply early to avoid disappointment.