##
ST456 Half Unit

Deep Learning

**This information is for the 2022/23 session.**

**Teacher responsible**

Prof Milan Vojnovic COL5.05

**Availability**

This course is available on the MSc in Applicable Mathematics, MSc in Applied Social Data Science, MSc in Data Science, MSc in Geographic Data Science, MSc in Health Data Science, MSc in Management of Information Systems and Digital Innovation, MSc in Operations Research & Analytics, MSc in Quantitative Methods for Risk Management, MSc in Statistics, MSc in Statistics (Financial Statistics), MSc in Statistics (Financial Statistics) (LSE and Fudan), MSc in Statistics (Financial Statistics) (Research), MSc in Statistics (Research), MSc in Statistics (Social Statistics) and MSc in Statistics (Social Statistics) (Research). This course is available with permission as an outside option to students on other programmes where regulations permit.

MSc Data Science students will be given priority for enrollment in this course.

**Pre-requisites**

The course requires some mathematics, in particular some use of vectors and some calculus. Basic knowledge of computer programming is expected, mainly Python.

**Course content**

This course is about deep learning, covering fundamental concepts of deep learning and neural networks, design of neural network architectures, optimisation methods for training neural networks, and neural networks design for particular purposes such as image recognition, sequence modelling, natural language processing and generative models. The course will cover the following topics:

- Introduction – course overview
- Introduction to neural networks – single-layer networks, linear discriminant functions, XOR problem, perceptron, multi-layer perceptron, perceptron learning criteria, perceptron learning algorithm, feedforward neural network architecture
- Optimisation algorithms – empirical loss function minimisation, gradient descent algorithm, stochastic gradient descent algorithm
- Advanced optimisation algorithms – adaptive learning rates, momentum, backpropagation, dropout
- Convolutional neural networks (CNNs) – principles and basic operations of convolutional neural networks, LeNet example
- Modern convolutional neural networks – understanding principles of some modern CNN architectures, including AlexNet, VGGNet, NiN, GoogLeNet, ResNet, and DenseNet
- Recurrent neural networks (RNNs) – RNN models, training RNNs, gated RNNs, GRU, LSTM, Deep RNNs, bidirectional RNNs, vector to sequence models using RNNs
- Sequence to sequence models – machine translation tasks, encoder-decoder architecture, attention mechanisms, transformer
- Autoencoders – introduction to autoencoders, linear factor models, PCA and probabilistic PCA, sparse coding, autoencoders, variational autoencoders
- Generative adversarial networks (GANs) – introduction to GANs, GAN architecture and training, Wasserstein GANs, Wasserstain GANs with gradient penalty

**Teaching**

20 hours of lectures and 15 hours of classes in the LT.

This course will be delivered through a combination of classes, and lectures and Q&A sessions totalling a minimum of 35 hours across Lent Term. This course includes a reading week in Week 6 of Lent Term.

**Formative coursework**

Students will be expected to produce 8 problem sets in the LT.

**Indicative reading**

- Ian Goodfellow, Yoshua Bengio and Aaron Courville, Deep Learning, MIT Press, 2016, https://www.deeplearningbook.org/
- Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola, Dive into Deep Learning, https://d2l.ai/
- TensorFlow – An end-to-end open source machine learning platform, https://www.tensorflow.org/

**Assessment**

Project (80%) in the LT.

Continuous assessment (10%) in the MT Week 4.

Continuous assessment (10%) in the MT Week 9.

Two of the problem sets submitted by students weekly will be assessed (20% in total). Each problem set will have an individual mark of 10% and submission will be required in LT Weeks 4 and 9. In addition, there will be a take-home exam (80%) in the form of a group project in which they will demonstrate their ability to develop and evaluate neural network algorithms for solving a prediction or classification task of their choice.

** Key facts **

Department: Statistics

Total students 2021/22: 58

Average class size 2021/22: 29

Controlled access 2021/22: Yes

Lecture capture used 2021/22: Yes (LT)

Value: Half Unit

**Course selection videos**

Some departments have produced short videos to introduce their courses. Please refer to the course selection videos index page for further information.

**Personal development skills**

- Self-management
- Problem solving
- Application of information skills
- Communication
- Application of numeracy skills
- Commercial awareness
- Specialist skills