Social statistics is the branch of statistics devoted to the application and development of quantitative methods tailored to the social and human sciences. Focusing on human behaviour, social structures, and societal trends, it aims to answer questions about populations, relationships, and collective patterns in societies. At the heart of social statistics is the recognition that social-science data often come from complex, messy real-world sources — surveys, longitudinal studies, administrative records, or observational data across time and groups.
Members of our social statistics research group have experience in a range of social science disciplines, including demography, education, epidemiology, psychology and sociology. We work on a range of statistical methods for answering questions across the social and human sciences, such as those relating to out-of-sample prediction based on complex data, description of population relationships using data from surveys and other sources, or causal inference from experimental or observational studies using approaches such as regression discontinuity, interrupted time series and synthetic and negative control designs. Data in these applications are often complex, high-dimensional and challenging to analyse. We develop methods that can cope with this complexity, such as:
· multivariate analysis of high-dimensional data;
· analysis of clustered data with complex correlation structures such as multivariate longitudinal data and multiprocess survival data; detection of outliers;
· analysis of problems with missing data, drop-out, misclassification and measurement error;
· and dealing with non-informative missingness in the presence of time-varying confounding in causal inference; and combining data from multiple different sources.
We develop and employ various statistical frameworks, models, methods of estimation, and computational algorithms such as:
· different types of latent variable, mixture and random effects models for continuous and categorical variables;
· Gaussian processes;
· interpretable machine learning methods;
· marginal modelling;
· composite likelihood methods;
· models for dependence using reproducing kernel Hilbert space methods;
· Markov decision and reinforcement learning methods;
· Bayesian methods;
· and computationally efficient Markov Chain Monte Carlo and sequential Monte Carlo computational techniques to facilitate parameter estimation, statistical inference, model choice, and prediction.
Faculty
Wicher Bergsma - Professor
Reproducing kernels, dependence modelling, graphical models, I-priors; categorical data
Wicher’s current research is focused on statistical modelling and testing using reproducing kernels and (empirical) Bayes techniques. In this context, he developed the I-prior methodology for parametric and nonparametric regression models, which is often simpler and better performing than competing methods. He isparticularly interested in graphical models and conditional independence testing. Wicher is known for his work in categorical data analysis, in particular marginal models which arise when there are nuisance dependencies in the data. With Angelos Dassios he developed the tau-star test, a scale invariant consistent test of independence.
Wicher joined LSE as a Lecturer in 2005, after completing postdoctoral fellowships at Eurandom in Eindhoven and Tilburg University. Prior to that, he studied Mathematics at Leiden University, did his PhD in Social Statistics at Tilburg University, and worked for one year as an Assistant Professor in Statistics at the Central European University in Budapest, Hungary. He has experience in the application of statistical methods in industry, in particular through projects with GlaxoSmithKline on clinical trials, and with Flextronics and Oce on fault detection in photocopying machines.
Yunxiao Chen - Assistant Professor
Latent variable models, high-dimensional multivariate analysis, empirical Bayes, process data sequential decision
Dr Yunxiao Chen's research focuses on the development of statistical and computational methods for solving problems in social and behavioral sciences, under three interrelated topics including (1) large-scale item response data analysis, (2) measurement and predictive modeling based on dynamic behavioral data and (3) sequential design of dynamic systems, with applications to educational assessment and learning.
Before joining LSE, Dr. Chen was an assistant professor in the Department of Psychology and the Institute for Quantitative Theory and Methods at Emory University. He completed his Ph.D. in Statistics at Columbia University in 2016.
Sara’s research interests centre around causal inference. This is the area of statistical methodology concerned with identifying and estimating effects of interventions. She is co-investigator in an MRC project that uses routinely collected medical data in a regression discontinuity design to estimate the effect of drugs in primary care. Methods for making causal inference often involve adjusting for different types of bias: by selection or due to confounding. These in turn can require the use of data from multiple sources to improve inference. Thus evidence synthesis is also an area of research that Sara is interested in. Sara is also a Bayesian and all her research is embedded in this paradigm.
Kostas’ research focuses on developing and applying advanced computational methods, such as Markov Chain and Sequential Monte Carlo, for Bayesian Inference. His methodology has mostly targeted continuous time probability models based on stochastic differential equations driven by standard or fractional Brownian motion. The areas of application include Financial and Econometric Time Series as well biomedical problems such as stochastic epidemic models and analysis of growth curves.
Prior to joining the Statistics Department of LSE, he was a post-doctoral researcher at the University of Cambridge, in the Signal Processing Laboratory of the Engineering Department. He completed his PhD (2007) in the Statistics Department of the Athens University of Economics and Business while spending some time at University of Lancaster.
Jouni Kuha - Professor
Categorical data, incomplete data problems, latent variable modelling, survey data analysis
Jouni Kuha is a social statistician whose methodological research focuses on latent variable modelling, analysis of survey data, problems with measurement error and missing data, and other topics that are motivated by applications of statistics in the social sciences.
He also collaborates with social scientists at the LSE and elsewhere on substantive research projects on a range of topics, such as the role of education in social class mobility, public attitudes to the police, estimating the prevalence of problem gambling, intergenerational exchanges of family support, and availability of essential medicines across the world. He has been a member of the analysis team of the broadcasters’ exit poll for the five most recent UK General Elections.
Before joining the LSE, Jouni was a postdoctoral researcher at Nuffield College, Oxford, and Assistant Professor of Statistics at the Pennsylvania State University. He completed his MSocSc in Statistics at the University of Helsinki in 1992, and his PhD in Social Statistics at the University of Southampton in 1996. He was elected a Fellow of the British Academy in 2021.
Irini Moustaki - Professor and Deputy Head of Department (Teaching)
Latent variable and structural equation models, estimation methods, treatment of missing values, outlier detection, categorical data analysis
Irini’s work is on the development of statistical methodology for analysing large and complex data sets. Her research interests and methodological contributions are in latent variable modelling for categorical and mixed outcomes, structural equation modelling, estimation methods, goodness-of-fit testing, detection of outliers and treatment of missing values and drop out in longitudinal studies. Her areas of application are in social sciences, education, psychiatry and health.
She completed her PhD in 1996 in the Department of Statistics at the LSE. She has co-authored books on latent variable models and multivariate data analysis. She is co-founder of the Psychometric Lab in the Department of Statistics (https://psychometricslab.com/). She received an honorary doctorate from the Faculty of Social Sciences at the University of Uppsala in 2014, she was editor-in-chief of the journal Psychometrika (2014-2018) and President of the Psychometric Society (2022-2023).
Fiona Steele - Professor
Multilevel modelling, longitudinal data analysis, event history analysis, multivariate analysis, dyadic data
Fiona Steele’s research interests are in developments of statistical methods that are motivated by social science problems. Her areas of expertise include longitudinal data analysis, multilevel modelling, event history analysis and dyadic data analysis. She has worked on a range of applications in demography (e.g. residential mobility, union formation and dissolution, and contraceptive use dynamics), education (the consequences of parental divorce for children’s educational outcomes, the impact of school resources on pupil attainment), family psychology (reciprocal influences between parents and children, sibling interactions), and health (child health, mental health and employment transitions, determinants and consequences of stress among nurses).
She has directed several research grants funded by the Economic and Social Research Council (ESRC), including the LEMMA node of the National Centre for Research Methods, a project on the interrelationships between housing transitions and fertility in Britain and Australia, and a project on (co-funded by EPSRC). As part of the LEMMA project, she led the development of a popular online training course on multilevel modelling which currently has over 20,000 registered users worldwide.
Fiona first joined in LSE in 1996 as Lecturer in Statistics and Research Methodology. She then worked at the Institute of Education, (now part of UCL) 2001-2005, followed by the University of Bristol 2005-2013 where she was Professor of Social Statistics and Director of the Centre for Multilevel Modelling. She returned to LSE in 2013.
Fiona was awarded the Royal Statistical Society (RSS) Guy Medal in Bronze in 2008 and elected a Fellow of the British Academy in 2009. She was appointed an Officer of the Order of the British Empire (OBE) in 2011 and a CBE in 2022 for services to social sciences. In 2024 she was awarded the RSS Howard Medal for her contributions to social statistics.
Research students
Motonori Oka Research interests: Statistics and machine/deep learning for the education and social sciences