ST505 Half Unit
Statistical Modelling and Data Analysis
This information is for the 2022/23 session.
Teacher responsible
Prof Wicher Bergsma
Availability
This course is available on the MPhil/PhD in Statistics. This course is available with permission as an outside option to students on other programmes where regulations permit.
Pre-requisites
A knowledge of probability and statistical theory to the level of ST102 and ST206, and linear regression to the level of ST211. Previous programming experience is not required but students who have no previous experience in R must complete an online pre-sessional R course from the Digital Skills Lab before the start of the course (https://moodle.lse.ac.uk/course/view.php?id=7745).
Course content
This course provides an overview of modern applied statistics. It will cover an introduction to quantitative research design and causal inference, exploratory data analysis and data visualisation, generalised linear models, and generalised latent variable models (including mixed effects or multilevel models, longitudinal data analysis, and structural equation models). The course will have an applied emphasis with students gaining hands-on experience in data analysis using R and practice in the interpretation of results.
Teaching
This course will be delivered through a combination of classes, lectures and Q&A sessions totalling a minimum of 30 hours across Michaelmas Term. This course includes a reading week in Week 6 of Michaelmas Term.
Formative coursework
Students will be expected to produce 1 project in the MT.
Students will be given a real dataset and asked to analyse the data to answer scientific questions and then write a report. Students' reports will be marked and feedback will be given.
Indicative reading
Maindonald, J., & Braun, J. (2006). Data analysis and graphics using R: an example-based approach. Cambridge University Press
Gelman, A., & Hill, J. (2006). Data analysis using regression and multilevel/hierarchical models. Cambridge University Press.
Skrondal, A. and Rabe-Hesketh (2004) Generalized latent variable modeling : multilevel, longitudinal, and structural equation models. Chapman & Hall/CRC
Imbens, G. W. and Rubin, D. B. (2015). Causal Inference for Statistics, Social, and Biomedical Sciences : An Introduction
Cambridge University Press
Assessment
Project (30%, 1000 words) in the MT.
Take-home assessment (70%) in the LT.
The summative assessment will be based on one piece of coursework (30%) and one take-home exam (70%). For the coursework, students will be given a dataset in week 6 and asked to analyse the data to answer several scientific questions and submit a report in week 10. The take-home exam will be in January. The take-home exam should be no fewer than 3000 words and students will be asked to submit this within three days.
Key facts
Department: Statistics
Total students 2021/22: 4
Average class size 2021/22: 4
Value: Half Unit
Course selection videos
Some departments have produced short videos to introduce their courses. Please refer to the course selection videos index page for further information.
Personal development skills
- Self-management
- Problem solving
- Application of information skills
- Communication
- Application of numeracy skills
- Specialist skills