DS101L      Half Unit
Fundamentals of Data Science

This information is for the 2021/22 session.

Teacher responsible

Prof Kenneth Benoit PEL.4.01C

Availability

This module is designed for students on social science degree programmes who do not have A-level Mathematics (e.g. in Anthropology, Law, and Social Policy).

Course content

This course is designed to introduce students to data science and its practice: how it works and how it can produce insights from social, political, and economic data. It combines accessible knowledge in data science as a field of study, with practical knowledge about data science as a career path. By combining case studies in applications of both with the study of the content of data science, it aims for a coverage of data science that is both pedagogic but accessible, as well as fundamentally applied and practical. It combines three perspectives: inferential thinking, computational thinking, and real-world relevance.

The topics covered include:

  • the fundamentals of the data science approach, with an emphasis on social scientific analysis and the study of the social, political, and economic worlds;
  • a survey of the forms of data and the challenges of working with data, including an overview of databases;
  • the basis of computational thinking and algorithmic design;
  • an introduction to the logic of statistical inference including probability and probability distributions and how they form the basis for statistical decision-making;
  • a survey of the basic techniques of statistical learning and machine learning, including a comparison of different approaches, including supervised and unsupervised methods;
  • how to integrate the insights from data analytics into knowledge generation and decision-making;
  • examples of methods for working with unstructured data, such as text mining.

Our applications are drawn from the social science fields represented at the LSE but also from private and public sector non-academic examples.

Teaching

16 hours and 40 minutes of lectures and 7 hours and 30 minutes of classes in the LT.

A combination of classes and lectures totalling 30 hours across Lent Term.

Reading week in Week 6.

Formative coursework

Students will be expected to produce 9 other pieces of coursework and 1 other piece of coursework in the LT.

Students will be presented with guided questions to answer in completing each week's reading, and discuss these in each class session. They will participate actively in presenting the answers of the questions to the group.

Indicative reading

  • Saltz, J. S., & Stanton, J. M. (2017). An introduction to data science. Sage Publications.
  • Denning, P. J., & Tedre, M. (2019). Computational thinking. MIT Press.
  • Shan, C. (2015). The Data Science Handbook: Advice and Insights from 25 Amazing Data Scientists. Data Science Bookshelf.
  • Schutt, R., & O'Neil, C. (2014). Doing data science: Straight talk from the frontline. O'Reilly.
  • Knaflic, C. N. (2015). Storytelling with data: A data visualization guide for business professionals. John Wiley & Sons.
  • Mayer-Schönberger, V., & Cukier, K. (2013). Big data: A revolution that will transform how we live, work, and think. Houghton Mifflin Harcourt.

 

Assessment

Essay (30%, 1500 words) and presentation (10%) in the LT.
Essay (60%, 2000 words) in the ST.

Course selection videos

Some departments have produced short videos to introduce their courses. Please refer to the course selection videos index page for further information.

Important information in response to COVID-19

Please note that during 2021/22 academic year some variation to teaching and learning activities may be required to respond to changes in public health advice and/or to account for the differing needs of students in attendance on campus and those who might be studying online. For example, this may involve changes to the mode of teaching delivery and/or the format or weighting of assessments. Changes will only be made if required and students will be notified about any changes to teaching or assessment plans at the earliest opportunity.

Key facts

Department: Data Science Institute

Total students 2020/21: Unavailable

Average class size 2020/21: Unavailable

Capped 2020/21: No

Value: Half Unit

Guidelines for interpreting course guide information

Personal development skills

  • Self-management
  • Team working
  • Problem solving
  • Application of information skills
  • Communication
  • Application of numeracy skills
  • Commercial awareness
  • Specialist skills