Statistical Methods for Causal Inference and Prediction

FaculteitTechniek, Bestuur en Management

NiveauMaster

Studiejaar2025-2026

Beschrijving

Clinical trials in biomedical research aim to determine whether a treatment is effective. Do patients receiving a COVID vaccine fare better than those in the control group who do not receive it? Similarly, A/B tests in online business research seek to evaluate whether a new website design boosts sales: Do visitors viewing page version A spend more than those viewing version B? Social scientists ask causal questions, too: When households face real-time electricity prices that vary by the hour, do they really use less electricity when the price is high? By how much does a feed-in tariff promote investment in sustainable energy sources? Does raising the minimum wage increase unemployment? Does banning digital devices in classrooms improve student performance?

In all these cases, the goal is to estimate a causal effect, and the preferred method is the randomized experiment. However, not all experiments researchers wish to conduct can be done or should be done – there are practical and ethical constraints. As a result, researchers make do with data collected through passive observation. But this raises a challenge: How can you distinguish causation from mere correlation?

The adage “correlation does not imply causation” is true but not always actionable. Scientists and practitioners strive to go beyond correlations to make robust causal claims. Over the past three decades, the “causal revolution” has reshaped empirical research in the health, social, and behavioral sciences. By leveraging counterfactual thinking and tools like causal diagrams, researchers can draw credible causal inferences from observational data.

The modern causal inference toolkit is widely used not only in science but also in business and public policy. Uber and many other companies routinely apply causal methods to enhance customer satisfaction and achieve their strategic goals. During the COVID crisis, causal epidemiological studies based on observational data guided critical policymaking. Governments and international organizations, as part of the evidence-based policy making agenda, conduct ex-post impact evaluations to assess whether policies exert causal effects on outcomes of interest.

This master-level course equips students with the theoretical foundations and practical tools needed to quantify cause-and-effect relationships in complex socio-technical systems. It covers the linear regression model and ordinary least squares estimation, the potential outcomes framework, causal diagrams, and instrumental variables estimation.

The course teaches the statistical theory upon which the statistical methods are based. To build your understanding, the ideas are illustrated with example applications drawn from recent research in different domains (e.g., transport, energy, health) and scientific disciplines (e.g., economics, political science, and epidemiology).

You will acquire coding skills along the way as a by-product of working through the examples and doing a replication. A replication, loosely speaking, is a follow-up study with the goal of evaluating the validity of a previous scientific study. You will replicate the results of a published scientific article. This means downloading the raw data, processing it, and analyzing it to check whether the published results hold up to scrutiny.

Reviews0 reviews

Nog geen reviews voor dit vak. Wees de eerste!

Heb jij dit vak gevolgd?

Deel je ervaring met toekomstige studenten. Inloggen met je TU Delft mailadres duurt één minuut.

Schrijf een review