Research Showcase 2024

Navigate to: 20 June | 21 June

This annual two-day in-person event is a key opportunity to find out more about the Department's research. It takes the form of a series of short talks and provides an overview of the research activities of the Department's four research groups: Data Science, Probability in Finance and Insurance, Social Statistics, and Time Series and Statistical Learning.

This event also included a poster session and reception on the evening of 20 June.

Thursday 20 June

Considering a Gaussian setting, a variety of useful models involve linear restrictions on the covariance kernel or the precision matrix. A key example is graphical models involving patterns of zeroes in the precision matrix. Alternatively, stationary Gaussian distributions involve linear restrictions on the covariance kernel, or, equivalently, the precision kernel. Furthermore, covariate information can be encoded via linear restrictions, in order to improve both estimation and understanding of the population distribution.

As a mathematical framework for sets of linearly restricted positive definite kernels, incorporating the aforementioned examples, we introduce a class of families of reproducing kernel Krein spaces. For each family, a generalized Wishart/inverse-Wishart prior can serve as a prior on the convex cone of positive definite kernels, allowing an (empirical) Bayes estimator for the covariance or precision kernel. This approach also addresses the difficulty of ensuring that the estimated covariance/precision kernel is positive definite.

In this talk, we give a brief review of the nonparametric estimation problem of production frontier function, which concerns the maximum possible output given input, and the efficiency of the firms. We then look at how (potentially) multiple changes over time in the production frontier can be detected. By assuming that the frontier shift upwards over time, which is plausible thanks to the advance in technologies, we are able to detect changes in the frontier at optimal rate under regularity conditions, irrelevant of the dimensionality of the input. This can be achieved by take into account whether the shift is global or local, and by modifying and utilising the well-known Free Disposal Hull (FDH) or Data Envelopment Analysis (DEA) algorithm. If time permits, we also discuss how the confidence intervals can be constructed in this setup.

The arrival of ChatGPT in November 2022 started a new wave of “generative AI” applications. One area with huge potential for generative AI to make a difference is in helping people to make better financial decisions. However, using AI Agents to provide consumer financial guidance requires assurance: there are risks with providing incorrect guidance and advice is a regulated activity. In this short talk, Tom Dorrington Ward, CTO & Co-Founder of Engage Smarter AI will survey emerging techniques – including AI architectures, and evaluation processes – for evaluating and assuring AI Agents. He will also highlight elements which make expert financial guidance a particularly complex use case. Finally, he will describe how Engage Smarter AI’s own framework for evaluating and assuring AI Agents in financial services brings together these different elements.

[Download slides - PDF]

To enable content moderation on large social media platforms it is important to timely detect harmful viral content. The detection problem is difficult because content virality results from interactions between user interests, content characteristics, feed ranking, and community structure.

This talk will shed the light on the design of the algorithms which can efficiently solve this problem at Meta scale.

We consider a model with producers making decisions on how much to produce and how much to invest in expansion of capacity of future production. With demand functions exogenously given, we study a multi-agent setting where prices are formed within equilibrium. Depending on the form of the production function, this leads to either a singular or standard control problem. The solutions to the latter are either given explicitly, or characterised via a second-order non-linear ODE. (Based on works with Junchao Jia, Alexander Pavlis and Michael Zervos.)

[Download slides - PDF]

Mean field games theory is a branch of game theory, namely a set of concepts, mathematical tools, theorems, and algorithms, which, like all game theory, helps (micro- or macro-) economists, sociologists, engineers, and even urban planners, to model situations of agents who take decisions in a context of strategic interactions. In this talk, I will introduce mean field games through some “toy models” to progressively discover the concepts and the mathematics behind this theory. I will possibly conclude with the presentation of some very preliminary results on a mean field game model of shipping, where the model is also calibrated on real data (co-authors: Michele Bergami, Simone Moawad, Barath Raaj Suria Narayanan (PG students, LSE); Evan Chien Yi Chow (ADIA); Charles-Albert Lehalle).

[Download slides - PDF]

A causal machine learning approach for estimating heterogeneous treatment effects in the primary catastrophe bond market.

We introduce a causal random forest approach to predict treatment heterogeneity in alternative capital markets. We focus on predicting the effect of issuance timing in the spreads of an insurance linked security called catastrophe bond. Studying the issuance timing is important for optimising the cost of capital and ensuring the success of the bond offering. We construct a causal random forest and we find that issuing a catastrophe bond in the first half of a calendar year is associated to a lower spread and this result varies according to several factors such as market conditions, type of the underlying asset and size of the issuance.

[Download slides - PDF]

Time series experiments, in which experimental units receive a sequence of treatments over time, are prevalent in technological companies, including ride-sharing platforms and trading companies. These companies frequently employ such experiments for A/B testing, to evaluate the performance of a newly developed policy, product, or treatment relative to a baseline control. Many existing solutions require that the experimental environment be fully observed to ensure the data collected satisfies the Markov assumption. This condition, however, is often violated in real-world scenarios. Such gap between theoretical assumptions and practical realities challenges the reliability of existing approaches and calls for more rigorous investigations of A/B testing procedures.

In this paper, we study the optimal experimental design for A/B testing in partially observable environments. We introduce a controlled (vector) autoregressive moving average model to effectively capture a rich class of partially observable environments. Within this framework, we derive closed-form expressions, i.e., efficiency indicators, to assess the statistical efficiency of various sequential experimental designs in estimating the average treatment effect (ATE). A key innovation of our approach lies in the introduction of a weak signal assumption, which significantly simplifies the computation of the asymptotic mean squared errors of ATE estimators in time series experiments. We next proceed to develop two data-driven algorithms to estimate the optimal design: one utilizing constrained optimization, and the other employing reinforcement learning. We demonstrate the superior performance of our designs using a dispatch simulator and two real datasets from a ride-sharing company.

[Download slides - PDF]

Kernel techniques (such as Hilbert-Schmidt independence criterion - HSIC; also called distance covariance) are among the most powerful approaches in data science and statistics to measure the statistical independence of M ≥ 2 random variables. Despite various existing HSIC estimators designed since its introduction close to two decades ago, the fundamental question of the rate at which HSIC can be estimated is still open; this forms the focus of the talk for translation-invariant kernels on R^d. [This is joint work with Florian Kalinke. Preprint: https://arxiv.org/abs/2403.07735]

[Download slides - PDF]

Friday 21 June 2024

This talk gives an overview of the research projects I am currently involved in. The Welcome grant aims to quantitatively assess the impact of austerity policies, including cuts to welfare such as the introduction of Universal credit on mental health and the impact of the hostile environment policy on the mental health of minority communities in England. This research uses the Understanding society data set. The ESRC grant uses causal inference methods such as causal DAGs to formally describe and explore how racial inequalities affects sentencing and remand. This research uses HMCTS data which is collected for every instance of a court appearance.

[Download slides - PDF]

Anoushka Gupta, an alumna of the MSc Data Science program, Class of 2022, is currently a Senior Data Analyst at Illuma Technology Ltd, a pioneering British AI company specializing in contextual ad targeting. Illuma's innovative technology operates without relying on cookies or identifiers, instead using real-time insights from audience browsing behaviour to identify relevant new audiences at scale.

In her upcoming talk at LSE, Anoushka will delve into the application of Illuma's technology in a cookieless world. She will discuss how Illuma leverages advanced AI to optimize advertising campaigns across the EMEA region. Anoushka's expertise in data science plays a crucial role in enhancing campaign performance, ensuring advertisements reach relevant audiences effectively and efficiently. Her insights will shed light on how businesses can navigate the challenges posed by the phasing out of cookies, demonstrating the potential of AI-driven solutions for successful ad targeting without traditional tracking methods.

[Download slides - PDF]

In this work, we explore the class of the hidden semi-Markov models (HSMMs), a flexible extension of the popular hidden Markov models (HMMs) that allows the underlying stochastic process to be a semi-Markov chain. HSMMs are typically used less frequently than HMMs due to the increased computational challenges in the evaluation of the likelihood function. Moreover, despite both families of models being sequential in nature, existing inference methods mainly target batch data settings. We address these issues by developing a computational scheme for Bayesian inference on HSMMs that allows for (1) estimation in a computationally feasible time, (2) in an exact manner, i.e. only subject to Monte Carlo error, and (3) in a sequential setting. Additionally, we explore the performance of HSMMs in two settings: a financial time series application on the VIX index, and stochastic epidemic models on data from COVID-19 pandemic. In both cases we demonstrate how the developed methodology can be used for tasks such as regime switching, model selection and clustering purposes.

[Download slides - PDF]

Tools for interpretable machine learning or explainable artificial intelligence can be used to audit algorithms for fairness or other desired properties. In a "black-box" setting--one without access to the algorithm's internal structure--the methods available to an auditor may be model-agnostic. These methods are based on varying inputs while observing differences in outputs, and include some of the most popular interpretability tools like Shapley values and Partial Dependence Plots. Such explanation methods have important limitations. Moreover, their limitations can impact audits with consequences for outcomes such as fairness. This talk will highlight key lessons that regulators, auditors, or other users of model-agnostic explanation tools must keep in mind when interpreting their output. Although we focus on a selection of tools for interpretation and on fairness as an example auditing goal, our lessons generalize to many other applications of model-agnostic explanations. These tools are increasing in popularity, which makes understanding their limitations an important research direction. That popularity is driven largely by their ease of use and portability. In high-stakes settings like an audit, however, it may be worth the extra work to use tools based on causal modeling that can incorporate background information and be tailored to each specific application.

[Download slides - PDF]

Composite quantile regression has been used to obtain robust estimators of regression coefficients in linear models with good statistical efficiency. By revealing an intrinsic link between the composite quantile regression loss function and the Wasserstein distance from the residuals to the set of quantiles, we establish a generalization of the composite quantile regression to the multiple-output settings. Theoretical convergence rates of the proposed estimator are derived both under the setting where the additive error possesses only a finite q-th moment (for q > 2) and where it exhibits a sub-Weibull tail. In doing so, we develop novel techniques for analyzing the M-estimation problem that involves Wasserstein-distance in the loss. Numerical studies confirm the practical effectiveness of our proposed procedure.

[Download slides - PDF]

We give a brief introduction on the autoregressive (AR) model for dynamic network processes. The model depicts the dynamic changes explicitly. It also facilitates simple and efficient statistical inference such as MLEs and a permutation test for model diagnostic checking. We illustrate how this AR model can serve as a building block to accommodate more complex structures such as stochastic latent blocks, change-points. We also elucidate how some stylized features often observed in real network data, including node heterogeneity, edge sparsity, persistence, transitivity and density dependence, can be embedded in the AR framework. Then the framework needs to be extended for dynamic networks with dependent edges, which poses new technical challenges. Illustration with real network data for the practical relevance of the proposed AR framework is also presented.

[Download slides - PDF]

Despite randomised control trials are desired to assess real-world medication use, they are costly and sometime unethical. With increasingly available real-world data, e.g. ‘minute-by-minute' electronic health records, causal inference methods like g-methods have become crucial in evaluating treatment effects in observational studies. G-methods, including inverse probability weighting with marginal structural models, parametric g- formula, and g-estimation for structural nested models, are well-developed for longitudinal data, where changes in treatment and confounding occur at a grid of time points common to all individuals. But real-world scenarios often involve sporadic changes at irregular intervals. Although several continuous-time g-methods have been proposed, literature is dispersed and involves technical complexities. This talk will give a summary of these methods and demonstrate their application using the UK ‘Towards A CurE for rheumatoid arthritis’ cohort data.

[Download slides - PDF]

Location map:

Map to show location of Arundel House

Have a question? Please contact the event organisers:
Professor Zoltan Szabo
Penny Montague

Research Showcase 2023 > Discover this event

Research Showcase 2024

Thursday 20 June - Friday 21 June 2024

Lee Kuan Yew Room, 5th floor, Arundel House
6 Temple Place, London WC2R 2PG

Thursday 20 June

Wicher Bergsma - 'Model-based estimation of a Gaussian covariance or precision kernel'

Yining Chen - 'Detecting changes in production frontiers'

Tom Dorrington Ward (Engage Smarter) - 'Evaluating and assuring AI Agents for financial services'

Dima Karamshuk (Meta) - 'Content Moderation at Scale – Protecting Integrity of Online Communities on Meta Platforms'

Kostas Kardaras - 'Equilibrium models of production and capacity expansion'

Giulia Livieri - 'On Mean Field Games and Applications'

Despoina Makariou (St. Gallen) - 'Estimation of heterogeneous treatment effects in the primary catastrophe bond market using causal forests'

Chengchun Shi - 'Switchback designs can enhance policy evaluation in reinforcement learning'

Zoltan Szabo - 'Minimax Rate of HSIC Estimation'

Friday 21 June 2024

Sara Geneletti - 'Using an interrupted time series design to understand the impact of austerity measures in the UK'

Anoushka Gupta ((Illuma Technology) - 'Application of Contextual Advertising in a Cookieless World'

Kostas Kalogeropoulos - 'Bayesian sequential learning for hidden semi-Markov models'

Joshua Loftus - 'Model-agnostic explanation tools and their limitations'

Tengyao Wang - 'Multiple-output composite quantile regression via optimal transport'

Qiwei Yao - 'Autoregressive dynamic networks'

Xuewen Yu - 'Causal inference in continuous time'

Location map:

Research Showcase 2023 > Discover this event

Research Showcase 2022 > Find out more