I am a senior research scientist at the UK AI Security Institute and an incoming Associate Professor of Computational Linguistics at UCL. Prior to this, I was an ETH Fellow, working with Ryan Cotterell at the Institute for Machine Learning in the ETH Zurich Department of Computer Science. I did my PhD at the Institute for Logic, Language and Computation of the University of Amsterdam, where I was advised by Raquel Fernández. I am an associated researcher at the ETH AI Center and a member of the ELLIS Society.

My research explores the computational principles underlying the ability to understand, produce, learn, and use language in interaction—both in natural and in artificial cognitive systems. I’m also deeply committed to improving evaluation standards for AI systems, in the domain of language and across the broader AI landscape. In 2018, I co-authored a paper that, according to the very kind Aaron Mueller, introduced the first mechanistic interpretability method for language models—apparently before it was cool.

News

Working with me

Please have a look at the following list of opportunities. I will keep updating it over the next months. If you are interested in applying for any of these, please get in touch.

Invited talks and guest lectures

Work in progress

Publications

Theses