Projects

Research projects

Project Telos: Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI systems

2025-2026 · Ongoing

Project Telos develops a general framework for detecting and measuring goal-directedness in AI sytems agents—an essential step for solving the alignment problem—by combining behavioural and representational analyses. Our aim is to enable high-confidence claims about which goals an AI is pursuing, and how consistently it acts towards them. Funded by SPAR and Cohere.

Preprint Introductory post

LM Playschool Workshop & Challenge

2026 · Ongoing

The LM Playschool Workshop and Challenge invites submissions on language agents that learn, adapt, and improve through situated interaction, with a focus on conversational, collaborative, goal-oriented, and multi-turn environments. A collaboration between ELLIS Units at UCL, Edinburgh, Amsterdam, Potsdam, Saarland, Bozen-Bolzano, and Amazon.

Workshop site Call for papers