Research projects
Project Telos: Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI systems
2025-2026 · Ongoing
Project Telos develops a general framework for detecting and measuring goal-directedness in AI sytems agents—an essential step for solving the alignment problem—by combining behavioural and representational analyses. Our aim is to enable high-confidence claims about which goals an AI is pursuing, and how consistently it acts towards them. Funded by SPAR and Cohere.
LM Playschool Workshop & Challenge
2026 · Ongoing
The LM Playschool Workshop and Challenge invites submissions on language agents that learn, adapt, and improve through situated interaction, with a focus on conversational, collaborative, goal-oriented, and multi-turn environments. A collaboration between ELLIS Units at UCL, Edinburgh, Amsterdam, Potsdam, Saarland, Bozen-Bolzano, and Amazon.