Robert Müller

Core Team

Robert Müller

RL & Self Improving Agents

AI Researcher

About

Robert is a reinforcement learning researcher and founder working at the intersection of RL, agentic AI, and self improving systems. His work focuses on AI systems that do not remain static after deployment, but improve through interaction, feedback, traces, and real world outcomes. Before joining the Agentic Systems Lab, Robert worked on deep reinforcement learning for high speed robotic control at Sony AI and AI driven control for electron microscopy at appliedAI. As founding researcher he build the research team at Convergence, the AI agent startup later acquired by Salesforce. His research spans policy gradient methods, meta learning across task distributions, multi agent environments, and strategic interaction between language model agents, including recent work on agent benchmarks for negotiation and imperfect information settings. Robert is the founder of Aganthos, where he works on learning from experience as a foundation for self improving AI systems. The company explores how single agents and multi agent workflows can learn from accumulated experience, adapt their behavior over time, and become more capable through use. At ASL, Robert focuses on reinforcement learning for self improving agents: systems that can evaluate their own behavior, extract useful learning signals from logs and feedback, and improve prompts, tools, routing decisions, and model weights over time. His broader ambition is to connect the empirical power of foundation models with the adaptive machinery of reinforcement learning.

Connect

robertmueller22

deep_q_learning

robert@aganthos.com

Publications

01Google Scholar

Research Areas

01Reinforcement Learning

02Continual Learning

03Multitask Learning

04Recursive Self Improvement

05Strategic LLM agent evaluation

Other team members

Dr. Robert Jakob

Founder & Co-Director

LECTURER & POSTDOC ETH

Dr. Kevin O'Sullivan

Founder & Co-Director

LECTURER & POSTDOC ETH

Dr. Markus Kreft

AI Engineering Lead

LECTURER & POSTDOC ETH

Dr. Patrick Langer

AI Research Lead

AI Research Scientist

Dr. Fan Wu

Multimodal AI in Healthcare

Nicolas Zumarraga

Multimodal AI

PhD Researcher ETH

Ning Wang

Multimodal AI & RL

PhD Researcher ETH

Thomas Kaar

Multimodal AI

AI researcher Stanford

Max Rosenblattl

Multimodal AI

AI researcher Stanford

Dr. Robin Deuber

AI in HCI & Robotics

David Schaurecker

AI Forecasting

PhD Researcher ETH

Akshaye Shenoi

Agentic AI in Healthcare

PhD Researcher NUS

Maximilian May

Agentic AI in Education

PhD Researcher HSG

Dr. Kevin Riehl

Multimodal AI

PhD Researcher ETH

Felix Moser

Voice AI

PhD Researcher HSG

Prof. Dr. Felix Wortmann

Professor & Scientific Director, University of St. Gallen

Prof. Dr. Elgar Fleisch

Professor, ETH Zurich & University of St. Gallen

Pre-Doctoral Researcher

Pascal Bertrand

Lab Co-Founder / Video Intelligence

Gabor Hollbeck

Agentic AI in Science

Atoof Shakir

Video Intelligence

Riccardo Mansutti

Agentic AI in Energy

Baran Peters

AI Evaluation Frameworks

Connor Larson

Agentic AI in Insurance

Lasse Bærland Strand

Next-Gen RAG & Transformers

Lorenzo Steno

Agentic AI in Science

Alpay Hasanli

Agentic AI in Science

Kilian Bänziger

Next-Gen RAG

Kacper Ozieblowski

Multimodal AI

Anirudhh Ramesh

World Models

Haseeb Raza

AI Research

Maximilian Weigl

Industry Applications of TSLMs

Michael Ameri

Multimodal AI in Telecommunications

Kalil Sama Bouzigues

Browser Agents

Francesco Cavalli

Agentic AI for Regulatory Affairs

Àlex Martí Guiu

Multimodal AI

Joseph Xavier Mertz Echauri

Agentic AI in Finance

Jakob Flunger

Agentic AI in Insurance

Marie-Louise Dugua

Recommender Systems

Behsad Riemer

AI Infrastructure

Ralf Boltshauser

Enterprise Agentic Engineering

Piyushi Goyal

Finetuning domain-specific SLMs

Interested in collaborating?

We are always looking for talented students, researchers and industry partners.