Jonas Hübotter

I am a PhD student in the Learning and Adaptive Systems Group at ETH Zurich working with Andreas Krause. Prior to this, I obtained a Master’s degree in Theoretical Computer Science and Machine Learning from ETH Zurich and a Bachelor’s degree in Computer Science and Mathematics from the Technical University of Munich. I am a recipient of the ETH Medal.

My research aims to leverage foundation models for solving hard tasks through specialization and reinforcement learning. Beyond this, I have broad interests including (approximate) probabilistic inference, optimization, and online learning.

Always feel free to reach out to me with things you find exciting.

Contacts: jhuebotter@ethz.ch Google Scholar GitHub Linkedin

News

Nov, 2025	Gave an invited lecture on test-time training at EPFL. Slides are here.
Sep, 2025	NeurIPS 2025: DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning has been accepted! We will also present our work towards understanding the effectiveness of test-time training in foundation models with an oral presentation at the CCFM workshop.
Jul, 2025	COLM 2025: Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging has been accepted! We will also present our work on test-time scaling via prefix-confidence at the SCALR workshop.
May, 2025	ICML 2025: Active Fine-Tuning of Multi-Task Policies has been accepted! We will also present our work on test-time offline RL at the PUT workshop and our work on curricula for sparse-reward RL at the EXAIT workshop.
Feb, 2025	Very excited to share notes on Probabilistic AI that I have been writing with Andreas Krause!

Selected Publications

Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning

Jonas Hübotter^*, Leander Diaz-Bone^*, Ido Hakimi , and 2 more authors

arXiv preprint arXiv:2510.04786, 2025

Bib PDF Code Models Data

@article{hubotter2025learning,
  title = {Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning},
  author = {Hübotter, Jonas and Diaz-Bone, Leander and Hakimi, Ido and Krause, Andreas and Hardt, Moritz},
  year = {2025},
  journal = {arXiv preprint arXiv:2510.04786},
}

Oral

Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models

Jonas Hübotter^*, Patrik Wolf^*, Alexander Shevchenko^* , and 3 more authors

arXiv preprint arXiv:2509.24510, 2025

Oral Presentation at NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates.

Bib PDF Code Models

@article{hubotter2025specialization,
  title = {Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models},
  author = {Hübotter, Jonas and Wolf, Patrik and Shevchenko, Alexander and Jüni, Dennis and Krause, Andreas and Kur, Gil},
  year = {2025},
  journal = {arXiv preprint arXiv:2509.24510},
}

NeurIPS ’25

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

Leander Diaz-Bone^*, Marco Bagatella^*, Jonas Hübotter^* , and 1 more author

In Advances in Neural Information Processing Systems (2025) , 2025

Bib PDF Code Poster

@inproceedings{diazbone2025discover,
  title = {DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning},
  author = {Diaz-Bone, Leander and Bagatella, Marco and Hübotter, Jonas and Krause, Andreas},
  year = {2025},
  booktitle = {Advances in Neural Information Processing Systems (2025)},
}

COLM ’25

Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

Ryo Bertolissi^*, Jonas Hübotter^*, Ido Hakimi , and 1 more author

In Conference on Language Modeling (2025) , 2025

Bib PDF Code Models Poster

@inproceedings{bertolissi2025local,
  title = {Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging},
  author = {Bertolissi, Ryo and Hübotter, Jonas and Hakimi, Ido and Krause, Andreas},
  year = {2025},
  booktitle = {Conference on Language Modeling (2025)},
}

ICLR ’25 Best Paper

Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

Jonas Hübotter, Sascha Bongni, Ido Hakimi , and 1 more author

In International Conference on Learning Representations (2025) , 2024

Best Paper Award at NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning.

Bib PDF Code Poster Slides

@inproceedings{hubotter2024efficiently,
  title = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
  author = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
  year = {2024},
  booktitle = {International Conference on Learning Representations (2025)},
}

NeurIPS ’24 Oral
Transductive Active Learning: Theory and Applications

Jonas Hübotter, Bhavya Sukhija, Lenart Treven , and 2 more authors

In Advances in Neural Information Processing Systems (2024) , 2024

Oral Presentation at ICML 2024 Workshop on Aligning Reinforcement Learning Experimentalists and Theorists.

Bib PDF Code Poster Slides
@inproceedings{hubotter2024transductive, title = {Transductive Active Learning: Theory and Applications}, author = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas}, year = {2024}, booktitle = {Advances in Neural Information Processing Systems (2024)}, }

Latest Talks

→ all talks

Jan 13, 2026 Invited Talk	Test-Time Training Agents for Deep Exploration BLISS Speaker Series, Berlin
Dec 7, 2025 Contributed Talk	Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models NeurIPS Workshop on Continual and Compatible Foundation Model Updates, San Diego
Nov 18, 2025 Invited Lecture	Test-Time Training 📝 Graduate course “Foundation Models and Generative AI” (hosted by Prof. Charlotte Bunne), EPFL, Lausanne
Nov 5, 2025 Invited Talk	Test-Time Training Agents to Solve Challenging Problems 🎥 📝 heidelberg.ai Speaker Series (hosted by Carsten Lüth), Heidelberg
Oct 17, 2025 Invited Talk	Learning on the Job: Solving Challenging Problems with Test-Time Training Agents NVIDIA

Supervision

I have had the privilege of advising several BSc and MSc students during their theses and semester projects. Some of these projects have led to publications.

Dennis Jüni (MSc): Meta Test-Time Training for Image Classification (with Frederike Lübeck)
Matthias Otth (MSc): Efficient Fine-Tuning and Test-Time Training of Large Language Models for Reasoning Tasks (with Ido Hakimi, SCALR@COLM'25)
Leander Diaz-Bone (MSc): Directed Goal-Conditioned Reinforcement Learning (with Marco Bagatella, NeurIPS'25)
Ryo Bertolissi (BSc): Test-Time Model Merging for Mixture of Local Experts (with Ido Hakimi, COLM'25)
Nicolas Menet (MSc): Efficiently Estimating Gaussian Probability of Maximality (with Parnian Kassraie, AISTATS'25)
Sascha Bongni (BSc): Active Fine-Tuning of Large Language Models (ICLR'25)
Pablo Lahmann (MSc): Safe Control as Inference (with Yarden As)
Anh Duc Nguyen (BSc): Safe Bayesian Optimization without Regret

You can find a list of potential projects of our research group here. If you want to work with me, please send me an email describing your area of interest. Please also attach your CV and up-to-date transcripts.