Jonas Hübotter

I am a PhD student in the Learning and Adaptive Systems Group at ETH Zurich working with Andreas Krause. Prior to this, I obtained a Master’s degree in Theoretical Computer Science and Machine Learning from ETH Zurich and a Bachelor’s degree in Computer Science and Mathematics from the Technical University of Munich. I am a recipient of the ETH Medal.

My research aims to leverage foundation models for solving hard tasks through specialization and reinforcement learning. Beyond this, I have broad interests including (approximate) probabilistic inference, optimization, and online learning.

Always feel free to reach out to me with things you find exciting.

Contacts: jhuebotter@ethz.ch Google Scholar GitHub Linkedin

Announcements

May, 2025	ICML 2025: Active Fine-Tuning of Multi-Task Policies has been accepted!
Feb, 2025	Very excited to share notes on Probabilistic AI that I have been writing with Andreas Krause!
Jan, 2025	ICLR 2025: Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs has been accepted!
Jan, 2025	AISTATS 2025: LITE: Efficiently Estimating Gaussian Probability of Maximality has been accepted!
Oct, 2024	NeurIPS 2024: Our work Transductive Active Learning: Theory and Applications was accepted! We will also present our work on efficiently learning at test-time with LLMs with an oral presentation at the Fine-Tuning in Modern ML workshop.

Selected Publications

DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning

Leander Diaz-Bone^*, Marco Bagatella^*, Jonas Hübotter^* , and 1 more author

arXiv preprint arXiv:2505.19850, 2025

Bib PDF Code

@article{diazbone2025discover,
  title = {DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning},
  author = {Diaz-Bone, Leander and Bagatella, Marco and Hübotter, Jonas and Krause, Andreas},
  year = {2025},
  journal = {arXiv preprint arXiv:2505.19850},
}

Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging

Ryo Bertolissi^*, Jonas Hübotter^*, Ido Hakimi , and 1 more author

arXiv preprint arXiv:2505.14136, 2025

Bib PDF Code

@article{bertolissi2025local,
  title = {Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging},
  author = {Bertolissi, Ryo and Hübotter, Jonas and Hakimi, Ido and Krause, Andreas},
  year = {2025},
  journal = {arXiv preprint arXiv:2505.14136},
}

ICLR 2025 Best Paper

Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs

Jonas Hübotter, Sascha Bongni, Ido Hakimi , and 1 more author

In International Conference on Learning Representations , 2025

Best Paper Award at NeurIPS Workshop on Fine-Tuning in Modern Machine Learning, 2024.

Bib PDF Code Poster Slides

@inproceedings{hubotter2025efficiently,
  title = {Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs},
  author = {H{\"u}botter, Jonas and Bongni, Sascha and Hakimi, Ido and Krause, Andreas},
  year = {2025},
  booktitle = {International Conference on Learning Representations},
}

NeurIPS 2024 Oral
Transductive Active Learning: Theory and Applications

Jonas Hübotter, Bhavya Sukhija, Lenart Treven , and 2 more authors

In Advances in Neural Information Processing Systems , 2024

Oral Presentation at ICML Workshop on Aligning Reinforcement Learning Experimentalists and Theorists, 2024.

Bib PDF Code Poster Slides
@inproceedings{hubotter2024transductive, title = {Transductive Active Learning: Theory and Applications}, author = {H{\"u}botter, Jonas and Sukhija, Bhavya and Treven, Lenart and As, Yarden and Krause, Andreas}, year = {2024}, booktitle = {Advances in Neural Information Processing Systems}, }

Talks

Towards Solving Hard Problems via Test-Time Training
Invited Talk, 1st Prague Workshop on Neural Networks and Reasoning, Prague, 10 Jul 2025.
Interview on Test-Time Model Merging with Nnamdi Iregbulem (Lightspeed Venture Partners), May 2025.
Efficiently Learning at Test-Time with LLMs via Transductive Active Learning — recording, slides
Invited Talk, Trillion Parameter Consortium (TPC) Seminar Series, 5 Mar 2025.
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs — recording, slides
Contributed Talk, NeurIPS Workshop on Fine-Tuning in Modern Machine Learning, Vancouver, 14 Dec 2024.
Interview with Machine Learning Street Talk (MLST) podcast, Nov 2024.
Transductive Active Learning for Fine-Tuning Large (Language) Models — slides
Invited Talk, Machine Learning and Modelling Seminar, Czech Academy of Sciences, Prague, 21 Nov 2024.
Efficiently Learning at Test-Time with LLMs — recording, slides
Invited Talk, Zurich AI Meetup, Zurich, 3 Dec 2024.
Invited Talk, Tufa Labs AI Meetup, Zurich, 29 Oct 2024.
Transductive Active Learning with Application to Safe Bayesian Optimization — recording, slides
Contributed Talk, ICML Workshop on Aligning Reinforcement Learning Experimentalists and Theorists, Vienna, 26 Jul 2024.
Active Fine-Tuning of Large Neural Networks — slides
Contributed Talk, Machine Learning Seminar, ETH Zurich, 18 Apr 2024.

Supervision

I have had the privilege of advising several BSc and MSc students during their theses and semester projects. Some of these projects have led to publications.

Matthias Otth: Efficient Fine-Tuning and Test-Time Training of Large Language Models for Reasoning Tasks (with Ido Hakimi)
Leander Diaz-Bone: Directed Goal-Conditioned Reinforcement Learning (with Marco Bagatella)
Ryo Bertolissi: Test-Time Model Selection and Merging for Mixture of Local Experts (with Ido Hakimi)
Nicolas Menet: Efficiently Estimating Gaussian Probability of Maximality (with Parnian Kassraie, AISTATS 2025)
Sascha Bongni: Active Fine-Tuning of Large Language Models (ICLR 2025)
Pablo Lahmann: Safe Control as Inference (with Yarden As)
Anh Duc Nguyen: Safe Bayesian Optimization without Regret

You can find a list of potential projects of our research group here. If you want to work with me, please send me an email describing your area of interest. Please also attach your CV and up-to-date transcripts.