Back

Variational Training for Better Models at Lower Cost

KAKENHI Grant-in-Aid for Scientific Research (A), FY 2026-2031. Project no. 26H02541.

About the Project

Modern AI models are highly capable, but with increasing scale they become costly to train, maintain, and update. In this project, we develop new fundamental learning algorithms for neural networks to drastically reduce AI's training cost, for example, by improving its ability to adapt and continually learn or by other means such as enabling sparsity and low-precision training. A focus is on variational Bayesian learning methods, which can address such issues, and our recent research shows them to be effective for large neural networks. This project aims to further advance variational learning methods for large deep networks and to demonstrate their application towards sustainable AI training.

Research and Open Positions

The project focuses on the following research directions:

  • Effective new variational learning algorithms for large deep networks (e.g., pre-training, sparsity, low-precision)
  • Applications in continual learning, distributed learning, active learning, reinforcement learning, etc.
  • Mechanistic interpretability, sensitivity analysis, and influence functions
  • Theoretical foundations of variational learning (PAC-Bayes, optimization in spaces of measures, etc.)

I am looking to hire interns (~5 months or longer fully funded internship) to work with me at RIKEN AIP in Tokyo in FY2026 (earliest starting date: October 2026). For eligibility criteria, and more information, please see the internship program website. If you are interested, please get in touch by email:

Tutorials, Slides, Other Materials

  • The Improved Variational Online Newton (IVON) Optimizer, GitHub link.

Publications