Johannes Ackermann

prof_pic.jpg

I am a fourth-year PhD student at the University of Tokyo, working on Reinforcement Learning supervised by Professor Masashi Sugiyama.

I am particularly interested in how we can deal with changing or complicated reward functions and dynamics in RL, as encountered in Multi-Agent interaction, Multi-Task settings or due to Non-Stationarity.

I am also employed by RIKEN AIP as a part-time researcher and supported by a Japanese government MEXT scholarship, and a Microsoft Research Asia D-CORE grant.

I previously interned at Sakana AI in 2025 and at Preferred Networks in 2022. Previously I worked as a DSP researcher in Huawei’s Munich Research Center, working on applied Machine Learning. Before that, I received my B.Sc. and M.Sc. in Electrical Engineering and Information Technology from the Technical University of Munich in 2018 and 2021. I wrote my Master’s Thesis about Multi-Task RL in ETH Zurich’s Distributed Computing group.

I’m always happy to chat about research, so feel free to reach out by e-mail or socials!

news

May 10, 2025 Two papers 1, 2 accepted at RLC 2025 :tada:
May 17, 2024 Our work on Offline Reinforcement Learning from Datasets with Structured Non-Stationarity was accepted at RLC 2024:tada:
Dec 05, 2022 I presented our work High-Resolution Image Editing via Multi-Stage Blended Diffusion from my internship in PFN at the NeurIPS Machine Learning for Creativity and Design Workshop :tada:

latest posts

publications

  1. RLC
    recursive_reward_aggregation.png
    Recursive Reward Aggregation
    Yuting Tang, Yivan Zhang, Johannes Ackermann, Yu-Jie Zhang, Soichiro Nishimori, and Masashi Sugiyama
    In Reinforcement Learning Conference (RLC) 2025 , Aug 2025
  2. RLC
    pu_offlinerl.png
    Offline Reinforcement Learning with Domain-Unlabeled Data
    Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, and Masashi Sugiyama
    In Reinforcement Learning Conference (RLC) 2025 , Aug 2025
  3. RLC
    nonstationary_dataset.png
    Offline Reinforcement Learning from Datasets with Structured Non-Stationarity
    Johannes Ackermann, Takayuki Osa, and Masashi Sugiyama
    In Reinforcement Learning Conference (RLC) 2024 , Aug 2024
  4. msbd_overview.jpg
    High-Resolution Image Editing via Multi-Stage Blended Diffusion
    Johannes Ackermann, and Minjun Li
    In NeurIPS Machine Learning for Creativity and Design Workshop 2022 , Dec 2022
  5. Unsupervised Task Clustering for Multi-Task Reinforcement Learning
    Johannes Ackermann, Oliver Richter, and Roger Wattenhofer
    In ECML-PKDD 2021 , Sep 2021
  6. Convolutional Neural Network Based Blind Estimation of Generalized Mutual Information for Optical Communication
    Johannes Ackermann, Maximilian Schädler, and Christian Blümm
    In European Conference on Optical Communication (ECOC) , Dec 2020
  7. estimation_matd3.png
    Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics
    Johannes Ackermann, Volker Gabler, Takayuki Osa, and Masashi Sugiyama
    In Deep Reinforcement Learning Workshop at NeurIPS , Dec 2019