Johannes Ackermann

I am a fourth-year PhD student at the University of Tokyo, working on Reinforcement Learning supervised by Professor Masashi Sugiyama.

I am particularly interested in how we can deal with changing or complicated reward functions and dynamics in RL, as encountered in Multi-Agent interaction, Multi-Task settings or due to Non-Stationarity.

I am also employed by RIKEN AIP as a part-time researcher and supported by a Japanese government MEXT scholarship, and a Microsoft Research Asia D-CORE grant.

I previously interned at Sakana AI in 2025 and at Preferred Networks in 2022. Previously I worked as a DSP researcher in Huawei’s Munich Research Center, working on applied Machine Learning. Before that, I received my B.Sc. and M.Sc. in Electrical Engineering and Information Technology from the Technical University of Munich in 2018 and 2021. I wrote my Master’s Thesis about Multi-Task RL in ETH Zurich’s Distributed Computing group.

I’m always happy to chat about research, so feel free to reach out by e-mail or socials!

news

May 10, 2025	Two papers 1, 2 accepted at RLC 2025
May 17, 2024	Our work on Offline Reinforcement Learning from Datasets with Structured Non-Stationarity was accepted at RLC 2024
Dec 05, 2022	I presented our work High-Resolution Image Editing via Multi-Stage Blended Diffusion from my internship in PFN at the NeurIPS Machine Learning for Creativity and Design Workshop

latest posts

Apr 20, 2022	Building a Text to Image Web App

publications

RLC

Recursive Reward Aggregation

Yuting Tang, Yivan Zhang, Johannes Ackermann, Yu-Jie Zhang, Soichiro Nishimori, and Masashi Sugiyama

In Reinforcement Learning Conference (RLC) 2025 , Aug 2025

PDF
RLC

Offline Reinforcement Learning with Domain-Unlabeled Data

Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, and Masashi Sugiyama

In Reinforcement Learning Conference (RLC) 2025 , Aug 2025

arXiv
RLC

Offline Reinforcement Learning from Datasets with Structured Non-Stationarity

Johannes Ackermann, Takayuki Osa, and Masashi Sugiyama

In Reinforcement Learning Conference (RLC) 2024 , Aug 2024

arXiv Website
High-Resolution Image Editing via Multi-Stage Blended Diffusion

Johannes Ackermann, and Minjun Li

In NeurIPS Machine Learning for Creativity and Design Workshop 2022 , Dec 2022

arXiv Blog
ECML-PKDD

Unsupervised Task Clustering for Multi-Task Reinforcement Learning

Johannes Ackermann, Oliver Richter, and Roger Wattenhofer

In ECML-PKDD 2021 , Sep 2021

PDF Video
ECOC

Convolutional Neural Network Based Blind Estimation of Generalized Mutual Information for Optical Communication

Johannes Ackermann, Maximilian Schädler, and Christian Blümm

In European Conference on Optical Communication (ECOC) , Dec 2020

PDF
Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

Johannes Ackermann, Volker Gabler, Takayuki Osa, and Masashi Sugiyama

In Deep Reinforcement Learning Workshop at NeurIPS , Dec 2019

arXiv