Dr. Thomy Phan Lehrstuhl für Mobile und Verteilte Systeme Ludwig-Maximilians-Universität München, Institut für Informatik Oettingenstraße 67 Raum G004 Telefon: +49 89 / 2180-9167 Fax: +49 89 / 2180-9148 Mail: thomy.phan@ifi.lmu.de |
Research Interests
- Technology and Research on Artificial Intelligence Laboratory (TRAIL)
- Artificial Intelligence
- Multiagent Systems
- Reinforcement Learning
- Adversarial Learning
- Automated Planning
❗For more information about my research, please visit my website
❗I have joined Sven Koenig’s Lab at University of Southern California in July 2023.
Teaching
- Autonomous Systems (Practical Course): SS 2019, WS 2019/20, SS 2020, WS 2020/21, SS 2021, WS 2021/22, SS 2022, WS 2022/23
- Mobile and Distributed Systems (Practical Course): SS 2018, WS 2018/19
- Artificial Intelligence (Working Group): SS 2019, WS 2019/20, SS 2020, WS 2020/21, SS 2021, WS 2021/22, SS 2022, WS 2022/23
Community
- 26th European Conference on Artificial Intelligence (ECAI 2023): Program Committee
- 37th Conference on Neural Information Processing Systems (NeurIPS 2023): Peer Review
- 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023): Program Committee (Main Track and AI & Social Good Track)
- 40th International Conference on Machine Learning (ICML 2023): Peer Review
- 22nd Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023): Program Committee (BlueSky Ideas Track), Peer Review (Main Track)
- 37th AAAI Conference on Artificial Intelligence (AAAI 2023): Program Committee
- 36th Conference on Neural Information Processing Systems (NeurIPS 2022): Peer Review
- 11th International Symposium On Leveraging Applications of Formal Methods (ISoLA 2022): Peer Review (Track – Rigorous Engineering of Collective Adaptive Systems)
- 39th International Conference on Machine Learning (ICML 2022): Peer Review (Top 10%)
- 36th AAAI Conference on Artificial Intelligence (AAAI 2022): Program Committee
- PLOS ONE 2021: Peer Review
- 35th AAAI Conference on Artificial Intelligence (AAAI 2021): Program Committee
- International Journal on Software Tools for Technology Transfer (STTT REoCAS 2020): Peer Review
- 1st International Symposium on Applied Artificial Intelligence (ISAAI 2019): Organizing Committee
- 8th International Symposium On Leveraging Applications of Formal Methods (ISoLA 2018): Peer Review (Track – Rigorous Engineering of Collective Adaptive Systems)
Selected Publications
- Thomy Phan, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Nüßlein, Michael Kölle, Thomas Gabor, and Claudia Linnhoff-Popien, „Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability“, in 40th International Conference on Machine Learning (ICML ’23), 2023, to appear. [PDF][source]
- Philipp Altmann, Leonard Feuchtinger, Fabian Ritz, Jonas Nüßlein, Thomy Phan, and Claudia Linnhof-Popien, „CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing“, in 32nd International Joint Conference on Artificial Intelligence (IJCAI ’23), 2023, to appear.
- Thomy Phan, Fabian Ritz, Jonas Nüßlein, Michael Kölle, Thomas Gabor, and Claudia Linnhoff-Popien, „Attention-Based Recurrence for Multi-Agent Reinforcement Learning under State Uncertainty (Extended Abstract)“, in 22nd Conference on Autonomous Agents and Multiagent Systems (AAMAS ’23), 2023, to appear. [extended preprint][source]
- Felip Guimerà Cuevas, Thomy Phan, Helmut Schmid, „Adaptive Bi-Nonlinear Neural Networks Based on Complex Numbers with Weights Constrained along the Unit Circle“, in 26th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD ’23), 2023. [PDF]
- Thomy Phan, Felix Sommer, Philipp Altmann, Fabian Ritz, Lenz Belzner, and Claudia Linnhoff-Popien, „Emergent Cooperation from Mutual Acknowledgment Exchange“, in 21st Conference on Autonomous Agents and Multiagent Systems (AAMAS ’22), 2022, pp. 1047–1055. [PDF][source][highlight paper]
- Robert Müller, Steffen Illium, Thomy Phan, Tom Haider, and Claudia Linnhoff-Popien, „Towards Anomaly Detection in Reinforcement Learning“, in 21st Conference on Autonomous Agents and Multiagent Systems (AAMAS ’22 Blue Sky Ideas), 2022, pp. 1799–1803. [PDF]
- Thomy Phan, Fabian Ritz, Lenz Belzner, Philipp Altmann, Thomas Gabor, and Claudia Linnhoff-Popien, „VAST: Value Function Factorization with Variable Agent Sub-Teams“, in 35th Conference on Neural Information Processing Systems (NeurIPS ’21), 2021, pp. 24018-24032. [PDF][source]
- Thomy Phan, Lenz Belzner, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, and Claudia Linnhoff-Popien, „Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition“, in 35th AAAI Conference on Artificial Intelligence (AAAI ’21), 2021, pp. 11308–11316. [PDF][source]
- Fabian Ritz, Daniel Ratke, Thomy Phan, Lenz Belzner, and Claudia Linnhoff-Popien, „A Sustainable Ecosystem through Emergent Cooperation in Multi-Agent Reinforcement Learning“, in Conference on Artificial Life (ALIFE ’21), 2021. [PDF][source]
- Thomas Gabor, Thomy Phan, and Claudia Linnhoff-Popien, „Productive Fitness in Diversity-Aware Evolutionary Algorithms“, in Natural Computing (NACO ’21), 2021. [PDF]
- Fabian Ritz, Felix Hohnstein, Robert Müller, Thomy Phan, Thomas Gabor, Claudia Linnhoff-Popien, and Carsten Hahn, „Towards Ecosystem Management from Greedy Reinforcement Learning in a Predator-Prey Setting“, in Conference on Artificial Life (ALIFE ’20), 2020, pp. 518-525. [PDF]
- Carsten Hahn, Fabian Ritz, Paula Wikidal, Thomy Phan, Thomas Gabor, and Claudia Linnhoff-Popien, „Foraging Swarms using Multi-Agent Reinforcement Learning“, in Conference on Artificial Life (ALIFE ’20), 2020, pp. 333–340. [PDF]
- Thomy Phan, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner Schmid, Jan Wieghardt, Marc Zeller, and Claudia Linnhoff-Popien, „Learning and Testing Resilience in Cooperative Multi-Agent Systems“, in 19th Conference on Autonomous Agents and Multiagent Systems (AAMAS ’20), 2020, pp. 1055-1063. [PDF][video]
- Thomy Phan, Lenz Belzner, Kyrill Schmid, Thomas Gabor, Fabian Ritz, Sebastian Feld, and Claudia Linnhoff-Popien, „A Distributed Policy Iteration Scheme for Cooperative Multi-Agent Policy Approximation“, in Adaptive and Learning Agents Workshop (ALA@AAMAS ’20), 2020. [PDF][video]
- Christoph Roch, Thomy Phan, Sebastian Feld, Robert Müller, Thomas Gabor, Carsten Hahn, and Claudia Linnhoff-Popien, „A Quantum Annealing Algorithm for Finding Pure Nash Equilibria in Graphical Games“, in 20th International Conference on Computational Science (ICCS ’20), 2020. [PDF]
- Thomas Gabor, Andreas Sedlmeier, Thomy Phan, Fabian Ritz, Marie Kiermeier, Lenz Belzner, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner Schmid, Jan Wieghardt, Marc Zeller, and Claudia Linnhoff-Popien, „The Scenario Coevolution Paradigm: Adaptive Quality Assurance for Adaptive Systems“, in International Journal on Software Tools for Technology Transfer (STTT ’20), 2020, pp. 457-476. [PDF]
- Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, and Claudia Linnhoff-Popien, „Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning“, in 28th International Joint Conference on Artificial Intelligence (IJCAI ’19), 2019, pp. 5607-5613. [PDF][source]
- Thomas Gabor, Jan Peter, Thomy Phan, Christian Meyer, and Claudia Linnhoff-Popien, „Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search“, in 28th International Joint Conference on Artificial Intelligence (IJCAI ’19), 2019, pp. 5562-5568. [PDF][source]
- Carsten Hahn, Thomy Phan, Thomas Gabor, Lenz Belzner, and Claudia Linnhoff-Popien, „Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning“, in Conference on Artificial Life (ALIFE ’19), 2019, pp. 598–605. [PDF]
- Thomas Gabor, Andreas Sedlmeier, Marie Kiermeier, Thomy Phan, Marcel Henrich, Monika Picklmair, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner Schmid, and Jan Wieghardt, „Scenario Co-Evolution for Reinforcement Learning on a GridWorld Smart Factory Domain“, in 28th Genetic and Evolutionary Computation Conference (GECCO ’19), 2019, pp. 898-906. [PDF]
- Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld, and Claudia Linnhoff-Popien, „Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies (Extended Abstract)“, in 18th Conference on Autonomous Agents and Multiagent Systems (AAMAS ’19), 2019, pp. 2162-2164. [PDF][extended preprint]
- Thomy Phan, Lenz Belzner, Marie Kiermeier, Markus Friedrich, Kyrill Schmid, and Claudia Linnhoff-Popien, „Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling“, in 33rd AAAI Conference on Artificial Intelligence (AAAI ’19), 2019, pp. 7941-7948. [PDF][source]
- Thomy Phan, Lenz Belzner, Thomas Gabor, and Kyrill Schmid, „Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation“, in 17th Conference on Autonomous Agents and Multiagent Systems (AAMAS ’18), 2018, pp. 730-738. [PDF]
- Fang Huang*, George Sirinakis*, Edward S. Allgeyer, Lena K. Schroeder, Whitney C. Duim, Emil B. Kromann, Thomy Phan, Felix E. Rivera-Molina, Jordan R. Myers, Irnov Irnov, Mark Lessard, Yongdeng Zhang, Mary Ann Handel, Christine Jacobs-Wagner, C. Patrick Lusk, James E. Rothman, Derek Toomre, Martin J. Booth, Joerg Bewersdorf, „Ultra-High Resolution 3D Imaging of Whole Cells“, in Cell 166, 2016, pp. 1028-1040. *equal authorship [PDF]
Supervised Theses
Master Theses
- Katharina Winter, Philipp Altmann, Thomy Phan, „Consensus-Based Mutual Acknowledgment Token Exchange“, 2023, ongoing
- Balthasar Schüss, Thomy Phan, Michael Kölle, „Generalizing Agents in the Starcraft Multi-Agent Challenge“, 2023
- Alain Feimer, Thomy Phan, Philipp Altmann, „Generalization in Multi-Agent Reinforcement Learning using Minimax Learning“, 2023
- Alexander Perzl, Thomy Phan, Fabian Ritz, “ A Spatial Social Dilemma Environment for Multi-Agent Reinforcement Learning“, 2023
- Arnold Unterauer, Thomy Phan, Philipp Altmann, „Hidden Attacks in Multi-Agent Reinforcement Learning“, 2023
- Nicolas Kraus, Christoph Roch, Thomy Phan, „Classification of Classical and Quantum Algorithms with Artificial Neural Networks“, 2022
- Hasan Turalic, Robert Müller, Thomy Phan, „Decentralized Multi-Agent Communication by Transmitting Cluster Indices“, 2022
- Liliane Kabboura, Thomas Gabor, Thomy Phan, „Reward Shaping for Optimal Exploration in Reinforcement Learning“, 2022
- Nina Czogalla, Thomy Phan, Fabian Ritz, „Adaptive Resilient Multi-Agent Reinforcement Learning“, 2022
- Felix Sommer, Thomy Phan, Philipp Altmann, „Learning Trust in Multi-Agent Systems“, 2021. [paper]
- Korbinian Blanz, Thomy Phan, Thomas Gabor, „Evaluating Resilience in Antagonist-based Multi-Agent Reinforcement Learning“, 2021
- Manuel Zierl, Carsten Hahn, Thomy Phan, Fabian Ritz, „Analysis and Improvement of Communication in Cooperative Reinforcement Learning“, 2021
- Jonas Nüßlein, Thomas Gabor, Thomy Phan, „Reinforcement Learning for Arbitrary Target Observations“, 2021
- Simon Cronjaeger, Thomy Phan, Thomas Gabor, „Neural Architecture Search using Upside Down Reinforcement Learning“, 2021
- Maximilian Wagner, Thomas Gabor, Thomy Phan, „An Evolutionary Algorithm for API Design Optimization in Software Ecosystems“, 2021
- Patrick Börzel, Thomas Gabor, Thomy Phan, „Randomized Fitness in Evolutionary Algorithms“, 2020
- Philipp Altmann, Thomas Gabor, Thomy Phan, „Adversarial Self-Imitation Learning“, 2020
- Felip Guimera Cuevas, Thomy Phan, Thomas Gabor, „Uncorrelated and Prioritized Reservoir Sampling for Reinforcement Learning with Restricted Memory over Arbitrary Time-Spans“, 2020
- Daniel Hämmerle, Andreas Sedlmeier, Thomy Phan, „Uncertainty-based Selection of DQN Agents“, 2020
- Hila Safi, Thomas Gabor, Thomy Phan, „Analyzing Quadratic Unconstrained Binary Optimization (QUBO) Matrices using Neural Networks for Solving NP-complete Problems“, 2020. [paper]
- Max Peters, Thomy Phan, Fabian Ritz, „Assembly of Multi-Agent Formations using Reinforcement Learning“, 2020
- Andreas Mayer, Sebastian Feld, Thomy Phan, „Competitive Spatio-Temporal Search with Mobile Agents“, 2019
- David Gehring, Thomy Phan, Steffen Illium, „Reinforcement Learning with Graph-Convolutional Neural Networks“, 2018
- Christian Ungnadner, Kyrill Schmid, Thomy Phan, „Deep Experience Planner“, 2018
- Jan Peter, Thomas Gabor, Thomy Phan, „Subgoal-Based Temporal Abstractions in Monte-Carlo Tree Search“, 2018. [paper]
- Judith Mosandl, Kyrill Schmid, Thomy Phan, „Application of Hybrid Reward Functions for Complexity Reduction in Technical Real-World Domains“, 2018
Bachelor Theses
- Felix Topp, Michael Kölle, Thomy Phan, „Quantum Multi-Agent Reinforcement Learning using Evolutionary Optimization“, 2023, ongoing
- Julian Schönberger, Thomas Gabor, Thomy Phan, „Genetic Neuron Selection for Deep Neural Networks“, 2023, ongoing
- Carlos Göhring, Thomy Phan, Kyrill Schmid, „Finding Dominant Strategies and Equilibria in Minority Games Using Reinforcement Learning“, 2022
- Paul Seipl, Thomy Phan, Fabian Ritz, „Evaluation of Aggregation Mechanisms for Federated Reinforcement Learning“, 2022
- Weronika Maniszewska, Thomy Phan, Fabian Ritz, „Multi-Agent Reinforcement Learning with Transformer-Based Policies“, 2022
- Marco Börner, Thomy Phan, Philipp Altmann, „Predicting the Optimal Approximation Level for Quantum Annealing“, 2022
- Hyeri An, Thomy Phan, Thomas Gabor, „Evolutionary Subgoal-Discovery for Temporal Abstract Planning“, 2022
- Florian Grzonka, Thomas Gabor, Thomy Phan, „Predicting Hyperparameters for Evolutionary Algorithms using Neural Networks“, 2022
- Tim Hesse, Thomas Gabor, Thomy Phan, „Reinforcement Learning for Imperfect Information Games Using the Trick Card Game Schafkopf as a Case Study“, 2021
- Caroline Reinig, Thomy Phan, Fabian Ritz, „Pretraining of Reinforcement Learning Models for Federated Learning“, 2021
- David Müller, Thomas Gabor, Thomy Phan, „Representing Algorithmic Properties of Learning Algorithms within their Reward Function“, 2021
- Louis Mackenzie-Smith, Fabian Ritz, Thomy Phan, „Creation of a Standard Challenge Dataset for Scalable Supervision in Reinforcement Learning“, 2021
- Christian Reff, Fabian Ritz, Thomy Phan, „Adapting MCTS Planning Algorithms to a Continuous Domain“, 2021
- Marian Lingsch, Thomas Gabor, Thomy Phan, „AI for Programming: Prediction of Code Snippets from UML“, 2021
- David Hansmair, Thomy Phan, Fabian Ritz, „Hexar.io as a Challenging Benchmark for Multi-Agent Reinforcement Learning“, 2020
- Miriam Fischer, Thomy Phan, Robert Müller, Steffen Illium, „Stackelberg Routing with Fairness Considerations“, 2020
- Antonia Halbig, Thomas Gabor, Thomy Phan, „Reinforcement Learning Targeting Arbitrary Expected Rewards“, 2020
- Marc Leichsenring, Thomas Gabor, Thomy Phan, „Multi-headed Modification to the Monte Carlo Tree Search with Variable Search Depth“, 2020
- Marco Philip, Thomy Phan, Thomas Gabor, „Adversarial Planning for Pursuit-Evasion Scenarios“, 2020
- Patricia Gschoßmann, Thomy Phan, Thomas Gabor, „Learning to Play Pommerman with Emergent Communication“, 2020
- Adam Mahmoud, Thomas Gabor, Thomy Phan, „Explainability of a Reinforcement Learning Agent for Autonomous Driving“, 2020
- Elena Terzieva, Thomy Phan, Robert Müller, „Multi-Step Deep Q-Networks with Stacked Target Networks“, 2019
- Sonja Geffert, Thomy Phan, Fabian Ritz, „Pooling of Target Networks in Deep Reinforcement Learning“, 2019
Project Work
- Daniel Ratke, Fabian Ritz, Thomy Phan, „A Sustainable Ecosystem through Emergent Cooperation in Multi-Agent Reinforcement Learning“, 2021. [paper]
- Felip Guimera Cuevas, Thomy Phan, Thomas Gabor, „Complex Value-based Deep Learning and Applications to Reinforcement Learning“, 2021. [paper]
- Daniel Ratke, Thomy Phan, Fabian Ritz, „GRAB Zero“, 2020
- Simon Cronjaeger, Thomy Phan, Robert Müller, „Development of a Predator-Prey Domain for Multi-Agent Reinforcement Learning“, 2020
- Jonas Nüßlein, Thomas Gabor, Thomy Phan, „Upside-Down Reinforcement Learning and Related Approaches“, 2020
- Korbinian Blanz, Thomy Phan, Thomas Gabor, „Implementation of a Testbed for Adversarial Multi-Agent Reinforcement Learning“, 2020
- Michael Kessler, Thomy Phan, Thomas Gabor, „Enhancing Value-based Reinforcement Learning with Tree Search“, 2020
- Johannes Handloser, Kyrill Schmid, Thomy Phan, „Implementation of a Test Environment for Multi-Agent Reinforcement Learning Algorithms“, 2018
on Google Scholar, ResearchGate, and LinkedIn