Filippos Christianos

University of Edinburgh, School of Informatics.

Currently on an internship with NVIDIA Research, on autonomous vehicles.

I am a PhD student in the CDT for Robotics and Autonomous Agents, advised by Stefano Albrecht (University of Edinburgh) and a member of the Autonomous Agents Research Group.

My PhD research is in the area of Multi-Agent Deep Reinforcement Learning. In particular, I study how multiple agents can efficiently explore and learn in environments with sparse rewards.

I am the author and maintainer of the Multi-Robot Warehouse environment for multi-agent RL research. I also developed and maintain the Python version of Level-based Foraging. Our group has been using both environments to develop new and exciting algorithms for MARL. I am the first author of two such algorithms: Shared Experience Actor-Critic (SEAC), and Selective Parameter Sharing (SePS) that have been published in NeurIPS (2020) and ICML (2021) respectively.

Keywords: Machine Learning, Deep Reinforcement Learning (RL), Multi-agent Systems, Exploration in RL.


Jun 23, 2022 I joined NVIDIA Research for a three month internship on autonomous vehicles!
Dec 20, 2021 :newspaper_roll: Our paper titled “Decoupling Exploitation and Intrinsically-Motivated Exploration in Reinforcement Learning” has been accepted in AAMAS 2022!
Sep 27, 2021 :newspaper_roll: Another paper accepted at NeurIPS 2021: Agent Modelling under Partial Observability for Deep Reinforcement Learning.
Jul 29, 2021 :newspaper_roll: Our benchmarking paper for MARL has been accepted at NeurIPS 2021: Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks.
May 10, 2021 :newspaper_roll::newspaper_roll: Two new papers accepted at ICML 2021: Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing and Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning.

selected publications

  1. ICML
    Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
    Christianos Filippos, Papoudakis Georgios, Rahman Arrasy, and Albrecht Stefano
    In Proceedings of the 38th International Conference on Machine Learning, 2021
  2. NeurIPS
    Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
    Papoudakis Georgios *, Christianos Filippos *, Schäfer Lukas, and Albrecht Stefano V.
    In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, 2021
  3. NeurIPS
    Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
    Christianos Filippos, Schäfer Lukas, and Albrecht Stefano
    In Advances in Neural Information Processing Systems, 2020
  4. ECAI
    Employing Hypergraphs for Efficient Coalition Formation with Application to the V2G Problem
    Christianos Filippos, and Chalkiadakis Georgios
    In Proceedings of the Twenty-Second European Conference on Artificial Intelligence, 2016