Expected Return Symmetries: A New Frontier in Artificial Intelligence Research

Wednesday 19 March 2025


The pursuit of symmetry has long been a fundamental aspect of artificial intelligence research, particularly in the realm of deep learning. By incorporating symmetries into models, researchers have demonstrated improved generalization and robustness across various domains. However, the efficient discovery of environment symmetries remains an open problem, especially for decentralized partially observable Markov decision processes.


Enter expected return symmetries (ER symmetries), a newly proposed class of previously unexplored symmetries that contain environmental symmetries as a subset. By training agents to be compatible under this broader group, researchers have achieved better zero-shot coordination results compared to traditional environment symmetry-based methods.


The key innovation lies in the development of algorithms designed to learn ER symmetries from data. These algorithms, such as Algorithm 1 and Algorithm 2, employ policy gradients to optimize expected return and enforce compositional closure and invertibility. This is achieved by iteratively sampling local action transpositions, updating parameters, and evaluating performance.


The authors’ approach has been tested on the challenging Hanabi game, a cooperative card game that requires players to work together under partial information. By applying ER symmetries to Hanabi OP agents, researchers have demonstrated improved coordination and reduced exploitation of asymmetric information.


One notable aspect of this research is its focus on interpretability. The authors provide a conditional action matrix for OPΦMDP-optimal and OPΦER-optimal policies, illustrating the differences in strategy between these two approaches. This level of transparency allows for a deeper understanding of how ER symmetries influence agent behavior.


The implications of this work are far-reaching. By exploring the realm of expected return symmetries, researchers can develop more effective coordination strategies for decentralized decision-making problems. This has significant potential applications in fields such as robotics, autonomous vehicles, and multi-agent systems.


While there is still much to be discovered about ER symmetries, the authors’ efforts have laid a solid foundation for further research. As the field of AI continues to evolve, it will be exciting to see how this work inspires new approaches to coordination and symmetry in deep learning.


Cite this article: “Expected Return Symmetries: A New Frontier in Artificial Intelligence Research”, The Science Archive, 2025.


Artificial Intelligence, Deep Learning, Symmetry, Markov Decision Processes, Partial Observability, Decentralization, Expected Return, Coordination, Robustness, Generalization


Reference: Darius Muglich, Johannes Forkel, Elise van der Pol, Jakob Foerster, “Expected Return Symmetries” (2025).


Leave a Reply