TY - GEN
T1 - Measuring Cooperation with Counterfactual Planning
AU - Barnett, Samuel A.
AU - Wantlin, Kathryn
AU - Adams, Ryan P.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - Cooperative behavior is commonly understood as that which is conducive to the good of the group: it is increasingly seen as a crucial component of advancing the capabilities as well as mitigating the harms of multi-agent AI systems [6, 10, 21]. Yet an “I’ll-know-it-when-I-see-it” approach is often taken when evaluating the cooperativeness of a sequence of actions, and even when cooperation is formalized, the definitions lead to category errors, conceptual confusions, and erroneous conclusions [11, 22, 52, 56]. We propose a formal measure of cooperation in stochastic games that avoids these pitfalls by being counterfactually contrastive, contextual, and customizable: in particular, cooperation is defined in contrast to the outcome that a self-interested actor would have effected in a similar circumstance, in the context of other agents’ behavior, and within a specified time and space horizon. This measure is simple to compute: it is dependent only on solving a reduction of the multi-agent game to a single-agent Markov decision process. We apply this measure to a diverse pool of behaviors in a number of mixed-motive social dilemmas and sequential predator-prey environments that have been studied in the multi-agent systems literature [4, 15, 26, 34, 36]. Our results demonstrate the importance of defining cooperation clearly, and provide a useful metric for builders of cooperative systems to use when establishing the cooperative nature of the system behavior.
AB - Cooperative behavior is commonly understood as that which is conducive to the good of the group: it is increasingly seen as a crucial component of advancing the capabilities as well as mitigating the harms of multi-agent AI systems [6, 10, 21]. Yet an “I’ll-know-it-when-I-see-it” approach is often taken when evaluating the cooperativeness of a sequence of actions, and even when cooperation is formalized, the definitions lead to category errors, conceptual confusions, and erroneous conclusions [11, 22, 52, 56]. We propose a formal measure of cooperation in stochastic games that avoids these pitfalls by being counterfactually contrastive, contextual, and customizable: in particular, cooperation is defined in contrast to the outcome that a self-interested actor would have effected in a similar circumstance, in the context of other agents’ behavior, and within a specified time and space horizon. This measure is simple to compute: it is dependent only on solving a reduction of the multi-agent game to a single-agent Markov decision process. We apply this measure to a diverse pool of behaviors in a number of mixed-motive social dilemmas and sequential predator-prey environments that have been studied in the multi-agent systems literature [4, 15, 26, 34, 36]. Our results demonstrate the importance of defining cooperation clearly, and provide a useful metric for builders of cooperative systems to use when establishing the cooperative nature of the system behavior.
KW - Cooperation
KW - Multi-agent Systems
KW - Reinforcement Learning
KW - Social Dilemmas
UR - https://www.scopus.com/pages/publications/105020265069
UR - https://www.scopus.com/pages/publications/105020265069#tab=citedBy
U2 - 10.1007/978-3-032-08064-6_2
DO - 10.1007/978-3-032-08064-6_2
M3 - Conference contribution
AN - SCOPUS:105020265069
SN - 9783032080639
T3 - Lecture Notes in Computer Science
SP - 23
EP - 41
BT - Game Theory and AI for Security - 16th International Conference, GameSec 2025, Proceedings
A2 - Baras, John S.
A2 - Papavassiliou, Symeon
A2 - Tsiropoulou, Eirini Eleni
A2 - Sayin, Muhammed O.
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th International Conference on Game Theory and AI for Security, GameSec 2025
Y2 - 13 October 2025 through 15 October 2025
ER -