Skip to main navigation Skip to search Skip to main content

Unilateral incentive alignment in two-agent stochastic games

  • Alex McAvoy
  • , Udari Madhushani Sehwag
  • , Christian Hilbe
  • , Krishnendu Chatterjee
  • , Wolfram Barfuss
  • , Qi Su
  • , Naomi Ehrich Leonard
  • , Joshua B. Plotkin

Research output: Contribution to journalArticlepeer-review

Abstract

Multiagent learning is challenging when agents face mixed-motivation interactions, where conflicts of interest arise as agents independently try to optimize their respective outcomes. Recent advancements in evolutionary game theory have identified a class of “zero-determinant” strategies, which confer an agent with significant unilateral control over outcomes in repeated games. Building on these insights, we present a comprehensive generalization of zero-determinant strategies to stochastic games, encompassing dynamic environments. We propose an algorithm that allows an agent to discover strategies enforcing predetermined linear (or approximately linear) payoff relationships. Of particular interest is the relationship in which both payoffs are equal, which serves as a proxy for fairness in symmetric games. We demonstrate that an agent can discover strategies enforcing such relationships through experience alone, without coordinating with an opponent. In finding and using such a strategy, an agent (“enforcer”) can incentivize optimal and equitable outcomes, circumventing potential exploitation. In particular, from the opponent’s viewpoint, the enforcer transforms a mixed-motivation problem into a cooperative problem, paving the way for more collaboration and fairness in multiagent systems.

Original languageEnglish (US)
Article numbere2319927121
JournalProceedings of the National Academy of Sciences of the United States of America
Volume122
Issue number25
DOIs
StatePublished - Jun 24 2025

All Science Journal Classification (ASJC) codes

  • General

Keywords

  • fairness
  • mixed-motivation interaction
  • stochastic game
  • zero-determinant strategy

Fingerprint

Dive into the research topics of 'Unilateral incentive alignment in two-agent stochastic games'. Together they form a unique fingerprint.

Cite this