TY - GEN
T1 - An adaptive-learning framework for semi-cooperative multi-agent coordination
AU - Boukhtouta, Abdeslem
AU - Berger, Jean
AU - Powell, Warren Buckler
AU - George, Abraham
PY - 2011
Y1 - 2011
N2 - Complex problems involving multiple agents exhibit varying degrees of cooperation. The levels of cooperation might reflect both differences in information as well as differences in goals. In this research, we develop a general mathematical model for distributed, semi-cooperative planning and suggest a solution strategy which involves decomposing the system into subproblems, each of which is specified at a certain period in time and controlled by an agent. The agents communicate marginal values of resources to each other, possibly with distortion. We design experiments to demonstrate the benefits of communication between the agents and show that, with communication, the solution quality approaches that of the ideal situation where the entire problem is controlled by a single agent.
AB - Complex problems involving multiple agents exhibit varying degrees of cooperation. The levels of cooperation might reflect both differences in information as well as differences in goals. In this research, we develop a general mathematical model for distributed, semi-cooperative planning and suggest a solution strategy which involves decomposing the system into subproblems, each of which is specified at a certain period in time and controlled by an agent. The agents communicate marginal values of resources to each other, possibly with distortion. We design experiments to demonstrate the benefits of communication between the agents and show that, with communication, the solution quality approaches that of the ideal situation where the entire problem is controlled by a single agent.
KW - Multi-agent
KW - approximate dynamic programming
KW - cooperative
KW - learning
UR - http://www.scopus.com/inward/record.url?scp=80052228198&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80052228198&partnerID=8YFLogxK
U2 - 10.1109/ADPRL.2011.5967386
DO - 10.1109/ADPRL.2011.5967386
M3 - Conference contribution
AN - SCOPUS:80052228198
SN - 9781424498888
T3 - IEEE SSCI 2011: Symposium Series on Computational Intelligence - ADPRL 2011: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning
SP - 324
EP - 331
BT - IEEE SSCI 2011
T2 - Symposium Series on Computational Intelligence, IEEE SSCI2011 - 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2011
Y2 - 11 April 2011 through 15 April 2011
ER -