SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 33, Issue , 2008, Pages 521-549

A multiagent reinforcement learning algorithm with non-linear dynamics

(2) Abdallah, Sherief a Lesser, Victor b

a THE BRITISH UNIVERSITY IN DUBAI (United Arab Emirates)

b University of Massachusetts Amherst (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATION THEORY; DIFFERENTIAL EQUATIONS; DYNAMICS; FERTILIZERS; GAME THEORY; INTELLIGENT AGENTS; MACHINE LEARNING; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

DYNAMICS DIFFERENTIAL EQUATIONS; LEARNING AGENTS; MULTI-AGENT REINFORCEMENT LEARNING; MULTIAGENT REINFORCEMENT LEARNING ALGORITHM; NASH EQUILIBRIA; NON-LINEAR DYNAMICS; PIECEWISE LINEAR; STATE-OF-THE-ART ALGORITHMS;

LEARNING ALGORITHMS;

EID: 70350699723 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.2628 Document Type: Article

Times cited : (86)

References (21)

1
- 34247227200
- Learning the task allocation game
- Abdallah, S., & Lesser, V. (2006). Learning the task allocation game. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 850-857.
- (2006) Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 850-857
- Abdallah, S.¹ Lesser, V.²

2
- 57349106030
- Multiagent reinforcement learning and self-organization in a network of agents
- Abdallah, S., & Lesser, V. (2007). Multiagent reinforcement learning and self-organization in a network of agents. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems.
- (2007) Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems
- Abdallah, S.¹ Lesser, V.²

3
- 84899897951
- Non-linear dynamics in multiagent reinforcement learning algorithms
- Abdallah, S., & Lesser, V. (2008). Non-linear dynamics in multiagent reinforcement learning algorithms. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 1321-1324.
- (2008) Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 1321-1324
- Abdallah, S.¹ Lesser, V.²

4
- 1142280919
- Adaptive policy gradient in multiagent learning
- Banerjee, B., & Peng, J. (2003). Adaptive policy gradient in multiagent learning. In Proceedings of the International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 686-692.
- (2003) Proceedings of the International Joint Conference on Autonomous Agents and Multi Agent Systems , pp. 686-692
- Banerjee, B.¹ Peng, J.²

5
- 35248823118
- Generalized multiagent learning with performance bound
- Banerjee, B., & Peng, J. (2007). Generalized multiagent learning with performance bound. Autonomous Agents and Multiagent Systems, 15(3), 281-312.
- (2007) Autonomous Agents and Multiagent Systems , vol.15 , Issue.3 , pp. 281-312
- Banerjee, B.¹ Peng, J.²

6
- 31844436490
- Tech. rep., University of Alberta
- Bowling, M. (2004). Convergence and no-regret in multiagent learning. Tech. rep., University of Alberta.
- (2004) Convergence and No-regret in Multiagent Learning
- Bowling, M.¹

7
- 84899027977
- Convergence and no-regret in multiagent learning
- Bowling, M. (2005). Convergence and no-regret in multiagent learning. In Proceedings of the Annual Conference on Advances in Neural Information Processing Systems, pp. 209-216.
- (2005) Proceedings of the Annual Conference on Advances in Neural Information Processing Systems , pp. 209-216
- Bowling, M.¹

8
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M., & Veloso, M. (2002). Multiagent learning using a variable learning rate. Artificial Intelligence, 136(2), 215-250.
- (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

9
- 0000719863
- Packet routing in dynamically changing networks: A reinforcement learning approach
- Boyan, J. A., & Littman, M. L. (1994). Packet routing in dynamically changing networks: A reinforcement learning approach. In Proceedings of the Annual Conference on Advances in Neural Information Processing Systems, pp. 671-678.
- (1994) Proceedings of the Annual Conference on Advances in Neural Information Processing Systems , pp. 671-678
- Boyan, J.A.¹ Littman, M.L.²

10
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C., & Boutilier, C. (1998). The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the National Conference on Artificial intelligence/Innovative Applications of Artificial Intelligence, pp. 746-752.
- (1998) Proceedings of the National Conference on Artificial Intelligence/ Innovative Applications of Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

11
- 34147159616
- AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Conitzer, V., & Sandholm, T. (2007). AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. Machine Learning, 67(1-2), 23-43.
- (2007) Machine Learning , vol.67 , Issue.1-2 , pp. 23-43
- Conitzer, V.¹ Sandholm, T.²

12
- 31144432283
- Cooperative information sharing to improve distributed learning in multi-agent systems
- Dutta, P. S., Jennings, N. R., & Moreau, L. (2005). Cooperative information sharing to improve distributed learning in multi-agent systems. Journal of Artificial Intelligence Research, 24, 407-463.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 407-463
- Dutta, P.S.¹ Jennings, N.R.² Moreau, L.³

13
- 4644369748
- Nash Q-learning for general-sum stochastic games
- Hu, J., & Wellman, M. P. (2003). Nash Q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4, 1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.P.²

14
- 0004178386
- Prentice-Hall, Upper Saddle River, NJ, USA
- Khalil, H. K. (2002). Nonlinear Systems. Prentice-Hall, Upper Saddle River, NJ, USA.
- (2002) Nonlinear Systems
- Khalil, H.K.¹

15
- 0001547175
- Value-function reinforcement learning in Markov games
- Littman, M. (2001). Value-function reinforcement learning in Markov games. Cognitive Systems Research, 2(12), 55-66.
- (2001) Cognitive Systems Research , vol.2 , Issue.12 , pp. 55-66
- Littman, M.¹

16
- 0012646255
- Learning to cooperate via policy search
- Peshkin, L., Kim, K.-E., Meuleau, N., & Kaelbling, L. P. (2000). Learning to cooperate via policy search. In Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 307-314.
- (2000) Proceedings of the Conference on Uncertainty in Artificial Intelligence , pp. 307-314
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

17
- 0001644761
- Nash convergence of gradient dynamics in generalsum games
- Singh, S., Kearns, M., & Mansour, Y. (2000). Nash convergence of gradient dynamics in generalsum games. In Proceedings of the Conference on Uncertainty in Artificial Intelligence, pp. 541-548.
- (2000) Proceedings of the Conference on Uncertainty in Artificial Intelligence , pp. 541-548
- Singh, S.¹ Kearns, M.² Mansour, Y.³

18
- 0004102479
- MIT Press
- Sutton, R., & Barto, A. (1999). Reinforcement Learning: An Introduction. MIT Press.
- (1999) Reinforcement Learning: An Introduction.
- Sutton, R.¹ Barto, A.²

19
- 31344450384
- An evolutionary dynamical analysis of multi-agent learning in iterated games
- Tuyls, K., 't Hoen, P. J., & Vanschoenwinkel, B. (2006). An evolutionary dynamical analysis of multi-agent learning in iterated games. Autonomous Agents and Multi-Agent Systems, 12(1), 115-153.
- (2006) Autonomous Agents and Multi-Agent Systems , vol.12 , Issue.1 , pp. 115-153
- Tuyls, K.¹ 'T Hoen, P.J.² Vanschoenwinkel, B.³

20
- 27744448185
- Reinforcement learning to play an optimal Nash equilibrium in team Markov games
- Wang, X., & Sandholm, T. (2003). Reinforcement learning to play an optimal Nash equilibrium in team Markov games. In Proceedings of the Annual Conference on Advances in Neural Information Processing Systems, pp. 1571-1578.
- (2003) Proceedings of the Annual Conference on Advances in Neural Information Processing Systems , pp. 1571-1578
- Wang, X.¹ Sandholm, T.²

21
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Zinkevich, M. (2003). Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the International Conference on Machine Learning, pp. 928-936.
- (2003) Proceedings of the International Conference on Machine Learning , pp. 928-936
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.