SCOPUS 정보 검색 플랫폼

International Journal of Knowledge-Based and Intelligent Engineering Systems

Volumn 15, Issue 1, 2011, Pages 55-64

The world of independent learners is not markovian

(3) Laurent, Guillaume J a Matignon, Laëtitia a Fort Piat, N Le a

a CNRS (France)

Author keywords

machine learning; Multi agent system; reinforcement learning

Indexed keywords

FORMAL CONCEPTS; LEARNING AGENTS; LEARNING METHODS; LEARNING PATHS; MARKOVIAN; NON-MARKOVIAN; SINGLE-AGENT;

INTELLIGENT AGENTS; LEARNING ALGORITHMS; MULTI AGENT SYSTEMS;

LEARNING SYSTEMS;

EID: 80052079894 PISSN: 13272314 EISSN: 18758827 Source Type: Journal
DOI: 10.3233/KES-2010-0206 Document Type: Article

Times cited : (137)

References (41)

1
- 0001700171
- A markov decision process
- R. Bellman, A markov decision process, Journal of Mathematical Mechanics 6(1957), 679-684.
- (1957) Journal of Mathematical Mechanics , vol.6 , pp. 679-684
- Bellman, R.¹

2
- 18744371204
- Reinforcement learning in markovian evolutionary games
- V. S. Borkar, Reinforcement learning in markovian evolutionary games, Advances in Complex Systems 5(1) (2002), 55-72.
- (2002) Advances in Complex Systems , vol.5 , Issue.1 , pp. 55-72
- Borkar, V.S.¹

3
- 22944447799
- PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, May
- M. Bowling, Multiagent Learning in the Presence of Agents with Limitations, PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, May 2003.
- (2003) Multiagent Learning in the Presence of Agents with Limitations
- Bowling, M.¹

4
- 0036531878
- Multiagent learning using a variable learning rate
- DOI 10.1016/S0004-3702(02)00121-2, PII S0004370202001212
- M. Bowling and M. Veloso, Multiagent learning using a variable learning rate, Artificial Intelligence 136(2002), 215-250. (Pubitemid 34232184)
- (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

5
- 0003863106
- An analysis of stochastic game theory for multiagent reinforcement learning
- Computer Science Department, Carnegie Mellon University
- M. Bowling and M. M. Veloso, An analysis of stochastic game theory for multiagent reinforcement learning, Technical Report CMU-CS-00-165, Computer Science Department, Carnegie Mellon University, 2000.
- (2000) Technical Report CMU-CS-00-165
- Bowling, M.¹ Veloso, M.M.²

6
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- DOI 10.1109/TSMCC.2007.913919
- L. Busoniu, R. Babuska and B. De Schutter, A comprehensive survey of multiagent reinforcement learning, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 3838(2) (2008), 156-172. (Pubitemid 351404112)
- (2008) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

7
- 34547223380
- Decentralized reinforcement learning control of a robotic manipulator
- L. Busoniu, R. Babuska and B. De Schutter, Decentralized Reinforcement Learning Control of a Robotic Manipulator, In Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision, pages 1347-1352, 2006.
- (2006) Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision , pp. 1347-1352
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

8
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- AAAI
- C. Claus and C. Boutilier, The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, In Proc. of the National Conference on Artificial Intelligence, pages 746-752. AAAI, 1998.
- (1998) Proc. of the National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

9
- 0003259931
- Improving elevator performance using reinforcement learning
- Cambridge, MA. the MIT Press
- R. H. Crites and A. G. Barto, Improving Elevator Performance using Reinforcement Learning, In Proc. of Advances in Neural Information Processing Systems, Cambridge, MA, 1996. The MIT Press.
- (1996) Proc. of Advances in Neural Information Processing Systems
- Crites, R.H.¹ Barto, A.G.²

10
- 80052097767
- Decentralized reinforcement learning for the online optimization of distributed systems
- C. Weber, M. Elshaw and N. M. Mayer, eds, I-TECH Education and Publishing
- J. Dowling and S. Haridi, Decentralized reinforcement learning for the online optimization of distributed systems, in: Reinforcement Learning: Theory and Applications, C. Weber, M. Elshaw and N. M. Mayer, eds, I-TECH Education and Publishing, 2008, pp. 143-166.
- (2008) Reinforcement Learning: Theory and Applications , pp. 143-166
- Dowling, J.¹ Haridi, S.²

11
- 84880861539
- Predicting and preventing coordination problems in cooperative Q-learning systems
- N. Fulda and D. Ventura, Predicting and Preventing Coordination Problems in Cooperative Q-Learning Systems, In Proceedings of the International Joint Conference on Artificial Intelligence, 2007.
- Proceedings of the International Joint Conference on Artificial Intelligence, 2007
- Fulda, N.¹ Ventura, D.²

12
- 0036932299
- Improving on the reinforcement learning of coordination in cooperative multi-agent systems
- S. Kapetanakis and D. Kudenko, Improving on the Reinforcement Learning of Coordination in Cooperative Multi-Agent Systems, In Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, Imperial College, London, April 2002.
- Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, Imperial College, London, April 2002
- Kapetanakis, S.¹ Kudenko, D.²

13
- 33748543203
- Collaborative multiagent reinforcement learning by payoff propagation
- J. R. Kok and N. Vlassis, Collaborative multiagent reinforcement learning by payoff propagation, Journal of Machine Learning Research 7(2006), 1789-1828. (Pubitemid 44373693)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1789-1828
- Kok, J.R.¹ Vlassis, N.²

14
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Morgan Kaufmann
- M. Lauer and M. Riedmiller, An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems, In Proc. of the Int. Conf. on Machine Learning, pages 535-542. Morgan Kaufmann, 2000.
- (2000) Proc. of the Int. Conf. on Machine Learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

15
- 4544226982
- Reinforcement learning for stochastic cooperative multi-agent systems
- M. Lauer and M. Riedmiller, Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems, In Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, pages 1516-1517, 2004.
- (2004) Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems , pp. 1516-1517
- Lauer, M.¹ Riedmiller, M.²

16
- 0001547175
- Value-function reinforcement learning in Markov games
- PII S1389041701000158
- M. L. Littman, Value-function reinforcement learning in markov games, Journal of Cognitive Systems Research 2(1) (2001), 55-66. (Pubitemid 33718550)
- (2001) Cognitive Systems Research , vol.2 , Issue.1 , pp. 55-66
- Littman, M.L.¹

17
- 51349117828
- Hysteretic Q-learning: An algorithm for decentralized reinforcement learning in cooperative multi-agent teams
- San Diego, CA, USA, Oct. 29-Nov. 2
- L. Matignon, G. J. Laurent and N. Le Fort-Piat, Hysteretic Q-Learning: An Algorithm for Decentralized Reinforcement Learning in Cooperative Multi-Agent Teams, In Proc. of the IEEE Int. Conf. on Intelligent Robots and Systems, pages 64-69, San Diego, CA, USA, Oct. 29-Nov. 2 2007.
- (2007) Proc. of the IEEE Int. Conf. on Intelligent Robots and Systems , pp. 64-69
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³

18
- 77955659466
- Coordination of independent learners in cooperative markov games
- Institut FEMTO-ST/UFC-ENSMM-UTBMCNRS, Besançon, France, March
- L. Matignon, G. J. Laurent and N. Le Fort-Piat, Coordination of independent learners in cooperative markov games, Technical Report RR-2009-01, Institut FEMTO-ST/UFC-ENSMM-UTBMCNRS, Besançon, France, March 2009. http://hal.archives-ouvertes.fr/hal-00370889/fr/.
- (2009) Technical Report RR-2009-01
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³

19
- 77955654600
- Designing decentralized controllers for distributed-airjet mems-based micromanipulators by reinforcement learning
- L. Matignon, G. J. Laurent, N. L. Fort-Piat and Y.-A. Chapuis, Designing decentralized controllers for distributed-airjet mems-based micromanipulators by reinforcement learning, Journal of Intelligent and Robotic Systems 59(2) (2010), 145-166.
- (2010) Journal of Intelligent and Robotic Systems , vol.59 , Issue.2 , pp. 145-166
- Matignon, L.¹ Laurent, G.J.² Fort-Piat, N.L.³ Chapuis, Y.-A.⁴

20
- 38349032850
- Convergence of independent adaptive learners
- Springer-Verlag
- F. S. Melo and M. C. Lopes, Convergence of independent adaptive learners. In Progress in Artificial Intelligence: 13th Portuguese Conf. on Artificial Intelligence, Lecture Notes in Artificial Intelligence, volume 4874, pages 555-567. Springer-Verlag, 2007.
- (2007) Progress in Artificial Intelligence: 13th Portuguese Conf. on Artificial Intelligence, Lecture Notes in Artificial Intelligence , vol.4874 , pp. 555-567
- Melo, F.S.¹ Lopes, M.C.²

21
- 34247191514
- Learning to cooperate in multi-agent social dilemmas
- DOI 10.1145/1160633.1160770, Proceedings of the Fifth International Joint Conference on Autonomous Agents and Multiagent Systems
- J. E. Munoz, A. Lazaric and M. Restelli, Learning to Cooperate in Multi-Agent Social Dilemmas, In Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, pages 783-785, 2006. (Pubitemid 46609551)
- (2006) Proceedings of the International Conference on Autonomous Agents , vol.2006 , pp. 783-785
- De Cote, E.M.¹ Lazaric, A.² Restelli, M.³ Bonarini, A.⁴

22
- 0003792179
- John Wiley and Sons
- J. Von Neumann and O. Morgenstern, Theory of Games and Economic Behavior. John Wiley and Sons, 1944.
- (1944) Theory of Games and Economic Behavior
- Von Neumann, J.¹ Morgenstern, O.²

23
- 0003427725
- MIT Press
- M. J. Osborne and A. Rubinstein, A Course in Game Theory. The MIT Press, 1994.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

24
- 26444601262
- Cooperative multi-agent learning: The state of the art
- DOI 10.1007/s10458-005-2631-2
- L. Panait and S. Luke, Cooperative multi-agent learning: The state of the art, Autonomous Agents and Multi-Agent Systems 11(3) (2005), 387-434. (Pubitemid 41425094)
- (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
- Panait, L.¹ Luke, S.²

25
- 41549123971
- Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
- L. Panait, K. Tuyls and S. Luke, Theoretical advantages of lenient learners: An evolutionary game theoretic perspective, Journal of Machine Learning Research 9(2008), 423-457. (Pubitemid 351469016)
- (2008) Journal of Machine Learning Research , vol.9 , pp. 423-457
- Panait, L.¹ Tuyls, K.² Luke, S.³

26
- 34249045960
- Perspectives on multiagent learning
- T. Sandholm, Perspectives on multiagent learning, Artificial Intelligence 171(2007), 382-392.
- (2007) Artificial Intelligence , vol.171 , pp. 382-392
- Sandholm, T.¹

27
- 0028555752
- Learning to coordinate without sharing information
- AAAI
- I. Sen, M. Sekaran and J. Hale, Learning to Coordinate Without Sharing Information, In Proc. of the National Conference on Artificial Intelligence, pages 426-431. AAAI, 1994.
- (1994) Proc. of the National Conference on Artificial Intelligence , pp. 426-431
- Sen, I.¹ Sekaran, M.² Hale, J.³

28
- 0000392613
- Reprinted in Kuhn, 1997
- L. S. Shapley, Stochastic games, PNAS 39(1953), 1095-1100, Reprinted in (Kuhn, 1997).
- (1953) Stochastic Games, PNAS , vol.39 , pp. 1095-1100
- Shapley, L.S.¹

29
- 0004102479
- MIT Press, Cambridge
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, The MIT Press, Cambridge, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

30
- 85152198941
- Multiagent reinforcement learning: Independent vs. cooperative agents
- M. Tan, Multiagent Reinforcement Learning: Independent vs. Cooperative Agents, In Proc. of the Int. Conf. on Machine Learning, pages 330-337, 1993.
- (1993) Proc. of the Int. Conf. on Machine Learning , pp. 330-337
- Tan, M.¹

31
- 33847379922
- Reinforcement learning in autonomic computing: A manifesto and case studies
- DOI 10.1109/MIC.2007.21
- G. Tesauro, Reinforcement learning in autonomic computing: A manifesto and case studies, IEEE Internet Computing 11(2) (2007), 22-30. (Pubitemid 46335538)
- (2007) IEEE Internet Computing , vol.11 , Issue.1 , pp. 22-30
- Tesauro, G.¹

32
- 31344450384
- An evolutionary dynamical analysis of multi-agent learning in iterated games
- DOI 10.1007/s10458-005-3783-9
- K. Tuyls, P. Jan, T. Hoen and B. Vanschoenwinkel, An evolutionary dynamical analysis of multi-agent learning in iterated games, Autonomous Agents and Multi-Agent Systems 12(2006), 115-153. (Pubitemid 43146342)
- (2006) Autonomous Agents and Multi-Agent Systems , vol.12 , Issue.1 , pp. 115-153
- Tuyls, K.¹ T Hoen, P.J.² Vanschoenwinkel, B.³

33
- 34247642270
- Exploring selfish reinforcement learning in repeated games with stochastic rewards
- K. Verbeeck, A. Nowé, J. Parent and K. Tuyls, Exploring selfish reinforcement learning in repeated games with stochastic rewards, Proc of the Int Conf on Autonomous Agents and Multiagent Systems 14(3) (2007), 239-269.
- (2007) Proc of the Int Conf on Autonomous Agents and Multiagent Systems , vol.14 , Issue.3 , pp. 239-269
- Verbeeck, K.¹ Nowé, A.² Parent, J.³ Tuyls, K.⁴

34
- 58049194007
- Multi-agent reinforcement learning in stochastic single and multi-stage games
- K. Verbeeck, A. Nowé, M. Peeters and K. Tuyls, Multi-agent reinforcement learning in stochastic single and multi-stage games. Lecture Notes in Computer Science, Adaptive Agents and Multi-Agent Systems III 3394(2005), 275-294.
- (2005) Lecture Notes in Computer Science, Adaptive Agents and Multi-Agent Systems III , vol.3394 , pp. 275-294
- Verbeeck, K.¹ Nowé, A.² Peeters, M.³ Tuyls, K.⁴

35
- 80052081144
- A concise introduction to multiagent systems and distributed artificial intelligence
- N. Vlassis, A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence, In Ronald Brachman and Thomas Dietterich, editors, Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool, 2007.
- Ronald Brachman and Thomas Dietterich, Editors, Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool, 2007
- Vlassis, N.¹

36
- 43549119106
- A machine-learning approach to multi-robot coordination
- DOI 10.1016/j.engappai.2007.05.006, PII S0952197607000693
- Y. Wang and C. W. de Silva, A machine-learning approach to multi-robot coordination, Engineering Applications of Artificial Intelligence 21(3) (2008), 470-484. (Pubitemid 351680683)
- (2008) Engineering Applications of Artificial Intelligence , vol.21 , Issue.3 , pp. 470-484
- Wang, Y.¹ De Silva, C.W.²

37
- 34249833101
- Technical note: Q-learning
- C. J. C. H. Watkins and P. Dayan, Technical note: Q-learning, Machine Learning 8(1992), 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

38
- 4544231144
- Best-response multiagent learning in non-stationary environments
- M. Weinberg and J. Rosenschein, Best-Response Multiagent Learning in Non-Stationary Environments, In Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, pages 506-513, 2004.
- (2004) Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems , pp. 506-513
- Weinberg, M.¹ Rosenschein, J.²

39
- 33746826183
- Multiagent reinforcement learning for multirobot systems: A survey
- Department of Computer Science, University of Essex
- E. Yang and D. Gu, Multiagent reinforcement learning for multirobot systems: A survey. Technical report, Department of Computer Science, University of Essex, 2004.
- (2004) Technical Report
- Yang, E.¹ Gu, D.²

40
- 34249001282
- The possible and the impossible in multi-agent learning
- DOI 10.1016/j.artint.2006.10.015, PII S0004370207000367, Foundations of Multi-Agent Learning
- H. P. Young, The possible and the impossible in multi-agent learning, Artificial Intelligence 171(2007), 429-433. (Pubitemid 46802420)
- (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 429-433
- Young, H.P.¹

41
- 80052092844
- Multiagent reinforcement learning for a planetary exploration multirobot system
- Z. Zheng, M. Shu-gen, C. Bing-gang, Z. Li-Ping and L. Bin, Multiagent Reinforcement Learning for a Planetary Exploration Multirobot System, In Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems, pages 339-350, 2006.
- (2006) Proc. of the Int. Conf. on Autonomous Agents and Multiagent Systems , pp. 339-350
- Zheng, Z.¹ Shu-gen, M.² Bing-gang, C.³ Li-Ping, Z.⁴ Bin, L.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.