SCOPUS 정보 검색 플랫폼

Knowledge Engineering Review

Volumn 27, Issue 1, 2012, Pages 1-31

Independent reinforcement learners in cooperative Markov games: A survey regarding coordination problems

(3) Matignon, Laetitia a Laurent, Guillaume J a Le Fort Piat, Nadine a

a FEMTO ST INSTITUTE (France)

Author keywords

[No Author keywords available]

Indexed keywords

COORDINATION PROBLEMS; DISTRIBUTED Q-LEARNING; FREQUENCY MAXIMA; HILL CLIMBING; INDEPENDENT AGENTS; MARKOV GAMES; MATRIX GAME; MULTI AGENT SYSTEM (MAS); MULTI STATE; MULTI-AGENT APPLICATIONS; NON-STATIONARITIES; PURSUIT DOMAIN; Q-LEARNING; Q-VALUES; STOCHASTICITY;

MULTI AGENT SYSTEMS; REINFORCEMENT;

LEARNING ALGORITHMS;

EID: 84857861863 PISSN: 02698889 EISSN: 14698005 Source Type: Journal
DOI: 10.1017/S0269888912000057 Document Type: Review

Times cited : (500)

References (62)

1
- 70350699723
- A multiagent reinforcement learning algorithm with non-linear dynamics
- Abdallah, S. & Lesser, V. 2008. A multiagent reinforcement learning algorithm with non-linear dynamics. Journal of Artificial Intelligence Research 33, 521-549.
- (2008) Journal of Artificial Intelligence Research , vol.33 , pp. 521-549
- Abdallah, S.¹ Lesser, V.²

2
- 33646016529
- Multi-agent reward analysis for learning in noisy domains
- ACM
- Agogino, A. & Turner, K. 2005. Multi-agent reward analysis for learning in noisy domains. In Proceedings of the 4th InternationalJoint Conference on Autonomous Agents and Multiagent Systems, AAMAS'05, 81-88. ACM.
- (2005) Proceedings of the 4th InternationalJoint Conference on Autonomous Agents and Multiagent Systems, AAMAS'05 , pp. 81-88
- Agogino, A.¹ Turner, K.²

3
- 58149280068
- Multi-agent reinforcement learning in common interest and fixed sum stochastic games: An experimental study
- Bab, A. & Brafman, R. I. 2008. Multi-agent reinforcement learning in common interest and fixed sum stochastic games: An experimental study. Journal of Machine Learning Research 9, 2635-2675
- (2008) Journal of Machine Learning Research , vol.9 , pp. 2635-2675
- Bab, A.¹ Brafman, R.I.²

4
- 0028745178
- Communication in reactive multiagent robotic systems
- Balch, T. & Arkin, R. C. 1994. Communication in reactive multiagent robotic systems. Autonomous Robots 1(1), 27-52.
- (1994) Autonomous Robots , vol.1 , Issue.1 , pp. 27-52
- Balch, T.¹ Arkin, R.C.²

5
- 1142280919
- Adaptive policy gradient in multiagent learning
- ACM
- Banerjee, B. & Peng, J. 2003. Adaptive policy gradient in multiagent learning. In AAMAS '03: Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems, 686-692. ACM.
- (2003) AAMAS ' 03: Proceedings of the 2nd International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 686-692
- Banerjee, B.¹ Peng, J.²

6
- 9144256373
- On-policy concurrent reinforcement learning
- Banerjee, B., Sen, S. & Peng, J. 2004. On-policy concurrent reinforcement learning. Journal of Experimental & Theoretical Artificial Intelligence 16(4), 245-260.
- (2004) Journal of Experimental & Theoretical Artificial Intelligence , vol.16 , Issue.4 , pp. 245-260
- Banerjee, B.¹ Sen, S.² Peng, J.³

7
- 0003529066
- Boeing Advanced Technology Center, Boeing Computing Services
- Benda, M., Jagannathan, V. & Dodhiawala, R. 1986. On Optimal Cooperation of Knowledge Sources - an Experimental Investigation. Technical report BCS-G2010-280, Boeing Advanced Technology Center, Boeing Computing Services.
- (1986) On Optimal Cooperation of Knowledge Sources - An Experimental Investigation. Technical report BCS-G2010-280
- Benda, M.¹ Jagannathan, V.² Dodhiawala, R.³

8
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- Morgan Kaufmann Publishers Inc.
- Boutilier, C. 1996. Planning, learning and coordination in multiagent decision processes. In Theoretical Aspects of Rationality and Knowledge, Morgan Kaufmann Publishers Inc., 195-201.
- (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-201
- Boutilier, C.¹

9
- 84880690163
- Sequential optimality and coordination in multiagent systems
- Morgan Publishers Inc.
- Boutilier, C. 1999. Sequential optimality and coordination in multiagent systems. In IJCAI, Morgan Publishers Inc., 478-485.
- (1999) IJCAI , pp. 478-485
- Boutilier, C.¹

10
- 84899027977
- Convergence and no-regret in multiagent learning
- Saul, L. K., Weiss, Y. & Bottou, L. (eds). MIT Press
- Bowling, M. 2005. Convergence and no-regret in multiagent learning. In Advances in Neural Information Processing Systems, Saul, L. K., Weiss, Y. & Bottou, L. (eds). MIT Press, 209-216.
- (2005) Advances in Neural Information Processing Systems , pp. 209-216
- Bowling, M.¹

11
- 0003863106
- Technical report, Computer Science Department, Carnegie Mellon University
- Bowling, M. & Veloso, M. 2000. An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning. Technical report, Computer Science Department, Carnegie Mellon University.
- (2000) An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning
- Bowling, M.¹ Veloso, M.²

12
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M. & Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence 136, 215-250.
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

13
- 4644369644
- Learning to coordinate efficiently: A model-based approach
- Brafman, R. I. & Tennenholtz, M. 2003. Learning to coordinate efficiently: A model-based approach. Journal of Artificial Intelligence Research 19, 11-23.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 11-23
- Brafman, R.I.¹ Tennenholtz, M.²

14
- 34547223380
- Decentralized reinforcement learning control of a robotic manipulator
- Singapore
- Busoniu, L., Babuska, R. & De Schutter, B. 2006. Decentralized reinforcement learning control of a robotic manipulator. In Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006), 1347-1352. Singapore.
- (2006) Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006) , pp. 1347-1352
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

15
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- Busoniu, L., Babuska, R. & De Schutter, B. 2008. A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 38(2), 156-172.
- (2008) IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

16
- 77649261098
- Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems
- Springer, Lecture Notes in Computer Science
- Carpenter, M. & Kudenko, D. 2005. Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems. In Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi-Agent Learning, Lecture Notes in Computer Science, 3394, 55-72. Springer.
- (2005) Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi-Agent Learning , vol.3394 , pp. 55-72
- Carpenter, M.¹ Kudenko, D.²

17
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- American Association for Artificial Intelligence
- Claus, C. & Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752, American Association for Artificial Intelligence.
- (1998) Proceedings of the 15th National Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

18
- 33750270145
- Building autonomic systems using collaborative reinforcement learning
- Dowling, J., Cunningham, R., Curran, E. & Cahill, V. 2006. Building autonomic systems using collaborative reinforcement learning. Knowledge Engineering Review 21(3), 231-238.
- (2006) Knowledge Engineering Review , vol.21 , Issue.3 , pp. 231-238
- Dowling, J.¹ Cunningham, R.² Curran, E.³ Cahill, V.⁴

19
- 84880861539
- Predicting and preventing coordination problems in cooperative q-learning systems
- Morgan Kaufmann Publishers Inc
- Fulda, N. & Ventura, D. 2007. Predicting and preventing coordination problems in cooperative q-learning systems. In Proceedings of the International Joint Conference on Artificial Intelligence. Morgan Kaufmann Publishers Inc.
- (2007) Proceedings of the International Joint Conference on Artificial Intelligence
- Fulda, N.¹ Ventura, D.²

20
- 33751020264
- Multi-agent case-based reasoning for cooperative reinforcement learners
- Springer
- Gabel, T. & Riedmiller, M. 2006. Multi-agent case-based reasoning for cooperative reinforcement learners. In Proceedings of the ECCBR, 32-46. Springer.
- (2006) Proceedings of the ECCBR , pp. 32-46
- Gabel, T.¹ Riedmiller, M.²

21
- 71149097863
- Dynamic analysis of multiagent-learning with -greedy exploration
- ACM
- Gomes, E. R. & Kowalczyk, R. 2009. Dynamic analysis of multiagent-learning with -greedy exploration. In ICML'09: Proceedings of the 26th International Conference on Machine Learning, 47. ACM.
- (2009) ICML'09: Proceedings of the 26th International Conference on Machine Learning , vol.47
- Gomes, E.R.¹ Kowalczyk, R.²

22
- 4644369748
- Nash q-learning for general-sum stochastic games
- Hu, J. & Wellman, M. P. 2003. Nash q-learning for general-sum stochastic games. Journal of Machine Learning Research 4, 1039-1069.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.P.²

23
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. & Moore, A. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.² Moore, A.³

24
- 0036932299
- Reinforcement learning of coordination in cooperative multi-agent systems
- Dechter, R., Kearns, M. & Sutton, R. (eds.). Edmonton, Alberta, Canada
- Kapetanakis, S. & Kudenko, D. 2002. Reinforcement learning of coordination in cooperative multi-agent systems. In Proceedings of the 9th NCAI, Dechter, R., Kearns, M. & Sutton, R. (eds.). Edmonton, Alberta, Canada.
- (2002) Proceedings of the 9th NCAI
- Kapetanakis, S.¹ Kudenko, D.²

25
- 4544251885
- Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
- IEEE Computer Society
- Kapetanakis, S. & Kudenko, D. 2004. Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems In AAMAS '04: Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems, 1258-1259. IEEE Computer Society.
- (2004) AAMAS '04: Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 1258-1259
- Kapetanakis, S.¹ Kudenko, D.²

26
- 33645306139
- Learning to coordinate using commitment sequences in cooperative multi-agent systems
- Springer, Lecture Notes in Computer Science
- Kapetanakis, S., Kudenko, D. & Strens, M. J. A. 2005. Learning to coordinate using commitment sequences in cooperative multi-agent systems. In Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi- Agent Learning, Lecture Notes in Computer Science, 106-118. Springer.
- (2005) Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi- Agent Learning , pp. 106-118
- Kapetanakis, S.¹ Kudenko, D.² Strens, M.J.A.³

27
- 56049125779
- Multiagent reinforcement learning for urban traffic control using coordination graphs
- Lecture Notes in Computer Science, Springer
- Kuyer, L., Whiteson, S., Bakker, B. & Vlassis, N. 2008. Multiagent reinforcement learning for urban traffic control using coordination graphs. In ECML PKDD '08: Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I, Lecture Notes in Computer Science, 5211, 656-671. Springer.
- (2008) ECML PKDD ' 08: Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I , vol.5211 , pp. 656-671
- Kuyer, L.¹ Whiteson, S.² Bakker, B.³ Vlassis, N.⁴

28
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multiagent systems
- Morgan Kaufmann
- Lauer, M. & Riedmiller, M. 2000. An algorithm for distributed reinforcement learning in cooperative multiagent systems. In Proceedings of the 17th International Conference on Machine Learning, 535-542. Morgan Kaufmann.
- (2000) Proceedings of the 17th International Conference on Machine Learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

29
- 4544226982
- Reinforcement learning for stochastic cooperative multi-agent systems
- Lauer, M. & Riedmiller, M. 2004. Reinforcement learning for stochastic cooperative multi-agent systems. Autonomous Agents and Multi-Agent Systems 03, 1516-1517.
- (2004) Autonomous Agents and Multi-Agent Systems , vol.3 , pp. 1516-1517
- Lauer, M.¹ Riedmiller, M.²

30
- 80052079894
- The world of independent learners is not Markovian
- IOS Press
- Laurent, G. J., Matignon, L. & Le Fort-Piat, N. 2010. The world of independent learners is not Markovian. Innovation in Knowledge-Based & Intelligent Engineering Systems 15, IOS Press.
- (2010) Innovation in Knowledge-Based & Intelligent Engineering Systems , vol.15
- Laurent, G.J.¹ Matignon, L.² Le Fort-Piat, N.³

31
- 0001547175
- Value-function reinforcement learning in Markov games
- Littman, M. 2001. Value-function reinforcement learning in Markov games. Journal of Cognitive Systems Research 2, 55-66.
- (2001) Journal of Cognitive Systems Research , vol.2 , pp. 55-66
- Littman, M.¹

32
- 0035410806
- Distributed manipulation using discrete actuator arrays
- Luntz, J. E., Messner, W. & Choset, H. 2001. Distributed manipulation using discrete actuator arrays. The International Journal of Robotics Research 20(7), 553-583.
- (2001) The International Journal of Robotics Research , vol.20 , Issue.7 , pp. 553-583
- Luntz, J.E.¹ Messner, W.² Choset, H.³

33
- 0032335478
- Using communication to reduce locality in distributed multiagent learning
- Mataric, M. J. 1998. Using communication to reduce locality in distributed multiagent learning. Journal of Experimental & Theoretical Artificial Intelligence 10(3), 357-369.
- (1998) Journal of Experimental & Theoretical Artificial Intelligence 10 , vol.3 , pp. 357-369
- Mataric, M.J.¹

34
- 33749869379
- Reward function and initial values : Better choices for accelerated goal-directed reinforcement learning
- Springer
- Matignon, L., Laurent, G. J. & Le Fort-Piat, N. 2006. Reward function and initial values : better choices for accelerated goal-directed reinforcement learning. In Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN'06), Lecture Notes in Computer Science, 4131, 840-849. Springer.
- (2006) Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN'06), Lecture Notes in Computer Science , vol.4131 , pp. 840-849
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³

35
- 51349117828
- Hysteretic q-learning : An algorithm for decentralized reinforcement learning in cooperative multi-agent teams
- Matignon, L., Laurent, G. J. & Le Fort-Piat, N. 2007. Hysteretic q-learning :an algorithm for decentralized reinforcement learning in cooperative multi-agent teams. In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2007, 64-69.
- (2007) Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems IROS 2007 , pp. 64-69
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³

36
- 84858041504
- A study of FMQ heuristic in cooperative multiagent games
- Estoril, Portugal
- Matignon, L., Laurent, G. J. & Le Fort-Piat, N. 2008. A study of FMQ heuristic in cooperative multiagent games. In Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains (AAMAS 08), Estoril, Portugal.
- (2008) Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains (AAMAS 08)
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³

37
- 77955654600
- Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning
- Matignon, L., Laurent, G. J., Le Fort-Piat, N. & Chapuis, Y. A. 2010. Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning. Journal of Intelligent and Robotic Systems 59(2), 145-166.
- (2010) Journal of Intelligent and Robotic Systems , vol.59 , Issue.2 , pp. 145-166
- Matignon, L.¹ Laurent, G.J.² Le Fort-Piat, N.³ Chapuis, Y.A.⁴

38
- 70349595296
- Learning to cooperate in multi-agent systems by combining q-learning and evolutionary strategy
- McGlohon, M. & Sen, S. 2005. Learning to cooperate in multi-agent systems by combining q-learning and evolutionary strategy. International Journal on Lateral Computing 1(2), 58-64.
- (2005) International Journal on Lateral Computing , vol.1 , Issue.2 , pp. 58-64
- McGlohon, M.¹ Sen, S.²

39
- 38349032850
- Convergence of independent adaptive learners
- Springer-Verlag
- Melo, F. S. & Lopes, M. C. 2007. Convergence of independent adaptive learners. In Progress in Artificial Intelligence: 13th Portuguese Conference on Artificial Intelligence, Lecture Notes in Artificial Intelligence, 4874, 555-567. Springer-Verlag.
- (2007) Progress in Artificial Intelligence: 13th Portuguese Conference on Artificial Intelligence, Lecture Notes in Artificial Intelligence , vol.4874 , pp. 555-567
- Melo, F.S.¹ Lopes, M.C.²

40
- 0002021736
- Equilibrium points in n-person games
- Nash, J. F. 1950. Equilibrium points in n-person games. In Proceedings of the National Academy of Sciences of the United States of America 36, 48-49.
- (1950) Proceedings of the National Academy of Sciences of the United States of America , vol.36 , pp. 48-49
- Nash, J.F.¹

41
- 0003427725
- MIT Press
- Osborne, M. J. & Rubinstein, A. 1994. A Course in Game Theory. MIT Press.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

42
- 34247189655
- Lenient learners in cooperative multiagent systems
- ACM Press
- Panait, L., Sullivan, K. & Luke, S. 2006. Lenient learners in cooperative multiagent systems. In AAMAS '06: Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, 801-803. ACM Press.
- (2006) AAMAS ' 06: Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 801-803
- Panait, L.¹ Sullivan, K.² Luke, S.³

43
- 41549123971
- Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
- Panait, L., Tuyls, K. & Luke, S. 2008. Theoretical advantages of lenient learners: An evolutionary game theoretic perspective. Journal of Machine Learning Research 9, 423-457.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 423-457
- Panait, L.¹ Tuyls, K.² Luke, S.³

44
- 0012646255
- Learning to cooperate via policy search
- Morgan Kaufmann
- Peshkin, L., Kim, K.-E., Meuleau, N. & Kaelbling, L. P. 2000. Learning to cooperate via policy search. In 16th Conference on Uncertainty in Artificial Intelligence, 307-314. Morgan Kaufmann.
- (2000) 16th Conference on Uncertainty in Artificial Intelligence , pp. 307-314
- Peshkin, L.¹ Kim, K.-E.² Meuleau, N.³ Kaelbling, L.P.⁴

45
- 0032359707
- Individual learning of coordination knowledge
- Sen, S. & Sekaran, M. 1998. Individual learning of coordination knowledge. Journal of Experimental & Theoretical Artificial Intelligence 10(3), 333-356.
- (1998) Journal of Experimental & Theoretical Artificial Intelligence , vol.10 , Issue.3 , pp. 333-356
- Sen, S.¹ Sekaran, M.²

46
- 0028555752
- Learning to coordinate without sharing information
- Seattle, WA
- Sen, S., Sekaran, M. & Hale, J. 1994. Learning to coordinate without sharing information. In Proceedings of the 12th National Conference on Artificial Intelligence, 426-431, Seattle, WA.
- (1994) Proceedings of the 12th National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

47
- 0000392613
- Stochastic games
- Shapley, L. 1953. Stochastic games. Proceedings of the National Academy of Sciences of the United States of America 39, 1095-1100.
- (1953) Proceedings of the National Academy of Sciences of the United States of America , vol.39 , pp. 1095-1100
- Shapley, L.¹

48
- 0033901602
- Convergence results for single-step onpolicy reinforcement-learning algorithms
- Singh, S. P., Jaakkola, T., Littman, M. L. & Szepesvari, C. 2000. Convergence results for single-step onpolicy reinforcement-learning algorithms. Machine Learning 38(3), 287-308.
- (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
- Singh, S.P.¹ Jaakkola, T.² Littman, M.L.³ Szepesvari, C.⁴

49
- 0034205975
- Multiagent systems: A survey from a machine learning perspective
- Stone, P. & Veloso, M. M. 2000. Multiagent systems: A survey from a machine learning perspective. Autonomous Robots 8(3), 345-383.
- (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 345-383
- Stone, P.¹ Veloso, M.M.²

50
- 0004102479
- The MIT Press
- Sutton, R. S. & Barto, A. G. 1998. Reinforcement Learning: An Introduction. The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

51
- 85152198941
- Multiagent reinforcement learning: Independent vs. cooperative agents
- Morgan Kaufmann
- Tan, M. 1993. Multiagent reinforcement learning: independent vs. cooperative agents. In Proceedings of the 10th International Conference on Machine Learning, 330-337. Morgan Kaufmann.
- (1993) Proceedings of the 10th International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

52
- 84855309978
- A multiagent approach to managing air traffic flow
- Tumer, K. & Agogino, A. K. 2010. A multiagent approach to managing air traffic flow. Journal of Autonomous Agents and Multi-Agent Systems 24, 1-25.
- (2010) Journal of Autonomous Agents and Multi-Agent Systems , vol.24 , pp. 1-25
- Tumer, K.¹ Agogino, A.K.²

53
- 34548072657
- Distributed agent-based air traffic flow management
- ACM
- Tumer, K. & Agogino, A. 2007. Distributed agent-based air traffic flow management In AAMAS '07: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, 1-8. ACM.
- (2007) AAMAS '07: Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 1-8
- Tumer, K.¹ Agogino, A.²

54
- 28544446213
- Evolutionary game theory and multi-agent reinforcement learning
- Tuyls, K. & Nowé , A. 2005. Evolutionary game theory and multi-agent reinforcement learning. Knowledge Engineering Review 20(1), 63-90.
- (2005) Knowledge Engineering Review , vol.20 , Issue.1 , pp. 63-90
- Tuyls, K.¹ Nowé, A.²

55
- 34247642270
- Exploring selfish reinforcement learning in repeated games with stochastic rewards
- Verbeeck, K., Nowé , A., Parent, J. & Tuyls, K. 2007. Exploring selfish reinforcement learning in repeated games with stochastic rewards. Autonomous Agents and Multi-Agent Systems 14(3), 239-269.
- (2007) Autonomous Agents and Multi-Agent Systems , vol.14 , Issue.3 , pp. 239-269
- Verbeeck, K.¹ Nowé, A.² Parent, J.³ Tuyls, K.⁴

56
- 34250651573
- Multi-robot box-pushing: Single-agent q-learning vs. team q-learning
- Wang, Y. & de Silva, C. W. 2006. Multi-robot box-pushing: single-agent q-learning vs. team q-learning. In Proceedings opf the IROS, 3694-3699.
- (2006) Proceedings opf the IROS , pp. 3694-3699
- Wang, Y.¹ De Silva, C.W.²

57
- 43549119106
- A machine-learning approach to multi-robot coordination
- Wang, Y. & de Silva, C. W. 2008. A machine-learning approach to multi-robot coordination. Engineering Applications of Artificial Intelligence 21(3), 470-484.
- (2008) Engineering Applications of Artificial Intelligence , vol.21 , Issue.3
- Wang, Y.¹ De Silva, C.W.²

58
- 34249833101
- Technical note: Q-learning
- Watkins, C. & Dayan, P. 1992. Technical note: Q-learning. Machine Learning 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

59
- 0004320981
- Technical Report NASAARC- IC- NASA Ames Research Center
- Wolpert, D. H. & Tumer, K. 1999. An Introduction to Collective Intelligence. Technical Report NASAARC- IC-99-63, NASA Ames Research Center.
- (1999) An Introduction to Collective Intelligence , pp. 99-63
- Wolpert, D.H.¹ Tumer, K.²

60
- 0001309161
- Optimal payoff functions for members of collectives
- Wolpert, D. H. & Tumer, K. 2001. Optimal payoff functions for members of collectives. Advances in Complex Systems 04(02), 265-279.
- (2001) Advances in Complex Systems , vol.4 , Issue.2 , pp. 265-279
- Wolpert, D.H.¹ Tumer, K.²

61
- 77956556543
- Classes of multiagent q-learning dynamics with epsilongreedy exploration
- Omni Press
- Wunder, M., Littman, M. L. & Babes, M. 2010. Classes of multiagent q-learning dynamics with epsilongreedy exploration. In ICML'10: Proceedings of the 27th international Conference on Machine Learning, 1167-1174. Omni Press.
- (2010) ICML'10: Proceedings of the 27th international Conference on Machine Learning , pp. 1167-1174
- Wunder, M.¹ Littman, M.L.² Babes, M.³

62
- 33746826183
- Department of Computer Science. University of Essex
- Yang, E. & Gu, D. 2004. Multiagent Reinforcement Learning for Multi-Robot Systems: A Survey. Department of Computer Science, University of Essex.
- (2004) Multiagent Reinforcement Learning for Multi-Robot Systems: A Survey
- Yang, E.¹ Gu, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.