SCOPUS 정보 검색 플랫폼

Advances in Complex Systems

Volumn 12, Issue 4-5, 2009, Pages 455-473

Learning from actions not taken in multiagent systems

(2) Tumer, Kagan a Khani, Newsha a

a OREGON STATE UNIVERSITY (United States)

Author keywords

Counterfactual reward; Difference reward; Multiagent learning

Indexed keywords

EID: 70349592320 PISSN: 02195259 EISSN: None Source Type: Journal
DOI: 10.1142/s0219525909002301 Document Type: Article

Times cited : (11)

References (45)

1
- 33646001120
- Handling communication restrictions and team formation in congestion games
- Agogino, A. K. and Tumer, K., Handling communication restrictions and team formation in congestion games, J. Auton. Agents Multi Agent Syst. 13 (2006) 97-115.
- (2006) J. Auton. Agents Multi Agent Syst. , vol.13 , pp. 97-115
- Agogino, A.K.¹ Tumer, K.²

2
- 51649111408
- Analyzing and visualizing multiagent rewards in dynamic and stochastic environments
- Agogino, A. K. and Tumer, K., Analyzing and visualizing multiagent rewards in dynamic and stochastic environments, J. Auton. Agents Multi Agent Syst. 17 (2008) 320-338.
- (2008) J. Auton. Agents Multi Agent Syst. , vol.17 , pp. 320-338
- Agogino, A.K.¹ Tumer, K.²

3
- 47749108686
- Efficient evaluation functions for evolving coordination
- Agogino, A. K. and Tumer, K., Efficient evaluation functions for evolving coordination, Evol. Comput. 16 (2008) 257-288.
- (2008) Evol. Comput. , vol.16 , pp. 257-288
- Agogino, A.K.¹ Tumer, K.²

4
- 84962053074
- Multi-agent reinforcement learning for planning and scheduling multiple goals
- Arai, S., Sycara, K. and Payne, T., Multi-agent reinforcement learning for planning and scheduling multiple goals, in Proc. Fourth Int. Conf. on Multiagent Syst. (2000), pp. 359-360.
- (2000) Proc. Fourth Int. Conf. on Multiagent Syst. , pp. 359-360
- Arai, S.¹ Sycara, K.² Payne, T.³

5
- 0002135590
- Complexity in economic theory: Inductive reasoning and bounded rationality
- Arthur, W. B., Complexity in economic theory: Inductive reasoning and bounded rationality, Am. Econ. Rev. 84 (1994) 406-411.
- (1994) Am. Econ. Rev. , vol.84 , pp. 406-411
- Arthur, W.B.¹

6
- 1142280924
- Coordination in multiagent reinforcement learning: A bayesian approach
- Melbourne, Australia
- Chalkiadakis, G. and Boutilier, C., Coordination in multiagent reinforcement learning: A bayesian approach, in Proc. Second Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS-03) (Melbourne, Australia, 2003).
- (2003) Proc. Second Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS-03)
- Chalkiadakis, G.¹ Boutilier, C.²

7
- 0032138856
- On the minority game: Analytical and numerical studies
- Challet, D. and Zhang, Y. C., On the minority game: Analytical and numerical studies, Physica A 256 (1998) 514.
- (1998) Physica A , vol.256 , pp. 514
- Challet, D.¹ Zhang, Y.C.²

8
- 0031630561
- The dynamics of reinforcement learning cooperative multiagent systems
- Madison, WI
- Claus, C. and Boutilier, C., The dynamics of reinforcement learning cooperative multiagent systems, in Proc. Fifteenth National Conf. on Artificial Intelligence (Madison, WI, 1998), pp. 746-752.
- (1998) Proc. Fifteenth National Conf. on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

9
- 0002825556
- Competition, efficiency and collective behavior in the "el Farol" bar model
- de Cara, M. A. R., Pla, O. and Guinea, F., Competition, efficiency and collective behavior in the "El Farol" bar model, Eur. Phys. J. B 10 (1999) 187.
- (1999) Eur. Phys. J. , vol.B10 , pp. 187
- De Cara, M.A.R.¹ Pla, O.² Guinea, F.³

10
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G., Hierarchical reinforcement learning with the MAXQ value function decomposition, J. Artif. Intell. 13 (2000) 227-303.
- (2000) J. Artif. Intell. , vol.13 , pp. 227-303
- Dietterich, T.G.¹

11
- 4544236179
- Coordinated reinforcement learning
- Guestrin, C., Lagoudakis, M. and Parr, R., Coordinated reinforcement learning, in Proc. 19th Int. Conf. on Machine Learning (2002).
- (2002) Proc. 19th Int. Conf. on Machine Learning
- Guestrin, C.¹ Lagoudakis, M.² Parr, R.³

12
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, J. andWellman, M. P., Multiagent reinforcement learning: Theoretical framework and an algorithm, in Proc. Fifteenth Int. Conf. on Machine Learning (1998), pp. 242- 250.
- (1998) Proc. Fifteenth Int. Conf. on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

13
- 0031706903
- Online learning about other agents in a dynamic multiagent system, in Proc.
- Hu, J. and Wellman, M. P., Online learning about other agents in a dynamic multiagent system, in Proc. Second Int. Conf. on Autonomous Agents (1998), pp. 239-246.
- (1998) Second Int. Conf. on Autonomous Agents , pp. 239-246
- Hu, J.¹ Wellman, M.P.²

14
- 41349093315
- Deterministic dynamics in the minority game
- Jefferies, P., Hart, M. L. and Johnson, N. F., Deterministic dynamics in the minority game, Phys. Rev. E 65(016105) (2002).
- (2002) Phys. Rev. e , vol.65 , pp. 016105
- Jefferies, P.¹ Hart, M.L.² Johnson, N.F.³

15
- 0032329151
- A roadmap of agent research and development
- Jennings, N. R., Sycara, K. and Wooldridge, M., A roadmap of agent research and development, Auton. Agents Multi-Agent Syst. 1 (1998) 7-38.
- (1998) Auton. Agents Multi-Agent Syst. , vol.1 , pp. 7-38
- Jennings, N.R.¹ Sycara, K.² Wooldridge, M.³

16
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M. L. and Moore, A.W., Reinforcement learning: A survey, J. Artif. Intell. Res. 4 (1996) 237-285.
- (1996) J. Artif. Intell. Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

17
- 70349594910
- Fast multiagent learning: Cashing in on team knowledge
- ASME, St. Louis
- Khani, N. and Tumer, K., Fast multiagent learning: Cashing in on team knowledge, in Artificial Neural Networks in Engineering (ASME, St. Louis, 2008), pp. 3-10.
- (2008) Artificial Neural Networks in Engineering , pp. 3-10
- Khani, N.¹ Tumer, K.²

18
- 34250202829
- Klügl, F. Bazzan, A. and Ossowski, S. (eds.) Springer
- Klügl, F., Bazzan, A. and Ossowski, S. (eds.), Applications of Agent Technology in Traffic and Transportation (Springer, 2005).
- (2005) Applications of Agent Technology in Traffic and Transportation

19
- 0002218603
- Coordination and learning in multi-robot systems
- Mataric, M. J., Coordination and learning in multi-robot systems, in IEEE Intelligent Systems (1998), pp. 6-8.
- (1998) IEEE Intelligent Systems , pp. 6-8
- Mataric, M.J.¹

20
- 70349595296
- Learning to cooperate in multi-agent systems by combining Q-learning and evolutionary strategy
- McGlohon, M. and Sen, S., Learning to cooperate in multi-agent systems by combining Q-learning and evolutionary strategy, Int. J. Lateral Comput. 1 (2005) 58-64.
- (2005) Int. J. Lateral Comput. , vol.1 , pp. 58-64
- McGlohon, M.¹ Sen, S.²

21
- 0003891507
- Prentice Hall
- Narendra, K. S. and Thathachar, M. A. L., Learning Automata: An Introduction (Prentice Hall, 1989).
- (1989) Learning Automata: An Introduction
- Narendra, K.S.¹ Thathachar, M.A.L.²

22
- 41549123971
- Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
- Panait, L., Tuyls, K. and Luke, S., Theoretical advantages of lenient learners: An evolutionary game theoretic perspective, J. Mach. Learn. Res. 9 (2008) 423-457.
- (2008) J. Mach. Learn. Res. , vol.9 , pp. 423-457
- Panait, L.¹ Tuyls, K.² Luke, S.³

23
- 70349596643
- On learnable mechanism design
- Springer
- Parkes, D., On learnable mechanism design, in Collectives and the Design of Complex Systems (Springer, 2004).
- (2004) Collectives and the Design of Complex Systems
- Parkes, D.¹

24
- 1142292938
- The communicative multiagent team decision problem: Analyzing teamwork theories and models
- Pynadath, D. and Tambe, M., The communicative multiagent team decision problem: Analyzing teamwork theories and models, J. Artif. Intell. Res. 16 (2002) 389-423.
- (2002) J. Artif. Intell. Res. , vol.16 , pp. 389-423
- Pynadath, D.¹ Tambe, M.²

25
- 33744792672
- Three automated stock-trading agents: A comparative study
- Lecture Notes in Artificial Intelligence Springer Verlag, Berlin
- Sherstov, A. and Stone, P., Three automated stock-trading agents: A comparative study, in Agent Mediated Electronic Commerce VI: Theories for and Engineering of Distributed Mechanisms and Systems (AMEC 2004), Lecture Notes in Artificial Intelligence (Springer Verlag, Berlin, 2005), pp. 173-187.
- (2005) Agent Mediated Electronic Commerce VI: Theories for and Engineering of Distributed Mechanisms and Systems (AMEC 2004) , pp. 173-187
- Sherstov, A.¹ Stone, P.²

26
- 0003401114
- MIT Press, Cambridge, MA
- Stone, P., Layered Learning in Multi-Agent Systems: A Winning Approach to Robotic Soccer (MIT Press, Cambridge, MA, 2000).
- (2000) Layered Learning in Multi-Agent Systems: A Winning Approach to Robotic Soccer
- Stone, P.¹

27
- 27544506565
- Reinforcement learning for RoboCupsoccer keepaway
- Stone, P., Sutton, R. S. and Kuhlmann, G., Reinforcement learning for RoboCupsoccer keepaway, Adapt. Behav. (2005).
- (2005) Adapt. Behav.
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

28
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R. S. and Barto, A. G., Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA, 1998).
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 0001936250
- Towards flexible teamwork
- Tambe, M., Towards flexible teamwork, J. Artif. Intell. Res. 7 (1997) 83-124.
- (1997) J. Artif. Intell. Res. , vol.7 , pp. 83-124
- Tambe, M.¹

30
- 33750259111
- Comparing evolutionary and temporal difference methods for reinforcement learning
- Seattle, WA
- Taylor, M. E., Whiteson, S. and Stone, P., Comparing evolutionary and temporal difference methods for reinforcement learning, in Proc. Genetic and Evolutionary Computation Conf. (Seattle, WA, 2006), pp. 1321-1328.
- (2006) Proc. Genetic and Evolutionary Computation Conf. , pp. 1321-1328
- Taylor, M.E.¹ Whiteson, S.² Stone, P.³

31
- 2542485629
- Practical issues in temporal difference learning
- eds. Moody, J., Hanson, S. and Lippmann, R. Morgan Kaufmann
- Tesauro, G., Practical issues in temporal difference learning, in Advances in Neural Information Processing Systems, Vol.4, eds. Moody, J., Hanson, S. and Lippmann, R. (Morgan Kaufmann, 1992), pp. 259-266.
- (1992) Advances in Neural Information Processing Systems , vol.4 , pp. 259-266
- Tesauro, G.¹

32
- 34548072657
- Distributed agent-based air traffic flow management
- Honolulu, HI
- Tumer, K. and Agogino, A., Distributed agent-based air traffic flow management, in Proc. Sixth Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (Honolulu, HI, 2007), pp. 330-337.
- (2007) Proc. Sixth Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems , pp. 330-337
- Tumer, K.¹ Agogino, A.²

33
- 0036355687
- Learning sequences of actions in collectives of autonomous agents
- Bologna, Italy
- Tumer, K., Agogino, A. and Wolpert, D., Learning sequences of actions in collectives of autonomous agents, in Proc. First Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (Bologna, Italy, 2002), pp. 378-385.
- (2002) Proc. First Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems , pp. 378-385
- Tumer, K.¹ Agogino, A.² Wolpert, D.³

34
- 84899955497
- Aligning social welfare and agent preferences to alleviate traffic congestion
- Estoril, Portugal
- Tumer, K., Welch, Z. T. and Agogino, A., Aligning social welfare and agent preferences to alleviate traffic congestion, in Proc. Seventh Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (Estoril, Portugal, 2008).
- (2008) Proc. Seventh Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems
- Tumer, K.¹ Welch, Z.T.² Agogino, A.³

35
- 4544388719
- Tumer, K. and Wolpert, D. (eds.) Springer, New York
- Tumer, K. and Wolpert, D. (eds.), Collectives and the Design of Complex Systems (Springer, New York, 2004).
- (2004) Collectives and the Design of Complex Systems

36
- 85158118268
- Collective intelligence and Braess' paradox
- Austin, TX
- Tumer, K. and Wolpert, D. H., Collective intelligence and Braess' paradox, in Proc. Seventeenth National Conf. on Artificial Intelligence (Austin, TX, 2000), pp. 104- 109.
- (2000) Proc. Seventeenth National Conf. on Artificial Intelligence , pp. 104-109
- Tumer, K.¹ Wolpert, D.H.²

37
- 34249064148
- What evolutionary game theory tells us about multiagent learning
- Tuyls, K. and Parsons, S., What evolutionary game theory tells us about multiagent learning, Artif. Intell. 171 (2007) 406-416.
- (2007) Artif. Intell. , vol.171 , pp. 406-416
- Tuyls, K.¹ Parsons, S.²

38
- 33644810030
- Coordinated exploration in multi-agent reinforcement learning: An application to load balancing
- Utrecht, The Netherlands
- Verbeeck, K., Nowe, A. and Tuyls, K., Coordinated exploration in multi-agent reinforcement learning: An application to load balancing, in Proc. Fourth Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (Utrecht, The Netherlands, 2005).
- (2005) Proc. Fourth Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems
- Verbeeck, K.¹ Nowe, A.² Tuyls, K.³

39
- 58049194007
- Reinforcement learning in stochastic single and multi-stage games
- Springer Verlag, Berlin
- Verbeeck, K., Peeters, M., Nowe, A. and Tuyls, K., Reinforcement learning in stochastic single and multi-stage games, in Adaptive Agents and Multi-Agent Systems II, Lecture Notes in Artificial Intelligence (Springer Verlag, Berlin, 2005), pp. 275-294.
- (2005) Adaptive Agents and Multi-Agent Systems II, Lecture Notes in Artificial Intelligence , pp. 275-294
- Verbeeck, K.¹ Peeters, M.² Nowe, A.³ Tuyls, K.⁴

40
- 70349601485
- Multiagent coordination using a distributed combinatorial auction
- Vidal, J. M., Multiagent coordination using a distributed combinatorial auction, in AAAI Workshop on Auction Mechanism for Robot Coordination (2006).
- (2006) AAAI Workshop on Auction Mechanism for Robot Coordination
- Vidal, J.M.¹

41
- 84962044659
- The moving target function problem in multi-agent learning
- AAAI/MIT press
- Vidal, J. M. and Durfee, E. H., The moving target function problem in multi-agent learning, in Proc. Third Int. Conf. on Multi-Agent Systems (AAAI/MIT press, 1998), pp. 317-324.
- (1998) Proc. Third Int. Conf. on Multi-Agent Systems , pp. 317-324
- Vidal, J.M.¹ Durfee, E.H.²

42
- 34249833101
- Q-learning
- Watkins, C. and Dayan, P., Q-learning, Mach. Learn. 8 (1992) 279-292.
- (1992) Mach. Learn. , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

43
- 0348156830
- Trading agents competing: Performance, progress, and market effectiveness
- Wellman, M. P., Cheng, S.-F., Reeves, D. M. and Lochne, K. M., Trading agents competing: Performance, progress, and market effectiveness, IEEE Intell. Syst. 18 (2003) 48-53.
- (2003) IEEE Intell. Syst. , vol.18 , pp. 48-53
- Wellman, M.P.¹ Cheng, S.-F.² Reeves, D.M.³ Lochne, K.M.⁴

44
- 33847264400
- Empirical studies in action selection for reinforcement learning
- Whiteson, S., Taylor, M. E. and Stone, P., Empirical studies in action selection for reinforcement learning, Adapt. Behav. 15 (2007).
- (2007) Adapt. Behav. , vol.15
- Whiteson, S.¹ Taylor, M.E.² Stone, P.³

45
- 0001309161
- Optimal reward functions for members of collectives
- Wolpert, D. H. and Tumer, K., Optimal reward functions for members of collectives, Adv. Complex Syst. 4 (2001) 265-279.
- (2001) Adv. Complex Syst. , vol.4 , pp. 265-279
- Wolpert, D.H.¹ Tumer, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.