SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 17, Issue 2, 2008, Pages 320-338

Analyzing and visualizing multiagent rewards in dynamic and stochastic domains

(2) Agogino, Adrian K a Tumer, Kagan b

a UNIVERSITY OF CALIFORNIA (United States)

b OREGON STATE UNIVERSITY (United States)

Author keywords

Multiagent learning; Reinforcement learning; Reward analysis; Visualization

Indexed keywords

EID: 51649111408 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-008-9046-9 Document Type: Article

Times cited : (121)

References (32)

1
- 51649110789
- Principal curve classifier-A nonlinear approach to pattern classification
- Anchorage, Alaska
- Agogino, A., Martin, C., & Ghosh, J. (1998). Principal curve classifier-A nonlinear approach to pattern classification. In Proceedings of International Joint Conference on Neural Networks, Anchorage, Alaska.
- (1998) Proceedings of International Joint Conference on Neural Networks
- Agogino, A.¹ Martin, C.² Ghosh, J.³

2
- 0033341042
- Visualization of radial basis function networks
- Washington, DC
- Agogino, A., Martin, C., & Ghosh, J. (1999). Visualization of radial basis function networks. In Proceedings of International Joint Conference on Neural Networks. Washington, DC.
- (1999) Proceedings of International Joint Conference on Neural Networks
- Agogino, A.¹ Martin, C.² Ghosh, J.³

3
- 35048882017
- Efficient evaluation functions for multi-rover systems
- Seattle, WA
- Agogino, A., & Tumer, K. (2004). Efficient evaluation functions for multi-rover systems. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004) (pp. 1-12). Seattle, WA.
- (2004) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004) , pp. 1-12
- Agogino, A.¹ Tumer, K.²

4
- 33644813420
- Multi agent reward analysis for learning in noisy domains
- Utrecht, Netherlands
- Agogino, A., & Tumer, K. (2005). Multi agent reward analysis for learning in noisy domains. In Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multi-Agent Systems, Utrecht, Netherlands.
- (2005) Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multi-Agent Systems
- Agogino, A.¹ Tumer, K.²

5
- 84898958374
- Gradient descent for general reinforcement learning
- Cambridge, MA.
- Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems (NIPS) (pp. 968-974). Cambridge, MA.
- (1999) Advances in Neural Information Processing Systems (NIPS) , pp. 968-974
- Baird, L.¹ Moore, A.²

6
- 0345833118
- Visualization methods for neural networks
- The Hague, Netherlands
- Bishof, H., Pinz, A., & Kropatsch, W. G. (1992). Visualization methods for neural networks. In 11th International Conference on Pattern Recognition (pp. 581-585). The Hague, Netherlands.
- (1992) 11th International Conference on Pattern Recognition , pp. 581-585
- Bishof, H.¹ Pinz, A.² Kropatsch, W.G.³

7
- 0003487601
- Oxford University Press New York
- Bishop C.M. (1995). Neural networks for pattern recognition. Oxford University Press, New York
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

8
- 1142280924
- Coordination in multiagent reinforcement learning: A Bayesian approach
- Melbourne, Australia
- Chalkiadakis, G., & Boutilier, C. (2003). Coordination in multiagent reinforcement learning: A Bayesian approach. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03), Melbourne, Australia.
- Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03) , pp. 2003
- Chalkiadakis, G.¹ Boutilier, C.²

9
- 85156187730
- Improving elevator performance using reinforcement learning
- MIT Press Cambridge, MA
- Crites R.H. and Barto A.G. (1996). Improving elevator performance using reinforcement learning. In: Touretzky, D.S., Mozer, M.C. and Hasselmo, M.E. (eds) Advances in neural information processing systems-8, pp 1017-1023. MIT Press, Cambridge, MA
- (1996) Advances in Neural Information Processing systems-8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.² Touretzky, D.S.³ Mozer, M.C.⁴ Hasselmo, M.E.⁵

10
- 3543121818
- The dynamic selection of coordination mechanisms
- 1-2
- Excelente-Toledo C.B. and Jennings N.R. (2004). The dynamic selection of coordination mechanisms. Journal of Autonomous Agents and Multi-Agent Systems 9(1-2): 55-85
- (2004) Journal of Autonomous Agents and Multi-Agent Systems , vol.9 , pp. 55-85
- Excelente-Toledo, C.B.¹ Jennings, N.R.²

11
- 0348037748
- Visualization of learning in neural networks using principal component analysis
- Gallagher, M., & Downs, T. (1997). Visualization of learning in neural networks using principal component analysis. In International Conference on Computational Intelligence and Multimedia Applications (pp. 327-331).
- (1997) International Conference on Computational Intelligence and Multimedia Applications , pp. 327-331
- Gallagher, M.¹ Downs, T.²

12
- 29344475738
- Solving factored MDPs with continuous and discrete variables
- Guestrin, C., Hauskrecht, M., & Kveton, B. (2004). Solving factored MDPs with continuous and discrete variables. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (pp. 235-242).
- (2004) Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence , pp. 235-242
- Guestrin, C.¹ Hauskrecht, M.² Kveton, B.³

13
- 84880898477
- Max-norm projections for factored MDPs
- Guestrin, C., Koller, D., & Parr, R. (2001a). Max-norm projections for factored MDPs. In Proceedings of the International Joint Conference on Artificial Intelligence.
- (2001) Proceedings of the International Joint Conference on Artificial Intelligence
- Guestrin, C.¹ Koller, D.² Parr, R.³

14
- 0012296128
- Multiagent planning with factored MDPs
- Guestrin, C., Koller, D., & Parr, R. (2001b). Multiagent planning with factored MDPs. In NIPS-14.
- (2001) NIPS-14
- Guestrin, C.¹ Koller, D.² Parr, R.³

15
- 4544236179
- Coordinated reinforcement learning
- Guestrin, C., Lagoudakis, M., & Parr, R. (2002). Coordinated reinforcement learning. In Proceedings of the 19th International Conference on Machine Learning.
- (2002) Proceedings of the 19th International Conference on Machine Learning
- Guestrin, C.¹ Lagoudakis, M.² Parr, R.³

16
- 0024732792
- Connectionist learning procedures
- Hinton G. (1986). Connectionist learning procedures. Artificial Intelligence 40: 185-234
- (1986) Artificial Intelligence , vol.40 , pp. 185-234
- Hinton, G.¹

17
- 4544307742
- Simulation and visualization of a market-based model for logistics management in transportation
- New York, NY.
- Hoen, P., Redekar, H. L. P. G., & Robu, V. (2004). Simulation and visualization of a market-based model for logistics management in transportation. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-Agent Systems (pp. 1218-1219). New York, NY.
- (2004) Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-Agent Systems , pp. 1218-1219
- Hoen, P.¹ Redekar, H.L.P.G.² Robu, V.³

18
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- Hu, J., & Wellman, M. P. (1998). Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proceedings of the Fifteenth International Conference on Machine Learning (pp. 242-250).
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

19
- 0003946510
- 2 Springer New York
- Jolliffe I. (2002). Principal component analysis (2nd ed). Springer, New York
- (2002) Principal Component Analysis
- Jolliffe, I.¹

20
- 84880677563
- Efficient reinforcement learning in factored MDPs
- Kearns, M., & Koller, D. (1999). Efficient reinforcement learning in factored MDPs. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (pp. 740-747).
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence , pp. 740-747
- Kearns, M.¹ Koller, D.²

21
- 0002218603
- Coordination and learning in multi-robot systems
- Mataric, M. J. (1998). Coordination and learning in multi-robot systems. In IEEE Intelligent Systems (pp. 6-8).
- (1998) IEEE Intelligent Systems , pp. 6-8
- Mataric, M.J.¹

22
- 0034205975
- Multiagent systems: A survey from a machine learning perspective
- 3
- Stone P. and Veloso M. (2000). Multiagent systems: A survey from a machine learning perspective. Autonomous Robots 8(3): 345-383
- (2000) Autonomous Robots , vol.8 , pp. 345-383
- Stone, P.¹ Veloso, M.²

23
- 0004102479
- MIT Press Cambridge, MA
- Sutton R.S. and Barto A.G. (1998). Reinforcement learning: An introduction. MIT Press, Cambridge, MA
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

24
- 51649111071
- Designing agent utilities for coordinated, scalable and robust multi-agent systems
- Scerri, P. Mailler, R., & R. Vincent (Eds.) Springer (to appear)
- Tumer, K. (2005). Designing agent utilities for coordinated, scalable and robust multi-agent systems. In Scerri, P. Mailler, R., & R. Vincent (Eds.), Challenges in the coordination of large scale multiagent Systems. Springer (to appear).
- (2005) Challenges in the Coordination of Large Scale Multiagent Systems
- Tumer, K.¹

25
- 34548072657
- Distributed agent-based air traffic flow management
- Honolulu, HI (Best paper award)
- Tumer, K., & Agogino, A. (2007). Distributed agent-based air traffic flow management. In Proceedings of the Sixth International Joint Conference on Autonomous Agents and Multi-Agent Systems (pp. 330-337). Honolulu, HI (Best paper award).
- (2007) Proceedings of the Sixth International Joint Conference on Autonomous Agents and Multi-Agent Systems , pp. 330-337
- Tumer, K.¹ Agogino, A.²

26
- 0036355687
- Learning sequences of actions in collectives of autonomous agents
- Bologna, Italy
- Tumer, K., Agogino, A., & Wolpert, D. (2002). Learning sequences of actions in collectives of autonomous agents. In Proceedings of the First International Joint Conference on Autonomous Agents and Multi-Agent Systems, Bologna, Italy (pp. 378-385).
- (2002) Proceedings of the First International Joint Conference on Autonomous Agents and Multi-Agent Systems , pp. 378-385
- Tumer, K.¹ Agogino, A.² Wolpert, D.³

27
- 4544388719
- Tumer K. and Wolpert D. (Eds). Springer New York
- Tumer K. and Wolpert D. (Eds). (2004a). Collectives and the design of complex systems. Springer, New York
- (2004) Collectives and the Design of Complex Systems

28
- 32444447473
- A survey of collectives
- Springer
- Tumer, K., & Wolpert, D. (2004b). A survey of collectives. In Collectives and the design of complex systems (pp. 1-42). Springer.
- (2004) Collectives and the Design of Complex Systems , pp. 1-42
- Tumer, K.¹ Wolpert, D.²

29
- 85158118268
- Collective intelligence and Braess Paradox
- Tumer, K., & Wolpert, D. H. (2000). Collective intelligence and Braess Paradox. In Proceedings of the Seventeeth National Conference on Artificial Intelligence (pp. 104-109).
- (2000) Proceedings of the Seventeeth National Conference on Artificial Intelligence , pp. 104-109
- Tumer, K.¹ Wolpert, D.H.²

30
- 0025867614
- Visualizing processes in neural networks
- Wejchert J. and Tesauro G. (1991). Visualizing processes in neural networks. IBM Journal of Research and Development 35: 244-253
- (1991) IBM Journal of Research and Development , vol.35 , pp. 244-253
- Wejchert, J.¹ Tesauro, G.²

31
- 0001309161
- Optimal payoff functions for members of collectives
- 2/3
- Wolpert D.H. and Tumer K. (2001). Optimal payoff functions for members of collectives. Advances in Complex Systems 4(2/3): 265-279
- (2001) Advances in Complex Systems , vol.4 , pp. 265-279
- Wolpert, D.H.¹ Tumer, K.²

32
- 1842531912
- Improving search algorithms by using intelligent coordinates
- Wolpert D.H., Tumer K. and Bandari E. (2004). Improving search algorithms by using intelligent coordinates. Physical Review E 69: 017701
- (2004) Physical Review e , vol.69 , pp. 017701
- Wolpert, D.H.¹ Tumer, K.² Bandari, E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.