SCOPUS 정보 검색 플랫폼

Proceedings of the International Conference on Autonomous Agents

Volumn , Issue , 2005, Pages 233-240

Multi-agent reward analysis for learning in noisy domains

(2) Agogino, Adrian a Tumer, Kagan a

a NASA AMES RESEARCH CENTER (United States)

Author keywords

Multiagent Systems; Reinforcement Learning; Visualization

Indexed keywords

AUTONOMOUS AGENTS; COMPUTER SIMULATION; LEARNING SYSTEMS; PROBLEM SOLVING; VISUALIZATION;

MULTI-AGENT LEARNING; REINFORCEMENT LEARNING; REWARD EFFICIENCY VISUALIZATION;

MULTI AGENT SYSTEMS;

EID: 33644813420 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (23)

1
- 35048882017
- Efficient evaluation functions for multi-rover systems
- Seattle, WA
- A. Agogino and K. Tumer. Efficient evaluation functions for multi-rover systems. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004), pages 1-12, Seattle, WA, 2004.
- (2004) Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004) , pp. 1-12
- Agogino, A.¹ Tumer, K.²

2
- 0033341042
- Visualization of radial basis function networks
- Washington, DC
- Adrian Agogino, Cheryl Martin, and Joydeep Ghosh. Visualization of radial basis function networks. In Proceedings of International Joint Conference on Neural Networks, Washington, DC, 1999.
- (1999) Proceedings of International Joint Conference on Neural Networks
- Agogino, A.¹ Martin, C.² Ghosh, J.³

3
- 84898958374
- Gradient descent for general reinforcement learning
- Cambridge, MA. The MIT Press
- L. Baird and A. Moore. Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems (NIPS), pages 968-974, Cambridge, MA, 1999. The MIT Press.
- (1999) Advances in Neural Information Processing Systems (NIPS) , pp. 968-974
- Baird, L.¹ Moore, A.²

4
- 0345833118
- Visualization methods for neural networks
- The Hague, Netherlands
- Horst Bishof, Axel Pinz, and Walter G. Kropatsch. Visualization methods for neural networks. In 11th International Conference on Pattern Recognition, pages 581-585, The Hague, Netherlands, 1992.
- (1992) 11th International Conference on Pattern Recognition , pp. 581-585
- Bishof, H.¹ Pinz, A.² Kropatsch, W.G.³

5
- 0003487601
- Oxford University Press, New York
- C. M. Bishop. Neural Networks for Pattern Recognition. Oxford University Press, New York, 1995.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

6
- 85156187730
- Improving elevator performance using reinforcement learning
- D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, MIT Press
- R. H. Crites and A. G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems - 8, pages 1017-1023. MIT Press, 1996.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

7
- 3543121818
- The dynamic selection of coordination mechanisms
- C. B. Excelente-Toledo and N. R. Jennings. The dynamic selection of coordination mechanisms. J. of Autonomous Agents and Multi-Agent Systems, 9(1-2), 2004.
- (2004) J. of Autonomous Agents and Multi-agent Systems , vol.9 , Issue.1-2
- Excelente-Toledo, C.B.¹ Jennings, N.R.²

8
- 0348037748
- Visualization of learning in neural networks using principal component analysis
- Marcus Gallagher and Tom Downs. Visualization of learning in neural networks using principal component analysis. In International Conference on Computational Intelligence and Multimedia Applications, pages 327-331, 1997.
- (1997) International Conference on Computational Intelligence and Multimedia Applications , pp. 327-331
- Gallagher, M.¹ Downs, T.²

9
- 0024732792
- Connectionist learning procedures
- G. Hinton. Connectionist learning procedures. Artificial Intelligence, 40:185-234, 1986.
- (1986) Artificial Intelligence , vol.40 , pp. 185-234
- Hinton, G.¹

10
- 4544307742
- Simulation and visualization of a market-based model for logistics management in transportation
- New York, NY, July
- P. Hoen and H. La Poutre G. Redekar, V. Robu. Simulation and visualization of a market-based model for logistics management in transportation. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-Agent Systems, pages 1218-1219, New York, NY, July 2004.
- (2004) Proceedings of the Third International Joint Conference on Autonomous Agents and Multi-agent Systems , pp. 1218-1219
- Hoen, P.¹ La Poutre, H.² Redekar, G.³ Robu, V.⁴

11
- 0000929496
- Multiagent reinforcement learning: Theoretical framework and an algorithm
- June
- J. Hu and M. P. Wellman. Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proceedings of the Fifteenth International Conference on Machine Learning, pages 242-250, June 1998.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

12
- 0002218603
- Coordination and learning in multi-robot systems
- March
- Maja J Mataric. Coordination and learning in multi-robot systems. In IEEE Intelligent Systems, pages 6-8, March 1998.
- (1998) IEEE Intelligent Systems , pp. 6-8
- Mataric, M.J.¹

13
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Button and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Button, R.S.¹ Barto, A.G.²

14
- 51649111071
- Designing agent utilities for coordinated, scalable and robust multi-agent systems
- P. Scerri, R. Mailler, and R. Vincent, editors, Springer. to appear
- K. Tumer. Designing agent utilities for coordinated, scalable and robust multi-agent systems. In P. Scerri, R. Mailler, and R. Vincent, editors, Challenges in the Coordination of Large Scale Multiagent Sy stems. Springer, 2005. to appear.
- (2005) Challenges in the Coordination of Large Scale Multiagent Sy Stems
- Tumer, K.¹

15
- 0036355687
- Learning sequences of actions in collectives of autonomous agents
- Bologna, Italy, July
- K. Tumer, A. Agogino, and D. Wolpert. Learning sequences of actions in collectives of autonomous agents. In Proceedings of the First International Joint Conference on Autonomous Agents and Multi-Agent Systems, pages 378-385, Bologna, Italy, July 2002.
- (2002) Proceedings of the First International Joint Conference on Autonomous Agents and Multi-agent Systems , pp. 378-385
- Tumer, K.¹ Agogino, A.² Wolpert, D.³

16
- 4544388719
- K. Tumer and D. Wolpert, editors. Springer, New York
- K. Tumer and D. Wolpert, editors. Collectives and the Design of Complex Systems. Springer, New York, 2004.
- (2004) Collectives and the Design of Complex Systems

17
- 32444447473
- A survey of collectives
- Springer
- K. Tumer and D. Wolpert. A survey of collectives. In Collectives and the Design of Complex Systems, pages 1,42. Springer, 2004.
- (2004) Collectives and the Design of Complex Systems , pp. 1
- Tumer, K.¹ Wolpert, D.²

18
- 85158118268
- Collective intelligence and Braess' paradox
- K. Tumer and D. H. Wolpert. Collective intelligence and Braess' paradox. In Proceedings of the Seventeeth National Conference on Artificial Intelligence, pages 104-109, 2000.
- (2000) Proceedings of the Seventeeth National Conference on Artificial Intelligence , pp. 104-109
- Tumer, K.¹ Wolpert, D.H.²

19
- 0025867614
- Visualizing processes in neural networks
- J. Wejchert and G. Tesauro. Visualizing processes in neural networks. IBM Journal of Research and Development, 35:244-253, 1991.
- (1991) IBM Journal of Research and Development , vol.35 , pp. 244-253
- Wejchert, J.¹ Tesauro, G.²

20
- 0033705642
- Adaptivity in agent-based routing for data networks
- D. H. Wolpert, S. Kirshner, C. J. Merz, and K. Tumer. Adaptivity in agent-based routing for data networks. In Proceedings of the fourth International Conference of Autonomous Agents, pages 396-403, 2000.
- (2000) Proceedings of the Fourth International Conference of Autonomous Agents , pp. 396-403
- Wolpert, D.H.¹ Kirshner, S.² Merz, C.J.³ Tumer, K.⁴

21
- 0001309161
- Optimal payoff functions for members of collectives
- D. H. Wolpert and K. Tumer. Optimal payoff functions for members of collectives. Advances in Complex Systems, 4(2/3):265-279, 2001.
- (2001) Advances in Complex Systems , vol.4 , Issue.2-3 , pp. 265-279
- Wolpert, D.H.¹ Tumer, K.²

22
- 1842531912
- Improving search algorithms by using intelligent coordinates
- D. H. Wolpert, K. Tumer, and E. Bandari. Improving search algorithms by using intelligent coordinates. Physical Review E, 69:017701, 2004.
- (2004) Physical Review E , vol.69 , pp. 017701
- Wolpert, D.H.¹ Tumer, K.² Bandari, E.³

23
- 0034635650
- Collective intelligence for control of distributed dynamical systems
- March
- D. H. Wolpert, K. Wheeler, and K. Tumer. Collective intelligence for control of distributed dynamical systems. Europhysics Letters, 49(6), March 2000.
- (2000) Europhysics Letters , vol.49 , Issue.6
- Wolpert, D.H.¹ Wheeler, K.² Tumer, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.