SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2008, Pages 154-161

A study of reinforcement learning in a new multiagent domain

Author keywords

[No Author keywords available]

Indexed keywords

EDUCATION; MULTI AGENT SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING;

BELLMAN ERRORS; EMPIRICAL RESULTS; KEEPAWAY; MULTI AGENTS; ROBOCUP; STATE SPACES; VALUE FUNCTIONS;

LEARNING ALGORITHMS;

EID: 62949148941 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/WIIAT.2008.114 Document Type: Conference Paper

Times cited : (5)

References (12)

2
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- P. Stone, R. S. Sutton, and G. Kuhlmann, "Reinforcement learning for RoboCup-soccer keepaway," Adaptive Behavior, vol. 13, no. 3, pp. 165-188, 2005.
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

3
- 4544247283
- M. E. Harmon and S. S. Harmon, "Reinforcement learning: a tutorial," 1996.
- (1996) Reinforcement learning: A tutorial
- Harmon, M.E.¹ Harmon, S.S.²

5
- 62949232970
- Advantage (λ) learning,
- B. Bakker, "Advantage (λ) learning," technical report, 2002.
- (2002) technical report
- Bakker, B.¹

6
- 11244263867
- Reinforcement learning for visual servoing of a mobile robot
- C. Gaskett, L. Fletcher, and E. Zelinsky, "Reinforcement learning for visual servoing of a mobile robot," in Proceedings of the Australian Conference on Robotics and Automation, 2000.
- (2000) Proceedings of the Australian Conference on Robotics and Automation
- Gaskett, C.¹ Fletcher, L.² Zelinsky, E.³

7
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

8
- 17444414191
- Basis function adaptation in temporal difference reinforcement learning
- I. Menache, S. Mannor, and N. Shimkin, "Basis function adaptation in temporal difference reinforcement learning," Annals of Operations Research, vol. 134, pp. 215-238, 2005.
- (2005) Annals of Operations Research , vol.134 , pp. 215-238
- Menache, I.¹ Mannor, S.² Shimkin, N.³

9
- 29344433509
- Samuel meets amarel: Automating value function approximation using global state space analysis
- S. Mahadevan, "Samuel meets amarel: Automating value function approximation using global state space analysis," in Proceedings of AAAI, pp. 1000-1005, 2005.
- (2005) Proceedings of AAAI , pp. 1000-1005
- Mahadevan, S.¹

10
- 34250706852
- Automatic basis function construction for approximate dynamic programming and reinforcement learning
- P. W. Keller, S. Mannor, and D. Precup, "Automatic basis function construction for approximate dynamic programming and reinforcement learning," in Proceedings of the 23rd International Conference on Machine Learning, 2006.
- (2006) Proceedings of the 23rd International Conference on Machine Learning
- Keller, P.W.¹ Mannor, S.² Precup, D.³

11
- 0024680419
- Adaptive aggregation methods for infinite horizon dynamic programming
- D. Bertsekas and D. non, "Adaptive aggregation methods for infinite horizon dynamic programming," IEEE Transactions on Automatic Control, vol. 34, pp. 589-598, 1989.
- (1989) IEEE Transactions on Automatic Control , vol.34 , pp. 589-598
- Bertsekas, D.¹ non, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.