SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 21 - Proceedings of the 2008 Conference

Volumn , Issue , 2009, Pages 1177-1184

Fitted Q-iteration by advantage weighted regression

(2) Neumann, Gerhard a Peters, Jan b

a GRAZ UNIVERSITY OF TECHNOLOGY (Austria)

b MAX PLANCK INSTITUTE FOR BIOLOGICAL CYBERNETICS (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS;

ACTION SELECTION; ACTION SPACES; CONTINUOUS ACTIONS; GREEDY ACTION; HIGH QUALITY; LEARNING PROCESS; OPTIMISATIONS; REAL-WORLD TASK; TECHNICAL APPLICATIONS; WEIGHTED REGRESSION;

ITERATIVE METHODS;

EID: 70049104729 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (62)

References (14)

1
- 0004102479
- Boston, MA: MIT Press
- R. Sutton and A. Barto, Reinforcement Learning. Boston, MA: MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

2
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- MIT Press
- J. A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems 7, pp. 369-376, MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 369-376
- Boyan, J.A.¹ Moore, A.W.²

3
- 0029248341
- Minimum-jerk, two-thirds power law, and isochrony: Converging approaches to movement planning
- P. Viviani and T. Flash, "Minimum-jerk, two-thirds power law, and isochrony: Converging approaches to movement planning," Journal of Experimental Psychology: Human Perception and Performance, vol. 21, no. 1, pp. 32-53, 1995.
- (1995) Journal of Experimental Psychology: Human Perception and Performance , vol.21 , Issue.1 , pp. 32-53
- Viviani, P.¹ Flash, T.²

4
- 0031064725
- A minimum energy cost hypothesis for human arm trajectories
- R. M. Alexander, "A minimum energy cost hypothesis for human arm trajectories," Biological Cybernetics, vol. 76, pp. 97-105, 1997. (Pubitemid 127665170)
- (1997) Biological Cybernetics , vol.76 , Issue.2 , pp. 97-105
- Alexander, R.McN.¹

5
- 0032552114
- Signal-dependent noise determines motor planning
- DOI 10.1038/29528
- C. M. Harris and D. M. Wolpert, "Signal-dependent noise determines motor planning.," Nature, vol. 394, pp. 780-784, August 1998. (Pubitemid 28391641)
- (1998) Nature , vol.394 , Issue.6695 , pp. 780-784
- Harris, C.M.¹ Wolpert, D.M.²

6
- 33646687423
- Neural fitted Q-iteration - First experiences with a data efficient neural reinforcement learning method
- M. Riedmiller, "Neural fitted Q-iteration - first experiences with a data efficient neural reinforcement learning method," in Proceedings of the European Conference on Machine Learning (ECML), 2005.
- (2005) Proceedings of the European Conference on Machine Learning (ECML)
- Riedmiller, M.¹

7
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- MIT Press
- R. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Advances in Neural Information Processing Systems 8, pp. 1038-1044, MIT Press, 1996.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.¹

8
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," J. Mach. Learn. Res., vol. 6, pp. 503-556, 2005.
- (2005) J. Mach. Learn. Res. , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

9
- 85161978146
- Fitted Q-iteration in continuous action-space MDPs
- Cambridge, MA: MIT Press
- A. Antos, R. Munos, and C. Szepesvari, "Fitted Q-iteration in continuous action-space MDPs," in Advances in Neural Information Processing Systems 20, pp. 9-16, Cambridge, MA: MIT Press, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 9-16
- Antos, A.¹ Munos, R.² Szepesvari, C.³

10
- 34548767315
- S. Timmer and M. Riedmiller, "Fitted Q-iteration with CMACs," pp. 1-8, 2007.
- (2007) Fitted Q-iteration with CMACs , pp. 1-8
- Timmer, S.¹ Riedmiller, M.²

11
- 69249227158
- Policy learning formotor skills
- J. Peters and S. Schaal, "Policy learning formotor skills," in Proceedings of 14th International Conference on Neural Information Processing (ICONIP), 2007.
- (2007) Proceedings of 14th International Conference on Neural Information Processing (ICONIP)
- Peters, J.¹ Schaal, S.²

12
- 17444409624
- A tutorial on the cross-entropy method
- DOI 10.1007/s10479-005-5724-z
- P.-T. de Boer, D. Kroese, S. Mannor, and R. Rubinstein, "A tutorial on the cross-entropy method," Annals of Operations Research, vol. 134, pp. 19-67, January 2005. (Pubitemid 40550039)
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 19-67
- De Boer, P.-T.¹ Kroese, D.P.² Mannor, S.³ Rubinstein, R.Y.⁴

13
- 34547964788
- Reinforcement learning by reward-weighted regression for operational space control
- J. Peters and S. Schaal, "Reinforcement learning by reward-weighted regression for operational space control," in Proceedings of the International Conference on Machine Learning (ICML), 2007.
- (2007) Proceedings of the International Conference on Machine Learning (ICML)
- Peters, J.¹ Schaal, S.²

14
- 0031074521
- Locally weighted learning
- C. G. Atkeson, A. W. Moore, and S. Schaal, "Locally weighted learning," Artificial Intelligence Review, vol. 11, no. 1-5, pp. 11-73, 1997. (Pubitemid 127508233)
- (1997) Artificial Intelligence Review , vol.11 , Issue.1-5 , pp. 11-73
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.