SCOPUS 정보 검색 플랫폼

Volumn 1747, Issue , 1999, Pages 417-428

Q-learning in continuous state and action spaces

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; COMPUTERS;

ACTION SPACES; CONTINUOUS ACTIONS; CONTINUOUS STATE; CONTROL POLICY; CONTROL TASK; DISCRETE STATE; ENHANCE LEARNING; NON-HOLONOMIC CONTROL;

ARTIFICIAL INTELLIGENCE;

EID: 84957629024 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-46695-9_35 Document Type: Conference Paper

Times cited : (96)

References (17)

1
- 0016556021
- A new approach to manipulator control: The cerebrellar model articulated controller (CMAC)
- [1] J. S. Albus. A new approach to manipulator control: the cerebrellar model articulated controller (CMAC). J. Dynamic Systems, Measurement and Control, 97:220-227, 1975.
- (1975) J. Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
- Albus, J.S.¹

4
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- [4] A. G. Barto, R. S. Sutton, and C. W. Anderson. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans on systems, man and cybernetics, SMC, 13:834-846, 1983.
- (1983) IEEE Trans on Systems, Man and Cybernetics, SMC , vol.13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

5
- 84855331760
- Reinforcement learning applied to the control of an autonomous underwater vehicle
- [5] Chris Gaskett, David Wettergreen, and Alexander Zelinsky. Reinforcement learning applied to the control of an autonomous underwater vehicle. In Proceedings of the Australian Conference on Robotics and Automation (AuCRA99), 1999.
- (1999) Proceedings of the Australian Conference on Robotics and Automation (Aucra99)
- Gaskett, C.¹ Wettergreen, D.² Zelinsky, A.³

7
- 37249077378
- Residual advantage learning applied to a differential game
- [7] Mance E. Harmon and Leemon C. Baird. Residual advantage learning applied to a differential game. In Proceedings of the International Conference on Neural Networks, Washington D.C, 1995.
- (1995) Proceedings of the International Conference on Neural Networks, Washington D.C
- Harmon, M.E.¹ Baird, L.C.²

8
- 0003527079
- Springer, Berlin, third edition
- [8] T. Kohonen. Self-Organization and Associative Memory. Springer, Berlin, third edition, 1989.
- (1989) Self-Organization and Associative Memory
- Kohonen, T.¹

9
- 0003759880
- Academic Press
- [9] Peter Lancaster and Kestutis Salkauskas. Curve and Surface Fitting, an Introduction. Academic Press, 1986.
- (1986) Curve and Surface Fitting, an Introduction
- Lancaster, P.¹ Salkauskas, K.²

10
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- [10] Long-Ji Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning Journal, 8(3/4), 1992.
- (1992) Machine Learning Journal , vol.8 , Issue.3-4
- Lin, L.-J.¹

12
- 0031231885
- Experiments with reinforcement learning in problems with continuous state and action spaces
- [12] Juan C. Santamaria, Richard S. Sutton, and Ashwin Ram. Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behaviour, 6(2):163-218, 1998.
- (1998) Adaptive Behaviour , vol.6 , Issue.2 , pp. 163-218
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

14
- 0004102479
- Bradford Books, MIT
- [14] Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. Bradford Books, MIT, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

15
- 0031341345
- Neural reinforcement learning for behaviour synthesis
- [15] Claude F. Touzet. Neural reinforcement learning for behaviour synthesis. Robotics and Autonomous Systems, 22(3-4):251-81, 1997.
- (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 251-281
- Touzet, C.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.