SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Machine Learning

Volumn 49, Issue 2-3, 2002, Pages 247-265

Continuous-action Q-learning

(3) Millán, José Del R a Posenato, Daniele a Dedieu, Eric a

a JOINT RESEARCH CENTRE (Italy)

Author keywords

Continuous domains; Incremental topology preserving maps; Real time operation; Reinforcement learning

Indexed keywords

AUTONOMOUS AGENTS; ESTIMATION; OPTIMIZATION; REAL TIME SYSTEMS; TOPOLOGY;

INCREMENTAL TOPOLOGY PRESERVING MAPS; Q LEARNING; REINFORCEMENT LEARNING; TEMPORAL DIFFERENCE METHODS;

LEARNING ALGORITHMS;

EID: 0036832960 PISSN: 08856125 EISSN: None Source Type: Journal
DOI: 10.1023/A:1017988514716 Document Type: Article

Times cited : (116)

References (23)

1
- 85151728371
- Residual algorithms: Reinforcement learning with function approximation
- (1995) Proceedings of the 12th International Conference on Machine Learning , pp. 30-37
- Baird, L.C.¹

2
- 0020970738
- Neuronlike elements that can solve difficult learning control problems
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , pp. 835-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

3
- 0008851557
- Efficient occupancy grids for variable resolution map building
- (1998) Proceedings of the 6th International Symposium on Intelligent Robotic Systems , pp. 195-203
- Dedieu, E.¹ Millán, J.D.R.²

4
- 85135470835
- A growing neural gas network learns topologies
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 625-632
- Fritzke, B.¹

5
- 0003410791
- Berlin: Springer-Verlag
- (1997) Self-Organizing Maps (2nd Edn.)
- Kohonen, T.¹

6
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

7
- 0029752592
- Average reward reinforcement learning: Foundations, algorithms, and empirical results
- (1996) Machine Learning , vol.22 , pp. 159-195
- Mahadevan, S.¹

8
- 0032029373
- Learning reaching strategies through reinforcement for a sensor-based manipulator
- (1998) Neural Networks , vol.11 , pp. 359-376
- Martín, P.¹ Millán, J.D.R.²

9
- 0026882311
- Integration of representation into goal-driven behavior-based robots
- (1992) IEEE Transactions on Robotics and Automation , vol.8 , pp. 304-312
- Matarić, M.J.¹

10
- 0008852323
- A reinforcement connectionist learning approach to robot path finding
- Ph.D. Thesis, Software Dept., Universitat Politècnica de Catalunya, Barcelona, Spain
- (1992)
- Millán, J.D.R.¹

11
- 0030171602
- Rapid, safe, and incremental learning of navigation strategies
- (1996) IEEE Transactions on Systems, Man, and Cybernetics-Part B , vol.26 , pp. 408-420
- Millán, J.D.R.¹

12
- 84956708276
- Incremental acquisition of local networks for the control of autonomous robots
- (1997) Proceedings of the 7th International Conference on Artificial Neural Networks , pp. 739-744
- Millán, J.D.R.¹

13
- 0000714373
- A reinforcement connectionist approach to robot path finding in non-maze-like environments
- (1992) Machine Learning , vol.8 , pp. 363-395
- Millán, J.D.R.¹ Torras, C.²

14
- 0031231885
- Experiments with reinforcement learning in problems with continuous state and action spaces
- (1998) Adaptive Behavior , vol.6 , pp. 163-217
- Santamaría, J.C.¹ Sutton, R.S.² Ram, A.³

15
- 85152626183
- A reinforcement learning method for maximizing undiscounted rewards
- (1993) Proceedings of the 10th International Conference on Machine Learning , pp. 298-305
- Schwartz, A.¹

16
- 0029753630
- Reinforcement learning with replacing eligibility traces
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

17
- 33847202724
- Learning to predict by the methods of temporal differences
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

18
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.S.¹

19
- 0004102479
- Cambridge, MA: MIT Press
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

20
- 0032050241
- Model-based average reward reinforcement learning
- (1998) Artificial Intelligence , vol.100 , pp. 177-224
- Tadepalli, P.¹ Ok, D.²

21
- 0029390263
- Reinforcement learning of multiple tasks using a hierarchical CMAC architecture
- (1995) Robotics and Autonomous Systems , vol.15 , pp. 247-274
- Tham, C.L.¹

22
- 0002210775
- The role of exploration in learning control
- In D. A. White & D. A. Sofge (Eds.); New York: Van Nostrand Reinhold
- (1992) Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches , pp. 527-559
- Thrun, S.B.¹

23
- 0004049895
- Learning with delayed rewards
- Ph.D. Thesis, Cambridge University, England, UK
- (1989)
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.