SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2007, Pages 1-8

Fitted Q iteration with CMACs

Author keywords

[No Author keywords available]

Indexed keywords

CONVERGENCE OF NUMERICAL METHODS; DATA ACQUISITION; ITERATIVE METHODS; LEARNING ALGORITHMS; STATE SPACE METHODS;

CMAC ARCHITECTURE; FUNCTION APPROXIMATORS; Q-LEARNING;

REINFORCEMENT LEARNING;

EID: 34548767315 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2007.368162 Document Type: Conference Paper

Times cited : (21)

References (15)

2
- 0003487482
- Belmont Massachusetts: Athena Scientific
- D. P. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming. Belmont Massachusetts: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.²

6
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

7
- 33646398129
- Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method
- M. Riedmiller, "Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method," in Proceedings of the Sixteenth European Conference on Machine Learning, Porto, Portugal, 2005, pp. 317-328.
- (2005) Proceedings of the Sixteenth European Conference on Machine Learning, Porto, Portugal , pp. 317-328
- Riedmiller, M.¹

9
- 0036832950
- Technical update: Least-squares temporal difference learning
- J. A Boyan, "Technical update: Least-squares temporal difference learning," Machine Learning, vol. 49, no. 2-3, pp. 233-246, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 233-246
- Boyan, J.A.¹

11
- 0029753630
- Reinforcement learning with replacing eligibility traces
- S. P. Singh and R. S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning, vol. 22, no. 1-3, pp. 123-158, 1996.
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

12
- 0038595393
- Carnegie Mellon University, Pittsburgh, PA 15213, Tech. Rep. CMU-CS95-103, January
- G. J. Gordon, "Stable function approximation in dynamic programming," Carnegie Mellon University, Pittsburgh, PA 15213, Tech. Rep. CMU-CS95-103, January 1995.
- (1995) Stable function approximation in dynamic programming
- Gordon, G.J.¹

13
- 0004098975
- New York: Wiley
- D. Luenberger, Optimization by Vector Space Methods. New York: Wiley, 1969.
- (1969) Optimization by Vector Space Methods
- Luenberger, D.¹

14
- 0344961876
- Reinforcement learning on explicitly specified time-scales
- R. Schoknecht and M. Riedmiller, "Reinforcement learning on explicitly specified time-scales," Neural Computing, vol. 12, no. 2, pp. 61-80, 2003.
- (2003) Neural Computing , vol.12 , Issue.2 , pp. 61-80
- Schoknecht, R.¹ Riedmiller, M.²

15
- 0004102479
- Cambridge, Massachusetts: The MIT Press
- R. Sutton and A. Barto, Reinforcement Learning, An Introduction. Cambridge, Massachusetts: The MIT Press, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.¹ Barto, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.