메뉴 건너뛰기




Volumn , Issue , 2007, Pages 1-8

Fitted Q iteration with CMACs

Author keywords

[No Author keywords available]

Indexed keywords

CONVERGENCE OF NUMERICAL METHODS; DATA ACQUISITION; ITERATIVE METHODS; LEARNING ALGORITHMS; STATE SPACE METHODS;

EID: 34548767315     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2007.368162     Document Type: Conference Paper
Times cited : (21)

References (15)
  • 1
    • 0004049893 scopus 로고
    • Learning from delayed rewards,
    • Ph.D. dissertation, University of Cambridge, England
    • C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, University of Cambridge, England, 1989.
    • (1989)
    • Watkins, C.1
  • 4
    • 84880694195 scopus 로고
    • Stable function approximation in dynamic programming
    • A. Prieditis and S. Russell, Eds. San Francisco, CA: Morgan Kaufmann
    • G. J. Gordon, "Stable function approximation in dynamic programming," in Proceedings of the Twelfth International Conference on Machine Learning, A. Prieditis and S. Russell, Eds. San Francisco, CA: Morgan Kaufmann, 1995, pp. 261-268.
    • (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 261-268
    • Gordon, G.J.1
  • 6
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 9
    • 0036832950 scopus 로고    scopus 로고
    • Technical update: Least-squares temporal difference learning
    • J. A Boyan, "Technical update: Least-squares temporal difference learning," Machine Learning, vol. 49, no. 2-3, pp. 233-246, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 233-246
    • Boyan, J.A.1
  • 10
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds, The MIT Press
    • R. S. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Proceedings of the International Conference on Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., vol. 8. The MIT Press, 1996, pp. 1038-1044.
    • (1996) Proceedings of the International Conference on Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 11
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • S. P. Singh and R. S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning, vol. 22, no. 1-3, pp. 123-158, 1996.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 12
  • 14
    • 0344961876 scopus 로고    scopus 로고
    • Reinforcement learning on explicitly specified time-scales
    • R. Schoknecht and M. Riedmiller, "Reinforcement learning on explicitly specified time-scales," Neural Computing, vol. 12, no. 2, pp. 61-80, 2003.
    • (2003) Neural Computing , vol.12 , Issue.2 , pp. 61-80
    • Schoknecht, R.1    Riedmiller, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.