메뉴 건너뛰기




Volumn , Issue , 2006, Pages 41-46

Dynamic exploration in Q(λ)-learning

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMIC MODELS; FUNCTION EVALUATION; OPTIMIZATION; PROBLEM SOLVING; STATE SPACE METHODS;

EID: 40649111409     PISSN: 10987576     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (12)
  • 2
    • 10944228202 scopus 로고    scopus 로고
    • Reinforcement learning using neural networks, with applications to motor control,
    • Ph.D. dissertation, Institute National Polytechnique de Grenoble, June
    • R. Coulom, "Reinforcement learning using neural networks, with applications to motor control," Ph.D. dissertation, Institute National Polytechnique de Grenoble, June 2002.
    • (2002)
    • Coulom, R.1
  • 5
    • 19744362457 scopus 로고    scopus 로고
    • Combining reinforcement learning with a local control algorithm
    • Morgan Kaufmann, San Francisco, Online, Available
    • J. Randlov, A. Barto, and M. Rosenstein, "Combining reinforcement learning with a local control algorithm," Proceedings of the Seventeenth International Conference on Machine Learning, pages 775-782, Morgan Kaufmann, San Francisco, 2000. [Online]. Available: citeseer.ist.psu.edu/ randlov00combining.html
    • (2000) Proceedings of the Seventeenth International Conference on Machine Learning , pp. 775-782
    • Randlov, J.1    Barto, A.2    Rosenstein, M.3
  • 6
    • 40649092084 scopus 로고    scopus 로고
    • S. Thrun, The role of exploration in learning control, in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches. D. White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold, 1992.
    • S. Thrun, "The role of exploration in learning control," in Handbook for Intelligent Control: Neural, Fuzzy and Adaptive Approaches. D. White and D. Sofge, Eds. Florence, Kentucky 41022: Van Nostrand Reinhold, 1992.
  • 8
    • 0345161977 scopus 로고    scopus 로고
    • Explorations in efficient reinforcement learning,
    • Ph.D. dissertation, University of Amsterdam, IDSIA. February
    • M. A. Wiering, "Explorations in efficient reinforcement learning," Ph.D. dissertation, University of Amsterdam / IDSIA. February 1999.
    • (1999)
    • Wiering, M.A.1
  • 9
    • 84899015857 scopus 로고    scopus 로고
    • Reinforcement learning with Long Short-Term Memory
    • T. G. Dietterich, S. Becker, and Z. Ghahramani, Eds. Cambridge, MA: MIT Press
    • B. Bakker, "Reinforcement learning with Long Short-Term Memory," in Advances in Neural Information Processing Systems 14, T. G. Dietterich, S. Becker, and Z. Ghahramani, Eds. Cambridge, MA: MIT Press, 2002.
    • (2002) Advances in Neural Information Processing Systems 14
    • Bakker, B.1
  • 11
    • 34249833101 scopus 로고
    • Q-learning
    • C. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3/4, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 12
    • 40649118042 scopus 로고    scopus 로고
    • S. Thrun and K. Möller, Active exploration in dynamic environments, in Advances in Neural Information Processing Systems 4, [NIPS Conference, Denver, Colorado, USA, December 2-5, 1991], J. E. Moody, S. J. Hanson, and R. Lippmann, Eds. Morgan Kaufmann, 1992, pp. 531-538.
    • S. Thrun and K. Möller, "Active exploration in dynamic environments," in Advances in Neural Information Processing Systems 4, [NIPS Conference, Denver, Colorado, USA, December 2-5, 1991], J. E. Moody, S. J. Hanson, and R. Lippmann, Eds. Morgan Kaufmann, 1992, pp. 531-538.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.