메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1177-1184

Fitted Q-iteration by advantage weighted regression

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS;

EID: 70049104729     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (62)

References (14)
  • 2
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • MIT Press
    • J. A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems 7, pp. 369-376, MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 369-376
    • Boyan, J.A.1    Moore, A.W.2
  • 3
    • 0029248341 scopus 로고
    • Minimum-jerk, two-thirds power law, and isochrony: Converging approaches to movement planning
    • P. Viviani and T. Flash, "Minimum-jerk, two-thirds power law, and isochrony: Converging approaches to movement planning," Journal of Experimental Psychology: Human Perception and Performance, vol. 21, no. 1, pp. 32-53, 1995.
    • (1995) Journal of Experimental Psychology: Human Perception and Performance , vol.21 , Issue.1 , pp. 32-53
    • Viviani, P.1    Flash, T.2
  • 4
    • 0031064725 scopus 로고    scopus 로고
    • A minimum energy cost hypothesis for human arm trajectories
    • R. M. Alexander, "A minimum energy cost hypothesis for human arm trajectories," Biological Cybernetics, vol. 76, pp. 97-105, 1997. (Pubitemid 127665170)
    • (1997) Biological Cybernetics , vol.76 , Issue.2 , pp. 97-105
    • Alexander, R.McN.1
  • 5
    • 0032552114 scopus 로고    scopus 로고
    • Signal-dependent noise determines motor planning
    • DOI 10.1038/29528
    • C. M. Harris and D. M. Wolpert, "Signal-dependent noise determines motor planning.," Nature, vol. 394, pp. 780-784, August 1998. (Pubitemid 28391641)
    • (1998) Nature , vol.394 , Issue.6695 , pp. 780-784
    • Harris, C.M.1    Wolpert, D.M.2
  • 6
    • 33646687423 scopus 로고    scopus 로고
    • Neural fitted Q-iteration - First experiences with a data efficient neural reinforcement learning method
    • M. Riedmiller, "Neural fitted Q-iteration - first experiences with a data efficient neural reinforcement learning method," in Proceedings of the European Conference on Machine Learning (ECML), 2005.
    • (2005) Proceedings of the European Conference on Machine Learning (ECML)
    • Riedmiller, M.1
  • 7
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • MIT Press
    • R. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Advances in Neural Information Processing Systems 8, pp. 1038-1044, MIT Press, 1996.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.1
  • 8
    • 21844465127 scopus 로고    scopus 로고
    • Tree-based batch mode reinforcement learning
    • D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," J. Mach. Learn. Res., vol. 6, pp. 503-556, 2005.
    • (2005) J. Mach. Learn. Res. , vol.6 , pp. 503-556
    • Ernst, D.1    Geurts, P.2    Wehenkel, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.