메뉴 건너뛰기




Volumn 6408 LNAI, Issue , 2010, Pages 207-218

Revisiting natural actor-critics with value function approximation

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING; DYNAMIC PROGRAMMING;

EID: 79956274048     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-16292-3_21     Document Type: Conference Paper
Times cited : (10)

References (16)
  • 2
    • 0004049893 scopus 로고
    • PhD thesis, Cambridge University, Cambridge, England
    • Watkins, C.: Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, England (1989)
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1
  • 8
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • Amari, S.I.: Natural gradient works efficiently in learning. Neural Computation 10, 251-276 (1998)
    • (1998) Neural Computation , vol.10 , pp. 251-276
    • Amari, S.I.1
  • 13
    • 0001771345 scopus 로고    scopus 로고
    • Linear Least-Squares algorithms for temporal difference learning
    • Bradtke, S.J., Barto, A.G.: Linear Least-Squares algorithms for temporal difference learning. Machine Learning 22, 33-57 (1996)
    • (1996) Machine Learning , vol.22 , pp. 33-57
    • Bradtke, S.J.1    Barto, A.G.2
  • 14
    • 76649127744 scopus 로고    scopus 로고
    • Tracking in reinforcement learning
    • Leung, C.S., Lee, M., Chan, J.H. (eds.) ICONIP 2009. Springer, Heidelberg
    • Geist, M., Pietquin, O., Fricout, G.: Tracking in reinforcement learning. In: Leung, C.S., Lee, M., Chan, J.H. (eds.) ICONIP 2009. LNCS, vol. 5863, pp. 502-511. Springer, Heidelberg (2009)
    • (2009) LNCS , vol.5863 , pp. 502-511
    • Geist, M.1    Pietquin, O.2    Fricout, G.3
  • 15
    • 33646831159 scopus 로고    scopus 로고
    • An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
    • Hao, Y., Liu, J., Wang, Y.-P., Cheung, Y.-m., Yin, H., Jiao, L., Ma, J., Jiao, Y.-C. (eds.) CIS 2005. Springer, Heidelberg
    • Park, J., Kim, J., Kang, D.: An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm. In: Hao, Y., Liu, J., Wang, Y.-P., Cheung, Y.-m., Yin, H., Jiao, L., Ma, J., Jiao, Y.-C. (eds.) CIS 2005. LNCS (LNAI), vol. 3801, pp. 65-72. Springer, Heidelberg (2005)
    • (2005) LNCS (LNAI) , vol.3801 , pp. 65-72
    • Park, J.1    Kim, J.2    Kang, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.