메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Stable dual dynamic programming

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ALGORITHMS; REINFORCEMENT LEARNING;

EID: 85161971158     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (10)
  • 5
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • L. C. Baird. Residual algorithms: Reinforcement learning with function approximation. In International Conference on Machine Learning, pages 30-37, 1995.
    • (1995) International Conference on Machine Learning , pp. 30-37
    • Baird, L.C.1
  • 7
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • PII S0018928697034375
    • J. Tsitsiklis and B. Van Roy. An analysis of temporal-difference learning with function approximation. IEEE Trans. Automat. Control, 42(5):674-690, 1997. (Pubitemid 127760263)
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 8
    • 0034342516 scopus 로고    scopus 로고
    • On the existence of fixed points for approximate value iteration and temporal-difference learning
    • D. de Farias and B. Van Roy. On the existence of fixed points for approximate value iteration and temporal-difference learning. J. Optimization Theory and Applic., 105(3):589-608, 2000.
    • (2000) J. Optimization Theory and Applic. , vol.105 , Issue.3 , pp. 589-608
    • De Farias, D.1    Van Roy, B.2
  • 9
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In NIPS 7, pages 369-376, 1995.
    • (1995) NIPS , vol.7 , pp. 369-376
    • Boyan, J.A.1    Moore, A.W.2
  • 10
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems, pages 1038-1044, 1996.
    • (1996) Advances in Neural Information Processing Systems , pp. 1038-1044
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.