메뉴 건너뛰기




Volumn , Issue , 2007, Pages 44-51

Dual representations for dynamic programming and reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ITERATIVE METHODS; LEARNING ALGORITHMS; PROBLEM SOLVING; REINFORCEMENT LEARNING;

EID: 34548784027     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2007.368168     Document Type: Conference Paper
Times cited : (44)

References (10)
  • 5
    • 0001158047 scopus 로고
    • Improving generalisation for temporal difference learning: The successor representation
    • P. Dayan, "Improving generalisation for temporal difference learning: The successor representation," Neural Computation, vol. 5, pp. 613-624, 1993.
    • (1993) Neural Computation , vol.5 , pp. 613-624
    • Dayan, P.1
  • 9
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • J. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Transactions on Automatic Control, vol. 42, no. 5, pp. 674-690, 1997.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.1    Van Roy, B.2
  • 10
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • L. Baird, "Residual algorithms: Reinforcement learning with function approximation," in Proceedings ICML, 1995.
    • (1995) Proceedings ICML
    • Baird, L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.