메뉴 건너뛰기




Volumn , Issue , 1997, Pages 1075-1081

Analysis of temporal-difference learning with function approximation

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ERRORS; FINITE STATE; FUNCTION APPROXIMATION; FUNCTION APPROXIMATORS; LINEAR FUNCTIONS; PARAMETER VECTORS; PARAMETERIZED; TEMPORAL DIFFERENCE LEARNING;

EID: 84887003012     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (261)

References (10)
  • 1
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • Prieditis & Russell, eds 9-12 July, Morgan Kaufman Publishers, San Francisco, CA
    • Baird, L. C. (1995). "Residual Algorithms: Reinforcement Learning with Function Approximation," in Prieditis & Russell, eds. Machine Learning: Proceedings of the Twelfth International Conference, 9-12 July, Morgan Kaufman Publishers, San Francisco, CA.
    • (1995) Machine Learning: Proceedings of the Twelfth International Conference
    • Baird, L.C.1
  • 2
    • 0000268954 scopus 로고
    • A counter-example to temporal-difference learning
    • Bertsekas, D. P. (1994) "A Counter-Example to Temporal-Difference Learning," Neural Computation, vol. 7, pp. 270-279.
    • (1994) Neural Computation , vol.7 , pp. 270-279
    • Bertsekas, D.P.1
  • 7
    • 84899012767 scopus 로고    scopus 로고
    • personal communication
    • Gurvits, L. (1996) personal communication.
    • (1996)
    • Gurvits, L.1
  • 8
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • Sutton, R. S., (1988) "Learning to Predict by the Method of Temporal Differences," Machine Learning, vol. 3, pp. 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 9
    • 33746944751 scopus 로고
    • On the virtues of linear learning and trajectory distributions
    • Boyan, Moore, and Sutton, Eds., Technical Report CMU-CS-95-206, Carnegie Mellon University, Pittsburgh, PA 15213
    • Sutton, R.S. (1995) "On the Virtues of Linear Learning and Trajectory Distributions," Proceedings of the Workshop on Value Function Approximation, Machine Learning Conference 1995, Boyan, Moore, and Sutton, Eds., p. 85. Technical Report CMU-CS-95-206, Carnegie Mellon University, Pittsburgh, PA 15213.
    • (1995) Proceedings of the Workshop on Value Function Approximation, Machine Learning Conference 1995 , pp. 85
    • Sutton, R.S.1
  • 10
    • 0008813539 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • to appear in the
    • Tsitsiklis, J. N. & Van Roy, B. (1996) "An Analysis of Temporal-Difference Learning with Function Approximation," to appear in the IEEE Transactions on Automatic Control.
    • (1996) IEEE Transactions on Automatic Control
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.