메뉴 건너뛰기




Volumn 35, Issue 11, 1999, Pages 1799-1808

Average cost temporal-difference learning

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; BOUNDARY CONDITIONS; COMPUTER SIMULATION; CONVERGENCE OF NUMERICAL METHODS; MARKOV PROCESSES; TABLE LOOKUP;

EID: 0033221519     PISSN: 00051098     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0005-1098(99)00099-0     Document Type: Article
Times cited : (144)

References (11)
  • 4
    • 0000430514 scopus 로고
    • The convergence of TD(λ) for general λ
    • Dayan, P. D. (1992). The convergence of TD(λ) for general λ. Machine Learning, 8, 341-362.
    • (1992) Machine Learning , vol.8 , pp. 341-362
    • Dayan, P.D.1
  • 7
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: Foundations, algorithms, and empirical results
    • Mahadevan, S. (1996). Average reward reinforcement learning: Foundations, algorithms, and empirical results. Machine Learning, 22, 1-38.
    • (1996) Machine Learning , vol.22 , pp. 1-38
    • Mahadevan, S.1
  • 8
    • 0032266562 scopus 로고    scopus 로고
    • Call admission control and routing in integrated service networks using reinforcement learning
    • Tampa, FL
    • Marbach, P., Mihatsch, O., & Tsitsiklis, J. N. (1998). Call admission control and routing in integrated service networks using reinforcement learning. In Proceedings of the 1998 IEEE CDC, Tampa, FL.
    • (1998) Proceedings of the 1998 IEEE CDC
    • Marbach, P.1    Mihatsch, O.2    Tsitsiklis, J.N.3
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • Sutton, R. S. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 11
    • 0031143730 scopus 로고    scopus 로고
    • An Analysis of Temporal- Difference Learning with Function Approximation
    • Tsitsiklis, J. N., & Van Roy, B. (1997). An Analysis of Temporal-Difference Learning with Function Approximation. IEEE Transactions on Automatic Control, 42(5), 674-690.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.