메뉴 건너뛰기




Volumn , Issue , 2010, Pages 458-465

Eligibility traces through colored noises

Author keywords

Colored noise; Neural networks; Reinforcement learning; Statistical modeling; Value function approximation

Indexed keywords

NEURAL NETWORKS; REINFORCEMENT LEARNING; STATISTICAL METHODS; STOCHASTIC SYSTEMS;

EID: 79951485912     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICUMT.2010.5676597     Document Type: Conference Paper
Times cited : (6)

References (17)
  • 6
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • R. E. Kalman, "A new approach to linear filtering and prediction problems," Transactions of the ASME-Journal of Basic Engineering, vol. 82, no. Series D, pp. 35-45, 1960.
    • (1960) Transactions of the ASME-Journal of Basic Engineering , vol.82 , Issue.SERIES D , pp. 35-45
    • Kalman, R.E.1
  • 7
    • 21244437999 scopus 로고    scopus 로고
    • Unscented filtering and nonlinear estimation
    • S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation," Proceedings of the IEEE, vol. 92, no. 3, pp. 401-422, 2004.
    • (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 401-422
    • Julier, S.J.1    Uhlmann, J.K.2
  • 9
    • 40849145988 scopus 로고    scopus 로고
    • Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
    • April
    • A. Antos, C. Szepesvári, and R. Munos, "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path," Machine Learning, vol. 71, no. 1, pp. 89-129, April 2008.
    • (2008) Machine Learning , vol.71 , Issue.1 , pp. 89-129
    • Antos, A.1    Szepesvári, C.2    Munos, R.3
  • 17
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Transactions on Automatic Control, vol. 42, pp. 674-690, 1997.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.