메뉴 건너뛰기




Volumn , Issue , 2001, Pages

Kernel-based reinforcement learning in average-cost problems: An application to optimal portfolio choice

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MARKOV PROCESSES; OPTIMIZATION; REINFORCEMENT LEARNING;

EID: 0003327481     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (10)
  • 2
    • 0001133021 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In NIPS 7, 1995.
    • (1995) NIPS , vol.7
    • Boyan, J.A.1    Moore, A.W.2
  • 4
    • 84898944389 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning in average-cost problems
    • In preparation
    • D. Ormoneit and P. Glynn. Kernel-based reinforcement learning in average-cost problems. Working paper, Stanford University. In preparation.
    • Working Paper, Stanford University
    • Ormoneit, D.1    Glynn, P.2
  • 5
    • 84898988649 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • To appear
    • D. Ormoneit and S. Sen. Kernel-based reinforcement learning. Machine Learning, 2001. To appear.
    • (2001) Machine Learning
    • Ormoneit, D.1    Sen, S.2
  • 6
    • 0001509947 scopus 로고    scopus 로고
    • Using randomization to break the curse of dimensionality
    • J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997.
    • (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
    • Rust, J.1
  • 7
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • R. S. Sutton, D. Mc Allester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In NIPS 12, 2000.
    • (2000) NIPS , vol.12
    • Sutton, R.S.1    McAllester, D.2    Singh, S.3    Mansour, Y.4
  • 8
    • 0029752470 scopus 로고    scopus 로고
    • Feature-based methods for large-scale dynamic programming
    • J.N. Tsitsiklis and B. Van Roy. Feature-based methods for large-scale dynamic programming. Machine Learning, 22:59-94, 1996.
    • (1996) Machine Learning , vol.22 , pp. 59-94
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 9
    • 0033221519 scopus 로고    scopus 로고
    • Average cost temporal-difference learning
    • J. N. Tsitsiklis and B. Van Roy. Average cost temporal-difference learning. Automatica, 35(11):1799-1808, 1999.
    • (1999) Automatica , vol.35 , Issue.11 , pp. 1799-1808
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 10


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.