메뉴 건너뛰기




Volumn 49, Issue 2-3, 2002, Pages 161-178

Kernel-based reinforcement learning

Author keywords

Kernel smoothing; Kernel based learning; Lazy learning; Local averaging; Markov decision process; Reinforcement learning

Indexed keywords

APPROXIMATION THEORY; ASYMPTOTIC STABILITY; CONVERGENCE OF NUMERICAL METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; NEURAL NETWORKS; OPTIMIZATION; PARAMETER ESTIMATION; REGRESSION ANALYSIS; STATE SPACE METHODS;

EID: 0036832956     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1017928328829     Document Type: Article
Times cited : (416)

References (35)
  • 2
    • 0003477315 scopus 로고
    • Reinforcement learning with high-dimensional, continuous actions
    • Technical Report WL-TR-93-1147, Wright Laboratory, Wright-Patterson Air Force Base Ohio
    • (1993)
    • Baird, L.C.1    Klopf, A.H.2
  • 7
    • 0040348531 scopus 로고    scopus 로고
    • Estimating portfolio and consumption choice. A conditional Euler equations approach
    • (1999) Journal of Finance , vol.54 , Issue.5 , pp. 1609-1645
    • Brandt, M.W.1
  • 11
    • 0003989207 scopus 로고    scopus 로고
    • Approximate solutions to Markov decision processes
    • Ph.D. Thesis, Computer Science Department Carnegie Mellon University, Pittsburgh, PA
    • (1999)
    • Gordon, G.1
  • 14
    • 0003754075 scopus 로고    scopus 로고
    • Reinforcement learning and distributed local model synthesis
    • Ph.D. Thesis, Linköping University
    • (1997)
    • Landelius, T.1
  • 15
    • 0003485741 scopus 로고    scopus 로고
    • Valuing American options by simulations: A simple least-squares approach
    • Technical Report 25-98, Department of Finance, UCLA
    • (1998)
    • Longstaff, F.A.1    Schwartz, E.S.2
  • 23
    • 0001509947 scopus 로고    scopus 로고
    • Using randomization to break the curse of dimensionality
    • (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
    • Rust, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.