SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2001, Pages

Kernel-based reinforcement learning in average-cost problems: An application to optimal portfolio choice

(2) Ormoneit, Dirk a Glynn, Peter b

a Stanford University (United States)

b STANFORD UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MARKOV PROCESSES; OPTIMIZATION; REINFORCEMENT LEARNING;

AVERAGE COST; KERNEL BASED APPROACH; MARKOV DECISION PROCESSES; OPTIMAL COSTS; OPTIMAL PORTFOLIOS; PARAMETRIC FUNCTIONS; TEMPORAL DIFFERENCE LEARNING; VALUE FUNCTIONS;

COSTS;

EID: 0003327481 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (10)

1
- 0003565783
- Athena Scientific
- D. P. Bertsekas. Dynamic Programming and Optimal Control, Volume 1 and 2. Athena Scientific, 1995.
- (1995) Dynamic Programming and Optimal Control , vol.1-2
- Bertsekas, D.P.¹

2
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In NIPS 7, 1995.
- (1995) NIPS , vol.7
- Boyan, J.A.¹ Moore, A.W.²

3
- 0003989207
- PhD thesis, Computer Science Department, Carnegie Mellon University
- G. Gordon. Approximate Solutions to Markov Decision Processes. PhD thesis, Computer Science Department, Carnegie Mellon University, 1999.
- (1999) Approximate Solutions to Markov Decision Processes
- Gordon, G.¹

4
- 84898944389
- Kernel-based reinforcement learning in average-cost problems
- In preparation
- D. Ormoneit and P. Glynn. Kernel-based reinforcement learning in average-cost problems. Working paper, Stanford University. In preparation.
- Working Paper, Stanford University
- Ormoneit, D.¹ Glynn, P.²

5
- 84898988649
- Kernel-based reinforcement learning
- To appear
- D. Ormoneit and S. Sen. Kernel-based reinforcement learning. Machine Learning, 2001. To appear.
- (2001) Machine Learning
- Ormoneit, D.¹ Sen, S.²

6
- 0001509947
- Using randomization to break the curse of dimensionality
- J. Rust. Using randomization to break the curse of dimensionality. Econometrica, 65(3):487-516, 1997.
- (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
- Rust, J.¹

7
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. S. Sutton, D. Mc Allester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In NIPS 12, 2000.
- (2000) NIPS , vol.12
- Sutton, R.S.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

8
- 0029752470
- Feature-based methods for large-scale dynamic programming
- J.N. Tsitsiklis and B. Van Roy. Feature-based methods for large-scale dynamic programming. Machine Learning, 22:59-94, 1996.
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

9
- 0033221519
- Average cost temporal-difference learning
- J. N. Tsitsiklis and B. Van Roy. Average cost temporal-difference learning. Automatica, 35(11):1799-1808, 1999.
- (1999) Automatica , vol.35 , Issue.11 , pp. 1799-1808
- Tsitsiklis, J.N.¹ Van Roy, B.²

10
- 84898938510
- Actor-critic algorithms
- J. N. Tsitsiklis V. R. Konda. Actor-critic algorithms. In NIPS 12, 2000.
- (2000) NIPS , vol.12
- Tsitsiklis, J.N.¹ Konda, V.R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.