SCOPUS 정보 검색 플랫폼

Proceedings, Twenty-First International Conference on Machine Learning, ICML 2004

Volumn , Issue , 2004, Pages 791-798

Interpolation-based Q-learning

(2) Szepesvári, Csaba a Smart, William D b

a INSTITUTE OF EXPERIMENTAL MEDICINE (Hungary)

b WASHINGTON UNIVERSITY IN ST LOUIS (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; COMPUTER SIMULATION; CONVERGENCE OF NUMERICAL METHODS; DYNAMIC PROGRAMMING; FUNCTIONS; INTERPOLATION; MARKOV PROCESSES; PROBLEM SOLVING; Q FACTOR MEASUREMENT; TRIANGULATION;

BAYCENTRIC INTERPOLATORS; OPTIMAL VALUE FUNCTIONS; Q-LEARNING; REINFORCEMENT LEARNING (RL);

LEARNING SYSTEMS;

EID: 14344263882 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (55)

References (10)

1
- 14344266002
- Learning rates for Q-learning
- Even-Dar, E., & Mansour, Y. (2003). Learning rates for Q-learning. Journal of Machine Learning Research, 5, 1-25.
- (2003) Journal of Machine Learning Research , vol.5 , pp. 1-25
- Even-Dar, E.¹ Mansour, Y.²

2
- 84880694195
- Stable function approximation in dynamic programming
- Morgan Kaufmann
- Gordon, G. J. (1995). Stable function approximation in dynamic programming. Proc. of ICML 20 (pp. 261-268). Morgan Kaufmann.
- (1995) Proc. of ICML , vol.20 , pp. 261-268
- Gordon, G.J.¹

3
- 85014758967
- Estimation of the density and the regression function under mixing conditions
- Liebscher, E. (2001). Estimation of the density and the regression function under mixing conditions. Statistics & Decisions, 19, 9-26.
- (2001) Statistics & Decisions , vol.19 , pp. 9-26
- Liebscher, E.¹

4
- 0003637131
- Springer-Verlag
- Meyn, S., & Tweedie, R. (1996). Markov chains and stochastic stability. Springer-Verlag.
- (1996) Markov Chains and Stochastic Stability
- Meyn, S.¹ Tweedie, R.²

5
- 84880680664
- Variable resolution discretization for high-accuracy solutions of optimal control problems
- Munos, R., &: Moore, A. (1999). Variable resolution discretization for high-accuracy solutions of optimal control problems. Proc. of IJCAI (pp. 1348-1355).
- (1999) Proc. of IJCAI , pp. 1348-1355
- Munos, R.¹ Moore, A.²

6
- 0036832956
- Kernel-based reinforcement learning
- Ormoneit, D., & Sen, S. (2002). Kernel-based reinforcement learning. Machine Learning, 49, 161-178.
- (2002) Machine Learning , vol.49 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

7
- 85153965130
- Reinforcement learning with soft state aggregation
- MIT Press
- Singh, S., Jaakkola, T., & Jordan, M. (1995). Reinforcement learning with soft state aggregation. NIPS 7 (pp. 361-368). MIT Press.
- (1995) NIPS , vol.7 , pp. 361-368
- Singh, S.¹ Jaakkola, T.² Jordan, M.³

8
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh, S., &: Sutton, R. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 32, 123-158.
- (1996) Machine Learning , vol.32 , pp. 123-158
- Singh, S.¹ Sutton, R.²

9
- 0033570798
- A unified analysis of value-function-based reinforcement-learning algorithms
- Szepesvári, C., & Littman, M. (1999). A unified analysis of value-function-based reinforcement-learning algorithms. Neural Computation, 11, 2017-2059.
- (1999) Neural Computation , vol.11 , pp. 2017-2059
- Szepesvári, C.¹ Littman, M.²

10
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis, J. N., & Van Roy, B. (1996). Feature-based methods for large scale dynamic programming. Machine Learning, 22, 59-94.
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.