SCOPUS 정보 검색 플랫폼

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

Volumn 2837, Issue , 2003, Pages 96-107

Iteratively extending time horizon reinforcement learning

(3) Ernst, Damien a Geurts, Pierre a Wehenkel, Louis a

a UNIVERSITY OF LIÈGE (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; CLOSED LOOP CONTROL SYSTEMS; CONVERGENCE OF NUMERICAL METHODS; FUNCTIONS; ITERATIVE METHODS; OPTIMAL CONTROL SYSTEMS; PROBLEM SOLVING; RANDOM PROCESSES; STANDARDS; TIME DOMAIN ANALYSIS; VECTORS; APPROXIMATION ALGORITHMS; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS; SUPERVISED LEARNING;

OPTIMAL CONTROL POLICY; REINFORCEMENT LEARNING; REWARD FUNCTIONS; TIME HORIZONS; CONTROL PROBLEMS; INFINITE TIME HORIZON; OPTIMAL CONTROLS; OPTIMALITY CRITERIA; REGRESSION TREES; STOCHASTIC APPROXIMATIONS; SUPERVISED LEARNING PROBLEMS;

LEARNING SYSTEMS; LEARNING ALGORITHMS;

EID: 9444250519 PISSN: 03029743 EISSN: None Source Type: Conference Proceeding
DOI: 10.1007/978-3-540-39857-8_11 Document Type: Conference Paper

Times cited : (19)

References (11)

1
- 0003565783
- Athena Scientific, Belmont, MA, 2nd edition
- D. Bertsekas. Dynamic Programming and Optimal Control, volume I. Athena Scientific, Belmont, MA, 2nd edition, 2000.
- (2000) Dynamic Programming and Optimal Control , vol.1
- Bertsekas, D.¹

2
- 0030211964
- Bagging predictors
- L. Breiman. Bagging predictors. Machine Learning, 24(2): 123-140, 1996.
- (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
- Breiman, L.¹

3
- 0035478854
- Random forests
- L. Breiman. Random forests. Machine Learning, 45:5-32, 2001.
- (2001) Machine Learning , vol.45 , pp. 5-32
- Breiman, L.¹

4
- 0003802343
- Wadsworth International (California)
- L. Breiman, J. Friedman, R. Olsen, and C. Stone. Classification and Regression Trees. Wadsworth International (California), 1984.
- (1984) Classification and Regression Trees
- Breiman, L.¹ Friedman, J.² Olsen, R.³ Stone, C.⁴

5
- 1442288723
- PhD thesis, University of Liège, Belgium, March
- D. Ernst. Near optimal closed-loop control. Application to electric power systems. PhD thesis, University of Liège, Belgium, March 2003.
- (2003) Near Optimal Closed-loop Control. Application to Electric Power Systems.
- Ernst, D.¹

6
- 2442476951
- PhD thesis, University of Liège, Belgium, May
- P. Geurts. Contributions to decision tree induction: bias/variance tradeoff and time series classification. PhD thesis, University of Liège, Belgium, May 2002.
- (2002) Contributions to Decision Tree Induction: Bias/variance Tradeoff and Time Series Classification
- Geurts, P.¹

7
- 9444276220
- Extremely randomized trees
- University of Liège
- P. Geurts. Extremely randomized trees. Technical report, University of Liège, 2003.
- (2003) Technical Report
- Geurts, P.¹

8
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- A. Moore and C. Atkeson. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time. Machine Learning, 13:103-130, 1993.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.²

9
- 8744262572
- Supervised learning combined with an actor-critic architecture
- University of Massachusetts, Department of Computer Science
- M. T. Rosenstein and A. G. Barto. Supervised learning combined with an actor-critic architecture. Technical report, University of Massachusetts, Department of Computer Science, 2002.
- (2002) Technical Report
- Rosenstein, M.T.¹ Barto, A.G.²

10
- 0001898381
- Practical reinforcement learning in continuous spaces
- W. Smart and L. Kaelbling. Practical Reinforcement Learning in Continuous Spaces. In Proceedings of the Sixteenth International Conference on Machine Learning, 2000.
- (2000) Proceedings of the Sixteenth International Conference on Machine Learning
- Smart, W.¹ Kaelbling, L.²

11
- 34249833101
- Q-learning
- C. Watkins and P. Dayan. Q-learning. Machine learning, 8:279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.