SCOPUS 정보 검색 플랫폼

2010 IEEE International Conference on Automation, Quality and Testing, Robotics, AQTR 2010 - Proceedings

Volumn 1, Issue , 2010, Pages 44-49

Using prior knowledge to accelerate online least-squares policy iteration

(4) Buşoniu, Lucian a De Schutter, Bart a Babuška, Robert a Ernst, Damien b

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b UNIVERSITY OF LIÈGE (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL POLICY; EMPIRICAL EVALUATIONS; IN-CONTROL; LEAST SQUARE; MONOTONICITY; ONLINE LEARNING; OPTIMAL CONTROLS; POLICY ITERATION; PRIOR KNOWLEDGE; SYSTEM STATE;

ITERATIVE METHODS;

ROBOTICS;

EID: 77958522395 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/AQTR.2010.5520917 Document Type: Conference Paper

Times cited : (8)

References (12)

1
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

2
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 0003565783
- Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, 3rd ed. Athena Scientific, 2007, vol. 2.
- (2007) Dynamic Programming and Optimal Control, 3rd Ed. , vol.2
- Bertsekas, D.P.¹

4
- 0036832950
- Technical update: Least-squares temporal difference learning
- J. Boyan, "Technical update: Least-squares temporal difference learning, " Machine Learning, vol. 49, pp. 233-246, 2002.
- (2002) Machine Learning , vol.49 , pp. 233-246
- Boyan, J.¹

5
- 0037288398
- Least-squares policy evaluation algorithms with linear function approximation
- A. Nedić and D. P. Bertsekas, "Least-squares policy evaluation algorithms with linear function approximation, " Discrete Event Dynamic Systems: Theory and Applications, vol. 13, no. 1-2, pp. 79-110, 2003.
- (2003) Discrete Event Dynamic Systems: Theory and Applications , vol.13 , Issue.1-2 , pp. 79-110
- Nedić, A.¹ Bertsekas, D.P.²

6
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration, " Journal of Machine Learning Research, vol. 4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

7
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning, " Journal ofMachine Learning Research, vol. 6, pp. 503-556, 2005.
- (2005) Journal OfMachine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

8
- 67949109470
- Convergence results for some temporal difference methods based on least squares
- H. Yu and D. P. Bertsekas, "Convergence results for some temporal difference methods based on least squares, " IEEE Transactions on Automatic Control, vol. 54, no. 7, pp. 1515-1531, 2009.
- (2009) IEEE Transactions on Automatic Control , vol.54 , Issue.7 , pp. 1515-1531
- Yu, H.¹ Bertsekas, D.P.²

9
- 77957782880
- Online least-squares policy iteration for reinforcement learning control
- Baltimore, US, 30 June - 2 July, accepted for publication
- L. Buşoniu, D. Ernst, B. De Schutter, and R. Babǔska, "Online least-squares policy iteration for reinforcement learning control, " in Proceedings 2010 American Control Conference (ACC-10), Baltimore, US, 30 June - 2 July 2010, accepted for publication.
- (2010) Proceedings 2010 American Control Conference (ACC-10)
- Buşoniu, L.¹ Ernst, D.² Schutter, B.D.³ Babǔska, R.⁴

10
- 33847202724
- Learning to predict by the method of temporal differences
- R. S. Sutton, "Learning to predict by the method of temporal differences, " Machine Learning, vol. 3, pp. 9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

11
- 84899834143
- Online exploration in least-squares policy iteration
- Budapest, Hungary, 10-15 May
- L. Li, M. L. Littman, and C. R. Mansley, "Online exploration in least-squares policy iteration, " in Proceedings 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS- 09), vol. 2, Budapest, Hungary, 10-15 May 2009, pp. 733-739.
- (2009) Proceedings 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS- 09) , vol.2 , pp. 733-739
- Li, L.¹ Littman, M.L.² Mansley, C.R.³

12
- 34548765672
- Kernelizing LSPE(λ?)
- Honolulu, US, 1-5 April
- T. Jung and D. Polani, "Kernelizing LSPE(λ?), " in Proceedings 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL-07), Honolulu, US, 1-5 April 2007, pp. 338-345.
- (2007) Proceedings 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL-07) , pp. 338-345
- Jung, T.¹ Polani, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.