SCOPUS 정보 검색 플랫폼

Proceedings of the 25th International Conference on Machine Learning

Volumn , Issue , 2008, Pages 1208-1215

Preconditioned temporal difference learning

(2) Yao, Hengshuai a Liu, Zhi Qiaug a

a CITY UNIVERSITY OF HONG KONG (Hong Kong)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL COMPLEXITY; ITERATIVE METHODS; MACHINE LEARNING; STOCHASTIC MODELS; STOCHASTIC SYSTEMS; EDUCATION; LEARNING SYSTEMS; ROBOT LEARNING;

LEAST SQUARE; LEAST-SQUARES TEMPORAL DIFFERENCES; POLICY EVALUATION; PRECONDITIONING TECHNIQUES; RELATED ALGORITHMS; STEP SIZE; TEMPORAL DIFFERENCE LEARNING;

LEARNING ALGORITHMS;

LEAST-SQUARES; MODEL EQUATIONS; POLICY EVALUATIONS; PRECONDITIONING TECHNIQUES; TEMPORAL DIFFERENCE LEARNING; TEMPORAL DIFFERENCES;

EID: 56449123618 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1390156.1390308 Document Type: Conference Paper

Times cited : (20)

References (13)

1
- 0038595396
- Least-squares temporal difference learning
- Morgan Kaufmann
- Boyan, J. A. (1999). Least-squares temporal difference learning. Proceedings of the Sixteenth International Conference on Machine Learning (pp. 49 56). Morgan Kaufmann.
- (1999) Proceedings of the Sixteenth International Conference on Machine Learning , pp. 49-56
- Boyan, J.A.¹

2
- 0001771345
- Linear least-squares algorithms for temporal difference learning
- Bradtke, S., & Barto, A. G. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning, 22, 33--57.
- (1996) Machine Learning , vol.22 , pp. 33-57
- Bradtke, S.¹ Barto, A.G.²

3
- 33750737011
- Incremental least-squares temporal difference learning
- AAAI Press
- Geramifard, A., Bowling, M., & Sutton, R. S. (2006a). Incremental least-squares temporal difference learning. Twenty-First National Conference on Artificial Intelligence (AAAI-06) (pp. 356 361). AAAI Press.
- (2006) Twenty-First National Conference on Artificial Intelligence (AAAI-06) , pp. 356-361
- Geramifard, A.¹ Bowling, M.² Sutton, R.S.³

4
- 56449115872
- iLSTD: Eligibility traces and convergence analysis
- Geramifard, A., Bowling, M., Zinkevich, M., & Sutton, R. S. (2006b). iLSTD: Eligibility traces and convergence analysis. Advances in Neural Information Processing Systems 19 (pp. 441-448).
- (2006) Advances in Neural Information Processing Systems , vol.19 , pp. 441-448
- Geramifard, A.¹ Bowling, M.² Zinkevich, M.³ Sutton, R.S.⁴

5
- 0012331016
- Memory approaches to reinforcement learning in non-markovian domains
- CMU-CS-92-138, Carnegie Mellon University, Pittsburgh, PA 15213
- Lin, L.-J., & Mitchell, T. M. (1992). Memory approaches to reinforcement learning in non-markovian domains (Technical Report CMU-CS-92-138). Carnegie Mellon University, Pittsburgh, PA 15213.
- (1992) Technical Report
- Lin, L.-J.¹ Mitchell, T.M.²

6
- 0037288398
- Least-squares policy evaluation algorithms with linear function approximation
- Nedić, A., & Bertsekas, D. P. (2003). Least-squares policy evaluation algorithms with linear function approximation. Journal of Discrete Event Systems, 13, 79-110.
- (2003) Journal of Discrete Event Systems , vol.13 , pp. 79-110
- Nedić, A.¹ Bertsekas, D.P.²

7
- 1842829625
- SIAM
- Saad, Y. (2003). Iterative methods for sparse linear systems. SIAM.
- (2003) Iterative methods for sparse linear systems
- Saad, Y.¹

8
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

9
- 0004102479
- MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

10
- 0035283402
- On the convergence of temporal-difference learning with linear function approximation
- Tadić, V. (2001). On the convergence of temporal-difference learning with linear function approximation. Machine Learning, 42, 241-267.
- (2001) Machine Learning , vol.42 , pp. 241-267
- Tadić, V.¹

11
- 0031143730
- An analysis of temporal-difference learning with function approximation
- Tsitsiklis, J. N., & Van Roy, B. (1997). An analysis of temporal-difference learning with function approximation. IEEE Transactions on Automatic Control, 42, 674-690.
- (1997) IEEE Transactions on Automatic Control , vol.42 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

12
- 0041345290
- Efficient reinforcement learning using recursive least-squares methods
- Xu, X., He, H., & Hu, D. (2002). Efficient reinforcement learning using recursive least-squares methods. Journal of Artificial Intelligence Research, 16, 259-292.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 259-292
- Xu, X.¹ He, H.² Hu, D.³

13
- 56449128935
- Preconditioned temporal difference learning
- CityU-SCM-MCG-0408, City University of Hong Kong
- Yao, H., & Liu, Z. (2008). Preconditioned temporal difference learning (Technical Report CityU-SCM-MCG-0408). City University of Hong Kong.
- (2008) Technical Report
- Yao, H.¹ Liu, Z.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.