SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 7188 LNAI, Issue , 2012, Pages 102-114

Regularized least squares temporal difference learning with nested ℓ 2 and ℓ 1 penalization

(4) Hoffman, Matthew W a Lazaric, Alessandro b Ghavamzadeh, Mohammad b Munos, Rémi b

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

b INRIA (France)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE VALUE FUNCTION; APPROXIMATION SPACES; CENTRAL PROBLEMS; HIGH-DIMENSIONAL FEATURE SPACE; LEAST SQUARE; NUMBER OF SAMPLES; OVERFITTING; POLICY EVALUATION; PREDICTION PERFORMANCE; PROJECTION OPERATOR; REGULARIZED LEAST SQUARES; REGULARIZED METHOD; TEMPORAL DIFFERENCE LEARNING;

ARTIFICIAL INTELLIGENCE;

REINFORCEMENT LEARNING;

EID: 84861687861 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-29946-9_13 Document Type: Conference Paper

Times cited : (19)

References (15)

1
- 40849145988
- Learning near-optimal policies with Bellmanresidual minimization based fitted policy iteration and a single sample path
- Antos, A., Szepesvári, C., Munos, R.: Learning near-optimal policies with Bellmanresidual minimization based fitted policy iteration and a single sample path. Machine Learning 71(1) (2008)
- (2008) Machine Learning , vol.71 , Issue.1
- Antos, A.¹ Szepesvári, C.² Munos, R.³

2
- 0001771345
- Linear least-squares algorithms for temporal difference learning
- Bradtke, S., Barto, A.: Linear least-squares algorithms for temporal difference learning. Machine Learning 22, 33-57 (1996)
- (1996) Machine Learning , vol.22 , pp. 33-57
- Bradtke, S.¹ Barto, A.²

3
- 50849114939
- Sparsity oracle inequalities for the lasso
- Bunea, F., Tsybakov, A., Wegkamp, M.: Sparsity oracle inequalities for the lasso. Electronic Journal of Statistics 1, 169-194 (2007)
- (2007) Electronic Journal of Statistics , vol.1 , pp. 169-194
- Bunea, F.¹ Tsybakov, A.² Wegkamp, M.³

4
- 3242708140
- Least angle regression
- Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least angle regression. Annals of Statistics 32(2) (2004)
- (2004) Annals of Statistics , vol.32 , Issue.2
- Efron, B.¹ Hastie, T.² Johnstone, I.³ Tibshirani, R.⁴

5
- 70049096468
- Regularized policy iteration
- Farahmand, A., Ghavamzadeh, M., Szepesvari, C., Mannor, S.: Regularized policy iteration. In: Advances in Neural Information Processing Systems 21 (2009)
- (2009) Advances in Neural Information Processing Systems , vol.21
- Farahmand, A.¹ Ghavamzadeh, M.² Szepesvari, C.³ Mannor, S.⁴

6
- 45849107328
- Pathwise coordinate optimization
- Friedman, J., Hastie, T., Höfling, H., Tibshirani, R.: Pathwise coordinate optimization. The Annals of Applied Statistics 1(2), 302-332 (2007)
- (2007) The Annals of Applied Statistics , vol.1 , Issue.2 , pp. 302-332
- Friedman, J.¹ Hastie, T.² Höfling, H.³ Tibshirani, R.⁴

7
- 0003684449
- Springer, Heidelberg
- Friedman, J., Hastie, T., Tibshirani, R.: The elements of statistical learning. Springer, Heidelberg (2001)
- (2001) The Elements of Statistical Learning
- Friedman, J.¹ Hastie, T.² Tibshirani, R.³

8
- 84876124578
- 1-penalized projected bellman residual
- 1-penalized projected bellman residual. In: European Workshop on Reinforcement Learning (2011)
- European Workshop on Reinforcement Learning (2011)
- Geist, M.¹ Scherrer, B.²

9
- 80053440025
- Finite-sample analysis of Lasso-TD
- Ghavamzadeh, M., Lazaric, A., Munos, R., Hoffman, M.: Finite-sample analysis of Lasso-TD. In: Proceedings of the International Conference on Machine Learning (2011)
- Proceedings of the International Conference on Machine Learning (2011)
- Ghavamzadeh, M.¹ Lazaric, A.² Munos, R.³ Hoffman, M.⁴

10
- 85162069759
- Linear complementarity for regularized policy evaluation and improvement
- Johns, J., Painter-Wakefield, C., Parr, R.: Linear complementarity for regularized policy evaluation and improvement. In: Advances in Neural Information Processing Systems 23 (2010)
- (2010) Advances in Neural Information Processing Systems , vol.23
- Johns, J.¹ Painter-Wakefield, C.² Parr, R.³

11
- 71149121683
- Regularization and feature selection in least-squares temporal difference learning
- Kolter, J.Z., Ng, A.Y.: Regularization and feature selection in least-squares temporal difference learning. In: Proceedings of the International Conference on Machine Learning (2009)
- Proceedings of the International Conference on Machine Learning (2009)
- Kolter, J.Z.¹ Ng, A.Y.²

12
- 4644323293
- Least-squares policy iteration
- Lagoudakis, M.G., Parr, R.: Least-squares policy iteration. Journal of Machine Learning Research 4 (2003)
- (2003) Journal of Machine Learning Research , vol.4
- Lagoudakis, M.G.¹ Parr, R.²

13
- 80053133834
- Ph.D. thesis, University of British Columbia
- Schmidt, M.: Graphical Model Structure Learning with l1-Regularization. Ph.D. thesis, University of British Columbia (2010)
- (2010) Graphical Model Structure Learning with L1-Regularization
- Schmidt, M.¹

14
- 0004102479
- MIT Press
- Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

15
- 85194972808
- Regression shrinkage and selection via the lasso
- Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 58(1), 267-288 (1996)
- (1996) Journal of the Royal Statistical Society. Series B (Methodological) , vol.58 , Issue.1 , pp. 267-288
- Tibshirani, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.