SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5863 LNCS, Issue PART 1, 2009, Pages 502-511

Tracking in reinforcement learning

(3) Geist, Matthieu a,b,c Pietquin, Olivier a Fricout, Gabriel b

a UMI Georgia Tech CNRS 2958 (France)

b ARCELORMITTAL (France)

c INRIA (France)

Author keywords

Kalman filtering; Reinforcement learning; Tracking; Value function approximation

Indexed keywords

ITERATIVE METHODS; SURFACE DISCHARGES;

CONVERGENCE ANALYSIS; EMPIRICAL INVESTIGATION; KALMAN-FILTERING; NON-STATIONARITIES; NON-STATIONARY ENVIRONMENT; OPTIMAL SOLUTIONS; TEMPORAL DIFFERENCES; VALUE FUNCTION APPROXIMATION;

REINFORCEMENT LEARNING;

EID: 76649127744 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-10677-4_57 Document Type: Conference Paper

Times cited : (16)

References (18)

1
- 0004102479
- MIT Press, Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1996)
- (1996) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 34547974097
- Tracking Value Function Dynamics to Improve Reinforcement Learning with Piecewise Linear Function Approximation
- Phua, C.W., Fitch, R.: Tracking Value Function Dynamics to Improve Reinforcement Learning with Piecewise Linear Function Approximation. In: International Conference on Machine Learning, ICML 2007 (2007)
- (2007) International Conference on Machine Learning, ICML
- Phua, C.W.¹ Fitch, R.²

3
- 34547991608
- On the role of tracking in stationary environments
- Sutton, R.S., Koop, A., Silver, D.: On the role of tracking in stationary environments. In: Proceedings of the 24th international conference on Machine learning, pp. 871-878 (2007)
- (2007) Proceedings of the 24th international conference on Machine learning , pp. 871-878
- Sutton, R.S.¹ Koop, A.² Silver, D.³

4
- 67650458797
- Kalman Temporal Differences: The deterministic case
- Nashville, TN, USA April
- Geist, M., Pietquin, O., Fricout, G.: Kalman Temporal Differences: the deterministic case. In: Proceedings of the IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA (April 2009)
- (2009) Proceedings of the IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL
- Geist, M.¹ Pietquin, O.² Fricout, G.³

5
- 85024429815
- Kalman, R.E.: A New Approach to Linear Filtering and Prediction Problems. Transactions of the ASME-Journal of Basic Engineering 82(Series D), 35-45 (1960)
- Kalman, R.E.: A New Approach to Linear Filtering and Prediction Problems. Transactions of the ASME-Journal of Basic Engineering 82(Series D), 35-45 (1960)

6
- 21244437999
- Unscented filtering and nonlinear estimation
- Julier, S.J., Uhlmann, J.K.: Unscented filtering and nonlinear estimation. Proceedings of the IEEE 92(3), 401-422 (2004)
- (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 401-422
- Julier, S.J.¹ Uhlmann, J.K.²

7
- 8344287766
- PhD thesis, Oregon Health & Science University, Portland, USA
- van der Merwe, R.: Sigma-Point Kalman Filters for Probabilistic Inference in Dynamic State-Space Models. PhD thesis, Oregon Health & Science University, Portland, USA (2004)
- (2004) Sigma-Point Kalman Filters for Probabilistic Inference in Dynamic State-Space Models
- van der Merwe, R.¹

8
- 0001771345
- Linear Least-Squares Algorithms for Temporal Difference Learning
- Bradtke, S.J., Barto, A.G.: Linear Least-Squares Algorithms for Temporal Difference Learning. Machine Learning 22(1-3), 33-57 (1996)
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 33-57
- Bradtke, S.J.¹ Barto, A.G.²

9
- 85151728371
- Residual Algorithms: Reinforcement Learning with Function Approximation
- Baird, L.C.: Residual Algorithms: Reinforcement Learning with Function Approximation. In: Proceedings of the International Conference on Machine Learning, pp. 30-37 (1995)
- (1995) Proceedings of the International Conference on Machine Learning , pp. 30-37
- Baird, L.C.¹

10
- 40849145988
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Antos, A., Szepesvári, C., Munos, R.: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning 71(1), 89-129 (2008)
- (2008) Machine Learning , vol.71 , Issue.1 , pp. 89-129
- Antos, A.¹ Szepesvári, C.² Munos, R.³

11
- 76649113839
- Kakade, S.: A natural policy gradient. In: Advances in Neural Information Processing Systems 14 (NIPS 2001), Vancouver, British Columbia, Canada, pp. 1531-1538 (2001)
- Kakade, S.: A natural policy gradient. In: Advances in Neural Information Processing Systems 14 (NIPS 2001), Vancouver, British Columbia, Canada, pp. 1531-1538 (2001)

12
- 33646413135
- Peters, J., Vijayakumar, S., Schaal, S.: Natural actor-critic. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS (LNAI), 3720, pp. 280-291. Springer, Heidelberg (2005)
- Peters, J., Vijayakumar, S., Schaal, S.: Natural actor-critic. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L. (eds.) ECML 2005. LNCS (LNAI), vol. 3720, pp. 280-291. Springer, Heidelberg (2005)

13
- 0036832950
- Technical Update: Least-Squares Temporal Difference Learning
- Boyan, J.A.: Technical Update: Least-Squares Temporal Difference Learning. Machine Learning 49(2-3), 233-246 (1999)
- (1999) Machine Learning , vol.49 , Issue.2-3 , pp. 233-246
- Boyan, J.A.¹

14
- 4644323293
- Least-Squares Policy Iteration
- Lagoudakis, M.G., Parr, R.: Least-Squares Policy Iteration. Journal of Machine Learning Research 4, 1107-1149 (2003)
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

15
- 20544433674
- Consistent Normalized Least Mean Square Filtering with Noisy Data Matrix
- Jo, S., Kim, S.W.: Consistent Normalized Least Mean Square Filtering with Noisy Data Matrix. IEEE Transactions on Signal Processing 53(6), 2112-2123 (2005)
- (2005) IEEE Transactions on Signal Processing , vol.53 , Issue.6 , pp. 2112-2123
- Jo, S.¹ Kim, S.W.²

16
- 31844451013
- Reinforcement Learning with Gaussian Processes
- Engel, Y., Mannor, S., Meir, R.: Reinforcement Learning with Gaussian Processes. In: Proceedings of Internation Conference on Machine Learning, ICML 2005 (2005)
- (2005) Proceedings of Internation Conference on Machine Learning, ICML
- Engel, Y.¹ Mannor, S.² Meir, R.³

17
- 85162049326
- Incremental Natural Actor-Critic Algorithms
- Vancouver
- Bhatnagar, S., Sutton, R.S., Ghavamzadeh, M., Lee, M.: Incremental Natural Actor-Critic Algorithms. In: Advances in Neural Information Processing Systems, Vancouver, vol. 21 (2008)
- (2008) In: Advances in Neural Information Processing Systems , vol.21
- Bhatnagar, S.¹ Sutton, R.S.² Ghavamzadeh, M.³ Lee, M.⁴

18
- 58449117448
- Geist, M., Pietquin, O., Fricout, G.: Bayesian Reward Filtering. In: Girgin, S., Loth, M., Munos, R., Preux, P., Ryabko, D. (eds.) EWRL 2008. LNCS (LNAI), 5323, pp. 96-109. Springer, Heidelberg (2008)
- Geist, M., Pietquin, O., Fricout, G.: Bayesian Reward Filtering. In: Girgin, S., Loth, M., Munos, R., Preux, P., Ryabko, D. (eds.) EWRL 2008. LNCS (LNAI), vol. 5323, pp. 96-109. Springer, Heidelberg (2008)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.