SCOPUS 정보 검색 플랫폼

2010 International Congress on Ultra Modern Telecommunications and Control Systems and Workshops, ICUMT 2010

Volumn , Issue , 2010, Pages 458-465

Eligibility traces through colored noises

(2) Geist, Matthieu a Pietquin, Olivier a

a UMI Georgia Tech CNRS 2958 (France)

Author keywords

Colored noise; Neural networks; Reinforcement learning; Statistical modeling; Value function approximation

Indexed keywords

NEURAL NETWORKS; REINFORCEMENT LEARNING; STATISTICAL METHODS; STOCHASTIC SYSTEMS;

COLORED NOISE; ELIGIBILITY TRACES; GAUSSIAN PROCESSES; MONTE CARLO ESTIMATES; NONLINEAR PARAMETERIZATIONS; STATISTICAL MODELING; TEMPORAL DIFFERENCES; VALUE FUNCTION APPROXIMATION;

WHITE NOISE;

EID: 79951485912 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICUMT.2010.5676597 Document Type: Conference Paper

Times cited : (6)

References (17)

1
- 0004102479
- 3rd ed. The MIT Press, March
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning), 3rd ed. The MIT Press, March 1998.
- (1998) Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)
- Sutton, R.S.¹ Barto, A.G.²

2
- 1942421151
- Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
- Y. Engel, S. Mannor, and R. Meir, "Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning," in Proceedings of the International Conference on Machine Learning (ICML 03), 2003, pp. 154-161.
- Proceedings of the International Conference on Machine Learning (ICML 03), 2003 , pp. 154-161
- Engel, Y.¹ Mannor, S.² Meir, R.³

3
- 67650458797
- Kalman Temporal Differences: The deterministic case
- M. Geist, O. Pietquin, and G. Fricout, "Kalman Temporal Differences: the deterministic case," in IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009.
- IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009
- Geist, M.¹ Pietquin, O.² Fricout, G.³

4
- 85151728371
- Residual Algorithms: Reinforcement Learning with Function Approximation
- L. C. Baird, "Residual Algorithms: Reinforcement Learning with Function Approximation," in Proceedings of the International Conference on Machine Learning (ICML 95), 1995, pp. 30-37.
- Proceedings of the International Conference on Machine Learning (ICML 95), 1995 , pp. 30-37
- Baird, L.C.¹

5
- 31844451013
- Reinforcement Learning with Gaussian Processes
- Y. Engel, S. Mannor, and R. Meir, "Reinforcement Learning with Gaussian Processes," in Proceedings of International Conference on Machine Learning (ICML-05), 2005.
- Proceedings of International Conference on Machine Learning (ICML-05), 2005
- Engel, Y.¹ Mannor, S.² Meir, R.³

6
- 85024429815
- A new approach to linear filtering and prediction problems
- R. E. Kalman, "A new approach to linear filtering and prediction problems," Transactions of the ASME-Journal of Basic Engineering, vol. 82, no. Series D, pp. 35-45, 1960.
- (1960) Transactions of the ASME-Journal of Basic Engineering , vol.82 , Issue.SERIES D , pp. 35-45
- Kalman, R.E.¹

7
- 21244437999
- Unscented filtering and nonlinear estimation
- S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation," Proceedings of the IEEE, vol. 92, no. 3, pp. 401-422, 2004.
- (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 401-422
- Julier, S.J.¹ Uhlmann, J.K.²

8
- 76649127744
- Tracking in Reinforcement Learning
- Springer
- M. Geist, O. Pietquin, and G. Fricout, "Tracking in Reinforcement Learning," in Proceedings of the 16th International Conference on Neural Information Processing (ICONIP 2009). Bangkok (Thailande): Springer, 2009.
- (2009) Proceedings of the 16th International Conference on Neural Information Processing (ICONIP 2009). Bangkok (Thailande)
- Geist, M.¹ Pietquin, O.² Fricout, G.³

9
- 40849145988
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- April
- A. Antos, C. Szepesvári, and R. Munos, "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path," Machine Learning, vol. 71, no. 1, pp. 89-129, April 2008.
- (2008) Machine Learning , vol.71 , Issue.1 , pp. 89-129
- Antos, A.¹ Szepesvári, C.² Munos, R.³

10
- 31844456714
- Ph.D. dissertation, Hebrew University, April
- Y. Engel, "Algorithms and Representations for Reinforcement Learning," Ph.D. dissertation, Hebrew University, April 2005.
- (2005) Algorithms and Representations for Reinforcement Learning
- Engel, Y.¹

11
- 0038595396
- Least-squares temporal difference learning
- Morgan Kaufmann, San Francisco, CA
- J. A. Boyan, "Least-squares temporal difference learning," in Proceedings of the 16th International Conference on Machine Learning (ICML 99). Morgan Kaufmann, San Francisco, CA, 1999, pp. 49-56.
- (1999) Proceedings of the 16th International Conference on Machine Learning (ICML 99) , pp. 49-56
- Boyan, J.A.¹

12
- 84889830739
- 1st ed. Wiley & Sons, August
- D. Simon, Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches, 1st ed. Wiley & Sons, August 2006.
- (2006) Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches
- Simon, D.¹

13
- 8344287766
- Ph.D. dissertation, OGI School of Science & Engineering, Oregon Health & Science University, Portland, OR, USA, April
- R. van der Merwe, "Sigma-point kalman filters for probabilistic inference in dynamic state-space models," Ph.D. dissertation, OGI School of Science & Engineering, Oregon Health & Science University, Portland, OR, USA, April 2004.
- (2004) Sigma-point Kalman Filters for Probabilistic Inference in Dynamic State-space Models
- Van Der Merwe, R.¹

14
- 84966204836
- Methods for Modifying Matrix Factorization
- April
- P. E. Gill, G. H. Golub, W. Murray, and M. A. Saunders, "Methods for Modifying Matrix Factorization," Mathematics of Computation, vol. 28, no. 126, pp. 505-535, April 1974.
- (1974) Mathematics of Computation , vol.28 , Issue.126 , pp. 505-535
- Gill, P.E.¹ Golub, G.H.² Murray, W.³ Saunders, M.A.⁴

15
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, vol. 4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

16
- 79951474303
- Managing Uncertainty within Value Function Approximation in Reinforcement Learning
- M. Geist and O. Pietquin, "Managing Uncertainty within Value Function Approximation in Reinforcement Learning," in Active Learning and Experimental Design workshop (collocated with AISTATS 2010), Sardinia, Italy, May 2010.
- Active Learning and Experimental Design Workshop (Collocated with AISTATS 2010), Sardinia, Italy, May 2010
- Geist, M.¹ Pietquin, O.²

17
- 0031143730
- An analysis of temporal-difference learning with function approximation
- J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Transactions on Automatic Control, vol. 42, pp. 674-690, 1997.
- (1997) IEEE Transactions on Automatic Control , vol.42 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.