SCOPUS 정보 검색 플랫폼

Proceedings - 7th International Conference on Machine Learning and Applications, ICMLA 2008

Volumn , Issue , 2008, Pages 75-82

Basis function construction in reinforcement learning using cascade-correlation learning architecture

(2) Girgin, Sertan a Preux, Philippe b

a INRIA (France)

b UNIV LILLE (France)

Author keywords

[No Author keywords available]

Indexed keywords

BASIS FUNCTIONS; BENCH-MARK PROBLEMS; INPUT DATUM; LEARNING ARCHITECTURES; LEAST SQUARES POLICY ITERATIONS; NEW APPROACHES; PROBLEM COMPLEXITY; VALUE FUNCTIONS;

APPROXIMATION ALGORITHMS; BENCHMARKING; CURVE FITTING; LEAST SQUARES APPROXIMATIONS; REINFORCEMENT; REINFORCEMENT LEARNING; ROBOT LEARNING;

EDUCATION;

EID: 60649120535 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICMLA.2008.24 Document Type: Conference Paper

Times cited : (4)

References (15)

1
- 10944228202
- PhD thesis, Institut National Polytechnique de Grenoble
- R. Coulom. Reinforcement Learning Using Neural Networks with Applications to Motor Control. PhD thesis, Institut National Polytechnique de Grenoble, 2002.
- (2002) Reinforcement Learning Using Neural Networks with Applications to Motor Control
- Coulom, R.¹

2
- 0000155950
- The cascade-correlation learning architecture
- D. S. Touretzky, editor, Morgan Kaufmann
- S. E. Fahlman and C. Lebiere. The cascade-correlation learning architecture. In D. S. Touretzky, editor, Advances in NIPS, volume 2, pages 524-532. Morgan Kaufmann, 1990.
- (1990) Advances in NIPS , vol.2 , pp. 524-532
- Fahlman, S.E.¹ Lebiere, C.²

3
- 84868894519
- Basis expansion in natural actor critic methods
- June
- th European Workshop on Reinforcement Learning, June 2008.
- (2008) th European Workshop on Reinforcement Learning
- Girgin, S.¹ Preux, P.²

4
- 47249145227
- Feature discovery in reinforcement learning using genetic programming
- Springer-Verlag, Mar
- S. Girgin and P. Preux. Feature discovery in reinforcement learning using genetic programming. In Proc. of Euro-GP, pages 218-229. Springer-Verlag, Mar. 2008.
- (2008) Proc. of Euro-GP , pp. 218-229
- Girgin, S.¹ Preux, P.²

5
- 34547971381
- Constructing basis functions from directed graphs for value function approximation
- NY, USA, ACM
- J. Johns and S. Mahadevan. Constructing basis functions from directed graphs for value function approximation. In ICML, pages 385-392, NY, USA, 2007. ACM.
- (2007) ICML , pp. 385-392
- Johns, J.¹ Mahadevan, S.²

6
- 33749263205
- Automatic basis function construction for approximate dynamic programming and reinforcement learning
- NY, USA, ACM
- P. W. Keller, S. Mannor, and D. Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In ICML, pages 449-456, NY, USA, 2006. ACM.
- (2006) ICML , pp. 449-456
- Keller, P.W.¹ Mannor, S.² Precup, D.³

7
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr. Least-squares policy iteration. J. of Machine Learning Research, 4:1107-1149, 2003.
- (2003) J. of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

8
- 34548803187
- Sparse temporal difference learning using LASSO
- Apr
- M. Loth, M. Davy, and P. Preux. Sparse temporal difference learning using LASSO. In Proc. of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, Apr. 2007.
- (2007) Proc. of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning
- Loth, M.¹ Davy, M.² Preux, P.³

9
- 35748957806
- Proto-value functions: A laplacian framework for learning representation and control in markov decision processes
- S. Mahadevan and M. Maggioni. Proto-value functions: A laplacian framework for learning representation and control in markov decision processes. J. of Machine Learning Research, 8:2169-2231, 2007.
- (2007) J. of Machine Learning Research , vol.8 , pp. 2169-2231
- Mahadevan, S.¹ Maggioni, M.²

10
- 17444414191
- Basis function adaptation in temporal difference reinforcement learning
- I. Menache, S. Mannor, and N. Shimkin. Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134:215-238 (24), 2005.
- (2005) Annals of Operations Research , vol.134 , Issue.24 , pp. 215-238
- Menache, I.¹ Mannor, S.² Shimkin, N.³

11
- 34547982545
- Analyzing feature generation for value-function approximation
- NY, USA, ACM
- R. Parr, C. Painter-Wakefield, L. Li, and M. Littman. Analyzing feature generation for value-function approximation. In ICML, pages 737-744, NY, USA, 2007. ACM.
- (2007) ICML , pp. 737-744
- Parr, R.¹ Painter-Wakefield, C.² Li, L.³ Littman, M.⁴

12
- 0003998452
- Markov Decision Processes - Discrete Stochastic Dynamic Programming
- Wiley
- M. Puterman. Markov Decision Processes - Discrete Stochastic Dynamic Programming. Probability and mathematical statistics. Wiley, 1994.
- (1994) Probability and mathematical statistics
- Puterman, M.¹

13
- 84943274699
- M. Riedmiller and H. Braun. A direct adaptive method for faster backpropagation learning: the rprop algorithm. pages 586-591 Vol. 1, 1993.
- (1993) A direct adaptive method for faster backpropagation learning: The rprop algorithm , vol.1 , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

14
- 1942516829
- Combining td-learning with cascade-correlation networks
- T. Fawcett and N. Mishra, editors, AAAI Press
- F. Rivest and D. Precup. Combining td-learning with cascade-correlation networks. In T. Fawcett and N. Mishra, editors, ICML, pages 632-639. AAAI Press, 2003.
- (2003) ICML , pp. 632-639
- Rivest, F.¹ Precup, D.²

15
- 0004102479
- MIT Press, Cambridge, MA, A Bradford Book
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998. A Bradford Book.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.