SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2533, Issue , 2002, Pages 403-413

Feedforward neural networks in reinforcement learning applied to high-dimensional motor control

(1) Coulom, Rémi a

a Laboratoire Leibniz IMAG (France)

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION ALGORITHMS; GRADIENT METHODS; MACHINE LEARNING; REINFORCEMENT LEARNING;

ARTICULATED ROBOTS; BACKPROPAGATION NETWORK; CONTROL VARIABLE; GRADIENT DESCENT; HIGH-DIMENSIONAL; HIGH-DIMENSIONAL PROBLEMS; INDEPENDENT STATE; LOW DIMENSIONAL;

FEEDFORWARD NEURAL NETWORKS;

EID: 84942750244 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-36169-3_32 Document Type: Conference Paper

Times cited : (11)

References (22)

1
- 0344154963
- Strategy learning with multilayer connectionist representations
- Irvine, CA, Morgan Kaufmann
- Charles W. Anderson. Strategy learning with multilayer connectionist representations. In Proceedings of the Fourth International Workshop on Machine Learning, pages 103–114, Irvine, CA, 1987. Morgan Kaufmann.
- (1987) Proceedings of the Fourth International Workshop on Machine Learning , pp. 103-114
- Anderson, C.W.¹

2
- 0027599793
- Universal approximation bounds for superpositions of a sigmoidal function
- Maym
- Andrew R. Barron. Universal approximation bounds for superpositions of a sigmoidal function. IEEE Transactions on Information Theory, 39(3):930–945, Maym 1993.
- (1993) IEEE Transactions on Information Theory , vol.39 , Issue.3 , pp. 930-945
- Barron, A.R.¹

3
- 85012688561
- Princeton University Press, Princeton, New Jersey
- Richard Bellman. Dynamic Programming. Princeton University Press, Princeton, New Jersey, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 0003487482
- Athena Scientific, Belmont, MA
- Dimitri P. Bertsekas and John N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 0003487601
- Oxford University Press
- Christopher M. Bishop. Neural Networks for Pattern Recognition. Oxford University Press, 1995.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

6
- 0033629916
- Reinforcement learning in continuous time and space
- Kenji Doya. Reinforcement learning in continuous time and space. Neural Computation, 12:243–269, 2000.
- (2000) Neural Computation , vol.12 , pp. 243-269
- Doya, K.¹

7
- 0003578240
- Technical Report CMU-CS-88-162, Carnegie-Mellon University
- Scott E. Fahlman. An empirical study of learning speed in back-propagation networks. Technical Report CMU-CS-88-162, Carnegie-Mellon University, 1988.
- (1988) An Empirical Study of Learning Speed in Back-Propagation Networks
- Fahlman, S.E.¹

8
- 0029679044
- Moore. Reinforcement learning: A survey
- Leslie Pack Kaelbling, Michael L. Littman, and Andrew W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237–285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Andrew, W.³

9
- 0001857994
- Efficient BackProp
- Genevieve B. Orr and Klaus-Robert M¨uller, editors, Springer
- Yann Le Cun, Leon Bottou, Genevieve B. Orr, and Klaus-Robert Müller. Efficient BackProp. In Genevieve B. Orr and Klaus-Robert M¨uller, editors, Neural Networks: Tricks of the Trade. Springer, 1998.
- (1998) Neural Networks: Tricks of the Trade
- Cun, Y.L.¹ Bottou, L.² Orr, G.B.³ Klaus-Robert, M.⁴

10
- 0027205884
- Møller. A scaled conjugate gradient algorithm for fast supervised learning
- Martin F. Møller. A scaled conjugate gradient algorithm for fast supervised learning. Neural Networks, 6:525–533, 1993.
- (1993) Neural Networks , vol.6 , pp. 525-533
- Martin, F.¹

11
- 0000423737
- Hierarchical reinforcement learning of lowdimensional subgoals and high-dimensional trajectories
- Jun Morimoto and Kenji Doya. Hierarchical reinforcement learning of lowdimensional subgoals and high-dimensional trajectories. In Proceedings of the Fifth International Conference on Neural Information Processing, pages 850–853, 1998.
- (1998) Proceedings of the Fifth International Conference on Neural Information Processing , pp. 850-853
- Morimoto, J.¹ Doya, K.²

12
- 0009589301
- How to train neural networks
- Genevieve B. Orr and Klaus-Robert M¨uller, editors, Springer
- Ralph Neuneier and Hans-Georg Zimmermann. How to train neural networks. In Genevieve B. Orr and Klaus-Robert M¨uller, editors, Neural Networks: Tricks of the Trade. Springer, 1998.
- (1998) Neural Networks: Tricks of the Trade
- Neuneier, R.¹ Zimmermann, H.-G.²

13
- 84898987060
- Using curvature information for fast stochastic search
- MIT Press
- Genevieve B. Orr and Todd K. Leen. Using curvature information for fast stochastic search. In Advances in Neural Information Processing Systems 9. MIT Press, 1997.
- (1997) Advances in Neural Information Processing Systems 9
- Orr, G.B.¹ Leen, T.K.²

14
- 84943274699
- A direct adaptive method for faster backpropagation learning: The RPROP algorithm
- Martin Riedmiller and Heinrich Braun. A direct adaptive method for faster backpropagation learning: The RPROP algorithm. In Proceedings of the IEEE International Conference on Neural Networks, 1993.
- (1993) Proceedings of the IEEE International Conference on Neural Networks
- Riedmiller, M.¹ Braun, H.²

15
- 0028374275
- Atkeson. Robot juggling: An implementation of memory-based learning
- Stefan Schaal and Christopher G. Atkeson. Robot juggling: An implementation of memory-based learning. Control Systems Magazine, 14:57–71, 1994.
- (1994) Control Systems Magazine , vol.14 , pp. 57-71
- Schaal, S.¹ Christopher, G.²

16
- 0033691378
- Real-time robot learning with locally weighted statistical learning
- Stefan Schaal, Christopher G. Atkeson, and Sethu Vijayakumar. Real-time robot learning with locally weighted statistical learning. In International Conference on Robotics and Automation (ICRA2000), 2000.
- (2000) International Conference on Robotics and Automation (ICRA2000)
- Schaal, S.¹ Atkeson, C.G.² Vijayakumar, S.³

17
- 0033338205
- Schraudolph. Local gain adaptation in stochastic gradient descent
- London, IEE
- Nicol N. Schraudolph. Local gain adaptation in stochastic gradient descent. In Proceedings of the 9th International Conference on Artificial Neural Networks, London, 1999. IEE.
- (1999) Proceedings of the 9Th International Conference on Artificial Neural Networks
- Nicol, N.¹

18
- 33847202724
- Sutton. Learning to predict by the methods of temporal differences
- Richard S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9–44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Richard, S.¹

19
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- MIT Press
- Richard S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, pages 1038–1044. MIT Press, 1996.
- (1996) Advances in Neural Information Processing Systems , pp. 1038-1044
- Sutton, R.S.¹

20
- 0003275980
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- MIT Press
- Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 0029276036
- Temporal difference learning and TD-Gammon
- MIT Press
- Gerald Tesauro. Temporal difference learning and TD-Gammon. Communications of the ACM, 38(3):58-68, March 1995.
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

22
- 0032294517
- Preventing unlearning during on-line training of feedforward networks
- MIT Press
- Scott E. Weaver, Leemon C. Baird, and Marios M. Polycarpou. Preventing unlearning during on-line training of feedforward networks. In Proceedings of the International Symposium of Intelligent Control, 1998.
- (1998) Proceedings of the International Symposium of Intelligent Control
- Weaver, S.E.¹ Baird, L.C.² Polycarpou, M.M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.