SCOPUS 정보 검색 플랫폼

Robotics and Autonomous Systems

Volumn 22, Issue 3-4, 1997, Pages 251-281

Neural reinforcement learning for behaviour synthesis

(1) Touzet, Claude F a,b

a AIX MARSEILLE UNIVERSITY (France)

b OAK RIDGE NATIONAL LABORATORY (United States)

Author keywords

Autonomous robotics; Neural Q learning; Obstacle avoidance behaviour; Reinforcement learning; Self organising map

Indexed keywords

COLLISION AVOIDANCE; LEARNING SYSTEMS; NEURAL NETWORKS; ROBOTICS;

AUTONOMOUS ROBOTICS; NEURAL Q LEARNING; REINFORCEMENT LEARNING;

MOBILE ROBOTS;

EID: 0031341345 PISSN: 09218890 EISSN: None Source Type: Journal
DOI: 10.1016/S0921-8890(97)00042-0 Document Type: Article

Times cited : (96)

References (27)

1
- 0000500817
- Interactions between learning and evolution
- C.G. Langton et al., eds., SFI Studies Sc. Complexity, Addison-Wesley, Reading, MA
- D. Ackley and M. Littman, Interactions between learning and evolution, in: C.G. Langton et al., eds., Artificial Life II, SFI Studies Sc. Complexity, Vol X (Addison-Wesley, Reading, MA, 1991) 487-509.
- (1991) Artificial Life II , vol.10 , pp. 487-509
- Ackley, D.¹ Littman, M.²

2
- 0016556021
- A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
- J.S. Albus, A new approach to manipulator control: The cerebellar model articulation controller (CMAC), Transactions of the ASME (1975).
- (1975) Transactions of the ASME
- Albus, J.S.¹

3
- 84927461265
- Pattern recognizing stochastic leaning automata
- A.G. Barto and P. Anandan, Pattern recognizing stochastic leaning automata, IEEE Transactions on Systems, Man and Cybernetics SMC-15 (1985) 360-375.
- (1985) IEEE Transactions on Systems, Man and Cybernetics SMC-15 , pp. 360-375
- Barto, A.G.¹ Anandan, P.²

4
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- A.G. Barto, R.S. Sutton and C.W. Anderson, Neuronlike adaptive elements that can solve difficult learning control problems, IEEE Transactions on Systems, Man and Cybernetics SMC-13 (1983) 834-846.
- (1983) IEEE Transactions on Systems, Man and Cybernetics , vol.SMC-13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

5
- 0003727264
- MIT Press, Cambridge, MA
- V. Braitenberg, Vehicles: Experiments in Synthetic Psychology (MIT Press, Cambridge, MA, 1986).
- (1986) Vehicles: Experiments in Synthetic Psychology
- Braitenberg, V.¹

6
- 30244470118
- The ART of adaptive pattern recognition by a self-roganizing neural network
- G.A. Carpenter and S. Grossberg, The ART of adaptive pattern recognition by a self-roganizing neural network, Proceedings of the IEEE (1988).
- (1988) Proceedings of the IEEE
- Carpenter, G.A.¹ Grossberg, S.²

7
- 0030167564
- Behavior analysis and training - A methodology for behavior engineering
- M. Dorigo, ed.
- M. Colombetti, M. Dorigo and G. Borghi, Behavior analysis and training - A methodology for behavior engineering, in: M. Dorigo, ed., Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics SMC-part B 26 (3) (1996) 365-380.
- (1996) Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics SMC-part B , vol.26 , Issue.3 , pp. 365-380
- Colombetti, M.¹ Dorigo, M.² Borghi, G.³

8
- 0030171602
- Rapid, safe and incremental learning of navigation strategies
- M. Dorigo, ed.
- J. del R. Millàn, Rapid, safe and incremental learning of navigation strategies, in: M. Dorigo, ed., Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics, SMC-part B 26 (3) (1996) 408-420.
- (1996) Special Issue on Learning Autonomous Robots, IEEE Transactions on Systems, Man and Cybernetics, SMC-part B , vol.26 , Issue.3 , pp. 408-420
- Millàn, J.D.R.¹

9
- 0028739953
- Robot shaping: Developing autonomous agents through learning
- M. Dorigo and M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 71 (2) (1994) 321-370.
- (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
- Dorigo, M.¹ Colombetti, M.²

10
- 30244434915
- Extending the adaptive heuristic critic and Q-learning: From facts to implications
- I. Alexander and J. Taylor, eds., Elsevier, Amsterdam
- O. Holland and M. Snaith, Extending the adaptive heuristic critic and Q-learning: From facts to implications, in: I. Alexander and J. Taylor, eds., Artificial Neural Networks, Vol. 2 (Elsevier, Amsterdam, 1992) 599-602.
- (1992) Artificial Neural Networks , vol.2 , pp. 599-602
- Holland, O.¹ Snaith, M.²

11
- 0004280606
- MIT Press, Cambridge, MA
- L. Kaelbling, Learning in Embedded Systems (MIT Press, Cambridge, MA, 1993).
- (1993) Learning in Embedded Systems
- Kaelbling, L.¹

12
- 0029679044
- Reinforcement learning: A survey
- L. Kaelbling, M. Littman and A. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research 4 (1996) 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.² Moore, A.³

13
- 0003527079
- Springer, Berlin
- T. Kohonen, Self-Organisation and Associative Memory (Springer, Berlin, 1984).
- (1984) Self-organisation and Associative Memory
- Kohonen, T.¹

14
- 0000123778
- Self-improving reative agents based on reinforcement learning, planning and teaching
- L.J. Lin, Self-improving reative agents based on reinforcement learning, planning and teaching, Machine Learning 8 (1992) 293-321.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.J.¹

15
- 0003673017
- Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103
- L.-J. Lin, Reinforcement learning for robots using neural networks, Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103, 1993.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

16
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- S. Mahadevan and J. Connell, Automatic programming of behavior-based robots using reinforcement learning, Artificial Intelligence 55 (2) (1991) 311-365.
- (1991) Artificial Intelligence , vol.55 , Issue.2 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

17
- 84957895797
- Reward function for accelerated learning
- W.W. Cohen and H. Hirsh, eds., Morgan Kaufman, LOS Altos, CA
- M. Mataric, Reward function for accelerated learning, in: W.W. Cohen and H. Hirsh, eds., Proc. of 11th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1994).
- (1994) Proc. of 11th Int. Conf. on Machine Learning
- Mataric, M.¹

18
- 84957622922
- Using transitional proximity for faster reinforcement learning
- Morgan Kaufman, LOS Altos, CA
- R.A. McCallum, Using transitional proximity for faster reinforcement learning, Proc. 9th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1992).
- (1992) Proc. 9th Int. Conf. on Machine Learning
- McCallum, R.A.¹

19
- 0001825948
- Mobile robot miniaturisation: A tool for investigation in control algorithms
- Kyoto, Japan
- F. Mondada, E. Franzi and P. Ienne, Mobile robot miniaturisation: A tool for investigation in control algorithms, Proc. 3rd Int. Symp. on Experimental Robotics, Kyoto, Japan (1993).
- (1993) Proc. 3rd Int. Symp. on Experimental Robotics
- Mondada, F.¹ Franzi, E.² Ienne, P.³

20
- 0000646059
- Learning internal representations by error propagation
- D. Rumelhart and J. Mc Clelland, eds., MIT Press, Cambridge, MA
- D. Rumelhart, G. Hinton and R. Williams, Learning internal representations by error propagation, in: D. Rumelhart and J. Mc Clelland, eds., Parallel Distributed Processing, Vol. 1 (MIT Press, Cambridge, MA, 1986) 318-362.
- (1986) Parallel Distributed Processing , vol.1 , pp. 318-362
- Rumelhart, D.¹ Hinton, G.² Williams, R.³

21
- 0009283096
- Reinforcement learning and neural reinforcement learning
- M. Verleysen, ed., D-Facto Publication, Brussels
- S. Sehad and C. Touzet, Reinforcement learning and neural reinforcement learning, in: M. Verleysen, ed., ESANN 94 (D-Facto Publication, Brussels, 1994).
- (1994) ESANN 94
- Sehad, S.¹ Touzet, C.²

22
- 0011200414
- Reinforcement learning architectures for a animats
- J.-A. Meyer and S.W. Wilson, eds., MIT Press, Cambridge, MA
- R.S. Sutton, Reinforcement learning architectures for a animats, in: J.-A. Meyer and S.W. Wilson, eds., Proc. 1st Int. Conf. on Simulation of Adaptive Behavior, From Animals to Animats (MIT Press, Cambridge, MA, 1991) 288-296.
- (1991) Proc. 1st Int. Conf. on Simulation of Adaptive Behavior, From Animals to Animats , pp. 288-296
- Sutton, R.S.¹

23
- 30244464358
- The connectionist sequential machine: A general model of sequential networks
- Canberra P. Leong and M. Jabri, eds., Sydney University Electrical Engineering NSW 2006
- C. Touzet and N. Giambiasi, The connectionist sequential machine: A general model of sequential networks, in: Canberra P. Leong and M. Jabri, eds., Australian Conf. on Neural Networks (Sydney University Electrical Engineering NSW 2006 (1992).
- (1992) Australian Conf. on Neural Networks
- Touzet, C.¹ Giambiasi, N.²

24
- 30244440305
- Application of connectionist models to fuzzy inference systems
- B. Fronhöfer and G. Wrightson, eds., Lectures Notes in Artifical Intelligence, Springer, Berlin
- C. Touzet and N. Giambiasi, Application of connectionist models to fuzzy inference systems, in: B. Fronhöfer and G. Wrightson, eds., Parallelization in Inference Systems, Lectures Notes in Artifical Intelligence, Vol. 590 (Springer, Berlin, 1992).
- (1992) Parallelization in Inference Systems , vol.590
- Touzet, C.¹ Giambiasi, N.²

25
- 30244514264
- Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions
- Cancun, Mexico
- C. Touzet, S. Sehad and N. Giambiasi, Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions, Int. Conf. on Robotics and Manufacturing, Cancun, Mexico (1995).
- (1995) Int. Conf. on Robotics and Manufacturing
- Touzet, C.¹ Sehad, S.² Giambiasi, N.³

26
- 0003117086
- An imitation of life
- W.G. Walter, An imitation of life, Scientific American 182 (5) (1950) 42-45.
- (1950) Scientific American , vol.182 , Issue.5 , pp. 42-45
- Walter, W.G.¹

27
- 0004049893
- Ph.D. Thesis, King's College, Cambridge, UK
- C.J.C.H. Watkins, Learning from delayed rewards, Ph.D. Thesis, King's College, Cambridge, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.