메뉴 건너뛰기




Volumn 22, Issue 3-4, 1997, Pages 251-281

Neural reinforcement learning for behaviour synthesis

Author keywords

Autonomous robotics; Neural Q learning; Obstacle avoidance behaviour; Reinforcement learning; Self organising map

Indexed keywords

COLLISION AVOIDANCE; LEARNING SYSTEMS; NEURAL NETWORKS; ROBOTICS;

EID: 0031341345     PISSN: 09218890     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0921-8890(97)00042-0     Document Type: Article
Times cited : (96)

References (27)
  • 1
    • 0000500817 scopus 로고
    • Interactions between learning and evolution
    • C.G. Langton et al., eds., SFI Studies Sc. Complexity, Addison-Wesley, Reading, MA
    • D. Ackley and M. Littman, Interactions between learning and evolution, in: C.G. Langton et al., eds., Artificial Life II, SFI Studies Sc. Complexity, Vol X (Addison-Wesley, Reading, MA, 1991) 487-509.
    • (1991) Artificial Life II , vol.10 , pp. 487-509
    • Ackley, D.1    Littman, M.2
  • 2
    • 0016556021 scopus 로고
    • A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
    • J.S. Albus, A new approach to manipulator control: The cerebellar model articulation controller (CMAC), Transactions of the ASME (1975).
    • (1975) Transactions of the ASME
    • Albus, J.S.1
  • 6
    • 30244470118 scopus 로고
    • The ART of adaptive pattern recognition by a self-roganizing neural network
    • G.A. Carpenter and S. Grossberg, The ART of adaptive pattern recognition by a self-roganizing neural network, Proceedings of the IEEE (1988).
    • (1988) Proceedings of the IEEE
    • Carpenter, G.A.1    Grossberg, S.2
  • 9
    • 0028739953 scopus 로고
    • Robot shaping: Developing autonomous agents through learning
    • M. Dorigo and M. Colombetti, Robot shaping: Developing autonomous agents through learning, Artificial Intelligence 71 (2) (1994) 321-370.
    • (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
    • Dorigo, M.1    Colombetti, M.2
  • 10
    • 30244434915 scopus 로고
    • Extending the adaptive heuristic critic and Q-learning: From facts to implications
    • I. Alexander and J. Taylor, eds., Elsevier, Amsterdam
    • O. Holland and M. Snaith, Extending the adaptive heuristic critic and Q-learning: From facts to implications, in: I. Alexander and J. Taylor, eds., Artificial Neural Networks, Vol. 2 (Elsevier, Amsterdam, 1992) 599-602.
    • (1992) Artificial Neural Networks , vol.2 , pp. 599-602
    • Holland, O.1    Snaith, M.2
  • 14
    • 0000123778 scopus 로고
    • Self-improving reative agents based on reinforcement learning, planning and teaching
    • L.J. Lin, Self-improving reative agents based on reinforcement learning, planning and teaching, Machine Learning 8 (1992) 293-321.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.J.1
  • 15
    • 0003673017 scopus 로고
    • Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103
    • L.-J. Lin, Reinforcement learning for robots using neural networks, Ph.D. Thesis, in: C.G. Langton et al., eds., Carnegie Mellon University, Pittsburgh, CMU-CS-93-103, 1993.
    • (1993) Reinforcement Learning for Robots Using Neural Networks
    • Lin, L.-J.1
  • 16
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • S. Mahadevan and J. Connell, Automatic programming of behavior-based robots using reinforcement learning, Artificial Intelligence 55 (2) (1991) 311-365.
    • (1991) Artificial Intelligence , vol.55 , Issue.2 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 17
    • 84957895797 scopus 로고
    • Reward function for accelerated learning
    • W.W. Cohen and H. Hirsh, eds., Morgan Kaufman, LOS Altos, CA
    • M. Mataric, Reward function for accelerated learning, in: W.W. Cohen and H. Hirsh, eds., Proc. of 11th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1994).
    • (1994) Proc. of 11th Int. Conf. on Machine Learning
    • Mataric, M.1
  • 18
    • 84957622922 scopus 로고
    • Using transitional proximity for faster reinforcement learning
    • Morgan Kaufman, LOS Altos, CA
    • R.A. McCallum, Using transitional proximity for faster reinforcement learning, Proc. 9th Int. Conf. on Machine Learning (Morgan Kaufman, LOS Altos, CA, 1992).
    • (1992) Proc. 9th Int. Conf. on Machine Learning
    • McCallum, R.A.1
  • 20
    • 0000646059 scopus 로고
    • Learning internal representations by error propagation
    • D. Rumelhart and J. Mc Clelland, eds., MIT Press, Cambridge, MA
    • D. Rumelhart, G. Hinton and R. Williams, Learning internal representations by error propagation, in: D. Rumelhart and J. Mc Clelland, eds., Parallel Distributed Processing, Vol. 1 (MIT Press, Cambridge, MA, 1986) 318-362.
    • (1986) Parallel Distributed Processing , vol.1 , pp. 318-362
    • Rumelhart, D.1    Hinton, G.2    Williams, R.3
  • 21
    • 0009283096 scopus 로고
    • Reinforcement learning and neural reinforcement learning
    • M. Verleysen, ed., D-Facto Publication, Brussels
    • S. Sehad and C. Touzet, Reinforcement learning and neural reinforcement learning, in: M. Verleysen, ed., ESANN 94 (D-Facto Publication, Brussels, 1994).
    • (1994) ESANN 94
    • Sehad, S.1    Touzet, C.2
  • 23
    • 30244464358 scopus 로고
    • The connectionist sequential machine: A general model of sequential networks
    • Canberra P. Leong and M. Jabri, eds., Sydney University Electrical Engineering NSW 2006
    • C. Touzet and N. Giambiasi, The connectionist sequential machine: A general model of sequential networks, in: Canberra P. Leong and M. Jabri, eds., Australian Conf. on Neural Networks (Sydney University Electrical Engineering NSW 2006 (1992).
    • (1992) Australian Conf. on Neural Networks
    • Touzet, C.1    Giambiasi, N.2
  • 24
    • 30244440305 scopus 로고
    • Application of connectionist models to fuzzy inference systems
    • B. Fronhöfer and G. Wrightson, eds., Lectures Notes in Artifical Intelligence, Springer, Berlin
    • C. Touzet and N. Giambiasi, Application of connectionist models to fuzzy inference systems, in: B. Fronhöfer and G. Wrightson, eds., Parallelization in Inference Systems, Lectures Notes in Artifical Intelligence, Vol. 590 (Springer, Berlin, 1992).
    • (1992) Parallelization in Inference Systems , vol.590
    • Touzet, C.1    Giambiasi, N.2
  • 25
    • 30244514264 scopus 로고
    • Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions
    • Cancun, Mexico
    • C. Touzet, S. Sehad and N. Giambiasi, Improving reinforcement learning of obstacle avoidance behavior with forbidden sequences of actions, Int. Conf. on Robotics and Manufacturing, Cancun, Mexico (1995).
    • (1995) Int. Conf. on Robotics and Manufacturing
    • Touzet, C.1    Sehad, S.2    Giambiasi, N.3
  • 26
    • 0003117086 scopus 로고
    • An imitation of life
    • W.G. Walter, An imitation of life, Scientific American 182 (5) (1950) 42-45.
    • (1950) Scientific American , vol.182 , Issue.5 , pp. 42-45
    • Walter, W.G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.