메뉴 건너뛰기




Volumn 4, Issue 1, 1997, Pages 73-83

Reinforcement Learning in the Multi-Robot Domain

Author keywords

Group behavior; Multi agent systems; Reinforcement learning; Robot learning; Robotics

Indexed keywords

ALGORITHMS; MOBILE ROBOTS; MOTION PLANNING; ROBOT PROGRAMMING; ROBOTICS; SENSORS;

EID: 0030647149     PISSN: 09295593     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1008819414322     Document Type: Article
Times cited : (372)

References (29)
  • 4
    • 2342593717 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A.G., Bradtke, S.J., and Singh, S.P. 1993. Learning to act using real-time dynamic programming. AI Journal.
    • (1993) AI Journal
    • Barto, A.G.1    Bradtke, S.J.2    Singh, S.P.3
  • 5
    • 0022688781 scopus 로고
    • A robust layered control system for a mobile robot
    • Brooks, R.A. 1986. A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation, RA-2:14-23.
    • (1986) IEEE Journal of Robotics and Automation , vol.RA-2 , pp. 14-23
    • Brooks, R.A.1
  • 6
    • 0003645589 scopus 로고
    • Technical Report AIM-1227, MIT Artificial Intelligence Lab.
    • Brooks, R.A. 1990. The behavior language; user's guide. Technical Report AIM-1227, MIT Artificial Intelligence Lab.
    • (1990) The Behavior Language; User's Guide
    • Brooks, R.A.1
  • 9
    • 85151437138 scopus 로고
    • Programming robots using reinforcement learning and teaching
    • Pittsburgh, PA
    • Lin, L.-J. 1991a. Programming robots using reinforcement learning and teaching. In Proceedings, AAAI-91, Pittsburgh, PA, pp. 781-786.
    • (1991) Proceedings, AAAI-91 , pp. 781-786
    • Lin, L.-J.1
  • 11
    • 84976813028 scopus 로고
    • Learning to coordinate behaviors
    • Boston, MA
    • Maes, P. and Brooks, R.A. 1990. Learning to coordinate behaviors. In Proceedings, AAAI-91, Boston, MA, pp. 796-802.
    • (1990) Proceedings, AAAI-91 , pp. 796-802
    • Maes, P.1    Brooks, R.A.2
  • 13
    • 0002386181 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Pittsburgh, PA
    • Mahadevan, S. and Connell, J. 1991a. Automatic programming of behavior-based robots using reinforcement learning. In Proceedings, AAAI-91, Pittsburgh, PA, pp. 8-14.
    • (1991) Proceedings, AAAI-91 , pp. 8-14
    • Mahadevan, S.1    Connell, J.2
  • 14
    • 79955966783 scopus 로고
    • Scaling reinforcement learning to robotics by exploiting the subsumption architecture
    • Morgan Kaufmann
    • Mahadevan, S. and Connell, J. 1991b. Scaling reinforcement learning to robotics by exploiting the subsumption architecture. In Eighth International Workshop on Machine Learning, Morgan Kaufmann, pp. 328-337.
    • (1991) Eighth International Workshop on Machine Learning , pp. 328-337
    • Mahadevan, S.1    Connell, J.2
  • 20
    • 0010527819 scopus 로고
    • Learning reactive sequences from basic reflexes
    • The MIT Press: Brighton, England
    • Millán, J.D.R. 1994. Learning reactive sequences from basic reflexes. In Proceedings, Simulation of Adaptive Behavior SAB-94, The MIT Press: Brighton, England, pp. 266-274.
    • (1994) Proceedings, Simulation of Adaptive Behavior SAB-94 , pp. 266-274
    • Millán, J.D.R.1
  • 21
    • 0003971885 scopus 로고
    • Fast, robust adaptive control by learning only forward models
    • Moore, A.W 1992. Fast, robust adaptive control by learning only forward models. Advances in Neural Information Processing, 4:571-579.
    • (1992) Advances in Neural Information Processing , vol.4 , pp. 571-579
    • Moore, A.W.1
  • 24
    • 0028374275 scopus 로고
    • Robot juggling: An implementation of memory-bassed learning
    • Schaal, S. and Atkeson, C.C. 1994. Robot juggling: An implementation of memory-bassed learning. Control Systems Magazine, 14:57-71.
    • (1994) Control Systems Magazine , vol.14 , pp. 57-71
    • Schaal, S.1    Atkeson, C.C.2
  • 25
    • 33847202724 scopus 로고
    • Learning to predict by method of temporal differences
    • Sutton, R. 1988. Learning to predict by method of temporal differences. Machine Learning, 3(1):9-44.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.1
  • 26
    • 85152198941 scopus 로고
    • Multi-agent reinforcement learning: Independent vs. cooperative agents
    • Amherst, MA
    • Tan, M. 1993. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings, Tenth International Conference on Machine Learning, Amherst, MA, pp. 330-337.
    • (1993) Proceedings, Tenth International Conference on Machine Learning , pp. 330-337
    • Tan, M.1
  • 27
    • 0007651192 scopus 로고
    • Integrating inductive neural network learning and explanation-based learning
    • Chambery, France
    • Thrun, S.B. and Mitchell, T.M. 1993. Integrating inductive neural network learning and explanation-based learning. In Proceedings, IJCAI-93, Chambery, France.
    • (1993) Proceedings, IJCAI-93
    • Thrun, S.B.1    Mitchell, T.M.2
  • 29
    • 0003326518 scopus 로고
    • Learning multiple goal behavior via task decomposition and dynamic policy merging
    • J.H. Connell and S. Mahadevan (Eds.), Kluwer Academic Publishers
    • Whitehead, S.D., Karlsson, J., and Tenenberg, J. 1993. Learning multiple goal behavior via task decomposition and dynamic policy merging. In Robot Learning, J.H. Connell and S. Mahadevan (Eds.), Kluwer Academic Publishers, pp. 45-78.
    • (1993) Robot Learning , pp. 45-78
    • Whitehead, S.D.1    Karlsson, J.2    Tenenberg, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.