메뉴 건너뛰기




Volumn , Issue , 2006, Pages

Decentralized reinforcement learning control of a robotic manipulator

Author keywords

Decentralized control; Multi agent learning; Reinforcement learning

Indexed keywords

DECENTRALIZED CONTROL; MOBILE ROBOTS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING; ROBOTICS; TELECOMMUNICATION;

EID: 34547223380     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICARCV.2006.345351     Document Type: Conference Paper
Times cited : (53)

References (22)
  • 1
  • 2
    • 0034205975 scopus 로고    scopus 로고
    • Multiagent systems: A survey from the machine learning perspective
    • P. Stone and M. Veloso, "Multiagent systems: A survey from the machine learning perspective," Autonomous Robots, vol. 8, no. 3, pp. 345-383, 2000.
    • (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 345-383
    • Stone, P.1    Veloso, M.2
  • 3
    • 84949949419 scopus 로고    scopus 로고
    • Learning in multi-robot systems
    • G. Weiß and S. Sen, Eds. Springer Verlag
    • M. J. Mataric̀, "Learning in multi-robot systems," in Adaptation and Learning in Multi-Agent Systems, G. Weiß and S. Sen, Eds. Springer Verlag, 1996, pp. 152-163.
    • (1996) Adaptation and Learning in Multi-Agent Systems , pp. 152-163
    • Mataric̀, M.J.1
  • 4
    • 0032208335 scopus 로고    scopus 로고
    • Elevator group control using multiple reinforcement learning agents
    • R. H. Crites and A. G. Barto, "Elevator group control using multiple reinforcement learning agents," Machine Learning, vol. 33, no. 2-3, pp. 235-262, 1998.
    • (1998) Machine Learning , vol.33 , Issue.2-3 , pp. 235-262
    • Crites, R.H.1    Barto, A.G.2
  • 7
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • November
    • L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," Autonomous Agents and Multi-Agent Systems, vol. 11, no. 3, pp. 387-434, November 2005.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 9
    • 34547224685 scopus 로고    scopus 로고
    • G. Chalkiadakis, Multiagent reinforcement learning: Stochastic games with multiple learning players, Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.es.toronto.edu/~gehalk/DepthReport/DepthReport.ps.
    • G. Chalkiadakis, "Multiagent reinforcement learning: Stochastic games with multiple learning players," Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.es.toronto.edu/~gehalk/DepthReport/DepthReport.ps.
  • 10
    • 28544446213 scopus 로고    scopus 로고
    • Evolutionary game theory and multi-agent reinforcement learning
    • K. Tuyls and A. Nowé, "Evolutionary game theory and multi-agent reinforcement learning," The Knowledge Engineering Review, vol. 20, no. 1, pp. 63-90, 2005.
    • (2005) The Knowledge Engineering Review , vol.20 , Issue.1 , pp. 63-90
    • Tuyls, K.1    Nowé, A.2
  • 11
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • M. Bowling and M. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 12
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • C. J. C. H. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 15
    • 84899027977 scopus 로고    scopus 로고
    • Convergence and no-regret in multiagent learning
    • Vancouver, Canada, 13-18 December
    • M. Bowling, "Convergence and no-regret in multiagent learning," in Advances in Neural Information Processing Systems 17 (NIPS-04), Vancouver, Canada, 13-18 December 2004, pp. 209-216.
    • (2004) Advances in Neural Information Processing Systems 17 (NIPS-04) , pp. 209-216
    • Bowling, M.1
  • 16
    • 0001547175 scopus 로고    scopus 로고
    • Value-function reinforcement learning in Markov games
    • M. L. Littman, "Value-function reinforcement learning in Markov games," Journal of Cognitive Systems Research, vol. 2, pp. 55-66, 2001.
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 55-66
    • Littman, M.L.1
  • 19
    • 0002500351 scopus 로고    scopus 로고
    • Planning, learning and coordination in multiagent decision processes
    • De Zeeuwse Stromen, The Netherlands, 17-20 March
    • C. Boutilier, "Planning, learning and coordination in multiagent decision processes," in Proc. Sixth Conference on Theoretical Aspects of Rationality and Knowledge (TARK-96), De Zeeuwse Stromen, The Netherlands, 17-20 March 1996, pp. 195-210.
    • (1996) Proc. Sixth Conference on Theoretical Aspects of Rationality and Knowledge (TARK-96) , pp. 195-210
    • Boutilier, C.1
  • 20
    • 34547220537 scopus 로고    scopus 로고
    • M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, High level coordination of agents based on multiagent Markov decision processes with roles, in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.
    • M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, "High level coordination of agents based on multiagent Markov decision processes with roles," in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.