메뉴 건너뛰기




Volumn 15, Issue 2, 2010, Pages 261-268

Sequential Q-learning with kalman filtering for multirobot cooperative transportation

Author keywords

Decision making; Multirobot systems; Q learning

Indexed keywords

COOPERATIVE TRANSPORTATION; CREDIT ASSIGNMENT; DISTRIBUTED Q-LEARNING; EXPERIMENTAL SYSTEM; KALMAN-FILTERING; LEARNING PROCESS; MULTI-ROBOT COOPERATION; MULTI-ROBOT DOMAINS; MULTI-ROBOT SYSTEMS; MULTIROBOTS; Q-LEARNING; Q-LEARNING ALGORITHMS; Q-VALUES; SINGLE-AGENT;

EID: 77950596388     PISSN: 10834435     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMECH.2009.2024681     Document Type: Article
Times cited : (39)

References (17)
  • 1
    • 0036817725 scopus 로고    scopus 로고
    • Guest editorial: Advances in multirobot systems
    • Oct.
    • T. Arai, E. Pagello, and L. E. Parker, "Guest editorial: Advances in multirobot systems," IEEE Trans. Robot. Autom., vol.18, no.5, pp. 655-661, Oct. 2002.
    • (2002) IEEE Trans. Robot. Autom. , vol.18 , Issue.5 , pp. 655-661
    • Arai, T.1    Pagello, E.2    Parker, L.E.3
  • 2
    • 0036817712 scopus 로고    scopus 로고
    • Cooperative transport by multiple mobile robots in unknown static environments associated with real-time task assignment
    • Oct.
    • N. Miyata, J. Ota, T. Arai, and H. Asama, "Cooperative transport by multiple mobile robots in unknown static environments associated with real-time task assignment," IEEE Trans. Robot. Autom., vol.18, no.5, pp. 769-780, Oct. 2002.
    • (2002) IEEE Trans. Robot. Autom. , vol.18 , Issue.5 , pp. 769-780
    • Miyata, N.1    Ota, J.2    Arai, T.3    Asama, H.4
  • 3
    • 0344961687 scopus 로고    scopus 로고
    • Campout: A control architecture for tightly coupled coordination of multi-robot systems for planetary surface exploration
    • Sep.
    • T. Huntsberger, P. Pirjanian, A. Trebi-Ollennu et al., "Campout: A control architecture for tightly coupled coordination of multi-robot systems for planetary surface exploration," IEEE Trans. Syst., Man, Cybern., vol.33, no.5, pp. 550-559, Sep. 2003.
    • (2003) IEEE Trans. Syst., Man, Cybern. , vol.33 , Issue.5 , pp. 550-559
    • Huntsberger, T.1    Pirjanian, P.2    Trebi-Ollennu, A.3
  • 7
    • 34250651573 scopus 로고    scopus 로고
    • Multi-robot box-pushing: Single-agent Q-learning vs. teamQ-learning
    • Robots Syst. (IROS), Beijing, China
    • Y. Wang and C. W. de Silva, "Multi-robot box-pushing: Single-agent Q-learning vs. teamQ-learning," in Proc. 2006 IEEE/RSJ Int. Conf. Intell. Robots Syst. (IROS), Beijing, China, pp. 3694-3699.
    • Proc. 2006 IEEE/RSJ Int. Conf. Intell , pp. 3694-3699
    • Wang, Y.1    De Silva, C.W.2
  • 8
    • 0001547175 scopus 로고    scopus 로고
    • Value-function reinforcement learning in Markov games
    • M. L. Littman, "Value-function reinforcement learning in Markov games," J. Cogn. Syst. Res., vol.2, no.1, pp. 55-66, 2001.
    • (2001) J. Cogn. Syst. Res. , vol.2 , Issue.1 , pp. 55-66
    • Littman, M.L.1
  • 9
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • ML New Brunswick, NJ
    • M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Mach. Learn. (ML 1994), New Brunswick, NJ, pp. 157-163.
    • (1994) Proc. 11th Int. Conf. Mach. Learn. , pp. 157-163
    • Littman, M.L.1
  • 10
    • 33746826183 scopus 로고    scopus 로고
    • Multiagent reinforcement learning for multi-robot systems: A survey
    • Colchester, U.K. [Online]. Available:
    • E. Yang and D. Gu. (2004). "Multiagent reinforcement learning for multi-robot systems: A survey," Tech. Rep., Univ. Essex, Colchester, U.K. [Online]. Available: http://robotics.usc.edu/?maja/teaching/cs584/ papers/yang04multiagent.pdf
    • (2004) Tech. Rep., Univ. Essex
    • Yang, E.1    Gu, D.2
  • 11
    • 34347384820 scopus 로고    scopus 로고
    • Multirobot-based nanoassembly planning with automated path generation
    • Jun.
    • X. Yuan and S. X. Yang, "Multirobot-based nanoassembly planning with automated path generation," IEEE/ASME Trans. Mechatronics, vol.12, no.3, pp. 352-356, Jun. 2007.
    • (2007) IEEE/ASME Trans. Mechatronics , vol.12 , Issue.3 , pp. 352-356
    • Yuan, X.1    Yang, S.X.2
  • 12
    • 0036493758 scopus 로고    scopus 로고
    • Manipulating rigid payloads with multiple robots using complaint grippers
    • Mar.
    • D. Sun and J. K. Mills, "Manipulating rigid payloads with multiple robots using complaint grippers," IEEE/ASME Trans.Mechatronics, vol.7, no.1, pp. 23-34, Mar. 2002.
    • (2002) IEEE/ASME Trans.Mechatronics , vol.7 , Issue.1 , pp. 23-34
    • Sun, D.1    Mills, J.K.2
  • 13
    • 13244252467 scopus 로고    scopus 로고
    • Cooperative teleoperation of a multirobot system with force reflection via Internet
    • DOI 10.1109/TMECH.2004.839040
    • W.-T. Lo, Y. Liu, I. H. Eihajj, N. Xi, Y.Wang, and T. Fukuda, "Cooperative teleoperation of a multirobot system with force reflection via Internet," IEEE/ASME Trans. Mechatronics, vol.9, no.4, pp. 661-670, Dec. 2004. (Pubitemid 40181620)
    • (2004) IEEE/ASME Transactions on Mechatronics , vol.9 , Issue.4 , pp. 661-670
    • Lo, W.-T.1    Liu, Y.2    Elhajj, I.H.3    Xi, N.4    Wang, Y.5    Fukuda, T.6
  • 14
    • 43549119106 scopus 로고    scopus 로고
    • A machine learning approach to multi-robot coordination
    • Y.Wang and C.W. de Silva, "A machine learning approach to multi-robot coordination," Eng. Appl. Artif. Intell., vol.21, no.3, pp. 470-484, 2008.
    • (2008) Eng. Appl. Artif. Intell. , vol.21 , Issue.3 , pp. 470-484
    • Wang, Y.1    De Silva, C.W.2
  • 15
    • 26444603425 scopus 로고    scopus 로고
    • All learning is local: Multi-agent learning in global reward games
    • presented at the Whistler, BC, Canada
    • Y.-H. Chang, T. Ho, and L. P. Kaelbling, "All learning is local: Multi-agent learning in global reward games," presented at the Neural Inf. Process. Syst. (NIPS), Whistler, BC, Canada, 2003.
    • (2003) Neural Inf. Process. Syst. (NIPS)
    • Chang, Y.-H.1    Ho, T.2    Kaelbling, L.P.3
  • 16
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • ser. D
    • R. E. Kalman, "A new approach to linear filtering and prediction problems," Trans. ASME, J. Basic Eng., vol.82, ser. D, pp. 35-45, 1960.
    • (1960) Trans. ASME, J. Basic Eng. , vol.82 , pp. 35-45
    • Kalman, R.E.1
  • 17
    • 56449116921 scopus 로고    scopus 로고
    • Mobile robot localization and object pose estimation using optical encoder, vision and laser sensors
    • Qingdao, China Sep.
    • H. Lang, Y. Wang, and C. W. de Silva, "Mobile robot localization and object pose estimation using optical encoder, vision and laser sensors," in Proc. IEEE Int. Conf. Autom. Logistics, Qingdao, China, Sep. 2008, pp. 617-622.
    • (2008) Proc. IEEE Int. Conf. Autom. Logistics , pp. 617-622
    • Lang, H.1    Wang, Y.2    De Silva, C.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.