메뉴 건너뛰기




Volumn 37, Issue 2, 2007, Pages 398-409

A study on expertise of agents and its effects on cooperative Q-learning

Author keywords

Area of expertise (AOE); Cooperative Q learning agents; Cooperative Q learning using AOE; Extraction of AOE; Multiagent (MASs)

Indexed keywords

ARTIFICIAL INTELLIGENCE; EXPERT SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS;

EID: 34047122064     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2006.883264     Document Type: Article
Times cited : (28)

References (24)
  • 1
    • 0034205975 scopus 로고    scopus 로고
    • "Multiagent systems: A survey from a machine learning perspective"
    • Jun
    • P. Stone and M. Veloso, "Multiagent systems: A survey from a machine learning perspective," Auton. Robots, vol. 8, no. 3, pp. 345-383, Jun. 2000.
    • (2000) Auton. Robots , vol.8 , Issue.3 , pp. 345-383
    • Stone, P.1    Veloso, M.2
  • 3
    • 0142229013 scopus 로고    scopus 로고
    • Agent Platform Special Interest Group, Sep. Needham, MA: Object Management Group (OMG)
    • Agent Platform Special Interest Group, Agent technology green paper, Sep. 2000, Needham, MA: Object Management Group (OMG).
    • (2000) Agent Technology Green Paper
  • 5
    • 85152198941 scopus 로고
    • "Multi-agent reinforcement learning: Independent vs. cooperative agents"
    • in Amherst, MA, Jun
    • M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Mach. Learning, Amherst, MA, Jun. 1993, pp. 330-337.
    • (1993) Proc. 10th Int. Conf. Mach. Learning , pp. 330-337
    • Tan, M.1
  • 6
    • 33745834823 scopus 로고    scopus 로고
    • "Distributed lazy Q-1earning for cooperative mobile robots"
    • Jan
    • C. F. Touzet, "Distributed lazy Q-1earning for cooperative mobile robots," Int. J. Adv. Robot. Syst., vol. 1, no. 1, pp. 5-13, Jan. 2004.
    • (2004) Int. J. Adv. Robot. Syst. , vol.1 , Issue.1 , pp. 5-13
    • Touzet, C.F.1
  • 7
    • 18144414808 scopus 로고    scopus 로고
    • "SWARM: Cooperative reinforcement learning for routing in ad-hoc networks"
    • M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep
    • E. Curran, "SWARM: Cooperative reinforcement learning for routing in ad-hoc networks," M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep. 2003.
    • (2003)
    • Curran, E.1
  • 8
    • 14344250637 scopus 로고    scopus 로고
    • "Sparse cooperative Q-learning"
    • in Banff, AB, Canada, Jul. 4-8
    • J. R. Kok and N. Vlassis, "Sparse cooperative Q-learning," in Proc. 21st Int. Conf. Mach. Learning, Banff, AB, Canada, Jul. 4-8, 2004, pp. 481-488.
    • (2004) Proc. 21st Int. Conf. Mach. Learning , pp. 481-488
    • Kok, J.R.1    Vlassis, N.2
  • 9
    • 0004049893 scopus 로고
    • "Learning with delayed rewards"
    • Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K
    • C. J. C. H. Watkins, "Learning with delayed rewards," Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K., 1989.
    • (1989)
    • Watkins, C.J.C.H.1
  • 12
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton Univ. Press
    • R. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 13
    • 0036057598 scopus 로고    scopus 로고
    • "The necessity of average rewards in cooperative multirobot learning"
    • in Washington, DC, May 11-15
    • P. Tangamchit, J. M. Dolan, and P. K. Khosla, "The necessity of average rewards in cooperative multirobot learning," in Proc. IEEE Int. Conf. Robot. Autom., Washington, DC, May 11-15, 2002, pp. 1296-1301.
    • (2002) Proc. IEEE Int. Conf. Robot. Autom. , pp. 1296-1301
    • Tangamchit, P.1    Dolan, J.M.2    Khosla, P.K.3
  • 14
    • 1142280399 scopus 로고    scopus 로고
    • "Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain"
    • in Melbourne, Australia, Jul. 14-18
    • L. Nunes and E. Oliveira, "Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain," in Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst., Melbourne, Australia, Jul. 14-18, 2003, pp. 1084-1086.
    • (2003) Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst. , pp. 1084-1086
    • Nunes, L.1    Oliveira, E.2
  • 15
    • 4544316833 scopus 로고    scopus 로고
    • "Cooperative learning using advice exchange"
    • in E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr
    • L. Nunes and E. Oliveira, "Cooperative learning using advice exchange," in Lecture Notes in Artificial Intelligence, vol. 2636, E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr. 2003, pp. 33-48.
    • (2003) Lecture Notes in Artificial Intelligence , vol.2636 , pp. 33-48
    • Nunes, L.1    Oliveira, E.2
  • 17
    • 0031630561 scopus 로고    scopus 로고
    • "The dynamics of reinforcement learning in cooperative multiagent systems"
    • in Menlo Park, CA, Aug
    • C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proc. 15th Int. Conf. Artif. Intell., Menlo Park, CA, Aug. 1998, pp. 746-752.
    • (1998) Proc. 15th Int. Conf. Artif. Intell. , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 18
    • 0342683320 scopus 로고    scopus 로고
    • "A general method for multi-agent reinforcement learning in unrestricted environments"
    • AAAI, Menlo Park, CA, Tech. Rep. SS-96-01, Mar
    • J. Schmidhuber, "A general method for multi-agent reinforcement learning in unrestricted environments," AAAI, Menlo Park, CA, pp. 84-87, Tech. Rep. SS-96-01, Mar. 1996.
    • (1996) , pp. 84-87
    • Schmidhuber, J.1
  • 19
    • 85149834820 scopus 로고
    • "Markov games as a framework for multi-agent reinforcement learning"
    • in San Francisco, CA
    • M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Mach. Learning, San Francisco, CA, 1994, pp. 157-163.
    • (1994) Proc. 11th Int. Conf. Mach. Learning , pp. 157-163
    • Littman, M.L.1
  • 20
    • 0000929496 scopus 로고    scopus 로고
    • "Multi-agent reinforcement learning: Theoretical framework and an algorithm"
    • in Madison, WI, Jul
    • J. Hu and M. P. Wellman, "Multi-agent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Mach. Learning, Madison, WI, Jul. 1998, pp. 242-250.
    • (1998) Proc. 15th Int. Conf. Mach. Learning , pp. 242-250
    • Hu, J.1    Wellman, M.P.2
  • 21
  • 24
    • 34047191980 scopus 로고    scopus 로고
    • "Knowledge based multiagent credit assignment: A study on task type and critic information"
    • to be published
    • A. Harati, M. N. Ahmadabadi, and B. N. Araabi, "Knowledge based multiagent credit assignment: A study on task type and critic information," Intelligent Automation and Soft Computing (AutoSoft), vol. 13, no. 3, 2007, to be published.
    • (2007) Intelligent Automation and Soft Computing (AutoSoft) , vol.13 , Issue.3
    • Harati, A.1    Ahmadabadi, M.N.2    Araabi, B.N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.