SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics

Volumn 37, Issue 2, 2007, Pages 398-409

A study on expertise of agents and its effects on cooperative Q-learning

(3) Araabi, Babak Nadjar a Mastoureshgh, Sahar a Ahmadabadi, Majid Nili a,b

Author keywords

Area of expertise (AOE); Cooperative Q learning agents; Cooperative Q learning using AOE; Extraction of AOE; Multiagent (MASs)

Indexed keywords

ARTIFICIAL INTELLIGENCE; EXPERT SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS;

Q-LEARNING; Q-TABLES; REINFORCEMENT-LEARNING HOMOGENEOUS AGENTS; STRATEGY SHARING (SS);

MULTI AGENT SYSTEMS;

ALGORITHM; ARTICLE; ARTIFICIAL INTELLIGENCE; AUTOMATED PATTERN RECOGNITION; COMPUTER SIMULATION; COOPERATION; DECISION SUPPORT SYSTEM; EXPERT SYSTEM; METHODOLOGY; THEORETICAL MODEL;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; COOPERATIVE BEHAVIOR; DECISION SUPPORT TECHNIQUES; EXPERT SYSTEMS; MODELS, THEORETICAL; PATTERN RECOGNITION, AUTOMATED;

EID: 34047122064 PISSN: 10834419 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCB.2006.883264 Document Type: Article

Times cited : (28)

References (24)

1
- 0034205975
- "Multiagent systems: A survey from a machine learning perspective"
- Jun
- P. Stone and M. Veloso, "Multiagent systems: A survey from a machine learning perspective," Auton. Robots, vol. 8, no. 3, pp. 345-383, Jun. 2000.
- (2000) Auton. Robots , vol.8 , Issue.3 , pp. 345-383
- Stone, P.¹ Veloso, M.²

2
- 0003744207
- 2nd ed. Cambridge, MA: MIT Press
- G. Weiss, Multi Agent Systems: A Modern Approach to Distributed Artificial Intelligence, 2nd ed. Cambridge, MA: MIT Press, 2000.
- (2000) Multi Agent Systems: A Modern Approach to Distributed Artificial Intelligence
- Weiss, G.¹

3
- 0142229013
- Agent Platform Special Interest Group, Sep. Needham, MA: Object Management Group (OMG)
- Agent Platform Special Interest Group, Agent technology green paper, Sep. 2000, Needham, MA: Object Management Group (OMG).
- (2000) Agent Technology Green Paper

4
- 0036465258
- "Expertness-based cooperative Q-learning"
- Feb
- M. N. Ahmadabadi and M. Asadpour, "Expertness-based cooperative Q-learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 1, pp. 66-77, Feb. 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.32 , Issue.1 , pp. 66-77
- Ahmadabadi, M.N.¹ Asadpour, M.²

5
- 85152198941
- "Multi-agent reinforcement learning: Independent vs. cooperative agents"
- in Amherst, MA, Jun
- M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Mach. Learning, Amherst, MA, Jun. 1993, pp. 330-337.
- (1993) Proc. 10th Int. Conf. Mach. Learning , pp. 330-337
- Tan, M.¹

6
- 33745834823
- "Distributed lazy Q-1earning for cooperative mobile robots"
- Jan
- C. F. Touzet, "Distributed lazy Q-1earning for cooperative mobile robots," Int. J. Adv. Robot. Syst., vol. 1, no. 1, pp. 5-13, Jan. 2004.
- (2004) Int. J. Adv. Robot. Syst. , vol.1 , Issue.1 , pp. 5-13
- Touzet, C.F.¹

7
- 18144414808
- "SWARM: Cooperative reinforcement learning for routing in ad-hoc networks"
- M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep
- E. Curran, "SWARM: Cooperative reinforcement learning for routing in ad-hoc networks," M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep. 2003.
- (2003)
- Curran, E.¹

8
- 14344250637
- "Sparse cooperative Q-learning"
- in Banff, AB, Canada, Jul. 4-8
- J. R. Kok and N. Vlassis, "Sparse cooperative Q-learning," in Proc. 21st Int. Conf. Mach. Learning, Banff, AB, Canada, Jul. 4-8, 2004, pp. 481-488.
- (2004) Proc. 21st Int. Conf. Mach. Learning , pp. 481-488
- Kok, J.R.¹ Vlassis, N.²

9
- 0004049893
- "Learning with delayed rewards"
- Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K
- C. J. C. H. Watkins, "Learning with delayed rewards," Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K., 1989.
- (1989)
- Watkins, C.J.C.H.¹

10
- 0029679044
- "Reinforcement learning: A survey"
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
- (1996) J. Artif. Intell. Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

11
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

12
- 0003787146
- Princeton, NJ: Princeton Univ. Press
- R. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

13
- 0036057598
- "The necessity of average rewards in cooperative multirobot learning"
- in Washington, DC, May 11-15
- P. Tangamchit, J. M. Dolan, and P. K. Khosla, "The necessity of average rewards in cooperative multirobot learning," in Proc. IEEE Int. Conf. Robot. Autom., Washington, DC, May 11-15, 2002, pp. 1296-1301.
- (2002) Proc. IEEE Int. Conf. Robot. Autom. , pp. 1296-1301
- Tangamchit, P.¹ Dolan, J.M.² Khosla, P.K.³

14
- 1142280399
- "Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain"
- in Melbourne, Australia, Jul. 14-18
- L. Nunes and E. Oliveira, "Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain," in Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst., Melbourne, Australia, Jul. 14-18, 2003, pp. 1084-1086.
- (2003) Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst. , pp. 1084-1086
- Nunes, L.¹ Oliveira, E.²

15
- 4544316833
- "Cooperative learning using advice exchange"
- in E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr
- L. Nunes and E. Oliveira, "Cooperative learning using advice exchange," in Lecture Notes in Artificial Intelligence, vol. 2636, E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr. 2003, pp. 33-48.
- (2003) Lecture Notes in Artificial Intelligence , vol.2636 , pp. 33-48
- Nunes, L.¹ Oliveira, E.²

16
- 0036967052
- "Cooperative Q-learning with heterogeneity in actions"
- in Hammamet, Tunisia, Oct. 6-9
- S. M. R. Mirfattah and M. N. Ahmadabadi, "Cooperative Q-learning with heterogeneity in actions," in Proc. IEEE Int. Conf. Syst. Man Cybern., Hammamet, Tunisia, Oct. 6-9, 2002.
- (2002) Proc. IEEE Int. Conf. Syst. Man Cybern.
- Mirfattah, S.M.R.¹ Ahmadabadi, M.N.²

17
- 0031630561
- "The dynamics of reinforcement learning in cooperative multiagent systems"
- in Menlo Park, CA, Aug
- C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proc. 15th Int. Conf. Artif. Intell., Menlo Park, CA, Aug. 1998, pp. 746-752.
- (1998) Proc. 15th Int. Conf. Artif. Intell. , pp. 746-752
- Claus, C.¹ Boutilier, C.²

18
- 0342683320
- "A general method for multi-agent reinforcement learning in unrestricted environments"
- AAAI, Menlo Park, CA, Tech. Rep. SS-96-01, Mar
- J. Schmidhuber, "A general method for multi-agent reinforcement learning in unrestricted environments," AAAI, Menlo Park, CA, pp. 84-87, Tech. Rep. SS-96-01, Mar. 1996.
- (1996) , pp. 84-87
- Schmidhuber, J.¹

19
- 85149834820
- "Markov games as a framework for multi-agent reinforcement learning"
- in San Francisco, CA
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Mach. Learning, San Francisco, CA, 1994, pp. 157-163.
- (1994) Proc. 11th Int. Conf. Mach. Learning , pp. 157-163
- Littman, M.L.¹

20
- 0000929496
- "Multi-agent reinforcement learning: Theoretical framework and an algorithm"
- in Madison, WI, Jul
- J. Hu and M. P. Wellman, "Multi-agent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Mach. Learning, Madison, WI, Jul. 1998, pp. 242-250.
- (1998) Proc. 15th Int. Conf. Mach. Learning , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

21
- 34250632505
- "Cooperative Q-learning through state transitions: A method for cooperation based on area of expertise"
- in Singapore, Nov
- S. Mastoureshgh, B. N. Araabi, and M. N. Ahmadabadi, "Cooperative Q-1earning through state transitions: A method for cooperation based on area of expertise," in Proc. 4th Asia-Pacific Conf. Simul. Evolution and Learning, Singapore, Nov. 2002, pp. 61-65.
- (2002) Proc. 4th Asia-Pacific Conf. Simul. Evolution and Learning , pp. 61-65
- Mastoureshgh, S.¹ Araabi, B.N.² Ahmadabadi, M.N.³

22
- 34250645675
- "Knowledge-based extraction of area of expertise for cooperation in learning"
- in Beijing, China, Oct
- A. Imanipour, M. Nili Ahmadabadi, B. N. Araabi, M. Asadpour, and R. Siegwart, "Knowledge-based extraction of area of expertise for cooperation in learning," in Proc. IEEE/RSJ Int. Conf. Intell. Robots and Syst. (IROS), Beijing, China, Oct. 2006, pp. 3700-3705.
- (2006) Proc. IEEE/RSJ Int. Conf. Intell. Robots and Syst. (IROS) , pp. 3700-3705
- Imanipour, A.¹ Nili Ahmadabadi, M.² Araabi, B.N.³ Asadpour, M.⁴ Siegwart, R.⁵

23
- 0032264491
- "Just talk to me: A field of expertise location"
- in Seattle, WA, Nov. 14-18
- D. W. McDonald and M. S. Ackerman, "Just talk to me: A field of expertise location," in Proc. ACM Conf. Comput.-Supported Cooperative Work, Seattle, WA, Nov. 14-18, 1998, pp. 315-324.
- (1998) Proc. ACM Conf. Comput.-Supported Cooperative Work , pp. 315-324
- McDonald, D.W.¹ Ackerman, M.S.²

24
- 34047191980
- "Knowledge based multiagent credit assignment: A study on task type and critic information"
- to be published
- A. Harati, M. N. Ahmadabadi, and B. N. Araabi, "Knowledge based multiagent credit assignment: A study on task type and critic information," Intelligent Automation and Soft Computing (AutoSoft), vol. 13, no. 3, 2007, to be published.
- (2007) Intelligent Automation and Soft Computing (AutoSoft) , vol.13 , Issue.3
- Harati, A.¹ Ahmadabadi, M.N.² Araabi, B.N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.