-
1
-
-
0034205975
-
"Multiagent systems: A survey from a machine learning perspective"
-
Jun
-
P. Stone and M. Veloso, "Multiagent systems: A survey from a machine learning perspective," Auton. Robots, vol. 8, no. 3, pp. 345-383, Jun. 2000.
-
(2000)
Auton. Robots
, vol.8
, Issue.3
, pp. 345-383
-
-
Stone, P.1
Veloso, M.2
-
3
-
-
0142229013
-
-
Agent Platform Special Interest Group, Sep. Needham, MA: Object Management Group (OMG)
-
Agent Platform Special Interest Group, Agent technology green paper, Sep. 2000, Needham, MA: Object Management Group (OMG).
-
(2000)
Agent Technology Green Paper
-
-
-
4
-
-
0036465258
-
"Expertness-based cooperative Q-learning"
-
Feb
-
M. N. Ahmadabadi and M. Asadpour, "Expertness-based cooperative Q-learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 1, pp. 66-77, Feb. 2002.
-
(2002)
IEEE Trans. Syst., Man, Cybern. B, Cybern.
, vol.32
, Issue.1
, pp. 66-77
-
-
Ahmadabadi, M.N.1
Asadpour, M.2
-
5
-
-
85152198941
-
"Multi-agent reinforcement learning: Independent vs. cooperative agents"
-
in Amherst, MA, Jun
-
M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Mach. Learning, Amherst, MA, Jun. 1993, pp. 330-337.
-
(1993)
Proc. 10th Int. Conf. Mach. Learning
, pp. 330-337
-
-
Tan, M.1
-
6
-
-
33745834823
-
"Distributed lazy Q-1earning for cooperative mobile robots"
-
Jan
-
C. F. Touzet, "Distributed lazy Q-1earning for cooperative mobile robots," Int. J. Adv. Robot. Syst., vol. 1, no. 1, pp. 5-13, Jan. 2004.
-
(2004)
Int. J. Adv. Robot. Syst.
, vol.1
, Issue.1
, pp. 5-13
-
-
Touzet, C.F.1
-
7
-
-
18144414808
-
"SWARM: Cooperative reinforcement learning for routing in ad-hoc networks"
-
M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep
-
E. Curran, "SWARM: Cooperative reinforcement learning for routing in ad-hoc networks," M.S. thesis, Comput. Sci. Dept., Univ. Dublin, Dublin, Ireland, Sep. 2003.
-
(2003)
-
-
Curran, E.1
-
8
-
-
14344250637
-
"Sparse cooperative Q-learning"
-
in Banff, AB, Canada, Jul. 4-8
-
J. R. Kok and N. Vlassis, "Sparse cooperative Q-learning," in Proc. 21st Int. Conf. Mach. Learning, Banff, AB, Canada, Jul. 4-8, 2004, pp. 481-488.
-
(2004)
Proc. 21st Int. Conf. Mach. Learning
, pp. 481-488
-
-
Kok, J.R.1
Vlassis, N.2
-
9
-
-
0004049893
-
"Learning with delayed rewards"
-
Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K
-
C. J. C. H. Watkins, "Learning with delayed rewards," Ph.D. dissertation, Psychol. Dept., Univ. Cambridge, Cambridge, U.K., 1989.
-
(1989)
-
-
Watkins, C.J.C.H.1
-
10
-
-
0029679044
-
"Reinforcement learning: A survey"
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
-
(1996)
J. Artif. Intell. Res.
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
12
-
-
0003787146
-
-
Princeton, NJ: Princeton Univ. Press
-
R. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
13
-
-
0036057598
-
"The necessity of average rewards in cooperative multirobot learning"
-
in Washington, DC, May 11-15
-
P. Tangamchit, J. M. Dolan, and P. K. Khosla, "The necessity of average rewards in cooperative multirobot learning," in Proc. IEEE Int. Conf. Robot. Autom., Washington, DC, May 11-15, 2002, pp. 1296-1301.
-
(2002)
Proc. IEEE Int. Conf. Robot. Autom.
, pp. 1296-1301
-
-
Tangamchit, P.1
Dolan, J.M.2
Khosla, P.K.3
-
14
-
-
1142280399
-
"Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain"
-
in Melbourne, Australia, Jul. 14-18
-
L. Nunes and E. Oliveira, "Advice-exchange amongst heterogeneous learning agents: Experiments in the pursuit domain," in Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst., Melbourne, Australia, Jul. 14-18, 2003, pp. 1084-1086.
-
(2003)
Proc. 2nd Int. Joint Conf. Auton. Agents and Multiagent Syst.
, pp. 1084-1086
-
-
Nunes, L.1
Oliveira, E.2
-
15
-
-
4544316833
-
"Cooperative learning using advice exchange"
-
in E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr
-
L. Nunes and E. Oliveira, "Cooperative learning using advice exchange," in Lecture Notes in Artificial Intelligence, vol. 2636, E. Alonso, D. Kazakov, and D. Kudenko, Eds. New York: Springer-Verlag, Apr. 2003, pp. 33-48.
-
(2003)
Lecture Notes in Artificial Intelligence
, vol.2636
, pp. 33-48
-
-
Nunes, L.1
Oliveira, E.2
-
16
-
-
0036967052
-
"Cooperative Q-learning with heterogeneity in actions"
-
in Hammamet, Tunisia, Oct. 6-9
-
S. M. R. Mirfattah and M. N. Ahmadabadi, "Cooperative Q-learning with heterogeneity in actions," in Proc. IEEE Int. Conf. Syst. Man Cybern., Hammamet, Tunisia, Oct. 6-9, 2002.
-
(2002)
Proc. IEEE Int. Conf. Syst. Man Cybern.
-
-
Mirfattah, S.M.R.1
Ahmadabadi, M.N.2
-
17
-
-
0031630561
-
"The dynamics of reinforcement learning in cooperative multiagent systems"
-
in Menlo Park, CA, Aug
-
C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proc. 15th Int. Conf. Artif. Intell., Menlo Park, CA, Aug. 1998, pp. 746-752.
-
(1998)
Proc. 15th Int. Conf. Artif. Intell.
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
18
-
-
0342683320
-
"A general method for multi-agent reinforcement learning in unrestricted environments"
-
AAAI, Menlo Park, CA, Tech. Rep. SS-96-01, Mar
-
J. Schmidhuber, "A general method for multi-agent reinforcement learning in unrestricted environments," AAAI, Menlo Park, CA, pp. 84-87, Tech. Rep. SS-96-01, Mar. 1996.
-
(1996)
, pp. 84-87
-
-
Schmidhuber, J.1
-
19
-
-
85149834820
-
"Markov games as a framework for multi-agent reinforcement learning"
-
in San Francisco, CA
-
M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. 11th Int. Conf. Mach. Learning, San Francisco, CA, 1994, pp. 157-163.
-
(1994)
Proc. 11th Int. Conf. Mach. Learning
, pp. 157-163
-
-
Littman, M.L.1
-
20
-
-
0000929496
-
"Multi-agent reinforcement learning: Theoretical framework and an algorithm"
-
in Madison, WI, Jul
-
J. Hu and M. P. Wellman, "Multi-agent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Mach. Learning, Madison, WI, Jul. 1998, pp. 242-250.
-
(1998)
Proc. 15th Int. Conf. Mach. Learning
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
21
-
-
34250632505
-
"Cooperative Q-learning through state transitions: A method for cooperation based on area of expertise"
-
in Singapore, Nov
-
S. Mastoureshgh, B. N. Araabi, and M. N. Ahmadabadi, "Cooperative Q-1earning through state transitions: A method for cooperation based on area of expertise," in Proc. 4th Asia-Pacific Conf. Simul. Evolution and Learning, Singapore, Nov. 2002, pp. 61-65.
-
(2002)
Proc. 4th Asia-Pacific Conf. Simul. Evolution and Learning
, pp. 61-65
-
-
Mastoureshgh, S.1
Araabi, B.N.2
Ahmadabadi, M.N.3
-
22
-
-
34250645675
-
"Knowledge-based extraction of area of expertise for cooperation in learning"
-
in Beijing, China, Oct
-
A. Imanipour, M. Nili Ahmadabadi, B. N. Araabi, M. Asadpour, and R. Siegwart, "Knowledge-based extraction of area of expertise for cooperation in learning," in Proc. IEEE/RSJ Int. Conf. Intell. Robots and Syst. (IROS), Beijing, China, Oct. 2006, pp. 3700-3705.
-
(2006)
Proc. IEEE/RSJ Int. Conf. Intell. Robots and Syst. (IROS)
, pp. 3700-3705
-
-
Imanipour, A.1
Nili Ahmadabadi, M.2
Araabi, B.N.3
Asadpour, M.4
Siegwart, R.5
-
23
-
-
0032264491
-
"Just talk to me: A field of expertise location"
-
in Seattle, WA, Nov. 14-18
-
D. W. McDonald and M. S. Ackerman, "Just talk to me: A field of expertise location," in Proc. ACM Conf. Comput.-Supported Cooperative Work, Seattle, WA, Nov. 14-18, 1998, pp. 315-324.
-
(1998)
Proc. ACM Conf. Comput.-Supported Cooperative Work
, pp. 315-324
-
-
McDonald, D.W.1
Ackerman, M.S.2
-
24
-
-
34047191980
-
"Knowledge based multiagent credit assignment: A study on task type and critic information"
-
to be published
-
A. Harati, M. N. Ahmadabadi, and B. N. Araabi, "Knowledge based multiagent credit assignment: A study on task type and critic information," Intelligent Automation and Soft Computing (AutoSoft), vol. 13, no. 3, 2007, to be published.
-
(2007)
Intelligent Automation and Soft Computing (AutoSoft)
, vol.13
, Issue.3
-
-
Harati, A.1
Ahmadabadi, M.N.2
Araabi, B.N.3
|