-
1
-
-
70350699723
-
A multiagent reinforcement learning algorithm with non-linear dynamics
-
Abdallah, S. & Lesser, V. 2008. A multiagent reinforcement learning algorithm with non-linear dynamics. Journal of Artificial Intelligence Research 33, 521-549.
-
(2008)
Journal of Artificial Intelligence Research
, vol.33
, pp. 521-549
-
-
Abdallah, S.1
Lesser, V.2
-
3
-
-
58149280068
-
Multi-agent reinforcement learning in common interest and fixed sum stochastic games: An experimental study
-
Bab, A. & Brafman, R. I. 2008. Multi-agent reinforcement learning in common interest and fixed sum stochastic games: An experimental study. Journal of Machine Learning Research 9, 2635-2675
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 2635-2675
-
-
Bab, A.1
Brafman, R.I.2
-
4
-
-
0028745178
-
Communication in reactive multiagent robotic systems
-
Balch, T. & Arkin, R. C. 1994. Communication in reactive multiagent robotic systems. Autonomous Robots 1(1), 27-52.
-
(1994)
Autonomous Robots
, vol.1
, Issue.1
, pp. 27-52
-
-
Balch, T.1
Arkin, R.C.2
-
6
-
-
9144256373
-
On-policy concurrent reinforcement learning
-
Banerjee, B., Sen, S. & Peng, J. 2004. On-policy concurrent reinforcement learning. Journal of Experimental & Theoretical Artificial Intelligence 16(4), 245-260.
-
(2004)
Journal of Experimental & Theoretical Artificial Intelligence
, vol.16
, Issue.4
, pp. 245-260
-
-
Banerjee, B.1
Sen, S.2
Peng, J.3
-
7
-
-
0003529066
-
-
Boeing Advanced Technology Center, Boeing Computing Services
-
Benda, M., Jagannathan, V. & Dodhiawala, R. 1986. On Optimal Cooperation of Knowledge Sources - an Experimental Investigation. Technical report BCS-G2010-280, Boeing Advanced Technology Center, Boeing Computing Services.
-
(1986)
On Optimal Cooperation of Knowledge Sources - An Experimental Investigation. Technical report BCS-G2010-280
-
-
Benda, M.1
Jagannathan, V.2
Dodhiawala, R.3
-
8
-
-
0002500351
-
Planning, learning and coordination in multiagent decision processes
-
Morgan Kaufmann Publishers Inc.
-
Boutilier, C. 1996. Planning, learning and coordination in multiagent decision processes. In Theoretical Aspects of Rationality and Knowledge, Morgan Kaufmann Publishers Inc., 195-201.
-
(1996)
Theoretical Aspects of Rationality and Knowledge
, pp. 195-201
-
-
Boutilier, C.1
-
9
-
-
84880690163
-
Sequential optimality and coordination in multiagent systems
-
Morgan Publishers Inc.
-
Boutilier, C. 1999. Sequential optimality and coordination in multiagent systems. In IJCAI, Morgan Publishers Inc., 478-485.
-
(1999)
IJCAI
, pp. 478-485
-
-
Boutilier, C.1
-
10
-
-
84899027977
-
Convergence and no-regret in multiagent learning
-
Saul, L. K., Weiss, Y. & Bottou, L. (eds). MIT Press
-
Bowling, M. 2005. Convergence and no-regret in multiagent learning. In Advances in Neural Information Processing Systems, Saul, L. K., Weiss, Y. & Bottou, L. (eds). MIT Press, 209-216.
-
(2005)
Advances in Neural Information Processing Systems
, pp. 209-216
-
-
Bowling, M.1
-
12
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
Bowling, M. & Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence 136, 215-250.
-
(2002)
Artificial Intelligence
, vol.136
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
14
-
-
34547223380
-
Decentralized reinforcement learning control of a robotic manipulator
-
Singapore
-
Busoniu, L., Babuska, R. & De Schutter, B. 2006. Decentralized reinforcement learning control of a robotic manipulator. In Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006), 1347-1352. Singapore.
-
(2006)
Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006)
, pp. 1347-1352
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
-
15
-
-
40949147745
-
A comprehensive survey of multiagent reinforcement learning
-
Busoniu, L., Babuska, R. & De Schutter, B. 2008. A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 38(2), 156-172.
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
, vol.38
, Issue.2
, pp. 156-172
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
-
16
-
-
77649261098
-
Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems
-
Springer, Lecture Notes in Computer Science
-
Carpenter, M. & Kudenko, D. 2005. Baselines for joint-action reinforcement learning of coordination in cooperative multi-agent systems. In Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi-Agent Learning, Lecture Notes in Computer Science, 3394, 55-72. Springer.
-
(2005)
Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi-Agent Learning
, vol.3394
, pp. 55-72
-
-
Carpenter, M.1
Kudenko, D.2
-
17
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
American Association for Artificial Intelligence
-
Claus, C. & Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the 15th National Conference on Artificial Intelligence, 746-752, American Association for Artificial Intelligence.
-
(1998)
Proceedings of the 15th National Conference on Artificial Intelligence
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
18
-
-
33750270145
-
Building autonomic systems using collaborative reinforcement learning
-
Dowling, J., Cunningham, R., Curran, E. & Cahill, V. 2006. Building autonomic systems using collaborative reinforcement learning. Knowledge Engineering Review 21(3), 231-238.
-
(2006)
Knowledge Engineering Review
, vol.21
, Issue.3
, pp. 231-238
-
-
Dowling, J.1
Cunningham, R.2
Curran, E.3
Cahill, V.4
-
20
-
-
33751020264
-
Multi-agent case-based reasoning for cooperative reinforcement learners
-
Springer
-
Gabel, T. & Riedmiller, M. 2006. Multi-agent case-based reasoning for cooperative reinforcement learners. In Proceedings of the ECCBR, 32-46. Springer.
-
(2006)
Proceedings of the ECCBR
, pp. 32-46
-
-
Gabel, T.1
Riedmiller, M.2
-
23
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M. & Moore, A. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.2
Moore, A.3
-
24
-
-
0036932299
-
Reinforcement learning of coordination in cooperative multi-agent systems
-
Dechter, R., Kearns, M. & Sutton, R. (eds.). Edmonton, Alberta, Canada
-
Kapetanakis, S. & Kudenko, D. 2002. Reinforcement learning of coordination in cooperative multi-agent systems. In Proceedings of the 9th NCAI, Dechter, R., Kearns, M. & Sutton, R. (eds.). Edmonton, Alberta, Canada.
-
(2002)
Proceedings of the 9th NCAI
-
-
Kapetanakis, S.1
Kudenko, D.2
-
26
-
-
33645306139
-
Learning to coordinate using commitment sequences in cooperative multi-agent systems
-
Springer, Lecture Notes in Computer Science
-
Kapetanakis, S., Kudenko, D. & Strens, M. J. A. 2005. Learning to coordinate using commitment sequences in cooperative multi-agent systems. In Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi- Agent Learning, Lecture Notes in Computer Science, 106-118. Springer.
-
(2005)
Adaptive Agents and Multi-Agent Systems II: Adaptation and Multi- Agent Learning
, pp. 106-118
-
-
Kapetanakis, S.1
Kudenko, D.2
Strens, M.J.A.3
-
27
-
-
56049125779
-
Multiagent reinforcement learning for urban traffic control using coordination graphs
-
Lecture Notes in Computer Science, Springer
-
Kuyer, L., Whiteson, S., Bakker, B. & Vlassis, N. 2008. Multiagent reinforcement learning for urban traffic control using coordination graphs. In ECML PKDD '08: Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I, Lecture Notes in Computer Science, 5211, 656-671. Springer.
-
(2008)
ECML PKDD ' 08: Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
, vol.5211
, pp. 656-671
-
-
Kuyer, L.1
Whiteson, S.2
Bakker, B.3
Vlassis, N.4
-
29
-
-
4544226982
-
Reinforcement learning for stochastic cooperative multi-agent systems
-
Lauer, M. & Riedmiller, M. 2004. Reinforcement learning for stochastic cooperative multi-agent systems. Autonomous Agents and Multi-Agent Systems 03, 1516-1517.
-
(2004)
Autonomous Agents and Multi-Agent Systems
, vol.3
, pp. 1516-1517
-
-
Lauer, M.1
Riedmiller, M.2
-
31
-
-
0001547175
-
Value-function reinforcement learning in Markov games
-
Littman, M. 2001. Value-function reinforcement learning in Markov games. Journal of Cognitive Systems Research 2, 55-66.
-
(2001)
Journal of Cognitive Systems Research
, vol.2
, pp. 55-66
-
-
Littman, M.1
-
32
-
-
0035410806
-
Distributed manipulation using discrete actuator arrays
-
Luntz, J. E., Messner, W. & Choset, H. 2001. Distributed manipulation using discrete actuator arrays. The International Journal of Robotics Research 20(7), 553-583.
-
(2001)
The International Journal of Robotics Research
, vol.20
, Issue.7
, pp. 553-583
-
-
Luntz, J.E.1
Messner, W.2
Choset, H.3
-
34
-
-
33749869379
-
Reward function and initial values : Better choices for accelerated goal-directed reinforcement learning
-
Springer
-
Matignon, L., Laurent, G. J. & Le Fort-Piat, N. 2006. Reward function and initial values : better choices for accelerated goal-directed reinforcement learning. In Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN'06), Lecture Notes in Computer Science, 4131, 840-849. Springer.
-
(2006)
Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN'06), Lecture Notes in Computer Science
, vol.4131
, pp. 840-849
-
-
Matignon, L.1
Laurent, G.J.2
Le Fort-Piat, N.3
-
36
-
-
84858041504
-
A study of FMQ heuristic in cooperative multiagent games
-
Estoril, Portugal
-
Matignon, L., Laurent, G. J. & Le Fort-Piat, N. 2008. A study of FMQ heuristic in cooperative multiagent games. In Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains (AAMAS 08), Estoril, Portugal.
-
(2008)
Proceedings of the 7th International Conference on Autonomous Agents and Multiagent Systems. Workshop 10 : Multi-Agent Sequential Decision Making in Uncertain Multi-Agent Domains (AAMAS 08)
-
-
Matignon, L.1
Laurent, G.J.2
Le Fort-Piat, N.3
-
37
-
-
77955654600
-
Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning
-
Matignon, L., Laurent, G. J., Le Fort-Piat, N. & Chapuis, Y. A. 2010. Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning. Journal of Intelligent and Robotic Systems 59(2), 145-166.
-
(2010)
Journal of Intelligent and Robotic Systems
, vol.59
, Issue.2
, pp. 145-166
-
-
Matignon, L.1
Laurent, G.J.2
Le Fort-Piat, N.3
Chapuis, Y.A.4
-
38
-
-
70349595296
-
Learning to cooperate in multi-agent systems by combining q-learning and evolutionary strategy
-
McGlohon, M. & Sen, S. 2005. Learning to cooperate in multi-agent systems by combining q-learning and evolutionary strategy. International Journal on Lateral Computing 1(2), 58-64.
-
(2005)
International Journal on Lateral Computing
, vol.1
, Issue.2
, pp. 58-64
-
-
McGlohon, M.1
Sen, S.2
-
39
-
-
38349032850
-
Convergence of independent adaptive learners
-
Springer-Verlag
-
Melo, F. S. & Lopes, M. C. 2007. Convergence of independent adaptive learners. In Progress in Artificial Intelligence: 13th Portuguese Conference on Artificial Intelligence, Lecture Notes in Artificial Intelligence, 4874, 555-567. Springer-Verlag.
-
(2007)
Progress in Artificial Intelligence: 13th Portuguese Conference on Artificial Intelligence, Lecture Notes in Artificial Intelligence
, vol.4874
, pp. 555-567
-
-
Melo, F.S.1
Lopes, M.C.2
-
43
-
-
41549123971
-
Theoretical advantages of lenient learners: An evolutionary game theoretic perspective
-
Panait, L., Tuyls, K. & Luke, S. 2008. Theoretical advantages of lenient learners: An evolutionary game theoretic perspective. Journal of Machine Learning Research 9, 423-457.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 423-457
-
-
Panait, L.1
Tuyls, K.2
Luke, S.3
-
44
-
-
0012646255
-
Learning to cooperate via policy search
-
Morgan Kaufmann
-
Peshkin, L., Kim, K.-E., Meuleau, N. & Kaelbling, L. P. 2000. Learning to cooperate via policy search. In 16th Conference on Uncertainty in Artificial Intelligence, 307-314. Morgan Kaufmann.
-
(2000)
16th Conference on Uncertainty in Artificial Intelligence
, pp. 307-314
-
-
Peshkin, L.1
Kim, K.-E.2
Meuleau, N.3
Kaelbling, L.P.4
-
46
-
-
0028555752
-
Learning to coordinate without sharing information
-
Seattle, WA
-
Sen, S., Sekaran, M. & Hale, J. 1994. Learning to coordinate without sharing information. In Proceedings of the 12th National Conference on Artificial Intelligence, 426-431, Seattle, WA.
-
(1994)
Proceedings of the 12th National Conference on Artificial Intelligence
, pp. 426-431
-
-
Sen, S.1
Sekaran, M.2
Hale, J.3
-
48
-
-
0033901602
-
Convergence results for single-step onpolicy reinforcement-learning algorithms
-
Singh, S. P., Jaakkola, T., Littman, M. L. & Szepesvari, C. 2000. Convergence results for single-step onpolicy reinforcement-learning algorithms. Machine Learning 38(3), 287-308.
-
(2000)
Machine Learning
, vol.38
, Issue.3
, pp. 287-308
-
-
Singh, S.P.1
Jaakkola, T.2
Littman, M.L.3
Szepesvari, C.4
-
49
-
-
0034205975
-
Multiagent systems: A survey from a machine learning perspective
-
Stone, P. & Veloso, M. M. 2000. Multiagent systems: A survey from a machine learning perspective. Autonomous Robots 8(3), 345-383.
-
(2000)
Autonomous Robots
, vol.8
, Issue.3
, pp. 345-383
-
-
Stone, P.1
Veloso, M.M.2
-
51
-
-
85152198941
-
Multiagent reinforcement learning: Independent vs. cooperative agents
-
Morgan Kaufmann
-
Tan, M. 1993. Multiagent reinforcement learning: independent vs. cooperative agents. In Proceedings of the 10th International Conference on Machine Learning, 330-337. Morgan Kaufmann.
-
(1993)
Proceedings of the 10th International Conference on Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
54
-
-
28544446213
-
Evolutionary game theory and multi-agent reinforcement learning
-
Tuyls, K. & Nowé , A. 2005. Evolutionary game theory and multi-agent reinforcement learning. Knowledge Engineering Review 20(1), 63-90.
-
(2005)
Knowledge Engineering Review
, vol.20
, Issue.1
, pp. 63-90
-
-
Tuyls, K.1
Nowé, A.2
-
55
-
-
34247642270
-
Exploring selfish reinforcement learning in repeated games with stochastic rewards
-
Verbeeck, K., Nowé , A., Parent, J. & Tuyls, K. 2007. Exploring selfish reinforcement learning in repeated games with stochastic rewards. Autonomous Agents and Multi-Agent Systems 14(3), 239-269.
-
(2007)
Autonomous Agents and Multi-Agent Systems
, vol.14
, Issue.3
, pp. 239-269
-
-
Verbeeck, K.1
Nowé, A.2
Parent, J.3
Tuyls, K.4
-
56
-
-
34250651573
-
Multi-robot box-pushing: Single-agent q-learning vs. team q-learning
-
Wang, Y. & de Silva, C. W. 2006. Multi-robot box-pushing: single-agent q-learning vs. team q-learning. In Proceedings opf the IROS, 3694-3699.
-
(2006)
Proceedings opf the IROS
, pp. 3694-3699
-
-
Wang, Y.1
De Silva, C.W.2
-
58
-
-
34249833101
-
Technical note: Q-learning
-
Watkins, C. & Dayan, P. 1992. Technical note: Q-learning. Machine Learning 8, 279-292.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
60
-
-
0001309161
-
Optimal payoff functions for members of collectives
-
Wolpert, D. H. & Tumer, K. 2001. Optimal payoff functions for members of collectives. Advances in Complex Systems 04(02), 265-279.
-
(2001)
Advances in Complex Systems
, vol.4
, Issue.2
, pp. 265-279
-
-
Wolpert, D.H.1
Tumer, K.2
|