-
4
-
-
0004255876
-
-
Boston, Massachusetts:Addison-Wesley Longman Publishing Co., Inc., 2nd ed, ISBN = 0201558661
-
Astrom, K. J. and Wittenmark, B., Adaptive Control. Boston, Massachusetts:Addison-Wesley Longman Publishing Co., Inc., 2nd ed., 1994, ISBN = 0201558661.
-
(1994)
Adaptive Control
-
-
Astrom, K.J.1
Wittenmark, B.2
-
5
-
-
40949147745
-
A comprehensive survey of multiagent reinforcement learning
-
L. Buşoniu and R. Babuška, and B. D. Schutter, "A comprehensive survey of multiagent reinforcement learning," IEEE Trans. Syst. Man Cybern. Part C, Vol. 38, no. 2, pp. 156-172, 2008.
-
(2008)
IEEE Trans. Syst. Man Cybern. Part C
, vol.38
, Issue.2
, pp. 156-172
-
-
Buşoniu, L.1
Babuška, R.2
Schutter, B.D.3
-
8
-
-
0000016172
-
A stochastic approximation method
-
H. Robbins and S. Monro, "A stochastic approximation method," Annals of Mathematical Statistics, vol. 22, no. 3, pp. 400-407, 1951.
-
(1951)
Annals of Mathematical Statistics
, vol.22
, Issue.3
, pp. 400-407
-
-
Robbins, H.1
Monro, S.2
-
9
-
-
5244366647
-
On the stochastic approximation method of robbins and monro
-
J. Wolfowitz, "On the stochastic approximation method of robbins and monro," Annals of Mathematical Statistics, vol. 23, no. 3, pp. 457-461, 1952.
-
(1952)
Annals of Mathematical Statistics
, vol.23
, Issue.3
, pp. 457-461
-
-
Wolfowitz, J.1
-
12
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
M. Bowling and M. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
-
(2002)
Artificial Intelligence
, vol.136
, Issue.2
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
13
-
-
85012688561
-
-
Princeton, New Jersey: Princeton University Press
-
R. Bellman, Dynamic Programming. Princeton, New Jersey: Princeton University Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
16
-
-
34249833101
-
Q-learning
-
C. J. C. H. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3, pp. 279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
17
-
-
0000439891
-
On the convergence of stochasticiterative dynamic programming algorithms
-
T. Jaakkola, M. Jordan, and S. Singh, "On the convergence of stochasticiterative dynamic programming algorithms," Neural Computation, vol. 6, no. 6, pp. 1185-1201, 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.6
, pp. 1185-1201
-
-
Jaakkola, T.1
Jordan, M.2
Singh, S.3
-
19
-
-
0004260006
-
-
San Diego, California: Academic Press
-
G. Owen, Game Theory. San Diego, California: Academic Press, 1995.
-
(1995)
Game Theory
-
-
Owen, G.1
-
20
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
M. Bowling andM. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
-
(2002)
Artificial Intelligence
, vol.136
, Issue.2
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
22
-
-
0028423534
-
Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information
-
P. Sastry, V. Phansalkar, and M. Thathachar, "Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information," IEEE Transactions on Systems, Man, and Cybernetics, vol. 24, no. 5, pp. 769-777, 1994.
-
(1994)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.24
, Issue.5
, pp. 769-777
-
-
Sastry, P.1
Phansalkar, V.2
Thathachar, M.3
-
24
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
New Brunswick, United States), July, 1994
-
M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in 11th International Conference on Machine Learning, (New Brunswick, United States), July 1994, pp. 157-163, 1994.
-
(1994)
11th International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
25
-
-
84906988849
-
On Multi-Agent Reinforcement Learning in Games
-
Ph.D. Thesis, Carleton University, Ottawa, ON
-
X. Lu, Ph.D., "On Multi-Agent Reinforcement Learning in Games." Ph.D. Thesis, Carleton University, Ottawa, ON, 2012.
-
(2012)
-
-
Lu, X.1
-
26
-
-
0001644761
-
Nash convergence of gradient dynamics in general-sum games
-
Stanford University, Stanford, California, USA, June 30 - July 3. 2000
-
S. P. Singh, M. J. Kearns, and Y. Mansour, "Nash convergence of gradient dynamics in general-sum games," in UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30 - July 3, 2000, pp. 541-548, 2000.
-
(2000)
UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence
, pp. 541-548
-
-
Singh, S.P.1
Kearns, M.J.2
Mansour, Y.3
-
27
-
-
34249833101
-
Q-learning
-
C. J. C. H. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3, pp. 279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
28
-
-
65149099581
-
A survey on multiagent reinforcement learning towards multi-robot systems
-
Proceedings of IEEE Symposium on Computational Intelligence and Games
-
E. Yang and D. Gu, "A survey on multiagent reinforcement learning towards multi-robot systems," in Proceedings of IEEE Symposium on Computational Intelligence and Games, 2005.
-
(2005)
-
-
Yang, E.1
Gu, D.2
-
29
-
-
34547192059
-
Multiagent reinforcement learning: a survey
-
9th International Conference on Control, Automation, Robotics and Vision (ICARCV)
-
L. Buşoniu, R. Babuška, and B. D. Schutter, "Multiagent reinforcement learning: a survey," 9th International Conference on Control, Automation, Robotics and Vision (ICARCV), pp. 1-6, 2006.
-
(2006)
, pp. 1-6
-
-
Buşoniu, L.1
Babuška, R.2
Schutter, B.D.3
-
30
-
-
77957757132
-
An investigation of guarding a territory problem in a grid world
-
American Control Conference
-
X. Lu and H. M. Schwartz, "An investigation of guarding a territory problem in a grid world," in American Control Conference, pp. 3204-3210, 2010.
-
(2010)
, pp. 3204-3210
-
-
Lu, X.1
Schwartz, H.M.2
-
31
-
-
0000268071
-
Learning algorithms for two-person zero-sum stochastic games with incomplete information
-
S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for two-person zero-sum stochastic games with incomplete information," Mathematics of Operations Research, vol. 6, no. 3, pp. 379-386, 1981.
-
(1981)
Mathematics of Operations Research
, vol.6
, Issue.3
, pp. 379-386
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
32
-
-
0020159814
-
Learning algorithms for two-person zero-sum stochastic games with incomplete information: a unified approach
-
S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for two-person zero-sum stochastic games with incomplete information: a unified approach," SIAM Journal on Control and Optimization, vol. 20, no. 4, pp. 541-552, 1982.
-
(1982)
SIAM Journal on Control and Optimization
, vol.20
, Issue.4
, pp. 541-552
-
-
Lakshmivarahan, S.1
Narendra, K.S.2
-
33
-
-
0036778915
-
The lagging anchor algorithm: reinforcement learning in two-player zero-sum games with imperfect information
-
F. A. Dahl, "The lagging anchor algorithm: reinforcement learning in two-player zero-sum games with imperfect information," Machine Learning, vol. 49, pp. 5-37, 2002.
-
(2002)
Machine Learning
, vol.49
, pp. 5-37
-
-
Dahl, F.A.1
-
34
-
-
19644371249
-
The lagging anchor model for game learning-a solution to the crawford puzzle
-
F. A. Dahl, "The lagging anchor model for game learning-a solution to the crawford puzzle," Journal of Economic Behavior & Organization, vol. 57, pp. 287-303, 2005.
-
(2005)
Journal of Economic Behavior & Organization
, vol.57
, pp. 287-303
-
-
Dahl, F.A.1
-
37
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
New Brunswick, United States), July, 1994
-
M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in 11th International Conference on Machine Learning, (New Brunswick, United States), July 1994, pp. 157-163, 1994.
-
(1994)
11th International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
38
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
M. Bowling andM. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
-
(2002)
Artificial Intelligence
, vol.136
, Issue.2
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
40
-
-
22944447799
-
Multiagent Learning in the Presence of Agents with Limitations
-
PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May
-
M. Bowling, Multiagent Learning in the Presence of Agents with Limitations. PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May 2003.
-
(2003)
-
-
Bowling, M.1
-
41
-
-
84906988849
-
On Multi-Agent Reinforcement Learning in Games
-
Ph.D. Thesis Carleton University, Ottawa, ON, Canada
-
X. Lu, "On Multi-Agent Reinforcement Learning in Games," Ph.D. Thesis Carleton University, Ottawa, ON, Canada, 2012.
-
(2012)
-
-
Lu, X.1
-
42
-
-
0001961616
-
A generalized reinforcement-learning model: Convergence and applications
-
(Bari, Italy), July, 1996
-
M. L. Littman and C. Szepesvári, "A generalized reinforcement-learning model: Convergence and applications," in Proceedings of the 13th International Conference on Machine Learning, (Bari, Italy), July 1996, pp. 310-318, 1996.
-
(1996)
Proceedings of the 13th International Conference on Machine Learning
, pp. 310-318
-
-
Littman, M.L.1
Szepesvári, C.2
-
43
-
-
0000929496
-
Multiagent reinforcement learning: theoretical framework and an algorithm
-
Madison, Wisconsin, USA, July 24-27, 1998
-
J. Hu and M. P. Wellman, "Multiagent reinforcement learning: theoretical framework and an algorithm," in Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), Madison, Wisconsin, USA, July 24-27, 1998, pp. 242-250, 1998.
-
(1998)
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998)
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
44
-
-
4644369748
-
Nash q-learning for general-sum stochastic games
-
J. Hu andM. P.Wellman, "Nash q-learning for general-sum stochastic games," Journal of Machine Learning Research, vol. 4, pp. 1039-1069, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 1039-1069
-
-
Hu, J.1
Wellman, M.P.2
-
48
-
-
0000176346
-
Equilibrium points of bimatrix games
-
C. E. Lemke and J. J. T. Howson, "Equilibrium points of bimatrix games," SIAM Journal on Applied Mathematics, vol. 12, no. 2, pp. 413-423, 1964.
-
(1964)
SIAM Journal on Applied Mathematics
, vol.12
, Issue.2
, pp. 413-423
-
-
Lemke, C.E.1
Howson, J.J.T.2
-
49
-
-
0042471966
-
-
Englewood Cliffs, New Jersey: Prentice-Hall
-
D. D. Meredith, K.W.Wong, R.W.Woodhead, and R. H.Wortman, Design and Planning of Engineering Systems. Englewood Cliffs, New Jersey: Prentice-Hall, 1973.
-
(1973)
Design and Planning of Engineering Systems
-
-
Meredith, D.D.1
Wong, K.W.2
Woodhead, R.W.3
Wortman, R.H.4
-
50
-
-
0003735415
-
The linear complimentary problem
-
San Diego, California: Academic Press, Inc
-
R.W. Cottle, J.-S. Pang, and R. E. Stone, "The linear complimentary problem," Computer Science and Scientific Computing, San Diego, California: Academic Press, Inc., 1992.
-
(1992)
Computer Science and Scientific Computing
-
-
Cottle, R.W.1
Pang, J.-S.2
Stone, R.E.3
-
51
-
-
84926397182
-
Study ofMultiple Multiagent Reinforcement Learning Algorithms in Grid Games
-
Master's thesis, Carleton University, Ottawa, ON, Canada
-
P. De Beck-Courcelle, "Study ofMultiple Multiagent Reinforcement Learning Algorithms in Grid Games", Master's thesis, Carleton University, Ottawa, ON, Canada, 2013.
-
(2013)
-
-
De Beck-Courcelle, P.1
-
53
-
-
34547192059
-
Multiagent reinforcement learning: a survey
-
L. Buşoniu, R. Babuška, and B. D. Schutter, "Multiagent reinforcement learning: a survey," in 9th International Conference on Control, Automation, Robotics and Vision (ICARCV), pp. 1-6, 2006.
-
(2006)
9th International Conference on Control, Automation, Robotics and Vision (ICARCV)
, pp. 1-6
-
-
Buşoniu, L.1
Babuška, R.2
Schutter, B.D.3
-
54
-
-
77957757132
-
An investigation of guarding a territory problem in a grid world
-
X. Lu and H. M. Schwartz, "An investigation of guarding a territory problem in a grid world," in American Control Conference, pp. 3204-3210, 2010.
-
(2010)
American Control Conference
, pp. 3204-3210
-
-
Lu, X.1
Schwartz, H.M.2
-
55
-
-
38249001350
-
A first approach to fuzzy differential game problem: guarding a territory
-
K. H. Hsia and J. G. Hsieh, "A first approach to fuzzy differential game problem: guarding a territory," Fuzzy Sets and Systems, vol. 55, pp. 157-167, 1993.
-
(1993)
Fuzzy Sets and Systems
, vol.55
, pp. 157-167
-
-
Hsia, K.H.1
Hsieh, J.G.2
-
56
-
-
0036721662
-
A strategy for a payoff-switching differential game based on fuzzy reasoning
-
Y. S. Lee, K. H. Hsia, and J. G. Hsieh, "A strategy for a payoff-switching differential game based on fuzzy reasoning," Fuzzy Sets and Systems, vol. 130, no. 2, pp. 237-251, 2002.
-
(2002)
Fuzzy Sets and Systems
, vol.130
, Issue.2
, pp. 237-251
-
-
Lee, Y.S.1
Hsia, K.H.2
Hsieh, J.G.3
-
57
-
-
40949147745
-
-
L. Buşoniu, R. Babuška, and B. D. Schutter, "A comprehensive survey of multiagent reinforcement learning," IEEE Transactions on Systems, Man, and Cybernetics Part C, vol. 38, no. 2, pp. 156-172, 2008.
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics Part C
, vol.38
, Issue.2
, pp. 156-172
-
-
Buşoniu, L.1
Babuška, R.2
Schutter, B.D.3
-
58
-
-
0034205975
-
Multiagent systems: a survey from a machine learning perspective
-
P. Stone and M. Veloso, "Multiagent systems: a survey from a machine learning perspective," Autonomous Robots, vol. 8, no. 3, pp. 345-383, 2000.
-
(2000)
Autonomous Robots
, vol.8
, Issue.3
, pp. 345-383
-
-
Stone, P.1
Veloso, M.2
-
59
-
-
0032207552
-
Colearning in differential games
-
J. W. Sheppard, "Colearning in differential games," Machine Learning, vol. 33, pp. 201-233, 1998.
-
(1998)
Machine Learning
, vol.33
, pp. 201-233
-
-
Sheppard, J.W.1
-
60
-
-
84891544020
-
Exponential Moving Average Q-Learning Algorithm
-
Proceedings of the IEEE Symposium Series on Computational Intelligence, Singapore, April 15-19
-
M. Awheda, and Schwartz, H.M., "Exponential Moving Average Q-Learning Algorithm", Proceedings of the IEEE Symposium Series on Computational Intelligence, Singapore, April 15-19, 2013.
-
(2013)
-
-
Awheda, M.1
Schwartz, H.M.2
-
61
-
-
70350566689
-
Effective learning in the presence of adaptive counterparts
-
A. Burkov and B. Chaib-draa, "Effective learning in the presence of adaptive counterparts," Journal of Algorithms, vol. 64, no. 4, pp. 127-138, 2009.
-
(2009)
Journal of Algorithms
, vol.64
, Issue.4
, pp. 127-138
-
-
Burkov, A.1
Chaib-draa, B.2
-
62
-
-
84898941549
-
Extending q-learning to general adaptive multi-agent systems
-
(S. Thrun, L. K. Saul and B. Schölkopf, eds.), (Cambridge, Massachusetts), MIT Press
-
G. Tesauro, "Extending q-learning to general adaptive multi-agent systems," in Advances in Neural Information Processing Systems 16 (S. Thrun, L. K. Saul and B. Schölkopf, eds.), (Cambridge, Massachusetts), pp. 215-250, MIT Press, 2004.
-
(2004)
Advances in Neural Information Processing Systems 16
, pp. 215-250
-
-
Tesauro, G.1
-
63
-
-
84899027977
-
Convergence and no-regret in multiagent learning
-
(L. K. Saul, Y.Weiss and L. Bottou, eds.), (Cambridge, Massachusetts), MIT Press
-
M. Bowling, "Convergence and no-regret in multiagent learning," in Advances in Neural Information Processing Systems 17 (L. K. Saul, Y.Weiss and L. Bottou, eds.), (Cambridge, Massachusetts), pp. 209-216, MIT Press, 2005.
-
(2005)
Advances in Neural Information Processing Systems 17
, pp. 209-216
-
-
Bowling, M.1
-
64
-
-
70350699723
-
Amultiagent reinforcement learning algorithm with non-linear dynamics
-
S. Abdallah and V. Lesser, "Amultiagent reinforcement learning algorithm with non-linear dynamics," Journal of Artificial Intelligence Research, vol. 33, pp. 521-549, 2008.
-
(2008)
Journal of Artificial Intelligence Research
, vol.33
, pp. 521-549
-
-
Abdallah, S.1
Lesser, V.2
-
65
-
-
85099723578
-
Multi-agent learning with policy prediction
-
Atlanta, GA, USA
-
C. Zhang and V. Lesser, "Multi-agent learning with policy prediction," in Proceedings of the 24th National Conference on Artificial Intelligence (AAAI'10), Atlanta, GA, USA, pp. 746-752, 2010.
-
(2010)
Proceedings of the 24th National Conference on Artificial Intelligence (AAAI'10
, pp. 746-752
-
-
Zhang, C.1
Lesser, V.2
-
66
-
-
78951475039
-
Mission-driven robotic intelligent sensor agents for territorial security
-
R. Abielmona, E. Petriu, M. Harb, and S. Wesolkowski, "Mission-driven robotic intelligent sensor agents for territorial security," IEEE Computational Intelligence Magazine, vol. 6, no. 1, pp. 55-67, 2011.
-
(2011)
IEEE Computational Intelligence Magazine
, vol.6
, Issue.1
, pp. 55-67
-
-
Abielmona, R.1
Petriu, E.2
Harb, M.3
Wesolkowski, S.4
-
68
-
-
79957749002
-
Reinforcement learning applied to a differential game
-
M. E. Harmon, L. C. Baird III, and A. H. Klopf, "Reinforcement learning applied to a differential game," Adaptive Behavior, vol. 4, no. 1, pp. 3-28, 1995.
-
(1995)
Adaptive Behavior
, vol.4
, Issue.1
, pp. 3-28
-
-
Harmon, M.E.1
Baird, L.C.2
Klopf, A.H.3
-
69
-
-
0032207552
-
Colearning in differential games
-
J. W. Sheppard, "Colearning in differential games," Machine Learning, vol. 33, pp. 201-233, 1998.
-
(1998)
Machine Learning
, vol.33
, pp. 201-233
-
-
Sheppard, J.W.1
-
70
-
-
77953027402
-
A reinforcement learning adaptive fuzzy controller for differential games
-
S. N. Givigi, H. M. Schwartz, and X. Lu, "A reinforcement learning adaptive fuzzy controller for differential games," Journal of Intelligent and Robotic Systems, vol. 59, pp. 3-30, 2010.
-
(2010)
Journal of Intelligent and Robotic Systems
, vol.59
, pp. 3-30
-
-
Givigi, S.N.1
Schwartz, H.M.2
Lu, X.3
-
71
-
-
78649832013
-
Self-learning fuzzy logic controllers for pursuit-evasion differential games
-
S. F. Desouky and H. M. Schwartz, "Self-learning fuzzy logic controllers for pursuit-evasion differential games," Robotics and Autonomous Systems, vol. 59, pp. 22-33, 2011.
-
(2011)
Robotics and Autonomous Systems
, vol.59
, pp. 22-33
-
-
Desouky, S.F.1
Schwartz, H.M.2
-
72
-
-
38249001350
-
A first approach to fuzzy differential game problem: guarding a territory
-
K. H. Hsia and J. G. Hsieh, "A first approach to fuzzy differential game problem: guarding a territory," Fuzzy Sets and Systems, vol. 55, pp. 157-167, 1993.
-
(1993)
Fuzzy Sets and Systems
, vol.55
, pp. 157-167
-
-
Hsia, K.H.1
Hsieh, J.G.2
-
73
-
-
0036721662
-
A strategy for a payoff-switching differential game based on fuzzy reasoning
-
Y. S. Lee, K. H. Hsia, and J. G. Hsieh, "A strategy for a payoff-switching differential game based on fuzzy reasoning," Fuzzy Sets and Systems, vol. 130, no. 2, pp. 237-251, 2002.
-
(2002)
Fuzzy Sets and Systems
, vol.130
, Issue.2
, pp. 237-251
-
-
Lee, Y.S.1
Hsia, K.H.2
Hsieh, J.G.3
-
74
-
-
0032140718
-
Fuzzy inference system learning by reinforcement methods
-
L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics Part C, vol. 28, no. 3, pp. 338-355, 1998.
-
(1998)
IEEE Transactions on Systems, Man, and Cybernetics Part C
, vol.28
, Issue.3
, pp. 338-355
-
-
Jouffe, L.1
-
75
-
-
0003947619
-
-
Boston, Massachusetts: Addison-Wesley Longman Publishing Co., Inc., 1st ed.
-
K. M. Passino and S. Yurkovich, Fuzzy Control. Boston, Massachusetts: Addison-Wesley Longman Publishing Co., Inc., 1st ed., 1998.
-
(1998)
Fuzzy Control
-
-
Passino, K.M.1
Yurkovich, S.2
-
77
-
-
34248666540
-
Fuzzy sets
-
L. A. Zadeh, "Fuzzy sets," Information and Control, vol. 8, no. 3, pp. 338-353, 1965.
-
(1965)
Information and Control
, vol.8
, Issue.3
, pp. 338-353
-
-
Zadeh, L.A.1
-
78
-
-
84926397180
-
Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning
-
Master's thesis, Carleton University, Ottawa, ON, Canada
-
B. Al Faiya, "Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning," Master's thesis, Carleton University, Ottawa, ON, Canada, 2012.
-
(2012)
-
-
Al Faiya, B.1
-
79
-
-
0021892282
-
Fuzzy identification of systems and its applications to modelling and control
-
SMC-15
-
T. Takagi and M. Sugeno, "Fuzzy identification of systems and its applications to modelling and control," IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-15, pp. 116-132, 1985.
-
(1985)
IEEE Transactions on Systems, Man, and Cybernetics
, pp. 116-132
-
-
Takagi, T.1
Sugeno, M.2
-
81
-
-
0004105094
-
Design of Fuzzy Controllers
-
Technical Univ. of Denmark: Technical Report (No:98-E864) Department of Automation, Hoboken, NJ
-
J. Jantzen, Design of Fuzzy Controllers. Technical Univ. of Denmark: Technical Report (No:98-E864) Department of Automation, Hoboken, NJ, 1999.
-
(1999)
-
-
Jantzen, J.1
-
83
-
-
84906988849
-
On Multi-Agent Reinforcement Learning in Games
-
Ph.D. Thesis Carleton University, Ottawa, ON, Canada
-
X. Lu, "On Multi-Agent Reinforcement Learning in Games." Ph.D. Thesis Carleton University, Ottawa, ON, Canada, 2012.
-
(2012)
-
-
Lu, X.1
-
84
-
-
27744536933
-
An approach to tune fuzzy contorllers based on reinforcement learning for autonomous vehicle control
-
X. Dai, C. Li, and A. Rad, "An approach to tune fuzzy contorllers based on reinforcement learning for autonomous vehicle control," IEEE Transactions on Intelligent Transportation Systems, vol. 6, no. 3, pp. 285-293, 2005.
-
(2005)
IEEE Transactions on Intelligent Transportation Systems
, vol.6
, Issue.3
, pp. 285-293
-
-
Dai, X.1
Li, C.2
Rad, A.3
-
86
-
-
0021892282
-
Fuzzy identification of systems and its application to modeling and control
-
T. Takagi and M. Sugeno, "Fuzzy identification of systems and its application to modeling and control," IEEE Transactions on Systems, Man, and Cybernetics, vol. 15, pp. 116-132, 1985.
-
(1985)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.15
, pp. 116-132
-
-
Takagi, T.1
Sugeno, M.2
-
89
-
-
74849131377
-
An experimental adaptive fuzzy controller for differential games
-
San Antonio, United States), Oct
-
S. N. Givigi, H. M. Schwartz, and X. Lu, "An experimental adaptive fuzzy controller for differential games," in Proceedings IEEE Systems, Man and Cybernetics'09, (San Antonio, United States), Oct 2009.
-
(2009)
Proceedings IEEE Systems, Man and Cybernetics'09
-
-
Givigi, S.N.1
Schwartz, H.M.2
Lu, X.3
-
90
-
-
0032072654
-
Adaptive fuzzy control of satellite attitude by reinforcement learning
-
W. M. van Buijtenen,G. Schram, R. Babuska, and H. B. Verbruggen, "Adaptive fuzzy control of satellite attitude by reinforcement learning," IEEE Transactions on Fuzzy Systems, vol. 6, no. 2, pp. 185-194, 1998.
-
(1998)
IEEE Transactions on Fuzzy Systems
, vol.6
, Issue.2
, pp. 185-194
-
-
Van Buijtenen, W.M.1
Schram, G.2
Babuska, R.3
Verbruggen, H.B.4
-
91
-
-
0004251759
-
-
New York: John Wiley & Sons, Inc.
-
R. Isaacs, Differential Games. New York: John Wiley & Sons, Inc., 1965.
-
(1965)
Differential Games
-
-
Isaacs, R.1
-
92
-
-
85003220328
-
The homicidal chauffeur
-
A. MERZ, "The homicidal chauffeur," AIAA Journal, vol. 12, pp. 259-260, 1974.
-
(1974)
AIAA Journal
, vol.12
, pp. 259-260
-
-
Merz, A.1
-
94
-
-
3042555488
-
A time-optimal control strategy for pursuit-evasion games problems
-
New Orleans, LA), April
-
S. H. Lim, T. Furukawa, G. Dissanayake, and H. F. D. Whyte, "A time-optimal control strategy for pursuit-evasion games problems," in Proceedings of the 2004 IEEE International Conference on Robotics and Automation, (New Orleans, LA), April 2004.
-
(2004)
Proceedings of the 2004 IEEE International Conference on Robotics and Automation
-
-
Lim, S.H.1
Furukawa, T.2
Dissanayake, G.3
Whyte, H.F.D.4
-
95
-
-
74849091001
-
Hybrid intelligent systems applied to the pursuit-evasion game
-
IEEE International Conference on, (San Antonio, TX, October, 2009
-
S. Desouky and H. Schwartz, "Hybrid intelligent systems applied to the pursuit-evasion game," in Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on, (San Antonio, TX, October 2009, pp. 2603-2608, 2009.
-
(2009)
Systems, Man and Cybernetics, 2009. SMC 2009
, pp. 2603-2608
-
-
Desouky, S.1
Schwartz, H.2
-
96
-
-
78751479831
-
Q(λ)-learning fuzzy logic controller for a multi-robot system
-
SMC 2010, (Istanbul, Turkey), October
-
S. Desouky and H. Schwartz, "Q(λ)-learning fuzzy logic controller for a multi-robot system," in IEEE International Conference on Systems, Man and Cybernetics, 2010. SMC 2010, (Istanbul, Turkey), October 2010.
-
(2010)
IEEE International Conference on Systems, Man and Cybernetics, 2010
-
-
Desouky, S.1
Schwartz, H.2
-
97
-
-
70449675033
-
Genetic based fuzzy logic controller for a wall-following mobile robot
-
(St. Louis, MO), June 2009, IEEE
-
S. F. Desouky and H. M. Schwartz, "Genetic based fuzzy logic controller for a wall-following mobile robot," in American Control Conference, 2009 ACC 2009, (St. Louis, MO), June 2009, pp. 3555-3560, IEEE, 2009.
-
(2009)
American Control Conference, 2009 ACC 2009
, pp. 3555-3560
-
-
Desouky, S.F.1
Schwartz, H.M.2
-
98
-
-
77950298151
-
Online learning of shaping rewards in reinforcement learning
-
M. Grzés and D. Kudenko, "Online learning of shaping rewards in reinforcement learning," Neural Networks, vol. 23, pp. 541-550, 2010.
-
(2010)
Neural Networks
, vol.23
, pp. 541-550
-
-
Grzés, M.1
Kudenko, D.2
-
99
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
M. J. Matarić, "Reinforcement learning in the multi-robot domain," Autonomous Robots, vol. 4, pp. 73-83, 1997.
-
(1997)
Autonomous Robots
, vol.4
, pp. 73-83
-
-
Matarić, M.J.1
-
102
-
-
34250608222
-
La Reconstruction du nid et les Coordinations Inter-Individuelles chez Bellicosistermes Natalensis et Cubetermes sp. La théorie de la Stigmergie: Essai d'interprétation du Comportement des Termites Constructeurs
-
P. P. Grassé, "La Reconstruction du nid et les Coordinations Inter-Individuelles chez Bellicosistermes Natalensis et Cubetermes sp. La théorie de la Stigmergie: Essai d'interprétation du Comportement des Termites Constructeurs," Insectes Sociaux, vol. 6, pp. 41-80, 1959.
-
(1959)
Insectes Sociaux
, vol.6
, pp. 41-80
-
-
Grassé, P.P.1
-
103
-
-
2642586048
-
Cofields: a physically inspired approach to motion cordination
-
M. Mamei, F. Zambonelli, and L. Leonardi, "Cofields: a physically inspired approach to motion cordination," IEEE Pervasive Computing, vol. 3, no. 2, pp. 52-61, 2004.
-
(2004)
IEEE Pervasive Computing
, vol.3
, Issue.2
, pp. 52-61
-
-
Mamei, M.1
Zambonelli, F.2
Leonardi, L.3
-
104
-
-
19944387938
-
Cooperating unmanned vehicles
-
R. Chalmers, D. Scheidt, T. Neighoff, S. Witwicki, and R. Bamberger, "Cooperating unmanned vehicles," in AIAA 1st Intelligent Systems Technical Conference, 2004.
-
(2004)
AIAA 1st Intelligent Systems Technical Conference
-
-
Chalmers, R.1
Scheidt, D.2
Neighoff, T.3
Witwicki, S.4
Bamberger, R.5
-
105
-
-
0024733382
-
Real time obstable avoidance for fast mobile robots
-
J. Borenstein and Y. Koren, "Real time obstable avoidance for fast mobile robots," IEEE Transactions on Systems, Man, and Cybernetics, vol. 19, no. 5, pp. 1179-1187, 1989.
-
(1989)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.19
, Issue.5
, pp. 1179-1187
-
-
Borenstein, J.1
Koren, Y.2
-
106
-
-
0004240310
-
-
New York, New York: Simon & Schuster
-
M. Mynsk, The Society of Mind. New York, New York: Simon & Schuster, 1986.
-
(1986)
The Society of Mind
-
-
Mynsk, M.1
-
107
-
-
34547152789
-
A game theoretic approach to swarm robotics
-
S. N. Givigi and H.M. Schwartz, "A game theoretic approach to swarm robotics," Applied Bionics and Biomechanics, vol. 3, no. 3, pp. 131-142, 2006.
-
(2006)
Applied Bionics and Biomechanics
, vol.3
, Issue.3
, pp. 131-142
-
-
Givigi, S.N.1
Schwartz, H.M.2
-
109
-
-
0038068981
-
Self-organizing multi-robot system based on personality evolution
-
D. Yingying, H. Yan, and J. Jing-ping, "Self-organizing multi-robot system based on personality evolution," in IEEE International Conference on Systems, Man, and Cybernetics, vol. 5, 2002.
-
(2002)
IEEE International Conference on Systems, Man, and Cybernetics
, vol.5
-
-
Yingying, D.1
Yan, H.2
Jing-ping, J.3
-
110
-
-
27744444690
-
Evolutionary swarm intelligence applied to robotics
-
Prooceedings of the IEEE International Conference on Mechatronics andAutomation
-
S. Givigi and H. M. Schwartz, "Evolutionary swarm intelligence applied to robotics," in Prooceedings of the IEEE International Conference on Mechatronics andAutomation, pp. 1005-1010, 2005.
-
(2005)
, pp. 1005-1010
-
-
Givigi, S.1
Schwartz, H.M.2
-
111
-
-
23944525907
-
Swarm robotics: from sources of inspiration to domains of application
-
(E. Sahin and W. M. Spears, eds.), (Berlin, Heidelberg), Springer-Verlag
-
E. Sahin, "Swarm robotics: from sources of inspiration to domains of application," in Swarm Robotics: SAB 2004 International Workshop, Santa Monica, CA, USA, July 17, 2004: revised selected papers (E. Sahin and W. M. Spears, eds.), (Berlin, Heidelberg), pp. 10-20, Springer-Verlag, 2005.
-
(2005)
Swarm Robotics: SAB 2004 International Workshop, Santa Monica, CA, USA, July 17, 2004: revised selected papers
, pp. 10-20
-
-
Sahin, E.1
-
112
-
-
23944509105
-
Fromswarmintelligence to swarmrobotics
-
(E. Sahin and W. M. Spears, eds.), (Berlin, Heidelberg), Springer-Verlag
-
G. Beni, "Fromswarmintelligence to swarmrobotics," in SwarmRobotics: SAB 2004 International workshop, Santa Monica, CA, USA, July 17, 2004: revised selected papers (E. Sahin and W. M. Spears, eds.), (Berlin, Heidelberg), pp. 1-9, Springer-Verlag, 2005.
-
(2005)
SwarmRobotics: SAB 2004 International workshop, Santa Monica, CA, USA, July 17, 2004: revised selected papers
, pp. 1-9
-
-
Beni, G.1
-
113
-
-
3142710864
-
Evolving self-organizing behaviours for a swarm-bot
-
M. Dorigo, V. Trianni, E. Şahin, R. Groß, T. Labella, G. Baldassarre, S. Nolfi, J. Deneubourg, F. Mondada, D. Floreano, and L. Gambardella, "Evolving self-organizing behaviours for a swarm-bot," Autonomous Robots, vol. 17, pp. 223-245, 2004.
-
(2004)
Autonomous Robots
, vol.17
, pp. 223-245
-
-
Dorigo, M.1
Trianni, V.2
Şahin, E.3
Groß, R.4
Labella, T.5
Baldassarre, G.6
Nolfi, S.7
Deneubourg, J.8
Mondada, F.9
Floreano, D.10
Gambardella, L.11
-
114
-
-
0004187979
-
-
Washington, District of Columbia: The Mathematical Association of America
-
P. D. Straffin, Game Theory and Strategy. Washington, District of Columbia: The Mathematical Association of America, 1993.
-
(1993)
Game Theory and Strategy
-
-
Straffin, P.D.1
-
115
-
-
3843059082
-
Unified convergence proofs of continuous-time fictitious play
-
J. S. Shamma and G. Arslan, "Unified convergence proofs of continuous-time fictitious play," IEEE Transactions on Automatic Control, vol. 49, no. 7, pp. 1137-1142, 2004.
-
(2004)
IEEE Transactions on Automatic Control
, vol.49
, Issue.7
, pp. 1137-1142
-
-
Shamma, J.S.1
Arslan, G.2
-
117
-
-
0001402950
-
An iterative method of solving a game
-
J. Robinson, "An iterative method of solving a game," Annals of Mathmatics, vol. 54, no. 2, pp. 296-301, 1951.
-
(1951)
Annals of Mathmatics
, vol.54
, Issue.2
, pp. 296-301
-
-
Robinson, J.1
-
119
-
-
0029679044
-
Reinforcement learning: a survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: a survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
120
-
-
0004149207
-
-
New York, New York: Oxford University Press, new ed.
-
R. Dawkins, The Selfish Gene. New York, New York: Oxford University Press, new ed., 1989.
-
(1989)
The Selfish Gene
-
-
Dawkins, R.1
-
121
-
-
84926400379
-
Analysis and Design of Swarm Based Robots Using Game Theory
-
Ph.D. Thesis, Ottawa, ON: Carleton University, Sept
-
G. Sidney, "Analysis and Design of Swarm Based Robots Using Game Theory". Ph.D. Thesis, Ottawa, ON: Carleton University, Sept. 2009.
-
(2009)
-
-
Sidney, G.1
|