메뉴 건너뛰기




Volumn 9781118362082, Issue , 2014, Pages 1-242

Multi-Agent Machine Learning: A Reinforcement Approach

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; ENGINEERING EDUCATION; FUZZY CONTROL; GAME THEORY; LEARNING ALGORITHMS; LEAST SQUARES APPROXIMATIONS; MEAN SQUARE ERROR; MULTI AGENT SYSTEMS; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS; STUDENTS;

EID: 84924356033     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781118884614     Document Type: Book
Times cited : (116)

References (121)
  • 4
    • 0004255876 scopus 로고
    • Boston, Massachusetts:Addison-Wesley Longman Publishing Co., Inc., 2nd ed, ISBN = 0201558661
    • Astrom, K. J. and Wittenmark, B., Adaptive Control. Boston, Massachusetts:Addison-Wesley Longman Publishing Co., Inc., 2nd ed., 1994, ISBN = 0201558661.
    • (1994) Adaptive Control
    • Astrom, K.J.1    Wittenmark, B.2
  • 5
    • 40949147745 scopus 로고    scopus 로고
    • A comprehensive survey of multiagent reinforcement learning
    • L. Buşoniu and R. Babuška, and B. D. Schutter, "A comprehensive survey of multiagent reinforcement learning," IEEE Trans. Syst. Man Cybern. Part C, Vol. 38, no. 2, pp. 156-172, 2008.
    • (2008) IEEE Trans. Syst. Man Cybern. Part C , vol.38 , Issue.2 , pp. 156-172
    • Buşoniu, L.1    Babuška, R.2    Schutter, B.D.3
  • 8
    • 0000016172 scopus 로고
    • A stochastic approximation method
    • H. Robbins and S. Monro, "A stochastic approximation method," Annals of Mathematical Statistics, vol. 22, no. 3, pp. 400-407, 1951.
    • (1951) Annals of Mathematical Statistics , vol.22 , Issue.3 , pp. 400-407
    • Robbins, H.1    Monro, S.2
  • 9
    • 5244366647 scopus 로고
    • On the stochastic approximation method of robbins and monro
    • J. Wolfowitz, "On the stochastic approximation method of robbins and monro," Annals of Mathematical Statistics, vol. 23, no. 3, pp. 457-461, 1952.
    • (1952) Annals of Mathematical Statistics , vol.23 , Issue.3 , pp. 457-461
    • Wolfowitz, J.1
  • 12
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • M. Bowling and M. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 13
    • 85012688561 scopus 로고
    • Princeton, New Jersey: Princeton University Press
    • R. Bellman, Dynamic Programming. Princeton, New Jersey: Princeton University Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 17
    • 0000439891 scopus 로고
    • On the convergence of stochasticiterative dynamic programming algorithms
    • T. Jaakkola, M. Jordan, and S. Singh, "On the convergence of stochasticiterative dynamic programming algorithms," Neural Computation, vol. 6, no. 6, pp. 1185-1201, 1994.
    • (1994) Neural Computation , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.2    Singh, S.3
  • 19
    • 0004260006 scopus 로고
    • San Diego, California: Academic Press
    • G. Owen, Game Theory. San Diego, California: Academic Press, 1995.
    • (1995) Game Theory
    • Owen, G.1
  • 20
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • M. Bowling andM. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 22
    • 0028423534 scopus 로고
    • Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information
    • P. Sastry, V. Phansalkar, and M. Thathachar, "Decentralized learning of Nash equilibria in multi-person stochastic games with incomplete information," IEEE Transactions on Systems, Man, and Cybernetics, vol. 24, no. 5, pp. 769-777, 1994.
    • (1994) IEEE Transactions on Systems, Man, and Cybernetics , vol.24 , Issue.5 , pp. 769-777
    • Sastry, P.1    Phansalkar, V.2    Thathachar, M.3
  • 24
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • New Brunswick, United States), July, 1994
    • M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in 11th International Conference on Machine Learning, (New Brunswick, United States), July 1994, pp. 157-163, 1994.
    • (1994) 11th International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 25
    • 84906988849 scopus 로고    scopus 로고
    • On Multi-Agent Reinforcement Learning in Games
    • Ph.D. Thesis, Carleton University, Ottawa, ON
    • X. Lu, Ph.D., "On Multi-Agent Reinforcement Learning in Games." Ph.D. Thesis, Carleton University, Ottawa, ON, 2012.
    • (2012)
    • Lu, X.1
  • 28
    • 65149099581 scopus 로고    scopus 로고
    • A survey on multiagent reinforcement learning towards multi-robot systems
    • Proceedings of IEEE Symposium on Computational Intelligence and Games
    • E. Yang and D. Gu, "A survey on multiagent reinforcement learning towards multi-robot systems," in Proceedings of IEEE Symposium on Computational Intelligence and Games, 2005.
    • (2005)
    • Yang, E.1    Gu, D.2
  • 29
    • 34547192059 scopus 로고    scopus 로고
    • Multiagent reinforcement learning: a survey
    • 9th International Conference on Control, Automation, Robotics and Vision (ICARCV)
    • L. Buşoniu, R. Babuška, and B. D. Schutter, "Multiagent reinforcement learning: a survey," 9th International Conference on Control, Automation, Robotics and Vision (ICARCV), pp. 1-6, 2006.
    • (2006) , pp. 1-6
    • Buşoniu, L.1    Babuška, R.2    Schutter, B.D.3
  • 30
    • 77957757132 scopus 로고    scopus 로고
    • An investigation of guarding a territory problem in a grid world
    • American Control Conference
    • X. Lu and H. M. Schwartz, "An investigation of guarding a territory problem in a grid world," in American Control Conference, pp. 3204-3210, 2010.
    • (2010) , pp. 3204-3210
    • Lu, X.1    Schwartz, H.M.2
  • 31
    • 0000268071 scopus 로고
    • Learning algorithms for two-person zero-sum stochastic games with incomplete information
    • S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for two-person zero-sum stochastic games with incomplete information," Mathematics of Operations Research, vol. 6, no. 3, pp. 379-386, 1981.
    • (1981) Mathematics of Operations Research , vol.6 , Issue.3 , pp. 379-386
    • Lakshmivarahan, S.1    Narendra, K.S.2
  • 32
    • 0020159814 scopus 로고
    • Learning algorithms for two-person zero-sum stochastic games with incomplete information: a unified approach
    • S. Lakshmivarahan and K. S. Narendra, "Learning algorithms for two-person zero-sum stochastic games with incomplete information: a unified approach," SIAM Journal on Control and Optimization, vol. 20, no. 4, pp. 541-552, 1982.
    • (1982) SIAM Journal on Control and Optimization , vol.20 , Issue.4 , pp. 541-552
    • Lakshmivarahan, S.1    Narendra, K.S.2
  • 33
    • 0036778915 scopus 로고    scopus 로고
    • The lagging anchor algorithm: reinforcement learning in two-player zero-sum games with imperfect information
    • F. A. Dahl, "The lagging anchor algorithm: reinforcement learning in two-player zero-sum games with imperfect information," Machine Learning, vol. 49, pp. 5-37, 2002.
    • (2002) Machine Learning , vol.49 , pp. 5-37
    • Dahl, F.A.1
  • 34
    • 19644371249 scopus 로고    scopus 로고
    • The lagging anchor model for game learning-a solution to the crawford puzzle
    • F. A. Dahl, "The lagging anchor model for game learning-a solution to the crawford puzzle," Journal of Economic Behavior & Organization, vol. 57, pp. 287-303, 2005.
    • (2005) Journal of Economic Behavior & Organization , vol.57 , pp. 287-303
    • Dahl, F.A.1
  • 37
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • New Brunswick, United States), July, 1994
    • M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in 11th International Conference on Machine Learning, (New Brunswick, United States), July 1994, pp. 157-163, 1994.
    • (1994) 11th International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 38
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • M. Bowling andM. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 40
    • 22944447799 scopus 로고    scopus 로고
    • Multiagent Learning in the Presence of Agents with Limitations
    • PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May
    • M. Bowling, Multiagent Learning in the Presence of Agents with Limitations. PhD thesis, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, May 2003.
    • (2003)
    • Bowling, M.1
  • 41
    • 84906988849 scopus 로고    scopus 로고
    • On Multi-Agent Reinforcement Learning in Games
    • Ph.D. Thesis Carleton University, Ottawa, ON, Canada
    • X. Lu, "On Multi-Agent Reinforcement Learning in Games," Ph.D. Thesis Carleton University, Ottawa, ON, Canada, 2012.
    • (2012)
    • Lu, X.1
  • 44
    • 4644369748 scopus 로고    scopus 로고
    • Nash q-learning for general-sum stochastic games
    • J. Hu andM. P.Wellman, "Nash q-learning for general-sum stochastic games," Journal of Machine Learning Research, vol. 4, pp. 1039-1069, 2003.
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.P.2
  • 51
    • 84926397182 scopus 로고    scopus 로고
    • Study ofMultiple Multiagent Reinforcement Learning Algorithms in Grid Games
    • Master's thesis, Carleton University, Ottawa, ON, Canada
    • P. De Beck-Courcelle, "Study ofMultiple Multiagent Reinforcement Learning Algorithms in Grid Games", Master's thesis, Carleton University, Ottawa, ON, Canada, 2013.
    • (2013)
    • De Beck-Courcelle, P.1
  • 54
    • 77957757132 scopus 로고    scopus 로고
    • An investigation of guarding a territory problem in a grid world
    • X. Lu and H. M. Schwartz, "An investigation of guarding a territory problem in a grid world," in American Control Conference, pp. 3204-3210, 2010.
    • (2010) American Control Conference , pp. 3204-3210
    • Lu, X.1    Schwartz, H.M.2
  • 55
    • 38249001350 scopus 로고
    • A first approach to fuzzy differential game problem: guarding a territory
    • K. H. Hsia and J. G. Hsieh, "A first approach to fuzzy differential game problem: guarding a territory," Fuzzy Sets and Systems, vol. 55, pp. 157-167, 1993.
    • (1993) Fuzzy Sets and Systems , vol.55 , pp. 157-167
    • Hsia, K.H.1    Hsieh, J.G.2
  • 56
    • 0036721662 scopus 로고    scopus 로고
    • A strategy for a payoff-switching differential game based on fuzzy reasoning
    • Y. S. Lee, K. H. Hsia, and J. G. Hsieh, "A strategy for a payoff-switching differential game based on fuzzy reasoning," Fuzzy Sets and Systems, vol. 130, no. 2, pp. 237-251, 2002.
    • (2002) Fuzzy Sets and Systems , vol.130 , Issue.2 , pp. 237-251
    • Lee, Y.S.1    Hsia, K.H.2    Hsieh, J.G.3
  • 58
    • 0034205975 scopus 로고    scopus 로고
    • Multiagent systems: a survey from a machine learning perspective
    • P. Stone and M. Veloso, "Multiagent systems: a survey from a machine learning perspective," Autonomous Robots, vol. 8, no. 3, pp. 345-383, 2000.
    • (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 345-383
    • Stone, P.1    Veloso, M.2
  • 59
    • 0032207552 scopus 로고    scopus 로고
    • Colearning in differential games
    • J. W. Sheppard, "Colearning in differential games," Machine Learning, vol. 33, pp. 201-233, 1998.
    • (1998) Machine Learning , vol.33 , pp. 201-233
    • Sheppard, J.W.1
  • 60
    • 84891544020 scopus 로고    scopus 로고
    • Exponential Moving Average Q-Learning Algorithm
    • Proceedings of the IEEE Symposium Series on Computational Intelligence, Singapore, April 15-19
    • M. Awheda, and Schwartz, H.M., "Exponential Moving Average Q-Learning Algorithm", Proceedings of the IEEE Symposium Series on Computational Intelligence, Singapore, April 15-19, 2013.
    • (2013)
    • Awheda, M.1    Schwartz, H.M.2
  • 61
    • 70350566689 scopus 로고    scopus 로고
    • Effective learning in the presence of adaptive counterparts
    • A. Burkov and B. Chaib-draa, "Effective learning in the presence of adaptive counterparts," Journal of Algorithms, vol. 64, no. 4, pp. 127-138, 2009.
    • (2009) Journal of Algorithms , vol.64 , Issue.4 , pp. 127-138
    • Burkov, A.1    Chaib-draa, B.2
  • 62
    • 84898941549 scopus 로고    scopus 로고
    • Extending q-learning to general adaptive multi-agent systems
    • (S. Thrun, L. K. Saul and B. Schölkopf, eds.), (Cambridge, Massachusetts), MIT Press
    • G. Tesauro, "Extending q-learning to general adaptive multi-agent systems," in Advances in Neural Information Processing Systems 16 (S. Thrun, L. K. Saul and B. Schölkopf, eds.), (Cambridge, Massachusetts), pp. 215-250, MIT Press, 2004.
    • (2004) Advances in Neural Information Processing Systems 16 , pp. 215-250
    • Tesauro, G.1
  • 63
    • 84899027977 scopus 로고    scopus 로고
    • Convergence and no-regret in multiagent learning
    • (L. K. Saul, Y.Weiss and L. Bottou, eds.), (Cambridge, Massachusetts), MIT Press
    • M. Bowling, "Convergence and no-regret in multiagent learning," in Advances in Neural Information Processing Systems 17 (L. K. Saul, Y.Weiss and L. Bottou, eds.), (Cambridge, Massachusetts), pp. 209-216, MIT Press, 2005.
    • (2005) Advances in Neural Information Processing Systems 17 , pp. 209-216
    • Bowling, M.1
  • 64
    • 70350699723 scopus 로고    scopus 로고
    • Amultiagent reinforcement learning algorithm with non-linear dynamics
    • S. Abdallah and V. Lesser, "Amultiagent reinforcement learning algorithm with non-linear dynamics," Journal of Artificial Intelligence Research, vol. 33, pp. 521-549, 2008.
    • (2008) Journal of Artificial Intelligence Research , vol.33 , pp. 521-549
    • Abdallah, S.1    Lesser, V.2
  • 68
    • 79957749002 scopus 로고
    • Reinforcement learning applied to a differential game
    • M. E. Harmon, L. C. Baird III, and A. H. Klopf, "Reinforcement learning applied to a differential game," Adaptive Behavior, vol. 4, no. 1, pp. 3-28, 1995.
    • (1995) Adaptive Behavior , vol.4 , Issue.1 , pp. 3-28
    • Harmon, M.E.1    Baird, L.C.2    Klopf, A.H.3
  • 69
    • 0032207552 scopus 로고    scopus 로고
    • Colearning in differential games
    • J. W. Sheppard, "Colearning in differential games," Machine Learning, vol. 33, pp. 201-233, 1998.
    • (1998) Machine Learning , vol.33 , pp. 201-233
    • Sheppard, J.W.1
  • 70
    • 77953027402 scopus 로고    scopus 로고
    • A reinforcement learning adaptive fuzzy controller for differential games
    • S. N. Givigi, H. M. Schwartz, and X. Lu, "A reinforcement learning adaptive fuzzy controller for differential games," Journal of Intelligent and Robotic Systems, vol. 59, pp. 3-30, 2010.
    • (2010) Journal of Intelligent and Robotic Systems , vol.59 , pp. 3-30
    • Givigi, S.N.1    Schwartz, H.M.2    Lu, X.3
  • 71
    • 78649832013 scopus 로고    scopus 로고
    • Self-learning fuzzy logic controllers for pursuit-evasion differential games
    • S. F. Desouky and H. M. Schwartz, "Self-learning fuzzy logic controllers for pursuit-evasion differential games," Robotics and Autonomous Systems, vol. 59, pp. 22-33, 2011.
    • (2011) Robotics and Autonomous Systems , vol.59 , pp. 22-33
    • Desouky, S.F.1    Schwartz, H.M.2
  • 72
    • 38249001350 scopus 로고
    • A first approach to fuzzy differential game problem: guarding a territory
    • K. H. Hsia and J. G. Hsieh, "A first approach to fuzzy differential game problem: guarding a territory," Fuzzy Sets and Systems, vol. 55, pp. 157-167, 1993.
    • (1993) Fuzzy Sets and Systems , vol.55 , pp. 157-167
    • Hsia, K.H.1    Hsieh, J.G.2
  • 73
    • 0036721662 scopus 로고    scopus 로고
    • A strategy for a payoff-switching differential game based on fuzzy reasoning
    • Y. S. Lee, K. H. Hsia, and J. G. Hsieh, "A strategy for a payoff-switching differential game based on fuzzy reasoning," Fuzzy Sets and Systems, vol. 130, no. 2, pp. 237-251, 2002.
    • (2002) Fuzzy Sets and Systems , vol.130 , Issue.2 , pp. 237-251
    • Lee, Y.S.1    Hsia, K.H.2    Hsieh, J.G.3
  • 74
    • 0032140718 scopus 로고    scopus 로고
    • Fuzzy inference system learning by reinforcement methods
    • L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics Part C, vol. 28, no. 3, pp. 338-355, 1998.
    • (1998) IEEE Transactions on Systems, Man, and Cybernetics Part C , vol.28 , Issue.3 , pp. 338-355
    • Jouffe, L.1
  • 75
    • 0003947619 scopus 로고    scopus 로고
    • Boston, Massachusetts: Addison-Wesley Longman Publishing Co., Inc., 1st ed.
    • K. M. Passino and S. Yurkovich, Fuzzy Control. Boston, Massachusetts: Addison-Wesley Longman Publishing Co., Inc., 1st ed., 1998.
    • (1998) Fuzzy Control
    • Passino, K.M.1    Yurkovich, S.2
  • 77
    • 34248666540 scopus 로고
    • Fuzzy sets
    • L. A. Zadeh, "Fuzzy sets," Information and Control, vol. 8, no. 3, pp. 338-353, 1965.
    • (1965) Information and Control , vol.8 , Issue.3 , pp. 338-353
    • Zadeh, L.A.1
  • 78
    • 84926397180 scopus 로고    scopus 로고
    • Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning
    • Master's thesis, Carleton University, Ottawa, ON, Canada
    • B. Al Faiya, "Learning in Pursuit-Evasion Differential Games Using Reinforcement Fuzzy Learning," Master's thesis, Carleton University, Ottawa, ON, Canada, 2012.
    • (2012)
    • Al Faiya, B.1
  • 79
    • 0021892282 scopus 로고
    • Fuzzy identification of systems and its applications to modelling and control
    • SMC-15
    • T. Takagi and M. Sugeno, "Fuzzy identification of systems and its applications to modelling and control," IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-15, pp. 116-132, 1985.
    • (1985) IEEE Transactions on Systems, Man, and Cybernetics , pp. 116-132
    • Takagi, T.1    Sugeno, M.2
  • 81
    • 0004105094 scopus 로고    scopus 로고
    • Design of Fuzzy Controllers
    • Technical Univ. of Denmark: Technical Report (No:98-E864) Department of Automation, Hoboken, NJ
    • J. Jantzen, Design of Fuzzy Controllers. Technical Univ. of Denmark: Technical Report (No:98-E864) Department of Automation, Hoboken, NJ, 1999.
    • (1999)
    • Jantzen, J.1
  • 83
    • 84906988849 scopus 로고    scopus 로고
    • On Multi-Agent Reinforcement Learning in Games
    • Ph.D. Thesis Carleton University, Ottawa, ON, Canada
    • X. Lu, "On Multi-Agent Reinforcement Learning in Games." Ph.D. Thesis Carleton University, Ottawa, ON, Canada, 2012.
    • (2012)
    • Lu, X.1
  • 84
    • 27744536933 scopus 로고    scopus 로고
    • An approach to tune fuzzy contorllers based on reinforcement learning for autonomous vehicle control
    • X. Dai, C. Li, and A. Rad, "An approach to tune fuzzy contorllers based on reinforcement learning for autonomous vehicle control," IEEE Transactions on Intelligent Transportation Systems, vol. 6, no. 3, pp. 285-293, 2005.
    • (2005) IEEE Transactions on Intelligent Transportation Systems , vol.6 , Issue.3 , pp. 285-293
    • Dai, X.1    Li, C.2    Rad, A.3
  • 86
    • 0021892282 scopus 로고
    • Fuzzy identification of systems and its application to modeling and control
    • T. Takagi and M. Sugeno, "Fuzzy identification of systems and its application to modeling and control," IEEE Transactions on Systems, Man, and Cybernetics, vol. 15, pp. 116-132, 1985.
    • (1985) IEEE Transactions on Systems, Man, and Cybernetics , vol.15 , pp. 116-132
    • Takagi, T.1    Sugeno, M.2
  • 89
  • 91
    • 0004251759 scopus 로고
    • New York: John Wiley & Sons, Inc.
    • R. Isaacs, Differential Games. New York: John Wiley & Sons, Inc., 1965.
    • (1965) Differential Games
    • Isaacs, R.1
  • 92
    • 85003220328 scopus 로고
    • The homicidal chauffeur
    • A. MERZ, "The homicidal chauffeur," AIAA Journal, vol. 12, pp. 259-260, 1974.
    • (1974) AIAA Journal , vol.12 , pp. 259-260
    • Merz, A.1
  • 95
    • 74849091001 scopus 로고    scopus 로고
    • Hybrid intelligent systems applied to the pursuit-evasion game
    • IEEE International Conference on, (San Antonio, TX, October, 2009
    • S. Desouky and H. Schwartz, "Hybrid intelligent systems applied to the pursuit-evasion game," in Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on, (San Antonio, TX, October 2009, pp. 2603-2608, 2009.
    • (2009) Systems, Man and Cybernetics, 2009. SMC 2009 , pp. 2603-2608
    • Desouky, S.1    Schwartz, H.2
  • 97
    • 70449675033 scopus 로고    scopus 로고
    • Genetic based fuzzy logic controller for a wall-following mobile robot
    • (St. Louis, MO), June 2009, IEEE
    • S. F. Desouky and H. M. Schwartz, "Genetic based fuzzy logic controller for a wall-following mobile robot," in American Control Conference, 2009 ACC 2009, (St. Louis, MO), June 2009, pp. 3555-3560, IEEE, 2009.
    • (2009) American Control Conference, 2009 ACC 2009 , pp. 3555-3560
    • Desouky, S.F.1    Schwartz, H.M.2
  • 98
    • 77950298151 scopus 로고    scopus 로고
    • Online learning of shaping rewards in reinforcement learning
    • M. Grzés and D. Kudenko, "Online learning of shaping rewards in reinforcement learning," Neural Networks, vol. 23, pp. 541-550, 2010.
    • (2010) Neural Networks , vol.23 , pp. 541-550
    • Grzés, M.1    Kudenko, D.2
  • 99
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in the multi-robot domain
    • M. J. Matarić, "Reinforcement learning in the multi-robot domain," Autonomous Robots, vol. 4, pp. 73-83, 1997.
    • (1997) Autonomous Robots , vol.4 , pp. 73-83
    • Matarić, M.J.1
  • 102
    • 34250608222 scopus 로고
    • La Reconstruction du nid et les Coordinations Inter-Individuelles chez Bellicosistermes Natalensis et Cubetermes sp. La théorie de la Stigmergie: Essai d'interprétation du Comportement des Termites Constructeurs
    • P. P. Grassé, "La Reconstruction du nid et les Coordinations Inter-Individuelles chez Bellicosistermes Natalensis et Cubetermes sp. La théorie de la Stigmergie: Essai d'interprétation du Comportement des Termites Constructeurs," Insectes Sociaux, vol. 6, pp. 41-80, 1959.
    • (1959) Insectes Sociaux , vol.6 , pp. 41-80
    • Grassé, P.P.1
  • 103
    • 2642586048 scopus 로고    scopus 로고
    • Cofields: a physically inspired approach to motion cordination
    • M. Mamei, F. Zambonelli, and L. Leonardi, "Cofields: a physically inspired approach to motion cordination," IEEE Pervasive Computing, vol. 3, no. 2, pp. 52-61, 2004.
    • (2004) IEEE Pervasive Computing , vol.3 , Issue.2 , pp. 52-61
    • Mamei, M.1    Zambonelli, F.2    Leonardi, L.3
  • 106
    • 0004240310 scopus 로고
    • New York, New York: Simon & Schuster
    • M. Mynsk, The Society of Mind. New York, New York: Simon & Schuster, 1986.
    • (1986) The Society of Mind
    • Mynsk, M.1
  • 107
    • 34547152789 scopus 로고    scopus 로고
    • A game theoretic approach to swarm robotics
    • S. N. Givigi and H.M. Schwartz, "A game theoretic approach to swarm robotics," Applied Bionics and Biomechanics, vol. 3, no. 3, pp. 131-142, 2006.
    • (2006) Applied Bionics and Biomechanics , vol.3 , Issue.3 , pp. 131-142
    • Givigi, S.N.1    Schwartz, H.M.2
  • 110
    • 27744444690 scopus 로고    scopus 로고
    • Evolutionary swarm intelligence applied to robotics
    • Prooceedings of the IEEE International Conference on Mechatronics andAutomation
    • S. Givigi and H. M. Schwartz, "Evolutionary swarm intelligence applied to robotics," in Prooceedings of the IEEE International Conference on Mechatronics andAutomation, pp. 1005-1010, 2005.
    • (2005) , pp. 1005-1010
    • Givigi, S.1    Schwartz, H.M.2
  • 114
    • 0004187979 scopus 로고
    • Washington, District of Columbia: The Mathematical Association of America
    • P. D. Straffin, Game Theory and Strategy. Washington, District of Columbia: The Mathematical Association of America, 1993.
    • (1993) Game Theory and Strategy
    • Straffin, P.D.1
  • 115
    • 3843059082 scopus 로고    scopus 로고
    • Unified convergence proofs of continuous-time fictitious play
    • J. S. Shamma and G. Arslan, "Unified convergence proofs of continuous-time fictitious play," IEEE Transactions on Automatic Control, vol. 49, no. 7, pp. 1137-1142, 2004.
    • (2004) IEEE Transactions on Automatic Control , vol.49 , Issue.7 , pp. 1137-1142
    • Shamma, J.S.1    Arslan, G.2
  • 117
    • 0001402950 scopus 로고
    • An iterative method of solving a game
    • J. Robinson, "An iterative method of solving a game," Annals of Mathmatics, vol. 54, no. 2, pp. 296-301, 1951.
    • (1951) Annals of Mathmatics , vol.54 , Issue.2 , pp. 296-301
    • Robinson, J.1
  • 120
    • 0004149207 scopus 로고
    • New York, New York: Oxford University Press, new ed.
    • R. Dawkins, The Selfish Gene. New York, New York: Oxford University Press, new ed., 1989.
    • (1989) The Selfish Gene
    • Dawkins, R.1
  • 121
    • 84926400379 scopus 로고    scopus 로고
    • Analysis and Design of Swarm Based Robots Using Game Theory
    • Ph.D. Thesis, Ottawa, ON: Carleton University, Sept
    • G. Sidney, "Analysis and Design of Swarm Based Robots Using Game Theory". Ph.D. Thesis, Ottawa, ON: Carleton University, Sept. 2009.
    • (2009)
    • Sidney, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.