메뉴 건너뛰기




Volumn 12, Issue 4, 1998, Pages 263-282

Learning Team Strategies: Soccer Case Studies

Author keywords

Coevolution; Evaluation functions; Multiagent reinforcement learning; Probabilistic incremental program evolution; Soccer; TD Q learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; FUNCTION EVALUATION; GAME THEORY; LEARNING ALGORITHMS; PROBABILITY DISTRIBUTIONS;

EID: 0032208296     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/a:1007570708568     Document Type: Article
Times cited : (38)

References (41)
  • 1
    • 0016556021 scopus 로고
    • A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
    • Albus, J.S. (1975). A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, 97, 220-227.
    • (1975) Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
    • Albus, J.S.1
  • 4
    • 85011779332 scopus 로고
    • Removing the genetics from the standard genetic algorithm
    • A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
    • Baluja, S., & Caruana, R. (1995). Removing the genetics from the standard genetic algorithm. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Twelfth International Conference (pp. 38-46). San Francisco, CA: Morgan Kaufmann Publishers.
    • (1995) Machine Learning: Proceedings of the Twelfth International Conference , pp. 38-46
    • Baluja, S.1    Caruana, R.2
  • 6
    • 0002258659 scopus 로고
    • A representation for the adaptive generation of simple sequential programs
    • J. Grefenstette (Ed.), Hillsdale, NJ: Lawrence Erlbaum Associates
    • Cramer, N.L. (1985). A representation for the adaptive generation of simple sequential programs. In J. Grefenstette (Ed.), Proceedings of an International Conference on Genetic Algorithms and their Applications (pp. 183-187). Hillsdale, NJ: Lawrence Erlbaum Associates.
    • (1985) Proceedings of An International Conference on Genetic Algorithms and Their Applications , pp. 183-187
    • Cramer, N.L.1
  • 7
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Cambridge, MA: MIT Press
    • Crites, R., & Barto, A. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1017-1023). Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
    • Crites, R.1    Barto, A.2
  • 11
    • 0000202647 scopus 로고
    • Universal sequential search problems
    • Levin, L.A. (1973). Universal sequential search problems. Problems of Information Transmission, 9(3), 265-266.
    • (1973) Problems of Information Transmission , vol.9 , Issue.3 , pp. 265-266
    • Levin, L.A.1
  • 12
    • 0021404339 scopus 로고
    • Randomness conservation inequalities: Information and independence in mathematical theories
    • Levin, L.A. (1984). Randomness conservation inequalities: Information and independence in mathematical theories. Information and Control, 61, 15-37.
    • (1984) Information and Control , vol.61 , pp. 15-37
    • Levin, L.A.1
  • 15
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
    • Littman, M.L. (1994). Markov games as a framework for multi-agent reinforcement learning. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Eleventh International Conference (pp. 157-163). San Francisco, CA: Morgan Kaufmann Publishers.
    • (1994) Machine Learning: Proceedings of the Eleventh International Conference , pp. 157-163
    • Littman, M.L.1
  • 19
    • 0001765492 scopus 로고
    • Simplifying neural networks by soft weight sharing
    • Nowlan, S.J., & Hinton, G.E. (1992). Simplifying neural networks by soft weight sharing. Neural Computation, 4, 173-193.
    • (1992) Neural Computation , vol.4 , pp. 173-193
    • Nowlan, S.J.1    Hinton, G.E.2
  • 20
    • 0000955979 scopus 로고    scopus 로고
    • Incremental multi-step Q-learning
    • Peng, J., & Williams, R. (1996). Incremental multi-step Q-learning. Machine Learning, 22, 283-290.
    • (1996) Machine Learning , vol.22 , pp. 283-290
    • Peng, J.1    Williams, R.2
  • 24
    • 0345593940 scopus 로고    scopus 로고
    • On learning soccer strategies
    • W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), Berlin Heidelberg: Springer-Verlag
    • Sałustowicz, R.P., Wiering, M.A., & Schmidhuber, J. (1997b). On learning soccer strategies. In W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), volume 1327 of Lecture Notes in Computer Science (pp. 769-774). Berlin Heidelberg: Springer-Verlag.
    • (1997) Lecture Notes in Computer Science , vol.1327 , pp. 769-774
    • Sałustowicz, R.P.1    Wiering, M.A.2    Schmidhuber, J.3
  • 25
    • 0031194381 scopus 로고    scopus 로고
    • Discovering neural nets with low Kolmogorov complexity and high generalization capability
    • Schmidhuber, J. (1997a). Discovering neural nets with low Kolmogorov complexity and high generalization capability. Neural Networks, 10(5), 857-873.
    • (1997) Neural Networks , vol.10 , Issue.5 , pp. 857-873
    • Schmidhuber, J.1
  • 26
    • 0007918330 scopus 로고    scopus 로고
    • A general method for incremental self-improvement and multi-agent learning in unrestricted environments
    • X. Yao (Ed.), Singapore: Scientific Publ. Co., in press
    • Schmidhuber, J. (1997b). A general method for incremental self-improvement and multi-agent learning in unrestricted environments. In X. Yao (Ed.), Evolutionary Computation: Theory and Applications. Singapore: Scientific Publ. Co., in press.
    • (1997) Evolutionary Computation: Theory and Applications
    • Schmidhuber, J.1
  • 27
    • 0000156236 scopus 로고    scopus 로고
    • Reinforcement learning with self-modifying policies
    • S. Thrun & L. Pratt (Eds.), Boston, MA: Kluwer
    • Schmidhuber, J., Zhao, J., & Schraudolph, N. (1997a). Reinforcement learning with self-modifying policies. In S. Thrun & L. Pratt (Eds.), Learning to Learn (pp. 293-309). Boston, MA: Kluwer.
    • (1997) Learning to Learn , pp. 293-309
    • Schmidhuber, J.1    Zhao, J.2    Schraudolph, N.3
  • 28
    • 0031186687 scopus 로고    scopus 로고
    • Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement
    • Schmidhuber, J., Zhao, J., & Wiering, M. (1997b). Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement. Machine Learning, 28, 105-130.
    • (1997) Machine Learning , vol.28 , pp. 105-130
    • Schmidhuber, J.1    Zhao, J.2    Wiering, M.3
  • 29
    • 0022825723 scopus 로고
    • An application of algorithmic probability to problems in artificial intelligence
    • L.N. Kanal & J.F. Lemmer (Eds.), Elsevier Science Publishers
    • Solomonoff, R. (1986). An application of algorithmic probability to problems in artificial intelligence. In L.N. Kanal & J.F. Lemmer (Eds.), Uncertainty in Artificial Intelligence (pp. 473-491). Elsevier Science Publishers.
    • (1986) Uncertainty in Artificial Intelligence , pp. 473-491
    • Solomonoff, R.1
  • 30
    • 85156255116 scopus 로고    scopus 로고
    • Beating a defender in robotic soccer: Memory-based learning of a continuous function
    • G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Cambridge, MA: MIT Press
    • Stone, P., & Veloso, M. (1996a). Beating a defender in robotic soccer: Memory-based learning of a continuous function. In G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 896-902). Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 896-902
    • Stone, P.1    Veloso, M.2
  • 31
    • 0032020927 scopus 로고    scopus 로고
    • A layered approach to learning client behaviors in the robocup soccer server
    • 1998, to appear
    • Stone, P., & Veloso, M. (1996b). A layered approach to learning client behaviors in the robocup soccer server. Applied Artificial Intelligence (AAI), 1998, to appear.
    • (1996) Applied Artificial Intelligence (AAI)
    • Stone, P.1    Veloso, M.2
  • 32
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 33
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Cambridge, MA: MIT Press
    • Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1038-1045). Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1045
    • Sutton, R.S.1
  • 35
    • 0000985504 scopus 로고
    • TD-gammon, a self-teaching backgammon program, achieves master-level play
    • Tesauro, G. (1994). TD-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2), 215-219.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1
  • 36
    • 84890657801 scopus 로고    scopus 로고
    • Learning real team solutions
    • G. Weiss (Ed.), DAI Meets Machine Learning, Berlin: Springer-Verlag
    • Versino, C., & Gambardella, L.M. (1997). Learning real team solutions. In G. Weiss (Ed.), DAI Meets Machine Learning, volume 1221 of Lecture Notes in Artificial Intelligence (pp. 40-61). Berlin: Springer-Verlag.
    • (1997) Lecture Notes in Artificial Intelligence , vol.1221 , pp. 40-61
    • Versino, C.1    Gambardella, L.M.2
  • 38
    • 84949977009 scopus 로고    scopus 로고
    • Adaptation and learning in multi-agent systems: Some remarks and a bibliography
    • G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, Berlin Heidelberg: Springer-Verlag
    • Weiss, G. (1996). Adaptation and learning in multi-agent systems: Some remarks and a bibliography. In G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, volume 1042 of Lecture Notes in Artificial Intelligence (pp. 1-21). Berlin Heidelberg: Springer-Verlag.
    • (1996) Lecture Notes in Artificial Intelligence , vol.1042 , pp. 1-21
    • Weiss, G.1
  • 39
    • 0002278965 scopus 로고
    • Adaptive switching circuits
    • New York: IRE. Reprinted in Anderson and Rosenfeld (1988)
    • Widrow, B., & Hoff, M.E. (1960). Adaptive switching circuits. 1960 IRE WESCON Convention Record (Vol. 4, pp. 96-104). New York: IRE. Reprinted in Anderson and Rosenfeld (1988).
    • (1960) 1960 IRE WESCON Convention Record , vol.4 , pp. 96-104
    • Widrow, B.1    Hoff, M.E.2
  • 41
    • 2542471725 scopus 로고    scopus 로고
    • (Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland
    • Wiering, M.A., & Schmidhuber, J. (1997). Fast online Q(λ) (Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland.
    • (1997) Fast Online Q(λ)
    • Wiering, M.A.1    Schmidhuber, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.