SCOPUS 정보 검색 플랫폼

Volumn 12, Issue 4, 1998, Pages 263-282

Learning Team Strategies: Soccer Case Studies

(3) Sałustowicz, Rafał P a Wiering, Marco A a Schmidhuber, Jürgen a

a DALLE MOLLE INSTITUTE FOR ARTIFICIAL INTELLIGENCE IDSIA (Switzerland)

Author keywords

Coevolution; Evaluation functions; Multiagent reinforcement learning; Probabilistic incremental program evolution; Soccer; TD Q learning

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; FUNCTION EVALUATION; GAME THEORY; LEARNING ALGORITHMS; PROBABILITY DISTRIBUTIONS;

MULTIAGENT REINFORCEMENT LEARNING; PROBABILISTIC INCREMENT PROGRAM EVOLUTION (PIPE);

LEARNING SYSTEMS;

EID: 0032208296 PISSN: 08856125 EISSN: None Source Type: Journal
DOI: 10.1023/a:1007570708568 Document Type: Article

Times cited : (38)

References (41)

1
- 0016556021
- A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
- Albus, J.S. (1975). A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, 97, 220-227.
- (1975) Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
- Albus, J.S.¹

2
- 2542445555
- A vision-based reinforcement learning for coordination of soccer playing behaviors
- Asada, M., Uchibe, E., Noda, S., Tawaratsumida, S., & Hosoda, K. (1994). A vision-based reinforcement learning for coordination of soccer playing behaviors. Proceedings of AAAI-94 Workshop on AI and A-life and Entertainment (pp. 16-21).
- (1994) Proceedings of AAAI-94 Workshop on AI and A-life and Entertainment , pp. 16-21
- Asada, M.¹ Uchibe, E.² Noda, S.³ Tawaratsumida, S.⁴ Hosoda, K.⁵

3
- 0003984832
- (Technical Report CMU-CS-94-163), Carnegie Mellon University, Pittsburgh
- Baluja, S. (1994). Population-based incremental learning: A method for integrating genetic search based function optimization and competitive learning (Technical Report CMU-CS-94-163), Carnegie Mellon University, Pittsburgh.
- (1994) Population-based Incremental Learning: A Method for Integrating Genetic Search Based Function Optimization and Competitive Learning
- Baluja, S.¹

4
- 85011779332
- Removing the genetics from the standard genetic algorithm
- A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
- Baluja, S., & Caruana, R. (1995). Removing the genetics from the standard genetic algorithm. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Twelfth International Conference (pp. 38-46). San Francisco, CA: Morgan Kaufmann Publishers.
- (1995) Machine Learning: Proceedings of the Twelfth International Conference , pp. 38-46
- Baluja, S.¹ Caruana, R.²

5
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D.P., & Tsitsiklis, J.N. (1996). Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 0002258659
- A representation for the adaptive generation of simple sequential programs
- J. Grefenstette (Ed.), Hillsdale, NJ: Lawrence Erlbaum Associates
- Cramer, N.L. (1985). A representation for the adaptive generation of simple sequential programs. In J. Grefenstette (Ed.), Proceedings of an International Conference on Genetic Algorithms and their Applications (pp. 183-187). Hillsdale, NJ: Lawrence Erlbaum Associates.
- (1985) Proceedings of An International Conference on Genetic Algorithms and Their Applications , pp. 183-187
- Cramer, N.L.¹

7
- 85156187730
- Improving elevator performance using reinforcement learning
- D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Cambridge, MA: MIT Press
- Crites, R., & Barto, A. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1017-1023). Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.¹ Barto, A.²

8
- 0344299454
- Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München
- Dickmanns, D., Schmidhuber, J., & Winklhofer, A. (1987). Der genetische Algorithmus: Eine Implementierung in Prolog. Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München.
- (1987) Der Genetische Algorithmus: Eine Implementierung in Prolog
- Dickmanns, D.¹ Schmidhuber, J.² Winklhofer, A.³

9
- 0003924942
- Cambridge, MA: MIT Press
- Gallant, S.I. (1993). Neural Network Learning and Expert Systems. Cambridge, MA: MIT Press.
- (1993) Neural Network Learning and Expert Systems
- Gallant, S.I.¹

10
- 0003882343
- Cambridge, MA: MIT Press
- Koza, J.R. (1992). Genetic Programming - On the Programming of Computers by Means of Natural Selection. Cambridge, MA: MIT Press.
- (1992) Genetic Programming - On the Programming of Computers by Means of Natural Selection
- Koza, J.R.¹

11
- 0000202647
- Universal sequential search problems
- Levin, L.A. (1973). Universal sequential search problems. Problems of Information Transmission, 9(3), 265-266.
- (1973) Problems of Information Transmission , vol.9 , Issue.3 , pp. 265-266
- Levin, L.A.¹

12
- 0021404339
- Randomness conservation inequalities: Information and independence in mathematical theories
- Levin, L.A. (1984). Randomness conservation inequalities: Information and independence in mathematical theories. Information and Control, 61, 15-37.
- (1984) Information and Control , vol.61 , pp. 15-37
- Levin, L.A.¹

13
- 0003680739
- New York, NY: Springer-Verlag
- Li, M., & Vitányi, P.M.B. (1993). An Introduction to Kolmogorov Complexity and its Applications. New York, NY: Springer-Verlag.
- (1993) An Introduction to Kolmogorov Complexity and Its Applications
- Li, M.¹ Vitányi, P.M.B.²

14
- 0003673017
- Ph.D. thesis, Carnegie Mellon University, Pittsburgh, PA
- Lin, L.J. (1993). Reinforcement learning for robots using neural networks. Ph.D. thesis, Carnegie Mellon University, Pittsburgh, PA.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.J.¹

15
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
- Littman, M.L. (1994). Markov games as a framework for multi-agent reinforcement learning. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Eleventh International Conference (pp. 157-163). San Francisco, CA: Morgan Kaufmann Publishers.
- (1994) Machine Learning: Proceedings of the Eleventh International Conference , pp. 157-163
- Littman, M.L.¹

16
- 0003322602
- Co-evolving soccer softbot team coordination with genetic programming
- Luke, S., Hohn, C., Farris, J., Jackson, G., & Hendler, J. (1997). Co-evolving soccer softbot team coordination with genetic programming. Proceedings of the First International Workshop on RoboCup, at the International Joint Conference on Artificial Intelligence (IJCAI-97).
- (1997) Proceedings of the First International Workshop on RoboCup, at the International Joint Conference on Artificial Intelligence (IJCAI-97)
- Luke, S.¹ Hohn, C.² Farris, J.³ Jackson, G.⁴ Hendler, J.⁵

17
- 0002650559
- Learning of cooperative actions in multi-agent systems: A case study of pass play in soccer
- S. Sen (Ed.), Menlo Park, CA: AAAI Press
- Matsubara, H., Noda, I., & Hiraki, K. (1996). Learning of cooperative actions in multi-agent systems: A case study of pass play in soccer. In S. Sen (Ed.), Working Notes for the AAAI-96 Spring Symposium on Adaptation, Coevolution and Learning in Multi-agent Systems (pp. 63-67). Menlo Park, CA: AAAI Press.
- (1996) Working Notes for the AAAI-96 Spring Symposium on Adaptation, Coevolution and Learning in Multi-agent Systems , pp. 63-67
- Matsubara, H.¹ Noda, I.² Hiraki, K.³

18
- 84890686131
- Correlating internal parameters and external performance: Learning soccer agents
- G. Weiss (Ed.), Berlin: Springer-Verlag
- Nadella, R., & Sen, S. (1996). Correlating internal parameters and external performance: Learning soccer agents. In G. Weiss (Ed.), Distributed Artificial Intelligence Meets Machine Learning. Learning in Multi-Agent Environments (pp. 137-150). Berlin: Springer-Verlag.
- (1996) Distributed Artificial Intelligence Meets Machine Learning. Learning in Multi-Agent Environments , pp. 137-150
- Nadella, R.¹ Sen, S.²

19
- 0001765492
- Simplifying neural networks by soft weight sharing
- Nowlan, S.J., & Hinton, G.E. (1992). Simplifying neural networks by soft weight sharing. Neural Computation, 4, 173-193.
- (1992) Neural Computation , vol.4 , pp. 173-193
- Nowlan, S.J.¹ Hinton, G.E.²

20
- 0000955979
- Incremental multi-step Q-learning
- Peng, J., & Williams, R. (1996). Incremental multi-step Q-learning. Machine Learning, 22, 283-290.
- (1996) Machine Learning , vol.22 , pp. 283-290
- Peng, J.¹ Williams, R.²

21
- 0004206036
- Master's thesis, University of British Columbia
- Sahota, M. (1993). Real-time intelligent behaviour in dynamic environments: Soccer-playing robots. Master's thesis, University of British Columbia.
- (1993) Real-time Intelligent Behaviour in Dynamic Environments: Soccer-playing Robots
- Sahota, M.¹

22
- 0000108169
- Probabilistic incremental program evolution
- Sałustowicz, R.P., & Schmidhuber, J. (1997). Probabilistic incremental program evolution. Evolutionary Computation, 5(2), 123-141.
- (1997) Evolutionary Computation , vol.5 , Issue.2 , pp. 123-141
- Sałustowicz, R.P.¹ Schmidhuber, J.²

23
- 0345161984
- Evolving soccer strategies
- Singapore: Springer-Verlag
- Sałustowicz, R.P., Wiering, M. A., & Schmidhuber, J. (1997a). Evolving soccer strategies. Proceedings of the Fourth International Conference on Neural Information Processing (ICONIP'97) (pp. 502-506). Singapore: Springer-Verlag.
- (1997) Proceedings of the Fourth International Conference on Neural Information Processing (ICONIP'97) , pp. 502-506
- Sałustowicz, R.P.¹ Wiering, M.A.² Schmidhuber, J.³

24
- 0345593940
- On learning soccer strategies
- W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), Berlin Heidelberg: Springer-Verlag
- Sałustowicz, R.P., Wiering, M.A., & Schmidhuber, J. (1997b). On learning soccer strategies. In W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), volume 1327 of Lecture Notes in Computer Science (pp. 769-774). Berlin Heidelberg: Springer-Verlag.
- (1997) Lecture Notes in Computer Science , vol.1327 , pp. 769-774
- Sałustowicz, R.P.¹ Wiering, M.A.² Schmidhuber, J.³

25
- 0031194381
- Discovering neural nets with low Kolmogorov complexity and high generalization capability
- Schmidhuber, J. (1997a). Discovering neural nets with low Kolmogorov complexity and high generalization capability. Neural Networks, 10(5), 857-873.
- (1997) Neural Networks , vol.10 , Issue.5 , pp. 857-873
- Schmidhuber, J.¹

26
- 0007918330
- A general method for incremental self-improvement and multi-agent learning in unrestricted environments
- X. Yao (Ed.), Singapore: Scientific Publ. Co., in press
- Schmidhuber, J. (1997b). A general method for incremental self-improvement and multi-agent learning in unrestricted environments. In X. Yao (Ed.), Evolutionary Computation: Theory and Applications. Singapore: Scientific Publ. Co., in press.
- (1997) Evolutionary Computation: Theory and Applications
- Schmidhuber, J.¹

27
- 0000156236
- Reinforcement learning with self-modifying policies
- S. Thrun & L. Pratt (Eds.), Boston, MA: Kluwer
- Schmidhuber, J., Zhao, J., & Schraudolph, N. (1997a). Reinforcement learning with self-modifying policies. In S. Thrun & L. Pratt (Eds.), Learning to Learn (pp. 293-309). Boston, MA: Kluwer.
- (1997) Learning to Learn , pp. 293-309
- Schmidhuber, J.¹ Zhao, J.² Schraudolph, N.³

28
- 0031186687
- Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement
- Schmidhuber, J., Zhao, J., & Wiering, M. (1997b). Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement. Machine Learning, 28, 105-130.
- (1997) Machine Learning , vol.28 , pp. 105-130
- Schmidhuber, J.¹ Zhao, J.² Wiering, M.³

29
- 0022825723
- An application of algorithmic probability to problems in artificial intelligence
- L.N. Kanal & J.F. Lemmer (Eds.), Elsevier Science Publishers
- Solomonoff, R. (1986). An application of algorithmic probability to problems in artificial intelligence. In L.N. Kanal & J.F. Lemmer (Eds.), Uncertainty in Artificial Intelligence (pp. 473-491). Elsevier Science Publishers.
- (1986) Uncertainty in Artificial Intelligence , pp. 473-491
- Solomonoff, R.¹

30
- 85156255116
- Beating a defender in robotic soccer: Memory-based learning of a continuous function
- G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Cambridge, MA: MIT Press
- Stone, P., & Veloso, M. (1996a). Beating a defender in robotic soccer: Memory-based learning of a continuous function. In G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 896-902). Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 896-902
- Stone, P.¹ Veloso, M.²

31
- 0032020927
- A layered approach to learning client behaviors in the robocup soccer server
- 1998, to appear
- Stone, P., & Veloso, M. (1996b). A layered approach to learning client behaviors in the robocup soccer server. Applied Artificial Intelligence (AAI), 1998, to appear.
- (1996) Applied Artificial Intelligence (AAI)
- Stone, P.¹ Veloso, M.²

32
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R.S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

33
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Cambridge, MA: MIT Press
- Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1038-1045). Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1045
- Sutton, R.S.¹

34
- 2542439929
- Sutton, R.S. (1997). Personal communication at the Seventh International Conference on Artificial Neural Networks (ICANN'97).
- (1997) Personal Communication at the Seventh International Conference on Artificial Neural Networks (ICANN'97)
- Sutton, R.S.¹

35
- 0000985504
- TD-gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. (1994). TD-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2), 215-219.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

36
- 84890657801
- Learning real team solutions
- G. Weiss (Ed.), DAI Meets Machine Learning, Berlin: Springer-Verlag
- Versino, C., & Gambardella, L.M. (1997). Learning real team solutions. In G. Weiss (Ed.), DAI Meets Machine Learning, volume 1221 of Lecture Notes in Artificial Intelligence (pp. 40-61). Berlin: Springer-Verlag.
- (1997) Lecture Notes in Artificial Intelligence , vol.1221 , pp. 40-61
- Versino, C.¹ Gambardella, L.M.²

37
- 0004049893
- PhD thesis, King's College, Cambridge
- Watkins, C. (1989). Learning from Delayed Rewards. PhD thesis, King's College, Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

38
- 84949977009
- Adaptation and learning in multi-agent systems: Some remarks and a bibliography
- G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, Berlin Heidelberg: Springer-Verlag
- Weiss, G. (1996). Adaptation and learning in multi-agent systems: Some remarks and a bibliography. In G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, volume 1042 of Lecture Notes in Artificial Intelligence (pp. 1-21). Berlin Heidelberg: Springer-Verlag.
- (1996) Lecture Notes in Artificial Intelligence , vol.1042 , pp. 1-21
- Weiss, G.¹

39
- 0002278965
- Adaptive switching circuits
- New York: IRE. Reprinted in Anderson and Rosenfeld (1988)
- Widrow, B., & Hoff, M.E. (1960). Adaptive switching circuits. 1960 IRE WESCON Convention Record (Vol. 4, pp. 96-104). New York: IRE. Reprinted in Anderson and Rosenfeld (1988).
- (1960) 1960 IRE WESCON Convention Record , vol.4 , pp. 96-104
- Widrow, B.¹ Hoff, M.E.²

40
- 0010888394
- Solving POMDPs with Levin search and EIRA
- L. Saitta (Ed.), San Francisco, CA: Morgan Kaufmann Publishers
- Wiering, M.A., & Schmidhuber, J. (1996). Solving POMDPs with Levin search and EIRA. In L. Saitta (Ed.), Machine Learning: Proceedings of the Thirteenth International Conference (pp. 534-542). San Francisco, CA: Morgan Kaufmann Publishers.
- (1996) Machine Learning: Proceedings of the Thirteenth International Conference , pp. 534-542
- Wiering, M.A.¹ Schmidhuber, J.²

41
- 2542471725
- (Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland
- Wiering, M.A., & Schmidhuber, J. (1997). Fast online Q(λ) (Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland.
- (1997) Fast Online Q(λ)
- Wiering, M.A.¹ Schmidhuber, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.