SCOPUS 정보 검색 플랫폼

Autonomous Robots

Volumn 7, Issue 1, 1999, Pages 77-88

Reinforcement learning soccer teams with incomplete world models

(3) Wiering, Marco a Sałustowicz, Rafał a Schmidhuber, Jürgen a

a DALLE MOLLE INSTITUTE FOR ARTIFICIAL INTELLIGENCE IDSIA (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; MATHEMATICAL MODELS; MOBILE ROBOTS; PROBABILITY;

REINFORCEMENT LEARNING (RL); WORLD MODELS (WM);

ROBOT LEARNING;

EID: 0345073177 PISSN: 09295593 EISSN: None Source Type: Journal
DOI: 10.1023/A:1008921914343 Document Type: Article

Times cited : (26)

References (38)

1
- 0016556021
- A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
- Albus, J.S. 1975. A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, pp. 220-227.
- (1975) Dynamic Systems, Measurement and Control , pp. 220-227
- Albus, J.S.¹

2
- 85011779332
- Removing the genetics from the standard genetic algorithm
- A. Prieditis and S. Russell (Eds.), Morgan Kaufmann Publishers: San Francisco, CA
- Baluja, S. and Caruana, R. 1995. Removing the genetics from the standard genetic algorithm. In Machine Learning: Proceedings of the Twelfth International Conference, A. Prieditis and S. Russell (Eds.), Morgan Kaufmann Publishers: San Francisco, CA, pp. 38-46.
- (1995) Machine Learning: Proceedings of the Twelfth International Conference , pp. 38-46
- Baluja, S.¹ Caruana, R.²

3
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto, A.G., Sutton, R.S., and Anderson, C.W. 1983. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics, SMC-13:834-846.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.SMC13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0004020376
- Princeton University Press
- Bellman, R. 1961. Adaptive Control Processes, Princeton University Press.
- (1961) Adaptive Control Processes
- Bellman, R.¹

5
- 0003487482
- Athena Scientific: Belmont, MA
- Bertsekas, D.P. and Tsitsiklis, J.N. 1996. Neuro-Dynamic Programming, Athena Scientific: Belmont, MA.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 0002192119
- Input generalization in delayed reinforcement learning
- Morgan Kaufman
- Chapman, D. and Kaelbling, L.P. 1991. Input generalization in delayed reinforcement learning. In Proceedings of the 13th International Joint Conference on Artificial Intelligence (IJCAI), Morgan Kaufman, Vol. 2, pp. 726-731.
- (1991) Proceedings of the 13th International Joint Conference on Artificial Intelligence (IJCAI) , vol.2 , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

7
- 0002258659
- A representation for the adaptive generation of simple sequential programs
- J.J. Grefenstette (Ed.), Lawrence Erlbaum Associates: Hillsdale, NJ
- Cramer, N.L. 1985. A representation for the adaptive generation of simple sequential programs. In Proceedings of an International Conference on Genetic Algorithms and Their Applications, J.J. Grefenstette (Ed.), Lawrence Erlbaum Associates: Hillsdale, NJ, pp. 183-187.
- (1985) Proceedings of an International Conference on Genetic Algorithms and Their Applications , pp. 183-187
- Cramer, N.L.¹

8
- 0344299454
- Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München
- Dickmanns, D., Schmidhuber, J., and Winklhofer, A. 1986. Der genetische Algorithmus: Eine Implementierung in Prolog. Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München.
- (1986) Der Genetische Algorithmus: Eine Implementierung in Prolog
- Dickmanns, D.¹ Schmidhuber, J.² Winklhofer, A.³

9
- 0003463297
- University of Michigan Press: Ann Arbor
- Holland, J.H. 1975. Adaptation in Natural and Artificial Systems, University of Michigan Press: Ann Arbor.
- (1975) Adaptation in Natural and Artificial Systems
- Holland, J.H.¹

10
- 0004280606
- MIT Press
- Kaelbling, L. 1993. Learning in Embedded Systems, MIT Press.
- (1993) Learning in Embedded Systems
- Kaelbling, L.¹

11
- 84899026236
- Finite-sample convergence rates for Q-learning and indirect algorithms
- M. Kearns, S.A. Solla, and D. Cohn (Eds.), MIT Press: Cambridge, MA
- Kearns, M. and Singh, S. 1999. Finite-sample convergence rates for Q-learning and indirect algorithms. In Advances in Neural Information Processing Systems 12, M. Kearns, S.A. Solla, and D. Cohn (Eds.), MIT Press: Cambridge, MA.
- (1999) Advances in Neural Information Processing Systems 12 , vol.12
- Kearns, M.¹ Singh, S.²

12
- 0001334736
- Genetic evolution and co-evolution of computer programs
- C.G. Langton, C. Taylor, J.D. Farmer, and S. Rasmussen (Eds.), Addison Wesley Publishing Company
- Koza, J. R. 1992. Genetic evolution and co-evolution of computer programs. In Artificial Life II, C.G. Langton, C. Taylor, J.D. Farmer, and S. Rasmussen (Eds.), Addison Wesley Publishing Company, pp. 313-324.
- (1992) Artificial Life II , vol.2 , pp. 313-324
- Koza, J.R.¹

13
- 0003673017
- Ph.D. Thesis, Carnegie Mellon University, Pittsburgh
- Lin, L.-J. 1993. Reinforcement Learning for Robots Using Neural Networks. Ph.D. Thesis, Carnegie Mellon University, Pittsburgh.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

14
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Moore, A. and Atkeson, C.G. 1993. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13:103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.G.²

15
- 0001765492
- Simplifying neural networks by soft weight sharing
- Nowlan, S.J. and Hinton, G.E. 1992. Simplifying neural networks by soft weight sharing. Neural Computation, 4:173-193.
- (1992) Neural Computation , vol.4 , pp. 173-193
- Nowlan, S.J.¹ Hinton, G.E.²

16
- 0000955979
- Incremental multi-step Q-learning
- Peng, J. and Williams, R. 1996. Incremental multi-step Q-learning. Machine Learning, 22:283-290.
- (1996) Machine Learning , vol.22 , pp. 283-290
- Peng, J.¹ Williams, R.²

17
- 0003502414
- Dissertation, Published in 1973 by Fromman-Holzboog
- Rechenberg, I. 1971. Evolutions strategie - Optimierung technischer Systeme nach Prinzipien der biologischen Evolution, Dissertation, Published in 1973 by Fromman-Holzboog.
- (1971) Evolutions Strategie - Optimierung Technischer Systeme Nach Prinzipien der Biologischen Evolution
- Rechenberg, I.¹

18
- 0345161982
- Technical Report CUED/F-INFENG-TR 166, Cambridge University, UK
- Rummery, G.A. and Niranjan, M. 1994. On-line Q-learning using connectionist sytems. Technical Report CUED/F-INFENG-TR 166, Cambridge University, UK.
- (1994) On-line Q-learning Using Connectionist Sytems
- Rummery, G.A.¹ Niranjan, M.²

19
- 0000108169
- Probabilistic incremental program evolution
- Sałustowicz, R.P. and Schmidhuber, J. 1997. Probabilistic incremental program evolution. Evolutionary Computation, 5(2):123-141.
- (1997) Evolutionary Computation , vol.5 , Issue.2 , pp. 123-141
- Sałustowicz, R.P.¹ Schmidhuber, J.²

20
- 0345161984
- Evolving soccer strategies
- Springer-Verlag: Singapore
- Sałustowicz, R.P., Wiering, M.A., and Schmidhuber, J. 1997a. Evolving soccer strategies. In Proceedings of the Fourth International Conference on Neural Information Processing (ICONIP'97), Springer-Verlag: Singapore, pp. 502-506.
- (1997) Proceedings of the Fourth International Conference on Neural Information Processing (ICONIP'97) , pp. 502-506
- Sałustowicz, R.P.¹ Wiering, M.A.² Schmidhuber, J.³

21
- 0345593940
- On learning soccer strategies
- W. Gerstner, A. Germond, M. Hasler, and J.-D. Nicoud (Eds.), Springer-Verlag: Berlin, Heidelberg
- Sałustowicz, R.P., Wiering, M.A., and Schmidhuber, J. 1997b. On learning soccer strategies. In Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), volume 1327 of Lecture Notes in Computer Science, W. Gerstner, A. Germond, M. Hasler, and J.-D. Nicoud (Eds.), Springer-Verlag: Berlin, Heidelberg, pp. 769-774.
- (1997) Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), Volume 1327 of Lecture Notes in Computer Science , vol.1327 , pp. 769-774
- Sałustowicz, R.P.¹ Wiering, M.A.² Schmidhuber, J.³

22
- 0032208296
- Learning team strategies: Soccer case studies
- Sałustowicz, R.P., Wiering, M.A., and Schmidhuber, J. 1998. Learning team strategies: Soccer case studies. Machine Learning, 33(2/3):263-282.
- (1998) Machine Learning , vol.33 , Issue.2-3 , pp. 263-282
- Sałustowicz, R.P.¹ Wiering, M.A.² Schmidhuber, J.³

23
- 0001201756
- Some studies in machine learning using the game of checkers
- Samuel, A.L. 1959. Some studies in machine learning using the game of checkers. IBM Journal on Research and Development, 3:210-229.
- (1959) IBM Journal on Research and Development , vol.3 , pp. 210-229
- Samuel, A.L.¹

24
- 0007908166
- Technical Report CIONS 96-088, Georgia Institute of Technology, Atlanta
- Santamaria, J.C., Sutton, R.S., and Ram, A. 1996. Experiments with reinforcement learning in problems with continuous state and action spaces. Technical Report CIONS 96-088, Georgia Institute of Technology, Atlanta.
- (1996) Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

25
- 0345161979
- Technical Report FKI-198-94, Fakultät für Informatik, Technische Universität München, Revised January 1995
- Schmidhuber, J. 1995. On learning how to learn learning strategies. Technical Report FKI-198-94, Fakultät für Informatik, Technische Universität München, Revised January 1995.
- (1995) On Learning How to Learn Learning Strategies
- Schmidhuber, J.¹

26
- 0000156236
- Reinforcement learning with self-modifying policies
- S. Thrun and L. Pratt (Eds.), Kluwer
- Schmidhuber, J., Zhao, J., and Schraudolph, N. 1997a. Reinforcement learning with self-modifying policies. In Learning to Learn, S. Thrun and L. Pratt (Eds.), Kluwer, pp. 293-309.
- (1997) Learning to Learn , pp. 293-309
- Schmidhuber, J.¹ Zhao, J.² Schraudolph, N.³

27
- 0031186687
- Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement
- Schmidhuber, J., Zhao, J., and Wiering, M. 1997b. Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement. Machine Learning, 28:105-130.
- (1997) Machine Learning , vol.28 , pp. 105-130
- Schmidhuber, J.¹ Zhao, J.² Wiering, M.³

28
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh, S.P. and Sutton, R.S. 1996. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

29
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R.S. 1988. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

30
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo (Eds.), MIT Press: Cambridge, MA
- Sutton, R.S. 1996. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo (Eds.), MIT Press: Cambridge, MA, pp. 1038-1045.
- (1996) Advances in Neural Information Processing Systems 8 , vol.8 , pp. 1038-1045
- Sutton, R.S.¹

31
- 0004102479
- MIT Press/Bradford Books
- Sutton, R.S. and Barto, A.G. 1988. Reinforcement Learning: An Introduction, MIT Press/Bradford Books.
- (1988) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

32
- 0032044899
- A probabilistic approach to concurrent mapping and localization for mobile robots
- Thrun, S., Fox, D., and Burgard, W 1998. A probabilistic approach to concurrent mapping and localization for mobile robots. Machine Learning, (31):29-53. Also appeared in Autonomous Robots, 5:253-271, 1998 as joint issue.
- (1998) Machine Learning , Issue.31 , pp. 29-53
- Thrun, S.¹ Fox, D.² Burgard, W.³

33
- 0032120083
- as joint issue
- Thrun, S., Fox, D., and Burgard, W 1998. A probabilistic approach to concurrent mapping and localization for mobile robots. Machine Learning, (31):29-53. Also appeared in Autonomous Robots, 5:253-271, 1998 as joint issue.
- (1998) Autonomous Robots , vol.5 , pp. 253-271

34
- 0004049893
- Ph.D. Thesis, King's College, Cambridge, England
- Watkins, C.J.C.H. 1989. Learning from Delayed Rewards. Ph.D. Thesis, King's College, Cambridge, England.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

35
- 34249833101
- Q-learning
- Watkins, C.J.C.H. and Dayan, P. 1992. Q-learning. Machine Learning, 8:279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

36
- 0345161977
- Ph.D. Thesis, University of Amsterdam/IDSIA
- Wiering, M.A. 1999. Explorations in Efficient Reinforcement Learning. Ph.D. Thesis, University of Amsterdam/IDSIA.
- (1999) Explorations in Efficient Reinforcement Learning
- Wiering, M.A.¹

37
- 0345161973
- Efficient model-based exploration
- J.A. Meyer and S.W. Wilson (Eds.), MIT Press/Bradford Books
- Wiering, M.A. and Schmidhuber, J. 1998a. Efficient model-based exploration. In Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior: From Animals to Animats 6, J.A. Meyer and S.W. Wilson (Eds.), MIT Press/Bradford Books, pp. 223-228.
- (1998) Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior: From Animals to Animats 6 , vol.6 , pp. 223-228
- Wiering, M.A.¹ Schmidhuber, J.²

38
- 0032182997
- Fast online Q(λ)
- Wiering, M.A. and Schmidhuber, J. 1998b. Fast online Q(λ). Machine Learning, 33(1):105-116.
- (1998) Machine Learning , vol.33 , Issue.1 , pp. 105-116
- Wiering, M.A.¹ Schmidhuber, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.