메뉴 건너뛰기




Volumn 7, Issue 1, 1999, Pages 77-88

Reinforcement learning soccer teams with incomplete world models

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; MATHEMATICAL MODELS; MOBILE ROBOTS; PROBABILITY;

EID: 0345073177     PISSN: 09295593     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1008921914343     Document Type: Article
Times cited : (26)

References (38)
  • 1
    • 0016556021 scopus 로고
    • A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
    • Albus, J.S. 1975. A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, pp. 220-227.
    • (1975) Dynamic Systems, Measurement and Control , pp. 220-227
    • Albus, J.S.1
  • 2
    • 85011779332 scopus 로고
    • Removing the genetics from the standard genetic algorithm
    • A. Prieditis and S. Russell (Eds.), Morgan Kaufmann Publishers: San Francisco, CA
    • Baluja, S. and Caruana, R. 1995. Removing the genetics from the standard genetic algorithm. In Machine Learning: Proceedings of the Twelfth International Conference, A. Prieditis and S. Russell (Eds.), Morgan Kaufmann Publishers: San Francisco, CA, pp. 38-46.
    • (1995) Machine Learning: Proceedings of the Twelfth International Conference , pp. 38-46
    • Baluja, S.1    Caruana, R.2
  • 7
    • 0002258659 scopus 로고
    • A representation for the adaptive generation of simple sequential programs
    • J.J. Grefenstette (Ed.), Lawrence Erlbaum Associates: Hillsdale, NJ
    • Cramer, N.L. 1985. A representation for the adaptive generation of simple sequential programs. In Proceedings of an International Conference on Genetic Algorithms and Their Applications, J.J. Grefenstette (Ed.), Lawrence Erlbaum Associates: Hillsdale, NJ, pp. 183-187.
    • (1985) Proceedings of an International Conference on Genetic Algorithms and Their Applications , pp. 183-187
    • Cramer, N.L.1
  • 11
    • 84899026236 scopus 로고    scopus 로고
    • Finite-sample convergence rates for Q-learning and indirect algorithms
    • M. Kearns, S.A. Solla, and D. Cohn (Eds.), MIT Press: Cambridge, MA
    • Kearns, M. and Singh, S. 1999. Finite-sample convergence rates for Q-learning and indirect algorithms. In Advances in Neural Information Processing Systems 12, M. Kearns, S.A. Solla, and D. Cohn (Eds.), MIT Press: Cambridge, MA.
    • (1999) Advances in Neural Information Processing Systems 12 , vol.12
    • Kearns, M.1    Singh, S.2
  • 12
    • 0001334736 scopus 로고
    • Genetic evolution and co-evolution of computer programs
    • C.G. Langton, C. Taylor, J.D. Farmer, and S. Rasmussen (Eds.), Addison Wesley Publishing Company
    • Koza, J. R. 1992. Genetic evolution and co-evolution of computer programs. In Artificial Life II, C.G. Langton, C. Taylor, J.D. Farmer, and S. Rasmussen (Eds.), Addison Wesley Publishing Company, pp. 313-324.
    • (1992) Artificial Life II , vol.2 , pp. 313-324
    • Koza, J.R.1
  • 14
    • 0027684215 scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less time
    • Moore, A. and Atkeson, C.G. 1993. Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13:103-130.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.1    Atkeson, C.G.2
  • 15
    • 0001765492 scopus 로고
    • Simplifying neural networks by soft weight sharing
    • Nowlan, S.J. and Hinton, G.E. 1992. Simplifying neural networks by soft weight sharing. Neural Computation, 4:173-193.
    • (1992) Neural Computation , vol.4 , pp. 173-193
    • Nowlan, S.J.1    Hinton, G.E.2
  • 16
    • 0000955979 scopus 로고    scopus 로고
    • Incremental multi-step Q-learning
    • Peng, J. and Williams, R. 1996. Incremental multi-step Q-learning. Machine Learning, 22:283-290.
    • (1996) Machine Learning , vol.22 , pp. 283-290
    • Peng, J.1    Williams, R.2
  • 23
    • 0001201756 scopus 로고
    • Some studies in machine learning using the game of checkers
    • Samuel, A.L. 1959. Some studies in machine learning using the game of checkers. IBM Journal on Research and Development, 3:210-229.
    • (1959) IBM Journal on Research and Development , vol.3 , pp. 210-229
    • Samuel, A.L.1
  • 25
    • 0345161979 scopus 로고
    • Technical Report FKI-198-94, Fakultät für Informatik, Technische Universität München, Revised January 1995
    • Schmidhuber, J. 1995. On learning how to learn learning strategies. Technical Report FKI-198-94, Fakultät für Informatik, Technische Universität München, Revised January 1995.
    • (1995) On Learning How to Learn Learning Strategies
    • Schmidhuber, J.1
  • 26
    • 0000156236 scopus 로고    scopus 로고
    • Reinforcement learning with self-modifying policies
    • S. Thrun and L. Pratt (Eds.), Kluwer
    • Schmidhuber, J., Zhao, J., and Schraudolph, N. 1997a. Reinforcement learning with self-modifying policies. In Learning to Learn, S. Thrun and L. Pratt (Eds.), Kluwer, pp. 293-309.
    • (1997) Learning to Learn , pp. 293-309
    • Schmidhuber, J.1    Zhao, J.2    Schraudolph, N.3
  • 27
    • 0031186687 scopus 로고    scopus 로고
    • Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement
    • Schmidhuber, J., Zhao, J., and Wiering, M. 1997b. Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement. Machine Learning, 28:105-130.
    • (1997) Machine Learning , vol.28 , pp. 105-130
    • Schmidhuber, J.1    Zhao, J.2    Wiering, M.3
  • 28
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S.P. and Sutton, R.S. 1996. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 29
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S. 1988. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 30
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo (Eds.), MIT Press: Cambridge, MA
    • Sutton, R.S. 1996. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, D.S. Touretzky, M.C. Mozer, and M.E. Hasselmo (Eds.), MIT Press: Cambridge, MA, pp. 1038-1045.
    • (1996) Advances in Neural Information Processing Systems 8 , vol.8 , pp. 1038-1045
    • Sutton, R.S.1
  • 32
    • 0032044899 scopus 로고    scopus 로고
    • A probabilistic approach to concurrent mapping and localization for mobile robots
    • Thrun, S., Fox, D., and Burgard, W 1998. A probabilistic approach to concurrent mapping and localization for mobile robots. Machine Learning, (31):29-53. Also appeared in Autonomous Robots, 5:253-271, 1998 as joint issue.
    • (1998) Machine Learning , Issue.31 , pp. 29-53
    • Thrun, S.1    Fox, D.2    Burgard, W.3
  • 33
    • 0032120083 scopus 로고    scopus 로고
    • as joint issue
    • Thrun, S., Fox, D., and Burgard, W 1998. A probabilistic approach to concurrent mapping and localization for mobile robots. Machine Learning, (31):29-53. Also appeared in Autonomous Robots, 5:253-271, 1998 as joint issue.
    • (1998) Autonomous Robots , vol.5 , pp. 253-271


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.