SCOPUS 정보 검색 플랫폼

Adaptive Behavior

Volumn 13, Issue 3, 2005, Pages 165-188

Reinforcement learning for RoboCup soccer keepaway

(3) Stone, Peter a Sutton, Richard S b,c Kuhlmann, Gregory a

a University of Texas at Austin (United States)

b UNIVERSITY OF ALBERTA (Canada)

c American Association for Artificial Intelligence (United States)

Author keywords

Machine learning; Multiagent learning; Multiagent systems; Reinforcement learning; Robot soccer

Indexed keywords

EID: 27544506565 PISSN: 10597123 EISSN: None Source Type: Journal
DOI: 10.1177/105971230501300301 Document Type: Article

Times cited : (318)

References (52)

1
- 0003942195
- Peterborough, NH: Byte Books
- Albus, J. S. (1981). Brains, behavior, and robotics. Peterborough, NH: Byte Books.
- (1981) Brains, Behavior, and Robotics
- Albus, J.S.¹

2
- 0006923516
- Refinement of soccer agents' positions using reinforcement learning
- H. Kitano (Ed.), Berlin: Springer
- Andou, T. (1998). Refinement of soccer agents' positions using reinforcement learning. In H. Kitano (Ed.), RoboCup-97: Robot soccer world cup I (pp. 373-388). Berlin: Springer.
- (1998) RoboCup-97: Robot Soccer World Cup I , pp. 373-388
- Andou, T.¹

3
- 84898960325
- Programmable reinforcement learning agents
- T. K. Leen, T. G. Dietterich, & V. Tresp (Eds.), Cambridge, MA: MIT Press
- Andre, D., & Russell, S. J. (2001). Programmable reinforcement learning agents. In T. K. Leen, T. G. Dietterich, & V. Tresp (Eds.), Advances in neural information processing systems (Vol. 13, pp. 1019-1025). Cambridge, MA: MIT Press.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 1019-1025
- Andre, D.¹ Russell, S.J.²

4
- 0036927201
- State abstraction for programmable reinforcement learning agents
- R. Dechter, M. Kearns & R. S. Sutton (Eds.), Mento Park CA: AAAL Press
- Andre, D., & Russell, S. J. (2002). State abstraction for programmable reinforcement learning agents. In R. Dechter, M. Kearns & R. S. Sutton (Eds.), Proceedings of the 18th National Conference on Artificial Intelligence Mento Park (pp. 119-125). CA: AAAL Press.
- (2002) Proceedings of the 18th National Conference on Artificial Intelligence , pp. 119-125
- Andre, D.¹ Russell, S.J.²

5
- 84947424101
- Evolving team Darwin United
- M. Asada & H. Kitano (Eds.), Berlin: Springer
- Andre, D., & Teller, A. (1999). Evolving team Darwin United. In M. Asada & H. Kitano (Eds.), RoboCup-98: Robot soccer world cup II (pp. 346-351). Berlin: Springer.
- (1999) RoboCup-98: Robot Soccer World Cup II , pp. 346-351
- Andre, D.¹ Teller, A.²

6
- 0034859944
- Autonomous helicopter control using reinforcement learning policy search methods
- IEEE
- Bagnell, J. A., & Schneider, J. (2001). Autonomous helicopter control using reinforcement learning policy search methods. In International Conference on Robotics and Automation (pp. 1615-1620). IEEE.
- (2001) International Conference on Robotics and Automation , pp. 1615-1620
- Bagnell, J.A.¹ Schneider, J.²

7
- 84898958374
- Gradient descent for general reinforcement learning
- M. J. Kearns, S. A. Solla, & D. A. Cohn (Eds.) Cambridge, MA: The MIT Press
- Baird, L. C., & Moore, A. W. (1999). Gradient descent for general reinforcement learning. In M. J. Kearns, S. A. Solla, & D. A. Cohn (Eds.) Advances in neural information processing systems (Vol. 11, pp. 968-974). Cambridge, MA: The MIT Press.
- (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 968-974
- Baird, L.C.¹ Moore, A.W.²

8
- 0013109127
- Balch, T. (2000a). Teambots. http://www.teambots.org.
- (2000) Teambots
- Balch, T.¹

9
- 1142282545
- Balch, T. (2000b). Teambots domain: Soccerbots. http://www-2.cs.cmu.edu/ trb/TeamBots/Domains/SoccerBots.
- (2000) Teambots Domain: Soccerbots
- Balch, T.¹

10
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- Boutilier, C., Dean, T., & Hanks, S. (1999). Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11, 1-94.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

11
- 85150714688
- Reinforcement learning methods for continuous-time Markov decision problems
- G. Tesauro, D. Touretzky, & T. Leem (Eds.), San Mateo, CA: Morgan Kaufmann
- Bradtke, S. J., & Duff, M. O. (1995). Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, & T. Leem (Eds.), Advances in neural information processing systems (Vol. 7, pp. 393-400). San Mateo, CA: Morgan Kaufmann.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
- Bradtke, S.J.¹ Duff, M.O.²

12
- 26444619379
- Chen, M., Foroughi, E., Heintz, F., Kapetanakis, S., Kostiadis, K., Kummeneje, J., Noda, I., Obst, O., Riley, P., Steffens, T., Wang, Y., & Yin, X. (2003). Users manual: RoboCup soccer server manual for soccer server version 7.07 and later. Available at http://sourceforge.net/projects/sserver/
- (2003) Users Manual: RoboCup Soccer Server Manual for Soccer Server Version 7.07 and Later
- Chen, M.¹ Foroughi, E.² Heintz, F.³ Kapetanakis, S.⁴ Kostiadis, K.⁵ Kummeneje, J.⁶ Noda, I.⁷ Obst, O.⁸ Riley, P.⁹ Steffens, T.¹⁰ Wang, Y.¹¹ Yin, X.¹²

13
- 85156187730
- Improving elevator performance using reinforcement learning
- D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Cambridge, MA: The MIT Press
- Crites, R. H., & Barto, A. G. (1996). Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in neural information processing systems (Vol. 8, pp. 1017-1023). Cambridge, MA: The MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

14
- 0009346464
- Reinforcement learning for planning and control
- S. Minton (Ed.), San Mateo, CA: Morgan Kaufmann
- Dean, T., Basye, K., & Shewchuk, J. (1992). Reinforcement learning for planning and control. In S. Minton (Ed.), Machine learning methods for planning and scheduling (pp. 67-92). San Mateo, CA: Morgan Kaufmann.
- (1992) Machine Learning Methods for Planning and Scheduling , pp. 67-92
- Dean, T.¹ Basye, K.² Shewchuk, J.³

15
- 0002278788
- Hierarchical reinforcement learning with the maxq value function decomposition
- Dietterich, T. G. (2000). Hierarchical reinforcement learning with the maxq value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

16
- 84898995808
- Reinforcement learning with function approximation converges to a region
- T. K. Leen, T. G. Dietterich, & V. Tresp (Eds.), Cambridge, MA: The MIT Press
- Gordon, G. (2001). Reinforcement learning with function approximation converges to a region. In T. K. Leen, T. G. Dietterich, & V. Tresp (Eds.), Advances in neural information processing systems (Vol. 13, pp. 1040-1046). Cambridge, MA: The MIT Press.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 1040-1046
- Gordon, G.¹

17
- 84899028010
- Multiagent planning with factored MDPs
- T. G. Dietterich, S. Becker & Z. Ghahramani (Eds.) Cambridge, MA: MIT Press
- Guestrin, C., Koller, D., & Parr, R. (2002). Multiagent planning with factored MDPs. In T. G. Dietterich, S. Becker & Z. Ghahramani (Eds.) Advances in neural information processing systems (Vol. 14, pp. 1523-1530). Cambridge, MA: MIT Press.
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 1523-1530
- Guestrin, C.¹ Koller, D.² Parr, R.³

18
- 4344663737
- Genetic programming and multi-agent layered learning by reinforcements
- W. B. Langdon et. al. (Eds.), New York San Mateo, CA: Morgan Kaufmann
- Hsu, W. H., & Gustafson, S. M. (2002). Genetic programming and multi-agent layered learning by reinforcements. In W. B. Langdon et. al. (Eds.), Genetic and Evolutionary Computation Conference (New York) (pp. 764-771). San Mateo, CA: Morgan Kaufmann.
- (2002) Genetic and Evolutionary Computation Conference , pp. 764-771
- Hsu, W.H.¹ Gustafson, S.M.²

19
- 84880668656
- The RoboCup synthetic agent challenge 97
- M. E. Pollack (Ed.) San Francisco, CA: Morgan Kaufmann
- Kitano, H., Tambe, M., Stone, P., Veloso, M., Coradeschi, S., Osawa, E., Matsubara, H., Noda, I., & Asada, M. (1997). The RoboCup synthetic agent challenge 97. In M. E. Pollack (Ed.) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (pp. 24-29). San Francisco, CA: Morgan Kaufmann.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence , pp. 24-29
- Kitano, H.¹ Tambe, M.² Stone, P.³ Veloso, M.⁴ Coradeschi, S.⁵ Osawa, E.⁶ Matsubara, H.⁷ Noda, I.⁸ Asada, M.⁹

20
- 84880688552
- Computing factored value functions for policies in structured MDPs
- T. Dean (Ed.), Morgan Kaufmann
- Koller, D., & Parr, R. (1999). Computing factored value functions for policies in structured MDPs. In T. Dean (Ed.), Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) (pp1332-1339). Morgan Kaufmann.
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) , pp. 1332-1339
- Koller, D.¹ Parr, R.²

21
- 0026219293
- CMAC-based adaptive critic self-learning control
- IEEE
- Lin, C.-S., & Kim, H. (1991). CMAC-based adaptive critic self-learning control. In IEEE Transactions on Neural Networks, 2, (pp. 530-533). IEEE.
- (1991) IEEE Transactions on Neural Networks , vol.2 , pp. 530-533
- Lin, C.-S.¹ Kim, H.²

22
- 0003322602
- Co-evolving soccer softbot team coordination with genetic programming
- Kitano, H. (Ed.), Berlin: Springer
- Luke, S., Hohn, C., Farris, J., Jackson, G., & Hendler, J. (1998). Co-evolving soccer softbot team coordination with genetic programming. In Kitano, H. (Ed.), RoboCup-97: Robot soccer world cup I (pp. 398-411). Berlin: Springer.
- (1998) RoboCup-97: Robot Soccer World Cup I , pp. 398-411
- Luke, S.¹ Hohn, C.² Farris, J.³ Jackson, G.⁴ Hendler, J.⁵

23
- 0043289815
- Experiences acquired in the design of RoboCup teams: A comparison of two fielded teams
- Marsella, S., Tambe, M., Adibi, J., Al-Onaizan, Y., Kaminka, G. A., & Muslea, I. (2001). Experiences acquired in the design of RoboCup teams: A comparison of two fielded teams. Autonomous Agents and Multi-Agent Systems, 4(2), 115-129.
- (2001) Autonomous Agents and Multi-Agent Systems , vol.4 , Issue.2 , pp. 115-129
- Marsella, S.¹ Tambe, M.² Adibi, J.³ Al-Onaizan, Y.⁴ Kaminka, G.A.⁵ Muslea, I.⁶

24
- 84867446844
- Keeping the ball from CMUnited-99
- P. Stone, T. Balch, & G. Kraetszchmar (Eds.), Berlin: Springer
- McAllester, D., & Stone, P. (2001). Keeping the ball from CMUnited-99. In P. Stone, T. Balch, & G. Kraetszchmar (Eds.), RoboCup-2000: Robot soccer world cup IV. (pp. 333-338) Berlin: Springer.
- (2001) RoboCup-2000: Robot Soccer World Cup IV , pp. 333-338
- McAllester, D.¹ Stone, P.²

25
- 84957866197
- Learning cooperative behavior in multi-agent environment: A case study of choice of play-plans in soccer
- N. Y. Foo & R. Gobel (Eds.), Cairns, Australia Springer
- Noda, I., Matsubara, H., & Hiraki, K. (1996). Learning cooperative behavior in multi-agent environment: A case study of choice of play-plans in soccer. In N. Y. Foo & R. Gobel (Eds.), PRICAI'96: Topics in Artificial Intelligence (Proceedings of the Fourth Pacific Rim International Conference on Artificial Intelligence) (pp. 570-579) (Cairns, Australia) Springer.
- (1996) PRICAI'96: Topics in Artificial Intelligence (Proceedings of the Fourth Pacific Rim International Conference on Artificial Intelligence) , pp. 570-579
- Noda, I.¹ Matsubara, H.² Hiraki, K.³

26
- 0032021222
- Soccer server: A tool for research on multiagent systems
- Noda, I., Matsubara, H., Hiraki, K., & Frank, I. (1998). Soccer server: A tool for research on multiagent systems. Applied Artificial Intelligence, 12, 233-250.
- (1998) Applied Artificial Intelligence , vol.12 , pp. 233-250
- Noda, I.¹ Matsubara, H.² Hiraki, K.³ Frank, I.⁴

27
- 84898960655
- A convergent form of approximate policy iteration
- S. Becker, S. Thrun, & K. Obermayer (Eds.), Cambridge, MA: The MIT Press
- Perkins, T. J., & Precup, D. (2003). A convergent form of approximate policy iteration. In S. Becker, S. Thrun, & K. Obermayer (Eds.), Advances in neural information processing systems (Vol. 16) (pp. 1595-1602) Cambridge, MA: The MIT Press.
- (2003) Advances in Neural Information Processing Systems , vol.16 , pp. 1595-1602
- Perkins, T.J.¹ Precup, D.²

28
- 0242584062
- Learning in RoboCup keepaway using evolutionary algorithms
- W. B. Langdon et. al., New York: Morgan Kaufmann
- Pietro, A. D., While, L., & Barone, L. (2002). Learning in RoboCup keepaway using evolutionary algorithms. In W. B. Langdon et. al., GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference (pp. 1065-1072). New York: Morgan Kaufmann.
- (2002) GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference , pp. 1065-1072
- Pietro, A.D.¹ While, L.² Barone, L.³

29
- 0003958910
- New York: Wiley
- Puterman, M. L. (1994). Markov decision problems. New York: Wiley.
- (1994) Markov Decision Problems
- Puterman, M.L.¹

30
- 0003500248
- San Mateo, CA: Morgan Kaufmann
- Quinlan, J. R. (1993). C4.5: Programs for machine learning. San Mateo, CA: Morgan Kaufmann.
- (1993) C4.5: Programs for Machine Learning
- Quinlan, J.R.¹

31
- 84867471400
- Karlsruhe brainstormers - A reinforcement learning approach to robotic soccer
- P. Stone, T. Balch, & G. Kraetszchmar, (Eds.), Berlin: Springer
- Riedmiller, M., Merke, A., Meier, D., Hoffman, A., Sinner, A., Thate, O., & Ehrmann, R. (2001). Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In P. Stone, T. Balch, & G. Kraetszchmar, (Eds.), RoboCup-2000: Robot soccer world cup IV. (pp. 367-372) Berlin: Springer.
- (2001) RoboCup-2000: Robot Soccer World Cup IV , pp. 367-372
- Riedmiller, M.¹ Merke, A.² Meier, D.³ Hoffman, A.⁴ Sinner, A.⁵ Thate, O.⁶ Ehrmann, R.⁷

32
- 0242667984
- Brainstormers 2002 - Team description
- G. A. Kaminka, P. U. Lima, & R. Rojas (Eds.), Berlin: Springer
- Riedmiller, M., Merke, A., Hoffmann, A., Withopf, D., Nickschas, M., & Zacharias, F. (2003). Brainstormers 2002 - team description. In G. A. Kaminka, P. U. Lima, & R. Rojas (Eds.), RoboCup-2002: Robot soccer world cup VI. Berlin: Springer.
- (2003) RoboCup-2002: Robot Soccer World Cup VI
- Riedmiller, M.¹ Merke, A.² Hoffmann, A.³ Withopf, D.⁴ Nickschas, M.⁵ Zacharias, F.⁶

33
- 0003636089
- On-line Q-learning using connectionist systems
- Cambridge University Engineering Department
- Rummery, G. A., & Niranjan, M. (1994). On-line Q-learning using connectionist systems. Technical report CUED/F-INFENG/TR 166, Cambridge University Engineering Department.
- (1994) Technical Report CUED/F-INFENG/TR 166
- Rummery, G.A.¹ Niranjan, M.²

34
- 0003401114
- Cambridge, MA: The MIT Press
- Stone, P. (2000). Layered learning in multiagent systems: A winning approach to robotic soccer. Cambridge, MA: The MIT Press.
- (2000) Layered Learning in Multiagent Systems: A Winning Approach to Robotic Soccer
- Stone, P.¹

35
- 0034832969
- An architecture for action selection in robotic soccer
- E. Andre, S. Sen, C. Frasson & J. P. Muller (Eds.) New York, NY: ACM Press
- Stone, P., & McAllester, D. (2001). An architecture for action selection in robotic soccer. In E. Andre, S. Sen, C. Frasson & J. P. Muller (Eds.) Proceedings of the Fifth International Conference on Autonomous Agents (pp. 316-323). New York, NY: ACM Press.
- (2001) Proceedings of the Fifth International Conference on Autonomous Agents , pp. 316-323
- Stone, P.¹ McAllester, D.²

36
- 0013528313
- Scaling reinforcement learning toward RoboCup soccer
- C. E. Brodley & A. P. Danyluk (Eds.) San Francisco, CA: Morgan Kaufmann
- Stone, P., & Sutton, R. S. (2001). Scaling reinforcement learning toward RoboCup soccer. In C. E. Brodley & A. P. Danyluk (Eds.) Proceedings of the Eighteenth International Conference on Machine Learning (pp. 537-544). San Francisco, CA: Morgan Kaufmann.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 537-544
- Stone, P.¹ Sutton, R.S.²

37
- 84867470253
- Keepaway soccer: A machine learning testbed
- A. Birk, S. Coradeschi, & S. Tadokoro (Eds.), Berlin: Springer
- Stone, P., & Sutton, R. S. (2002). Keepaway soccer: A machine learning testbed. In A. Birk, S. Coradeschi, & S. Tadokoro (Eds.), RoboCup-2001: Robot soccer world cup V (pp. 214-223). Berlin: Springer.
- (2002) RoboCup-2001: Robot Soccer World Cup V , pp. 214-223
- Stone, P.¹ Sutton, R.S.²

38
- 84867452958
- Reinforcement learning for 3 vs. 2 keepaway
- P. Stone, T. Balch, & G. Kraetszchmar (Eds.), Berlin: Springer
- Stone, P., Sutton, R. S., & Singh, S. (2001). Reinforcement learning for 3 vs. 2 keepaway. In P. Stone, T. Balch, & G. Kraetszchmar (Eds.), RoboCup-2000: Robot soccer world cup IV (pp. 249-258). Berlin: Springer.
- (2001) RoboCup-2000: Robot Soccer World Cup IV , pp. 249-258
- Stone, P.¹ Sutton, R.S.² Singh, S.³

39
- 84947431867
- Team-partitioned, opaque-transition reinforcement learning
- M. Asada, & H. Kitano (Eds.), Berlin: Springer Verlag
- Stone, P., & Veloso, M. (1999). Team-partitioned, opaque-transition reinforcement learning. In M. Asada, & H. Kitano (Eds.), RoboCup-98: Robot soccer world cup II (pp. 261-272). Berlin: Springer Verlag.
- (1999) RoboCup-98: Robot Soccer World Cup II , pp. 261-272
- Stone, P.¹ Veloso, M.²

40
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Cambridge, MA: The MIT Press
- Sutton, R. S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, & M. E. Hasselmo (Eds.), Advances in neural information processing systems Vol. 8, (pp. 1038-1044), Cambridge, MA: The MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
- Sutton, R.S.¹

41
- 0004102479
- Cambridge, MA: The MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

42
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- S. A. Solla, T. K. Leen, & K. R. Muller (Eds.) Cambridge, MA: The MIT Press
- Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In S. A. Solla, T. K. Leen, & K. R. Muller (Eds.) Advances in neural information processing systems, (Vol. 12, pp. 1057-1063). Cambridge, MA: The MIT Press.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

43
- 0033170372
- Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
- Sutton, R., Precup, D., & Singh, S. (1999). Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

44
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Morgan Kaufmann
- Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning (pp. 330-337). Morgan Kaufmann.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

45
- 27544473171
- Behavior transfer for value-function-based reinforcement learning
- V. Digman, S. Koenig, S. Kraus, M. P. Sigh & M. Wooldridge (Eds.), New York, NY: ACM Press
- Taylor, M. E., & Stone, P. (2005). Behavior transfer for value-function-based reinforcement learning. In V. Digman, S. Koenig, S. Kraus, M. P. Sigh & M. Wooldridge (Eds.), The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems. (pp. 53-59). New York, NY: ACM Press.
- (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
- Taylor, M.E.¹ Stone, P.²

46
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(1), 215-219.
- (1994) Neural Computation , vol.6 , Issue.1 , pp. 215-219
- Tesauro, G.¹

47
- 0031143730
- An analysis of temporal-difference learning with function approximation
- Tsitsiklis, J. N., & Van Roy, B. (1997). An analysis of temporal-difference learning with function approximation. IEEE Transactions on Automatic Control, 42, 674-690.
- (1997) IEEE Transactions on Automatic Control , vol.42 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

48
- 27544460461
- Ph.D. thesis, Osaka University
- Uchibe, E. (1999). Cooperative behavior acquisition by learning and evolution in a multi-agent environment for mobile robots. Ph.D. thesis, Osaka University.
- (1999) Cooperative Behavior Acquisition by Learning and Evolution in a Multi-agent Environment for Mobile Robots
- Uchibe, E.¹

49
- 0010049486
- Evolution for behavior selection accelerated by activation/termination constraints
- H. Beyer, E. Canth-Puz, D. Goldberg, Parmee, L. Spector & D. Whitley (Eds.), Morgan Kaufmann
- Uchibe, E., Yanase, M., & Asada, M. (2001). Evolution for behavior selection accelerated by activation/termination constraints. In H. Beyer, E. Canth-Puz, D. Goldberg, Parmee, L. Spector & D. Whitley (Eds.), Proceedings of the Genetic and Evolutionary Computation Conference (pp. 1122-1129). Morgan Kaufmann.
- (2001) Proceedings of the Genetic and Evolutionary Computation Conference , pp. 1122-1129
- Uchibe, E.¹ Yanase, M.² Asada, M.³

50
- 0033339225
- Anticipation as a key for collaboration in a team of agents: A case study in robotic soccer
- P. S. Schenker & G. T. McKee (Eds.) (Boston, MA). Belligman, W.A: SPIE
- Veloso, M., Stone, P., & Bowling, M. (1999). Anticipation as a key for collaboration in a team of agents: A case study in robotic soccer. In P. S. Schenker & G. T. McKee (Eds.) Proceedings of SPIE Sensor Fusion and Decentralized Control in Robotic Systems II (Vol. 3839) (Boston, MA). Belligman, W.A: SPIE.
- (1999) Proceedings of SPIE Sensor Fusion and Decentralized Control in Robotic Systems II , vol.3839
- Veloso, M.¹ Stone, P.² Bowling, M.³

51
- 0004049893
- Ph.D. thesis, King's College, Cambridge
- Watkins, C. J. C. H. (1989). Learning from delayed rewards. Ph.D. thesis, King's College, Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

52
- 1142280955
- Concurrent layered learning
- J. S. Rosenchein, T. Sandholm, M. Woodridge & M. Yokoo (Eds.), New York, NY: ACM Press
- Whiteson, S., & Stone, P. (2003). Concurrent layered learning. In J. S. Rosenchein, T. Sandholm, M. Woodridge & M. Yokoo (Eds.), Second International Joint Conference on Autonomous Agents and Multiagent Systems (pp. 193-200). New York, NY: ACM Press.
- (2003) Second International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 193-200
- Whiteson, S.¹ Stone, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.