메뉴 건너뛰기




Volumn 2, Issue , 2005, Pages 880-885

Value functions for RL-based behavior transfer: A comparative study

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIOR TRANSFER; TEMPORAL DIFFERENCE (TD);

EID: 29444435242     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (49)

References (25)
  • 3
    • 0141727204 scopus 로고    scopus 로고
    • Evolving team Darwin United
    • Asada, M., and Kitano, H., eds. Berlin: Springer Verlag
    • Andre, D., and Teller, A. 1999. Evolving team Darwin United. In Asada, M., and Kitano, H., eds., RoboCup-98: Robot Soccer World Cup II. Berlin: Springer Verlag.
    • (1999) RoboCup-98: Robot Soccer World Cup II
    • Andre, D.1    Teller, A.2
  • 5
    • 24844475430 scopus 로고
    • Robot Shaping: Developing Situated Agents through Learning
    • International Computer Science Institute, Berkeley, CA
    • Colombetti, M., and Dorigo, M. 1993. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA.
    • (1993) Technical Report , vol.TR-92-040
    • Colombetti, M.1    Dorigo, M.2
  • 6
    • 0003259931 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds. Cambridge, MA: MIT Press
    • Crites, R. H., and Barto, A. G. 1996. Improving elevator performance using reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8
    • Crites, R.H.1    Barto, A.G.2
  • 8
    • 0043247546 scopus 로고    scopus 로고
    • Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    • Drummond, C. 2002. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research 16:59-104.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
    • Drummond, C.1
  • 10
    • 22944468731 scopus 로고    scopus 로고
    • Approximate policy iteration with a policy language bias
    • Thrun, S.; Saul, L.; and Schölkopf, B., eds. Cambridge, MA: MIT Press
    • Fern, A.; Yoon, S.; and Givan, R. 2004. Approximate policy iteration with a policy language bias. In Thrun, S.; Saul, L.; and Schölkopf, B., eds., Advances in Neural Information Processing Systems 16. Cambridge, MA: MIT Press.
    • (2004) Advances in Neural Information Processing Systems , vol.16
    • Fern, A.1    Yoon, S.2    Givan, R.3
  • 15
  • 17
    • 0003229379 scopus 로고    scopus 로고
    • Karlsruhe brainstormers - A reinforcement learning approach to robotic soccer
    • Stone, P.; Balch, T.; and Kraetszchmar, G., eds. Berlin: Springer Verlag
    • Riedmiller, M.; Merke, A.; Meier, D.; Hoffman, A.; Sinner, A.; Thate, O.; and Ehrmann, R. 2001. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Stone, P.; Balch, T.; and Kraetszchmar, G., eds., RoboCup-2000: Robot Soccer World Cup IV. Berlin: Springer Verlag.
    • (2001) RoboCup-2000: Robot Soccer World Cup IV
    • Riedmiller, M.1    Merke, A.2    Meier, D.3    Hoffman, A.4    Sinner, A.5    Thate, O.6    Ehrmann, R.7
  • 19
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • Singh, S. P. 1992. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning 8:323-339.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.P.1
  • 21
    • 84944901151 scopus 로고    scopus 로고
    • The CMUnited-99 champion simulator team
    • Veloso, M.; Pagello, E.; and Kitano, H., eds. Berlin: Springer
    • Stone, P.; Riley, P.; and Veloso, M. 2000. The CMUnited-99 champion simulator team. In Veloso, M.; Pagello, E.; and Kitano, H., eds., RoboCup-99: Robot Soccer World Cup III. Berlin: Springer. 35-48.
    • (2000) RoboCup-99: Robot Soccer World Cup III , pp. 35-48
    • Stone, P.1    Riley, P.2    Veloso, M.3
  • 22
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • To appear
    • Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior. To appear.
    • (2005) Adaptive Behavior
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 25
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program, achieves master-level play
    • Tesauro, G. 1994. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2):215-219.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.