메뉴 건너뛰기




Volumn , Issue , 2005, Pages 201-207

Behavior transfer for value-function-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING METHODS; TEMPORAL DIFFERENCE (TD); TRAINING TIME;

EID: 33644807975     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (28)

References (23)
  • 3
    • 0141727204 scopus 로고    scopus 로고
    • Evolving team Darwin United
    • M. Asada and H. Kitano, editors Springer Verlag, Berlin
    • D. Andre and A. Teller. Evolving team Darwin United. In M. Asada and H. Kitano, editors, RoboCup-98: Robot Soccer World Cup II. Springer Verlag, Berlin, 1999.
    • (1999) RoboCup-98: Robot Soccer World Cup II
    • Andre, D.1    Teller, A.2
  • 6
    • 24844475430 scopus 로고
    • Robot shaping: Developing situated agents through learning
    • International Computer Science Institute, Berkeley, CA
    • M. Colombetti and M. Dorigo. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA, 1993.
    • (1993) Technical Report , vol.TR-92-040
    • Colombetti, M.1    Dorigo, M.2
  • 7
    • 0043247546 scopus 로고    scopus 로고
    • Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    • C. Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
    • Drummond, C.1
  • 9
    • 22944468731 scopus 로고    scopus 로고
    • Approximate policy iteration with a policy language bias
    • S. Thrun, L. Saul, and B. Schölkopf, editors. MIT Press, Cambridge, MA
    • A. Fern, S. Yoon, and R. Givan. Approximate policy iteration with a policy language bias. In S. Thrun, L. Saul, and B. Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
    • (2004) Advances in Neural Information Processing Systems , vol.16
    • Fern, A.1    Yoon, S.2    Givan, R.3
  • 16
    • 0003229379 scopus 로고    scopus 로고
    • Karlsruhe brainstormers - A reinforcement learning approach to robotic soccer
    • P. Stone, T. Balch, and G. Kraetszchmar, editors. Springer Verlag, Berlin
    • M. Riedmiller, A. Merke, D. Meier, A. Hoffman, A. Sinner, O. Thate, and R. Ehrmann. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In P. Stone, T. Balch, and G. Kraetszchmar, editors, RoboCup-2000: Robot Soccer World Cup IV. Springer Verlag, Berlin, 2001.
    • (2001) RoboCup-2000: Robot Soccer World Cup IV
    • Riedmiller, M.1    Merke, A.2    Meier, D.3    Hoffman, A.4    Sinner, A.5    Thate, O.6    Ehrmann, R.7
  • 18
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • S. P. Singh. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8:323-339, 1992.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.P.1
  • 20
    • 84944901151 scopus 로고    scopus 로고
    • The CMUnited-99 champion simulator team
    • M. Veloso, E. Pagello, and H. Kitano, editors. Springer, Berlin
    • P. Stone, P. Riley, and M. Veloso. The CMUnited-99 champion simulator team. In M. Veloso, E. Pagello, and H. Kitano, editors, RoboCup-99: Robot Soccer World Cup III, pages 35-48. Springer, Berlin, 2000.
    • (2000) RoboCup-99: Robot Soccer World Cup III , pp. 35-48
    • Stone, P.1    Riley, P.2    Veloso, M.3
  • 21
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • To appear
    • P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 2005. To appear.
    • (2005) Adaptive Behavior
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 23
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.