메뉴 건너뛰기




Volumn 8, Issue , 2007, Pages 2125-2167

Transfer learning via inter-task mappings for temporal difference learning

Author keywords

Inter task mapping; Reinforcement learning; Temporal difference methods; Transfer learning; Value function approximation

Indexed keywords

APPROXIMATION THEORY; CONFORMAL MAPPING; FUNCTION EVALUATION; MATHEMATICAL MODELS; NEURAL NETWORKS; TEMPORAL LOGIC;

EID: 34848816477     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (243)

References (40)
  • 3
    • 84947424101 scopus 로고    scopus 로고
    • Evolving team Darwin United
    • Minoru Asada and Hiroaki Kitano, editors, Springer Verlag, Berlin
    • David Andre and Astro Teller. Evolving team Darwin United. In Minoru Asada and Hiroaki Kitano, editors, RoboCup-98: Robot Soccer World Cup II, pages 346-351. Springer Verlag, Berlin, 1999.
    • (1999) RoboCup-98: Robot Soccer World Cup II , pp. 346-351
    • Andre, D.1    Teller, A.2
  • 5
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • G. Tesauro, D. Touretzky, and T. Leen, editors, San Mateo, CA, Morgan Kaufmann
    • Steven J. Bradtke and Michael O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 393-400, San Mateo, CA, 1995. Morgan Kaufmann.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
    • Bradtke, S.J.1    Duff, M.O.2
  • 6
    • 34848884838 scopus 로고    scopus 로고
    • Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.
    • Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.
  • 7
    • 24844475430 scopus 로고
    • Robot Shaping: Developing Situated Agents through Learning
    • Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA
    • Marco Colombetti and Marco Dorigo. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA, 1993.
    • (1993)
    • Colombetti, M.1    Dorigo, M.2
  • 8
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Cambridge, MA, MIT Press
    • Robert H. Crites and Andrew G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1017-1023, Cambridge, MA, 1996. MIT Press.
    • (1996) Advances in Neural Information Processing Systems 8 , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 9
    • 0043247546 scopus 로고    scopus 로고
    • Accelerating reinforcement learning by composing solutions of automatically identified subtasks
    • Chris Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
    • (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
    • Drummond, C.1
  • 10
    • 22944468731 scopus 로고    scopus 로고
    • Approximate policy iteration with a policy language bias
    • Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, MIT Press, Cambridge, MA
    • Alan Fern, Sungwook Yoon, and Robert Givan. Approximate policy iteration with a policy language bias. In Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
    • (2004) Advances in Neural Information Processing Systems 16
    • Fern, A.1    Yoon, S.2    Givan, R.3
  • 19
    • 27344432348 scopus 로고    scopus 로고
    • Accelerating reinforcement learning through implicit imitation
    • Bob Price and Craig Boutilier. Accelerating reinforcement learning through implicit imitation. Journal of Artificial Intelligence Research, 19:569-629, 2003.
    • (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 569-629
    • Price, B.1    Boutilier, C.2
  • 21
    • 84867471400 scopus 로고    scopus 로고
    • Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer
    • Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, Springer Verlag, Berlin
    • Martin Riedmiller, Author Merke, David Meier, Andreas Hoffman, Alex Sinner, Ortwin Thate, and Ralf Ehrmann. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, RoboCup-2000: Robot Soccer World Cup IV, pages 367-372. Springer Verlag, Berlin, 2001.
    • (2001) RoboCup-2000: Robot Soccer World Cup IV , pp. 367-372
    • Riedmiller, M.1    Merke, A.2    Meier, D.3    Hoffman, A.4    Sinner, A.5    Thate, O.6    Ehrmann, R.7
  • 22
    • 0003636089 scopus 로고
    • On-line Q-learning using connectionist systems
    • Engineering Department, Cambridge University
    • Gavin Rummery and Mahesan Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
    • (1994) Technical Report CUED/F-INFENG-RT , vol.116
    • Rummery, G.1    Niranjan, M.2
  • 24
    • 0001027894 scopus 로고
    • Transfer of learning by composing solutions of elemental sequential tasks
    • Satinder P. Singh. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8:323-339, 1992.
    • (1992) Machine Learning , vol.8 , pp. 323-339
    • Singh, S.P.1
  • 25
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Satinder P. Singh and Richard S. Sutton. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158, 1996.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 28
    • 84867470253 scopus 로고    scopus 로고
    • Keepaway soccer: A machine learning testbed
    • Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, of, Springer Verlag, Berlin
    • Peter Stone and Richard S. Sutton. Keepaway soccer: a machine learning testbed. In Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, volume 2377 of Lecture Notes in Artificial Intelligence, pages 214-223. Springer Verlag, Berlin, 2002.
    • (2002) Lecture Notes in Artificial Intelligence , vol.2377 , pp. 214-223
    • Stone, P.1    Sutton, R.S.2
  • 29
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 30
    • 37249034293 scopus 로고    scopus 로고
    • Keepaway soccer: From machine learning testbed to benchmark
    • Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, Springer Verlag, Berlin
    • Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu. Keepaway soccer: From machine learning testbed to benchmark. In Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93-105. Springer Verlag, Berlin, 2006.
    • (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
    • Stone, P.1    Kuhlmann, G.2    Taylor, M.E.3    Liu, Y.4
  • 33
    • 27544473171 scopus 로고    scopus 로고
    • Behavior transfer for value-function-based reinforcement learning
    • Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, New York, NY, July, ACM Press
    • Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
    • (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
    • Taylor, M.E.1    Stone, P.2
  • 37
    • 34848888015 scopus 로고    scopus 로고
    • Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
    • Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.