SCOPUS 정보 검색 플랫폼

Volumn 8, Issue , 2007, Pages 2125-2167

Transfer learning via inter-task mappings for temporal difference learning

(3) Taylor, Matthew E a Stone, Peter a Liu, Yaxin a

Author keywords

Inter task mapping; Reinforcement learning; Temporal difference methods; Transfer learning; Value function approximation

Indexed keywords

APPROXIMATION THEORY; CONFORMAL MAPPING; FUNCTION EVALUATION; MATHEMATICAL MODELS; NEURAL NETWORKS; TEMPORAL LOGIC;

INTERTASK MAPPING; TEMPORAL DIFFERENCE METHODS; TRANSFER LEARNING; VALUE FUNCTION APPROXIMATION;

REINFORCEMENT LEARNING;

EID: 34848816477 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (248)

References (40)

1
- 0003942195
- Byte Books, Peterborough, NH
- James S. Albus. Brains, Behavior, and Robotics. Byte Books, Peterborough, NH, 1981.
- (1981) Brains, Behavior, and Robotics
- Albus, J.S.¹

2
- 0036927201
- State abstraction for programmable reinforcement learning agents
- David Andre and Stuart J. Russell. State abstraction for programmable reinforcement learning agents. In Proc. of the Eighteenth National Conference on Artificial Intelligence, pages 119-125, 2002.
- (2002) Proc. of the Eighteenth National Conference on Artificial Intelligence , pp. 119-125
- Andre, D.¹ Russell, S.J.²

3
- 84947424101
- Evolving team Darwin United
- Minoru Asada and Hiroaki Kitano, editors, Springer Verlag, Berlin
- David Andre and Astro Teller. Evolving team Darwin United. In Minoru Asada and Hiroaki Kitano, editors, RoboCup-98: Robot Soccer World Cup II, pages 346-351. Springer Verlag, Berlin, 1999.
- (1999) RoboCup-98: Robot Soccer World Cup II , pp. 346-351
- Andre, D.¹ Teller, A.²

4
- 0006221144
- Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
- Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, and Koh Hosoda. Vision-based behavior acquisition for a shooting robot by using a reinforcement learning. In Proc. of IAPR/IEEE Workshop on Visual Behaviors-1994, pages 112-118, 1994.
- (1994) Proc. of IAPR/IEEE Workshop on Visual Behaviors-1994 , pp. 112-118
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

5
- 85150714688
- Reinforcement learning methods for continuous-time Markov decision problems
- G. Tesauro, D. Touretzky, and T. Leen, editors, San Mateo, CA, Morgan Kaufmann
- Steven J. Bradtke and Michael O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 393-400, San Mateo, CA, 1995. Morgan Kaufmann.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
- Bradtke, S.J.¹ Duff, M.O.²

6
- 34848884838
- Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.
- Mao Chen, Ehsan Foroughi, Fredrik Heintz, Spiros Kapetanakis, Kostas Kostiadis, Johan Kummeneje, Itsuki Noda, Oliver Obst, Patrick Riley, Timo Steffens, Yi Wang, and Xiang Yin. Users manual: RoboCup soccer server manual for soccer server version 7.07 and later, 2003. Available at http://sourceforge. net/projects/sserver/.

7
- 24844475430
- Robot Shaping: Developing Situated Agents through Learning
- Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA
- Marco Colombetti and Marco Dorigo. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA, 1993.
- (1993)
- Colombetti, M.¹ Dorigo, M.²

8
- 85156187730
- Improving elevator performance using reinforcement learning
- D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Cambridge, MA, MIT Press
- Robert H. Crites and Andrew G. Barto. Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1017-1023, Cambridge, MA, 1996. MIT Press.
- (1996) Advances in Neural Information Processing Systems 8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

9
- 0043247546
- Accelerating reinforcement learning by composing solutions of automatically identified subtasks
- Chris Drummond. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research, 16:59-104, 2002.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
- Drummond, C.¹

10
- 22944468731
- Approximate policy iteration with a policy language bias
- Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, MIT Press, Cambridge, MA
- Alan Fern, Sungwook Yoon, and Robert Givan. Approximate policy iteration with a policy language bias. In Sebastian Thrun, Lawrence Saul, and Bernhard Schölkopf, editors, Advances in Neural Information Processing Systems 16. MIT Press, Cambridge, MA, 2004.
- (2004) Advances in Neural Information Processing Systems 16
- Fern, A.¹ Yoon, S.² Givan, R.³

11
- 34247199512
- Probabilistic policy reuse in a reinforcement learning agent
- Fernando Fernandez and Manuela Veloso. Probabilistic policy reuse in a reinforcement learning agent. In Proceedings of the 5th International Conference on Autonomous Agents and Multiagent Systems, pages 720-727, 2006.
- (2006) Proceedings of the 5th International Conference on Autonomous Agents and Multiagent Systems , pp. 720-727
- Fernandez, F.¹ Veloso, M.²

12
- 84982556581
- Analogical problem-solving
- Mary L. Gick and Keith J. Holyoak. Analogical problem-solving. Cognitive Psychology, 12:306-355, 1980.
- (1980) Cognitive Psychology , vol.12 , pp. 306-355
- Gick, M.L.¹ Holyoak, K.J.²

13
- 84880803349
- Generalizing plans to new environments in relational mdps
- Acapulco, Mexico, August
- Carlos Guestrin, Daphne Koller, Chris Gearhart, and Neal Kanodia. Generalizing plans to new environments in relational mdps. In International Joint Conference on Artificial Intelligence (IJCAI-03), Acapulco, Mexico, August 2003.
- (2003) International Joint Conference on Artificial Intelligence (IJCAI-03)
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

14
- 33749243349
- Autonomous shaping: Knowledge transfer in reinforcement learning
- George Konidaris and Andrew Barto. Autonomous shaping: Knowledge transfer in reinforcement learning. In Proceedings of the 23rd International Conference on Machine Learning, pages 489-496, 2006.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 489-496
- Konidaris, G.¹ Barto, A.²

15
- 33750742257
- Value-function-based transfer for reinforcement learning using structure mapping
- July
- Yaxin Liu and Peter Stone. Value-function-based transfer for reinforcement learning using structure mapping. In Proceedings of the Twenty-First National Conference on Artificial Intelligence, pages 415-20, July 2006.
- (2006) Proceedings of the Twenty-First National Conference on Artificial Intelligence , pp. 415-420
- Liu, Y.¹ Stone, P.²

16
- 84957895797
- Reward functions for accelerated learning
- Maja J. Mataric. Reward functions for accelerated learning. In International Conference on Machine Learning, pages 181-189, 1994.
- (1994) International Conference on Machine Learning , pp. 181-189
- Mataric, M.J.¹

17
- 0003496531
- MIT Press, Cambridge, MA, USA, ISBN 0-262-13328-8
- Kishan Mehrotra, Chilukuri K. Mohan, and Sanjay Ranka. Elements of Artificial Neural Networks. MIT Press, Cambridge, MA, USA, 1997. ISBN 0-262-13328-8.
- (1997) Elements of Artificial Neural Networks
- Mehrotra, K.¹ Mohan, C.K.² Ranka, S.³

18
- 0032021222
- Soccer server: A tool for research on multiagent systems
- Itsuki Noda, Hitoshi Matsubara, Kazuo Hiraki, and Ian Frank. Soccer server: A tool for research on multiagent systems. Applied Artificial Intelligence, 12:233-250, 1998.
- (1998) Applied Artificial Intelligence , vol.12 , pp. 233-250
- Noda, I.¹ Matsubara, H.² Hiraki, K.³ Frank, I.⁴

19
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- Bob Price and Craig Boutilier. Accelerating reinforcement learning through implicit imitation. Journal of Artificial Intelligence Research, 19:569-629, 2003.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

20
- 0003998452
- John Wiley & Sons, Inc, ISBN 0471619779
- Martin L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., 1994. ISBN 0471619779.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

21
- 84867471400
- Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer
- Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, Springer Verlag, Berlin
- Martin Riedmiller, Author Merke, David Meier, Andreas Hoffman, Alex Sinner, Ortwin Thate, and Ralf Ehrmann. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Peter Stone, Tucker Balch, and Gerhard Kraetszchmar, editors, RoboCup-2000: Robot Soccer World Cup IV, pages 367-372. Springer Verlag, Berlin, 2001.
- (2001) RoboCup-2000: Robot Soccer World Cup IV , pp. 367-372
- Riedmiller, M.¹ Merke, A.² Meier, D.³ Hoffman, A.⁴ Sinner, A.⁵ Thate, O.⁶ Ehrmann, R.⁷

22
- 0003636089
- On-line Q-learning using connectionist systems
- Engineering Department, Cambridge University
- Gavin Rummery and Mahesan Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
- (1994) Technical Report CUED/F-INFENG-RT , vol.116
- Rummery, G.¹ Niranjan, M.²

23
- 0344752303
- Training and tracking in robotics
- Oliver G. Selfridge, Richard S. Sutton, and Andrew G. Barto. Training and tracking in robotics. In Proceedings of the Ninth International Joint Conference on Artificial Intelligence, pages 670-672, 1985.
- (1985) Proceedings of the Ninth International Joint Conference on Artificial Intelligence , pp. 670-672
- Selfridge, O.G.¹ Sutton, R.S.² Barto, A.G.³

24
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- Satinder P. Singh. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8:323-339, 1992.
- (1992) Machine Learning , vol.8 , pp. 323-339
- Singh, S.P.¹

25
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Satinder P. Singh and Richard S. Sutton. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158, 1996.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

26
- 0004144751
- Colliler-Macmillian, ISBN 0029290406
- Burrhus F. Skinner. Science and Human Behavior. Colliler-Macmillian, 1953. ISBN 0029290406.
- (1953) Science and Human Behavior
- Skinner, B.F.¹

27
- 33750690679
- Using homomorphisms to transfer options across continuous reinforcement learning domains
- July
- Vishal Soni and Satinder Singh. Using homomorphisms to transfer options across continuous reinforcement learning domains. In Proceedings of the Twenty First National Conference on Artificial Intelligence, July 2006.
- (2006) Proceedings of the Twenty First National Conference on Artificial Intelligence
- Soni, V.¹ Singh, S.²

28
- 84867470253
- Keepaway soccer: A machine learning testbed
- Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, of, Springer Verlag, Berlin
- Peter Stone and Richard S. Sutton. Keepaway soccer: a machine learning testbed. In Andreas Birk, Silvia Coradeschi, and Satoshi Tadokoro, editors, RoboCup-2001: Robot Soccer World Cup V, volume 2377 of Lecture Notes in Artificial Intelligence, pages 214-223. Springer Verlag, Berlin, 2002.
- (2002) Lecture Notes in Artificial Intelligence , vol.2377 , pp. 214-223
- Stone, P.¹ Sutton, R.S.²

29
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3):165-188, 2005.
- (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

30
- 37249034293
- Keepaway soccer: From machine learning testbed to benchmark
- Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, Springer Verlag, Berlin
- Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu. Keepaway soccer: From machine learning testbed to benchmark. In Itsuki Noda, Adam Jacoff, Ansgar Bredenfeld, and Yasutake Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX, volume 4020, pages 93-105. Springer Verlag, Berlin, 2006.
- (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
- Stone, P.¹ Kuhlmann, G.² Taylor, M.E.³ Liu, Y.⁴

31
- 0003420416
- MIT Press, ISBN 0262193981
- Richard S. Sutton and Andrew G. Barto. Introduction to Reinforcement Learning. MIT Press, 1998. ISBN 0262193981.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

32
- 84880892531
- An experts algorithm for transfer learning
- Erik Talvitie and Satinder Singh. An experts algorithm for transfer learning. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, 2007.
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence
- Talvitie, E.¹ Singh, S.²

33
- 27544473171
- Behavior transfer for value-function-based reinforcement learning
- Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, New York, NY, July, ACM Press
- Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
- (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
- Taylor, M.E.¹ Stone, P.²

34
- 38349005230
- Cross-domain transfer for reinforcement learning
- June
- Matthew E. Taylor and Peter Stone. Cross-domain transfer for reinforcement learning. In Proceedings of the Twenty-Fourth International Conference on Machine Learning, June 2007.
- (2007) Proceedings of the Twenty-Fourth International Conference on Machine Learning
- Taylor, M.E.¹ Stone, P.²

35
- 29444435242
- Value functions for RL-based behavior transfer: A comparative study
- July
- Matthew E. Taylor, Peter Stone, and Yaxin Liu. Value functions for RL-based behavior transfer: A comparative study. In Proceedings of the Twentieth National Conference on Artificial Intelligence, July 2005.
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence
- Taylor, M.E.¹ Stone, P.² Liu, Y.³

36
- 60349107400
- Transfer via inter-task mappings in policy search reinforcement learning
- May
- Matthew E. Taylor, Shimon Whiteson, and Peter Stone. Transfer via inter-task mappings in policy search reinforcement learning. In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems, May 2007.
- (2007) The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems
- Taylor, M.E.¹ Whiteson, S.² Stone, P.³

37
- 34848888015
- Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
- Gerald Tesauro. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.

38
- 33750366539
- Using advice to transfer knowledge acquired in one reinforcement learning task to another
- Lisa Torrey, Trevor Walker, Jude Shavlik, and Richard Maclin. Using advice to transfer knowledge acquired in one reinforcement learning task to another. In Proceedings of the Sixteenth European Conference on Machine Learning, 2005.
- (2005) Proceedings of the Sixteenth European Conference on Machine Learning
- Torrey, L.¹ Walker, T.² Shavlik, J.³ Maclin, R.⁴

39
- 0004049893
- PhD thesis, King's College, Cambridge, UK
- Christopher J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, King's College, Cambridge, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

40
- 34547994508
- Multi-task reinforcement learning: A hierarchical bayesian approach
- New York, NY, USA, ACM Press
- Aaron Wilson, Alan Fern, Soumya Ray, and Prasad Tadepalli. Multi-task reinforcement learning: a hierarchical bayesian approach. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 1015-1022, New York, NY, USA, 2007. ACM Press.
- (2007) ICML '07: Proceedings of the 24th international conference on Machine learning , pp. 1015-1022
- Wilson, A.¹ Fern, A.² Ray, S.³ Tadepalli, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.