SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn 2, Issue , 2005, Pages 880-885

Value functions for RL-based behavior transfer: A comparative study

(3) Taylor, Matthew E a Stone, Peter a Liu, Yaxin a

a University of Texas at Austin (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIOR TRANSFER; TEMPORAL DIFFERENCE (TD);

ALGORITHMS; APPROXIMATION THEORY; FUNCTIONS; LEARNING SYSTEMS;

BEHAVIORAL RESEARCH;

EID: 29444435242 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (49)

References (25)

1
- 0003942195
- Peterborough, NH: Byte Books
- Albus, J. S. 1981. Brains, Behavior, and Robotics. Peterborough, NH: Byte Books.
- (1981) Brains, Behavior, and Robotics
- Albus, J.S.¹

2
- 0036927201
- State abstraction for programmable reinforcement learning agents
- Andre, D., and Russell, S. J. 2002. State abstraction for programmable reinforcement learning agents. In Proceedings of the Eighteenth National Conference on Artificial Intelligence, 119-125.
- (2002) Proceedings of the Eighteenth National Conference on Artificial Intelligence , pp. 119-125
- Andre, D.¹ Russell, S.J.²

3
- 0141727204
- Evolving team Darwin United
- Asada, M., and Kitano, H., eds. Berlin: Springer Verlag
- Andre, D., and Teller, A. 1999. Evolving team Darwin United. In Asada, M., and Kitano, H., eds., RoboCup-98: Robot Soccer World Cup II. Berlin: Springer Verlag.
- (1999) RoboCup-98: Robot Soccer World Cup II
- Andre, D.¹ Teller, A.²

4
- 0006221144
- Vision-based behavior acquisition for a shooting robot by using a reinforcement learning
- Asada, M.; Noda, S.; Tawaratsumida, S.; and Hosoda, K. 1994. Vision-based behavior acquisition for a shooting robot by using a reinforcement learning. In Proc. of IAPK/IEEE Workshop on Visual Behaviors-1994, 112-118.
- (1994) Proc. of IAPK/IEEE Workshop on Visual Behaviors-1994 , pp. 112-118
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

5
- 24844475430
- Robot Shaping: Developing Situated Agents through Learning
- International Computer Science Institute, Berkeley, CA
- Colombetti, M., and Dorigo, M. 1993. Robot Shaping: Developing Situated Agents through Learning. Technical Report TR-92-040, International Computer Science Institute, Berkeley, CA.
- (1993) Technical Report , vol.TR-92-040
- Colombetti, M.¹ Dorigo, M.²

6
- 0003259931
- Improving elevator performance using reinforcement learning
- Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds. Cambridge, MA: MIT Press
- Crites, R. H., and Barto, A. G. 1996. Improving elevator performance using reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8
- Crites, R.H.¹ Barto, A.G.²

7
- 0242504775
- Master's thesis, University of Amsterdam, The Netherlands
- de Boer, R., and Kok, J. R. 2002. The incremental development of a synthetic multi-agent system: The uva trileam 2001 robotic soccer simulation team. Master's thesis, University of Amsterdam, The Netherlands.
- (2002) The Incremental Development of A Synthetic Multi-agent System: The Uva Trileam 2001 Robotic Soccer Simulation Team
- Boer, R.¹ Kok, J.R.²

8
- 0043247546
- Accelerating reinforcement learning by composing solutions of automatically identified subtasks
- Drummond, C. 2002. Accelerating reinforcement learning by composing solutions of automatically identified subtasks. Journal of Artificial Intelligence Research 16:59-104.
- (2002) Journal of Artificial Intelligence Research , vol.16 , pp. 59-104
- Drummond, C.¹

9
- 0035312760
- Relational reinforcement learning
- Dzeroski, S.; Raedt, L. D.; and Driessens, K. 2001. Relational reinforcement learning. Machine Learning 43:7-52.
- (2001) Machine Learning , vol.43 , pp. 7-52
- Dzeroski, S.¹ Raedt, L.D.² Driessens, K.³

10
- 22944468731
- Approximate policy iteration with a policy language bias
- Thrun, S.; Saul, L.; and Schölkopf, B., eds. Cambridge, MA: MIT Press
- Fern, A.; Yoon, S.; and Givan, R. 2004. Approximate policy iteration with a policy language bias. In Thrun, S.; Saul, L.; and Schölkopf, B., eds., Advances in Neural Information Processing Systems 16. Cambridge, MA: MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.16
- Fern, A.¹ Yoon, S.² Givan, R.³

11
- 84880803349
- Generalizing plans to new environments in relational mdps
- Guestrin, C.; Koller, D.; Gearhart, C.; and Kanodia, N. 2003. Generalizing plans to new environments in relational mdps. In International Joint Conference on Artificial Intelligence (IJCAI-03).
- (2003) International Joint Conference on Artificial Intelligence (IJCAI-03)
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

12
- 84957895797
- Reward functions for accelerated learning
- Mataric, M. J. 1994. Reward functions for accelerated learning. In International Conference on Machine Learning, 181-189.
- (1994) International Conference on Machine Learning , pp. 181-189
- Mataric, M.J.¹

13
- 4444326434
- Scaling up reinforcement learning with a relational representation
- Morales, E. F. 2003. Scaling up reinforcement learning with a relational representation. In Proc. of the Workshop on Adaptability in Multi-agent Systems.
- (2003) Proc. of the Workshop on Adaptability in Multi-agent Systems
- Morales, E.F.¹

14
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Ng, A. Y.; Harada, D.; and Russell, S. 1999. Policy invariance under reward transformations: Theory and application to reward shaping. In Proc. 16th International Conf. on Machine Learning.
- (1999) Proc. 16th International Conf. on Machine Learning
- Ng, A.Y.¹ Harada, D.² Russell, S.³

15
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- Price, B., and Boutilier, C. 2003. Accelerating reinforcement learning through implicit imitation. Journal of Artificial Intelligence Research 19:569-629.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

16
- 0003998452
- John Wiley & Sons, Inc.
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

17
- 0003229379
- Karlsruhe brainstormers - A reinforcement learning approach to robotic soccer
- Stone, P.; Balch, T.; and Kraetszchmar, G., eds. Berlin: Springer Verlag
- Riedmiller, M.; Merke, A.; Meier, D.; Hoffman, A.; Sinner, A.; Thate, O.; and Ehrmann, R. 2001. Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer. In Stone, P.; Balch, T.; and Kraetszchmar, G., eds., RoboCup-2000: Robot Soccer World Cup IV. Berlin: Springer Verlag.
- (2001) RoboCup-2000: Robot Soccer World Cup IV
- Riedmiller, M.¹ Merke, A.² Meier, D.³ Hoffman, A.⁴ Sinner, A.⁵ Thate, O.⁶ Ehrmann, R.⁷

18
- 0344752303
- Training and tracking in robotics
- Selfridge, O.; Sutton, R. S.; and Barto, A. G. 1985. Training and tracking in robotics. Proceedings of the Ninth International Joint Conference on Artificial Intelligence 670-672.
- (1985) Proceedings of the Ninth International Joint Conference on Artificial Intelligence , pp. 670-672
- Selfridge, O.¹ Sutton, R.S.² Barto, A.G.³

19
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- Singh, S. P. 1992. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning 8:323-339.
- (1992) Machine Learning , vol.8 , pp. 323-339
- Singh, S.P.¹

20
- 29444445207
- Keepaway soccer: From machine learning testbed to benchmark
- To appear
- Stone, P.; Kuhlmann, G.; Taylor, M.; and Liu, Y. 2005. Keepaway soccer: From machine learning testbed to benchmark. In Proceedings of RoboCup International Symposium. To appear.
- (2005) Proceedings of RoboCup International Symposium
- Stone, P.¹ Kuhlmann, G.² Taylor, M.³ Liu, Y.⁴

21
- 84944901151
- The CMUnited-99 champion simulator team
- Veloso, M.; Pagello, E.; and Kitano, H., eds. Berlin: Springer
- Stone, P.; Riley, P.; and Veloso, M. 2000. The CMUnited-99 champion simulator team. In Veloso, M.; Pagello, E.; and Kitano, H., eds., RoboCup-99: Robot Soccer World Cup III. Berlin: Springer. 35-48.
- (2000) RoboCup-99: Robot Soccer World Cup III , pp. 35-48
- Stone, P.¹ Riley, P.² Veloso, M.³

22
- 27544506565
- Reinforcement learning for RoboCup-soccer keepaway
- To appear
- Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior. To appear.
- (2005) Adaptive Behavior
- Stone, P.¹ Sutton, R.S.² Kuhlmann, G.³

23
- 0003420416
- MIT Press
- Sutton, R. S., and Barto, A. G. 1998. Introduction to Reinforcement Learning. MIT Press.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

24
- 33644807975
- Behavior transfer for value-function-based reinforcement learning
- To appear
- Taylor, M. E., and Stone, P. 2005. Behavior transfer for value-function-based reinforcement learning. In The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems. To appear.
- (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems
- Taylor, M.E.¹ Stone, P.²

25
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. 1994. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation 6(2):215-219.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.