SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn 148, Issue , 2006, Pages 489-496

Autonomous shaping: Knowledge transfer in reinforcement learning

(2) Konidaris, George a Barto, Andrew a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUTONOMOUS AGENTS; PREDICTIVE CONTROL SYSTEMS; REINFORCEMENT LEARNING;

ROD POSITIONING TASK; SHAPING FUNCTION;

KNOWLEDGE ACQUISITION;

EID: 34250719248 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1143844.1143906 Document Type: Conference Paper

Times cited : (94)

References (24)

1
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

2
- 33749244036
- Reusing old policies to accelerate learning on new MDPs
- UM-CS-1999-026, Department of Computer Science, University of Massachusetts at Amherst
- Bernstein, D. (1999). Reusing old policies to accelerate learning on new MDPs (Technical Report UM-CS-1999-026). Department of Computer Science, University of Massachusetts at Amherst.
- (1999) Technical Report
- Bernstein, D.¹

3
- 0003782780
- MIT Press/Bradford Books
- Dorigo, M., & Colombetti, M. (1998). Robot shaping: An experiment in behavior engineering. MIT Press/Bradford Books.
- (1998) Robot shaping: An experiment in behavior engineering
- Dorigo, M.¹ Colombetti, M.²

4
- 0002479021
- Exploring unknown environments with real-time search or reinforcement learning
- Koenig, S. (1999). Exploring unknown environments with real-time search or reinforcement learning. Advances in Neural Information Processing Systems (NIPS) 12 (pp. 1003-1009).
- (1999) Advances in Neural Information Processing Systems (NIPS) , vol.12 , pp. 1003-1009
- Koenig, S.¹

5
- 0029751419
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
- Koenig, S., & Simmons, R. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms. Machine Learning, 22, 227-250.
- (1996) Machine Learning , vol.22 , pp. 227-250
- Koenig, S.¹ Simmons, R.²

6
- 33749264347
- Estimating future reward in reinforcement learning animals using associative learning
- Konidaris, G., & Hayes, G. (2004). Estimating future reward in reinforcement learning animals using associative learning. From Animals to Animals 8: Proceedings of the 8th International Conference on the Simulation of Adaptive Behavior (pp. 297-304).
- (2004) From Animals to Animals 8: Proceedings of the 8th International Conference on the Simulation of Adaptive Behavior , pp. 297-304
- Konidaris, G.¹ Hayes, G.²

7
- 0025400088
- Real-time heuristic search
- Korf, R. (1990). Real-time heuristic search. Artificial Intelligence, 42, 189-211.
- (1990) Artificial Intelligence , vol.42 , pp. 189-211
- Korf, R.¹

8
- 31844433360
- Proto-value functions: Developmental reinforcement learning
- Mahadevan, S. (2005). Proto-value functions: Developmental reinforcement learning. Proceedings of the Twenty Second International Conference on Machine Learning (ICML 05).
- (2005) Proceedings of the Twenty Second International Conference on Machine Learning (ICML 05)
- Mahadevan, S.¹

9
- 0030647149
- Reinforcement learning in the multi-robot domain
- Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4, 73-83.
- (1997) Autonomous Robots , vol.4 , pp. 73-83
- Matarić, M.¹

10
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Moore, A., & Atkeson, C. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.²

11
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Ng, A., Harada, D., & Russell, S. (1999). Policy invariance under reward transformations: theory and application to reward shaping. Proceedings of the 16th International Conference on Machine Learning (pp. 278-287).
- (1999) Proceedings of the 16th International Conference on Machine Learning , pp. 278-287
- Ng, A.¹ Harada, D.² Russell, S.³

12
- 33749265081
- Robot shaping - principles, methods and architectures
- Perkins, S., & Hayes, G. (1996). Robot shaping - principles, methods and architectures. Artificial Intelligence and Simulation of Behaviour 1996 - Workshop on Learning in Robots and Animals.
- (1996) Artificial Intelligence and Simulation of Behaviour 1996 - Workshop on Learning in Robots and Animals
- Perkins, S.¹ Hayes, G.²

13
- 14344250461
- Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
- Pickett, M., & Barto, A. (2002). Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. Proceedings of the Nineteenth International Conference of Machine Learning (ICML 02) (pp. 506-513).
- (2002) Proceedings of the Nineteenth International Conference of Machine Learning (ICML 02) , pp. 506-513
- Pickett, M.¹ Barto, A.²

14
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- Randløv, J., & Alstrøm, P. (1998). Learning to drive a bicycle using reinforcement learning and shaping. Proceedings of the 15th International Conference on Machine Learning (pp. 463-471).
- (1998) Proceedings of the 15th International Conference on Machine Learning , pp. 463-471
- Randløv, J.¹ Alstrøm, P.²

15
- 0344752303
- Training and tracking in robotics
- Selfridge, O., Sutton, R. S., & Barto, A. G. (1985). Training and tracking in robotics. Proceedings of the Ninth International Joint Conference on Artificial Intelligence (pp. 670-672).
- (1985) Proceedings of the Ninth International Joint Conference on Artificial Intelligence , pp. 670-672
- Selfridge, O.¹ Sutton, R.S.² Barto, A.G.³

16
- 0003880401
- New York: Appleton-Century-Crofts
- Skinner, B. F. (1938). The behavior of organisms: An experimental analysis. New York: Appleton-Century-Crofts.
- (1938) The behavior of organisms: An experimental analysis
- Skinner, B.F.¹

17
- 84974678409
- Layered learning
- Barcelona, Spain: Springer, Berlin
- Stone, P., & Veloso, M. (2000). Layered learning. Proceedings of the 11th European Conference on Machine Learning (pp. 369-381). Barcelona, Spain: Springer, Berlin.
- (2000) Proceedings of the 11th European Conference on Machine Learning , pp. 369-381
- Stone, P.¹ Veloso, M.²

18
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.¹ Barto, A.²

19
- 29444435242
- Value functions for RL-based behavior transfer: A comparative study
- Taylor, M., Stone, P., & Liu, Y. (2005). Value functions for RL-based behavior transfer: a comparative study. Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI-05).
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence (AAAI-05)
- Taylor, M.¹ Stone, P.² Liu, Y.³

20
- 0003411271
- Efficient exploration in reinforcement learning
- CS-92-102, Carnegie Mellon University
- Thrun, S. (1992). Efficient exploration in reinforcement learning (Technical Report CS-92-102). Carnegie Mellon University.
- (1992) Technical Report
- Thrun, S.¹

21
- 33749882712
- Finding structure in reinforcement learning
- The MIT Press
- Thrun, S., & Schwartz, A. (1995). Finding structure in reinforcement learning. Advances in Neural Information Processing Systems (pp. 385-392). The MIT Press.
- (1995) Advances in Neural Information Processing Systems , pp. 385-392
- Thrun, S.¹ Schwartz, A.²

22
- 0003787427
- Doctoral dissertation, Massachusetts Institute of Technology
- Van Roy, B. (1998). Learning and value function approximation in complex decision processes. Doctoral dissertation, Massachusetts Institute of Technology.
- (1998) Learning and value function approximation in complex decision processes
- Van Roy, B.¹

23
- 0035951444
- Autonomous mental development by robots and animals
- Weng, J., McClelland, J., Pentland, A., Sporns, O., Stockman, I., Sur, M., & Thelen, E. (2000). Autonomous mental development by robots and animals. Science, 291, 599-600.
- (2000) Science , vol.291 , pp. 599-600
- Weng, J.¹ McClelland, J.² Pentland, A.³ Sporns, O.⁴ Stockman, I.⁵ Sur, M.⁶ Thelen, E.⁷

24
- 27344453198
- Potential-based shaping and Q-value initialization are equivalent
- Wiewiora, E. (2003). Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19, 205-208.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 205-208
- Wiewiora, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.