SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference

Volumn , Issue , 2009, Pages 1015-1023

Skill discovery in continuous reinforcement learning domains using skill chaining

(2) Konidaris, George a Barto, Andrew a

a The Manning College of Information and Computer Sciences (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS DOMAIN; CONTINUOUS REINFORCEMENT; PERFORMANCE BENEFITS; REINFORCEMENT LEARNINGS;

REINFORCEMENT LEARNING;

EID: 80055032021 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (290)

References (26)

1
- 0037288370
- Recent advances in hierarchical reinforcement learning
- Special Issue on Reinforcement Learning
- A.G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Systems, 13:41-77, 2003. Special Issue on Reinforcement Learning.
- (2003) Discrete Event Systems , vol.13 , pp. 41-77
- Barto, A.G.¹ Mahadevan, S.²

2
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R.S. Sutton, D. Precup, and S.P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

3
- 33745609140
- Intrinsically motivated reinforcement learning
- S. Singh, A.G. Barto, and N. Chentanez. Intrinsically motivated reinforcement learning. In Proceedings of the 18th Annual Conference on Neural Information Processing Systems, 2004.
- (2004) Proceedings of the 18th Annual Conference on Neural Information Processing Systems
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

4
- 0004102479
- MIT Press, Cambridge, MA
- R.S. Sutton and A.G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

5
- 0004782095
- Learning hierarchical control structures for multiple tasks and changing environments
- MIT Press
- B.L. Digney. Learning hierarchical control structures for multiple tasks and changing environments. In From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior. MIT Press, 1998.
- (1998) From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior
- Digney, B.L.¹

6
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- A. McGovern and A.G. Barto. Automatic discovery of subgoals in reinforcement learning using diverse density. In Proceedings of the 18th International Conference on Machine Learning, pages 361-368, 2001.
- (2001) Proceedings of the 18th International Conference on Machine Learning , pp. 361-368
- McGovern, A.¹ Barto, A.G.²

7
- 14344261491
- Using relative novelty to identify useful temporal abstractions in reinforcement learning
- Ö. Şimşek and A.G. Barto. Using relative novelty to identify useful temporal abstractions in reinforcement learning. In Proceedings of the 21st International Conference on Machine Learning, pages 751-758, 2004.
- (2004) Proceedings of the 21st International Conference on Machine Learning , pp. 751-758
- Şimşek, O.¹ Barto, A.G.²

8
- 78651097494
- Skill characterization based on betweenness
- Ö. Şimşek and A.G. Barto. Skill characterization based on betweenness. In Advances in Neural Information Processing Systems 22, 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22
- Şimşek, O.¹ Barto, A.G.²

9
- 84945250000
- Q-cut-dynamic discovery of sub-goals in reinforcement learning
- I. Menache, S. Mannor, and N. Shimkin. Q-cut-dynamic discovery of sub-goals in reinforcement learning. In Proceedings of the 13th European Conference on Machine Learning, pages 295-306, 2002.
- (2002) Proceedings of the 13th European Conference on Machine Learning , pp. 295-306
- Menache, I.¹ Mannor, S.² Shimkin, N.³

10
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- S. Mannor, I. Menache, A. Hoze, and U. Klein. Dynamic abstraction in reinforcement learning via clustering. In Proceedings of the 21st International Conference on Machine Learning, pages 560-567, 2004.
- (2004) Proceedings of the 21st International Conference on Machine Learning , pp. 560-567
- Mannor, S.¹ Menache, I.² Hoze, A.³ Klein, U.⁴

11
- 31844447221
- Identifying useful subgoals in reinforcement learning by local graph partitioning
- Ö. Şimşek, A.P. Wolfe, and A.G. Barto. Identifying useful subgoals in reinforcement learning by local graph partitioning. In Proceedings of the 22nd International Conference on Machine Learning, 2005.
- (2005) Proceedings of the 22nd International Conference on Machine Learning
- Şimşek, O.¹ Wolfe, A.P.² Barto, A.G.³

12
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ
- B. Hengst. Discovering hierarchy in reinforcement learning with HEXQ. In Proceedings of the 19th International Conference on Machine Learning, pages 243-250, 2002.
- (2002) Proceedings of the 19th International Conference on Machine Learning , pp. 243-250
- Hengst, B.¹

13
- 31844455449
- A causal approach to hierarchical decomposition of factored MDPs
- A. Jonsson and A.G. Barto. A causal approach to hierarchical decomposition of factored MDPs. In Proceedings of the 22nd International Conference on Machine Learning, 2005.
- (2005) Proceedings of the 22nd International Conference on Machine Learning
- Jonsson, A.¹ Barto, A.G.²

14
- 33749882712
- Finding structure in reinforcement learning
- The MIT Press
- S. Thrun and A. Schwartz. Finding structure in reinforcement learning. In Advances in Neural Information Processing Systems, volume 7, pages 385-392. The MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 385-392
- Thrun, S.¹ Schwartz, A.²

15
- 33749244036
- Reusing old policies to accelerate learning on new MDPs
- University of Massachusetts at Amherst, April
- D.S. Bernstein. Reusing old policies to accelerate learning on new MDPs. Technical Report UM-CS-1999-026, Department of Computer Science, University of Massachusetts at Amherst, April 1999.
- (1999) Technical Report UM-cs-1999-026, Department of Computer Science
- Bernstein, D.S.¹

16
- 0142121953
- Using options for knowledge transfer in reinforcement learning
- University of Massachusetts Amherst
- T.J. Perkins and D. Precup. Using options for knowledge transfer in reinforcement learning. Technical Report UM-CS-1999-034, Department of Computer Science, University of Massachusetts Amherst, 1999.
- (1999) Technical Report UM-cs-1999-034, Department of Computer Science
- Perkins, T.J.¹ Precup, D.²

17
- 14344250461
- Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
- M. Pickett and A.G. Barto. Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. In Proceedings of the 19th International Conference of Machine Learning, pages 506-513, 2002.
- (2002) Proceedings of the 19th International Conference of Machine Learning , pp. 506-513
- Pickett, M.¹ Barto, A.G.²

18
- 78751697580
- Autonomously learning an action hierarchy using a learned qualitative state representation
- J. Mugan and B. Kuipers. Autonomously learning an action hierarchy using a learned qualitative state representation. In Proceedings of the 21st International Joint Conference on Artificial Intelligence, 2009.
- (2009) Proceedings of the 21st International Joint Conference on Artificial Intelligence
- Mugan, J.¹ Kuipers, B.²

19
- 70049112930
- Learning complex motions by sequencing simpler motion templates
- G. Neumann, W. Maass, and J. Peters. Learning complex motions by sequencing simpler motion templates. In Proceedings of the 26th International Conference on Machine Learning, 2009.
- (2009) Proceedings of the 26th International Conference on Machine Learning
- Neumann, G.¹ Maass, W.² Peters, J.³

20
- 0032647341
- Sequential composition of dynamically dextrous robot behaviors
- R.R. Burridge, A.A. Rizzi, and D.E. Koditschek. Sequential composition of dynamically dextrous robot behaviors. International Journal of Robotics Research, 18(6):534-555, 1999.
- (1999) International Journal of Robotics Research , vol.18 , Issue.6 , pp. 534-555
- Burridge, R.R.¹ Rizzi, A.A.² Koditschek, D.E.³

21
- 77957761338
- LQR-trees: Feedback motion planning on sparse randomized trees
- R. Tedrake. LQR-Trees: Feedback motion planning on sparse randomized trees. In Proceedings of Robotics: Science and Systems, 2009.
- (2009) Proceedings of Robotics: Science and Systems
- Tedrake, R.¹

22
- 84880873347
- Building portable options: Skill transfer in reinforcement learning
- G.D. Konidaris and A.G. Barto. Building portable options: Skill transfer in reinforcement learning. In Proceedings of the 20th International Joint Conference on Artificial Intelligence, 2007.
- (2007) Proceedings of the 20th International Joint Conference on Artificial Intelligence
- Konidaris, G.D.¹ Barto, A.G.²

23
- 77956435931
- Value function approximation in reinforcement learning using the fourier basis
- University of Massachusetts Amherst, June
- G.D. Konidaris and S. Osentoski. Value function approximation in reinforcement learning using the Fourier basis. Technical Report UM-CS-2008-19, Department of Computer Science, University of Massachusetts Amherst, June 2008.
- (2008) Technical Report UM-cs-2008-19, Department of Computer Science
- Konidaris, G.D.¹ Osentoski, S.²

24
- 70349322784
- Learning representation and control in Markov decision processes: New frontiers
- S. Mahadevan. Learning representation and control in Markov Decision Processes: New frontiers. Foundations and Trends in Machine Learning, 1(4):403-565, 2009.
- (2009) Foundations and Trends in Machine Learning , vol.1 , Issue.4 , pp. 403-565
- Mahadevan, S.¹

25
- 67650127165
- Sensorimotor abstraction selection for efficient, autonomous robot skill acquisition
- G.D. Konidaris and A.G. Barto. Sensorimotor abstraction selection for efficient, autonomous robot skill acquisition. In Proceedings of the 7th IEEE International Conference on Development and Learning, 2008.
- (2008) Proceedings of the 7th IEEE International Conference on Development and Learning
- Konidaris, G.D.¹ Barto, A.G.²

26
- 78751681641
- Efficient skill learning using abstraction selection
- July
- G.D. Konidaris and A.G. Barto. Efficient skill learning using abstraction selection. In Proceedings of the 21st International Joint Conference on Artificial Intelligence, July 2009.
- (2009) Proceedings of the 21st International Joint Conference on Artificial Intelligence
- Konidaris, G.D.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.