SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010

Volumn , Issue , 2010, Pages

Constructing skill trees for reinforcement learning agents from demonstration trajectories

(4) Konidaris, George a Kuindersmay, Scott a Barto, Andrew a Grupen, Roderic a

a Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; CHAINS; DEMONSTRATIONS; FORESTRY; MANIPULATORS; REINFORCEMENT LEARNING; TREES (MATHEMATICS);

CHANGE POINT DETECTION; CONTINUOUS DOMAIN; CONTINUOUS REINFORCEMENT; DETECTION METHODS; MOBILE MANIPULATOR; REINFORCEMENT LEARNING AGENT; REINFORCEMENT LEARNINGS;

TRAJECTORIES;

FORESTRY; MATHEMATICS; TREES;

EID: 85162033542 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (63)

References (26)

1
- 0037288370
- Recent advances in hierarchical reinforcement learning
- Special Issue on Reinforcement Learning
- A.G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:41-77, 2003. Special Issue on Reinforcement Learning.
- (2003) Discrete Event Dynamic Systems , vol.13 , pp. 41-77
- Barto, A.G.¹ Mahadevan, S.²

2
- 0004102479
- MIT Press, Cambridge, MA
- R.S. Sutton and A.G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 80055032021
- Skill discovery in continuous reinforcement learning domains using skill chaining
- G.D. Konidaris and A.G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1015-1023
- Konidaris, G.D.¹ Barto, A.G.²

4
- 78751681641
- Efficient skill learning using abstraction selection
- July
- G.D. Konidaris and A.G. Barto. Efficient skill learning using abstraction selection. In Proceedings of the Twenty First International Joint Conference on Artificial Intelligence, July 2009.
- (2009) Proceedings of the Twenty First International Joint Conference on Artificial Intelligence
- Konidaris, G.D.¹ Barto, A.G.²

5
- 63149159130
- A survey of robot learning from demonstration
- B. Argall, S. Chernova, M. Veloso, and B. Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57:469-483, 2009.
- (2009) Robotics and Autonomous Systems , vol.57 , pp. 469-483
- Argall, B.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

6
- 0031343489
- A feedback control structure for on-line learning tasks
- M. Huber and R.A. Grupen. A feedback control structure for on-line learning tasks. Robotics and Autonomous Systems, 22(3-4):303-315, 1997.
- (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 303-315
- Huber, M.¹ Grupen, R.A.²

7
- 84979715630
- Supervised actor-critic reinforcement learning
- J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors. John Wiley & Sons, Inc., New York
- M. Rosenstein and A.G. Barto. Supervised actor-critic reinforcement learning. In J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors, Learning and Approximate Dynamic Programming: Scaling up the Real World, pages 359-380. John Wiley & Sons, Inc., New York, 2004.
- (2004) Learning and Approximate Dynamic Programming: Scaling Up the Real World , pp. 359-380
- Rosenstein, M.¹ Barto, A.G.²

8
- 34547837455
- On-line inference for multiple changepoint problems
- P. Fearnhead and Z. Liu. On-line inference for multiple changepoint problems. Journal of the Royal Statistical Society B, 69:589-605, 2007.
- (2007) Journal of the Royal Statistical Society B , vol.69 , pp. 589-605
- Fearnhead, P.¹ Liu, Z.²

9
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R.S. Sutton, D. Precup, and S.P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

10
- 0021390267
- Automatic synthesis of fine-motion strategies for robots
- T. Lozano-Perez, M.T. Mason, and R.H. Taylor. Automatic synthesis of fine-motion strategies for robots. The International Journal of Robotics Research, 3(1):3-24, 1984.
- (1984) The International Journal of Robotics Research , vol.3 , Issue.1 , pp. 3-24
- Lozano-Perez, T.¹ Mason, M.T.² Taylor, R.H.³

11
- 0032647341
- Sequential composition of dynamically dextrous robot behaviors
- R.R. Burridge, A.A. Rizzi, and D.E. Koditschek. Sequential composition of dynamically dextrous robot behaviors. International Journal of Robotics Research, 18(6):534-555, 1999.
- (1999) International Journal of Robotics Research , vol.18 , Issue.6 , pp. 534-555
- Burridge, R.R.¹ Rizzi, A.A.² Koditschek, D.E.³

12
- 5844409947
- Adaptive targeting of chaos
- S. Boccaletti, A. Farini, E.J. Kostelich, and F.T. Arecchi. Adaptive targeting of chaos. Physical Review E, 55(5):4845-4848, 1997.
- (1997) Physical Review e , vol.55 , Issue.5 , pp. 4845-4848
- Boccaletti, S.¹ Farini, A.² Kostelich, E.J.³ Arecchi, F.T.⁴

13
- 77956435931
- Value function approximation in reinforcement learning using the Fourier basis
- University of Massachusetts Amherst, June
- G.D. Konidaris and S. Osentoski. Value function approximation in reinforcement learning using the Fourier basis. Technical Report UM-CS-2008-19, Department of Computer Science, University of Massachusetts Amherst, June 2008.
- (2008) Technical Report UM-CS-2008-19, Department of Computer Science
- Konidaris, G.D.¹ Osentoski, S.²

14
- 56449130136
- Automatic discovery and transfer of MAXQ hierarchies
- N. Mehta, S. Ray, P. Tadepalli, and T. Dietterich. Automatic discovery and transfer of MAXQ hierarchies. In Proceedings of the Twenty Fifth International Conference on Machine Learning, pages 648-655, 2008.
- (2008) Proceedings of the Twenty Fifth International Conference on Machine Learning , pp. 648-655
- Mehta, N.¹ Ray, S.² Tadepalli, P.³ Dietterich, T.⁴

15
- 78751697580
- Autonomously learning an action hierarchy using a learned qualitative state representation
- J. Mugan and B. Kuipers. Autonomously learning an action hierarchy using a learned qualitative state representation. In Proceedings of the 21st International Joint Conference on Artificial Intelligence, 2009.
- (2009) Proceedings of the 21st International Joint Conference on Artificial Intelligence
- Mugan, J.¹ Kuipers, B.²

16
- 70049112930
- Learning complex motions by sequencing simpler motion templates
- G. Neumann, W. Maass, and J. Peters. Learning complex motions by sequencing simpler motion templates. In Proceedings of the 26th International Conference on Machine Learning, 2009.
- (2009) Proceedings of the 26th International Conference on Machine Learning
- Neumann, G.¹ Maass, W.² Peters, J.³

17
- 77957761338
- LQR-Trees: Feedback motion planning on sparse randomized trees
- R. Tedrake. LQR-Trees: Feedback motion planning on sparse randomized trees. In Proceedings of Robotics: Science and Systems, pages 18-24, 2009.
- (2009) Proceedings of Robotics: Science and Systems , pp. 18-24
- Tedrake, R.¹

18
- 77956512686
- Modeling changing dependency structure in multivariate time series
- X. Xuan and K. Murphy. Modeling changing dependency structure in multivariate time series. In Proceedings of the Twenty-Fourth International Conference on Machine Learning, 2007.
- (2007) Proceedings of the Twenty-Fourth International Conference on Machine Learning
- Xuan, X.¹ Murphy, K.²

19
- 84858766922
- Nonparametric Bayesian learning of switching linear dynamical systems
- E.B. Fox, E.B. Sudderth, M.I. Jordan, and A.S. Willsky. Nonparametric Bayesian learning of switching linear dynamical systems. In Advances in Neural Information Processing Systems 21, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.21
- Fox, E.B.¹ Sudderth, E.B.² Jordan, M.I.³ Willsky, A.S.⁴

20
- 17144391260
- Performance-derived behavior vocabularies: Data-driven acquisition of skills from motion
- O.C. Jenkins and M. Matarić. Performance-derived behavior vocabularies: data-driven acquisition of skills from motion. International Journal of Humanoid Robotics, 1(2):237-288, 2004.
- (2004) International Journal of Humanoid Robotics , vol.1 , Issue.2 , pp. 237-288
- Jenkins, O.C.¹ Matarić, M.²

21
- 78651471279
- Incremental learning of subtasks from unsegmented demonstration
- D.H. Grollman and O.C. Jenkins. Incremental learning of subtasks from unsegmented demonstration. In International Conference on Intelligent Robots and Systems, 2010.
- (2010) International Conference on Intelligent Robots and Systems
- Grollman, D.H.¹ Jenkins, O.C.²

22
- 79851473084
- Learning from demonstration using a multi-valued function regressor for time-series data
- J. Butterfield, S. Osentoski, G. Jay, and O.C. Jenkins. Learning from demonstration using a multi-valued function regressor for time-series data. In Proceedings of the Tenth IEEE-RAS International Conference on Humanoid Robots, 2010.
- (2010) Proceedings of the Tenth IEEE-RAS International Conference on Humanoid Robots
- Butterfield, J.¹ Osentoski, S.² Jay, G.³ Jenkins, O.C.⁴

23
- 40649106649
- Natural actor-critic
- J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
- (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

24
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- P. Abbeel and A.Y. Ng. Apprenticeship learning via inverse reinforcement learning. In Proceedings of the 21st International Conference on Machine Learning, 2004.
- (2004) Proceedings of the 21st International Conference on Machine Learning
- Abbeel, P.¹ Ng, A.Y.²

25
- 60349110367
- Confidence-based policy learning from demonstration using Gaussian mixture models
- S. Chernova and M. Veloso. Confidence-based policy learning from demonstration using Gaussian mixture models. In Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, 2007.
- (2007) Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems
- Chernova, S.¹ Veloso, M.²

26
- 84880873347
- Building portable options: Skill transfer in reinforcement learning
- G.D. Konidaris and A.G. Barto. Building portable options: Skill transfer in reinforcement learning. In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, 2007.
- (2007) Proceedings of the Twentieth International Joint Conference on Artificial Intelligence
- Konidaris, G.D.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.