SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Volumn , Issue , 2011, Pages

Clustering via Dirichlet process mixture models for portable skill discovery

(2) Niekum, Scott a Barto, Andrew G a

a University of Massachusetts Amherst (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MIXTURES;

AGENT SPACE; CLUSTERINGS; DIRICHLET PROCESS MIXTURE MODEL; DISCOVERY ALGORITHM; REINFORCEMENT LEARNINGS; SINGLE STATE; STATE-SPACE; SUBGOALS; TERMINATION CONDITION;

REINFORCEMENT LEARNING;

EID: 85162360219 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (22)

1
- 33845876447
- Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization
- Bram Bakker and Jürgen Schmidhuber. Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization. In Proc. of the 8th Conference on Intelligent Autonomous Systems, pages 438-445, 2004.
- (2004) Proc. of the 8th Conference on Intelligent Autonomous Systems , pp. 438-445
- Bakker, B.¹ Schmidhuber, J.²

2
- 33749651693
- Intrinsically motivated learning of hierarchical collections of skills
- A. G. Barto, S. Singh, and N. Chentanez. Intrinsically motivated learning of hierarchical collections of skills. In Proc. of the International Conference on Developmental Learning, pages 112-119, 2004.
- (2004) Proc. of the International Conference on Developmental Learning , pp. 112-119
- Barto, A.G.¹ Singh, S.² Chentanez, N.³

3
- 0004782095
- Learning hierarchical control structures for multiple tasks and changing environments
- MIT Press
- Bruce L. Digney. Learning hierarchical control structures for multiple tasks and changing environments. In Proc. of the 5th Conference on the Simulation of Adaptive Behavior. MIT Press, 1998.
- (1998) Proc. of the 5th Conference on the Simulation of Adaptive Behavior
- Digney, B.L.¹

4
- 0000324169
- Adaptive rejection sampling for gibbs sampling
- W. R. Gilks and P. Wild. Adaptive Rejection Sampling for Gibbs Sampling. Journal of the Royal Statistical Society, Series C, 41(2):337-348, 1992.
- (1992) Journal of the Royal Statistical Society, Series C , vol.41 , Issue.2 , pp. 337-348
- Gilks, W.R.¹ Wild, P.²

5
- 58049128403
- Npclu: An approach for clustering spatially extended objects
- December
- M. Halkidi and M. Vazirgiannis. Npclu: An approach for clustering spatially extended objects. Intell. Data Anal., 12:587-606, December 2008.
- (2008) Intell. Data Anal. , vol.12 , pp. 587-606
- Halkidi, M.¹ Vazirgiannis, M.²

6
- 52649148744
- Self-optimizing memory controllers: A reinforcement learning approach
- Engin Ipek, Onur Mutlu, Jose F. Martinez, and Rich Caruana. Self-optimizing memory controllers: A reinforcement learning approach. Computer Architecture, International Symposium on, 0:39-50, 2008.
- (2008) Computer Architecture International Symposium on , pp. 39-50
- Ipek, E.¹ Mutlu, O.² Martinez, J.F.³ Caruana, R.⁴

7
- 33750705246
- Causal graph based decomposition of factored mdps
- December
- Anders Jonsson and Andrew Barto. Causal graph based decomposition of factored mdps. J. Mach. Learn. Res., 7:2259-2301, December 2006.
- (2006) J. Mach. Learn. Res. , vol.7 , pp. 2259-2301
- Jonsson, A.¹ Barto, A.²

8
- 80055028007
- Value function approximation in reinforcement learning using the fourier basis
- G.D. Konidaris, S. Osentoski, and P.S. Thomas. Value function approximation in reinforcement learning using the fourier basis. In Proceedings of the Twenty-Fifth Conference on Artificial Intelligence, 2011.
- (2011) Proceedings of the Twenty-Fifth Conference on Artificial Intelligence
- Konidaris, G.D.¹ Osentoski, S.² Thomas, P.S.³

9
- 84880873347
- Building portable options: Skill transfer in reinforcement learning
- George Konidaris and Andrew G. Barto. Building portable options: Skill transfer in reinforcement learning. In Proc. of the 20th International Joint Conference on Artificial Intelligence, pages 895-900, 2007.
- (2007) Proc. of the 20th International Joint Conference on Artificial Intelligence , pp. 895-900
- Konidaris, G.¹ Barto, A.G.²

10
- 80055032021
- Skill discovery in continuous reinforcement learning domains using skill chaining
- George Konidaris and Andrew G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1015-1023
- Konidaris, G.¹ Barto, A.G.²

11
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- Amy McGovern and Andrew G. Barto. Automatic discovery of subgoals in reinforcement learning using diverse density. In ICML, pages 361-368, 2001.
- (2001) ICML , pp. 361-368
- McGovern, A.¹ Barto, A.G.²

12
- 77958566186
- Reinforcement learning for closed-loop propofol anesthesia: A human volunteer study
- Brett Moore, Periklis Panousis, Vivek Kulkarni, Larry Pyeatt, and Anthony Doufas. Reinforcement learning for closed-loop propofol anesthesia: A human volunteer study. In Innovative Applications of Artificial Intelligence, 2010.
- (2010) Innovative Applications of Artificial Intelligence
- Moore, B.¹ Panousis, P.² Kulkarni, V.³ Pyeatt, L.⁴ Doufas, A.⁵

13
- 77950032550
- Markov chain sampling methods for Dirichlet process mixture models
- R.M. Neal. Markov chain sampling methods for Dirichlet process mixture models. Journal of computational and graphical statistics, 9(2):249-265, 2000.
- (2000) Journal of Computational and Graphical Statistics , vol.9 , Issue.2 , pp. 249-265
- Neal, R.M.¹

14
- 0041875229
- On spectral clustering: Analysis and an algorithm
- MIT Press
- Andrew Y. Ng, Michael I. Jordan, and Yair Weiss. On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems, pages 849-856. MIT Press, 2001.
- (2001) Advances in Neural Information Processing Systems , pp. 849-856
- Ng, A.Y.¹ Jordan, M.I.² Weiss, Y.³

15
- 14344250461
- Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
- Marc Pickett and Andrew G. Barto. Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. In ICML, pages 506-513, 2002.
- (2002) ICML , pp. 506-513
- Pickett, M.¹ Barto, A.G.²

16
- 79955803023
- The infinite Gaussian mixture model
- MIT Press
- Carl Edward Rasmussen. The infinite Gaussian mixture model. In Advances in Neural Information Processing Systems 12, pages 554-560. MIT Press, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 554-560
- Rasmussen, C.E.¹

17
- 14344261491
- Using relative novelty to identify useful temporal abstractions in reinforcement learning
- Özgür Ş imşek and Andrew G. Barto. Using relative novelty to identify useful temporal abstractions in reinforcement learning. In Proc. of the Twenty-First International Conference on Machine Learning, pages 751-758, 2004.
- (2004) Proc. of the Twenty-First International Conference on Machine Learning , pp. 751-758
- Şimşek, O.¹ Barto, A.G.²

18
- 78651097494
- Skill characterization based on betweenness
- Özgür Ş imşek and Andrew G. Barto. Skill characterization based on betweenness. In NIPS, pages 1497-1504, 2008.
- (2008) NIPS , pp. 1497-1504
- Şimşek, O.¹ Barto, A.G.²

19
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Richard Sutton, Doina Precup, and Satinder Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

20
- 0004102479
- MIT Press
- Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 33749882712
- Finding structure in reinforcement learning
- MIT Press
- Sebastian Thrun and Anton Schwartz. Finding structure in reinforcement learning. In Advances in Neural Information Processing Systems 7, pages 385-392. MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 385-392
- Thrun, S.¹ Schwartz, A.²

22
- 34547994508
- Multi-task reinforcement learning: A hierarchical bayesian approach
- ACM Press
- Aaron Wilson, Alan Fern, Soumya Ray, and Prasad Tadepalli. Multi-task reinforcement learning: A hierarchical bayesian approach. In In: ICML 07: Proceedings of the 24th international conference on Machine learning, page 1015. ACM Press, 2007.
- (2007) ICML 07: Proceedings of the 24th International Conference on Machine Learning , pp. 1015
- Wilson, A.¹ Fern, A.² Ray, S.³ Tadepalli, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.