메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Clustering via Dirichlet process mixture models for portable skill discovery

Author keywords

[No Author keywords available]

Indexed keywords

MIXTURES;

EID: 85162360219     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (22)
  • 5
    • 58049128403 scopus 로고    scopus 로고
    • Npclu: An approach for clustering spatially extended objects
    • December
    • M. Halkidi and M. Vazirgiannis. Npclu: An approach for clustering spatially extended objects. Intell. Data Anal., 12:587-606, December 2008.
    • (2008) Intell. Data Anal. , vol.12 , pp. 587-606
    • Halkidi, M.1    Vazirgiannis, M.2
  • 7
    • 33750705246 scopus 로고    scopus 로고
    • Causal graph based decomposition of factored mdps
    • December
    • Anders Jonsson and Andrew Barto. Causal graph based decomposition of factored mdps. J. Mach. Learn. Res., 7:2259-2301, December 2006.
    • (2006) J. Mach. Learn. Res. , vol.7 , pp. 2259-2301
    • Jonsson, A.1    Barto, A.2
  • 10
    • 80055032021 scopus 로고    scopus 로고
    • Skill discovery in continuous reinforcement learning domains using skill chaining
    • George Konidaris and Andrew G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
    • (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1015-1023
    • Konidaris, G.1    Barto, A.G.2
  • 11
    • 0013465187 scopus 로고    scopus 로고
    • Automatic discovery of subgoals in reinforcement learning using diverse density
    • Amy McGovern and Andrew G. Barto. Automatic discovery of subgoals in reinforcement learning using diverse density. In ICML, pages 361-368, 2001.
    • (2001) ICML , pp. 361-368
    • McGovern, A.1    Barto, A.G.2
  • 13
    • 77950032550 scopus 로고    scopus 로고
    • Markov chain sampling methods for Dirichlet process mixture models
    • R.M. Neal. Markov chain sampling methods for Dirichlet process mixture models. Journal of computational and graphical statistics, 9(2):249-265, 2000.
    • (2000) Journal of Computational and Graphical Statistics , vol.9 , Issue.2 , pp. 249-265
    • Neal, R.M.1
  • 15
    • 14344250461 scopus 로고    scopus 로고
    • Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
    • Marc Pickett and Andrew G. Barto. Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. In ICML, pages 506-513, 2002.
    • (2002) ICML , pp. 506-513
    • Pickett, M.1    Barto, A.G.2
  • 18
    • 78651097494 scopus 로고    scopus 로고
    • Skill characterization based on betweenness
    • Özgür Ş imşek and Andrew G. Barto. Skill characterization based on betweenness. In NIPS, pages 1497-1504, 2008.
    • (2008) NIPS , pp. 1497-1504
    • Şimşek, O.1    Barto, A.G.2
  • 19
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Richard Sutton, Doina Precup, and Satinder Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.