메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1565-1570

Bayesian policy search with policy priors

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACT STRUCTURES; ACTION SEQUENCES; FINITE-STATE CONTROLLERS; MARKOV CHAIN MONTE-CARLO; MOTOR PRIMITIVES; OPTIMAL POLICIES; PRIMITIVE ACTIONS; RECURSIVE PROCESS;

EID: 84881042664     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5591/978-1-57735-516-8/IJCAI11-263     Document Type: Conference Paper
Times cited : (31)

References (12)
  • 5
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Leslie P. Kaelbling, Michael L. Littman, and Anthony R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 7
    • 0036025698 scopus 로고    scopus 로고
    • Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition
    • Jim Pitman. Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition. Combinatorics, Probability and Computing, 11:501514, 2002.
    • (2002) Combinatorics, Probability and Computing , vol.11 , pp. 501514
    • Pitman, J.1
  • 8
    • 84957069070 scopus 로고    scopus 로고
    • Theoretical results on reinforcement learning with temporally abstract options
    • Doina Precup, Richard S. Sutton, and Satinder Singh. Theoretical results on reinforcement learning with temporally abstract options. In European Conference on Machine Learning (ECML), pages 382-393, 1998.
    • (1998) European Conference on Machine Learning (ECML) , pp. 382-393
    • Precup, D.1    Sutton, R.S.2    Singh, S.3
  • 12
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.