SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 1565-1570

Bayesian policy search with policy priors

(5) Wingate, David a Goodman, Noah D b Roy, Daniel M a Kaelbling, Leslie P a Tenenbaum, Joshua B a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

b STANFORD UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACT STRUCTURES; ACTION SEQUENCES; FINITE-STATE CONTROLLERS; MARKOV CHAIN MONTE-CARLO; MOTOR PRIMITIVES; OPTIMAL POLICIES; PRIMITIVE ACTIONS; RECURSIVE PROCESS;

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS;

INFERENCE ENGINES;

EID: 84881042664 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-263 Document Type: Conference Paper

Times cited : (31)

References (12)

1
- 70049104354
- Goal-directed decision making in prefrontal cortex: A computational framework
- Matthew Botvinick and James An. Goal-directed decision making in prefrontal cortex: a computational framework. In Advances in Neural Information Processing Systems (NIPS), 2008.
- (2008) Advances in Neural Information Processing Systems (NIPS)
- Botvinick, M.¹ An, J.²

2
- 84881060367
- Achieving Master Level in 9x9 Go
- Sylvain Gelly and David Silver. Achieving Master Level in 9x9 Go. In National Conference on Articial Intelligence (AAAI), 2008.
- National Conference on Articial Intelligence (AAAI), 2008
- Gelly, S.¹ Silver, D.²

3
- 84881068992
- Learning a theory of causality
- Noah D. Goodman, Tomer Ullman, and Joshua B. Tenenbaum. Learning a theory of causality. In Proceedings of the Thirty-First Annual Conference of the Cognitive Science Society, 2009.
- Proceedings of the Thirty-First Annual Conference of the Cognitive Science Society, 2009
- Goodman, N.D.¹ Ullman, T.² Tenenbaum, J.B.³

4
- 0003644124
- The MIT Press, Cambridge, Massachusetts
- Ronald A. Howard. Dynamic Programming and Markov Processes. The MIT Press, Cambridge, Massachusetts, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

5
- 0032073263
- Planning and acting in partially observable stochastic domains
- Leslie P. Kaelbling, Michael L. Littman, and Anthony R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

6
- 85161968592
- Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods
- Alessandro Lazaric, Marcello Restelli, and Andrea Bonarini. Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods. In Advances in Neural Information Processing Systems (NIPS), 2007.
- (2007) Advances in Neural Information Processing Systems (NIPS)
- Lazaric, A.¹ Restelli, M.² Bonarini, A.³

7
- 0036025698
- Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition
- Jim Pitman. Poisson-Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition. Combinatorics, Probability and Computing, 11:501514, 2002.
- (2002) Combinatorics, Probability and Computing , vol.11 , pp. 501514
- Pitman, J.¹

8
- 84957069070
- Theoretical results on reinforcement learning with temporally abstract options
- Doina Precup, Richard S. Sutton, and Satinder Singh. Theoretical results on reinforcement learning with temporally abstract options. In European Conference on Machine Learning (ECML), pages 382-393, 1998.
- (1998) European Conference on Machine Learning (ECML) , pp. 382-393
- Precup, D.¹ Sutton, R.S.² Singh, S.³

9
- 33749249312
- Hierarchical Dirichlet processes
- Yee Whye Teh, Michael I. Jordan, Matthew J. Beal, and David M. Blei. Hierarchical Dirichlet processes. Journal of the American Statistical Association, 101:1566-1581, 2006.
- (2006) Journal of the American Statistical Association , vol.101 , pp. 1566-1581
- Teh, Y.W.¹ Jordan, M.I.² Beal, M.J.³ Blei, D.M.⁴

10
- 51349153274
- Technical Report EDI-INF-RR-0934, University of Edinburgh
- Marc Toussaint, Stefan Harmeling, and Amos Storkey. Probabilistic inference for solving (PO)MDPs. Technical Report EDI-INF-RR-0934, University of Edinburgh, 2006.
- (2006) Probabilistic Inference for Solving (PO)MDPs
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

11
- 67349102783
- Hierarchical POMDP Controller Optimization by Likelihood Maximization
- Marc Toussaint, Laurent Charlin, and Pascal Poupart. Hierarchical POMDP Controller Optimization by Likelihood Maximization. In Uncertainty in Artificial Intelligence, 2008.
- (2008) Uncertainty in Artificial Intelligence
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

12
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.