SCOPUS 정보 검색 플랫폼

Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence, UAI 2009

Volumn , Issue , 2009, Pages 223-231

New inference strategies for solving Markov Decision Processes using reversible jump MCMC

(4) Hoffman, Matt a Kueck, Hendrik a De Freitas, Nando a Doucet, Arnaud a

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; MONTE CARLO METHODS;

HIGHER-DIMENSIONAL; MARKOV CHAIN MONTE CARLO METHOD; MARKOV DECISION PROCESSES; OPTIMAL POLICIES; PARAMETERIZED CONTROL; REVERSIBLE JUMP MCMC; STRONG CORRELATION;

MARKOV CHAINS;

EID: 78751705157 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (20)

1
- 33749242151
- Planning by probabilistic inference
- H. Attias. Planning by probabilistic inference. In UAI, 2003.
- (2003) UAI
- Attias, H.¹

2
- 0029357425
- Mean shift, mode seeking, and clustering
- Y. Cheng. Mean shift, mode seeking, and clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(8):790-799, 1995.
- (1995) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.17 , Issue.8 , pp. 790-799
- Cheng, Y.¹

3
- 0346982426
- Using em for reinforcement learning
- P. Dayan and G. Hinton. Using EM for reinforcement learning. Neural Computation, 9:271-278, 1997.
- (1997) Neural Computation , vol.9 , pp. 271-278
- Dayan, P.¹ Hinton, G.²

4
- 80053145677
- Technical Report CUED-F-INFENG 444 Cambridge University Engineering Department
- A. Doucet and V. Tadic. On solving integral equations using Markov Chain Monte Carlo methods. Technical Report CUED-F-INFENG 444, Cambridge University Engineering Department, 2004.
- (2004) On Solving Integral Equations Using Markov Chain Monte Carlo Methods
- Doucet, A.¹ Tadic, V.²

5
- 0141567816
- Marginal maximum a posteriori estimation using Markov chain Monte Carlo
- DOI 10.1023/A:1013172322619
- A. Doucet, S. Godsill, and C. Robert. Marginal maximum a posteriori estimation using Markov chain Monte Carlo. Statistics and Computing, 12(1):77-84, 2002. (Pubitemid 37132839)
- (2002) Statistics and Computing , vol.12 , Issue.1 , pp. 77-84
- Doucet, A.¹ Godsill, S.J.² Robert, C.P.³

6
- 77956889087
- Reversible jump Markov Chain Monte Carlo computation and Bayesian model determination
- P. Green. Reversible jump Markov Chain Monte Carlo computation and Bayesian model determination. Biometrika, 82(4):711-732, 1995.
- (1995) Biometrika , vol.82 , Issue.4 , pp. 711-732
- Green, P.¹

7
- 70350090880
- Bayesian policy learning with trans-dimensional MCMC
- M. Hoffman, A. Doucet, N. de Freitas, and A. Jasra. Bayesian policy learning with trans-dimensional MCMC. In NIPS, 2007a.
- (2007) NIPS
- Hoffman, M.¹ Doucet, A.² De Freitas, N.³ Jasra, A.⁴

8
- 68749104962
- Technical Report TR-2007-04, University of British Columbia, Computer Science
- M. Hoffman, A. Doucet, N. de Freitas, and A. Jasra. On solving general state-space sequential decision problems using inference algorithms. Technical Report TR-2007-04, University of British Columbia, Computer Science, 2007b.
- (2007) On Solving General State-space Sequential Decision Problems Using Inference Algorithms
- Hoffman, M.¹ Doucet, A.² De Freitas, N.³ Jasra, A.⁴

9
- 84867572211
- An expectation maximization algorithm for continuous Markov Decision Processes with arbitrary reward
- M. Hoffman, N. de Freitas, A. Doucet, and J. Peters. An expectation maximization algorithm for continuous Markov Decision Processes with arbitrary reward. In AI-STATS, 2009.
- (2009) AI-STATS
- Hoffman, M.¹ De Freitas, N.² Doucet, A.³ Peters, J.⁴

10
- 80053157514
- Policy search for motor primitives in robotics
- P. Müller. Simulation based optimal design. In Bayesian Statistics 6
- J. Kober and J. Peters. Policy search for motor primitives in robotics. In NIPS, 2008. P. Müller. Simulation based optimal design. In Bayesian Statistics 6, 1998.
- (1998) NIPS 2008
- Kober, J.¹ Peters, J.²

11
- 4944254628
- Optimal bayesian design by inhomogeneous Markov chain simulation
- DOI 10.1198/016214504000001123
- P.Müller, B. Sansó, and M. de Iorio. Optimal Bayesian design by inhomogeneous Markov chain simulation. Journal of the American Statistical Association, 99: 788-798, 2004. (Pubitemid 39332860)
- (2004) Journal of the American Statistical Association , vol.99 , Issue.467 , pp. 788-798
- Muller, P.¹ Sanso, B.² De Iorio, M.³

12
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- A. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In UAI, pages 406-415, 2000.
- (2000) UAI , pp. 406-415
- Ng, A.¹ Jordan, M.²

13
- 2442627902
- Noncentered parameterisations for hierarchical models and data augmentation
- O. Papaspiliopoulos, G. Roberts, and M. Sköld. Noncentered parameterisations for hierarchical models and data augmentation. Bayesian Statistics, 7, 2003.
- (2003) Bayesian Statistics , vol.7
- Papaspiliopoulos, O.¹ Roberts, G.² Sköld, M.³

14
- 36348971133
- Reinforcement learning for operational space control
- J. Peters and S. Schaal. Reinforcement learning for operational space control. In ICRA, 2007.
- (2007) ICRA
- Peters, J.¹ Schaal, S.²

15
- 0013025914
- Wiley-Interscience
- J. Spall. Introduction to stochastic search and optimization: estimation, simulation, and control. Wiley-Interscience, 2005.
- (2005) Introduction to Stochastic Search and Optimization: Estimation, Simulation, and Control
- Spall, J.¹

16
- 33749234798
- Probabilistic inference for solving discrete and continuous state Markov decision processes
- M. Toussaint and A. Storkey. Probabilistic inference for solving discrete and continuous state Markov Decision Processes. In ICML, 2006.
- (2006) ICML
- Toussaint, M.¹ Storkey, A.²

17
- 51349153274
- Technical Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics
- M. Toussaint, S. Harmeling, and A. Storkey. Probabilistic inference for solving (PO)MDPs. Technical Report EDI-INF-RR-0934, University of Edinburgh, School of Informatics, 2006.
- (2006) Probabilistic Inference for Solving (PO)MDPs
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

18
- 67349102783
- Hierarchical POMDP controller optimization by likelihood maximization
- M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. In UAI, pages 562-570, 2008.
- (2008) UAI , pp. 562-570
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

19
- 34250613841
- Planning and acting in uncertain environments using probabilistic inference
- D. Verma and R. Rao. Planning and acting in uncertain environments using probabilistic inference. In IROS, 2006.
- (2006) IROS
- Verma, D.¹ Rao, R.²

20
- 67049132520
- Planning and moving in dynamic environments: A statistical machine learning approach
- Sendhoff, Koerner, Sporns, Ritter, and Doya, editors, LNAI. Springer-Verlag
- S. Vijayakumar, M. Toussaint, G. Petkos, and M. Howard. Planning and moving in dynamic environments: A statistical machine learning approach. In Sendhoff, Koerner, Sporns, Ritter, and Doya, editors, Creating Brain Like Intelligence: From Principles to Complex Intelligent Systems, LNAI-Vol. 5436. Springer-Verlag, 2009.
- (2009) Creating Brain Like Intelligence: From Principles to Complex Intelligent Systems , vol.5436
- Vijayakumar, S.¹ Toussaint, M.² Petkos, G.³ Howard, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.