SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 20 - Proceedings of the 2007 Conference

Volumn , Issue , 2008, Pages

Bayes-adaptive POMDPs

(3) Ross, Stéphane a Chaib Draa, Brahim b Pineau, Joelle a

a MCGILL UNIVERSITY (Canada)

b UNIVERSITÉ LAVAL (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS; ESTIMATION; REINFORCEMENT LEARNING;

BAYESIAN REINFORCEMENT LEARNING; EXPLORATION/EXPLOITATION; HIDDEN VARIABLE; MARKOV DECISION PROCESSES; MODEL ESTIMATES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; PARTIALLY OBSERVABLE MDP; PROCESS FRAMEWORK; REINFORCEMENT LEARNINGS; TRADE OFF;

MARKOV PROCESSES;

EID: 85162018872 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (108)

References (12)

1
- 1142281527
- Model based bayesian exploration
- R. Dearden, N. Friedman, and N. Andre. Model based bayesian exploration. In UAI, 1999.
- (1999) UAI
- Dearden, R.¹ Friedman, N.² Andre, N.³

2
- 1942450858
- PhD thesis, University of Massachusetts, Amherst, USA
- M. Duff. Optimal Learning: Computational Procedure for Bayes-Adaptive Markov Decision Processes. PhD thesis, University of Massachusetts, Amherst, USA, 2002.
- (2002) Optimal Learning: Computational Procedure for Bayes-Adaptive Markov Decision Processes
- Duff, M.¹

3
- 33749251297
- An analytic solution to discrete bayesian reinforcement learning
- P. Poupart, N. Vlassis, J. Hoey, and K. Regan. An analytic solution to discrete bayesian reinforcement learning. In Proc. ICML, 2006.
- (2006) Proc. ICML
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

4
- 39649090194
- Active learning in partially observable markov decision processes
- R. Jaulmes, J. Pineau, and D. Precup. Active learning in partially observable markov decision processes. In ECML, 2005.
- (2005) ECML
- Jaulmes, R.¹ Pineau, J.² Precup, D.³

5
- 0032073263
- Planning and acting in partially observable stochastic domains
- PII S000437029800023X
- L. P. Kaelbling,M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998. (Pubitemid 128387390)
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

6
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Acapulco, Mexico
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: an anytime algorithm for POMDPs. In IJCAI, pages 1025-1032, Acapulco, Mexico, 2003.
- (2003) IJCAI , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

7
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- M. Spaan and N. Vlassis. Perseus: randomized point-based value iteration for POMDPs. JAIR, 24:195-220, 2005. (Pubitemid 43130936)
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

8
- 31144465830
- Heuristic search value iteration for POMDPs
- Banff, Canada
- T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In UAI, Banff, Canada, 2004.
- (2004) UAI
- Smith, T.¹ Simmons, R.²

9
- 44649095768
- An online POMDP algorithm for complex multiagent environments
- S. Paquet, L. Tobin, and B. Chaib-draa. An online POMDP algorithm for complex multiagent environments. In AAMAS, 2005.
- (2005) AAMAS
- Paquet, S.¹ Tobin, L.² Chaib-Draa, B.³

10
- 0013535965
- Infinite-horizon policy-gradient estimation
- Jonathan Baxter and Peter L. Bartlett. Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research (JAIR), 15:319-350, 2001.
- (2001) Journal of Artificial Intelligence Research (JAIR) , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

11
- 84858778653
- Technical Report SOCSTR-2007.6 McGill University
- Stéphane Ross, Brahim Chaib-draa, and Joelle Pineau. Bayes-adaptive pomdps. Technical Report SOCSTR-2007.6, McGill University, 2007.
- (2007) Bayes-adaptive Pomdps
- Ross, S.¹ Chaib-Draa, B.² Pineau, J.³

12
- 0003665481
- Springer
- A. Doucet, N. de Freitas, and N. Gordon. Sequential Monte Carlo Methods In Practice. Springer, 2001.
- (2001) Sequential Monte Carlo Methods in Practice
- Doucet, A.¹ De Freitas, N.² Gordon, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.