SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference

Volumn , Issue , 2009, Pages 198-206

Learning to explore and exploit in POMDPs

(3) Cai, Chenghui a Liao, Xuejun a Carin, Lawrence a

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE LEARNING; AGENT BEHAVIOR; ASSOCIATED COSTS; EXPLORATION AND EXPLOITATION; EXPLORATION/EXPLOITATION; OPTIMAL ACTIONS; PARTIALLY OBSERVABLE ENVIRONMENTS; REINFORCEMENT LEARNINGS; SPECIFIC PROBLEMS; THEORETICAL GUARANTEES;

REINFORCEMENT LEARNING;

EID: 80055053199 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (24)

References (10)

1
- 3543081155
- PhD thesis, Gatsby Computational Neuroscience Unit, Univertisity College London
- M. J. Beal. Variational Algorithms for Approximate Bayesian Inference. PhD thesis, Gatsby Computational Neuroscience Unit, Univertisity College London, 2003.
- (2003) Variational Algorithms for Approximate Bayesian Inference
- Beal, M.J.¹

2
- 0041965975
- R-max - A general polynomial time algorithm for near-optimal reinforcement learning
- OCT
- R. I. Brafman and M. Tennenholtz. R-max - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3(OCT):213-231, 2002.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

3
- 56449086386
- Reinforcement learning with limited reinforcement: Using bayes risk for active learning in POMDPs
- ACM
- F. Doshi, J. Pineau, and N. Roy. Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs. In Proceedings of the 25th international conference on Machine learning, pages 256-263. ACM, 2008.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 256-263
- Doshi, F.¹ Pineau, J.² Roy, N.³

4
- 84880677563
- Efficient reinforcement learning in factored mdps
- M. Kearns and D. Koller. Efficient reinforcement learning in factored mdps. In Proc. of the Sixteenth International Joint Conference of Artificial Intelligence, pages 740-747, 1999.
- (1999) Proc. of the Sixteenth International Joint Conference of Artificial Intelligence , pp. 740-747
- Kearns, M.¹ Koller, D.²

5
- 0012257655
- Near-optimal performance for reinforcement learning in polynomial time
- M. Kearns and S. P. Singh. Near-optimal performance for reinforcement learning in polynomial time. In Proc. ICML, pages 260-268, 1998.
- (1998) Proc. ICML , pp. 260-268
- Kearns, M.¹ Singh, S.P.²

6
- 66849131425
- Multi-task reinforcement learning in partially observable stochastic environments
- H. Li, X. Liao, and L. Carin. Multi-task reinforcement learning in partially observable stochastic environments. Journal of Machine Learning Research, 10:1131-1186, 2009.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 1131-1186
- Li, H.¹ Liao, X.² Carin, L.³

7
- 85138579181
- Learning policies for partially observable environments: Scaling up
- M.L. Littman, A.R. Cassandra, and L.P. Kaelbling. Learning policies for partially observable environments: scaling up. In ICML, 1995.
- (1995) ICML
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

8
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- August
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI), pages 1025 - 1032, August 2003.
- (2003) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI) , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

9
- 77950356463
- Model-based Bayesian reinforcement learning in partially observable domains
- P. Poupart and N. Vlassis. Model-based bayesian reinforcement learning in partially observable domains. In International Symposiu on Artificial Intelligence and Mathmatics (ISAIM), 2008.
- (2008) International Symposiu on Artificial Intelligence and Mathmatics (ISAIM)
- Poupart, P.¹ Vlassis, N.²

10
- 0004102479
- MIT Press, Cambridge, MA
- R. Sutton and A. Barto. Reinforcement learning: An introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.