SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2012, Pages 97-104

Near-optimal BRL using optimistic local transitions

Author keywords

[No Author keywords available]

Indexed keywords

COMBINATORIAL EXPLOSION; HIGH PROBABILITY; LOCAL TRANSITIONS; SAMPLE COMPLEXITY; TRANSITION FUNCTIONS; UNKNOWN ENVIRONMENTS;

OPTIMIZATION; REINFORCEMENT LEARNING;

HEURISTIC ALGORITHMS;

EID: 84867138336 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (25)

References (15)

1
- 84867123077
- (extended version). Technical Report 7965, INRIA, May
- Araya-López, M., Thomas, V., and Buffet, O. Near-optimal BRL using optimistic local transitions (extended version). Technical Report 7965, INRIA, May 2012.
- (2012) Near-optimal BRL Using Optimistic Local Transitions
- Araya-López, M.¹ Thomas, V.² Buffet, O.³

2
- 78649507911
- A Bayesian sampling approach to exploration in reinforcement learning
- Asmuth, J., Li, L., Littman, M.L., Nouri, A., and Wingate, D. A Bayesian sampling approach to exploration in reinforcement learning. In Proc. of UAI, 2009.
- Proc. of UAI, 2009
- Asmuth, J.¹ Li, L.² Littman, M.L.³ Nouri, A.⁴ Wingate, D.⁵

3
- 0041965975
- R-max - A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R.I. and Tennenholtz, M. R-max - a general polynomial time algorithm for near-optimal reinforcement learning. JMLR, 3:213-231, 2003.
- (2003) JMLR , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 1942450858
- PhD thesis, University of Massachusetts Amherst
- Duff, M. Optimal learning: Computational procedures for Bayes-adaptive Markov decision processes. PhD thesis, University of Massachusetts Amherst, 2002.
- (2002) Optimal Learning: Computational Procedures for Bayes-adaptive Markov Decision Processes
- Duff, M.¹

5
- 0012257655
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. and Singh, S. Near-optimal reinforcement learning in polynomial time. In Machine Learning, pp. 260-268, 1998.
- (1998) Machine Learning , pp. 260-268
- Kearns, M.¹ Singh, S.²

6
- 71149109483
- Near-Bayesian exploration in polynomial time
- Kolter, J. and Ng, A. Near-Bayesian exploration in polynomial time. In Proc. of ICML, 2009.
- Proc. of ICML, 2009
- Kolter, J.¹ Ng, A.²

7
- 33749251297
- An analytic solution to discrete Bayesian reinforcement learning
- Poupart, P., Vlassis, N., Hoey, J., and Regan, K. An analytic solution to discrete Bayesian reinforcement learning. In Proc. of ICML, 2006.
- Proc. of ICML, 2006
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

8
- 85102627959
- Wiley-Interscience
- Puterman, M. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley-Interscience, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

9
- 80053165997
- Variance-based rewards for approximate Bayesian reinforcement learning
- Sorg, J., Singh, S., and Lewis, R. Variance-based rewards for approximate Bayesian reinforcement learning. In Proc. of UAI, 2010.
- Proc. of UAI, 2010
- Sorg, J.¹ Singh, S.² Lewis, R.³

11
- 14344258433
- A Bayesian framework for rein- Forcement learning
- Strens, Malcolm J. A. A Bayesian framework for rein- forcement learning. In Proc. of ICML, 2000.
- Proc. of ICML, 2000
- Strens, M.J.A.¹

12
- 0004102479
- MIT Press
- Sutton, R. and Barto, A. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

13
- 77956520676
- Model-based reinforcement learning with nearly tight exploration complexity bounds
- Szita, Istvn and Szepesvri, Csaba. Model-based reinforcement learning with nearly tight exploration complexity bounds. In Proc. of ICML, 2010.
- Proc. of ICML, 2010
- Szita, I.¹ Szepesvri, C.²

14
- 0021518106
- A theory of the learnable
- Valiant, L. G. A theory of the learnable. In Proc. of STOC ACM, 1984.
- Proc. of STOC ACM, 1984
- Valiant, L.G.¹

15
- 79958846996
- Exploring compact reinforcement-learning representations with linear regression
- Walsh, T.J., Szita, I., Diuk, C., and Littman, M.L. Exploring compact reinforcement-learning representations with linear regression. In Proc. of UAI, 2009.
- Proc. of UAI, 2009
- Walsh, T.J.¹ Szita, I.² Diuk, C.³ Littman, M.L.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.