SCOPUS 정보 검색 플랫폼

2010 IEEE 9th International Conference on Development and Learning, ICDL-2010 - Conference Program

Volumn , Issue , 2010, Pages 191-196

Real time targeted exploration in large domains

(2) Hester, Todd a Stone, Peter a

a University of Texas at Austin (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION EFFECT; BAYESIAN METHODS; COMPUTATION TIME; INTRINSIC MOTIVATION; LARGE DOMAIN; MODEL FREE; MODEL-BASED; NOVEL DOMAIN; REAL TIME; REAL-WORLD TASK;

ALGORITHMS; BAYESIAN NETWORKS;

MATHEMATICAL MODELS;

EID: 78149247074 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/DEVLRN.2010.5578845 Document Type: Conference Paper

Times cited : (15)

References (15)

1
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 33750293964
- Bandit based monte-carlo planning
- Number 4212 in LNCS. Springer
- L. Kocsis and C. Szepesvári, "Bandit based monte-carlo planning," in In: ECML-06. Number 4212 in LNCS. Springer, 2006, pp. 282-293.
- (2006) ECML-06 , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

3
- 84899784589
- Generalized model learning for reinforcement learning in factored domains
- May
- T. Hester and P. Stone, "Generalized model learning for reinforcement learning in factored domains," in AAMAS, May 2009.
- (2009) AAMAS
- Hester, T.¹ Stone, P.²

4
- 78149266799
- Efficient reinforcement learning for motor control
- Hluboka nad Vltavou, Czech Republic, Sept.
- M. P. Deisenroth and C. E. Rasmussen, "Efficient reinforcement learning for motor control," in 10th International PhD Workshop on Systems and Control, Hluboka nad Vltavou, Czech Republic, Sept. 2009.
- (2009) 10th International PhD Workshop on Systems and Control
- Deisenroth, M.P.¹ Rasmussen, C.E.²

5
- 84880854156
- R-MAX - A general polynomial time algorithm for near-optimal reinforcement learning
- R. I. Brafman and M. Tennenholtz, "R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning," in IJCAI, 2001, pp. 953-958.
- (2001) IJCAI , pp. 953-958
- Brafman, R.I.¹ Tennenholtz, M.²

6
- 33749242809
- Learning the structure of factored markov decision processes in reinforcement learning problems
- New York, NY, USA: ACM
- T. Degris, O. Sigaud, and P.-H. Wuillemin, "Learning the structure of factored markov decision processes in reinforcement learning problems," in ICML '06. New York, NY, USA: ACM, 2006, pp. 257-264.
- (2006) ICML '06 , pp. 257-264
- Degris, T.¹ Sigaud, O.² Wuillemin, P.-H.³

7
- 33749251297
- An analytic solution to discrete bayesian reinforcement learning
- New York, NY, USA: ACM
- P. Poupart, N. Vlassis, J. Hoey, and K. Regan, "An analytic solution to discrete bayesian reinforcement learning," in ICML '06. New York, NY, USA: ACM, 2006, pp. 697-704.
- (2006) ICML '06 , pp. 697-704
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

8
- 14344258433
- A Bayesian framework for reinforcement learning
- M. Strens, "A Bayesian framework for reinforcement learning," in ICML '00, 2000, pp. 943-950.
- (2000) ICML '00 , pp. 943-950
- Strens, M.¹

9
- 78649507911
- A bayesian sampling approach to exploration in reinforcement learning
- J. Asmuth, L. Li, M. L. Littman, A. Nouri, and D. Wingate, "A bayesian sampling approach to exploration in reinforcement learning," in UAI, 2009.
- (2009) UAI
- Asmuth, J.¹ Li, L.² Littman, M.L.³ Nouri, A.⁴ Wingate, D.⁵

10
- 34250703734
- An intrinsic reward mechanism for efficient exploration
- O. Şimşek and A. G. Barto, "An intrinsic reward mechanism for efficient exploration," in ICML, 2006, pp. 833-840.
- (2006) ICML , pp. 833-840
- Şimşek, O.¹ Barto, A.G.²

11
- 34047267520
- Intrinsic motivation systems for autonomous mental development
- P.-Y. Oudeyer, F. Kaplan, and V. V. Hafner, "Intrinsic motivation systems for autonomous mental development." IEEE Trans. Evolutionary Computation, vol. 11, no. 2, pp. 265-286, 2007.
- (2007) IEEE Trans. Evolutionary Computation , vol.11 , Issue.2 , pp. 265-286
- Oudeyer, P.-Y.¹ Kaplan, F.² Hafner, V.V.³

12
- 56449122733
- Knows what it knows: A framework for self-aware learning
- L. Li, M. L. Littman, and T. J. Walsh, "Knows what it knows: a framework for self-aware learning," in ICML, 2008, pp. 568-575.
- (2008) ICML , pp. 568-575
- Li, L.¹ Littman, M.L.² Walsh, T.J.³

13
- 0035478854
- Random forests
- L. Breiman, "Random forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.
- (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
- Breiman, L.¹

14
- 33744584654
- Induction of decision trees
- J. R. Quinlan, "Induction of decision trees," Machine Learning, vol. 1, pp. 81-106, 1986.
- (1986) Machine Learning , vol.1 , pp. 81-106
- Quinlan, J.R.¹

15
- 0004049893
- Ph.D. dissertation, University of Cambridge
- C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.