SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn 382, Issue , 2009, Pages

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

(3) Diuk, Carlos a Li, Lihong a Leffler, Bethany R a

a RUTGERS UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARK DOMAINS; FEATURE SELECTION; LOWER BOUNDS; NAVIGATION PROBLEM; STATE-OF-THE-ART ALGORITHMS; STRUCTURE-LEARNING; UPPER BOUND;

ADAPTIVE ALGORITHMS; CONTROL THEORY; EDUCATION; REINFORCEMENT; ROBOT LEARNING;

LEARNING ALGORITHMS;

EID: 70049104382 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1553374.1553406 Document Type: Conference Paper

Times cited : (8)

References (21)

1
- 33747670266
- Learning factor graphs in polynomial time and sample complexity
- Abbeel, P., Koller, D., & Ng, A. Y. (2006). Learning factor graphs in polynomial time and sample complexity. Journal of Machine Learning Research, 7, 1743-1788.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1743-1788
- Abbeel, P.¹ Koller, D.² Ng, A.Y.³

2
- 0346942368
- Decisiontheoretic planning: Structural assumptions and computational leverage
- Boutilier, C., Dean, T., & Hanks, S. (1999). Decisiontheoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11, 1-94.
- (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

3
- 0041965975
- R-max a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., & Tennenholtz, M. (2002). R-max a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 70049084399
- CORL: A continuous-state offsetdynamics reinforcement learner
- Brunskill, E., Leffler, B. R., Li, L., Littman, M. L., & Roy, N. (2008). CORL: A continuous-state offsetdynamics reinforcement learner. Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08).
- (2008) Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08)
- Brunskill, E.¹ Leffler, B.R.² Li, L.³ Littman, M.L.⁴ Roy, N.⁵

5
- 0031140246
- How to use expert advice
- Cesa-Bianchi, N., Freund, Y., Haussler, D., Helmbold, D. P., Schapire, R. E., & Warmuth, M. K. (1997). How to use expert advice. Journal of the ACM, 44, 427-485.
- (1997) Journal of the ACM , vol.44 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

6
- 84990553353
- A model for reasoning about persistence and causation
- Dean, T., & Kanazawa, K. (1989). A model for reasoning about persistence and causation. Computational Intelligence, 5, 142-150.
- (1989) Computational Intelligence , vol.5 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

7
- 4544318426
- Efficient solution algorithms for factored MDPs
- Guestrin, C., Koller, D., Parr, R., & Venkataraman, S. (2003). Efficient solution algorithms for factored MDPs. Journal of Artificial Intelligence Research, 19, 399-468.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

8
- 23244466805
- Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London, UK
- Kakade, S. (2003). On the sample complexity of reinforcement learning. Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London, UK.
- (2003) On the sample complexity of reinforcement learning
- Kakade, S.¹

9
- 84880677563
- Efficient reinforcement learning in factored MDPs
- Kearns, M. J., & Koller, D. (1999). Efficient reinforcement learning in factored MDPs. Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) (pp. 740-747).
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) , pp. 740-747
- Kearns, M.J.¹ Koller, D.²

10
- 0028460231
- Efficient distribution-free learning of probabilistic concepts
- Kearns, M. J., & Schapire, R. E. (1994). Efficient distribution-free learning of probabilistic concepts. Journal of Computer and System Sciences, 48, 464-497.
- (1994) Journal of Computer and System Sciences , vol.48 , pp. 464-497
- Kearns, M.J.¹ Schapire, R.E.²

11
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. J., & Singh, S. P. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.J.¹ Singh, S.P.²

12
- 36349026477
- Efficient reinforcement learning with relocatable action models
- Leffler, B. R., Littman, M. L., & Edmunds, T. (2007). Efficient reinforcement learning with relocatable action models. Proceedings of the Twenty-Second Conference on Artificial Intelligence (AAAI-07) (pp. 572-577).
- (2007) Proceedings of the Twenty-Second Conference on Artificial Intelligence (AAAI-07) , pp. 572-577
- Leffler, B.R.¹ Littman, M.L.² Edmunds, T.³

13
- 70049090614
- Li, L. (2009). A unifying framework for computational reinforcement learning theory. Doctoral dissertation, Department of Computer Science, Rutgers University, New Brunswick, NJ. Li, L., Littman, M. L., & Walsh, T. J. (2008). Knows whatit knows: A framework for self-aware learning. Proceedings of the Twenty-Fifth International Conference on Machine Learning (ICML-08) (pp. 568-575).
- Li, L. (2009). A unifying framework for computational reinforcement learning theory. Doctoral dissertation, Department of Computer Science, Rutgers University, New Brunswick, NJ. Li, L., Littman, M. L., & Walsh, T. J. (2008). Knows whatit knows: A framework for self-aware learning. Proceedings of the Twenty-Fifth International Conference on Machine Learning (ICML-08) (pp. 568-575).

14
- 30044441333
- The sample complexity of exploration in the multi-armed bandit problem
- Mannor, S., & Tsitsiklis, J. N. (2004). The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research, 5, 623- 648.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 623-648
- Mannor, S.¹ Tsitsiklis, J.N.²

15
- 85102627959
- New York: Wiley-Interscience
- Puterman, M. L. (1994). Markov decision processes: Discrete stochastic dynamic programming. New York: Wiley-Interscience.
- (1994) Markov decision processes: Discrete stochastic dynamic programming
- Puterman, M.L.¹

16
- 34548763246
- Model-based reinforcement learning in factored-state MDPs
- Strehl, A. L. (2007). Model-based reinforcement learning in factored-state MDPs. Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (pp. 103-110).
- (2007) Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning , pp. 103-110
- Strehl, A.L.¹

17
- 36348930987
- Efficient structure learning in factored-state MDPs
- Strehl, A. L., Diuk, C., & Littman, M. L. (2007). Efficient structure learning in factored-state MDPs. Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI-07) (pp. 645-650).
- (2007) Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence (AAAI-07) , pp. 645-650
- Strehl, A.L.¹ Diuk, C.² Littman, M.L.³

18
- 34548745051
- Incremental model-based learners with formal learning-time guarantees
- Strehl, A. L., Li, L., & Littman, M. L. (2006a). Incremental model-based learners with formal learning-time guarantees. Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI-06) (pp. 485-493).
- (2006) Proceedings of the Twenty-Second Conference on Uncertainty in Artificial Intelligence (UAI-06) , pp. 485-493
- Strehl, A.L.¹ Li, L.² Littman, M.L.³

19
- 33749255382
- PAC model-free reinforcement learning
- Strehl, A. L., Li, L.,Wiewiora, E., Langford, J., & Littman, M. L. (2006b). PAC model-free reinforcement learning. Proceedings of the Twenty-Third International Conference on Machine Learning (ICML-06) (pp. 881-888).
- (2006) Proceedings of the Twenty-Third International Conference on Machine Learning (ICML-06) , pp. 881-888
- Strehl, A.L.¹ Li, L.² Wiewiora, E.³ Langford, J.⁴ Littman, M.L.⁵

20
- 0021518106
- A theory of the learnable
- Valiant, L. G. (1984). A theory of the learnable. Communications of the ACM, 27, 1134-1142.
- (1984) Communications of the ACM , vol.27 , pp. 1134-1142
- Valiant, L.G.¹

21
- 0000819141
- A learning criterion for stochastic rules
- Yamanishi, K. (1992). A learning criterion for stochastic rules. Machine Learning, 9, 165-203.
- (1992) Machine Learning , vol.9 , pp. 165-203
- Yamanishi, K.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.