SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2008, Pages 568-575

Knows what it knows: A framework for self-aware learning

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS; REINFORCEMENT; ROBOT LEARNING;

LEARNING FRAMEWORKS; LEARNING PROBLEMS; LEARNING SETTINGS; OPEN PROBLEMS; TRAINING EXAMPLES; REINFORCEMENT-LEARNING;

EDUCATION;

EID: 56449122733 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (87)

References (21)

1
- 0742284346
- Queries revisited
- Angluin, D. (2004). Queries revisited. Theoretical Computer Science, 313, :175-194.
- (2004) Theoretical Computer Science , vol.313 , pp. 175-194
- Angluin, D.¹

2
- 1942450194
- Technical Report CMU-RI-TR-01-25, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA
- Bagnell, J., Ng, A. Y., & Schneider, J. (2001). Solving uncertain Markov decision problems (Technical Report CMU-RI-TR-01-25). Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
- (2001) Solving uncertain Markov decision problems
- Bagnell, J.¹ Ng, A.Y.² Schneider, J.³

3
- 0028517062
- Separating distribution-free and mistake-bound learning models over the Boolean domain
- Blum, A. (1994). Separating distribution-free and mistake-bound learning models over the Boolean domain. SIAM Journal on Computing, 23, 990-1000.
- (1994) SIAM Journal on Computing , vol.23 , pp. 990-1000
- Blum, A.¹

4
- 0041965975
- R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., & Tennenholtz, M. (2002). R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

5
- 33745738567
- Worst-case analysis of selective sampling for linear classification
- Cesa-Bianchi, N., Gentile, C., & Zaniboni, L. (2006). Worst-case analysis of selective sampling for linear classification. Journal of Machine Learning Research, 7, 1205-1230.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1205-1230
- Cesa-Bianchi, N.¹ Gentile, C.² Zaniboni, L.³

6
- 20544462399
- Minimizing regret with label efficient prediction
- Cesa-Bianchi, N., Lugosi, G., & Stoltz, G. (2005). Minimizing regret with label efficient prediction. IEEE Transactions on Information Theory, 51, 2152-2162.
- (2005) IEEE Transactions on Information Theory , vol.51 , pp. 2152-2162
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

7
- 0028424239
- Improving generalization with active learning
- Cohn, D. A., Atlas, L., & Ladner, R. E. (1994). Improving generalization with active learning. Machine Learning, 15, 201-221.
- (1994) Machine Learning , vol.15 , pp. 201-221
- Cohn, D.A.¹ Atlas, L.² Ladner, R.E.³

8
- 78650606637
- A quantitative study of hypothesis selection
- Fong, P. W. L. (1995). A quantitative study of hypothesis selection. Proceedings of the Twelfth International Conference on Machine Learning (ICML-95) (pp. 226-234).
- (1995) Proceedings of the Twelfth International Conference on Machine Learning (ICML-95) , pp. 226-234
- Fong, P.W.L.¹

9
- 0030643068
- Using and combining predictors that specialize
- Freund, Y., Schapire, R. E., Singer, Y., & Warmuth, M. K. (1997). Using and combining predictors that specialize. STOC '97: Proceedings of the twenty-ninth annual ACM symposium on Theory of computing (pp. 334-343).
- (1997) STOC '97: Proceedings of the twenty-ninth annual ACM symposium on Theory of computing , pp. 334-343
- Freund, Y.¹ Schapire, R.E.² Singer, Y.³ Warmuth, M.K.⁴

10
- 0034666805
- Apple tasting
- Helmbold, D. P., Littlestone, N., & Long, P. M. (2000). Apple tasting. Information and Computation, 161, 85-139.
- (2000) Information and Computation , vol.161 , pp. 85-139
- Helmbold, D.P.¹ Littlestone, N.² Long, P.M.³

11
- 1942452450
- Exploration in metric state spaces
- Kakade, S., Kearns, M., & Langford, J. (2003). Exploration in metric state spaces. Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Kakade, S.¹ Kearns, M.² Langford, J.³

12
- 23244466805
- Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London
- Kakade, S. M. (2003). On the sample complexity of reinforcement learning. Doctoral dissertation, Gatsby Computational Neuroscience Unit, University College London.
- (2003) On the sample complexity of reinforcement learning
- Kakade, S.M.¹

13
- 84880677563
- Efficient reinforcement learning in factored MDPs
- Kearns, M. J., & Koller, D. (1999). Efficient reinforcement learning in factored MDPs. Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI) (pp. 740-747).
- (1999) Proceedings of the 16th International Joint Conference on Artificial Intelligence (IJCAI) , pp. 740-747
- Kearns, M.J.¹ Koller, D.²

14
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M. J., & Singh, S. P. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.J.¹ Singh, S.P.²

15
- 0037400054
- An empirical study of two approaches to sequence learning for anomaly detection
- Lane, T., & Brodley, C. E. (2003). An empirical study of two approaches to sequence learning for anomaly detection. Machine Learning, 51, 73-107.
- (2003) Machine Learning , vol.51 , pp. 73-107
- Lane, T.¹ Brodley, C.E.²

16
- 34250091945
- Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
- Littlestone, N. (1987). Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2, 285-318.
- (1987) Machine Learning , vol.2 , pp. 285-318
- Littlestone, N.¹

17
- 36348930987
- Efficient structure learning in factored-state MDPs
- Strehl, A. L., Diuk, C., & Littman, M. L. (2007). Efficient structure learning in factored-state MDPs. Proceedings of the Twenty-Second National Conference on Artificial Intelligence (AAAI-07).
- (2007) Proceedings of the Twenty-Second National Conference on Artificial Intelligence (AAAI-07)
- Strehl, A.L.¹ Diuk, C.² Littman, M.L.³

18
- 85162058047
- Online linear regression and its application to model-based reinforcement learning
- Strehl, A. L., & Littman, M. L. (2008). Online linear regression and its application to model-based reinforcement learning. Advances in Neural Information Processing Systems 20.
- (2008) Advances in Neural Information Processing Systems , vol.20
- Strehl, A.L.¹ Littman, M.L.²

19
- 33749242078
- Experience-efficient learning in associative bandit problems
- Strehl, A. L., Mesterharm, C., Littman, M. L., & Hirsh, H. (2006). Experience-efficient learning in associative bandit problems. Proceedings of the Twenty-third International Conference on Machine Learning (ICML-06).
- (2006) Proceedings of the Twenty-third International Conference on Machine Learning (ICML-06)
- Strehl, A.L.¹ Mesterharm, C.² Littman, M.L.³ Hirsh, H.⁴

20
- 0021518106
- A theory of the learnable
- Valiant, L. G. (1984). A theory of the learnable. Communications of the ACM, 27, 1134-1142.
- (1984) Communications of the ACM , vol.27 , pp. 1134-1142
- Valiant, L.G.¹

21
- 49549125826
- Maximizing classifier utility when training data is costly
- Weiss, G. M., & Tian, Y. (2006). Maximizing classifier utility when training data is costly. SIGKDD Explorations, 8, 31-38.
- (2006) SIGKDD Explorations , vol.8 , pp. 31-38
- Weiss, G.M.¹ Tian, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.