SCOPUS 정보 검색 플랫폼

International Conference on Information and Knowledge Management, Proceedings

Volumn , Issue , 2013, Pages 989-998

Towards minimizing the annotation cost of certified text classification

(4) Bagdouri, Mossaab a Webber, William a Lewis, David D b Oard, Douglas W a

a UNIVERSITY OF MARYLAND (United States)

b David D Lewis Consulting (United States)

Author keywords

E discovery; Evaluation; Supervised learning; Text categorization

Indexed keywords

ALLOCATION POLICIES; ANALYTIC APPROXIMATION; CONFIDENCE INTERVAL; E DISCOVERIES; EVALUATION; STATISTICAL VALIDITY; TEXT CATEGORIZATION; TEXT CLASSIFICATION;

KNOWLEDGE MANAGEMENT; SUPERVISED LEARNING; TEXT PROCESSING;

CLASSIFICATION (OF INFORMATION);

EID: 84889598884 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2505515.2505708 Document Type: Conference Paper

Times cited : (17)

References (32)

1
- 84925604888
- No unbiased estimator of the variance of k-fold cross-validation
- September
- Y. Bengio and Y. Grandvalet. No unbiased estimator of the variance of k-fold cross-validation. Journal of Machine Learning Research, 5:1089-1105, September 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1089-1105
- Bengio, Y.¹ Grandvalet, Y.²

2
- 0004134209
- Cambirdge University Press
- A. Borodin and R. El-Yaniv. Online Computation and Competitive Analysis. Cambirdge University Press, 2005.
- (2005) Online Computation and Competitive Analysis
- Borodin, A.¹ El-Yaniv, R.²

3
- 13344280339
- One-sided confidence intervals in discrete distributions
- T. T. Cai. One-sided confidence intervals in discrete distributions. Journal of Statistical Planning and Inference, 131:63-88, 2005.
- (2005) Journal of Statistical Planning and Inference , vol.131 , pp. 63-88
- Cai, T.T.¹

4
- 33644696493
- Reducing workload in systematic review preparation using automated citation classification
- A. M. Cohen, W. R. Hersh, K. Peterson, and P.-Y. Yen. Reducing workload in systematic review preparation using automated citation classification. Journal of the American Medical Informatics Association, 13:206-219, 2006.
- (2006) Journal of the American Medical Informatics Association , vol.13 , pp. 206-219
- Cohen, A.M.¹ Hersh, W.R.² Peterson, K.³ Yen, P.-Y.⁴

5
- 0003577917
- Lawrence Erlbaum Associates, 2nd edition
- J. Cohen. Statistical Power Analysis for the Behavioral Sciences. Lawrence Erlbaum Associates, 2nd edition, 1988.
- (1988) Statistical Power Analysis for the Behavioral Sciences
- Cohen, J.¹

6
- 50349090268
- Cross-validation and bootstrapping are unreliable in small sample classification
- A. Isaksson, M. Wallman, H. Göransson, and M. G. Gustafsson. Cross-validation and bootstrapping are unreliable in small sample classification. Pattern Recognition Letters, 29:1960-1965, 2008.
- (2008) Pattern Recognition Letters , vol.29 , pp. 1960-1965
- Isaksson, A.¹ Wallman, M.² Göransson, H.³ Gustafsson, M.G.⁴

7
- 84957069814
- Text categorization with support vector machines: Learning with many relevant features
- T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In ECML, pages 137-142, 1998.
- (1998) ECML , pp. 137-142
- Joachims, T.¹

8
- 31844446804
- A support vector method for multivariate performance measures
- T. Joachims. A support vector method for multivariate performance measures. In ICML, pages 377-384, 2005.
- (2005) ICML , pp. 377-384
- Joachims, T.¹

9
- 33749563073
- Training linear svms in linear time
- T. Joachims. Training linear svms in linear time. In KDD, pages 217-226, 2006.
- (2006) KDD , pp. 217-226
- Joachims, T.¹

10
- 68949154453
- Sparse kernel svms via cutting-plane training
- T. Joachims and C.-N. J. Yu. Sparse kernel svms via cutting-plane training. In ECML PKDD: Part I, 2009.
- (2009) ECML PKDD: Part I
- Joachims, T.¹ Yu, C.-N.J.²

11
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4:237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

12
- 79951773917
- Technical report, Electronic Discovery Institute October
- A. Kershaw and J. Howie. eDiscovery institute survey on predictive coding. Technical report, Electronic Discovery Institute (http://www. ediscoveryinstitute.org/pubs/PredictiveCodingSurvey.pdf), October 2010.
- (2010) EDiscovery Institute Survey on Predictive Coding
- Kershaw, A.¹ Howie, J.²

13
- 80053225505
- Combining train set and test set bounds
- J. Langford. Combining train set and test set bounds. In ICML, pages 331-338, 2002.
- (2002) ICML , pp. 331-338
- Langford, J.¹

14
- 21844462365
- Tutorial on practical prediction theory for classification
- J. Langford. Tutorial on practical prediction theory for classification. Journal of Machine Learning Research, 6(1):273-306, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , Issue.1 , pp. 273-306
- Langford, J.¹

15
- 0004143760
- Springer, 3rd edition
- E. L. Lehmann and J. P. Romano. Testing Statistical Hypotheses. Springer, 3rd edition, 2005.
- (2005) Testing Statistical Hypotheses
- Lehmann, E.L.¹ Romano, J.P.²

16
- 85013879626
- A sequential algorithm for training text classifiers
- D. D. Lewis and W. A. Gale. A sequential algorithm for training text classifiers. In SIGIR, pages 3-12, 1994.
- (1994) SIGIR , pp. 3-12
- Lewis, D.D.¹ Gale, W.A.²

17
- 84876811202
- RCV1: A new benchmark collection for text categorization research
- December
- D. D. Lewis, Y. Yang, T. G. Rose, and F. Li. RCV1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5:361-397, December 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 361-397
- Lewis, D.D.¹ Yang, Y.² Rose, T.G.³ Li, F.⁴

18
- 0004031293
- McGraw-Hill, 3rd edition
- A. M. Mood, F. A. Graybill, and D. C. Boes. Introduction to the Theory of Statistics. McGraw-Hill, 3rd edition, 1974.
- (1974) Introduction to the Theory of Statistics
- Mood, A.M.¹ Graybill, F.A.² Boes, D.C.³

19
- 0042847140
- Inference for the generalization error
- C. Nadeau and Y. Bengio. Inference for the generalization error. Machine Learning, 52:239-281, 2003.
- (2003) Machine Learning , vol.52 , pp. 239-281
- Nadeau, C.¹ Bengio, Y.²

20
- 79951773558
- Evaluation of information retrieval for e-discovery
- D. W. Oard, J. R. Baron, B. Hedin, D. D. Lewis, and S. Tomlinson. Evaluation of information retrieval for e-discovery. Artificial Intelligence and Law, 18(4):347-386, 2010.
- (2010) Artificial Intelligence and Law , vol.18 , Issue.4 , pp. 347-386
- Oard, D.W.¹ Baron, J.R.² Hedin, B.³ Lewis, D.D.⁴ Tomlinson, S.⁵

21
- 84887928716
- Where the money goes: Understanding litigant expenditures for producing electronic discovery
- Santa Monica, CA
- N. M. Pace and L. Zakaras. Where the money goes: Understanding litigant expenditures for producing electronic discovery. Technical report, RAND Institute for Civil Justice, Santa Monica, CA, 2012.
- (2012) Technical Report, RAND Institute for Civil Justice
- Pace, N.M.¹ Zakaras, L.²

22
- 0002515248
- Efficient progressive sampling
- F. Provost, D. Jensen, and T. Oates. Efficient progressive sampling. In KDD, pages 23-32, 1999.
- (1999) KDD , pp. 23-32
- Provost, F.¹ Jensen, D.² Oates, T.³

23
- 45549117987
- Term-weighting approaches in automatic text retrieval
- G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513-523, 1988.
- (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 513-523
- Salton, G.¹ Buckley, C.²

24
- 0037245343
- Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification
- January
- R. Simon, M. D. Radmacher, K. Dobbin, and L. M. McShane. Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification. Journal of the National Cancer Institute, 95(1):14-18, January 2003.
- (2003) Journal of the National Cancer Institute , vol.95 , Issue.1 , pp. 14-18
- Simon, R.¹ Radmacher, M.D.² Dobbin, K.³ McShane, L.M.⁴

25
- 0003449101
- University Science Books, 2nd edition
- J. R. Taylor. Introduction to error analysis. University Science Books, 2nd edition, 1997.
- (1997) Introduction to Error Analysis
- Taylor, J.R.¹

26
- 0346868087
- Wiley, 2nd edition
- S. K. Thompson. Sampling. Wiley, 2nd edition, 2002.
- (2002) Sampling
- Thompson, S.K.¹

27
- 0016082639
- Bibliography on estimation of misclassification
- July
- G. T. Toussaint. Bibliography on estimation of misclassification. IEEE Transactions on Information Theory, IT-20(4):472-479, July 1974.
- (1974) IEEE Transactions on Information Theory , vol.IT-20 , Issue.4 , pp. 472-479
- Toussaint, G.T.¹

28
- 0004217877
- Butterworth
- C. J. van Rijsbergen. Information Retrieval. Butterworth, 1979.
- (1979) Information Retrieval
- Van Rijsbergen, C.J.¹

29
- 84873884780
- Approximate recall confidence intervals
- W. Webber. Approximate recall confidence intervals. ACM Transactions on Information Systems, 31(1):2:1-33, 2013.
- (2013) ACM Transactions on Information Systems , vol.31 , Issue.1-2 , pp. 1-33
- Webber, W.¹

30
- 84883057552
- Sequential testing in classifier evaluation yields biased estimates of effectiveness
- July
- W. Webber, M. Bagdouri, D. D. Lewis, and D. W. Oard. Sequential testing in classifier evaluation yields biased estimates of effectiveness. In SIGIR, pages 933-936, July 2013.
- (2013) SIGIR , pp. 933-936
- Webber, W.¹ Bagdouri, M.² Lewis, D.D.³ Oard, D.W.⁴

31
- 33645784163
- BMC Bioinformatics
- U. Wickenberg-Bolin, H. Göransson, M. Fryknäs, M. G. Gustafsson, and A. Isaksson. Improved variance estimation of classification performance via reduction of bias caused by small sample size. BMC Bioinformatics, 2006.
- (2006) Improved Variance Estimation of Classification Performance Via Reduction of Bias Caused by Small Sample Size
- Wickenberg-Bolin, U.¹ Göransson, H.² Fryknäs, M.³ Gustafsson, M.G.⁴ Isaksson, A.⁵

32
- 85024373635
- A re-examination of text categorization methods
- Y. Yang and X. Liu. A re-examination of text categorization methods. In SIGIR, pages 42-49, 1999.
- (1999) SIGIR , pp. 42-49
- Yang, Y.¹ Liu, X.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.