메뉴 건너뛰기




Volumn 5, Issue , 2004, Pages 361-397

RCV1: A new benchmark collection for text categorization research

Author keywords

Applications; Automated indexing; Controlled vocabulary indexing; Effectiveness measures; Evaluation; Feature selection; K NN; Methodology; Multiclass; Multilabel; Nearest neighbor; News articles; Operational systems; Rocchio; SCut; SCutFBR; Support vector machines; SVMs; Term weighting; Test collection; Text classification; Thresholding

Indexed keywords

APPLICATIONS; CLASSIFICATION (OF INFORMATION); FEATURE EXTRACTION; INDEXING (OF INFORMATION); QUALITY CONTROL; SEMANTICS; SUPPORT VECTOR MACHINES; TAXONOMIES;

EID: 84876811202     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (2564)

References (45)
  • 2
    • 9444224089 scopus 로고    scopus 로고
    • KNN, Rocchio and metrics for information filtering at TREC-10
    • Gaithersburg, MD 20899-0001, National Institute of Standards and Technology
    • T. Ault and Y. Yang. kNN, Rocchio and metrics for information filtering at TREC-10. In The Tenth Text REtrieval Conference (TREC 2001), pages 84-93, Gaithersburg, MD 20899-0001, 2002. National Institute of Standards and Technology. http://trec.nist.gov/pubs/trec10/papers/cmucatcorrect.pdf.
    • (2002) The Tenth Text REtrieval Conference (TREC 2001) , pp. 84-93
    • Ault, T.1    Yang, Y.2
  • 7
    • 83055194679 scopus 로고
    • Design of the MUC-6 evaluation
    • Defense Advanced Research Projects Agency, Morgan Kaufmann
    • R. Grishman and B. Sundheim. Design of the MUC-6 evaluation. In Sixth Message Understanding Evaluation (MUC-6), pages 1-12. Defense Advanced Research Projects Agency, Morgan Kaufmann, 1995.
    • (1995) Sixth Message Understanding Evaluation (MUC-6) , pp. 1-12
    • Grishman, R.1    Sundheim, B.2
  • 11
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Berlin
    • T. Joachims. Text categorization with support vector machines: Learning with many relevant features. In European Conference on Machine Learning (ECML'98), pages 137-142, Berlin, 1998.
    • (1998) European Conference on Machine Learning (ECML'98) , pp. 137-142
    • Joachims, T.1
  • 12
    • 0001938951 scopus 로고    scopus 로고
    • Transductive inference for text classification using support vector machines
    • San Francisco, CA
    • T. Joachims. Transductive inference for text classification using support vector machines. In International Conference on Machine Learning (ICML'99), pages 200-209, San Francisco, CA, 1999.
    • (1999) International Conference on Machine Learning (ICML'99) , pp. 200-209
    • Joachims, T.1
  • 17
    • 0002131932 scopus 로고
    • Evaluating text categorization
    • Defense Advanced Research Projects Agency, Morgan Kaufmann
    • D. D. Lewis. Evaluating text categorization. In Proceedings of Speech and Natural Language Workshop, pages 312-318. Defense Advanced Research Projects Agency, Morgan Kaufmann, 1991.
    • (1991) Proceedings of Speech and Natural Language Workshop , pp. 312-318
    • Lewis, D.D.1
  • 21
    • 1542268653 scopus 로고    scopus 로고
    • Applying support vector machines to the TREC-2001 batch filtering and routing tasks
    • Gaithersburg, MD 20899-0001, National Institute of Standards and Technology
    • D. D. Lewis. Applying support vector machines to the TREC-2001 batch filtering and routing tasks. In The Tenth Text REtrieval Conference (TREC 2001), pages 286-292, Gaithersburg, MD 20899-0001, 2002. National Institute of Standards and Technology. http://trec.nist.gov/pubs/trec10/papers/daviddlewis-trec2001-draft4.pdf
    • (2002) The Tenth Text REtrieval Conference (TREC 2001) , pp. 286-292
    • Lewis, D.D.1
  • 24
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130-137, 1980.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.F.1
  • 26
    • 0344689278 scopus 로고    scopus 로고
    • The TREC 2001 filtering track report
    • Gaithersburg, MD 20899-0001, National Institute of Standards and Technology
    • S. Robertson and I. Soboroff. The TREC 2001 filtering track report. In The Tenth Text REtrieval Conference (TREC 2001), pages 26-37, Gaithersburg, MD 20899-0001, 2002. National Institute of Standards and Technology. http://trec.nist.gov/pubs/trec10/papers/filtering2track.pdf
    • (2002) The Tenth Text REtrieval Conference (TREC 2001) , pp. 26-37
    • Robertson, S.1    Soboroff, I.2
  • 34
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • F. Sebastiani. Machine learning in automated text categorization. ACM Computing Surveys, 34(1):1-47, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 35
    • 0343397223 scopus 로고
    • The pragmatics of information retrieval experimentation
    • K. Sparck Jones, editor, chapter 5. Butterworths
    • J. M. Tague. The pragmatics of information retrieval experimentation. In K. Sparck Jones, editor, Information Retrieval Experiment, chapter 5. Butterworths, 1981.
    • (1981) Information Retrieval Experiment
    • Tague, J.M.1
  • 40
    • 27144441097 scopus 로고    scopus 로고
    • An evaluation of statistical approaches to text categorization
    • Y. Yang. An evaluation of statistical approaches to text categorization. Information Retrieval, 1(1/2):67-88, 1999.
    • (1999) Information Retrieval , vol.1 , Issue.1-2 , pp. 67-88
    • Yang, Y.1
  • 45
    • 0001868572 scopus 로고    scopus 로고
    • Text categorization based on regularized linear classification methods
    • T. Zhang and F. J. Oles. Text categorization based on regularized linear classification methods. Information Retrieval, 4(1):5-31, 2001.
    • (2001) Information Retrieval , vol.4 , Issue.1 , pp. 5-31
    • Zhang, T.1    Oles, F.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.