메뉴 건너뛰기




Volumn 12, Issue 3, 2009, Pages 400-415

Using the Web as corpus for self-training text categorization

Author keywords

Authorship attribution; Self training; Semi supervised learning; Text categorization; Web as corpus

Indexed keywords


EID: 64749095041     PISSN: 13864564     EISSN: 15737659     Source Type: Journal    
DOI: 10.1007/s10791-008-9083-7     Document Type: Article
Times cited : (20)

References (28)
  • 3
    • 33750242283 scopus 로고    scopus 로고
    • Tech. Rep. IR-408. Center of Intelligent Information Retrieval, UMass Amherst
    • Bekkerman, R., & Allan, J. (2004). Using bigrams in text categorization. Tech. Rep. IR-408. Center of Intelligent Information Retrieval, UMass Amherst.
    • (2004) Using Bigrams in Text Categorization
    • Bekkerman, R.1    Allan, J.2
  • 4
    • 34047230751 scopus 로고    scopus 로고
    • Who's at the keyboard: Authorship attribution in digital evidence investigations
    • C Chaski 2005 Who's at the keyboard: Authorship attribution in digital evidence investigations International Journal of Digital Evidence 4 1 1 13
    • (2005) International Journal of Digital Evidence , vol.4 , Issue.1 , pp. 1-13
    • Chaski, C.1
  • 5
    • 27144549260 scopus 로고    scopus 로고
    • Editorial: Special issue on learning from imbalanced data sets
    • NV Chawla N Japkowicz A Kotcz 2004 Editorial: Special issue on learning from imbalanced data sets SIGKDD Explorations 6 1 1 6
    • (2004) SIGKDD Explorations , vol.6 , Issue.1 , pp. 1-6
    • Chawla, N.V.1    Japkowicz, N.2    Kotcz, A.3
  • 6
    • 33845185878 scopus 로고    scopus 로고
    • Authorship attribution using word sequences
    • J. F. Martínez-Trinidad, J. A. Carrasco-Ochoa, & J. Kittler (Eds.), CIARP Springer
    • Coyotl-Morales, R. M., Villaseñor-Pineda, L., Montes-Y- Gómez, M., & Rosso, P. (2006). Authorship attribution using word sequences. In J. F. Martínez-Trinidad, J. A. Carrasco-Ochoa, & J. Kittler (Eds.), CIARP (Vol. 4225, pp. 844-853). Springer, Lecture Notes in Computer Science.
    • (2006) Lecture Notes in Computer Science , vol.4225 , pp. 844-853
    • Coyotl-Morales R., M.1
  • 11
    • 0001938951 scopus 로고    scopus 로고
    • Transductive inference for text classification using support vector machines
    • San Francisco, CA: Morgan Kaufmann
    • Joachims, T. (1999). Transductive inference for text classification using support vector machines. In Proceedings of the 16th International Conference on Machine Learning (pp. 200-209). San Francisco, CA: Morgan Kaufmann.
    • (1999) Proceedings of the 16th International Conference on Machine Learning , pp. 200-209
    • Joachims, T.1
  • 13
    • 0344154403 scopus 로고    scopus 로고
    • Introduction to the Special issue of the Web as Corpus
    • A Kilgarriff G Grefenstette 2003 Introduction to the Special issue of the Web as Corpus Computational Linguistics 29 2 333 347
    • (2003) Computational Linguistics , vol.29 , Issue.2 , pp. 333-347
    • Kilgarriff, A.1    Grefenstette, G.2
  • 14
    • 77952507676 scopus 로고    scopus 로고
    • Authorship attribution of texts: A review
    • R. Ahlswede, L. Bäumer, N. Cai, H. K. Aydinian, V. Blinovsky, C. Deppe, & H. Mashurian (Eds.) Springer, Lecture Notes in Computer Science
    • Malyutov, M. B. (2006). Authorship attribution of texts: A review. In R. Ahlswede, L. Bäumer, N. Cai, H. K. Aydinian, V. Blinovsky, C. Deppe, & H. Mashurian (Eds.), GTIT-C (Vol. 4123, pp. 362-380). Springer, Lecture Notes in Computer Science.
    • (2006) GTIT-C , vol.4123 , pp. 362-380
    • Malyutov, M.B.1
  • 15
    • 35048879815 scopus 로고    scopus 로고
    • Complex linguistic features for text classification: A comprehensive study
    • S. McDonald & J. Tait (Eds.) Sunderland, UK: Springer, Lecture Notes in Computer Science
    • Moschitti, A., & Basili, R. (2004). Complex linguistic features for text classification: A comprehensive study. In S. McDonald & J. Tait (Eds.), Proceedings of the 26th European Conference on Information Retrieval (ECIR 2004) (Vol. 2997, pp. 181-196). Sunderland, UK: Springer, Lecture Notes in Computer Science.
    • (2004) Proceedings of the 26th European Conference on Information Retrieval (ECIR 2004) , vol.2997 , pp. 181-196
    • Moschitti, A.1    Basili, R.2
  • 16
    • 0033886806 scopus 로고    scopus 로고
    • Text classification from labeled and unlabeled documents using em
    • K Nigam AK Mccallum S Thrun T Mitchell 2000 Text classification from labeled and unlabeled documents using EM Machine Learning 39 2/3 103 134
    • (2000) Machine Learning , vol.39 , Issue.2-3 , pp. 103-134
    • Nigam, K.1    McCallum, A.K.2    Thrun, S.3    Mitchell, T.4
  • 17
    • 3843083955 scopus 로고    scopus 로고
    • Augmenting naive Bayes classifiers with statistical language models
    • F Peng D Schuurmans S Wang 2004 Augmenting naive Bayes classifiers with statistical language models Information Retrieval 7 3-4 317 345
    • (2004) Information Retrieval , vol.7 , Issue.34 , pp. 317-345
    • Peng, F.1    Schuurmans, D.2    Wang, S.3
  • 18
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • F Sebastiani 2002 Machine learning in automated text categorization ACM Computing Surveys 34 1 1 47
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 25
    • 84942780408 scopus 로고    scopus 로고
    • Integrating background knowledge into nearest-neighbor text classification
    • S. Craw & A. D. Preece (Eds.) Springer, Lecture Notes in Computer Science
    • Zelikovitz, S., & Hirsh, H. (2002). Integrating background knowledge into nearest-neighbor text classification. In S. Craw & A. D. Preece (Eds.), ECCBR (Vol. 2416, pp. 1-5). Springer, Lecture Notes in Computer Science.
    • (2002) ECCBR , vol.2416 , pp. 1-5
    • Zelikovitz, S.1    Hirsh, H.2
  • 26
    • 33746097784 scopus 로고    scopus 로고
    • Using web searches on important words to create background sets for LSI classification
    • G. Sutcliffe & R. Goebel (Eds.) AAAI Press
    • Zelikovitz, S., & Kogan, M. (2006). Using web searches on important words to create background sets for LSI classification. In G. Sutcliffe & R. Goebel (Eds.), FLAIRS Conference (pp. 598-603). AAAI Press.
    • (2006) FLAIRS Conference , pp. 598-603
    • Zelikovitz, S.1    Kogan, M.2
  • 27
    • 33646123296 scopus 로고    scopus 로고
    • Effective and scalable authorship attribution using function words
    • G. G. Lee, A. Yamada, H. Meng, & S. H. Myaeng (Eds.) Springer, Lecture Notes in Computer Science
    • Zhao, Y., & Zobel, J. (2005). Effective and scalable authorship attribution using function words. In G. G. Lee, A. Yamada, H. Meng, & S. H. Myaeng (Eds.), AIRS (Vol. 3689, pp. 174-189). Springer, Lecture Notes in Computer Science.
    • (2005) AIRS , vol.3689 , pp. 174-189
    • Zhao, Y.1    Zobel, J.2
  • 28
    • 33745456231 scopus 로고    scopus 로고
    • Tech. Rep. Computer Sciences, University of Wisconsin-Madison
    • Zhu, X. (2005). Semi-supervised learning literature survey. Tech. Rep. Computer Sciences, University of Wisconsin-Madison.
    • (2005) Semi-supervised Learning Literature Survey
    • Zhu, X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.