메뉴 건너뛰기




Volumn 12, Issue 5, 2009, Pages 509-525

The ineffectiveness of within-document term frequency in text classification

Author keywords

Bag of words; Within document frequency; Word burstiness

Indexed keywords


EID: 80052839148     PISSN: 13864564     EISSN: 15737659     Source Type: Journal    
DOI: 10.1007/s10791-008-9069-5     Document Type: Article
Times cited : (26)

References (25)
  • 4
    • 33749257142 scopus 로고    scopus 로고
    • Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution
    • Pittsburgh, Pennsylvania: ACM Press
    • Elkan, C. (2006). Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution. In 23rd International Conference on Machine Learning. Pittsburgh, Pennsylvania: ACM Press.
    • (2006) 23rd International Conference On Machine Learning
    • Elkan, C.1
  • 14
    • 0141596527 scopus 로고    scopus 로고
    • Retrieved September 18, 2007, from
    • Minka, T. P. (2003). Estimating a Dirichlet distribution. Retrieved September 18, 2007, from https://research.microsoft.com/*minka/papers/dirichlet/
    • (2003) Estimating a Dirichlet Distribution
    • Minka, T.P.1
  • 18
    • 0003882234 scopus 로고
    • Reading, Massachusetts: Addison-Wesley Publishing Company
    • Salton, G. (1989). Automatic text processing. Reading, Massachusetts: Addison-Wesley Publishing Company.
    • (1989) Automatic Text Processing
    • Salton, G.1
  • 21
    • 84893733030 scopus 로고    scopus 로고
    • o. M. U. S. M. S. H, United States National Library of Medicine, National Institutes of Health, Medical Subject Headings. N. I. o. H. United States National Library of Medicine, U.S. National Library of Medicine, National Institutes of Health, Health & Human Services
    • Section, N. L. o. M. U. S. M. S. H. (2004). MeSH tree structures [electronic resource]/United States National Library of Medicine, National Institutes of Health, Medical Subject Headings. N. I. o. H. United States National Library of Medicine, U.S. National Library of Medicine, National Institutes of Health, Health & Human Services.
    • (2004) MeSH Tree Structures [electronic Resource]
    • Section, N.L.1
  • 22
    • 1542754043 scopus 로고    scopus 로고
    • Empirical development of an exponential probabilistic model for text retrieval: Using textual analysis to build a better model
    • Toronto, Canada
    • Teevan, J., & Karger, D. R. (2003). Empirical development of an exponential probabilistic model for text retrieval: Using textual analysis to build a better model. In ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada.
    • (2003) ACM SIGIR Conference On Research and Development In Information Retrieval
    • Teevan, J.1    Karger, D.R.2
  • 23
    • 0003990972 scopus 로고    scopus 로고
    • San Francisco: Morgan-Kaufmann Publishers, Inc
    • Witten, I. H., Moffat, A., et al. (1999). Managing gigabytes. San Francisco: Morgan-Kaufmann Publishers, Inc.
    • (1999) Managing Gigabytes
    • Witten, I.H.1    Moffat, A.2
  • 25
    • 0001868572 scopus 로고    scopus 로고
    • Text categorization based on regularized linear classification methods
    • doi:10.1023/A:1011441423217
    • Zhang, T., & Oles, F. J. (2001). Text categorization based on regularized linear classification methods. Information Retrieval, 4(1), 5-31. doi:10.1023/A:1011441423217.
    • (2001) Information Retrieval , vol.4 , Issue.1 , pp. 5-31
    • Zhang, T.1    Oles, F.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.