메뉴 건너뛰기




Volumn 39, Issue 1, 2003, Pages 45-65

An information-theoretic perspective of tf-idf measures

Author keywords

Information theory; Term weighting theories; Text categorization; tf idf

Indexed keywords

COMPUTATIONAL METHODS; INFORMATION THEORY; PROBABILITY; TEXT PROCESSING;

EID: 0037213089     PISSN: 03064573     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0306-4573(02)00021-3     Document Type: Article
Times cited : (1058)

References (45)
  • 3
    • 0012026398 scopus 로고    scopus 로고
    • Semantic information retrieval
    • F. Crestani, M. Lalmas, & C.J. Van Rijsbergen (Eds.), Boston: Kluwer Academic Press
    • Amati G., Van Rijsbergen K. Semantic information retrieval. Crestani F., Lalmas M., Van Rijsbergen C.J. Information retrieval: uncertainty and logics. 1998;189-219 Kluwer Academic Press, Boston.
    • (1998) Information retrieval: Uncertainty and logics , pp. 189-219
    • Amati, G.1    Van Rijsbergen, K.2
  • 6
    • 0012083369 scopus 로고
    • The Shannon model of IR systems
    • Brookes B.C. The Shannon model of IR systems. Journal of Documentation. 28:1972;160-162.
    • (1972) Journal of Documentation , vol.28 , pp. 160-162
    • Brookes, B.C.1
  • 7
    • 4243442188 scopus 로고    scopus 로고
    • Inverse document frequency (IDF): A measure of deviations from Poisson
    • Boston: Kluwer Academic Press
    • Church K.W., Gale W. Inverse document frequency (IDF): a measure of deviations from Poisson. Natural language processing using very large corpora. 1999;283-295 Kluwer Academic Press, Boston.
    • (1999) Natural language processing using very large corpora , pp. 283-295
    • Church, K.W.1    Gale, W.2
  • 8
    • 84936824188 scopus 로고
    • Word association norms, mutual information and lexicography
    • Church K.W., Hanks P. Word association norms, mutual information and lexicography. Computational Linguistics. 6(1):1990;22-29.
    • (1990) Computational Linguistics , vol.6 , Issue.1 , pp. 22-29
    • Church, K.W.1    Hanks, P.2
  • 10
    • 0002028294 scopus 로고    scopus 로고
    • Exploiting the similarity of non-matching terms at retrieval time
    • Crestani F. Exploiting the similarity of non-matching terms at retrieval time. Journal of Information Retrieval. 2(1):2000;23-43.
    • (2000) Journal of Information Retrieval , vol.2 , Issue.1 , pp. 23-43
    • Crestani, F.1
  • 11
    • 0018711255 scopus 로고
    • Using probabilistic models of document retrieval without relevance information
    • Croft W.B., Harper D.J. Using probabilistic models of document retrieval without relevance information. Journal of Documentation. 35(4):1979;285-295.
    • (1979) Journal of Documentation , vol.35 , Issue.4 , pp. 285-295
    • Croft, W.B.1    Harper, D.J.2
  • 12
    • 84888616628 scopus 로고
    • The construction of a thesaurus automatically from a sample of text
    • M.E. Stevens, V.E. Giuliano, & L.B. Heilprin (Eds.), (Miscellaneous publication 269). Washington, DC: National Bureau of Standards
    • Dennis S.F. The construction of a thesaurus automatically from a sample of text. Stevens M.E., Giuliano V.E., Heilprin L.B. Statistical association methods for mechanized documentation, symposium proceedings (Miscellaneous publication 269). 1964;National Bureau of Standards, Washington, DC.
    • (1964) Statistical association methods for mechanized documentation, symposium proceedings
    • Dennis, S.F.1
  • 13
    • 85055298348 scopus 로고
    • Accurate methods for the statistics of surprise and coincidence
    • Dunning T. Accurate methods for the statistics of surprise and coincidence. Computational Linguistics. 19(1):1993;61-74.
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 61-74
    • Dunning, T.1
  • 14
    • 0024868803 scopus 로고
    • Models for retrieval with probabilistic indexing
    • Fuhr N. Models for retrieval with probabilistic indexing. Information Processing and Management. 25(1):1989;55-72.
    • (1989) Information Processing and Management , vol.25 , Issue.1 , pp. 55-72
    • Fuhr, N.1
  • 15
    • 0030678849 scopus 로고    scopus 로고
    • A technical word and term translation aid using noisy parallel corpora across language groups
    • Fung P., McKeown K. A technical word and term translation aid using noisy parallel corpora across language groups. The Machine Translation Journal. 12(1-2):1996;53-87.
    • (1996) The Machine Translation Journal , vol.12 , Issue.1-2 , pp. 53-87
    • Fung, P.1    McKeown, K.2
  • 18
    • 0000600049 scopus 로고    scopus 로고
    • A probabilistic justification for using tf×idf term weighting in information retrieval
    • Hiemstra D. A probabilistic justification for using. tf×idf term weighting in information retrieval International Journal on Digital Libraries. 3(2):2000;131-139.
    • (2000) International Journal on Digital Libraries , vol.3 , Issue.2 , pp. 131-139
    • Hiemstra, D.1
  • 20
    • 84989380187 scopus 로고    scopus 로고
    • Methods of automatic term recognition: A review
    • Kageura K., Umino B. Methods of automatic term recognition: a review. Terminology. 3(2):1996;259-289.
    • (1996) Terminology , vol.3 , Issue.2 , pp. 259-289
    • Kageura, K.1    Umino, B.2
  • 26
    • 0000159640 scopus 로고
    • A statistical approach to mechanized encoding and searching of literary information
    • Luhn H.P. A statistical approach to mechanized encoding and searching of literary information. IBM Journal of Research and Development. 1(4):1957;309-317.
    • (1957) IBM Journal of Research and Development , vol.1 , Issue.4 , pp. 309-317
    • Luhn, H.P.1
  • 31
    • 0012057428 scopus 로고
    • An automated method for the extraction of important words from Japanese scientific documents
    • Nagao M., Mizutani M., Ikeda H. An automated method for the extraction of important words from Japanese scientific documents. Transactions of Information Processing Society of Japan. 17(2):1976;110-117.
    • (1976) Transactions of Information Processing Society of Japan , vol.17 , Issue.2 , pp. 110-117
    • Nagao, M.1    Mizutani, M.2    Ikeda, H.3
  • 33
    • 0001737422 scopus 로고
    • On term selection for query expansion
    • Robertson S.E. On term selection for query expansion. Journal of Documentation. 46(4):1990;359-364.
    • (1990) Journal of Documentation , vol.46 , Issue.4 , pp. 359-364
    • Robertson, S.E.1
  • 34
    • 21844486560 scopus 로고
    • Query-document symmetry and dual models
    • Robertson S.E. Query-document symmetry and dual models. Journal of Documentation. 50(3):1994;233-238.
    • (1994) Journal of Documentation , vol.50 , Issue.3 , pp. 233-238
    • Robertson, S.E.1
  • 39
    • 84974675567 scopus 로고
    • Retrieving collocations from text: Xtract
    • Smadja F. Retrieving collocations from text: Xtract. Computational Linguistics. 19(1):1993;143-178.
    • (1993) Computational Linguistics , vol.19 , Issue.1 , pp. 143-178
    • Smadja, F.1
  • 40
    • 84953744816 scopus 로고
    • A statistical interpretation of term specificity and its application in retrieval
    • Sparck-Jones K. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation. 28(1):1972;11-21.
    • (1972) Journal of Documentation , vol.28 , Issue.1 , pp. 11-21
    • Sparck-Jones, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.