메뉴 건너뛰기




Volumn , Issue , 2015, Pages 960-970

N-gram IDF: A global term weighting scheme based on information distance

Author keywords

IDF; Infor mation Distance; Kolmogorov Complexity; MED; Multiword Expression; Term Weighting

Indexed keywords

INVERSE PROBLEMS;

EID: 84968732930     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2736277.2741628     Document Type: Conference Paper
Times cited : (35)

References (48)
  • 2
    • 0037213089 scopus 로고    scopus 로고
    • An information-Theoretic perspective of tf-idf measures
    • Jan
    • A. Aizawa. An Information-Theoretic Perspective of TF-IDF Measures. Information Processing and Management, 39(1):45-65, Jan. 2003.
    • (2003) Information Processing and Management , vol.39 , Issue.1 , pp. 45-65
    • Aizawa, A.1
  • 9
    • 84936824188 scopus 로고
    • Word association norms, mutual information, and lexicography
    • 22-29 Mar
    • K. W. Church and P. Hanks. Word Association Norms, Mutual Information, and Lexicography. Computational Linguistics, 16(1):22-29, Mar. 1990.
    • (1990) Computational Linguistics , vol.16 , pp. 1
    • Church, K.W.1    Hanks, P.2
  • 12
    • 4944243732 scopus 로고    scopus 로고
    • A local maxima method and a fair dispersion normalization for extracting multi-word units from corpora
    • July
    • J. F. da Silva and J. G. P. Lopes. A Local Maxima Method and a Fair Dispersion Normalization for Extracting Multi-word Units from Corpora. In Proceedings of Meeting on Mathematics of Language (MOL), pages 369-381, July 1999.
    • (1999) Proceedings of Meeting on Mathematics of Language (MOL , pp. 369-381
    • Da Silva, J.F.1    Lopes, J.G.P.2
  • 15
    • 84857913430 scopus 로고    scopus 로고
    • New algorithms on wavelet trees and applications to information retrieval
    • Apr
    • T. Gagie, G. Navarro, and S. J. Puglisi. New Algorithms on Wavelet Trees and Applications to Information Retrieval. Theoretical Computer Science, 426-427:25-41, Apr. 2012.
    • (2012) Theoretical Computer Science , vol.426-427 , pp. 25-41
    • Gagie, T.1    Navarro, G.2    Puglisi, S.J.3
  • 18
    • 0016526522 scopus 로고
    • A probabilistic approach to automatic keyword indexing part i on the distribution of specialty words in a technical literature
    • July/Aug
    • S. P. Harter. A Probabilistic Approach to Automatic Keyword Indexing. Part I. On the Distribution of Specialty Words in a Technical Literature. Journal of the American Society for Information Science, 26(4):197-206, July/Aug. 1975.
    • (1975) Journal of the American Society for Information Science , vol.26 , Issue.4 , pp. 197-206
    • Harter, S.P.1
  • 20
    • 0000600049 scopus 로고    scopus 로고
    • A probabilistic justification for using tf df term weighting in information retrieval
    • Aug
    • D. Hiemstra. A Probabilistic Justification for Using TF DF Term Weighting in Information Retrieval. International Journal on Digital Libraries, 3(2):131-139, Aug. 2000.
    • (2000) International Journal on Digital Libraries , vol.3 , Issue.2 , pp. 131-139
    • Hiemstra, D.1
  • 21
    • 84953744816 scopus 로고
    • A statistical interpretation of term specificity and its application in retrieval
    • K. S. Jones. A Statistical Interpretation of Term Specificity and its Application in Retrieval. Journal of Documentation, 28:11-21, 1972.
    • (1972) Journal of Documentation , vol.28 , pp. 11-21
    • Jones, K.S.1
  • 22
    • 0007424433 scopus 로고
    • On tables of random numbers
    • A. Kolmogorov. On Tables of Random Numbers. Sankhya Ser. A, 25:369-376, 1963.
    • (1963) Sankhy A Ser. A , vol.25 , pp. 369-376
    • Kolmogorov, A.1
  • 24
    • 84966611720 scopus 로고
    • Irreversibility and heat generation in the computing process
    • July
    • R. Landauer. Irreversibility and Heat Generation in the Computing Process. IBM Journal of Research and Development, 5(3):183-191, July 1961.
    • (1961) IBM Journal of Research and Development , vol.5 , Issue.3 , pp. 183-191
    • Landauer, R.1
  • 33
    • 84859887839 scopus 로고    scopus 로고
    • An extensive empirical study of collocation extraction methods
    • June
    • P. Pecina. An Extensive Empirical Study of Collocation Extraction Methods. In Proceedings of ACL Student Research Workshop, pages 13-18, June 2005.
    • (2005) Proceedings of ACL Student Research Workshop , pp. 13-18
    • Pecina, P.1
  • 36
    • 8844253324 scopus 로고    scopus 로고
    • Understanding inverse document frequency: On theoretical arguments for IDF
    • S. Robertson. Understanding Inverse Document Frequency: On theoretical arguments for IDF. Journal of Documentation, 60(5):503-520, 2004.
    • (2004) Journal of Documentation , vol.60 , Issue.5 , pp. 503-520
    • Robertson, S.1
  • 42
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • G. Salton, A. Wong, and C.-S. Yang. A Vector Space Model for Automatic Indexing. Communications of the ACM, 18(11):613-620, 1975.
    • (1975) Communications of the ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.-S.3
  • 47
    • 0038632285 scopus 로고    scopus 로고
    • Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
    • 1-30, Mar
    • M. Yamamoto and K. W. Church. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus. Computational Linguistics, 27(1):1-30, Mar. 2001.
    • (2001) Computational Linguistics , vol.27 , pp. 1
    • Yamamoto, M.1    Church, K.W.2
  • 48
    • 67349246433 scopus 로고    scopus 로고
    • Improving effectiveness of mutual information for substantival multiword expression extraction
    • Oct
    • W. Zhang, T. Yoshida, X. Tang, and T.-B. Ho. Improving Effectiveness of Mutual Information for Substantival Multiword Expression Extraction. Expert Systems with Applications, 36(8):10919-10930, Oct. 2009.
    • (2009) Expert Systems with Applications , vol.36 , Issue.8 , pp. 10919-10930
    • Zhang, W.1    Yoshida, T.2    Tang, X.3    Ho, T.-B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.