메뉴 건너뛰기




Volumn 8, Issue 2, 2009, Pages 249-265

Distribution of multi-words in Chinese and English documents

Author keywords

G distribution; Multi word; Poisson distribution; Term distribution; Zero inflated distribution

Indexed keywords


EID: 67749101286     PISSN: 02196220     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0219622009003399     Document Type: Article
Times cited : (6)

References (25)
  • 1
    • 1542754043 scopus 로고    scopus 로고
    • Empirical development of an exponential probabilistic model for text retrieval
    • Toronto, Canada
    • J. Teevan and D. R. Karger, Empirical development of an exponential probabilistic model for text retrieval, in Proc. 26th Int. ACM SIGIR Conf. (Toronto, Canada, 2003), pp. 18-25.
    • (2003) Proc. 26th Int. ACM SIGIR Conf , pp. 18-25
    • Teevan, J.1    Karger, D.R.2
  • 2
    • 31844437086 scopus 로고    scopus 로고
    • Modeling word burstiness using the Dirichlet distribution
    • Bonn, Germany
    • R. E. Madsen, D. Kauchak and C. Elkan, Modeling word burstiness using the Dirichlet distribution, in Proc. 22nd Int. Conf. Machine Learning (Bonn, Germany, 2005), pp. 545-552.
    • (2005) Proc. 22nd Int. Conf. Machine Learning , pp. 545-552
    • Madsen, R.E.1    Kauchak, D.2    Elkan, C.3
  • 3
    • 36448955332 scopus 로고    scopus 로고
    • A study of poisson query generation model for information retrieval
    • Amsterdam, The Netherlands
    • Q. Mei, H. Fang and C. Zhai, A study of poisson query generation model for information retrieval, in Proc. 30th Int. ACM SIGIR Conf. (Amsterdam, The Netherlands, 2007), pp. 319-326.
    • (2007) Proc. 30th Int. ACM SIGIR Conf , pp. 319-326
    • Mei, Q.1    Fang, H.2    Zhai, C.3
  • 4
    • 58149235001 scopus 로고    scopus 로고
    • A descriptive framework for the field of data mining and knowledge discovery
    • Y. Peng, G. Kou, Y. Shi and Z. Chen, A descriptive framework for the field of data mining and knowledge discovery, Int. J. Inform. Technol. Decision Making 7(4) (2008).
    • (2008) Int. J. Inform. Technol. Decision Making , vol.7 , Issue.4
    • Peng, Y.1    Kou, G.2    Shi, Y.3    Chen, Z.4
  • 8
    • 54949149162 scopus 로고    scopus 로고
    • Text classification based on multi-word with support vector machine
    • in press
    • W. Zhang, T. Yoshida and X. J. Tang, Text classification based on multi-word with support vector machine, Knowl.-Based Syst., in press.
    • Knowl.-Based Syst
    • Zhang, W.1    Yoshida, T.2    Tang, X.J.3
  • 9
    • 26444438045 scopus 로고    scopus 로고
    • True reason for Zipf's law in language
    • D. Wang, M. Li and Z. Di, True reason for Zipf's law in language, Physica A 358 (2005) 545-550.
    • (2005) Physica A , vol.358 , pp. 545-550
    • Wang, D.1    Li, M.2    Di, Z.3
  • 10
    • 0013039398 scopus 로고    scopus 로고
    • Zipf's law and random texts
    • R. F. Cancho and R. V. Sole, Zipf's law and random texts, Adv. Complex Syst. 5(1) (2002) 1-6.
    • (2002) Adv. Complex Syst , vol.5 , Issue.1 , pp. 1-6
    • Cancho, R.F.1    Sole, R.V.2
  • 11
    • 0030092468 scopus 로고    scopus 로고
    • Distribution of content words and phrases in texts and language modeling
    • S. Katz, Distribution of content words and phrases in texts and language modeling, Nat. Language Eng. 2(1) (1996) 15-59.
    • (1996) Nat. Language Eng , vol.2 , Issue.1 , pp. 15-59
    • Katz, S.1
  • 13
    • 67749136370 scopus 로고    scopus 로고
    • Frequent term distribution measures for dataset profiling
    • Language Resources and Evaluation Lisbon, Portugal
    • A. D. Roeck, A. Sarkar and P. Garthwaite, Frequent term distribution measures for dataset profiling, in Proc. 4th Int. Conf. Language Resources and Evaluation (Lisbon, Portugal, 2004), pp. 30-37.
    • (2004) Proc. 4th Int. Conf , pp. 30-37
    • Roeck, A.D.1    Sarkar, A.2    Garthwaite, P.3
  • 17
    • 34548696880 scopus 로고    scopus 로고
    • Text classification toward a scientific forum
    • W. Zhang, X. J. Tang and T. Yoshida, Text classification toward a scientific forum, J. Syst. Sci. Syst. Eng. 16(3) (2007) 356-369.
    • (2007) J. Syst. Sci. Syst. Eng , vol.16 , Issue.3 , pp. 356-369
    • Zhang, W.1    Tang, X.J.2    Yoshida, T.3
  • 18
    • 1042268329 scopus 로고    scopus 로고
    • Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
    • Montreal, Canada
    • M. Yamamoto and K. W. Church, Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus, in Proc. 6th Workshop on very Large Corpora (Montreal, Canada, 1998), pp. 285-313.
    • (1998) Proc. 6th Workshop on very Large Corpora , pp. 285-313
    • Yamamoto, M.1    Church, K.W.2
  • 20
    • 84936824188 scopus 로고
    • Word association norms, mutual information, and lexicography
    • K. W. Church and P. Hanks, Word association norms, mutual information, and lexicography, Comput. Linguist. 16(1) (1990) 22-29.
    • (1990) Comput. Linguist , vol.16 , Issue.1 , pp. 22-29
    • Church, K.W.1    Hanks, P.2
  • 21
    • 84974295346 scopus 로고
    • Technical terminology: Some linguistic properties and an algorithm for identification in text
    • F. Jueston and S. M. Katz, Technical terminology: Some linguistic properties and an algorithm for identification in text, Nat. Language Eng. 1(1) (1995) 9-27.
    • (1995) Nat. Language Eng , vol.1 , Issue.1 , pp. 9-27
    • Jueston, F.1    Katz, S.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.