메뉴 건너뛰기




Volumn 4394 LNCS, Issue , 2007, Pages 611-622

Clustering narrow-domain short texts by using the Kullback-Leibler distance

Author keywords

[No Author keywords available]

Indexed keywords

DATA ACQUISITION; INFORMATION RETRIEVAL SYSTEMS; NUMERICAL METHODS; PROBABILITY DISTRIBUTIONS; PROBLEM SOLVING;

EID: 37149013306     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-70939-8_54     Document Type: Conference Paper
Times cited : (40)

References (27)
  • 1
    • 25144495542 scopus 로고    scopus 로고
    • An Approach to Clustering Abstracts
    • Proceedings of the 10th International Conference NLDB-05, of, Springer-Verlag
    • M. Alexandrov, A. Gelbukh, and P. Rosso: An Approach to Clustering Abstracts, In Proceedings of the 10th International Conference NLDB-05, volume 3513 of Lecture Notes in Computer Science, pages 275-285, Springer-Verlag, 2005.
    • (2005) Lecture Notes in Computer Science , vol.3513 , pp. 275-285
    • Alexandrov, M.1    Gelbukh, A.2    Rosso, P.3
  • 3
    • 35248874304 scopus 로고    scopus 로고
    • Vocabulary and Language Model Adaptation using Information Retrieval
    • Proceedings of the ECIR-2003, of, Springer-Verlag
    • B. Bigi, Y. Huang, R. d. Mori: Vocabulary and Language Model Adaptation using Information Retrieval, In Proceedings of the ECIR-2003, volume 2633 of Lecture Notes in Computer Science, pages 305-319, Springer-Verlag, 2003.
    • (2003) Lecture Notes in Computer Science , vol.2633 , pp. 305-319
    • Bigi, B.1    Huang, Y.2    Mori, R.D.3
  • 4
    • 35248874304 scopus 로고    scopus 로고
    • Using Kullback-Leibler Distance for Text Categorization
    • Proceedings of the ECIR-2003, of, Springer-Verlag
    • B. Bigi: Using Kullback-Leibler Distance for Text Categorization, In Proceedings of the ECIR-2003, volume 2633 of Lecture Notes in Computer Science, pages 305-319, Springer-Verlag, 2003.
    • (2003) Lecture Notes in Computer Science , vol.2633 , pp. 305-319
    • Bigi, B.1
  • 6
    • 5244360269 scopus 로고
    • A Law of Occurrences for Words of Low Frequency
    • A. D. Booth: A Law of Occurrences for Words of Low Frequency, Information and control, 10(4):386-393, 1967.
    • (1967) Information and control , vol.10 , Issue.4 , pp. 386-393
    • Booth, A.D.1
  • 7
    • 0000354976 scopus 로고
    • A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
    • P. Burman, A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods, Biometrika 76(3):503-514, 1989.
    • (1989) Biometrika , vol.76 , Issue.3 , pp. 503-514
    • Burman, P.1
  • 9
    • 0032633485 scopus 로고    scopus 로고
    • Similarity-based models of word cooccurrence probabilities
    • I. Dagan, L. Lee, F. Pereira: Similarity-based models of word cooccurrence probabilities, Machine Learning, 34(1-3):43-69, 1999.
    • (1999) Machine Learning , vol.34 , Issue.1-3 , pp. 43-69
    • Dagan, I.1    Lee, L.2    Pereira, F.3
  • 10
    • 5044248550 scopus 로고    scopus 로고
    • Jensen-Shannon Divergence and Hubert space embedding
    • Information Theory
    • B. Fuglede, F. Topse: Jensen-Shannon Divergence and Hubert space embedding, IEEE Int Sym. Information Theory, 2004.
    • (2004) IEEE Int Sym
    • Fuglede, B.1    Topse, F.2
  • 11
    • 37149039872 scopus 로고    scopus 로고
    • Uso del punto de transición en la selección de términos índice para agrupamiento de textos cortos, Procesamiento del Lenguaje Natural, 35
    • H. Jiménez, D. Pinto, and P. Rosso: Uso del punto de transición en la selección de términos índice para agrupamiento de textos cortos, Procesamiento del Lenguaje Natural, 35(1):114-118, 2005 (in Spanish).
    • (2005) Spanish) , vol.114-118
    • Jiménez, H.1    Pinto, D.2    Rosso, P.3
  • 12
    • 0014129195 scopus 로고
    • Hierarchical Clustering Schemes
    • S. C Johnson: Hierarchical Clustering Schemes, Psychometrika, 2:241-254, 1967.
    • (1967) Psychometrika , vol.2 , pp. 241-254
    • Johnson, S.C.1
  • 14
    • 1942516906 scopus 로고    scopus 로고
    • An evaluation on feature selection for text clustering, In T. Fawcett and N
    • AAAI Press
    • T. Liu, S. Liu, Z. Chen, and W. Ma: An evaluation on feature selection for text clustering, In T. Fawcett and N. Mishra, editors, ICML, pages 488-495, AAAI Press, 2003.
    • (2003) Mishra, editors, ICML , pp. 488-495
    • Liu, T.1    Liu, S.2    Chen, Z.3    Ma, W.4
  • 15
    • 22944482209 scopus 로고    scopus 로고
    • Clustering Abstracts instead of Full Texts
    • Proceedings of the Seventh International Conference on Text, Speech and Dialogue TSD, of, Springer-Verlag
    • P. Makagonov, M. Alexandrov, and A. Gelbukh: Clustering Abstracts instead of Full Texts, In Proceedings of the Seventh International Conference on Text, Speech and Dialogue (TSD 2004), volume 3206 of Lecture Notes in Artificial Intelligence, pages 129-135, Springer-Verlag, 2004.
    • (2004) Lecture Notes in Artificial Intelligence , vol.3206 , pp. 129-135
    • Makagonov, P.1    Alexandrov, M.2    Gelbukh, A.3
  • 19
    • 33745547773 scopus 로고    scopus 로고
    • D. Pinto, H. Jiménez-Salazar, and P. Rosso: Clustering abstracts of scientific texts using the transition point technique, In Alexander F. Gelbukh, editor, CICLing, 3878 of Lecture Notes in Computer Science, pages 536-546. Springer-Verlang, 2006.
    • D. Pinto, H. Jiménez-Salazar, and P. Rosso: Clustering abstracts of scientific texts using the transition point technique, In Alexander F. Gelbukh, editor, CICLing, volume 3878 of Lecture Notes in Computer Science, pages 536-546. Springer-Verlang, 2006.
  • 20
    • 37149028209 scopus 로고    scopus 로고
    • A Comparative Study of Clustering Algorithms on Narrow-Domain Abstracts
    • D. Pinto, P. Rosso, A. Juan, and H. Jiménez, : A Comparative Study of Clustering Algorithms on Narrow-Domain Abstracts, Procesamiento del Lenguaje Natural, 37(1):43-49, 2006.
    • (2006) Procesamiento del Lenguaje Natural , vol.37 , Issue.1 , pp. 43-49
    • Pinto, D.1    Rosso, P.2    Juan, A.3    Jiménez, H.4
  • 21
    • 37149025749 scopus 로고    scopus 로고
    • KnCr: A Short-Text Narrow-Domain Sub-Corpus of Medline
    • D. Pinto, and P. Rosso: KnCr: A Short-Text Narrow-Domain Sub-Corpus of Medline, In Proceedings of TLH-ENC06, pages 266-269, 2006.
    • (2006) Proceedings of TLH-ENC06 , pp. 266-269
    • Pinto, D.1    Rosso, P.2
  • 22
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • M. F. Porter: An algorithm for suffix stripping, In Program, 14(3), 1980.
    • (1980) In Program , vol.14 , Issue.3
    • Porter, M.F.1
  • 23
    • 84958045693 scopus 로고    scopus 로고
    • K. Shin and S. Y. Han: Fast clustering algorithm for information organization, In A. F. Gelbukh, editor, CICLing, 2588 of Lecture Notes in Computer Science, pages 619-622, Springer-Verlang, 2003.
    • K. Shin and S. Y. Han: Fast clustering algorithm for information organization, In A. F. Gelbukh, editor, CICLing, volume 2588 of Lecture Notes in Computer Science, pages 619-622, Springer-Verlang, 2003.
  • 24
    • 0004217877 scopus 로고
    • 2nd edition, Dept. of Computer Science, University of Glasgow
    • C. J. Van Rijsbergen: Information Retrieval, 2nd edition, Dept. of Computer Science, University of Glasgow, 1979.
    • (1979) Information Retrieval
    • Van Rijsbergen, C.J.1
  • 25
    • 0029180724 scopus 로고
    • Noise reduction in a statistical approach to text categorization
    • Y. Yang: Noise reduction in a statistical approach to text categorization, In Proceedings of SIGIR-ACM, pages 256-263, 1995.
    • (1995) Proceedings of SIGIR-ACM , pp. 256-263
    • Yang, Y.1
  • 26
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study on feature selection in text categorization
    • Y. Yang , J. O. Pedersen. A comparative study on feature selection in text categorization. In Proc. ICML, pages 412-420, 1997.
    • (1997) Proc. ICML , pp. 412-420
    • Yang, Y.1    Pedersen, J.O.2
  • 27
    • 0027632406 scopus 로고
    • A measure of relative entropy between individual sequences with application to universal classification
    • J. Ziv and N. Merhav: A measure of relative entropy between individual sequences with application to universal classification, IEEE Transactions on Information Theory, 39(4):1270-1279, 1993.
    • (1993) IEEE Transactions on Information Theory , vol.39 , Issue.4 , pp. 1270-1279
    • Ziv, J.1    Merhav, N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.