메뉴 건너뛰기




Volumn 33, Issue 3, 2007, Pages 600-605

A fuzzy clustering approach for finding similar documents using a novel similarity measure

Author keywords

Distance based similarity; Document similarity; Fuzzy clustering; Fuzzy similarity measure; Text mining

Indexed keywords

FEATURE EXTRACTION; INFORMATION RETRIEVAL SYSTEMS; ONLINE SEARCHING; TEXT PROCESSING;

EID: 33847660060     PISSN: 09574174     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.eswa.2006.06.002     Document Type: Article
Times cited : (49)

References (29)
  • 1
    • 33847660695 scopus 로고    scopus 로고
    • Apte, C., Damerau, P., & Weiss, S. (1998). Text mining with decision rules and decision trees. In Proceedings of the conference automated learning and discovery, CMU.
  • 2
    • 33847643901 scopus 로고    scopus 로고
    • Clarke, C. L. A., Cormack, G. V., Kisman, D. I. E., & Lynam, T. R. (2000). Question answering by passage selection. In The ninth text retrieval conference, Gaithersburg.
  • 4
    • 33847659847 scopus 로고    scopus 로고
    • Dumais, S., Platt, J., Heckerman, D., & Sahami, M. (1998). Inductive learning algorithm and representations for text categorization. In Proceedings of the 1998 ACM 7th international conference on information and knowledge management (pp. 148-155).
  • 5
    • 33847679228 scopus 로고    scopus 로고
    • Elworthy, D. (2000). Question answering using a large NLP system. In The ninth text retrieval conference, Gaithersburg, 2000.
  • 6
    • 0003690475 scopus 로고    scopus 로고
    • Finding the main themes in a Spanish document
    • Guzman A. Finding the main themes in a Spanish document. Expert Systems with Applications 14 (1998) 139-148
    • (1998) Expert Systems with Applications , vol.14 , pp. 139-148
    • Guzman, A.1
  • 10
    • 33847656396 scopus 로고    scopus 로고
    • Joachims, T. (1997). Probabilistic analysis of the rocchio algorithm with TFIDF for text categorization. In Proceedings of the international conference on machine learning (ICML'97) (pp. 143-151).
  • 12
    • 84916622253 scopus 로고    scopus 로고
    • Kou, H., & Gardarin, G. (2002). Similarity model and term association for document categorization. In Proceedings of the 13th international workshop on database and expert systems applications (DEXA'02).
  • 14
    • 0026986166 scopus 로고    scopus 로고
    • Masand, B., Linoff, G., & Waltz, D. (1992). Classifying news stories using memory based reasoning. In Proceedings of the 15th annual ACM/SIGIR conference on research and development in information retrieval (pp. 59-65).
  • 16
    • 0036346203 scopus 로고    scopus 로고
    • Miyamoto, S. (2001). Fuzzy multisets and fuzzy clustering of documents. In Proceedings of the IEEE International Conference on Fuzzy Systems, FUZZ-IEEE.
  • 17
    • 84994334226 scopus 로고    scopus 로고
    • Murata, M., Ma, Q., Uchimoto, K., Ozaku, H., Utiyama, M., & Isahara, H. (2000). Japanese probabilistic information retrieval using location and category information. In Proceedings of the fifth international workshop on information retrieval with Asian language.
  • 18
  • 19
    • 33847615431 scopus 로고    scopus 로고
    • Sahami, M., Dumais S., Heckerman, D., & Horvitz, E. (1998). A Bayesian approach to filtering junk e-mail. In AAAI 98, workshops on text categorization.
  • 20
    • 17844387127 scopus 로고    scopus 로고
    • Neighbor-weighted K-nearest neighbor for unbalanced text corpus
    • Tan S. Neighbor-weighted K-nearest neighbor for unbalanced text corpus. Expert Systems with Applications 28 (2005) 667-671
    • (2005) Expert Systems with Applications , vol.28 , pp. 667-671
    • Tan, S.1
  • 21
    • 28544439958 scopus 로고    scopus 로고
    • An effective refinement strategy for KNN text classifier
    • Tan S. An effective refinement strategy for KNN text classifier. Expert Systems with Applications 30 (2005) 290-298
    • (2005) Expert Systems with Applications , vol.30 , pp. 290-298
    • Tan, S.1
  • 23
    • 0027718886 scopus 로고    scopus 로고
    • Tzeras, K., & Hartmann, S. (1993). Automatic indexing based on Bayesian inference networks. In Proceedings of the 16th annual ACM/SIGIR conference on research and development in information retrieval (pp. 22-34).
  • 24
    • 0141887128 scopus 로고    scopus 로고
    • A study on searching for similar documents based on multiple concepts and distribution of concepts
    • Weng S.S., and Lin Y.J. A study on searching for similar documents based on multiple concepts and distribution of concepts. Expert Systems with Applications 25 3 (2003) 355-368
    • (2003) Expert Systems with Applications , vol.25 , Issue.3 , pp. 355-368
    • Weng, S.S.1    Lin, Y.J.2
  • 25
    • 1442335145 scopus 로고    scopus 로고
    • Using text classification and multiple concepts to answer e-mails
    • Weng S.S., and Liu C.K. Using text classification and multiple concepts to answer e-mails. Expert Systems with Applications 26 4 (2004) 529-543
    • (2004) Expert Systems with Applications , vol.26 , Issue.4 , pp. 529-543
    • Weng, S.S.1    Liu, C.K.2
  • 27
    • 33847635012 scopus 로고    scopus 로고
    • Wiener, E., Pederson, J., & Weigend, A. (1995). A neural network approach to topic spotting. In Fourth annual symposium on document analysis and information retrieval.
  • 28
    • 4544318080 scopus 로고    scopus 로고
    • A text mining approach on automatic generation of web directories and hierarchies
    • Yang H.C., and Lee C.H. A text mining approach on automatic generation of web directories and hierarchies. Expert Systems with Applications 27 (2004) 645-663
    • (2004) Expert Systems with Applications , vol.27 , pp. 645-663
    • Yang, H.C.1    Lee, C.H.2
  • 29
    • 25844442530 scopus 로고    scopus 로고
    • A text mining approach on automatic construction of hypertexts
    • Yang H.C., and Lee C.H. A text mining approach on automatic construction of hypertexts. Expert Systems with Applications 29 (2005) 723-734
    • (2005) Expert Systems with Applications , vol.29 , pp. 723-734
    • Yang, H.C.1    Lee, C.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.