메뉴 건너뛰기




Volumn 20, Issue 9, 2008, Pages 1217-1229

Efficient phrase-based document similarity for clustering

Author keywords

Document clustering; Document model; Similarity measure; Suffix tree

Indexed keywords

BENCHMARKING; CLUSTER ANALYSIS; FLOW OF SOLIDS; IMAGE RETRIEVAL; INFORMATION RETRIEVAL; LEARNING SYSTEMS; STANDARDS; VECTORS;

EID: 50649091328     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2008.50     Document Type: Article
Times cited : (128)

References (29)
  • 2
    • 0010226876 scopus 로고
    • Organizing and Searching Large Files of Documents,
    • PhD dissertation, Univ. of Cambridge
    • W.B. Croft, "Organizing and Searching Large Files of Documents," PhD dissertation, Univ. of Cambridge, 1978.
    • (1978)
    • Croft, W.B.1
  • 3
    • 0016572913 scopus 로고
    • A Vector Space Model for Automatic Indexing
    • G. Salton, A. Wong, and C.S. Yang, "A Vector Space Model for Automatic Indexing," Comm. ACM, vol. 18, no. 11, pp. 613-620, 1975.
    • (1975) Comm. ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.S.3
  • 5
    • 0033294891 scopus 로고    scopus 로고
    • Grouper: A Dynamic Clustering Interface to Web Search Results
    • O. Zamir and O. Etzioni, "Grouper: A Dynamic Clustering Interface to Web Search Results," Computer Networks, vol. 31, nos. 11-16, pp. 1361-1374, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1361-1374
    • Zamir, O.1    Etzioni, O.2
  • 7
    • 0038632285 scopus 로고    scopus 로고
    • Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus
    • M. Yamamoto and K.W. Church, "Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus," Computational Linguistics, vol. 27, no. 1, pp. 1-30, 2001.
    • (2001) Computational Linguistics , vol.27 , Issue.1 , pp. 1-30
    • Yamamoto, M.1    Church, K.W.2
  • 8
    • 0027681165 scopus 로고
    • Suffix Arrays: A New Method for On-Line String Searches
    • U. Manber and G. Myers, "Suffix Arrays: A New Method for On-Line String Searches," SIAM J. Computing, vol. 22, no. 5, pp. 935-948, 1993.
    • (1993) SIAM J. Computing , vol.22 , Issue.5 , pp. 935-948
    • Manber, U.1    Myers, G.2
  • 9
    • 13844267502 scopus 로고    scopus 로고
    • Efficient Phrase-Based Document Indexing for Web Document Clustering
    • Oct
    • K.M. Hammouda and M.S. Kamel, "Efficient Phrase-Based Document Indexing for Web Document Clustering," IEEE Trans. Knowledge and Data Eng., vol. 16, no. 10, pp. 1279-1296, Oct. 2004.
    • (2004) IEEE Trans. Knowledge and Data Eng , vol.16 , Issue.10 , pp. 1279-1296
    • Hammouda, K.M.1    Kamel, M.S.2
  • 12
    • 50649106294 scopus 로고    scopus 로고
    • D.S. Sven Meyer zu Eissen and M. Potthast, The Suffix Tree Document Model Revisited, Proc. Fifth Int'l Conf. Knowledge Management (I-Know '05), pp. 596-603, 2005.
    • D.S. Sven Meyer zu Eissen and M. Potthast, "The Suffix Tree Document Model Revisited," Proc. Fifth Int'l Conf. Knowledge Management (I-Know '05), pp. 596-603, 2005.
  • 13
    • 84876811202 scopus 로고    scopus 로고
    • RCV1: A New Benchmark Collection for Text Categorization Research
    • D.D. Lewis, Y. Yang, and F. Li, "RCV1: A New Benchmark Collection for Text Categorization Research," J. Machine Learning Research, vol. 5, pp. 361-397, 2004.
    • (2004) J. Machine Learning Research , vol.5 , pp. 361-397
    • Lewis, D.D.1    Yang, Y.2    Li, F.3
  • 15
    • 3543147086 scopus 로고
    • Recent Trends in Hierarchic Document Clustering: A Critical Review
    • P. Willett, "Recent Trends in Hierarchic Document Clustering: A Critical Review," Information Processing and Management, vol. 24, no. 5, pp. 577-597, 1988.
    • (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 577-597
    • Willett, P.1
  • 20
    • 0024050355 scopus 로고
    • A New Distance Metric on Strings Computable in Linear Time
    • A. Ehrenfeucht and D. Haussier, "A New Distance Metric on Strings Computable in Linear Time," Discrete Applied Math., vol. 40, 1988.
    • (1988) Discrete Applied Math , vol.40
    • Ehrenfeucht, A.1    Haussier, D.2
  • 21
    • 33749028201 scopus 로고    scopus 로고
    • A Suffix Tree Approach to Anti-Spam Email Filtering
    • B.M. Rajesh Pampapathi and M. Levene, "A Suffix Tree Approach to Anti-Spam Email Filtering," Machine Learning, vol. 65, 2006.
    • (2006) Machine Learning , vol.65
    • Rajesh Pampapathi, B.M.1    Levene, M.2
  • 24
    • 0001704377 scopus 로고
    • On-Line Construction of Suffix Trees
    • E. Ukkonen, "On-Line Construction of Suffix Trees," Algorithmica, vol. 14, no. 3, pp. 249-260, 1995.
    • (1995) Algorithmica , vol.14 , Issue.3 , pp. 249-260
    • Ukkonen, E.1
  • 25
    • 0001553729 scopus 로고    scopus 로고
    • From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction
    • R. Giegerich and S. Kurtz, "From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction," Algorithmica, vol. 19, no. 3, pp. 331-353, 1997.
    • (1997) Algorithmica , vol.19 , Issue.3 , pp. 331-353
    • Giegerich, R.1    Kurtz, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.