메뉴 건너뛰기




Volumn 5012 LNAI, Issue , 2008, Pages 877-883

Detecting near-duplicates in large-scale short text databases

Author keywords

Duplicate detection; Optimization; Short text; Term weighting

Indexed keywords

AD HOC NETWORKS; ALGORITHMS; COMPUTATIONAL COMPLEXITY; LINEAR PROGRAMMING; OPTIMIZATION; TEXT PROCESSING;

EID: 44649164477     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-68125-0_87     Document Type: Conference Paper
Times cited : (20)

References (14)
  • 2
    • 44649176793 scopus 로고    scopus 로고
    • Hu, J.X.: Message text clustering based on frequent patterns (In. Chinese). M.S. thesis, Institute of Computing Technology, Chinese Academy of Sciences. Beijing, China (2006)
    • Hu, J.X.: Message text clustering based on frequent patterns (In. Chinese). M.S. thesis, Institute of Computing Technology, Chinese Academy of Sciences. Beijing, China (2006)
  • 5
    • 44649183165 scopus 로고    scopus 로고
    • Lyon, C., Barrett, R., Malcolm, J.: A theoretical basis to the automated detection of copying between texts, and its practical implementation in the Ferret plagiarism and collusion detector. In: Plagiarism: Prevention, Practice and Policies Conference (June 2004)
    • Lyon, C., Barrett, R., Malcolm, J.: A theoretical basis to the automated detection of copying between texts, and its practical implementation in the Ferret plagiarism and collusion detector. In: Plagiarism: Prevention, Practice and Policies Conference (June 2004)
  • 11
    • 77956142012 scopus 로고    scopus 로고
    • An n-gram-based approach for detecting approximately duplicate database records
    • Tian, Z.P., Lu, H.J., Ji, W.Y., et al.: An n-gram-based approach for detecting approximately duplicate database records. International Journal on Digital Libraries 5(3), 325-331 (2001)
    • (2001) International Journal on Digital Libraries , vol.5 , Issue.3 , pp. 325-331
    • Tian, Z.P.1    Lu, H.J.2    Ji, W.Y.3
  • 13
  • 14
    • 45549117987 scopus 로고
    • Term weighting approaches in automatic text retrieval. Information Processing and Management
    • Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. Information Processing and Management: an International Journal 24(5), 513-523 (1988)
    • (1988) an International Journal , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.