메뉴 건너뛰기




Volumn , Issue , 2008, Pages 601-609

New issues in near-duplicate detection

Author keywords

[No Author keywords available]

Indexed keywords

CLASSIFICATION (OF INFORMATION); DATA HANDLING; DIGITAL STORAGE; MACHINE LEARNING;

EID: 84867666907     PISSN: 14318814     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/978-3-540-78246-9_71     Document Type: Conference Paper
Times cited : (13)

References (23)
  • 1
    • 33646126481 scopus 로고    scopus 로고
    • A scalable system for identifying co-derivative documents
    • BERNSTEIN, Y. and ZOBEL, J. (2004): A scalable system for identifying co-derivative documents, Proc. of SPIRE '04.
    • (2004) Proc. of SPIRE '04
    • Bernstein, Y.1    Zobel, J.2
  • 3
    • 4944224800 scopus 로고    scopus 로고
    • Identifying and filtering near-duplicate documents
    • BRODER, A. (2000): Identifying and filtering near-duplicate documents, Proc. of COM '00.
    • (2000) Proc. of COM '00
    • Broder, A.1
  • 5
    • 0037844312 scopus 로고    scopus 로고
    • Similarity estimation techniques from rounding algorithms
    • CHARIKAR, M. (2002): Similarity Estimation Techniques from Rounding Algorithms, Proc. of STOC '02.
    • (2002) Proc. of STOC '02
    • Charikar, M.1
  • 7
    • 12244271239 scopus 로고    scopus 로고
    • Online duplicate document detection: Signature reliability in a dynamic retrieval environment
    • CONRAD, J., GUO, X. and SCHRIBER, C. (2003): Online duplicate document detection: signature reliability in a dynamic retrieval environment, Proc. of CIKM '03.
    • (2003) Proc. of CIKM '03
    • Conrad, J.1    Guo, X.2    Schriber, C.3
  • 8
    • 8644227073 scopus 로고    scopus 로고
    • Constructing a text corpus for inexact duplicate detection
    • CONRAD, J. and SCHRIBER, C. (2004): Constructing a text corpus for inexact duplicate detection, Proc. of SIGIR '04.
    • (2004) Proc. of SIGIR '04
    • Conrad, J.1    Schriber, C.2
  • 13
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: A large-scale evaluation of algorithms
    • HENZINGER, M. (2006): Finding Near-Duplicate Web Pages: a Large-Scale Evaluation of Algorithms, Proc. of SIGIR '06.
    • (2006) Proc. of SIGIR '06
    • Henzinger, M.1
  • 14
    • 0037319544 scopus 로고    scopus 로고
    • Methods for identifying versioned and plagiarised documents
    • HOAD, T. and ZOBEL, J. (2003): Methods for Identifying Versioned and Plagiarised Documents, Jour. of ASIST, 54.
    • (2003) Jour. of ASIST , vol.54
    • Hoad, T.1    Zobel, J.2
  • 15
    • 0001907042 scopus 로고    scopus 로고
    • Approximate nearest neighbor-towards removing the curse of dimensionality
    • INDYK, P. and MOTWANI, R. (1998): Approximate Nearest Neighbor-Towards Removing the Curse of Dimensionality, Proc. of STOC '98.
    • (1998) Proc. of STOC '98
    • Indyk, P.1    Motwani, R.2
  • 16
    • 12244261882 scopus 로고    scopus 로고
    • Improved robustness of signature-based near-replica detection via lexicon randomization
    • KOŁCZ, A., CHOWDHURY, A. and ALSPECTOR, J. (2004): Improved robustness of signature-based near-replica detection via lexicon randomization, Proc. of KDD '04.
    • (2004) Proc. of KDD '04
    • KoŁcz, A.1    Chowdhury, A.2    Alspector, J.3
  • 17
    • 85043988965 scopus 로고
    • Finding similar files in a large file system
    • MANBER, U. (1994): Finding similar files in a large file system, Proc. of USENIX-TC '94.
    • (1994) Proc. of USENIX-TC '94
    • Manber, U.1
  • 19
    • 36448989077 scopus 로고    scopus 로고
    • Fuzzy-fingerprints for text-based information retrieval
    • STEIN, B. (2005): Fuzzy-Fingerprints for Text-based Information Retrieval, Proc. of I-KNOW '05.
    • (2005) Proc. of I-KNOW '05
    • Stein, B.1
  • 20
    • 36448954599 scopus 로고    scopus 로고
    • Principles of hash-based text retrieval
    • STEIN, B. (2007): Principles of Hash-based Text Retrieval, Proc. of SIGIR '07.
    • (2007) Proc. of SIGIR '07
    • Stein, B.1
  • 21
    • 0000681228 scopus 로고    scopus 로고
    • A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces
    • WEBER, R., SCHEK, H. and BLOTT, S. (1998): A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces, Proc. of VLDB '98.
    • (1998) Proc. of VLDB '98
    • Weber, R.1    Schek, H.2    Blott, S.3
  • 22
    • 84879583318 scopus 로고    scopus 로고
    • A systematic study of parameter correlations in large scale duplicate document detection
    • YE, S., WEN, J. and MA, W. (2006): A Systematic Study of Parameter Correlations in Large Scale Duplicate Document Detection, Proc. of PAKDD '06.
    • (2006) Proc. of PAKDD '06
    • Ye, S.1    Wen, J.2    Ma, W.3
  • 23
    • 84879585107 scopus 로고    scopus 로고
    • The case of the duplicate documents: Measurement, search, and science
    • ZOBEL, J. and BERNSTEIN, Y. (2006): The case of the duplicate documents: Measurement, search, and science, Proc. of APWeb '06.
    • (2006) Proc. of APWeb '06
    • Zobel, J.1    Bernstein, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.