메뉴 건너뛰기




Volumn 20, Issue 2, 2002, Pages 171-191

Collection statistics for fast duplicate document detection

Author keywords

[No Author keywords available]

Indexed keywords

COLLECTION STATISTICS; DATA COLLECTION; WEB DOCUMENTS;

EID: 0013206133     PISSN: 10468188     EISSN: 10468188     Source Type: Journal    
DOI: 10.1145/506309.506311     Document Type: Article
Times cited : (199)

References (24)
  • 6
    • 0010248759 scopus 로고    scopus 로고
    • Efficiency considerations in very large information retrieval servers
    • FRIEDER, O., GROSSMAN, D., CHOWDHURY, A., AND FRIEDER, G. 2000. Efficiency considerations in very large information retrieval servers. J. Dig. Inf. 1, 5 (Apr).
    • (2000) J. Dig. Inf. , vol.1 , Issue.5 APR
    • Frieder, O.1    Grossman, D.2    Chowdhury, A.3    Frieder, G.4
  • 7
    • 0001186389 scopus 로고    scopus 로고
    • Accessibility and distribution of information on the web
    • GILES, L. AND LAWRENCE, S. 1999. Accessibility and distribution of information on the web. Nature 400, 107-109.
    • (1999) Nature , vol.400 , pp. 107-109
    • Giles, L.1    Lawrence, S.2
  • 10
    • 12244278769 scopus 로고
    • Discrimination of authorship using visualization
    • Pergamon Press
    • KJELL, B., WOODS, W., AND FRIEDER, O. 1994. Discrimination of authorship using visualization. Information Processing and Management. Pergamon Press 30, 1 (Jan), 141-150.
    • (1994) Information Processing and Management , vol.30 , Issue.1 JAN , pp. 141-150
    • Kjell, B.1    Woods, W.2    Frieder, O.3
  • 12
    • 0032478628 scopus 로고    scopus 로고
    • Searching the world wide web
    • LAWRENCE, S. AND GILES, C. L. 1998. Searching the World Wide Web. Science. 280, 5360, 98-100.
    • (1998) Science , vol.280 , Issue.5360 , pp. 98-100
    • Lawrence, S.1    Giles, C.L.2
  • 13
    • 84859686132 scopus 로고    scopus 로고
    • NIH, National Center for Complementary and Alternative Medicine (NCCAM), April 12, 2000
    • NIH. 2000. http://nccam.nih.gov/, The National Institutes of Health (NIH), National Center for Complementary and Alternative Medicine (NCCAM), April 12, 2000.
    • (2000)
  • 14
    • 0003629991 scopus 로고
    • NIST, U.S. Department of Commerce/National Institute of Standards and Technology, FIPS PUB 180-1 (April 17)
    • NIST. 1995. Secure Hash Standard, U.S. Department of Commerce/National Institute of Standards and Technology, FIPS PUB 180-1 (April 17).
    • (1995) Secure Hash Standard
  • 15
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • PORTER, M. 1980. An algorithm for suffix stripping. Program 14, 3, 130-137.
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 18
    • 0016572913 scopus 로고
    • A vector-space model for information retrieval
    • SALTON, G., YANG, C. S., AND WONG, A. 1975. A vector-space model for information retrieval. C. ACM. ASIS. 18, 11, 613-620.
    • (1975) C. ACM. ASIS. , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Yang, C.S.2    Wong, A.3
  • 23
    • 84859697676 scopus 로고    scopus 로고
    • January 19
    • SMART FTP site: 2000. ftp://ftp.cs.cornell.edu/pub/smart/. January 19.
    • (2000)
  • 24
    • 33750419792 scopus 로고    scopus 로고
    • Ad-hoc retrieval using thresholds, WSTs for French monolingual retrieval, Document-at-a-Glance for high precision and triphone windows for spoken documents
    • Gathersburg, Maryland
    • SMEATON, A., KELLEDY, F., AND QUINN, G. 1997. Ad-hoc retrieval using thresholds, WSTs for French monolingual retrieval, Document-at-a-Glance for high precision and triphone windows for spoken documents. In Proceedings of the Sixth Text Retrieval Conference (TREC-6, Gathersburg, Maryland).
    • (1997) Proceedings of the Sixth Text Retrieval Conference (TREC-6)
    • Smeaton, A.1    Kelledy, F.2    Quinn, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.