|
Volumn 2006, Issue , 2006, Pages 284-291
|
Finding near-duplicate web pages: A large-scale evaluation of algorithms
a
EPFL
(Switzerland)
|
Author keywords
Content duplication; Near duplicate documents; Web pages
|
Indexed keywords
CONTENT DUPLICATION;
NEAR-DUPLICATE DOCUMENTS;
CONTENT DUPLICATIONS;
NEAR DUPLICATE DOCUMENTS;
PRECISIONS;
SHINGLING ALGORITHMS;
ALGORITHMS;
ELECTRONIC DOCUMENT EXCHANGE;
SEARCH ENGINES;
DATA STRUCTURES;
LARGE SCALE SYSTEMS;
RANDOM PROCESSES;
WEBSITES;
|
EID: 33750296887
PISSN: None
EISSN: None
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (365)
|
References (15)
|