|
Volumn 20, Issue 1, 2010, Pages 152-187
|
An incremental clustering scheme for data de-duplication
|
Author keywords
Approximated similarity measures; Clustering mining methods and algorithms; De duplication; Indexing methods and structures; Locality sensitive hashing; Min wise independent permutations; Record classification
|
Indexed keywords
INDEXING METHODS;
LOCALITY SENSITIVE HASHING;
MIN-WISE INDEPENDENT PERMUTATIONS;
MINING METHODS AND ALGORITHMS;
SIMILARITY MEASURE;
CONTENT BASED RETRIEVAL;
MINING;
INDEXING (OF INFORMATION);
|
EID: 76749114248
PISSN: 13845810
EISSN: None
Source Type: Journal
DOI: 10.1007/s10618-009-0155-0 Document Type: Article |
Times cited : (36)
|
References (47)
|