메뉴 건너뛰기




Volumn 7, Issue 12, 2014, Pages 1059-1070

ClusterJoin: A similarity joins framework using Map-reduce

Author keywords

[No Author keywords available]

Indexed keywords

DATA DISTRIBUTION; DISTANCE FUNCTIONS; DYNAMIC LOAD BALANCING SCHEMES; EXPERIMENTAL EVALUATION; HIGH DIMENSIONAL DATA; METRIC DISTANCES; PROBABILISTIC GUARANTEES; SIMILARITY SCORES;

EID: 84905106551     PISSN: None     EISSN: 21508097     Source Type: Conference Proceeding    
DOI: 10.14778/2732977.2732981     Document Type: Conference Paper
Times cited : (82)

References (27)
  • 1
    • 84905122535 scopus 로고    scopus 로고
    • Linkedgeodata
    • Linkedgeodata: http://linkedgeodata.org.
  • 2
    • 84905116649 scopus 로고    scopus 로고
    • Openstreetmap
    • Openstreetmap: www.openstreetmap.org.
  • 12
    • 84905085044 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In Communication of ACM, 2010.
    • (2010) Communication of ACM
    • Dean, J.1    Ghemawat, S.2
  • 13
    • 84905114638 scopus 로고    scopus 로고
    • Massjoin: A mapreduce-based algorithm for string similarity joins
    • D. Deng, G. Li, S. Hao, J. Wang, and J. Feng. Massjoin: A mapreduce-based algorithm for string similarity joins. In Proceedings of ICDE, 2013.
    • (2013) Proceedings of ICDE
    • Deng, D.1    Li, G.2    Hao, S.3    Wang, J.4    Feng, J.5
  • 16
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: a large-scale evaluation of algorithms
    • M. R. Henzinger. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In Proceedings of SIGIR, 2006.
    • (2006) Proceedings of SIGIR
    • Henzinger, M.R.1
  • 17
    • 0037319544 scopus 로고    scopus 로고
    • Methods for identifying versioned and plagiarized documents
    • T. C. Hoad and J. Zobel. Methods for identifying versioned and plagiarized documents. In JASIST 54(3), 2003.
    • (2003) JASIST , vol.54 , Issue.3
    • Hoad, T.C.1    Zobel, J.2
  • 18
    • 84863758126 scopus 로고    scopus 로고
    • V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors
    • A. Metwally and C. Faloutsos. V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors. In Proceedings of VLDB, 2012.
    • (2012) Proceedings of VLDB
    • Metwally, A.1    Faloutsos, C.2
  • 20
    • 84905110717 scopus 로고    scopus 로고
    • Spatial tessellations: Concepts and applications of voronoi diagrams
    • A. Okabe, B. Boots, K. Sugihara, and S. N. Chiu. Spatial tessellations: Concepts and applications of voronoi diagrams. 2009.
    • (2009)
    • Okabe, A.1    Boots, B.2    Sugihara, K.3    Chiu, S.N.4
  • 22
    • 84863752005 scopus 로고    scopus 로고
    • Bayesian locality sensitive hashing for fast similarity search
    • V. Satuluri and S. Parthasarathy. Bayesian locality sensitive hashing for fast similarity search. In Proceedings of VLDB, 2012.
    • (2012) Proceedings of VLDB
    • Satuluri, V.1    Parthasarathy, S.2
  • 23
  • 24
    • 77954744650 scopus 로고    scopus 로고
    • Efficient parallel set-similarity joins using mapreduce
    • R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In Proceedings of SIGMOD, 2010.
    • (2010) Proceedings of SIGMOD
    • Vernica, R.1    Carey, M.J.2    Li, C.3
  • 26
  • 27
    • 66249113620 scopus 로고    scopus 로고
    • Efficient similarity joins for near duplicate detection
    • C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In Proceedings of WWW, 2008.
    • (2008) Proceedings of WWW
    • Xiao, C.1    Wang, W.2    Lin, X.3    Yu, J.X.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.