메뉴 건너뛰기




Volumn 54, Issue 8, 2011, Pages 101-109

Theory and applications of b-bit minwise hashing

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL OVERHEADS; LARGE DATASETS; REAL LIFE DATASETS; SET SIMILARITY; SIMPLE ALGORITHM; STORAGE REQUIREMENTS;

EID: 79961058834     PISSN: 00010782     EISSN: 15577317     Source Type: Journal    
DOI: 10.1145/1978542.1978566     Document Type: Article
Times cited : (95)

References (25)
  • 1
    • 37549058056 scopus 로고    scopus 로고
    • Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    • Andoni, A., Indyk, P. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Commun. ACM 51 (2008), 117-122.
    • (2008) Commun ACM , vol.51 , pp. 117-122
    • Andoni, A.1    Indyk, P.2
  • 2
    • 0031346696 scopus 로고    scopus 로고
    • On the resemblance and containment of documents
    • Positano, Italy
    • Broder, A.Z. On the resemblance and containment of documents. In The Compression and Complexity of Sequences (Positano, Italy, 1997), 21-29.
    • (1997) The Compression and Complexity of Sequences , pp. 21-29
    • Broder, A.Z.1
  • 4
    • 0010362121 scopus 로고    scopus 로고
    • Syntactic clustering of the web
    • Santa Clara, CA
    • Broder, A.Z., Glassman, S.C., Manasse, M.S., Zweig, G. Syntactic clustering of the web. In WWW (Santa Clara, CA , 1997), 1157-1166.
    • (1997) WWW , pp. 1157-1166
    • Broder, A.Z.1    Glassman, S.C.2    Manasse, M.S.3    Zweig, G.4
  • 7
    • 84880492977 scopus 로고    scopus 로고
    • A large-scale study of the evolution of web pages
    • Budapest, Hungary
    • Fetterly, D., Manasse, M., Najork, M., Wiener, J.L. A large-scale study of the evolution of web pages. In WWW (Budapest, Hungary, 2003), 669-678.
    • (2003) WWW , pp. 669-678
    • Fetterly, D.1    Manasse, M.2    Najork, M.3    Wiener, J.L.4
  • 8
    • 77952264233 scopus 로고    scopus 로고
    • Efficient detection of large-scale redundancy in enterprise file systems
    • Forman, G., Eshghi, K., Suermondt, J. Efficient detection of large-scale redundancy in enterprise file systems. SIGOPS Oper. Syst. Rev. 43, 1 (2009), 84-91.
    • (2009) SIGOPS Oper. Syst. Rev. , vol.43 , Issue.1 , pp. 84-91
    • Forman, G.1    Eshghi, K.2    Suermondt, J.3
  • 10
    • 0040152802 scopus 로고    scopus 로고
    • And tunable similar set retrieval
    • Santa Barbara, CA
    • Gionis, A., Gunopulos, D., Koudas, N. Efficient and tunable similar set retrieval. In SIGMOD (Santa Barbara, CA , 2001), 247-258.
    • (2001) SIGMOD , pp. 247-258
    • Gionis, A.1    Gunopulos, D.2    Efficient, K.N.3
  • 11
    • 84893574327 scopus 로고
    • Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming
    • Goemans, M.X., Williamson, D.P. Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. J. ACM 42, 6 (1995), 1115-1145.
    • (1995) J ACM , vol.42 , Issue.6 , pp. 1115-1145
    • Goemans, M.X.1    Williamson, D.P.2
  • 12
    • 0344612511 scopus 로고    scopus 로고
    • A Small Approximately Min-Wise Independent Family of Hash Functions
    • DOI doi:10.1006/jagm.2000.1131
    • Indyk, P. A small approximately min-wise independent family of hash functions. J. Algorithms 38, 1 (2001), 84-90. (Pubitemid 33667201)
    • (2001) Journal of Algorithms , vol.38 , Issue.1 , pp. 84-90
    • Indyk, P.1
  • 13
    • 0031644241 scopus 로고    scopus 로고
    • Approximate nearest neighbors: Towards removing the curse of dimensionality
    • Dallas, TX
    • Indyk, P., Motwani, R. Approximate nearest neighbors: Towards removing the curse of dimensionality. In STOC (Dallas, TX, 1998), 604-613.
    • (1998) STOC , pp. 604-613
    • Indyk, P.1    Motwani, R.2
  • 14
    • 0038784483 scopus 로고    scopus 로고
    • On the sample size of k-restricted min-wise independent permutations and other k-wise distributions
    • San Diego, CA
    • Itoh, T., Takei, Y., Tarui, J. On the sample size of k-restricted min-wise independent permutations and other k-wise distributions. In STOC (San Diego, CA , 2003), 710-718.
    • (2003) STOC , pp. 710-718
    • Itoh, T.1    Takei, Y.2    Tarui, J.3
  • 15
    • 0031636499 scopus 로고    scopus 로고
    • Efficient search for approximate nearest neighbor in high dimensional spaces
    • Dallas, TX
    • Kushilevitz, E., Ostrovsky, R., Rabani, Y. Efficient search for approximate nearest neighbor in high dimensional spaces. In STOC (Dallas, TX, 1998), 614-623.
    • (1998) STOC , pp. 614-623
    • Kushilevitz, E.1    Ostrovsky, R.2    Rabani, Y.3
  • 16
    • 34748825544 scopus 로고    scopus 로고
    • A sketch algorithm for estimating two-way and multi-way associations
    • Preliminary results appeared in HLT /EM NLP
    • Li, P., Church, K.W. A sketch algorithm for estimating two-way and multi-way associations. Comput. Linguist. 33, 3 (2007), 305-354 (Preliminary results appeared in HLT /EM NLP 2005).
    • (2005) Comput. Linguist. , vol.33 , Issue.3 , pp. 305-354
    • Li, P.1    Church, K.W.2
  • 17
    • 79961072869 scopus 로고    scopus 로고
    • One sketch for all: Theory and applications of conditional random sampling
    • (Vancouver, British Columbia, Canada 2008) (Preliminary results appeared in NIPS
    • Li, P., Church, K.W., Hastie, T.J. One sketch for all: Theory and applications of conditional random sampling. In NIPS (Vancouver, British Columbia, Canada, 2008) (Preliminary results appeared in NIPS 2006).
    • (2006) NIPS
    • Li, P.1    Church, K.W.2    Hastie, T.J.3
  • 19
    • 85162008856 scopus 로고    scopus 로고
    • B-Bit minwise hashing for estimating three-way similarities
    • Vancouver, British Columbia, Canada
    • Li, P., König, A.C., Gui, W. b-Bit minwise hashing for estimating three-way similarities. In NIPS (Vancouver, British Columbia, Canada, 2010).
    • (2010) NIPS
    • Li, P.1    König, A.C.2    Gui, W.3
  • 20
    • 79961040653 scopus 로고    scopus 로고
    • B-Bit minwise hashing for large-scale linear SVM
    • Li, P., Moore, J., König, A.C. b-Bit minwise hashing for large-scale linear SVM. Technical report, 2011.http://www.stat.cornell.edu/?li/b- bithashing/HashingSVM.pdf
    • (2011) Technical Report
    • Li, P.1    Moore, J.2    König, A.C.3
  • 21
    • 70350645468 scopus 로고    scopus 로고
    • Applying Syntactic similarity algorithms for enterprise information management
    • Paris, France
    • Cherkasova, L., Eshghi, K., Morrey III, C.B., Tucek, J., Veitch, A. Applying Syntactic similarity algorithms for enterprise information management. In KDD (Paris, France, 2009), 1087-1096.
    • (2009) KDD , pp. 1087-1096
    • Cherkasova, L.1    Eshghi, K.2    Morrey Iii, C.B.3    Tucek, J.4    Veitch, A.5
  • 25


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.