메뉴 건너뛰기




Volumn , Issue , 2012, Pages 1402-1411

Efficient Jaccard-based diversity analysis of large document collections

Author keywords

clustering; diversity; jaccard

Indexed keywords

BIBLIOGRAPHIC DATA; CENSUS DATA; CLUSTERING; CONSTANT TIME; DATA SETS; DIVERSITY; DIVERSITY ANALYSIS; DOCUMENT COLLECTION; JACCARD; KNOWLEDGE EVOLUTION; LARGE DOCUMENT CORPORA; LINEAR-TIME APPROXIMATION; PHOTO-SHARING SITES; PROBABILISTIC GUARANTEES; QUADRATIC COMPLEXITY; SIMILARITY COMPUTATION; TOPIC DIVERSITY; USER-GENERATED CONTENT; WEB REPOSITORIES;

EID: 84871064447     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2396761.2398445     Document Type: Conference Paper
Times cited : (20)

References (30)
  • 1
    • 77950369345 scopus 로고    scopus 로고
    • Data clustering: 50 Years beyond k-means
    • Data clustering: 50 years beyond k-means. Pattern Recognition Letters, 31(8):651-666, 2010.
    • (2010) Pattern Recognition Letters , vol.31 , Issue.8 , pp. 651-666
  • 4
    • 84871090671 scopus 로고    scopus 로고
    • Min-wise independent permutations: Theory and practice
    • A. Z. Broder. Min-wise independent permutations: Theory and practice. ICALP '00, London, UK.
    • ICALP '00, London, UK
    • Broder, A.Z.1
  • 6
    • 84871080456 scopus 로고    scopus 로고
    • Identifying and filtering near-duplicate documents
    • A. Z. Broder. Identifying and filtering near-duplicate documents. COM '00, London, UK, 2000.
    • COM '00, London, UK, 2000
    • Broder, A.Z.1
  • 8
    • 85086351786 scopus 로고    scopus 로고
    • Similarity estimation techniques from rounding algorithms
    • M. Charikar. Similarity estimation techniques from rounding algorithms. STOC '02.
    • STOC '02
    • Charikar, M.1
  • 9
    • 0034538249 scopus 로고    scopus 로고
    • An optimal algorithm for monte carlo estimation
    • March
    • P. Dagum, R. Karp, M. Luby, and S. Ross. An optimal algorithm for monte carlo estimation. SIAM J. Comput., 29:1484-1496, March 2000.
    • (2000) SIAM J. Comput. , vol.29 , pp. 1484-1496
    • Dagum, P.1    Karp, R.2    Luby, M.3    Ross, S.4
  • 11
    • 4344634903 scopus 로고    scopus 로고
    • Ethnic and cultural diversity by country*
    • J. D. Fearon. Ethnic and cultural diversity by country*. Journal of Economic Growth, 8:195-222, 2003.
    • (2003) Journal of Economic Growth , vol.8 , pp. 195-222
    • Fearon, J.D.1
  • 13
    • 0001907042 scopus 로고    scopus 로고
    • Approximate nearest neighbors: Towards removing the curse of dimensionality
    • P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. STOC '98, Dallas, Texas, USA.
    • STOC '98, Dallas, Texas, USA
    • Indyk, P.1    Motwani, R.2
  • 18
    • 85055761970 scopus 로고
    • Measuring population diversity
    • S. Lieberson. Measuring population diversity. American Sociological Review, 34(6):850-862, 1969.
    • (1969) American Sociological Review , vol.34 , Issue.6 , pp. 850-862
    • Lieberson, S.1
  • 20
    • 0242698067 scopus 로고    scopus 로고
    • The learning-curve sampling method applied to model-based clustering
    • March
    • C. Meek, B. Thiesson, and D. Heckerman. The learning-curve sampling method applied to model-based clustering. J. Mach. Learn. Res., 2:397-418, March 2002.
    • (2002) J. Mach. Learn. Res. , vol.2 , pp. 397-418
    • Meek, C.1    Thiesson, B.2    Heckerman, D.3
  • 21
    • 80052136714 scopus 로고    scopus 로고
    • Incremental diversification for very large sets: A streaming-based approach
    • E. Minack, W. Siberski, and W. Nejdl. Incremental diversification for very large sets: a streaming-based approach. In SIGIR '11, Beijing, China.
    • SIGIR '11, Beijing, China
    • Minack, E.1    Siberski, W.2    Nejdl, W.3
  • 22
    • 0004140530 scopus 로고
    • Ph.D. Diss. (University of California at Berkeley)
    • Olken. Random sampling from databases. In Ph.D. Diss. (University of California at Berkeley), 1993.
    • (1993) Random Sampling from Databases
    • Olken1
  • 23
    • 77952311353 scopus 로고    scopus 로고
    • Text clustering for peer-to-peer networks with probabilistic guarantees
    • Springer Berlin / Heidelberg
    • O. Papapetrou, W. Siberski, and N. Fuhr. Text clustering for peer-to-peer networks with probabilistic guarantees. LNCS, pages V.5993, 293-305. Springer Berlin / Heidelberg, 2010.
    • (2010) LNCS , vol.5993 , pp. 293-305
    • Papapetrou, O.1    Siberski, W.2    Fuhr, N.3
  • 25
    • 77952880386 scopus 로고    scopus 로고
    • Diversity and network coherence as indicators of interdisciplinarity: Case studies in bionanoscience
    • I. Rafols and M. Meyer. Diversity and network coherence as indicators of interdisciplinarity: case studies in bionanoscience. Scientometrics, 82(2):263-287, 2010.
    • (2010) Scientometrics , vol.82 , Issue.2 , pp. 263-287
    • Rafols, I.1    Meyer, M.2
  • 27
    • 0001072449 scopus 로고
    • Measurement of diversity
    • E. H. Simpson. Measurement of diversity. Nature, 163, 1949.
    • (1949) Nature , pp. 163
    • Simpson, E.H.1
  • 28
    • 34547621279 scopus 로고    scopus 로고
    • A general framework for analysing diversity in science, technology and society
    • A. Stirling. A general framework for analysing diversity in science, technology and society. Journal of The Royal Society Interface, 4(15):707-719, 2007.
    • (2007) Journal of the Royal Society Interface , vol.4 , Issue.15 , pp. 707-719
    • Stirling, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.