메뉴 건너뛰기




Volumn , Issue , 2009, Pages 81-90

Efficient overlap and content reuse detection in blogs and online news articles.

Author keywords

Reuse detection; Weblogs

Indexed keywords

BLOGOSPHERES; CONTENT RE-USE; DETECTION RATES; DYNAMIC NATURE; INCREMENTAL PROCESSING; INFORMATION SOURCES; MEDIA OUTLETS; MULTIPLE ORDERS; ONLINE NEWS; PROCESSING TIME; WEBLOGS;

EID: 77955914175     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1526709.1526721     Document Type: Conference Paper
Times cited : (30)

References (40)
  • 1
    • 84865633903 scopus 로고    scopus 로고
    • David Sifry's Blog. http://www.sifry.com/alerts/.
  • 2
    • 84865633905 scopus 로고    scopus 로고
    • Google Blog Search. http://blogsearch.google.com/blogsearch.
  • 3
    • 84865646867 scopus 로고    scopus 로고
    • Google News. http://news.google.com.
  • 4
    • 84865660488 scopus 로고    scopus 로고
    • Google Book Search. http://books.google.com/.
  • 5
    • 84865660484 scopus 로고    scopus 로고
    • Yahoo News. http://news.yahoo.com.
  • 9
    • 85104914015 scopus 로고    scopus 로고
    • Efficient exact set-similarity joins
    • A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, 2006.
    • (2006) VLDB
    • Arasu, A.1    Ganti, V.2    Kaushik, R.3
  • 10
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all Pairs similarity search
    • R.J. Bayardo, Y. Ma, and R. Srikant. Scaling Up All Pairs Similarity Search. In WWW, 2007.
    • (2007) WWW
    • Bayardo, R.J.1    Ma, Y.2    Srikant, R.3
  • 11
    • 0037870443 scopus 로고    scopus 로고
    • The X-tree: An index structure for high-dimensional data
    • S. Berchtold, D.A. Keim, and H. Kriegei. The X-tree: An Index Structure for High-Dimensional Data. In VLDB, 1996.
    • (1996) VLDB
    • Berchtold, S.1    Keim, D.A.2    Kriegei, H.3
  • 15
    • 84976810280 scopus 로고
    • Copy detection mechanisms for digital documents
    • S. Brin, J. Davis, and H. Garcia-Molina. Copy detection mechanisms for digital documents. In SIGMOD, 1995.
    • (1995) SIGMOD
    • Brin, S.1    Davis, J.2    Garcia-Molina, H.3
  • 16
    • 0032664793 scopus 로고    scopus 로고
    • The hybrid tree: An index structure for high dimensional feature spaces
    • K. Chakrabarti, and S. Mehrotra. The Hybrid Tree: An Index Structure for High Dimensional Feature Spaces. In ICDE, 1999.
    • (1999) ICDE
    • Chakrabarti, K.1    Mehrotra, S.2
  • 17
    • 33749597967 scopus 로고    scopus 로고
    • A primitive operator for similarity joins in data cleaning
    • S. Chaudhuri, V. Ganti, and R. Kaushik. A Primitive Operator for Similarity Joins in Data Cleaning. In ICDE, 2006.
    • (2006) ICDE
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 19
    • 36849049806 scopus 로고    scopus 로고
    • Structural and temporal analysis of the blogosphere through community factorization
    • Y. Chi, S. Zhu, X. Song, J. Tatemura, and B.L. Tseng. Structural and temporal analysis of the blogosphere through community factorization. In SIGKDD, 2007.
    • (2007) SIGKDD
    • Chi, Y.1    Zhu, S.2    Song, X.3    Tatemura, J.4    Tseng, B.L.5
  • 21
    • 0013206133 scopus 로고    scopus 로고
    • Collection statistics for fast duplicate document detection
    • A. Chowdhury, O. Frieder, D. Grossman, M.C. McCabe. Collection statistics for fast duplicate document detection. ACM TOIS, v.20 n.2, p.171-191, 2002.
    • (2002) ACM Tois , vol.20 , Issue.2 , pp. 171-191
    • Chowdhury, A.1    Frieder, O.2    Grossman, D.3    McCabe, M.C.4
  • 22
    • 15044355327 scopus 로고    scopus 로고
    • Similarity search in high dimensions via hashing
    • A. Gionis, P. Indyk, and R. Motwani. Similarity Search in High Dimensions via Hashing. In VLDB, 1999.
    • (1999) VLDB
    • Gionis, A.1    Indyk, P.2    Motwani, R.3
  • 26
    • 0030646261 scopus 로고    scopus 로고
    • Locality-preserving hashing in multidimensional spaces
    • P. Indyk, R. Motwani, P. Raghavan and S. Vempala Locality-preserving hashing in multidimensional spaces. In STOC, 1997.
    • (1997) STOC
    • Indyk, P.1    Motwani, R.2    Raghavan, P.3    Vempala, S.4
  • 27
    • 0031162081 scopus 로고    scopus 로고
    • The SR-tree: An index structure for high-dimensional nearest neighbor queries
    • N. Katayama and S. Satoh. The SR-tree: an index structure for high-dimensional nearest neighbor queries. In SIGMOD, 1997.
    • (1997) SIGMOD
    • Katayama, N.1    Satoh, S.2
  • 28
    • 47749095961 scopus 로고    scopus 로고
    • CDIP: Collection-driven, yet individuality-preserving automated blog tagging
    • J.W. Kim, K.S. Candan, and J.Tatemura. CDIP: Collection-Driven, yet Individuality-Preserving Automated Blog Tagging. In ICSC, 2007.
    • (2007) ICSC
    • Kim, J.W.1    Candan, K.S.2    Tatemura, J.3
  • 30
    • 57349180452 scopus 로고    scopus 로고
    • Generating links by mining quotations
    • O. Kolak, and B.N. Schilit. Generating links by mining quotations. In HT, 2008.
    • (2008) HT
    • Kolak, O.1    Schilit, B.N.2
  • 32
    • 35348911985 scopus 로고    scopus 로고
    • Detecting near duplicates for web crawling
    • G.S. Manku, A. Jain and A.D.Sarma. Detecting Near Duplicates for Web Crawling. In WWW, 2007.
    • (2007) WWW
    • Manku, G.S.1    Jain, A.2    Sarma, A.D.3
  • 34
    • 1142267351 scopus 로고    scopus 로고
    • Winnowing: Local algorithms for document fingerprinting
    • S. Schleimer, D.S. Wilkerson, and A. Aiken. Winnowing: Local Algorithms for Document Fingerprinting. In SIGMOD, 2003.
    • (2003) SIGMOD
    • Schleimer, S.1    Wilkerson, D.S.2    Aiken, A.3
  • 35
    • 85088005959 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • S. Sarawagi, and A. Kirpa. Efficient set joins on similarity predicates. In SIGMOD, 2004.
    • (2004) SIGMOD
    • Sarawagi, S.1    Kirpa, A.2
  • 38
    • 33750311279 scopus 로고    scopus 로고
    • Near-duplicate detection by instance-level constrained clustering
    • H. Yang, and J. Callan Near-duplicate detection by instance-level constrained clustering. In SIGIR, 2006.
    • (2006) SIGIR
    • Yang, H.1    Callan, J.2
  • 40
    • 66249113620 scopus 로고    scopus 로고
    • Efficient similarity joins for near duplicate detection
    • C. Xiao, W. Wang, X. Lin, and J.X. Yu. Efficient Similarity Joins for Near Duplicate Detection. In WWW, 2008.
    • (2008) WWW
    • Xiao, C.1    Wang, W.2    Lin, X.3    Yu, J.X.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.