메뉴 건너뛰기




Volumn , Issue , 2009, Pages 262-271

Finding text reuse on the web

Author keywords

Information flow; Text reuse; Web search

Indexed keywords

DETECTION TECHNIQUE; INFORMATION FLOW; INFORMATION FLOWS; LINK ANALYSIS; NOVEL TECHNIQUES; TEXT REUSE; TREC COLLECTION; WEB SEARCH; WEB SEARCHES;

EID: 70349155038     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1498759.1498835     Document Type: Conference Paper
Times cited : (72)

References (30)
  • 1
    • 33748871742 scopus 로고    scopus 로고
    • Tracking Information Epidemics in Blogspace
    • E. Adar and L. Adamic. Tracking Information Epidemics in Blogspace. In Proceedings of WI, pages 207-214, 2005.
    • (2005) Proceedings of WI , pp. 207-214
    • Adar, E.1    Adamic, L.2
  • 4
    • 57349156145 scopus 로고    scopus 로고
    • Genealogical trees on the web: A search engine user perspective
    • R. Baeza-Yates, Á. Pereira, and N. Ziviani. Genealogical trees on the web: a search engine user perspective. In Proceedings of WWW, 2008.
    • (2008) Proceedings of WWW
    • Baeza-Yates, R.1    Pereira, A.2    Ziviani, N.3
  • 6
    • 41849143005 scopus 로고    scopus 로고
    • Utilizing passage-based language models for document retrieval
    • M. Bendersky and O. Kurland. Utilizing passage-based language models for document retrieval. In Proceedings of ECIR, pages 162-174, 2008.
    • (2008) Proceedings of ECIR , pp. 162-174
    • Bendersky, M.1    Kurland, O.2
  • 7
    • 33646126481 scopus 로고    scopus 로고
    • A Scalable System for Identifying Co-derivative Documents
    • Y. Bernstein and J. Zobel. A Scalable System for Identifying Co-derivative Documents. In Proceedings of SPIRE, 2004.
    • (2004) Proceedings of SPIRE
    • Bernstein, Y.1    Zobel, J.2
  • 8
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117, 1998.
    • (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 9
    • 79956075292 scopus 로고    scopus 로고
    • Identifying and Filtering Near-Duplicate Documents
    • A. Broder. Identifying and Filtering Near-Duplicate Documents. In Proceedings of CPM, pages 1-10, 2000.
    • (2000) Proceedings of CPM , pp. 1-10
    • Broder, A.1
  • 10
    • 0036040277 scopus 로고    scopus 로고
    • Similarity estimation techniques from rounding algorithms
    • M. S. Charikar. Similarity estimation techniques from rounding algorithms. In Proceedings of STOC, pages 380-388, 2002.
    • (2002) Proceedings of STOC , pp. 380-388
    • Charikar, M.S.1
  • 11
    • 8644252773 scopus 로고    scopus 로고
    • Using temporal profiles of queries for precision prediction
    • F. Diaz and R. Jones. Using temporal profiles of queries for precision prediction. In Proceedings of SIGIR, pages 18-24, 2004.
    • (2004) Proceedings of SIGIR , pp. 18-24
    • Diaz, F.1    Jones, R.2
  • 12
    • 84885639910 scopus 로고    scopus 로고
    • Detecting phrase-level duplication on the world wide web
    • D. Fetterly, M. Manasse, and M. Najork. Detecting phrase-level duplication on the world wide web. In Proceedings of SIGIR, pages 170-177, 2005.
    • (2005) Proceedings of SIGIR , pp. 170-177
    • Fetterly, D.1    Manasse, M.2    Najork, M.3
  • 13
    • 84880498138 scopus 로고    scopus 로고
    • DOM-based content extraction of HTML documents
    • S. Gupta, G. Kaiser, D. Neistadt, and P. Grimm. DOM-based content extraction of HTML documents. In Proceedings of WWW, pages 207-214, 2003.
    • (2003) , pp. 207-214
    • Gupta, S.1    Kaiser, G.2    Neistadt, D.3    Grimm, P.4
  • 15
    • 33750296887 scopus 로고    scopus 로고
    • Finding near-duplicate web pages: A large-scale evaluation of algorithms
    • M. Henzinger. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In Proceedings of SIGIR, pages 284-291, 2006.
    • (2006) Proceedings of SIGIR , pp. 284-291
    • Henzinger, M.1
  • 16
    • 4243148480 scopus 로고    scopus 로고
    • Authoritative sources in a hyperlinked environment
    • J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604-632, 1999.
    • (1999) Journal of the ACM (JACM) , vol.46 , Issue.5 , pp. 604-632
    • Kleinberg, J.1
  • 18
    • 35348861901 scopus 로고    scopus 로고
    • Web projections: Learning from contextual subgraphs of the web
    • J. Leskovec, S. Dumais, and E. Horvitz. Web projections: learning from contextual subgraphs of the web. In Proceedings of WWW, pages 471-480, 2007.
    • (2007) , pp. 471-480
    • Leskovec, J.1    Dumais, S.2    Horvitz, E.3
  • 19
  • 20
    • 84996678707 scopus 로고    scopus 로고
    • Information extraction: Distilling structured data from unstructured text
    • A. McCallum. Information extraction: distilling structured data from unstructured text. Queue, 3(9):48-57, 2005.
    • (2005) Queue , vol.3 , Issue.9 , pp. 48-57
    • McCallum, A.1
  • 21
    • 29244457315 scopus 로고    scopus 로고
    • Discovering evolutionary theme patterns from text: An exploration of temporal text mining
    • Q. Mei and C. Zhai. Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In Proceeding of KDD, pages 198-207, 2005.
    • (2005) Proceeding of KDD , pp. 198-207
    • Mei, Q.1    Zhai, C.2
  • 23
    • 84885662673 scopus 로고    scopus 로고
    • A Markov random field model for term dependencies
    • D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of SIGIR, pages 472-479, 2005.
    • (2005) In Proceedings of SIGIR , pp. 472-479
    • Metzler, D.1    Croft, W.B.2
  • 24
    • 37149036775 scopus 로고    scopus 로고
    • A translation model for sentence retrieval
    • V. Murdock and W. B. Croft. A translation model for sentence retrieval. In Proceedings of HLT/EMNLP, pages 684-691, 2005.
    • (2005) Proceedings of HLT/EMNLP , pp. 684-691
    • Murdock, V.1    Croft, W.B.2
  • 25
    • 0032268440 scopus 로고    scopus 로고
    • A language modeling approach to information retrieval
    • J. M. Ponte and B. W. Croft. A language modeling approach to information retrieval. In Proceedings of SIGIR, pages 275-281, 1998.
    • (1998) Proceedings of SIGIR , pp. 275-281
    • Ponte, J.M.1    Croft, B.W.2
  • 26
    • 84881219500 scopus 로고    scopus 로고
    • A maximum entropy approach to identifying sentence boundaries
    • J. C. Reynar and A. Ratnaparkhi. A maximum entropy approach to identifying sentence boundaries. In Proceedings of ANLP, pages 16-19, 1997.
    • (1997) Proceedings of ANLP , pp. 16-19
    • Reynar, J.C.1    Ratnaparkhi, A.2
  • 27
    • 33745199116 scopus 로고    scopus 로고
    • Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores
    • M. Ringel, E. Cutrell, S. Dumais, and E. Horvitz. Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores. In Proceedings of INTERACT, pages 184-191, 2003.
    • (2003) Proceedings of INTERACT , pp. 184-191
    • Ringel, M.1    Cutrell, E.2    Dumais, S.3    Horvitz, E.4
  • 29
    • 70349132329 scopus 로고    scopus 로고
    • N. Shivakumar and H. Garcia-Molina. SCAM: Copy detection mechanisms for digital documents. In Proceedings of Digital Libraries, 1995.
    • N. Shivakumar and H. Garcia-Molina. SCAM: Copy detection mechanisms for digital documents. In Proceedings of Digital Libraries, 1995.
  • 30
    • 8644264918 scopus 로고    scopus 로고
    • Timemines: Constructing timelines with statistical models of word usage
    • R. Swan and D. Jensen. Timemines: Constructing timelines with statistical models of word usage. In Proceedings of KDD, pages 73-80, 2000.
    • (2000) Proceedings of KDD , pp. 73-80
    • Swan, R.1    Jensen, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.