-
1
-
-
33748871742
-
Tracking Information Epidemics in Blogspace
-
E. Adar and L. Adamic. Tracking Information Epidemics in Blogspace. In Proceedings of WI, pages 207-214, 2005.
-
(2005)
Proceedings of WI
, pp. 207-214
-
-
Adar, E.1
Adamic, L.2
-
2
-
-
42949138243
-
Finding high-quality content in social media
-
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Finding high-quality content in social media. In Proceedings of WSDM, pages 183-194, 2008.
-
(2008)
In Proceedings of WSDM
, pp. 183-194
-
-
Agichtein, E.1
Castillo, C.2
Donato, D.3
Gionis, A.4
Mishne, G.5
-
6
-
-
41849143005
-
Utilizing passage-based language models for document retrieval
-
M. Bendersky and O. Kurland. Utilizing passage-based language models for document retrieval. In Proceedings of ECIR, pages 162-174, 2008.
-
(2008)
Proceedings of ECIR
, pp. 162-174
-
-
Bendersky, M.1
Kurland, O.2
-
7
-
-
33646126481
-
A Scalable System for Identifying Co-derivative Documents
-
Y. Bernstein and J. Zobel. A Scalable System for Identifying Co-derivative Documents. In Proceedings of SPIRE, 2004.
-
(2004)
Proceedings of SPIRE
-
-
Bernstein, Y.1
Zobel, J.2
-
8
-
-
0038589165
-
The anatomy of a large-scale hypertextual Web search engine
-
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117, 1998.
-
(1998)
Computer Networks and ISDN Systems
, vol.30
, Issue.1-7
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
9
-
-
79956075292
-
Identifying and Filtering Near-Duplicate Documents
-
A. Broder. Identifying and Filtering Near-Duplicate Documents. In Proceedings of CPM, pages 1-10, 2000.
-
(2000)
Proceedings of CPM
, pp. 1-10
-
-
Broder, A.1
-
10
-
-
0036040277
-
Similarity estimation techniques from rounding algorithms
-
M. S. Charikar. Similarity estimation techniques from rounding algorithms. In Proceedings of STOC, pages 380-388, 2002.
-
(2002)
Proceedings of STOC
, pp. 380-388
-
-
Charikar, M.S.1
-
11
-
-
8644252773
-
Using temporal profiles of queries for precision prediction
-
F. Diaz and R. Jones. Using temporal profiles of queries for precision prediction. In Proceedings of SIGIR, pages 18-24, 2004.
-
(2004)
Proceedings of SIGIR
, pp. 18-24
-
-
Diaz, F.1
Jones, R.2
-
12
-
-
84885639910
-
Detecting phrase-level duplication on the world wide web
-
D. Fetterly, M. Manasse, and M. Najork. Detecting phrase-level duplication on the world wide web. In Proceedings of SIGIR, pages 170-177, 2005.
-
(2005)
Proceedings of SIGIR
, pp. 170-177
-
-
Fetterly, D.1
Manasse, M.2
Najork, M.3
-
13
-
-
84880498138
-
DOM-based content extraction of HTML documents
-
S. Gupta, G. Kaiser, D. Neistadt, and P. Grimm. DOM-based content extraction of HTML documents. In Proceedings of WWW, pages 207-214, 2003.
-
(2003)
, pp. 207-214
-
-
Gupta, S.1
Kaiser, G.2
Neistadt, D.3
Grimm, P.4
-
15
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
M. Henzinger. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In Proceedings of SIGIR, pages 284-291, 2006.
-
(2006)
Proceedings of SIGIR
, pp. 284-291
-
-
Henzinger, M.1
-
16
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604-632, 1999.
-
(1999)
Journal of the ACM (JACM)
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.1
-
18
-
-
35348861901
-
Web projections: Learning from contextual subgraphs of the web
-
J. Leskovec, S. Dumais, and E. Horvitz. Web projections: learning from contextual subgraphs of the web. In Proceedings of WWW, pages 471-480, 2007.
-
(2007)
, pp. 471-480
-
-
Leskovec, J.1
Dumais, S.2
Horvitz, E.3
-
20
-
-
84996678707
-
Information extraction: Distilling structured data from unstructured text
-
A. McCallum. Information extraction: distilling structured data from unstructured text. Queue, 3(9):48-57, 2005.
-
(2005)
Queue
, vol.3
, Issue.9
, pp. 48-57
-
-
McCallum, A.1
-
21
-
-
29244457315
-
Discovering evolutionary theme patterns from text: An exploration of temporal text mining
-
Q. Mei and C. Zhai. Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In Proceeding of KDD, pages 198-207, 2005.
-
(2005)
Proceeding of KDD
, pp. 198-207
-
-
Mei, Q.1
Zhai, C.2
-
22
-
-
33745797351
-
Similarity measures for tracking information flow
-
D. Metzler, Y. Bernstein, W. B. Croft, A. Moffat, and J. Zobel. Similarity measures for tracking information flow. In Proceedings of CIKM, 2005.
-
(2005)
Proceedings of CIKM
-
-
Metzler, D.1
Bernstein, Y.2
Croft, W.B.3
Moffat, A.4
Zobel, J.5
-
23
-
-
84885662673
-
A Markov random field model for term dependencies
-
D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In Proceedings of SIGIR, pages 472-479, 2005.
-
(2005)
In Proceedings of SIGIR
, pp. 472-479
-
-
Metzler, D.1
Croft, W.B.2
-
24
-
-
37149036775
-
A translation model for sentence retrieval
-
V. Murdock and W. B. Croft. A translation model for sentence retrieval. In Proceedings of HLT/EMNLP, pages 684-691, 2005.
-
(2005)
Proceedings of HLT/EMNLP
, pp. 684-691
-
-
Murdock, V.1
Croft, W.B.2
-
25
-
-
0032268440
-
A language modeling approach to information retrieval
-
J. M. Ponte and B. W. Croft. A language modeling approach to information retrieval. In Proceedings of SIGIR, pages 275-281, 1998.
-
(1998)
Proceedings of SIGIR
, pp. 275-281
-
-
Ponte, J.M.1
Croft, B.W.2
-
26
-
-
84881219500
-
A maximum entropy approach to identifying sentence boundaries
-
J. C. Reynar and A. Ratnaparkhi. A maximum entropy approach to identifying sentence boundaries. In Proceedings of ANLP, pages 16-19, 1997.
-
(1997)
Proceedings of ANLP
, pp. 16-19
-
-
Reynar, J.C.1
Ratnaparkhi, A.2
-
27
-
-
33745199116
-
Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores
-
M. Ringel, E. Cutrell, S. Dumais, and E. Horvitz. Milestones in Time: The Value of Landmarks in Retrieving Information from Personal Stores. In Proceedings of INTERACT, pages 184-191, 2003.
-
(2003)
Proceedings of INTERACT
, pp. 184-191
-
-
Ringel, M.1
Cutrell, E.2
Dumais, S.3
Horvitz, E.4
-
29
-
-
70349132329
-
-
N. Shivakumar and H. Garcia-Molina. SCAM: Copy detection mechanisms for digital documents. In Proceedings of Digital Libraries, 1995.
-
N. Shivakumar and H. Garcia-Molina. SCAM: Copy detection mechanisms for digital documents. In Proceedings of Digital Libraries, 1995.
-
-
-
-
30
-
-
8644264918
-
Timemines: Constructing timelines with statistical models of word usage
-
R. Swan and D. Jensen. Timemines: Constructing timelines with statistical models of word usage. In Proceedings of KDD, pages 73-80, 2000.
-
(2000)
Proceedings of KDD
, pp. 73-80
-
-
Swan, R.1
Jensen, D.2
|