-
4
-
-
0038589165
-
The anatomy of a large-scale hypertextual Web search engine
-
April
-
Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual Web search engine. In Seventh International World Wide Web Conference, pages 107-111, April 1998.
-
(1998)
Seventh International World Wide Web Conference
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
6
-
-
0342652248
-
Crawling towards eternity
-
May
-
Mike Burner, Crawling towards Eternity. Web Techniques, 2(5), May 1997.
-
(1997)
Web Techniques
, vol.2
, Issue.5
-
-
Burner, M.1
-
10
-
-
0038209285
-
Crawling the web: Discovery and maintenance of large-scale web data
-
PhD thesis, Stanford University
-
Junghoo Cho. Crawling the Web: Discovery and Maintenance of Large-Scale Web Data. PhD thesis, Stanford University, 2001.
-
(2001)
-
-
Cho, J.1
-
15
-
-
33746687744
-
An autonomous, Web-based, multilingual corpus collection tool
-
Jim Cowie, Yevgeny Ludovik, and Ron Zacharski. An autonomous, Web-based, multilingual corpus collection tool. In International Conference on Natural Language Processing and Industrial Applications, 1998.
-
International Conference on Natural Language Processing and Industrial Applications, 1998
-
-
Cowie, J.1
Ludovik, Y.2
Zacharski, R.3
-
16
-
-
0034172483
-
Learning to construct knowledge bases from the World Wide Web
-
Mark Craven, Dan DiPasquo, Dayne Freitag, Andrew McCallum, Tom Mitchell, Kamal Nigan, and Séan Slattery. Learning to construct knowledge bases from the World Wide Web. Artificial Intelligence, 118(1-2):69-113, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, Issue.1-2
, pp. 69-113
-
-
Craven, M.1
DiPasquo, D.2
Freitag, D.3
McCallum, A.4
Mitchell, T.5
Nigan, K.6
Slattery, S.7
-
18
-
-
70350672544
-
Focused crawling using context graphs
-
September
-
M. Diligenti, F. M. Coetzee, S. Lawrence, C. L. Giles, and jM. Gori. Focused crawling using context graphs. In 26th International Conference on Very Large Databases, pages 527-534, September 2000.
-
(2000)
26th International Conference on Very Large Databases
, pp. 527-534
-
-
Diligenti, M.1
Coetzee, F.M.2
Lawrence, S.3
Giles, C.L.4
Gori, M.5
-
21
-
-
33947178503
-
ProFusion: Intelligent fusion from multiple, distributed search engines
-
September
-
Susan Gauch, Guijun, Wang, and Mario Gomez. ProFusion: Intelligent fusion from multiple, distributed search engines. Journal of Universal Computer Science, 2(9), September 1996.
-
(1996)
Journal of Universal Computer Science
, vol.2
, Issue.9
-
-
Gauch, S.1
Guijun, W.2
Gomez, M.3
-
23
-
-
79951675059
-
Mercator: A scalable, extensible Web crawler
-
December
-
Allan Heydon and Marc Najork. Mercator: A scalable, extensible Web crawler. World Wide Web, 1(2):219-229, December 1999.
-
(1999)
World Wide Web
, vol.1
, Issue.2
, pp. 219-229
-
-
Heydon, A.1
Najork, M.2
-
24
-
-
0001842086
-
Performance limitations of the Java core libraries
-
June
-
Allan Heydon and Marc Najork. Performance limitations of the Java Core libraries. In ACM 1999 Java Grande Conference, pages 35-41, June 1999.
-
(1999)
ACM 1999 Java Grande Conference
, pp. 35-41
-
-
Heydon, A.1
Najork, M.2
-
25
-
-
0002409860
-
A probabilistic analysis of the Rocchio algorithm with TFIDF for text classification
-
Thorsten Joachims. A probabilistic analysis of the Rocchio algorithm with TFIDF for text classification. In 14th International Conference on Machine Learning, pages 143-151, 1997.
-
(1997)
14th International Conference on Machine Learning
, pp. 143-151
-
-
Joachims, T.1
-
27
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.M.1
-
31
-
-
0034794539
-
Evaluating topic-driven Web crawlers
-
September
-
Filippo Menczer, Gautan Pant, Padmini Srinivasan, and Miguel E. Ruiz. Evaluating topic-driven Web crawlers. In 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 241-249, September 2001.
-
(2001)
24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 241-249
-
-
Menczer, F.1
Pant, G.2
Srinivasan, P.3
Ruiz, M.E.4
-
34
-
-
0033886806
-
Text classification from labeled and unlabeled documents using EM
-
Kamal Nigam, Andrew Kachites McCallum, Sebastian Thrun, and Tom Mitchell. Text classification from labeled and unlabeled documents using EM. Machine Learning, 39(2/3):103-134, 2000.
-
(2000)
Machine Learning
, vol.39
, Issue.2-3
, pp. 103-134
-
-
Nigam, K.1
McCallum, A.K.2
Thrun, S.3
Mitchell, T.4
|