-
1
-
-
33748157553
-
-
P. Lyman, H. Varian, J. Dunn, A. Strygin, K. Swearingen, How much information? 2003. Available from: . Link checked on March 10, 2006.
-
-
-
-
2
-
-
33748204717
-
-
R. Zakon, Hobbes' internet timeline v7.0. Available from: . Link checked on March 10, 2006.
-
-
-
-
3
-
-
33748182037
-
-
Search engine sizes. Available from: . Link checked on March 10, 2006.
-
-
-
-
4
-
-
33748172967
-
-
Site position and coverage. Available from: . Link checked on March 10, 2006.
-
-
-
-
5
-
-
33748142139
-
-
S. Chakrabarti, M. van den Berg, B. Dom, Focused crawling: a new approach to topic-specific Web resource discovery, in: Proceedings of the 8th International WWW Conference, Toronto, Canada, 1999.
-
-
-
-
6
-
-
33748189874
-
-
D. Bergmark, C. Lagoze, A. Sbityakov, Focused crawls, tunneling, and digital libraries, in: Proceedings of the 6th European Conference on Digital Libraries, Rome, Italy, 2002.
-
-
-
-
7
-
-
33748162744
-
-
P.D. Bra, R. Post, Information retrieval in the World Wide Web: making client-base searching feasible, in: Proceedings of the 1st International WWW Conference, Geneva, Switzerland, 1994.
-
-
-
-
8
-
-
33748138286
-
-
M. Hersovici, M. Jacovi, Y. Maarek, D. Pelleg, M. Shtalhaim, S. Ur, The Shark-search algorithm-an application: tailored Web site mapping, in: Proceedings of the 7th International WWW Conference, Brisbane, Australia, 1998.
-
-
-
-
9
-
-
84874371227
-
-
C. Aggarwal, F. Al-Garawi, P. Yu, Intelligent crawling on the World Wide Web with arbitrary predicates, in: Proceedings of the 10th International WWW Conference, Hong Kong, 2001.
-
-
-
-
10
-
-
33748174705
-
-
S. Chakrabarti, K. Punera, M. Subramanyam, Accelerated focused crawling through online relevance feedback, in: Proceedings of the 11th International WWW Conference, Hawaii, USA, 1999.
-
-
-
-
11
-
-
33748130953
-
-
J. Cho, H. Garcia-Molina, L. Page, Efficient crawling through URL ordering, in: Proceedings of the 7th World Wide Web Conference, Brisbane, Australia, 1998.
-
-
-
-
12
-
-
33748206438
-
-
K. Stamatakis, V. Karkaletsis, G. Paliouras, J. Horlock, et al., Domain-specific Web site identification: the CROSSMARC focused Web crawler, in: Proceedings of the 2nd International Workshop on Web Document Analysis (WDA2003), Edinburgh, UK, 2003.
-
-
-
-
13
-
-
33748178502
-
-
J. Rennie, A. McCallum, Using reinforcement learning to spider the Web efficiently, in: Proceedings of the 16th International Conference on Machine Learning (ICML-99), Bled, Slovenia, 1999.
-
-
-
-
14
-
-
1942484949
-
-
J. Johnson, K. Tsioutsiouliklis, C.L. Giles, Evolving strategies for focused Web crawling, in: Proceedings of the 20th International Conference on Machine Learning (ICML-2003), Washington, DC, USA, 2003.
-
-
-
-
15
-
-
70350672544
-
-
M. Diligenti, F. Coetzee, S. Lawrence, C. Giles, M. Gori, Focused crawling using context graphs, in: Proceedings of the 26th International Conference on Very Large Databases (VLDB 2000), Cairo, Egypt, 2000.
-
-
-
-
16
-
-
0034794539
-
-
F. Menczer, G. Pant, P. Srinivasan, M. Ruiz, Evaluating topic-driven Web crawlers, in: Proceedings of the 24th Annual International ACM/SIGIR Conference, New Orleans, USA, 2001.
-
-
-
-
17
-
-
9744257884
-
Topical Web crawlers: evaluating adaptive algorithms
-
Menczer F., Pant G., and Srinivasan P. Topical Web crawlers: evaluating adaptive algorithms. ACM TOIT 4 4 (2004) 378-419
-
(2004)
ACM TOIT
, vol.4
, Issue.4
, pp. 378-419
-
-
Menczer, F.1
Pant, G.2
Srinivasan, P.3
-
18
-
-
17444365825
-
A general evaluation framework for topical crawlers
-
Srinivasan P., Menczer F., and Pant G. A general evaluation framework for topical crawlers. Information Retrieval 8 3 (2005) 417-447
-
(2005)
Information Retrieval
, vol.8
, Issue.3
, pp. 417-447
-
-
Srinivasan, P.1
Menczer, F.2
Pant, G.3
-
19
-
-
4944227235
-
-
G. Pant, K. Tsioutsiouliklis, J. Johnson, C. Giles, Panorama: extending digital libraries with topical crawlers, in: Proceedings of ACM/IEEE Joint Conference on Digital Libraries (JCDL 2004), Tucson, Arizona, June 2004, pp. 142-150.
-
-
-
-
20
-
-
33748136124
-
-
Algorithmic Solutions. Available from: . Link checked on March 10, 2006.
-
-
-
-
21
-
-
33748147026
-
-
M.W. Berry, LSI: Latent Semantic Indexing Web Site. Available from: . Link checked on March 10, 2006.
-
-
-
-
22
-
-
84989525001
-
Indexing by latent semantic analysis
-
Deerwester S.C., Dumais S.T., Landauer T.K., Furnas G.W., and Harshman R.A. Indexing by latent semantic analysis. Journal of the American Society of Information Science 41 6 (1990) 391-407
-
(1990)
Journal of the American Society of Information Science
, vol.41
, Issue.6
, pp. 391-407
-
-
Deerwester, S.C.1
Dumais, S.T.2
Landauer, T.K.3
Furnas, G.W.4
Harshman, R.A.5
-
24
-
-
33748154973
-
-
M.W. Berry et al., SVDPACKC: Version 1.0 User's Guide, Technical Report CS-93-194, University of Tennessee, Knoxville, TN, October 1993.
-
-
-
-
26
-
-
0024610919
-
A tutorial on hidden Markov model and selected applications in speech recognition
-
Rabiner L.R. A tutorial on hidden Markov model and selected applications in speech recognition. Proceedings of the IEEE 77 2 (1989) 257-285
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-285
-
-
Rabiner, L.R.1
-
27
-
-
33748199239
-
-
Google. Available from: . Link checked on March 10, 2006.
-
-
-
-
29
-
-
1542287488
-
-
D. Pinto, A. McCallum, X. Wei, W.B. Croft, Table extraction using conditional random fields, in: Proceedings of the 26th Annual International ACM SIGIR Conference, Toronto, Canada, 2003.
-
-
-
|