-
1
-
-
33846530701
-
Analyzing imbalance among homogeneous index servers in a web search system
-
DOI 10.1016/j.ipm.2006.09.002, PII S0306457306001452
-
BADUE, C. S., BAEZA-YATES, R., RIBEIRO-NETO, B., ZIVIANI, A., AND ZIVIANI, N. 2007. Analyzing imbalance among homogeneous index servers in a Web search system. Inform. Process. Manage. 43, 3, 592-608. (Pubitemid 46164824)
-
(2007)
Information Processing and Management
, vol.43
, Issue.3
, pp. 592-608
-
-
Badue, C.S.1
Baeza-Yates, R.2
Ribeiro-Neto, B.3
Ziviani, A.4
Ziviani, N.5
-
2
-
-
67650650786
-
Challenges in distributed information retrieval (invited paper)
-
IEEE CS Press
-
BAEZA-YATES, R., CASTILLO, C., JUNQUEIRA, F., PLACHOURAS, V., AND SILVESTRI, F. 2007a. Challenges in distributed information retrieval (invited paper). In Proceedings of International Conference on Data Engineering (ICDE). IEEE CS Press.
-
(2007)
Proceedings of International Conference on Data Engineering (ICDE)
-
-
Baeza-Yates, R.1
Castillo, C.2
Junqueira, F.3
Plachouras, V.4
Silvestri, F.5
-
3
-
-
36448931586
-
The impact of caching on search engines
-
DOI 10.1145/1277741.1277775, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
-
BAEZA-YATES, R., GIONIS, A., JUNQUEIRA, F., MURDOCK, V., PLACHOURAS, V., AND SILVESTRI, F. 2007b. The impact of caching on search engines. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). ACM, New York, NY, 183-190. (Pubitemid 350164960)
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07
, pp. 183-190
-
-
Baeza-Yates, R.1
Gionis, A.2
Junqueira, F.3
Murdock, V.4
Plachouras, V.5
Silvestri, F.6
-
4
-
-
55149095218
-
Design trade-offs for search engine caching
-
BAEZA-YATES, R., GIONIS, A., JUNQUEIRA, F. P., MURDOCK, V., PLACHOURAS, V., AND SILVESTRI, F. 2008. Design trade-offs for search engine caching. ACM Trans. Web 2, 4, 1-28.
-
(2008)
ACM Trans. Web
, vol.2
, Issue.4
, pp. 1-28
-
-
Baeza-Yates, R.1
Gionis, A.2
Junqueira, F.P.3
Murdock, V.4
Plachouras, V.5
Silvestri, F.6
-
5
-
-
0037619265
-
Web search for a planet: The Google cluster architecture
-
BARROSO, L., DEAN, J., AND HÖLZE, U. 2003. Web search for a planet: The Google cluster architecture. IEEE Micro 22, 2.
-
(2003)
IEEE Micro
, vol.22
, Issue.2
-
-
Barroso, L.1
Dean, J.2
Hölze, U.3
-
6
-
-
8744301897
-
Hourly analysis of a very large topically categorized Web query log
-
ACM, New York, NY
-
BEITZEL, S. M., JENSEN, E. C., CHOWDHURY, A., GROSSMAN, D., AND FRIEDER, O. 2004. Hourly analysis of a very large topically categorized Web query log. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). ACM, New York, NY, 321-328.
-
(2004)
Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
, pp. 321-328
-
-
Beitzel, S.M.1
Jensen, E.C.2
Chowdhury, A.3
Grossman, D.4
Frieder, O.5
-
7
-
-
19944376183
-
The WebGraph framework I: Compression techniques
-
Thirteenth International World Wide Web Conference Proceedings, WWW2004
-
BOLDI, P. AND VIGNA, S. 2004. The webgraph framework I: compression techniques. In Proceedings of the 13th International Conference on World Wide Web (WWW). ACM Press, New York, NY, 595-602. (Pubitemid 40752793)
-
(2004)
Thirteenth International World Wide Web Conference Proceedings, WWW2004
, pp. 595-602
-
-
Boldi, P.1
Vigna, S.2
-
8
-
-
0038589165
-
The anatomy of a large-scale hypertextual Web search engine
-
Elsevier Science Publishers B. V., Amsterdam, The Netherlands
-
BRIN, S. AND PAGE, L. 1998. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the Seventh International Conference on World Wide Web (WWW). Elsevier Science Publishers B. V., Amsterdam, The Netherlands, 107-117.
-
(1998)
Proceedings of the Seventh International Conference on World Wide Web (WWW)
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
9
-
-
0010362121
-
Syntactic clustering of the Web
-
Elsevier Science Publishers Ltd., Amsterdam, The Netherlands
-
BRODER, A. Z., GLASSMAN, S. C., MANASSE, M. S., AND ZWEIG, G. 1997. Syntactic clustering of the Web. In Selected Papers from the Sixth International Conference on World Wide Web. Elsevier Science Publishers Ltd., Amsterdam, The Netherlands, 1157-1166.
-
(1997)
Selected Papers from the Sixth International Conference on World Wide Web
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
10
-
-
0029193309
-
Searching distributed collections with inference networks
-
E. A. Fox, P. Ingwersen, and R. Fidel, Eds. ACM Press
-
CALLAN, J., LU, Z., AND CROFT, W. 1995. Searching distributed collections with inference networks. In Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. E. A. Fox, P. Ingwersen, and R. Fidel, Eds. ACM Press, 21-28.
-
(1995)
Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 21-28
-
-
Callan, J.1
Lu, Z.2
Croft, W.3
-
11
-
-
35448996113
-
Finding near neighbors through cluster pruning
-
DOI 10.1145/1265530.1265545, Proceedings of the Twenty-sixth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2007
-
CHIERICHETTI, F., PANCONESI, A., RAGHAVAN, P., SOZIO, M., TIBERI, A., AND UPFAL, E. 2007. Finding near neighbors through cluster pruning. In Proceedings of the Twenty-Sixth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS). ACM, New York, NY, 103-112. (Pubitemid 47620885)
-
(2007)
Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems
, pp. 103-112
-
-
Chierichetti, F.1
Panconesi, A.2
Raghavan, P.3
Sozio, M.4
Tiberi, A.5
Upfal, E.6
-
12
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
DOI 10.1145/506309.506311
-
CHOWDHURY, A., FRIEDER, O., GROSSMAN, D., AND MCCABE, M. C. 2002. Collection statistics for fast duplicate document detection. ACM Trans. Inform. Syst. 20, 2, 171-191. (Pubitemid 44642301)
-
(2002)
ACM Transactions on Information Systems
, vol.20
, Issue.2
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.C.4
-
13
-
-
85030321143
-
Mapreduce: Simplified data processing on large clusters
-
USENIX Association, Berkeley, CA
-
DEAN, J. AND GHEMAWAT, S. 2004. Mapreduce: simplified data processing on large clusters. In Proceedings of the 6th Conference Symposium on Operating Systems Design and Implementation (OSDI). USENIX Association, Berkeley, CA, 10-10.
-
(2004)
Proceedings of the 6th Conference Symposium on Operating Systems Design and Implementation (OSDI)
, pp. 10-10
-
-
Dean, J.1
Ghemawat, S.2
-
15
-
-
33745196401
-
Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data
-
FAGNI, T., PEREGO, R., SILVESTRI, F., AND ORLANDO, S. 2006. Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data. ACM Trans. Inform. Syst. 24, 1, 51-78.
-
(2006)
ACM Trans. Inform. Syst.
, vol.24
, Issue.1
, pp. 51-78
-
-
Fagni, T.1
Perego, R.2
Silvestri, F.3
Orlando, S.4
-
16
-
-
84982442246
-
On the allocation of documents in multiprocessor information retrieval systems
-
ACM Press, New York, NY
-
FRIEDER, O. AND SIEGELMANN, H. T. 1991. On the allocation of documents in multiprocessor information retrieval systems. In Proceedings of the 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). ACM Press, New York, NY, 230-239.
-
(1991)
Proceedings of the 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
, pp. 230-239
-
-
Frieder, O.1
Siegelmann, H.T.2
-
17
-
-
70449353970
-
-
Google
-
Google. 2007. Google begins move to universal search. http://www.google.com/intl/en/press/pressrel/universalsearch-20070516.html.
-
(2007)
Google Begins Move to Universal Search
-
-
-
18
-
-
0001858763
-
Generalizing GlOSS to vector-space databases and broker hierarchies
-
Morgan Kaufmann Publishers Inc., San Francisco, CA
-
GRAVANO, L. AND GARCIA-MOLINA, H. 1995. Generalizing GlOSS to vector-space databases and broker hierarchies. In Proceedings of the 21th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann Publishers Inc., San Francisco, CA, 78-89.
-
(1995)
Proceedings of the 21th International Conference on Very Large Data Bases (VLDB)
, pp. 78-89
-
-
Gravano, L.1
Garcia-Molina, H.2
-
19
-
-
34547460091
-
-
Techn. note number STAN-CS-TN-94-10, Stanford University
-
GRAVANO, L., GARCIA-MOLINA, H., AND TOMASIC, A. 1994. Precision and recall of GlOSS estimators for database discovery. Techn. note number STAN-CS-TN-94-10, Stanford University.
-
(1994)
Precision and Recall of GlOSS Estimators for Database Discovery
-
-
Gravano, L.1
Garcia-Molina, H.2
Tomasic, A.3
-
20
-
-
0037319544
-
Methods for identifying versioned and plagiarized documents
-
HOAD, T. C. AND ZOBEL, J. 2003. Methods for identifying versioned and plagiarized documents. J. Amer. Soc. Inform. Sci. Tech. 54, 3, 203-215.
-
(2003)
J. Amer. Soc. Inform. Sci. Tech.
, vol.54
, Issue.3
, pp. 203-215
-
-
Hoad, T.C.1
Zobel, J.2
-
22
-
-
23744485775
-
How are we searching the World Wide Web? A comparison of nine search engine transaction logs
-
DOI 10.1016/j.ipm.2004.10.007, PII S0306457304001396
-
JANSEN, B. AND SPINK, A. 2006. How are we searching the World Wide Web? A comparison of nine search engine transaction logs. Inform Proc. and Management 42, 248-263. (Pubitemid 41119088)
-
(2006)
Information Processing and Management
, vol.42
, Issue.1 SPEC. ISS
, pp. 248-263
-
-
Jansen, B.J.1
Spink, A.2
-
23
-
-
0001685668
-
Real life information retrieval: A study of user queries on the web
-
JANSEN, B. J., SPINK, A., BATEMAN, J., AND SARACEVIC, T. 1998. Real life information retrieval: a study of user queries on the Web. SIGIR Forum 32, 1, 5-17. (Pubitemid 128619000)
-
(1998)
SIGIR Forum (ACM Special Interest Group on Information Retrieval)
, vol.32
, Issue.1
, pp. 5-17
-
-
Jansen, B.J.1
Spink, A.2
Bateman, J.3
Saracevic, T.4
-
24
-
-
0028384238
-
Caching strategies to improve disk system performance
-
KAREDLA, R., LOVE, J. S., AND WHERRY, B. G. 1994. Caching strategies to improve disk system performance. Computer 27, 3, 38-46.
-
(1994)
Computer
, vol.27
, Issue.3
, pp. 38-46
-
-
Karedla, R.1
Love, J.S.2
Wherry, B.G.3
-
25
-
-
84951717275
-
Collection selection and results merging with topically organized U. S. patents and TREC data
-
ACM Press, New York, NY
-
LARKEY, L. S., CONNELL, M. E., AND CALLAN, J. 2000. Collection selection and results merging with topically organized U. S. patents and TREC data. In Proceedings of the 9th International Conference on Information and Knowledge Management (CIKM). ACM Press, New York, NY, 282-289.
-
(2000)
Proceedings of the 9th International Conference on Information and Knowledge Management (CIKM)
, pp. 282-289
-
-
Larkey, L.S.1
Connell, M.E.2
Callan, J.3
-
26
-
-
56049107379
-
Predictive caching and prefetching of query results in search engines
-
ACM, New York, NY
-
LEMPEL, R. AND MORAN, S. 2003. Predictive caching and prefetching of query results in search engines. In Proceedings of the 12th International Conference on World Wide Web (WWW). ACM, New York, NY, 19-28.
-
(2003)
Proceedings of the 12th International Conference on World Wide Web (WWW)
, pp. 19-28
-
-
Lempel, R.1
Moran, S.2
-
27
-
-
8644243122
-
Cluster-based retrieval using language models
-
ACM Press, New York, NY
-
LIU, X. AND CROFT, W. B. 2004. Cluster-based retrieval using language models. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). ACM Press, New York, NY, 186-193.
-
(2004)
Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
, pp. 186-193
-
-
Liu, X.1
Croft, W.B.2
-
28
-
-
0035253949
-
On caching search engine query results
-
DOI 10.1016/S0140-3664(00)00308-X
-
MARKATOS, E. P. 2001. On caching search engine query results. Comput. Comm. 24, 2, 137-143. (Pubitemid 32092795)
-
(2001)
Computer Communications
, vol.24
, Issue.2
, pp. 137-143
-
-
Markatos, E.P.1
-
32
-
-
57349102877
-
Query-sets: Using implicit feedback and query patterns to organize Web documents
-
ACM, New York, NY, USA
-
POBLETE, B. AND BAEZA-YATES, R. 2008. Query-sets: using implicit feedback and query patterns to organize Web documents. In Proceedings of the 17th International Conference on World Wide Web (WWW). ACM, New York, NY, USA, 41-50.
-
(2008)
Proceedings of the 17th International Conference on World Wide Web (WWW)
, pp. 41-50
-
-
Poblete, B.1
Baeza-Yates, R.2
-
34
-
-
34547626215
-
The query-vector document model
-
DOI 10.1145/1183614.1183777, Proceedings of the 15th ACM Conference on Information and Knowledge Management, CIKM 2006
-
PUPPIN, D. AND SILVESTRI, F. 2006. The query-vector document model. In Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM). ACM, New York, NY, 880-881. (Pubitemid 47203675)
-
(2006)
International Conference on Information and Knowledge Management, Proceedings
, pp. 880-881
-
-
Puppin, D.1
Silvestri, F.2
-
35
-
-
41849115046
-
Query-driven document partitioning and collection selection (invited paper)
-
ACM, New York, NY, USA
-
PUPPIN, D., SILVESTRI, F., AND LAFORENZA, D. 2006. Query-driven document partitioning and collection selection (invited paper). In Proceedings of the 1st International Conference on Scalable Information Systems (InfoScale). ACM, New York, NY, USA, 34.
-
(2006)
Proceedings of the 1st International Conference on Scalable Information Systems (InfoScale)
, pp. 34
-
-
Puppin, D.1
Silvestri, F.2
Laforenza, D.3
-
36
-
-
84940835729
-
Load-balancing and caching for collection selection architectures
-
ICST, Brussels, Belgium
-
PUPPIN, D., SILVESTRI, F., PEREGO, R., AND BAEZA-YATES, R. 2007. Load-balancing and caching for collection selection architectures. In Proceedings of the 2nd International Conference on Scalable Information Systems (InfoScale). ICST, Brussels, Belgium, 1-10.
-
(2007)
Proceedings of the 2nd International Conference on Scalable Information Systems (InfoScale)
, pp. 1-10
-
-
Puppin, D.1
Silvestri, F.2
Perego, R.3
Baeza-Yates, R.4
-
37
-
-
0029205263
-
On the reuse of past optimal queries
-
ACM, New York, NY
-
RAGHAVAN, V. V. AND SEVER, H. 1995. On the reuse of past optimal queries. In SIGIR'95: Proceedings of the 18th annual international ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). ACM, New York, NY, 344-350.
-
(1995)
SIGIR'95: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
, pp. 344-350
-
-
Raghavan, V.V.1
Sever, H.2
-
38
-
-
84948402438
-
The link database: Fast access to graphs of the Web
-
Los Alamitos, CA
-
RANDALL, K. H., STATA, R., WIENER, J. L., AND WICKREMESINGHE, R. G. 2002. The link database: Fast access to graphs of the Web. In Proceedings of the Data Compression Conference (DCC). IEEE Computer Society, Los Alamitos, CA, 122.
-
(2002)
Proceedings of the Data Compression Conference (DCC). IEEE Computer Society
, pp. 122
-
-
Randall, K.H.1
Stata, R.2
Wiener, J.L.3
Wickremesinghe, R.G.4
-
39
-
-
84966534942
-
Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval
-
Springer-Verlag, Berlin, Germany
-
ROBERTSON, S. E. AND WALKER, S. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). Springer-Verlag, Berlin, Germany, 232-241.
-
(1994)
Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
, pp. 232-241
-
-
Robertson, S.E.1
Walker, S.2
-
40
-
-
3042687811
-
Analysis of a very large Web search engine query log
-
SILVERSTEIN, C., HENZINGER, M., MARAIS, H., AND MORICZ, M. 1999. Analysis of a very large Web search engine query log. In ACM SIGIR Forum. 6-12.
-
(1999)
ACM SIGIR Forum
, pp. 6-12
-
-
Silverstein, C.1
Henzinger, M.2
Marais, H.3
Moricz, M.4
-
41
-
-
37149046010
-
Sorting out the document identifier assignment problem
-
Advances in Information Retrieval - 29th European Conference on IR Research, ECIR 2007, Proceedings
-
SILVESTRI, F. 2007. Sorting out the document identifier assignment problem. In Proceedings of the European Conference on IR Research (ECIR). G. Amati, C. Carpineto, and G. Romano, Eds. Lecture Notes in Computer Science, vol. 4425. Springer, 101-112. (Pubitemid 350259543)
-
(2007)
Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.4425
, pp. 101-112
-
-
Silvestri, F.1
-
42
-
-
0031177232
-
Data structures for efficient broker implementation
-
TOMASIC, A., GRAVANO, L., LUE, C., SCHWARZ, P., AND HAAS, L. 1997. Data structures for efficient broker implementation. ACM Trans. Inform. Syst. 15, 3, 223-253. (Pubitemid 127685705)
-
(1997)
ACM Transactions on Information Systems
, vol.15
, Issue.3
, pp. 223-253
-
-
Tomasic, A.1
Gravano, L.2
Lue, C.3
Schwarz, P.4
Haas, L.5
-
44
-
-
34250184471
-
A pipelined architecture for distributed text query evaluation
-
WEBBER, W., MOFFAT, A., ZOBEL, J., AND BAEZA-YATES, R. 2006. A pipelined architecture for distributed text query evaluation. Inform. Retrieval. 10, 3.
-
(2006)
Inform. Retrieval
, vol.10
, pp. 3
-
-
Webber, W.1
Moffat, A.2
Zobel, J.3
Baeza-Yates, R.4
|