-
1
-
-
34147094956
-
-
D. Hawking, N. Craswell, P.B. Thistlewaite, Overview of TREC-7 very large collection track, in: Proceedings of The Seventh TREC Conference, 1998, pp. 40-52.
-
-
-
-
2
-
-
34147158027
-
-
C. Clarke, N. Craswell, I. Soboroff. Terabyte track, http://www-nlpir.nist.gov/projects/terabyte/, 2003.
-
-
-
-
3
-
-
0033714666
-
-
J. Hirai, H. Garcia-Molina, A. Paepcke, S. Raghavan, WebBase: a repository of web pages, in: Proceedings of The Ninth International World Wide Web Conference, 2000, pp. 277-293.
-
-
-
-
4
-
-
34147164764
-
-
NTCIR Patent Retrieval Task, http://www.slis.tsukuba.ac.jp/~fujii/ntcir5/cfp-en.html, 2005.
-
-
-
-
5
-
-
0037619265
-
Web search for a planet: the Google cluster architecture
-
Barroso L.A., Dean J., and Hölzle U. Web search for a planet: the Google cluster architecture. IEEE Micro. 23 2 (2003) 22-28
-
(2003)
IEEE Micro.
, vol.23
, Issue.2
, pp. 22-28
-
-
Barroso, L.A.1
Dean, J.2
Hölzle, U.3
-
6
-
-
0038274863
-
Efficient single-pass index construction for text databases
-
Heinz S., and Zobel J. Efficient single-pass index construction for text databases. J. Am. Soc. Inform. Sci. Technol. 54 8 (2003) 713-729
-
(2003)
J. Am. Soc. Inform. Sci. Technol.
, vol.54
, Issue.8
, pp. 713-729
-
-
Heinz, S.1
Zobel, J.2
-
7
-
-
34147093931
-
-
R. Baeza-Yates, B. Ribeiro-Neto, Modern Information Retrieval, ACM Press, 1999.
-
-
-
-
8
-
-
29244432781
-
Efficient online index maintenance for contiguous inverted lists
-
Lester N., Zobel J., and Williams H. Efficient online index maintenance for contiguous inverted lists. Informat. Process. Manage. 42 (2006) 916-933
-
(2006)
Informat. Process. Manage.
, vol.42
, pp. 916-933
-
-
Lester, N.1
Zobel, J.2
Williams, H.3
-
10
-
-
34147099823
-
-
A. MacFarlane, S.E. Robertson, J.A. McCann, On concurrency control of inverted files, in: F.C. Johnson (Ed.), Proceedings of the 18th MCS IRSG Annual Colloquium on Information Retrieval Research, 26-27 March 1996, pp. 67-79.
-
-
-
-
11
-
-
33746091908
-
-
U. Manber, S. Wu, Glimpse: a tool to search through entire file systems, in: Proceedings of the USENIX Winter 1994 Technical Conference, 1994, pp. 23-32.
-
-
-
-
12
-
-
0342521304
-
Compression: a key for next-generation text retrieval systems
-
Ziviani N., de Moura E.S., Navarro G., and Baeza-Yates R. Compression: a key for next-generation text retrieval systems. IEEE Comput. 33 11 (2000) 37-44
-
(2000)
IEEE Comput.
, vol.33
, Issue.11
, pp. 37-44
-
-
Ziviani, N.1
de Moura, E.S.2
Navarro, G.3
Baeza-Yates, R.4
-
13
-
-
0016486577
-
Universal codeword sets and the representation of the integers
-
Elias P. Universal codeword sets and the representation of the integers. IEEE Trans. Inform. Theory 21 2 (1975) 194-203
-
(1975)
IEEE Trans. Inform. Theory
, vol.21
, Issue.2
, pp. 194-203
-
-
Elias, P.1
-
14
-
-
3843079609
-
Compressing inverted files
-
Trotman A. Compressing inverted files. Inform. Retriev. 6 1 (2003) 5-19
-
(2003)
Inform. Retriev.
, vol.6
, Issue.1
, pp. 5-19
-
-
Trotman, A.1
-
15
-
-
0032654288
-
Compressing integers for fast file access
-
Williams H., and Zobel J. Compressing integers for fast file access. Comput. J. 42 3 (1999) 193-201
-
(1999)
Comput. J.
, vol.42
, Issue.3
, pp. 193-201
-
-
Williams, H.1
Zobel, J.2
-
17
-
-
34147145114
-
Combining systems and databases: a search engine retrospective
-
Hellerstein J.M., and Stonebraker M. (Eds), MIT Press, Cmabridge, MA
-
Brewer E.A. Combining systems and databases: a search engine retrospective. In: Hellerstein J.M., and Stonebraker M. (Eds). Reading in Database Systems. Fourth Edition (2005), MIT Press, Cmabridge, MA
-
(2005)
Reading in Database Systems. Fourth Edition
-
-
Brewer, E.A.1
-
18
-
-
34147105703
-
-
A. Fox, E.A. Brewer, Harvest, yield, and scalable tolerant systems, in: Proceedings of the 16th SOSP, St. Malo, France, October 1997.
-
-
-
-
19
-
-
1542640153
-
Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services
-
Gilbert S., and Lynch N. Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services. Sigact News 33 2 (2002) 51-59
-
(2002)
Sigact News
, vol.33
, Issue.2
, pp. 51-59
-
-
Gilbert, S.1
Lynch, N.2
-
21
-
-
34147182450
-
-
C.L.A. Clarke, G. V. Cormack, Dynamic inverted indexes for a distributed full-text retrieval system, Technical Report MT-95-01, University of Waterloo, 1995.
-
-
-
-
22
-
-
78149380878
-
-
B. A. Ribeiro-Nero, J. P. Kitajima, G. Navarro, C. Santana, N. Ziviani, Parallel generation of inverted files for distributed text collections, in: Proceedings of the 18th International Conference of the Chilean Society of Computer Science, Chile, 1998, pp. 149-157.
-
-
-
-
23
-
-
84994692628
-
-
B. Ribeiro-Neto, E.S. Moura, M.S. Neubert, N. Ziviani, Efficient distributed algorithms to build inverted files, in: Proceedings of The 22nd Annual International ACM SIGIR conference on Research and development in information retrieval, Berkeley, 1999, pp. 105-112.
-
-
-
-
24
-
-
0038564328
-
Burst tries: a fast, efficient data structure for string keys
-
Heinz S., Zobel J., and Williams H.E. Burst tries: a fast, efficient data structure for string keys. ACM Trans. Inform. Syst. 20 2 (2002) 192-223
-
(2002)
ACM Trans. Inform. Syst.
, vol.20
, Issue.2
, pp. 192-223
-
-
Heinz, S.1
Zobel, J.2
Williams, H.E.3
-
25
-
-
0008585418
-
On methods of Chinese automatic word segmentation
-
Kit C., Liu Y., and Liang N. On methods of Chinese automatic word segmentation. J. Chin. Inform. Process. 3 1 (1989) 13-20
-
(1989)
J. Chin. Inform. Process.
, vol.3
, Issue.1
, pp. 13-20
-
-
Kit, C.1
Liu, Y.2
Liang, N.3
-
26
-
-
0038895148
-
Building a distributed full-text index for the Web
-
Melnik S., Raghavan S., Yang B., and Garcia-Molina H. Building a distributed full-text index for the Web. ACM Trans. Inform. Syst. 19 3 (2001) 217-247
-
(2001)
ACM Trans. Inform. Syst.
, vol.19
, Issue.3
, pp. 217-247
-
-
Melnik, S.1
Raghavan, S.2
Yang, B.3
Garcia-Molina, H.4
-
28
-
-
0003756969
-
-
Morgan Kaufmann Publishers, Los Alios, CA
-
Witten I., Moffat A., and Bell T.C. Managing Gigabytes: Compressing and Indexing Documents and Images (1999), Morgan Kaufmann Publishers, Los Alios, CA
-
(1999)
Managing Gigabytes: Compressing and Indexing Documents and Images
-
-
Witten, I.1
Moffat, A.2
Bell, T.C.3
-
29
-
-
0023385618
-
Description and performance analysis of signature file methods
-
Faloutsos C., and Christodoulakis S. Description and performance analysis of signature file methods. ACM Trans. Office Inform. Syst. 5 3 (1987) 237-257
-
(1987)
ACM Trans. Office Inform. Syst.
, vol.5
, Issue.3
, pp. 237-257
-
-
Faloutsos, C.1
Christodoulakis, S.2
-
30
-
-
85030158691
-
-
D. Cutting, J. Petersen, Optimizations for dynamic inverted index maintenance, in: Proceedings of the Thirteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1990, pp. 405-411.
-
-
-
-
31
-
-
0028447796
-
-
A. Tomasic, H. Garcia-Molina, K.A. Shoens, Incremental updates of inverted lists for text document retrieval, in: Proceedings of The ACM SIGMOD International Conference on Management of Data, 1994, pp. 289-300.
-
-
-
-
32
-
-
34147147765
-
-
E.W. Brown, J.P. Callan, W.B. Croft, Fast incremental indexing for full-text information retrieval, in: Proceedings of The 20th VLDB Conference, 1994, pp. 192-202.
-
-
-
-
33
-
-
34147184922
-
-
J. Zobel, A. Moffat, R. Sacks-Davis, Storage management for files of dynamic records, in: Proceedings of the Fourth Australian Database Conference, 1993, pp. 26-38.
-
-
-
-
34
-
-
7544242035
-
A statistics-based approach to incrementally update inverted files
-
Shieh W.-Y., and Chung C.-P. A statistics-based approach to incrementally update inverted files. Inform. Process. Manage. 41 2 (2005) 275-288
-
(2005)
Inform. Process. Manage.
, vol.41
, Issue.2
, pp. 275-288
-
-
Shieh, W.-Y.1
Chung, C.-P.2
-
36
-
-
84963800436
-
-
C. Badue, B.A. Ribeiro-Neto, R. Baeza-Yates, N. Ziviani, Distributed query processing using partitioned inverted files, in: Proceedings of the Eighth International Symposium on String Processing and Information Retrieval (SPIRE 2001), 2001, pp. 10-20.
-
-
-
-
37
-
-
84964514173
-
-
A. MacFarlane, J.A. McCann, S.E. Robertson, Parallel search using partitioned inverted files, in: Proceedings of the Seventh International Symposium on String Processing and Information Retrieval (SPIRE 2000), 2000, pp. 209-220.
-
-
-
-
38
-
-
0029193309
-
-
J.P. Callan, Z. Lu, W.B. Croft, Searching distributed collections with inference networks, in: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, 1995, pp. 21-28.
-
-
-
-
39
-
-
84976839683
-
-
C. Stanfill, Partitioned posting files: a parallel inverted file structure for information retrieval, in: Proceedings of the 13th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Brussels, 1990, pp. 413-428.
-
-
-
-
40
-
-
0029255005
-
Inverted file partitioning schemes in multiple disk systems
-
Jeong B.-S., and Omiecinski E. Inverted file partitioning schemes in multiple disk systems. IEEE Trans. Parallel Distribut. Syst. 6 2 (1995) 142-153
-
(1995)
IEEE Trans. Parallel Distribut. Syst.
, vol.6
, Issue.2
, pp. 142-153
-
-
Jeong, B.-S.1
Omiecinski, E.2
-
41
-
-
84902799141
-
-
S.-H. Chung, S.-C. Oh, K.R. Ryu, S.-H. Park, Parallel information retrieval on a distributed memory multiprocessor system, in: Proceedings of the International Conference on Algorithms and Architectures for Parallel Processing (ICAPP 97), 1997, pp. 163-176.
-
-
-
-
42
-
-
34147127191
-
-
P. Bailey, D. Hawking, A parallel architecture for query processing over a terabyte of text, Technical Report TR-CS-96-04, The Australia National University, June 1996.
-
-
-
-
43
-
-
84961839082
-
-
N. Goharian, T. El-Ghazawi, D. Grossman, Enterprise text processing: a sparse matrix approach, in: Proceedings of International Conference on Information Technology: Coding and Computing,, 2001, pp. 71-75.
-
-
-
-
44
-
-
0022877691
-
Parallel free-text search on the connection machine system
-
Stanfill C., and Kahle B. Parallel free-text search on the connection machine system. Commun. ACM 29 12 (1986) 1229-1239
-
(1986)
Commun. ACM
, vol.29
, Issue.12
, pp. 1229-1239
-
-
Stanfill, C.1
Kahle, B.2
-
45
-
-
34147121232
-
-
Z. Lu, K.S. McKinley, B. Cahoon, The hardware/software balancing act for information retrieval on symmetric multiprocessors, in: Proceedings of Euro-Par 98, 1998, pp. 521-527.
-
-
-
-
46
-
-
84970879656
-
-
S. Melnik, S. Raghavan, B. Yang, H. Garcia-Molina, Building a distributed full-text index for the Web, in: Proceedings of The 10th International Conference on World Wide Web, Hong Kong, 2001, pp. 396-406.
-
-
-
-
47
-
-
0037722069
-
The state of the art in locally distributed web servers
-
Cardellini V., Casalicchio E., Colajanni M., and Yu P.S. The state of the art in locally distributed web servers. ACM Comput. Surveys 34 2 (2002) 263-311
-
(2002)
ACM Comput. Surveys
, vol.34
, Issue.2
, pp. 263-311
-
-
Cardellini, V.1
Casalicchio, E.2
Colajanni, M.3
Yu, P.S.4
-
48
-
-
0035390859
-
Lessons from giant-scale services
-
Brewer E.A. Lessons from giant-scale services. IEEE Internet Comput. 5 4 (2001) 46-55
-
(2001)
IEEE Internet Comput.
, vol.5
, Issue.4
, pp. 46-55
-
-
Brewer, E.A.1
-
49
-
-
84880240041
-
Searching the web
-
Arasu A., Cho J., Garcia-Molina H., Paepcke A., and Raghavan S. Searching the web. ACM Trans. Internet Technol. 1 1 (2001) 2-43
-
(2001)
ACM Trans. Internet Technol.
, vol.1
, Issue.1
, pp. 2-43
-
-
Arasu, A.1
Cho, J.2
Garcia-Molina, H.3
Paepcke, A.4
Raghavan, S.5
-
50
-
-
84989565544
-
An analysis of performance and cost factors in searching large text databases using parallel search systems
-
Couvreur T.R., Benzel R.N., Miller S.F., Zeitler D.N., Lee D.L., Singhai M., Shivaratri N., and Wong W.Y.P. An analysis of performance and cost factors in searching large text databases using parallel search systems. J. Am. Soc. Inform. Sci. 45 7 (1994) 443-464
-
(1994)
J. Am. Soc. Inform. Sci.
, vol.45
, Issue.7
, pp. 443-464
-
-
Couvreur, T.R.1
Benzel, R.N.2
Miller, S.F.3
Zeitler, D.N.4
Lee, D.L.5
Singhai, M.6
Shivaratri, N.7
Wong, W.Y.P.8
-
51
-
-
0003214715
-
Evaluating the performance of distributed architectures for information retrieval using a variety of workloads
-
Cahoon B., McKinley K.S., and Lu Z. Evaluating the performance of distributed architectures for information retrieval using a variety of workloads. ACM Trans. Inform. Syst. 18 1 (2000) 1-43
-
(2000)
ACM Trans. Inform. Syst.
, vol.18
, Issue.1
, pp. 1-43
-
-
Cahoon, B.1
McKinley, K.S.2
Lu, Z.3
|