메뉴 건너뛰기




Volumn 32, Issue 5, 2007, Pages 733-754

Efficient in-memory extensible inverted file

Author keywords

Indexing; Information retrieval; Optimization

Indexed keywords

AUTOMATIC INDEXING; DATA REDUCTION; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS; PERSONAL COMPUTERS; RANDOM ACCESS STORAGE; STORAGE ALLOCATION (COMPUTER); WORD PROCESSING;

EID: 34147114386     PISSN: 03064379     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.is.2006.06.001     Document Type: Article
Times cited : (16)

References (51)
  • 1
    • 34147094956 scopus 로고    scopus 로고
    • D. Hawking, N. Craswell, P.B. Thistlewaite, Overview of TREC-7 very large collection track, in: Proceedings of The Seventh TREC Conference, 1998, pp. 40-52.
  • 2
    • 34147158027 scopus 로고    scopus 로고
    • C. Clarke, N. Craswell, I. Soboroff. Terabyte track, http://www-nlpir.nist.gov/projects/terabyte/, 2003.
  • 3
    • 0033714666 scopus 로고    scopus 로고
    • J. Hirai, H. Garcia-Molina, A. Paepcke, S. Raghavan, WebBase: a repository of web pages, in: Proceedings of The Ninth International World Wide Web Conference, 2000, pp. 277-293.
  • 4
    • 34147164764 scopus 로고    scopus 로고
    • NTCIR Patent Retrieval Task, http://www.slis.tsukuba.ac.jp/~fujii/ntcir5/cfp-en.html, 2005.
  • 5
    • 0037619265 scopus 로고    scopus 로고
    • Web search for a planet: the Google cluster architecture
    • Barroso L.A., Dean J., and Hölzle U. Web search for a planet: the Google cluster architecture. IEEE Micro. 23 2 (2003) 22-28
    • (2003) IEEE Micro. , vol.23 , Issue.2 , pp. 22-28
    • Barroso, L.A.1    Dean, J.2    Hölzle, U.3
  • 6
    • 0038274863 scopus 로고    scopus 로고
    • Efficient single-pass index construction for text databases
    • Heinz S., and Zobel J. Efficient single-pass index construction for text databases. J. Am. Soc. Inform. Sci. Technol. 54 8 (2003) 713-729
    • (2003) J. Am. Soc. Inform. Sci. Technol. , vol.54 , Issue.8 , pp. 713-729
    • Heinz, S.1    Zobel, J.2
  • 7
    • 34147093931 scopus 로고    scopus 로고
    • R. Baeza-Yates, B. Ribeiro-Neto, Modern Information Retrieval, ACM Press, 1999.
  • 8
    • 29244432781 scopus 로고    scopus 로고
    • Efficient online index maintenance for contiguous inverted lists
    • Lester N., Zobel J., and Williams H. Efficient online index maintenance for contiguous inverted lists. Informat. Process. Manage. 42 (2006) 916-933
    • (2006) Informat. Process. Manage. , vol.42 , pp. 916-933
    • Lester, N.1    Zobel, J.2    Williams, H.3
  • 10
    • 34147099823 scopus 로고    scopus 로고
    • A. MacFarlane, S.E. Robertson, J.A. McCann, On concurrency control of inverted files, in: F.C. Johnson (Ed.), Proceedings of the 18th MCS IRSG Annual Colloquium on Information Retrieval Research, 26-27 March 1996, pp. 67-79.
  • 11
    • 33746091908 scopus 로고    scopus 로고
    • U. Manber, S. Wu, Glimpse: a tool to search through entire file systems, in: Proceedings of the USENIX Winter 1994 Technical Conference, 1994, pp. 23-32.
  • 12
    • 0342521304 scopus 로고    scopus 로고
    • Compression: a key for next-generation text retrieval systems
    • Ziviani N., de Moura E.S., Navarro G., and Baeza-Yates R. Compression: a key for next-generation text retrieval systems. IEEE Comput. 33 11 (2000) 37-44
    • (2000) IEEE Comput. , vol.33 , Issue.11 , pp. 37-44
    • Ziviani, N.1    de Moura, E.S.2    Navarro, G.3    Baeza-Yates, R.4
  • 13
    • 0016486577 scopus 로고
    • Universal codeword sets and the representation of the integers
    • Elias P. Universal codeword sets and the representation of the integers. IEEE Trans. Inform. Theory 21 2 (1975) 194-203
    • (1975) IEEE Trans. Inform. Theory , vol.21 , Issue.2 , pp. 194-203
    • Elias, P.1
  • 14
    • 3843079609 scopus 로고    scopus 로고
    • Compressing inverted files
    • Trotman A. Compressing inverted files. Inform. Retriev. 6 1 (2003) 5-19
    • (2003) Inform. Retriev. , vol.6 , Issue.1 , pp. 5-19
    • Trotman, A.1
  • 15
    • 0032654288 scopus 로고    scopus 로고
    • Compressing integers for fast file access
    • Williams H., and Zobel J. Compressing integers for fast file access. Comput. J. 42 3 (1999) 193-201
    • (1999) Comput. J. , vol.42 , Issue.3 , pp. 193-201
    • Williams, H.1    Zobel, J.2
  • 16
  • 17
    • 34147145114 scopus 로고    scopus 로고
    • Combining systems and databases: a search engine retrospective
    • Hellerstein J.M., and Stonebraker M. (Eds), MIT Press, Cmabridge, MA
    • Brewer E.A. Combining systems and databases: a search engine retrospective. In: Hellerstein J.M., and Stonebraker M. (Eds). Reading in Database Systems. Fourth Edition (2005), MIT Press, Cmabridge, MA
    • (2005) Reading in Database Systems. Fourth Edition
    • Brewer, E.A.1
  • 18
    • 34147105703 scopus 로고    scopus 로고
    • A. Fox, E.A. Brewer, Harvest, yield, and scalable tolerant systems, in: Proceedings of the 16th SOSP, St. Malo, France, October 1997.
  • 19
    • 1542640153 scopus 로고    scopus 로고
    • Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services
    • Gilbert S., and Lynch N. Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services. Sigact News 33 2 (2002) 51-59
    • (2002) Sigact News , vol.33 , Issue.2 , pp. 51-59
    • Gilbert, S.1    Lynch, N.2
  • 21
    • 34147182450 scopus 로고    scopus 로고
    • C.L.A. Clarke, G. V. Cormack, Dynamic inverted indexes for a distributed full-text retrieval system, Technical Report MT-95-01, University of Waterloo, 1995.
  • 22
    • 78149380878 scopus 로고    scopus 로고
    • B. A. Ribeiro-Nero, J. P. Kitajima, G. Navarro, C. Santana, N. Ziviani, Parallel generation of inverted files for distributed text collections, in: Proceedings of the 18th International Conference of the Chilean Society of Computer Science, Chile, 1998, pp. 149-157.
  • 23
    • 84994692628 scopus 로고    scopus 로고
    • B. Ribeiro-Neto, E.S. Moura, M.S. Neubert, N. Ziviani, Efficient distributed algorithms to build inverted files, in: Proceedings of The 22nd Annual International ACM SIGIR conference on Research and development in information retrieval, Berkeley, 1999, pp. 105-112.
  • 24
    • 0038564328 scopus 로고    scopus 로고
    • Burst tries: a fast, efficient data structure for string keys
    • Heinz S., Zobel J., and Williams H.E. Burst tries: a fast, efficient data structure for string keys. ACM Trans. Inform. Syst. 20 2 (2002) 192-223
    • (2002) ACM Trans. Inform. Syst. , vol.20 , Issue.2 , pp. 192-223
    • Heinz, S.1    Zobel, J.2    Williams, H.E.3
  • 25
    • 0008585418 scopus 로고
    • On methods of Chinese automatic word segmentation
    • Kit C., Liu Y., and Liang N. On methods of Chinese automatic word segmentation. J. Chin. Inform. Process. 3 1 (1989) 13-20
    • (1989) J. Chin. Inform. Process. , vol.3 , Issue.1 , pp. 13-20
    • Kit, C.1    Liu, Y.2    Liang, N.3
  • 27
    • 0032268976 scopus 로고    scopus 로고
    • Inverted files versus signature files for text indexing
    • Zobel J., Moffat A., and Ramanohanarao K. Inverted files versus signature files for text indexing. ACM Trans. Database Syst. 23 4 (1998) 453-490
    • (1998) ACM Trans. Database Syst. , vol.23 , Issue.4 , pp. 453-490
    • Zobel, J.1    Moffat, A.2    Ramanohanarao, K.3
  • 29
    • 0023385618 scopus 로고
    • Description and performance analysis of signature file methods
    • Faloutsos C., and Christodoulakis S. Description and performance analysis of signature file methods. ACM Trans. Office Inform. Syst. 5 3 (1987) 237-257
    • (1987) ACM Trans. Office Inform. Syst. , vol.5 , Issue.3 , pp. 237-257
    • Faloutsos, C.1    Christodoulakis, S.2
  • 30
    • 85030158691 scopus 로고    scopus 로고
    • D. Cutting, J. Petersen, Optimizations for dynamic inverted index maintenance, in: Proceedings of the Thirteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1990, pp. 405-411.
  • 31
    • 0028447796 scopus 로고    scopus 로고
    • A. Tomasic, H. Garcia-Molina, K.A. Shoens, Incremental updates of inverted lists for text document retrieval, in: Proceedings of The ACM SIGMOD International Conference on Management of Data, 1994, pp. 289-300.
  • 32
    • 34147147765 scopus 로고    scopus 로고
    • E.W. Brown, J.P. Callan, W.B. Croft, Fast incremental indexing for full-text information retrieval, in: Proceedings of The 20th VLDB Conference, 1994, pp. 192-202.
  • 33
    • 34147184922 scopus 로고    scopus 로고
    • J. Zobel, A. Moffat, R. Sacks-Davis, Storage management for files of dynamic records, in: Proceedings of the Fourth Australian Database Conference, 1993, pp. 26-38.
  • 34
    • 7544242035 scopus 로고    scopus 로고
    • A statistics-based approach to incrementally update inverted files
    • Shieh W.-Y., and Chung C.-P. A statistics-based approach to incrementally update inverted files. Inform. Process. Manage. 41 2 (2005) 275-288
    • (2005) Inform. Process. Manage. , vol.41 , Issue.2 , pp. 275-288
    • Shieh, W.-Y.1    Chung, C.-P.2
  • 36
    • 84963800436 scopus 로고    scopus 로고
    • C. Badue, B.A. Ribeiro-Neto, R. Baeza-Yates, N. Ziviani, Distributed query processing using partitioned inverted files, in: Proceedings of the Eighth International Symposium on String Processing and Information Retrieval (SPIRE 2001), 2001, pp. 10-20.
  • 37
    • 84964514173 scopus 로고    scopus 로고
    • A. MacFarlane, J.A. McCann, S.E. Robertson, Parallel search using partitioned inverted files, in: Proceedings of the Seventh International Symposium on String Processing and Information Retrieval (SPIRE 2000), 2000, pp. 209-220.
  • 38
    • 0029193309 scopus 로고    scopus 로고
    • J.P. Callan, Z. Lu, W.B. Croft, Searching distributed collections with inference networks, in: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, 1995, pp. 21-28.
  • 39
    • 84976839683 scopus 로고    scopus 로고
    • C. Stanfill, Partitioned posting files: a parallel inverted file structure for information retrieval, in: Proceedings of the 13th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Brussels, 1990, pp. 413-428.
  • 40
    • 0029255005 scopus 로고
    • Inverted file partitioning schemes in multiple disk systems
    • Jeong B.-S., and Omiecinski E. Inverted file partitioning schemes in multiple disk systems. IEEE Trans. Parallel Distribut. Syst. 6 2 (1995) 142-153
    • (1995) IEEE Trans. Parallel Distribut. Syst. , vol.6 , Issue.2 , pp. 142-153
    • Jeong, B.-S.1    Omiecinski, E.2
  • 41
    • 84902799141 scopus 로고    scopus 로고
    • S.-H. Chung, S.-C. Oh, K.R. Ryu, S.-H. Park, Parallel information retrieval on a distributed memory multiprocessor system, in: Proceedings of the International Conference on Algorithms and Architectures for Parallel Processing (ICAPP 97), 1997, pp. 163-176.
  • 42
    • 34147127191 scopus 로고    scopus 로고
    • P. Bailey, D. Hawking, A parallel architecture for query processing over a terabyte of text, Technical Report TR-CS-96-04, The Australia National University, June 1996.
  • 43
    • 84961839082 scopus 로고    scopus 로고
    • N. Goharian, T. El-Ghazawi, D. Grossman, Enterprise text processing: a sparse matrix approach, in: Proceedings of International Conference on Information Technology: Coding and Computing,, 2001, pp. 71-75.
  • 44
    • 0022877691 scopus 로고
    • Parallel free-text search on the connection machine system
    • Stanfill C., and Kahle B. Parallel free-text search on the connection machine system. Commun. ACM 29 12 (1986) 1229-1239
    • (1986) Commun. ACM , vol.29 , Issue.12 , pp. 1229-1239
    • Stanfill, C.1    Kahle, B.2
  • 45
    • 34147121232 scopus 로고    scopus 로고
    • Z. Lu, K.S. McKinley, B. Cahoon, The hardware/software balancing act for information retrieval on symmetric multiprocessors, in: Proceedings of Euro-Par 98, 1998, pp. 521-527.
  • 46
    • 84970879656 scopus 로고    scopus 로고
    • S. Melnik, S. Raghavan, B. Yang, H. Garcia-Molina, Building a distributed full-text index for the Web, in: Proceedings of The 10th International Conference on World Wide Web, Hong Kong, 2001, pp. 396-406.
  • 48
    • 0035390859 scopus 로고    scopus 로고
    • Lessons from giant-scale services
    • Brewer E.A. Lessons from giant-scale services. IEEE Internet Comput. 5 4 (2001) 46-55
    • (2001) IEEE Internet Comput. , vol.5 , Issue.4 , pp. 46-55
    • Brewer, E.A.1
  • 51
    • 0003214715 scopus 로고    scopus 로고
    • Evaluating the performance of distributed architectures for information retrieval using a variety of workloads
    • Cahoon B., McKinley K.S., and Lu Z. Evaluating the performance of distributed architectures for information retrieval using a variety of workloads. ACM Trans. Inform. Syst. 18 1 (2000) 1-43
    • (2000) ACM Trans. Inform. Syst. , vol.18 , Issue.1 , pp. 1-43
    • Cahoon, B.1    McKinley, K.S.2    Lu, Z.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.