메뉴 건너뛰기




Volumn , Issue , 2008, Pages 409-418

Tinylex: Static n-gram index pruning with perfect recall

Author keywords

Algorithms; Experimentation

Indexed keywords

APPROXIMATE MATCHING; ERROR-RESILIENT; EXPERIMENTATION; FALSE POSITIVE; INCLUSION STRUCTURE; INVERTED INDICES; MAIN MEMORY; N-GRAMS; PRACTICAL PROBLEMS; QUERY STRING; QUERY TIME; SUB-STRINGS;

EID: 70349239392     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1458082.1458138     Document Type: Conference Paper
Times cited : (8)

References (26)
  • 1
    • 0027334710 scopus 로고
    • Trigrams as index elements in full text retrieval: Observations and experimental results
    • E. S. Adams and A. C. Meltzer. Trigrams as index elements in full text retrieval: Observations and experimental results. In ACM Conference on Computer Science, pages 433-439, 1993.
    • (1993) ACM Conference on Computer Science , pp. 433-439
    • Adams, E.S.1    Meltzer, A.C.2
  • 2
    • 22044441103 scopus 로고    scopus 로고
    • Inverted index compression using word-aligned binary codes
    • V. N. Anh and A. Moffat. Inverted index compression using word-aligned binary codes. Inf. Retr., 8(1):151-166, 2005.
    • (2005) Inf. Retr , vol.8 , Issue.1 , pp. 151-166
    • Anh, V.N.1    Moffat, A.2
  • 3
    • 0030671788 scopus 로고    scopus 로고
    • A corpus for the evaluation of lossless compression algorithms
    • Washington, DC, USA, IEEE Computer Society
    • R. Arnold and T. Bell. A corpus for the evaluation of lossless compression algorithms. In DCC '97: Proceedings of the Conference on Data Compression, page 201, Washington, DC, USA, 1997. IEEE Computer Society.
    • (1997) DCC '97: Proceedings of the Conference on Data Compression , pp. 201
    • Arnold, R.1    Bell, T.2
  • 5
    • 0014814325 scopus 로고
    • Space/time trade-offs in hash coding with allowable errors
    • B. H. Bloom. Space/time trade-offs in hash coding with allowable errors. Commun. ACM, 13(7):422-426, 1970.
    • (1970) Commun. ACM , vol.13 , Issue.7 , pp. 422-426
    • Bloom, B.H.1
  • 8
    • 85030014406 scopus 로고
    • One-time complete indexing of text: Theory and practice
    • R. J. D'Amore and C. P. Mah. One-time complete indexing of text: theory and practice. In SIGIR, pages 155-164, 1985.
    • (1985) SIGIR , pp. 155-164
    • D'Amore, R.J.1    Mah, C.P.2
  • 10
    • 0038217041 scopus 로고    scopus 로고
    • The distribution of n-grams
    • L. Egghe. The distribution of n-grams. Scientometrics, 47(2):237-252, 2000.
    • (2000) Scientometrics , vol.47 , Issue.2 , pp. 237-252
    • Egghe, L.1
  • 11
    • 2042437650 scopus 로고    scopus 로고
    • Initial sequencing and analysis of the human genome
    • International Human Genome Sequencing Consortium
    • International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature, 409(6822):860-921, 2001.
    • (2001) Nature , vol.409 , Issue.6822 , pp. 860-921
  • 13
    • 33745621089 scopus 로고    scopus 로고
    • n-Gram/2L: A space and time efficient two-level n-gram inverted index structure
    • M.-S. Kim, K.-Y. Whang, J.-G. Lee, and M.-J. Lee. n-Gram/2L: A space and time efficient two-level n-gram inverted index structure. In VLDB, pages 325-336, 2005.
    • (2005) VLDB , pp. 325-336
    • Kim, M.-S.1    Whang, K.-Y.2    Lee, J.-G.3    Lee, M.-J.4
  • 14
    • 3843104001 scopus 로고
    • Complete statistical indexing of text by overlapping word fragments
    • C. P. Mah and R. J. D'Amore. Complete statistical indexing of text by overlapping word fragments. SIGIR Forum, 17(3):6-16, 1982.
    • (1982) SIGIR Forum , vol.17 , Issue.3 , pp. 6-16
    • Mah, C.P.1    D'Amore, R.J.2
  • 17
    • 1542377482 scopus 로고    scopus 로고
    • Single n-gram stemming
    • J. Mayfield and P. McNamee. Single n-gram stemming. In SIGIR, pages 415-416, 2003.
    • (2003) SIGIR , pp. 415-416
    • Mayfield, J.1    McNamee, P.2
  • 18
    • 70349249052 scopus 로고    scopus 로고
    • Haircut: A system for multilingual text retrieval in java
    • P. McNamee, J. Mayfield, and C. Piatko. Haircut: a system for multilingual text retrieval in java. J. Comput. Small Coll., 17(3):8-22, 2002.
    • (2002) J. Comput. Small Coll , vol.17 , Issue.3 , pp. 8-22
    • McNamee, P.1    Mayfield, J.2    Piatko, C.3
  • 19
    • 0004990868 scopus 로고    scopus 로고
    • Binary interpolative coding for effective index compression
    • A. Moffat and L. Stuiver. Binary interpolative coding for effective index compression. Inf. Retr., 3(1):25-47, 2000.
    • (2000) Inf. Retr , vol.3 , Issue.1 , pp. 25-47
    • Moffat, A.1    Stuiver, L.2
  • 20
    • 0036470314 scopus 로고    scopus 로고
    • An efficient document retrieval method using n-gram indexing
    • Y. Ogawa and T. Matsuda. An efficient document retrieval method using n-gram indexing. Systems and Computers in Japan, 33(2):54-63, 2002.
    • (2002) Systems and Computers in Japan , vol.33 , Issue.2 , pp. 54-63
    • Ogawa, Y.1    Matsuda, T.2
  • 24
    • 0003129916 scopus 로고
    • Agrep: A fast approximate pattern-matching tool
    • San Francisco, California
    • S. Wu and U. Manber. Agrep: A fast approximate pattern-matching tool. In Proc. of the Winter 1992 USENIX Conference, pages 153-162, San Francisco, California, 1991.
    • (1991) Proc. of the Winter 1992 USENIX Conference , pp. 153-162
    • Wu, S.1    Manber, U.2
  • 25
    • 0038632285 scopus 로고    scopus 로고
    • Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
    • M. Yamamoto and K. W. Church. Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus. Computational Linguistics, 27(1):1-30, 2001.
    • (2001) Computational Linguistics , vol.27 , Issue.1 , pp. 1-30
    • Yamamoto, M.1    Church, K.W.2
  • 26
    • 0032268976 scopus 로고    scopus 로고
    • Inverted files versus signature files for text indexing
    • J. Zobel, A. Moffat, and K. Ramamohanarao. Inverted files versus signature files for text indexing. ACM Trans. Database Syst., 23(4):453-490, 1998.
    • (1998) ACM Trans. Database Syst , vol.23 , Issue.4 , pp. 453-490
    • Zobel, J.1    Moffat, A.2    Ramamohanarao, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.