메뉴 건너뛰기




Volumn 35, Issue 2, 2006, Pages 378-407

Compressed suffix arrays and suffix trees with applications to text indexing and string matching

Author keywords

Compressed data structures; Compression; Pattern matching; String searching; Suffix arrays; Suffix trees; Text indexing; Text retrieval

Indexed keywords

COMPRESSED DATA STRUCTURES; SUFFIX ARRAYS; TEXT INDEXING; TEXT RETRIEVAL;

EID: 33645796213     PISSN: 00975397     EISSN: None     Source Type: Journal    
DOI: 10.1137/S0097539702402354     Document Type: Article
Times cited : (336)

References (72)
  • 2
    • 85009167014 scopus 로고
    • Efficient two-dimensional compressed matching
    • J. A. Storer and M. Cohn, eds., IEEE Computer Society Press, Los Alamitos, CA
    • A. AMIR AND G. BENSON, Efficient two-dimensional compressed matching, in Proceedings of the IEEE Data Compression Conference, J. A. Storer and M. Cohn, eds., IEEE Computer Society Press, Los Alamitos, CA, 1992, pp. 279-288.
    • (1992) Proceedings of the IEEE Data Compression Conference , pp. 279-288
    • Amir, A.1    Benson, G.2
  • 3
    • 0030129729 scopus 로고    scopus 로고
    • Let sleeping files LIE: Pattern matching in Z-compressed files
    • A. AMIR, G. BENSON, AND M. FARACH, Let sleeping files LIE: Pattern matching in Z-compressed files, J. Comput. System Sci., 52 (1996), pp. 299-307.
    • (1996) J. Comput. System Sci. , vol.52 , pp. 299-307
    • Amir, A.1    Benson, G.2    Farach, M.3
  • 7
    • 0002139526 scopus 로고
    • The myriad virtues of suffix trees
    • Combinatorial Algorithms on Words, A. Apostolico and Z. Galil, eds., Springer-Verlag, Berlin
    • A. APOSTOLICO, The myriad virtues of suffix trees, in Combinatorial Algorithms on Words, A. Apostolico and Z. Galil, eds., NATO Advanced Science Institutes Series F: Computer and System Sciences 12, Springer-Verlag, Berlin, 1985, pp. 85-96.
    • (1985) NATO Advanced Science Institutes Series F: Computer and System Sciences , vol.12 , pp. 85-96
    • Apostolico, A.1
  • 9
    • 0019173421 scopus 로고
    • Efficient worst-case data structures for range searching
    • J. L. BENTLEY AND H. A. MAURER, Efficient worst-case data structures for range searching, Acta Inform., 13 (1980), pp. 155-168.
    • (1980) Acta Inform. , vol.13 , pp. 155-168
    • Bentley, J.L.1    Maurer, H.A.2
  • 11
    • 0023383102 scopus 로고
    • Complete inverted files for efficient text retrieval and analysis
    • A. BLUMER, J. BLUMER, D. HAUSSLER, R. MCCONNELL, AND A. EHRENFEUCHT, Complete inverted files for efficient text retrieval and analysis, J. ACM, 34 (1987), pp. 578-595.
    • (1987) J. ACM , vol.34 , pp. 578-595
    • Blumer, A.1    Blumer, J.2    Haussler, D.3    Mcconnell, R.4    Ehrenfeucht, A.5
  • 12
    • 84957700642 scopus 로고    scopus 로고
    • Approximate dictionary queries
    • Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching, D. S. Hirschberg and E. W. Myers, eds., Springer-Verlag, Berlin, New York
    • G. S. BRODAL AND L. GA̧SIENIEC, Approximate dictionary queries, in Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching, D. S. Hirschberg and E. W. Myers, eds., Lecture Notes in Comput. Sci. 1075, Springer-Verlag, Berlin, New York, 1996, pp. 65-74.
    • (1996) Lecture Notes in Comput. Sci. , vol.1075 , pp. 65-74
    • Brodal, G.S.1    Ga̧sieniec, L.2
  • 13
    • 0032613750 scopus 로고    scopus 로고
    • Membership in constant time and almost-minimum space
    • A. BRODNIK AND J. I. MUNRO, Membership in constant time and almost-minimum space, SIAM J. Comput., 28 (1999), pp. 1627-1640.
    • (1999) SIAM J. Comput. , vol.28 , pp. 1627-1640
    • Brodnik, A.1    Munro, J.I.2
  • 14
    • 0037967497 scopus 로고
    • A block sorting data compression algorithm
    • Digital Systems Research Center, Palo Alto, CA
    • M. BURROWS AND D. J. WHEELER, A Block Sorting Data Compression Algorithm, Tech. report, Digital Systems Research Center, Palo Alto, CA, 1994.
    • (1994) Tech. Report
    • Burrows, M.1    Wheeler, D.J.2
  • 15
    • 0004090301 scopus 로고    scopus 로고
    • Ph.D. thesis, Department of Computer Science, University of Waterloo, Waterloo, Ontario, Canada
    • D. CLARK, Compact Pat Trees, Ph.D. thesis, Department of Computer Science, University of Waterloo, Waterloo, Ontario, Canada, 1996.
    • (1996) Compact Pat Trees
    • Clark, D.1
  • 17
    • 0042022841 scopus 로고    scopus 로고
    • A time and space efficient data structure for string searching on large texts
    • L. COLUSSI AND A. DE COL, A time and space efficient data structure for string searching on large texts, Inform. Process. Lett., 58 (1996), pp. 217-222.
    • (1996) Inform. Process. Lett. , vol.58 , pp. 217-222
    • Colussi, L.1    De Col, A.2
  • 18
    • 0022865420 scopus 로고
    • Transducers and repetitions
    • M. CROCHEMORE, Transducers and repetitions, Theoret. Comput. Sci., 45 (1986), pp. 63-86.
    • (1986) Theoret. Comput. Sci. , vol.45 , pp. 63-86
    • Crochemore, M.1
  • 19
    • 71149088928 scopus 로고
    • Two-way string matching
    • M. CROCHEMORE AND D. PERRIN, Two-way string matching, J. ACM, 38 (1991), pp. 651-675.
    • (1991) J. ACM , vol.38 , pp. 651-675
    • Crochemore, M.1    Perrin, D.2
  • 23
    • 0016047467 scopus 로고
    • Efficient storage and retrieval by content and address of static files
    • P. ELIAS, Efficient storage and retrieval by content and address of static files, J. ACM, 21 (1974), pp. 246-260.
    • (1974) J. ACM , vol.21 , pp. 246-260
    • Elias, P.1
  • 24
    • 0000573761 scopus 로고    scopus 로고
    • String matching in Lempel-Ziv compressed strings
    • M. FARACH AND M. THORUP, String matching in Lempel-Ziv compressed strings, Algorithmica, 20 (1998), pp. 388-404.
    • (1998) Algorithmica , vol.20 , pp. 388-404
    • Farach, M.1    Thorup, M.2
  • 26
    • 0037624231 scopus 로고    scopus 로고
    • On the sorting-complexity of suffix tree construction
    • M. FARACH-COLTON, P. FERRAGINA, AND S. MUTHUKRISHNAN, On the sorting-complexity of suffix tree construction, J. ACM, 47 (2000), pp. 987-1011.
    • (2000) J. ACM , vol.47 , pp. 987-1011
    • Farach-Colton, M.1    Ferragina, P.2    Muthukrishnan, S.3
  • 27
    • 84947716322 scopus 로고    scopus 로고
    • Optimal logarithmic time randomized suffix tree construction
    • Automata, Languages, and Programming, 23rd International Colloquium, F. Meyer auf der Heide and B. Monien, eds., Springer-Verlag, Berlin, New York
    • M. FARACH-COLTON AND S. MUTHUKRISHNAN, Optimal logarithmic time randomized suffix tree construction, in Automata, Languages, and Programming, 23rd International Colloquium, F. Meyer auf der Heide and B. Monien, eds., Lecture Notes in Comput. Sci. 1099, Springer-Verlag, Berlin, New York, 1996, pp. 550-561.
    • (1996) Lecture Notes in Comput. Sci. , vol.1099 , pp. 550-561
    • Farach-Colton, M.1    Muthukrishnan, S.2
  • 28
    • 0000950910 scopus 로고    scopus 로고
    • The String B-tree: A new data structure for string search in external memory and its applications
    • P. FERRAGINA AND R. GROSSI, The String B-tree: a new data structure for string search in external memory and its applications, J. ACM, 46 (1999), pp. 236-280.
    • (1999) J. ACM , vol.46 , pp. 236-280
    • Ferragina, P.1    Grossi, R.2
  • 31
    • 0021467937 scopus 로고
    • Storing a sparse table with O(1) worst case access time
    • M. L. FREDMAN, J. KOMLÓS, AND E. SZEMERÉDI, Storing a sparse table with O(1) worst case access time, J. ACM, 31 (1984), pp. 538-544.
    • (1984) J. ACM , vol.31 , pp. 538-544
    • Fredman, M.L.1    Komlós, J.2    Szemerédi, E.3
  • 32
    • 33244495461 scopus 로고    scopus 로고
    • Automata, languages and programming
    • Proceedings of the 30th International Colloquium, Springer, Berlin, New York
    • A. GÁL AND P. B. MILTERSEN, Automata, Languages and Programming, in Proceedings of the 30th International Colloquium, Lecture Notes in Comput. Sci. 2719, Springer, Berlin, New York, 2003, pp. 332-344.
    • (2003) Lecture Notes in Comput. Sci. , vol.2719 , pp. 332-344
    • Gál, A.1    Miltersen, P.B.2
  • 33
    • 0020764168 scopus 로고
    • Time-space-optimal string matching
    • Z. GALIL AND J. SEIFERAS, Time-space-optimal string matching, J. Comput. System Sci., 26 (1983), pp. 280-294.
    • (1983) J. Comput. System Sci. , vol.26 , pp. 280-294
    • Galil, Z.1    Seiferas, J.2
  • 34
    • 84957045476 scopus 로고    scopus 로고
    • Efficient implementation of lazy suffix trees
    • Proceedings of the 3rd Workshop on Algorithm Engineering, J. S. Vitter and C. D. Zaroliagis, eds., Springer-Verlag, Berlin
    • R. GIEGERICH, S. KURTZ, AND J. STOYE, Efficient implementation of lazy suffix trees, in Proceedings of the 3rd Workshop on Algorithm Engineering, J. S. Vitter and C. D. Zaroliagis, eds., Lecture Notes in Comput. Sci. 1668, Springer-Verlag, Berlin, 1999, pp. 30-42.
    • (1999) Lecture Notes in Comput. Sci. , vol.1668 , pp. 30-42
    • Giegerich, R.1    Kurtz, S.2    Stoye, J.3
  • 37
    • 0033690274 scopus 로고    scopus 로고
    • Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extended abstract)
    • Portland, OR
    • R. GROSSI AND J. S. VITTER, Compressed suffix arrays and suffix trees with applications to text indexing and string matching (extended abstract), in Proceedings of the 32nd Annual ACM Symposium on the Theory of Computing, Portland, OR, 2000, pp. 397-406.
    • (2000) Proceedings of the 32nd Annual ACM Symposium on the Theory of Computing , pp. 397-406
    • Grossi, R.1    Vitter, J.S.2
  • 41
    • 0003435032 scopus 로고
    • Suffix binary search trees
    • Computing Science Department, University of Glasgow, Glasgow, UK
    • R. W. IRVING, Suffix Binary Search Trees, Tech. report TR-1995-7, Computing Science Department, University of Glasgow, Glasgow, UK, 1995.
    • (1995) Tech. Report , vol.TR-1995-7
    • Irving, R.W.1
  • 43
    • 0004310601 scopus 로고
    • Succinct static data structures
    • Department of Computer Science, Carnegie-Mellon University, Pittsburgh, PA
    • G. JACOBSON, Succinct Static Data Structures, Tech. report CMU-CS-89-112, Department of Computer Science, Carnegie-Mellon University, Pittsburgh, PA, 1989.
    • (1989) Tech. Report , vol.CMU-CS-89-112
    • Jacobson, G.1
  • 44
    • 84957881825 scopus 로고
    • Suffix cactus: A cross between suffix tree and suffix array
    • Combinatorial Pattern Matching, Springer, Berlin, New York
    • J. KÄRKKÄINEN, Suffix cactus: A cross between suffix tree and suffix array, in Combinatorial Pattern Matching, Lecture Notes in Comput. Sci. 937, Springer, Berlin, New York, 1995, pp. 191-204.
    • (1995) Lecture Notes in Comput. Sci. , vol.937 , pp. 191-204
    • Kärkkäinen, J.1
  • 45
    • 0042209438 scopus 로고    scopus 로고
    • Lempel-Ziv index for q-grams
    • J. KÄRKKÄINEN AND E. SUTINEN, Lempel-Ziv index for q-grams, Algorithmica, 21 (1998), pp. 137-154.
    • (1998) Algorithmica , vol.21 , pp. 137-154
    • Kärkkäinen, J.1    Sutinen, E.2
  • 46
    • 0141565607 scopus 로고    scopus 로고
    • Lempel-Ziv parsing and sublinear-size index structures for string matching
    • N. Ziviani, R. Baeza-Yates, and K. Guimarães, eds., Carleton University Press, Ottawa, Ontario, Canada
    • J. KÄRKKÄINEN AND E. UKKONEN, Lempel-Ziv parsing and sublinear-size index structures for string matching, in Proceedings of the 3rd South American Workshop on String Processing, N. Ziviani, R. Baeza-Yates, and K. Guimarães, eds., Carleton University Press, Ottawa, Ontario, Canada, 1996, pp. 141-155.
    • (1996) Proceedings of the 3rd South American Workshop on String Processing , pp. 141-155
    • Kärkkäinen, J.1    Ukkonen, E.2
  • 47
    • 84948971495 scopus 로고    scopus 로고
    • Sparse suffix trees
    • Computing and Combinatorics, Springer, Berlin
    • J. KÄRKKÄINEN AND E. UKKONEN, Sparse suffix trees, in Computing and Combinatorics, Lecture Notes in Comput. Sci. 1090, Springer, Berlin, 1996, pp. 219-230.
    • (1996) Lecture Notes in Comput. Sci. , vol.1090 , pp. 219-230
    • Kärkkäinen, J.1    Ukkonen, E.2
  • 48
    • 0242337515 scopus 로고    scopus 로고
    • Sorting and searching, 2nd ed.
    • Addison-Wesley, Reading, MA
    • D. E. KNUTH, Sorting and Searching, 2nd ed., The Art of Computer Programming 3, Addison-Wesley, Reading, MA, 1998.
    • (1998) The Art of Computer Programming , vol.3
    • Knuth, D.E.1
  • 53
    • 0033227559 scopus 로고    scopus 로고
    • Reducing the space requirement of suffix trees
    • S. KURTZ, Reducing the Space Requirement of Suffix Trees, Software Practice and Experience, 29 (1999), pp. 1149-1171.
    • (1999) Software Practice and Experience , vol.29 , pp. 1149-1171
    • Kurtz, S.1
  • 54
    • 84873644260 scopus 로고    scopus 로고
    • Compact suffix array
    • Combinatorial Pattern Matching, Springer, Berlin, New York
    • V. MÄKINEN, Compact suffix array, in Combinatorial Pattern Matching, Lecture Notes in Comput. Sci. 1848, Springer, Berlin, New York, 2000, pp. 305-319.
    • (2000) Lecture Notes in Comput. Sci. , vol.1848 , pp. 305-319
    • Mäkinen, V.1
  • 55
    • 0027681165 scopus 로고
    • Suffix arrays: A new method for on-line string searches
    • U. MANBER AND G. MYERS, Suffix arrays: A new method for on-line string searches, SIAM J. Comput., 22 (1993), pp. 935-948.
    • (1993) SIAM J. Comput. , vol.22 , pp. 935-948
    • Manber, U.1    Myers, G.2
  • 57
    • 0016942292 scopus 로고
    • A space-economical suffix tree construction algorithm
    • E. M. MCCREIGHT, A space-economical suffix tree construction algorithm, J. ACM, 23 (1976), pp. 262-272.
    • (1976) J. ACM , vol.23 , pp. 262-272
    • Mccreight, E.M.1
  • 58
    • 0030260359 scopus 로고    scopus 로고
    • Self-indexing inverted files for fast text retrieval
    • A. MOFFAT AND J. ZOBEL, Self-indexing inverted files for fast text retrieval, ACM Trans. Inform. Systems, 14 (1996), pp. 349-379.
    • (1996) ACM Trans. Inform. Systems , vol.14 , pp. 349-379
    • Moffat, A.1    Zobel, J.2
  • 59
    • 38149018071 scopus 로고
    • PATRICIA - Practical algorithm to retrieve information coded in alphanumeric
    • D. R. MORRISON, PATRICIA - Practical algorithm to retrieve information coded In alphanumeric, J. ACM, 15 (1968), pp. 514-534.
    • (1968) J. ACM , vol.15 , pp. 514-534
    • Morrison, D.R.1
  • 60
    • 84947935857 scopus 로고    scopus 로고
    • Tables
    • Foundations of Software Technology and Theoretical Computer Science, Springer, Berlin
    • J. I. MUNRO, Tables, in Foundations of Software Technology and Theoretical Computer Science, Lecture Notes in Comput. Sci. 1180, Springer, Berlin, pp. 37-42.
    • Lecture Notes in Comput. Sci. , vol.1180 , pp. 37-42
    • Munro, J.I.1
  • 62
    • 0000674707 scopus 로고    scopus 로고
    • Space efficient suffix trees
    • J. I. MUNRO, V. RAMAN, AND S. S. RAO, Space efficient suffix trees, J. Algorithms. 39 (2001), pp. 205-222.
    • (2001) J. Algorithms. , vol.39 , pp. 205-222
    • Munro, J.I.1    Raman, V.2    Rao, S.S.3
  • 63
    • 0036248032 scopus 로고    scopus 로고
    • Low redundancy in static dictionaries with constant query time
    • R. PAGH, Low redundancy in static dictionaries with constant query time, SIAM J. Comput., 31 (2001), pp. 353-363.
    • (2001) SIAM J. Comput. , vol.31 , pp. 353-363
    • Pagh, R.1
  • 64
    • 84949755971 scopus 로고    scopus 로고
    • Compressed text databases with efficient query algorithms based on the compressed suffix array
    • Proceedings of ISAAC '00, Springer, Berlin, New York
    • K. SADAKANE, Compressed text databases with efficient query algorithms based on the compressed suffix array, in Proceedings of ISAAC '00, Lecture Notes in Comput. Sci. 1969, Springer, Berlin, New York, 2000, pp. 410-421.
    • (2000) Lecture Notes in Comput. Sci. , vol.1969 , pp. 410-421
    • Sadakane, K.1
  • 65
    • 84968866645 scopus 로고    scopus 로고
    • Succinct representations of lcp information and improvements in the compressed suffix arrays
    • SIAM, Philadelphia
    • K. SADAKANE, Succinct representations of lcp information and improvements in the compressed suffix arrays, in Proceedings of the 13th Annual ACM-SIAM Symposium on Discrete Algorithms, SIAM, Philadelphia, 2002, pp. 225-232.
    • (2002) Proceedings of the 13th Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 225-232
    • Sadakane, K.1
  • 67
    • 0001704377 scopus 로고
    • On-line construction of suffix trees
    • E. UKKONEN, On-line construction of suffix trees, Algorithmica, 14 (1995), pp. 249-260.
    • (1995) Algorithmica , vol.14 , pp. 249-260
    • Ukkonen, E.1
  • 70
    • 0012726646 scopus 로고    scopus 로고
    • Dictionary look-up with one error
    • A. C. YAO AND F. F. YAO, Dictionary look-up with one error, J. Algorithms, 25 (1997), pp. 194-202.
    • (1997) J. Algorithms , vol.25 , pp. 194-202
    • Yao, A.C.1    Yao, F.F.2
  • 71
    • 0342521304 scopus 로고    scopus 로고
    • Compression: A key for next-generation text retrieval systems
    • N. ZIVIANI, E. S. DE MOURA, G. NAVARRO, AND R. BAEZA-YATES, Compression: A key for next-generation text retrieval systems, IEEE Comput., 33 (2000), pp. 37-44.
    • (2000) IEEE Comput. , vol.33 , pp. 37-44
    • Ziviani, N.1    De Moura, E.S.2    Navarro, G.3    Baeza-Yates, R.4
  • 72


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.