메뉴 건너뛰기




Volumn 9, Issue 2, 2012, Pages 421-429

Efficient maximal repeat finding using the burrows-wheeler transform and wavelet tree

Author keywords

Burrows Wheeler transform; maximal repeats; Repeats; wavelet trees

Indexed keywords

AUXILIARY DATA STRUCTURES; BIOLOGICAL FUNCTIONS; BURROWS WHEELER TRANSFORM; DATA COMPRESSOR; DATA SETS; DESKTOP COMPUTER; EXTERNAL MEMORY; GENOMIC SEQUENCE; HUMAN GENOMES; INTERNAL MEMORY; MASSIVE DATA; MAXIMAL REPEATS; NATURAL LANGUAGE TEXT; OPEN-SOURCE SOFTWARES; ORDERS OF MAGNITUDE; PROTEIN DATA; REPEATS; REPETITIVE STRUCTURE; SPACE USAGE; SUFFIX ARRAYS; SUFFIX-TREES; TIMING PERFORMANCE; WAVELET TREE;

EID: 84856440630     PISSN: 15455963     EISSN: None     Source Type: Journal    
DOI: 10.1109/TCBB.2011.127     Document Type: Article
Times cited : (26)

References (42)
  • 1
    • 67649850577 scopus 로고    scopus 로고
    • Efficient computation of all perfect repeats in genomic sequences of up to half a gigabyte, with a case study on the human genome
    • V. Becher, A. Deymonnaz, and P.A. Heiber, "Efficient Computation of All Perfect Repeats in Genomic Sequences of Up To Half a Gigabyte, with a Case Study on the Human Genome," Bioinformatics, vol. 25, no. 14, pp. 1746-1753, 2009.
    • (2009) Bioinformatics , vol.25 , Issue.14 , pp. 1746-1753
    • Becher, V.1    Deymonnaz, A.2    Heiber, P.A.3
  • 3
    • 0033555906 scopus 로고    scopus 로고
    • Tandem repeats finder: A program to analyze DNA sequences
    • DOI 10.1093/nar/27.2.573
    • G. Benson, "Tandem Repeats Finder: A Program to Analyze DNA Sequences," Nucleic Acids Research, vol. 27, no. 2, pp. 573-580, 1999. (Pubitemid 29210025)
    • (1999) Nucleic Acids Research , vol.27 , Issue.2 , pp. 573-580
    • Benson, G.1
  • 5
    • 0035999989 scopus 로고    scopus 로고
    • Troll-tandem repeat occurrence locator
    • A.T. Castelo, W. Martins, and G.R. Gao, "Troll-Tandem Repeat Occurrence Locator," Bioinformatics, vol. 18, no. 4, pp. 634-636, 2002.
    • (2002) Bioinformatics , vol.18 , Issue.4 , pp. 634-636
    • Castelo, A.T.1    Martins, W.2    Gao, G.R.3
  • 7
    • 30544432152 scopus 로고    scopus 로고
    • Indexing compressed text
    • DOI 10.1145/1082036.1082039
    • P. Ferragina and G. Manzini, "Indexing Compressed Text," J. ACM, vol. 52, no. 4, pp. 552-581, 2005. (Pubitemid 43078371)
    • (2005) Journal of the ACM , vol.52 , Issue.4 , pp. 552-581
    • Ferragina, P.1    Manzini, G.2
  • 11
    • 33645796213 scopus 로고    scopus 로고
    • Compressed suffix arrays and suffix trees with applications to text indexing and string matching
    • R. Grossi and J.S. Vitter, "Compressed Suffix Arrays and Suffix Trees with Applications to Text Indexing and String Matching," SIAM J. Computing, vol. 35, no. 32, pp. 378-407, 2005.
    • (2005) SIAM J. Computing , vol.35 , Issue.32 , pp. 378-407
    • Grossi, R.1    Vitter, J.S.2
  • 13
    • 34547375123 scopus 로고    scopus 로고
    • A space and time efficient algorithm for constructing compressed suffix arrays
    • DOI 10.1007/s00453-006-1228-8
    • W.-K. Hon, T.-W. Lam, K. Sadakane, W.-K. Sung, and S.-M. Yiu, "A Space and Time Efficient Algorithm for Constructing Compressed Suffix Arrays," Algorithmica, vol. 48, no. 1, pp. 23-36, 2007. (Pubitemid 47136627)
    • (2007) Algorithmica (New York) , vol.48 , Issue.1 , pp. 23-36
    • Hon, W.-K.1    Lam, T.-W.2    Sadakane, K.3    Sung, W.-K.4    Yiu, S.-M.5
  • 14
    • 65949095627 scopus 로고    scopus 로고
    • Breaking a time-And-space barrier in constructing full-text indices
    • W.-K. Hon, K. Sadakane, and W.-K. Sung, "Breaking a Time-And-Space Barrier in Constructing Full-Text Indices," SIAM J. Computing, vol. 38, no. 6, pp. 2162-2178, 2009.
    • (2009) SIAM J. Computing , vol.38 , Issue.6 , pp. 2162-2178
    • Hon, W.-K.1    Sadakane, K.2    Sung, W.-K.3
  • 15
    • 35449006300 scopus 로고    scopus 로고
    • Fast BWT in small space by blockwise suffix sorting
    • DOI 10.1016/j.tcs.2007.07.018, PII S0304397507005245, The Burrows-Wheaker Transform
    • J. Kärkkäinen, "Fast BWT in Small Space by Blockwise Suffix Sorting," Theoretical Computer Science, vol. 387, no. 3, pp. 249-257, 2007. (Pubitemid 47633442)
    • (2007) Theoretical Computer Science , vol.387 , Issue.3 , pp. 249-257
    • Karkkainen, J.1
  • 17
    • 84956976999 scopus 로고    scopus 로고
    • Linear-time longest-common-prefix computation in suffix arrays and its applications
    • Combinatorial Pattern Matching
    • T. Kasai, G. Lee, H. Arimura, S. Arikawa, and K. Park, "Linear-Time Longest-Common-Prefix Computation in Suffix Arrays and Its Applications," Proc. Ann. Symp. Combinatorial Pattern Matching, pp. 181-192, 2001. (Pubitemid 33298873)
    • (2001) Lecture Notes in Computer Science , Issue.2089 , pp. 181-192
    • Kasai, T.1    Lee, G.2    Arimura, H.3    Arikawa, S.4    Park, K.5
  • 19
    • 0033227559 scopus 로고    scopus 로고
    • Reducing the space requirement of suffix trees
    • S. Kurtz, "Reducing the Space Requirements of Suffix Trees," Software-Practice and Experience, vol. 29, no. 13, pp. 1149-1171, 1999. (Pubitemid 129303954)
    • (1999) Software - Practice and Experience , vol.29 , Issue.13 , pp. 1149-1171
    • Kurtz, S.1
  • 21
    • 0033066718 scopus 로고    scopus 로고
    • REPuter: Fast computation of maximal repeats in complete genomes
    • DOI 10.1093/bioinformatics/15.5.426
    • S. Kurtz and C. Schleiermacher, "Reputer: Fast Computation of Maximal Repeats in Complete Genomes," Bioinformatics, vol. 15, no. 5, pp. 426-427, 1999. (Pubitemid 29312632)
    • (1999) Bioinformatics , vol.15 , Issue.5 , pp. 426-427
    • Kurtz, S.1    Schleiermacher, C.2
  • 22
    • 0037433046 scopus 로고    scopus 로고
    • FORRepeats: Detects repeats on entire chromosomes and between genomes
    • DOI 10.1093/bioinformatics/btf843
    • A. Lefebvre, T. Lecroq, H. Dauchel, and J. Alexandre, "FORRepeats: Detects Repeats on Entire Chromosomes and between Genomes," Bioinformatics, vol. 19, no. 3, pp. 319-326, 2003. (Pubitemid 36284931)
    • (2003) Bioinformatics , vol.19 , Issue.3 , pp. 319-326
    • Lefebvre, A.1    Lecroq, T.2    Dauchel, H.3    Alexandre, J.4
  • 23
    • 18844405663 scopus 로고    scopus 로고
    • Space-efficient whole genome comparisons with Burrows-Wheeler transforms
    • DOI 10.1089/cmb.2005.12.407
    • R. Lippert, "Space-Efficient Whole Genome Comparisons with Burrows-Wheeler Transforms," J. Computational Biology, vol. 12, no. 4, pp. 407-415, 2005. (Pubitemid 40696479)
    • (2005) Journal of Computational Biology , vol.12 , Issue.4 , pp. 407-415
    • Lippert, R.A.1
  • 24
    • 25644453578 scopus 로고    scopus 로고
    • A space-efficient construction of the Burrows-Wheeler transform for genomic data
    • DOI 10.1089/cmb.2005.12.943
    • R.A. Lippert, C.M. Mobarry, and B. Walenz, "A Space-Efficient Construction of the Burrows-Wheeler Transform for Genomic Data," J. Computational Biology, vol. 12, no. 7, pp. 943-951, 2005. (Pubitemid 41384390)
    • (2005) Journal of Computational Biology , vol.12 , Issue.7 , pp. 943-951
    • Lippert, R.A.1    Mobarry, C.M.2    Walenz, B.P.3
  • 25
    • 34248338640 scopus 로고    scopus 로고
    • Finding the region of pseudo-periodic tandem repeats in biological sequences
    • X. Liu and L. Wang, "Finding the Region of Pseudo-Periodic Tandem Repeats in Biological Sequences," Algorithms for Molecular Biology, vol. 1, no. 1, p. 2, 2006.
    • (2006) Algorithms for Molecular Biology , vol.1 , Issue.1 , pp. 2
    • Liu, X.1    Wang, L.2
  • 26
    • 17444443735 scopus 로고    scopus 로고
    • Compact suffix array: A space-efficient full-text index
    • Oct.
    • V. Mäkinen, "Compact Suffix Array: A Space-Efficient Full-Text Index," Fundamenta Informaticae, vol. 56, pp. 191-210, Oct. 2002.
    • (2002) Fundamenta Informaticae , vol.56 , pp. 191-210
    • Mäkinen, V.1
  • 27
    • 35048818017 scopus 로고    scopus 로고
    • Two space saving tricks for linear time lcp array computation
    • G. Manzini, "Two Space Saving Tricks for Linear Time lcp Array Computation," Proc. Scandinavian Workshop Algorithm Theory, pp. 372-383, 2004.
    • (2004) Proc. Scandinavian Workshop Algorithm Theory , pp. 372-383
    • Manzini, G.1
  • 29
    • 0021101102 scopus 로고
    • An efficient method for finding repeats in molecular sequences
    • H.M. Martinez, "An Efficient Method for Finding Repeats in Molecular Sequences," Nucleic Acids Research, vol. 11, no. 13, pp. 4629-4634, 1983.
    • (1983) Nucleic Acids Research , vol.11 , Issue.13 , pp. 4629-4634
    • Martinez, H.M.1
  • 30
    • 84856449902 scopus 로고
    • Human genetics: The molecular revolution
    • E.H. McConkey, Human Genetics: The Molecular Revolution. Jones and Bartlett, 1993.
    • (1993) Jones and Bartlett
    • McConkey, E.H.1
  • 31
    • 34548697512 scopus 로고    scopus 로고
    • Alphabet-independent linear-time construction of compressed suffix arrays using o (n log n)-bit working space
    • DOI 10.1016/j.tcs.2007.05.030, PII S030439750700477X
    • J.C. Na and K. Park, "Alphabet-Independent Linear-Time Construction of Compressed Suffix Arrays Using o(nlogn)-Bit Working Space," Theoretical Computer Science, vol. 385, nos. 1-3, pp. 127-136, 2007. (Pubitemid 47419161)
    • (2007) Theoretical Computer Science , vol.385 , Issue.1-3 , pp. 127-136
    • Na, J.C.1    Park, K.2
  • 35
    • 34548782284 scopus 로고    scopus 로고
    • Evolutionary insights from suffix array-based genome sequence analysis
    • Special Issue - Bioinformatics: from Molecules to Systems August 2007
    • A. Poddar, N. Chandra, M. Ganapathiraju, K. Sekar, J. Klein-Seetharaman, R. Judith, R. Reddy, and N. Balakrishnan, "Evolutionary Insights from Suffix Array-based Genome Sequence Analysis," J. Biosciences, vol. 32, no. 5, pp. 871-881, 2007. (Pubitemid 47435519)
    • (2007) Journal of Biosciences , vol.32 , Issue.5 , pp. 871-881
    • Poddar, A.1    Chandra, N.2    Ganapathiraju, M.3    Sekar, K.4    Klein-Seetharaman, J.5    Reddy, R.6    Balakrishnan, N.7
  • 36
    • 36448943298 scopus 로고    scopus 로고
    • Succinct indexable dictionaries with applications to encoding k-Ary trees, prefix sums and multisets
    • R. Raman, V. Raman, and S.S. Rao, "Succinct Indexable Dictionaries with Applications to Encoding k-Ary Trees, Prefix Sums and Multisets," ACM Trans. Algorithms, vol. 3, no. 4, p. 43, 2007.
    • (2007) ACM Trans. Algorithms , vol.3 , Issue.4 , pp. 43
    • Raman, R.1    Raman, V.2    Rao, S.S.3
  • 37
    • 84968866645 scopus 로고    scopus 로고
    • Succinct representations of lcp information and improvements in the compressed suffix arrays
    • K. Sadakane, "Succinct Representations of lcp Information and Improvements in the Compressed Suffix Arrays," Proc. Ann. ACM-SIAM Symp. Discrete Algorithms, pp. 225-232, 2002.
    • (2002) Proc. Ann. ACM-SIAM Symp. Discrete Algorithms , pp. 225-232
    • Sadakane, K.1
  • 38
    • 42449112779 scopus 로고    scopus 로고
    • Computational approaches and tools used in identification of dispersed repetitive DNA sequences
    • S. Saha, S. Bridges, Z.V. Magbanua, and D.G. Peterson, "Computational Approaches and Tools Used in Identification of Dispersed Repetitive DNA Sequences," Tropical Plant Biology, vol. 1, no. 1, pp. 85-96, 2008.
    • (2008) Tropical Plant Biology , vol.1 , Issue.1 , pp. 85-96
    • Saha, S.1    Bridges, S.2    Magbanua, Z.V.3    Peterson, D.G.4
  • 39
    • 42449106154 scopus 로고    scopus 로고
    • Empirical comparison of ab initio repeat finding programs
    • DOI 10.1093/nar/gkn064
    • S. Saha, S. Bridges, Z.V. Magbanua, and D.G. Peterson, "Empirical Comparison of ab Initio Repeat Finding Programs," Nucleic Acids Research, vol. 36, no. 7, pp. 2284-2294, 2008. (Pubitemid 351567001)
    • (2008) Nucleic Acids Research , vol.36 , Issue.7 , pp. 2284-2294
    • Saha, S.1    Bridges, S.2    Magbanua, Z.V.3    Peterson, D.G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.