메뉴 건너뛰기




Volumn 9, Issue 2, 2012, Pages 345-357

A new efficient data structure for storage and retrieval of multiple biosequences

Author keywords

biology and genetics; Data storage representations; reusable libraries; software engineering

Indexed keywords

ALPHABET SIZE; BIOLOGICAL SEQUENCES; BIOLOGY AND GENETICS; BIOSEQUENCES; CUSTOMIZABLE; DATA STORAGE REPRESENTATION; DIFFERENT DISTRIBUTIONS; EFFICIENT DATA STRUCTURES; GENERIC IMPLEMENTATION; GENOME ANALYSIS; HUMAN GENOMES; INTERNAL REPRESENTATION; OBJECT-ORIENTED INTERFACES; PORTABLE SOFTWARE; REUSABLE LIBRARY; SCRIPTING LANGUAGES; SEQUENCE LENGTHS; SEQUENTIAL ACCESS; SPACE AND TIME; STORAGE AND RETRIEVALS; SUB-STRINGS;

EID: 84856496740     PISSN: 15455963     EISSN: None     Source Type: Journal    
DOI: 10.1109/TCBB.2011.146     Document Type: Article
Times cited : (10)

References (42)
  • 1
    • 0038451355 scopus 로고    scopus 로고
    • Reduction of protein sequence complexity by residue grouping
    • T. Li, K. Fan, J. Wang, and W. Wang, "Reduction of Protein Sequence Complexity by Residue Grouping" Protein Eng., vol. 16, no. 5, pp. 323-330, 2003. (Pubitemid 36806923)
    • (2003) Protein Engineering , vol.16 , Issue.5 , pp. 323-330
    • Li, T.1    Fan, K.2    Wang, J.3    Wang, W.4
  • 2
    • 65649149743 scopus 로고    scopus 로고
    • Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment
    • E.L. Peterson, J. Kondev, J.A. Theriot, and R. Phillips, "Reduced Amino Acid Alphabets Exhibit an Improved Sensitivity and Selectivity in Fold Assignment" Bioinformatics, vol. 25, no. 11, pp. 1356-1362, 2009.
    • (2009) Bioinformatics , vol.25 , Issue.11 , pp. 1356-1362
    • Peterson, E.L.1    Kondev, J.2    Theriot, J.A.3    Phillips, R.4
  • 3
    • 1242320272 scopus 로고    scopus 로고
    • Local homology recognition and distance measures in linear time using compressed amino acid alphabets
    • DOI 10.1093/nar/gkh180
    • R.C. Edgar, "Local Homology Recognition and Distance Measures in Linear Time Using Compressed Amino Acid Alphabets" Nucleic Acids Research, vol. 32, no. 1, pp. 380-385, 2004. (Pubitemid 38228638)
    • (2004) Nucleic Acids Research , vol.32 , Issue.1 , pp. 380-385
    • Edgar, R.C.1
  • 4
    • 77955613051 scopus 로고    scopus 로고
    • Clustering of protein families into functional subtypes using relative complexity measure with reduced amino acid alphabets
    • article 428
    • A. Albayrak, H.H. Otu, and U.O. Sezerman, "Clustering of Protein Families into Functional Subtypes Using Relative Complexity Measure with Reduced Amino Acid Alphabets" BMC Bioinformatics, vol. 11, no. 1, article 428, 2010.
    • (2010) BMC Bioinformatics , vol.11 , Issue.1
    • Albayrak, A.1    Otu, H.H.2    Sezerman, U.O.3
  • 6
    • 0030663490 scopus 로고    scopus 로고
    • Compression of nucleotide databases for fast searching
    • H. Williams and J. Zobel, "Compression of Nucleotide Databases for Fast Searching" Computer Applications in the Biosciences, vol. 13, no. 5, pp. 549-554, 1997. (Pubitemid 27480215)
    • (1997) Computer Applications in the Biosciences , vol.13 , Issue.5 , pp. 549-554
    • Williams, H.1    Zobel, J.2
  • 8
    • 34548580699 scopus 로고    scopus 로고
    • Comparing compressed sequences for faster nucleotide BLAST searches
    • DOI 10.1109/TCBB.2007.1029
    • M. Cameron and H.E. Williams, "Comparing Compressed Sequences for Faster Nucleotide BLAST Searches" IEEE/ACM Trans Computational Biology and Bioinformatics, vol. 4, no. 3, pp. 349-364, July-Sept. 2007. (Pubitemid 47393455)
    • (2007) IEEE/ACM Transactions on Computational Biology and Bioinformatics , vol.4 , Issue.3 , pp. 349-364
    • Cameron, M.1    Williams, H.E.2
  • 9
    • 84856423763 scopus 로고    scopus 로고
    • "The NCBI C Toolkit" ftp://ftp.ncbi.nih.gov/toolbox/ncbi-tools, 2011.
    • (2011) The NCBI C Toolkit
  • 10
    • 0036226603 scopus 로고    scopus 로고
    • BLAT-The blast-like alignment tool
    • W.J. Kent, "BLAT-the BLAST-Like Alignment Tool" Genome Research, vol. 12, no. 4, pp. 656-664, 2002.
    • (2002) Genome Research , vol.12 , Issue.4 , pp. 656-664
    • Kent, W.J.1
  • 12
    • 39549090389 scopus 로고    scopus 로고
    • Seqan an efficient, generic c++ library for sequence analysis
    • article 11
    • A. Döring, D. Weese, T. Rausch, and K. Reinert, "SeqAn an Efficient, Generic C++ Library for Sequence Analysis" BMC Bioinformatics, vol. 9, article 11, 2008.
    • (2008) BMC Bioinformatics , vol.9
    • A. Döring1    Weese, D.2    Rausch, T.3    Reinert, K.4
  • 13
    • 77951226627 scopus 로고    scopus 로고
    • The sanger FASTQ file format for sequences with quality scores, and the solexa/illumina FASTQ variants
    • P.J.A. Cock, C.J. Fields, N. Goto, M.L. Heuer, and P.M. Rice, "The Sanger FASTQ file Format for Sequences with Quality Scores, and the Solexa/Illumina FASTQ variants" Nucleic Acids Research, vol. 38, pp. 1767-1771, 2010.
    • (2010) Nucleic Acids Research , vol.38 , pp. 1767-1771
    • Cock, P.J.A.1    Fields, C.J.2    Goto, N.3    Heuer, M.L.4    Rice, P.M.5
  • 15
    • 77953909503 scopus 로고    scopus 로고
    • The genomedata format for storing large-scale functional genomics data
    • M.M. Hoffman, O.J. Buske, and W.S. Noble, "The Genomedata Format for Storing Large-Scale Functional Genomics Data" Bioinformatics, vol. 26, no. 11, pp. 1458-1459, 2010.
    • (2010) Bioinformatics , vol.26 , Issue.11 , pp. 1458-1459
    • Hoffman, M.M.1    Buske, O.J.2    Noble, W.S.3
  • 16
    • 77955886068 scopus 로고    scopus 로고
    • G-SQZ: Compact encoding of genomic sequence and quality data
    • W. Tembe, J. Lowey, and E. Suh, "G-SQZ: Compact Encoding of Genomic Sequence and Quality Data" Bioinformatics, vol. 26, no. 17, pp. 2192-2194, 2010.
    • (2010) Bioinformatics , vol.26 , Issue.17 , pp. 2192-2194
    • Tembe, W.1    Lowey, J.2    Suh, E.3
  • 17
    • 79952580139 scopus 로고    scopus 로고
    • Compression of DNA sequence reads in FASTQ format
    • S. Deorowicz and S. Grabowski, "Compression of DNA Sequence Reads in FASTQ Format" Bioinformatics, vol. 27, no. 6, pp. 860-862, 2011.
    • (2011) Bioinformatics , vol.27 , Issue.6 , pp. 860-862
    • Deorowicz, S.1    Grabowski, S.2
  • 18
    • 28744458859 scopus 로고    scopus 로고
    • Bioconductor: Open software development for computational biology and bioinformatics
    • R.C. Gentleman et al., "Bioconductor: Open Software Development for Computational Biology and Bioinformatics" Genome Biology, vol. 5, p. R80, 2004.
    • (2004) Genome Biology , vol.5
    • Gentleman, R.C.1
  • 19
    • 70349866687 scopus 로고    scopus 로고
    • ShortRead: A bioconductor package for input, quality assessment and exploration of high-throughput sequence data
    • M. Morgan, S. Anders, M. Lawrence, P. Aboyoun, H. Pagès, and R. Gentleman, "ShortRead: A Bioconductor Package for Input, Quality Assessment and Exploration of High-Throughput Sequence Data" Bioinformatics, vol. 25, no. 19, pp. 2607-2608, 2009.
    • (2009) Bioinformatics , vol.25 , Issue.19 , pp. 2607-2608
    • Morgan, M.1    Anders, S.2    Lawrence, M.3    Aboyoun, P.4    Pagès, H.5    Gentleman, R.6
  • 20
    • 84856505869 scopus 로고    scopus 로고
    • "GenomeTools C API" http://genometools.org/libgenometools. html, 2011.
    • (2011) GenomeTools C API
  • 21
    • 75849146491 scopus 로고    scopus 로고
    • Efficient estimation of pairwise distances between genomes
    • M. Domazet-Loso and B. Haubold, "Efficient Estimation of Pairwise Distances between Genomes" Bioinformatics, vol. 25, pp. 3221-3227, 2009.
    • (2009) Bioinformatics , vol.25 , pp. 3221-3227
    • M. Domazet-Loso1    Haubold, B.2
  • 24
    • 39749179047 scopus 로고    scopus 로고
    • LTRharvest an Efficient and Flexible Software for de novo Detection of LTR Retrotransposons
    • article 18
    • D. Ellinghaus, S. Kurtz, and U. Willhoeft, "LTRharvest, an Efficient and Flexible Software for de novo Detection of LTR Retrotransposons" BMC Bioinformatics, vol. 9, article 18, 2008.
    • (2008) BMC Bioinformatics , vol.9
    • Ellinghaus, D.1    Kurtz, S.2    Willhoeft, U.3
  • 25
    • 56549086632 scopus 로고    scopus 로고
    • A new method to compute k-mer frequencies and its application to annotate large repetitive plant genomes
    • article 517
    • S. Kurtz, A. Narechania, J.C. Stein, and D. Ware, "A New Method to Compute K-mer Frequencies and Its Application to Annotate Large Repetitive Plant Genomes" BMC Genomics, vol. 9, article 517, 2008.
    • (2008) BMC Genomics , vol.9
    • Kurtz, S.1    Narechania, A.2    Stein, J.C.3    Ware, D.4
  • 26
    • 75349090212 scopus 로고    scopus 로고
    • Fine-grained annotation and classification of de novo predicted ltr retrotransposons
    • S. Steinbiss, U. Willhoeft, G. Gremme, and S. Kurtz, "Fine-Grained Annotation and Classification of de novo predicted LTR retrotransposons" Nucleic Acids Research, vol. 37, no. 21, pp. 7002-7013, 2009.
    • (2009) Nucleic Acids Research , vol.37 , Issue.21 , pp. 7002-7013
    • Steinbiss, S.1    Willhoeft, U.2    Gremme, G.3    Kurtz, S.4
  • 27
    • 79952117335 scopus 로고    scopus 로고
    • MetaGenomeThreader: A software tool for predicting genes in dna-sequences of metagenome projects
    • W. Streit and R. Daniel, eds. Springer
    • D.J. Schmitz-Hübsch and S. Kurtz, "MetaGenomeThreader: A Software Tool for Predicting Genes in DNA-Sequences of Metagenome Projects" Metagenomics: Methods and Protocols, ser. Methods in Molecular Biology, W. Streit and R. Daniel, eds. Springer, 2010.
    • (2010) Metagenomics: Methods and Protocols, ser. Methods in Molecular Biology
    • Schmitz-Hübsch, D.J.1    Kurtz, S.2
  • 30
    • 84856505866 scopus 로고    scopus 로고
    • "The ISC License" http://www.isc.org/software/license, 2011.
    • (2011) The ISC License
  • 31
    • 84856449773 scopus 로고    scopus 로고
    • "Cygwin" http://www.cygwin.com, 2011.
    • (2011) Cygwin
  • 34
    • 0036202921 scopus 로고    scopus 로고
    • PatternHunter: Faster and more sensitive homology search
    • B. Ma, J. Tromp, and M. Li, "PatternHunter: Faster and More Sensitive Homology Search" Bioinformatics, vol. 18, no. 3, pp. 440-445, 2002. (Pubitemid 34284945)
    • (2002) Bioinformatics , vol.18 , Issue.3 , pp. 440-445
    • Ma, B.1    Tromp, J.2    Li, M.3
  • 37
    • 79551588525 scopus 로고    scopus 로고
    • ABMapper: A suffix array-based tool for multi-location searching and splice-junction mapping
    • S.-K. Lou, B. Ni, L.-Y. Lo, S.K.-W. Tsui, T.-F. Chan, and K.-S. Leung, "ABMapper: A Suffix Array-Based Tool for Multi-Location Searching and Splice-Junction Mapping" Bioinformatics, vol. 27, no. 3, pp. 421-422, 2011.
    • (2011) Bioinformatics , vol.27 , Issue.3 , pp. 421-422
    • Lou, S.-K.1    Ni, B.2    Lo, L.-Y.3    Tsui, S.K.-W.4    Chan, T.-F.5    Leung, K.-S.6
  • 41
    • 34547418487 scopus 로고    scopus 로고
    • SWIG: An easy to use tool for integrating scripting languages with c and c++
    • D.M. Beazley, "SWIG: An Easy to Use Tool for Integrating Scripting Languages with C and C++" Proc. Fourth Conf. USENIX Tcl/Tk Workshop, 1996.
    • (1996) Proc. Fourth Conf. USENIX Tcl/Tk Workshop
    • Beazley, D.M.1
  • 42
    • 0021919480 scopus 로고
    • Rapid and sensitive protein similarity searches
    • D.J. Lipman and W.R. Pearson, "Rapid and Sensitive Protein Similarity Searches" Science, vol. 227, no. 4693, pp. 1435-1441, 1985. (Pubitemid 15146986)
    • (1985) Science , vol.227 , Issue.4693 , pp. 1435-1441
    • Lipman, D.J.1    Pearson, W.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.