SCOPUS 정보 검색 플랫폼

BMC Bioinformatics

Volumn 9, Issue , 2008, Pages

Compressing DNA sequence databases with coil

(2) White, W Timothy J a Hendy, Michael D a

a MASSEY UNIVERSITY (New Zealand)

Author keywords

[No Author keywords available]

Indexed keywords

DATA-COMMUNICATION; DNA SEQUENCE DATABASE; EXPONENTIAL RATES; EXPRESSED SEQUENCE TAGS; HIGH COMPRESSION RATIO; NARROW DISTRIBUTION; PORTABLE SOFTWARE; SEQUENCE DATABASE;

DATABASE SYSTEMS; DIGITAL STORAGE; DNA SEQUENCES; GENES;

DATA COMPRESSION;

DNA;

ARTICLE; BIOINFORMATICS; COMPUTER MEMORY; COMPUTER PROGRAM; DNA SEQUENCE; EXPRESSED SEQUENCE TAG; GENBANK; INFORMATION PROCESSING; PROCESS DESIGN; SEQUENCE DATABASE; ANIMAL; ARTIFICIAL NEURAL NETWORK; DATA BASE; HUMAN; METHODOLOGY; MOLECULAR EVOLUTION; NUCLEIC ACID DATABASE; PHYLOGENY; POINT MUTATION; SPECIES DIFFERENCE;

ANIMALS; DATA COMPRESSION; DATABASE MANAGEMENT SYSTEMS; DATABASES, NUCLEIC ACID; EVOLUTION, MOLECULAR; EXPRESSED SEQUENCE TAGS; HUMANS; NEURAL NETWORKS (COMPUTER); PHYLOGENY; POINT MUTATION; SEQUENCE ANALYSIS, DNA; SPECIES SPECIFICITY;

EID: 45249110222 PISSN: None EISSN: 14712105 Source Type: Journal
DOI: 10.1186/1471-2105-9-242 Document Type: Article

Times cited : (12)

References (38)

1
- 84874668040
- ftp://ftp.ncbi.nih.gov/genbank/release.notes/gb159.release.notes
- NCBI NCBI-GenBank Flat File Release 159 Release Notes ftp://ftp.ncbi.nih.gov/genbank/release.notes/gb159.release.notes

2
- 84874662915
- GenBank passes the 100 gigabase mark
- Benson D and Wheeler D NCBI News
- NCBI News GenBank Passes the 100 Gigabase Mark NCBI News Benson D and Wheeler D http://www.ncbi.nlm.nih.gov/Web/Newsltr/V14N2/100gig.html
- NCBI News

3
- 0017493286
- Universal Algorithm for Sequential Data Compression
- Ziv J Lempel A Universal Algorithm for Sequential Data Compression IEEE Transactions on Information Theory 1977 23 337-343
- (1977) IEEE Transactions on Information Theory , vol.23 , pp. 337-343
- Ziv, J.¹ Lempel, A.²

4
- 84886762610
- Gailly J Adler M gzip (GNU zip) compression utility http://www.gnu.org/ software/gzip/
- Gailly, J.¹ Adler, M.²

5
- 0034578442
- Biological sequence compression algorithms: December 18-19; Tokyo
- Universal Academy Press Universal Academy Press
- Matsumoto T Sadakane K Imai H Biological sequence compression algorithms: December 18-19; Tokyo. Universal Academy Press 2000 43-52
- (2000) , pp. 43-52
- Matsumoto, T.¹ Sadakane, K.² Imai, H.³

6
- 84925291641
- Compression of DNA sequences: 30 March-2 April; Snowbird, Utah
- IEEE Computer Society Press Storer JA and Cohn M
- Grumbach S Tahi F Compression of DNA sequences: 30 March-2 April; Snowbird, Utah. IEEE Computer Society Press Storer JA and Cohn M 1993 340-350
- (1993) , pp. 340-350
- Grumbach, S.¹ Tahi, F.²

7
- 0000100455
- A New Challenge for Compression Algorithms - Genetic Sequences
- Grumbach S Tahi F A New Challenge for Compression Algorithms - Genetic Sequences Inf Process Manage 1994 30 875-886
- (1994) Inf Process Manage , vol.30 , pp. 875-886
- Grumbach, S.¹ Tahi, F.²

8
- 0034890615
- A compression algorithm for DNA sequences
- Chen X Kwong S Li M A compression algorithm for DNA sequences IEEE Engineering in Medicine and Biology Magazine 2001 20 61-66
- (2001) IEEE Engineering in Medicine and Biology Magazine , vol.20 , pp. 61-66
- Chen, X.¹ Kwong, S.² Li, M.³

9
- 0036947893
- DNACompress: Fast and effective DNA sequence compression
- 12490460
- Chen X Li M Ma B Tromp J DNACompress: Fast and effective DNA sequence compression Bioinformatics 2002 18 1696-1698 12490460
- (2002) Bioinformatics , vol.18 , pp. 1696-1698
- Chen, X.¹ Li, M.² Ma, B.³ Tromp, J.⁴

10
- 0035102453
- An information-based sequence distance and its application to whole mitochondrial genome phylogeny
- 11238070
- Li M Badger JH Chen X Kwong S Kearney P Zhang HY An information-based sequence distance and its application to whole mitochondrial genome phylogeny Bioinformatics 2001 17 149-154 11238070
- (2001) Bioinformatics , vol.17 , pp. 149-154
- Li, M.¹ Badger, J.H.² Chen, X.³ Kwong, S.⁴ Kearney, P.⁵ Zhang, H.Y.⁶

11
- 32544454688
- Application of compression-based distance measures to protein sequence classification: A methodological study
- 16317070
- Kocsor A Kertesz-Farkas A Kajan L Pongor S Application of compression-based distance measures to protein sequence classification: a methodological study Bioinformatics 2006 22 407-412 16317070
- (2006) Bioinformatics , vol.22 , pp. 407-412
- Kocsor, A.¹ Kertesz-Farkas, A.² Kajan, L.³ Pongor, S.⁴

12
- 0036202921
- PatternHunter: Faster and more sensitive homology search
- 11934743
- Ma B Tromp J Li M PatternHunter: Faster and more sensitive homology search Bioinformatics 2002 18 440-445 11934743
- (2002) Bioinformatics , vol.18 , pp. 440-445
- Ma, B.¹ Tromp, J.² Li, M.³

13
- 0028826043
- Compression of protein-sequence databases
- 8590180
- Strelets VB Lim HA Compression of Protein-Sequence Databases Comput Appl Biosci 1995 11 557-561 8590180
- (1995) Comput Appl Biosci , vol.11 , pp. 557-561
- Strelets, V.B.¹ Lim, H.A.²

14
- 0037246063
- The protein information resource
- 165487 12520019
- Wu CH Yeh LSL Huang HZ Arminski L Castro-Alvear J Chen YX Hu ZZ Kourtesis P Ledley RS Suzek BE Vinayaka CR Zhang J Barker WC The Protein Information Resource Nucleic Acids Res 2003 31 345-347 165487 12520019
- (2003) Nucleic Acids Res , vol.31 , pp. 345-347
- Wu, C.H.¹ Yeh, L.S.L.² Huang, H.Z.³ Arminski, L.⁴ Castro-Alvear, J.⁵ Chen, Y.X.⁶ Hu, Z.Z.⁷ Kourtesis, P.⁸ Ledley, R.S.⁹ Suzek, B.E.¹⁰ Vinayaka, C.R.¹¹ Zhang, J.¹² Barker, W.C.¹³

15
- 84874651569
- PKZIP
- Milwaukee, WI, USA, PKWARE, Inc. 1.1
- Katz P PKZIP Milwaukee, WI, USA, PKWARE, Inc. 1.1 1990 http://www.pkware.com/
- (1990)
- Katz, P.¹

16
- 0035072551
- Clustering of highly homologous sequences to reduce the size of large protein databases
- 11294794
- Li WZ Jaroszewski L Godzik A Clustering of highly homologous sequences to reduce the size of large protein databases Bioinformatics 2001 17 282-283 11294794
- (2001) Bioinformatics , vol.17 , pp. 282-283
- Li, W.Z.¹ Jaroszewski, L.² Godzik, A.³

17
- 0036169928
- Tolerating some redundancy significantly speeds up clustering of large protein databases
- 11836214
- Li WZ Jaroszewski L Godzik A Tolerating some redundancy significantly speeds up clustering of large protein databases Bioinformatics 2002 18 77-82 11836214
- (2002) Bioinformatics , vol.18 , pp. 77-82
- Li, W.Z.¹ Jaroszewski, L.² Godzik, A.³

18
- 33745634395
- Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences
- 16731699
- Li WZ Godzik A Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences Bioinformatics 2006 22 1658-1659 16731699
- (2006) Bioinformatics , vol.22 , pp. 1658-1659
- Li, W.Z.¹ Godzik, A.²

19
- 84886758809
- nrdb http://blast.wustl.edu/pub/nrdb/

20
- 0027968068
- Clustal-W - Improving the Sensitivity of Progressive Multiple Sequence Alignment through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice
- 308517 7984417
- Thompson JD Higgins DG Gibson TJ Clustal-W - Improving the Sensitivity of Progressive Multiple Sequence Alignment through Sequence Weighting, Position-Specific Gap Penalties and Weight Matrix Choice Nucleic Acids Res 1994 22 4673-4680 308517 7984417
- (1994) Nucleic Acids Res , vol.22 , pp. 4673-4680
- Thompson, J.D.¹ Higgins, D.G.² Gibson, T.J.³

21
- 0003081588
- The Steiner problem in phylogeny is NP-complete
- Foulds LR Graham RL The Steiner problem in phylogeny is NP-complete Advances in Applied Mathematics 1982 3 43-49
- (1982) Advances in Applied Mathematics , vol.3 , pp. 43-49
- Foulds, L.R.¹ Graham, R.L.²

22
- 0012574730
- A minimum spanning tree algorithm with Inverse Ackermann type complexity
- Chazelle B A minimum spanning tree algorithm with Inverse Ackermann type complexity Journal of the ACM 2000 47 1028-1047
- (2000) Journal of the ACM , vol.47 , pp. 1028-1047
- Chazelle, B.¹

23
- 30544432152
- Indexing compressed text
- Ferragina P Manzini G Indexing compressed text J ACM 2005 52 552-581
- (2005) J ACM , vol.52 , pp. 552-581
- Ferragina, P.¹ Manzini, G.²

24
- 33750283805
- A compressed self-index using a Ziv-Lempel dictionary
- Berlin, SPRINGER-VERLAG BERLIN
- Russo LMS Oliveira AL A compressed self-index using a Ziv-Lempel dictionary String Processing and Information Retrieval, Proceedings Berlin, SPRINGER-VERLAG BERLIN Lecture Notes in Computer Science 2006 4209 163-180
- (2006) String Processing and Information Retrieval, Proceedings , vol.4209 , pp. 163-180
- Russo, L.M.S.¹ Oliveira, A.L.²

25
- 33846576963
- When indexing equals compression: Experiments with compressing suffix arrays and applications
- Foschini L Grossi R Gupta A Vitter JS When indexing equals compression: Experiments with compressing suffix arrays and applications ACM Trans Algorithms 2006 2 611-639
- (2006) ACM Trans Algorithms , vol.2 , pp. 611-639
- Foschini, L.¹ Grossi, R.² Gupta, A.³ Vitter, J.S.⁴

26
- 0021919480
- Rapid and sensitive protein similarity searches
- 2983426
- Lipman DJ Pearson WR Rapid and Sensitive Protein Similarity Searches Science 1985 227 1435-1441 2983426
- (1985) Science , vol.227 , pp. 1435-1441
- Lipman, D.J.¹ Pearson, W.R.²

27
- 77954935024
- bzip2 and libbzip2 - A program and library for data compression
- 1.0.3
- Seward J bzip2 and libbzip2 - A program and library for data compression 1.0.3 1997
- (1997)
- Seward, J.¹

28
- 0017492836
- A fast algorithm for computing longest common subsequences
- Hunt JW Szymanski TG A Fast Algorithm for Computing Longest Common Subsequences Communications of the ACM 1977 20 350-353
- (1977) Communications of the ACM , vol.20 , pp. 350-353
- Hunt, J.W.¹ Szymanski, T.G.²

29
- 0034764307
- SSAHA: A fast search method for large DNA databases
- 311141 11591649
- Ning ZM Cox AJ Mullikin JC SSAHA: A fast search method for large DNA databases Genome Res 2001 11 1725-1729 311141 11591649
- (2001) Genome Res , vol.11 , pp. 1725-1729
- Ning, Z.M.¹ Cox, A.J.² Mullikin, J.C.³

30
- 45249103790
- One-gapped q-gram filters for Levenshtein distance
- Berlin, SPRINGER-VERLAG BERLIN Lecture Notes in Computer Science
- Burkhardt S Karkkainen J One-gapped q-gram filters for Levenshtein distance Combinatorial Pattern Matching Berlin, SPRINGER-VERLAG BERLIN Lecture Notes in Computer Science 2002 2373 225-234
- (2002) Combinatorial Pattern Matching , vol.2373 , pp. 225-234
- Burkhardt, S.¹ Karkkainen, J.²

31
- 70350674995
- On the shortest spanning subtree of a graph and the traveling salesman problem
- Kruskal JB Jr. On the Shortest Spanning Subtree of a Graph and the Traveling Salesman Problem Proceedings of the American Mathematical Society 1956 7 48-50
- (1956) Proceedings of the American Mathematical Society , vol.7 , pp. 48-50
- Kruskal Jr., J.B.¹

32
- 84911584312
- Shortest connection networks and some generalizations
- Prim RC Shortest Connection Networks and Some Generalizations Bell System Technical Journal 1957 36 1389-1401
- (1957) Bell System Technical Journal , vol.36 , pp. 1389-1401
- Prim, R.C.¹

33
- 0003827295
- Algorithms from P to NP: Design and efficiency
- Redwood City, CA, Benjamin/Cummings
- Moret B Shapiro H Algorithms from P to NP: Design and Efficiency Redwood City, CA, Benjamin/Cummings 1991
- (1991)
- Moret, B.¹ Shapiro, H.²

34
- 0016495233
- Efficiency of a good but not linear set union algorithm
- Tarjan RE Efficiency of a Good but Not Linear Set Union Algorithm J ACM 1975 22 215-225
- (1975) J ACM , vol.22 , pp. 215-225
- Tarjan, R.E.¹

35
- 33745128489
- An O(ND) difference algorithm and its variations
- Myers EW An O(ND) Difference Algorithm and its Variations Algorithmica 1986 1 251-266
- (1986) Algorithmica , vol.1 , pp. 251-266
- Myers, E.W.¹

36
- 84886755999
- GenBank Sequence Database http://www.ncbi.nlm.nih.gov/Genbank/index.html

37
- 84948416199
- Shkarin D PPM: One Step to Practicality 2002 202-211
- (2002) PPM: One Step to Practicality , pp. 202-211
- Shkarin, D.¹

38
- 84886764769
- 7-Zip http://www.7-zip.org

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.