-
1
-
-
79951493627
-
On the future of genomic data
-
Kahn SD. On the future of genomic data. Science 2011; 331(6018): 728-729.
-
(2011)
Science
, vol.331
, Issue.6018
, pp. 728-729
-
-
Kahn, S.D.1
-
2
-
-
77952586314
-
The 1000 Genomes Project: New opportunities for research and social challenges
-
doi: 10. 1186/gm124
-
Via M, Gignoux C, Burchard EG. The 1000 Genomes Project: new opportunities for research and social challenges. Genome Med 2010; 2(1): 3. doi: 10. 1186/gm124.
-
(2010)
Genome Med
, vol.2
, Issue.1
, pp. 3
-
-
Via, M.1
Gignoux, C.2
Burchard, E.G.3
-
3
-
-
77951115122
-
International network of cancer genome projects
-
Hudson TJ, Anderson W, Artez A, et al. International network of cancer genome projects. Nature 2010; 464(7291): 993-998.
-
(2010)
Nature
, vol.464
, Issue.7291
, pp. 993-998
-
-
Hudson, T.J.1
Anderson, W.2
Artez, A.3
-
4
-
-
79951811877
-
Big data, but are we ready?
-
Trelles O, Prins P, Snir M, et al. Big data, but are we ready? Nat Rev Genet 2011; 12(3): 224.
-
(2011)
Nat Rev Genet
, vol.12
, Issue.3
, pp. 224
-
-
Trelles, O.1
Prins, P.2
Snir, M.3
-
6
-
-
77954526823
-
The case for cloud computing in genome informatics
-
Stein L. The case for cloud computing in genome informatics. Genome Biol 2010, 11(5): 207.
-
(2010)
Genome Biol
, vol.11
, Issue.5
, pp. 207
-
-
Stein, L.1
-
7
-
-
84890318823
-
Data management challenges in next generation sequencing
-
Wandelt S, Rheinländer A, Bux M, Thalheim L, Haldemann B, Leser U. Data management challenges in next generation sequencing. Datenbank-Spektrum 2012, 12(3): 161-171.
-
(2012)
Datenbank-Spektrum
, vol.12
, Issue.3
, pp. 161-171
-
-
Wandelt, S.1
Rheinländer, A.2
Bux, M.3
Thalheim, L.4
Haldemann, B.5
Leser, U.6
-
8
-
-
0000100455
-
A new challenge for compression algorithms: Genetic sequences
-
Grumbach S, Tahi F. A new challenge for compression algorithms: genetic sequences. Inform Process Manag 1994, 30(6): 875-886.
-
(1994)
Inform Process Manag
, vol.30
, Issue.6
, pp. 875-886
-
-
Grumbach, S.1
Tahi, F.2
-
12
-
-
0021405335
-
Data compression using adaptive coding and partial string matching
-
Cleary JG, Witten IH. Data compression using adaptive coding and partial string matching. IEEE T Comm 1984, 32: 396-402.
-
(1984)
IEEE T Comm
, vol.32
, pp. 396-402
-
-
Cleary, J.G.1
Witten, I.H.2
-
13
-
-
34547630480
-
A simple statistical algorithm for biological sequence compression
-
Cao MD, Dix TI, Allison L, Mears C. A simple statistical algorithm for biological sequence compression. In Proceedings of the 2007 Conference on Data Compression, DCC'07, pages 43-52.
-
(2007)
Proceedings of the Conference on Data Compression, DCC'07
, pp. 43-52
-
-
Cao, M.D.1
Dix, T.I.2
Allison, L.3
Mears, C.4
-
17
-
-
0017493286
-
A universal algorithm for sequential data compression
-
Ziv J, Lempel A. A universal algorithm for sequential data compression. IEEE T Inform Theory 1977, 23(3): 337-343.
-
(1977)
IEEE T Inform Theory
, vol.23
, Issue.3
, pp. 337-343
-
-
Ziv, J.1
Lempel, A.2
-
18
-
-
84930881609
-
Run-length encodings
-
Golomb SW. Run-length encodings. IEEE T Inform Theory 1966, 12: 399-401.
-
(1966)
IEEE T Inform Theory
, vol.12
, pp. 399-401
-
-
Golomb, S.W.1
-
19
-
-
79955714647
-
On the usefulness of fibonacci compression codes
-
Klein ST, Ben-Nissan MK. On the usefulness of fibonacci compression codes. Computer J 2010, 53(6): 701-716.
-
(2010)
Computer J
, vol.53
, Issue.6
, pp. 701-716
-
-
Klein, S.T.1
Ben-Nissan, M.K.2
-
20
-
-
84938015047
-
A method for the construction of minimumredundancy codes
-
Huffman DA. A method for the construction of minimumredundancy codes. Proceedings of the Institute of Radio Engineers 1952, 40(9): 1098-1101.
-
(1952)
Proceedings of the Institute of Radio Engineers
, vol.40
, Issue.9
, pp. 1098-1101
-
-
Huffman, D.A.1
-
21
-
-
0041619492
-
Is huffman coding dead?
-
Bookstein A, Klein ST. Is huffman coding dead? Computing 1993, 50: 279-296.
-
(1993)
Computing
, vol.50
, pp. 279-296
-
-
Bookstein, A.1
Klein, S.T.2
-
24
-
-
0023536787
-
Data compression using dynamic markov modelling
-
Cormack G, Horspool N. Data compression using dynamic markov modelling. Comput J 1987, 30: 541-550.
-
(1987)
Comput J
, vol.30
, pp. 541-550
-
-
Cormack, G.1
Horspool, N.2
-
26
-
-
79952580139
-
Compression of dna sequence reads in fastq format
-
Deorowicz S, Grabowski S. Compression of dna sequence reads in fastq format. Bioinformatics 2011, 27(6): 860-862.
-
(2011)
Bioinformatics
, vol.27
, Issue.6
, pp. 860-862
-
-
Deorowicz, S.1
Grabowski, S.2
-
27
-
-
79952395270
-
Cancer genomics: From discovery science to personalized medicine
-
Chin L, Andersen JN, Futreal PA. Cancer genomics: from discovery science to personalized medicine. Nat Med 2011, 17(3): 297-303.
-
(2011)
Nat Med
, vol.17
, Issue.3
, pp. 297-303
-
-
Chin, L.1
Andersen, J.N.2
Futreal, P.A.3
-
28
-
-
84868670481
-
Adaptive efficient compression of genomes
-
Wandelt S, Leser U. Adaptive efficient compression of genomes. Algorithm Mol Bio 2012, 7: 30.
-
(2012)
Algorithm Mol Bio
, vol.7
, pp. 30
-
-
Wandelt, S.1
Leser, U.2
-
29
-
-
84894514001
-
FRESCO: Referential Compression of Highly-Similar sequences
-
(to appear)
-
Wandelt S and Leser U. FRESCO: Referential Compression of Highly-Similar sequences. IEEE ACM T Comput Bi 2013, (to appear).
-
(2013)
IEEE ACM T Comput Bi
-
-
Wandelt, S.1
Leser, U.2
-
30
-
-
0037805644
-
Biotechnological prospects from metagenomics
-
Schloss P, Handelsman J. Biotechnological prospects from metagenomics. Curr Opin Biotechn 2003, 14(3): 303-310.
-
(2003)
Curr Opin Biotechn
, vol.14
, Issue.3
, pp. 303-310
-
-
Schloss, P.1
Handelsman, J.2
-
31
-
-
84890458860
-
Differential direct coding: A compression algorithm for nucleotide sequence data
-
Vey G. Differential direct coding: a compression algorithm for nucleotide sequence data. The Journal of Biological Databases and Curation 2009.
-
(2009)
The Journal of Biological Databases and Curation
-
-
Vey, G.1
-
33
-
-
84881510889
-
A biological sequence compression based on cross chromosomal similarities using variable length lut
-
Bharti RK, Verma A, Singh RK. A biological sequence compression based on cross chromosomal similarities using variable length lut. Intl J Biomet Bioinform 2011, 4: 217-223.
-
(2011)
Intl J Biomet Bioinform
, vol.4
, pp. 217-223
-
-
Bharti, R.K.1
Verma, A.2
Singh, R.K.3
-
36
-
-
84877943508
-
An efficient horizontal and vertical method for online dna sequence compression
-
Mishra KN, Aaggarwal A, Abdelhadi E, et al. An efficient horizontal and vertical method for online dna sequence compression. Intl J Comput Appl 2010, 3(1): 39-46.
-
(2010)
Intl J Comput Appl
, vol.3
, Issue.1
, pp. 39-46
-
-
Mishra, K.N.1
Aaggarwal, A.2
Abdelhadi, E.3
-
37
-
-
79959701435
-
Dnabit compress-genome compression algorithm
-
Rajeswari P, Apparao A. Dnabit compress-genome compression algorithm. Bioinformation 2011, 5(8): 350-60.
-
(2011)
Bioinformation
, vol.5
, Issue.8
, pp. 350-360
-
-
Rajeswari, P.1
Apparao, A.2
-
38
-
-
81455132689
-
Iterative dictionary construction for compression of large dna data sets
-
Kuruppu S, Beresford-Smith B, Conway T, et al. Iterative dictionary construction for compression of large dna data sets. IEEE ACM T Comput Bi 2012, 9(1): 137-149.
-
(2012)
IEEE ACM T Comput Bi
, vol.9
, Issue.1
, pp. 137-149
-
-
Kuruppu, S.1
Beresford-Smith, B.2
Conway, T.3
-
41
-
-
34547630480
-
A simple statistical algorithm for biological sequence compression
-
Cao MD, Dix TI, Allison L, et al. A simple statistical algorithm for biological sequence compression. In Proceedings of the 2007 Conference on Data Compression, DCC'07, pages 43-52.
-
(2007)
Proceedings of the Conference on Data Compression, DCC'07
, pp. 43-52
-
-
Cao, M.D.1
Dix, T.I.2
Allison, L.3
-
42
-
-
80052957011
-
Compressing the human genome using exclusively markov models
-
Pratas D, Pinho AJ. Compressing the human genome using exclusively markov models. Adv Intel Soft Comput 2011; 213-220.
-
(2011)
Adv Intel Soft Comput
, pp. 213-220
-
-
Pratas, D.1
Pinho, A.J.2
-
43
-
-
61449196761
-
-
chapter 14, Springer
-
Venugopal KR, Srinivasa KG, Patnaik L. Probabilistic Approach for DNA Compression, chapter 14, pages 279-289. Springer, 2009.
-
(2009)
Probabilistic Approach for DNA Compression
, pp. 279-289
-
-
Venugopal, K.R.1
Srinivasa, K.G.2
Patnaik, L.3
-
46
-
-
67649855126
-
Data structures and compression algorithms for genomic sequence data
-
Brandon MC, Wallace DC, Baldi P. Data structures and compression algorithms for genomic sequence data. Bioinformatics 2009, 25(14): 1731-1738.
-
(2009)
Bioinformatics
, vol.25
, Issue.14
, pp. 1731-1738
-
-
Brandon, M.C.1
Wallace, D.C.2
Baldi, P.3
-
47
-
-
58349097721
-
Human genomes as email attachments
-
Christley S, Lu Y, Li C, et al. Human genomes as email attachments. Bioinformatics 2009, 25(2): 274-275.
-
(2009)
Bioinformatics
, vol.25
, Issue.2
, pp. 274-275
-
-
Christley, S.1
Lu, Y.2
Li, C.3
-
48
-
-
79954595666
-
A novel compression tool for efficient storage of genome resequencing data
-
Wang C, Zhang D. A novel compression tool for efficient storage of genome resequencing data. Nucleic Acids Res 2011, 39(7): e45.
-
(2011)
Nucleic Acids Res
, vol.39
, Issue.7
-
-
Wang, C.1
Zhang, D.2
-
52
-
-
84873187741
-
Green: A tool for efficient compression of genome resequencing data
-
Pinho AJ, Pratas D, Garcia SP. Green: a tool for efficient compression of genome resequencing data. Nucleic Acids Res 2011.
-
(2011)
Nucleic Acids Res
-
-
Pinho, A.J.1
Pratas, D.2
Garcia, S.P.3
-
53
-
-
80053956723
-
Dna data compression based on the whole genome sequence
-
Kim JD, Kim JH. Dna data compression based on the whole genome sequence. J Convergence Inform Technol 2009, 4(3): 82-85.
-
(2009)
J Convergence Inform Technol
, vol.4
, Issue.3
, pp. 82-85
-
-
Kim, J.D.1
Kim, J.H.2
-
60
-
-
43149107930
-
Quality scores and SNP detection in sequencing-by-synthesis systems
-
BrockmanW, AlvarezP, Young S, et al. Quality scores and SNP detection in sequencing-by-synthesis systems. Genome Res 2008, 18(5): 763-770.
-
(2008)
Genome Res
, vol.18
, Issue.5
, pp. 763-770
-
-
Brockman, W.1
Alvarez, P.2
Young, S.3
-
62
-
-
45649084526
-
Data compression and genomes: A two-dimensional life domain map
-
Menconi G, Benci V, Buiatti M. Data compression and genomes: a two-dimensional life domain map. J Theor Biol 2008, 253(2): 281-288.
-
(2008)
J Theor Biol
, vol.253
, Issue.2
, pp. 281-288
-
-
Menconi, G.1
Benci, V.2
Buiatti, M.3
-
63
-
-
80054701916
-
Dna sequence compression using adaptive particle swarm optimization-based memetic algorithm
-
Zhu Z, Zhou J, Ji Z, et al. Dna sequence compression using adaptive particle swarm optimization-based memetic algorithm. IEEE T Evolut Comput 2011, 15(5): 643-658.
-
(2011)
IEEE T Evolut Comput
, vol.15
, Issue.5
, pp. 643-658
-
-
Zhu, Z.1
Zhou, J.2
Ji, Z.3
-
64
-
-
84862938045
-
No-reference compression of genomic data stored in fastq format
-
Bhola V, Bopardikar A, Narayanan R, et al. No-reference compression of genomic data stored in fastq format. In Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine, BIBM'11, pages 147-150.
-
(2011)
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, BIBM'11
, pp. 147-150
-
-
Bhola, V.1
Bopardikar, A.2
Narayanan, R.3
-
65
-
-
84869232795
-
Transformations for the compression of fastq quality scores of next generation sequencing data
-
Wan R, Anh VN, Asai K. Transformations for the compression of fastq quality scores of next generation sequencing data. Bioinformatics 2011.
-
(2011)
Bioinformatics
-
-
Wan, R.1
Anh, V.N.2
Asai, K.3
-
66
-
-
77955886068
-
G-sqz: Compact encoding of genomic sequence and quality data
-
Tembe W, Lowey J, SuhE. G-sqz: compact encoding of genomic sequence and quality data. Bioinformatics 2010, 26(17): 2192-2194.
-
(2010)
Bioinformatics
, vol.26
, Issue.17
, pp. 2192-2194
-
-
Tembe, W.1
Lowey, J.2
Suh, E.3
-
67
-
-
84873027492
-
Integrating human genome database into electronic health record with sequence alignment and compression mechanism
-
Chen WH, Lu YW, Lai FP, et al. Integrating human genome database into electronic health record with sequence alignment and compression mechanism. J Med Syst 2011, 36(3): 2587-2597.
-
(2011)
J Med Syst
, vol.36
, Issue.3
, pp. 2587-2597
-
-
Chen, W.H.1
Lu, Y.W.2
Lai, F.P.3
-
68
-
-
77957765256
-
Data structures and compression algorithms for high throughput sequencing technologies
-
Daily K, Rigor R, Christley S, et al. Data structures and compression algorithms for high throughput sequencing technologies. BMC Bioinformatics 2010, 11(1): 514+.
-
(2010)
BMC Bioinformatics
, vol.11
, Issue.1
, pp. 514
-
-
Daily, K.1
Rigor, R.2
Christley, S.3
-
69
-
-
78650275807
-
Compressing genomic sequence fragments using slimgene
-
Kozanitis C, Saunders C, Kruglyak S, et al. Compressing genomic sequence fragments using slimgene. In Proceedings of the 14th Annual International Conference on Research in Computational Molecular Biology, RECOMB'10, pages 310-324.
-
Proceedings of the 14th Annual International Conference on Research in Computational Molecular Biology, RECOMB'10
, pp. 310-324
-
-
Kozanitis, C.1
Saunders, C.2
Kruglyak, S.3
-
70
-
-
79955554401
-
Efficient storage of high throughput dna sequencing data using reference-based compression
-
Fritz MH, Leinonen R, Cochrane G, et al. Efficient storage of high throughput dna sequencing data using reference-based compression. Genome Res 2011, 21(5): 734-740.
-
(2011)
Genome Res
, vol.21
, Issue.5
, pp. 734-740
-
-
Fritz, M.H.1
Leinonen, R.2
Cochrane, G.3
-
71
-
-
84891054664
-
RCSI: Scalable similarity search in thousand(s) of genomes
-
Wandelt S, Starlinger J, Bux M, et al. RCSI: Scalable similarity search in thousand(s) of genomes. In Proceedings of the VLDB Endowment 2013 (PVLDB), Vol. 6, No 13.
-
(2013)
Proceedings of the VLDB Endowment (PVLDB)
, vol.6
, Issue.13
-
-
Wandelt, S.1
Starlinger, J.2
Bux, M.3
|