메뉴 건너뛰기




Volumn 6, Issue 6, 2011, Pages

On the representability of complete genomes by multiple competing finite-context (Markov) models

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; DATA ANALYSIS; DNA DETERMINATION; DNA SEQUENCE; GENOME; GENOME ANALYSIS; MARKOV MODEL; MOLECULAR MODEL; NUCLEOTIDE SEQUENCE; PROBABILITY; STATISTICAL ANALYSIS; STATISTICAL MODEL; ANIMAL; BIOLOGY; GENETICS; HUMAN; METHODOLOGY;

EID: 79959722141     PISSN: None     EISSN: 19326203     Source Type: Journal    
DOI: 10.1371/journal.pone.0021588     Document Type: Article
Times cited : (51)

References (47)
  • 5
    • 34547630306 scopus 로고    scopus 로고
    • DNA sequence compression using the normalized maximum likelihood model for discrete regression
    • Snowbird, Utah
    • Tabus I, Korodi G, Rissanen J, (2003) DNA sequence compression using the normalized maximum likelihood model for discrete regression. pp. 253-262 In: Proc. of the Data Compression Conf., DCC-2003. Snowbird, Utah.
    • (2003) Proc. of the Data Compression Conf., DCC-2003 , pp. 253-262
    • Tabus, I.1    Korodi, G.2    Rissanen, J.3
  • 7
    • 13844281512 scopus 로고    scopus 로고
    • An efficient normalized maximum likelihood algorithm for DNA sequence compression
    • Korodi G, Tabus I, (2005) An efficient normalized maximum likelihood algorithm for DNA sequence compression. ACM Trans on Information Systems 23: 3-34.
    • (2005) ACM Trans on Information Systems , vol.23 , pp. 3-34
    • Korodi, G.1    Tabus, I.2
  • 8
    • 26444479436 scopus 로고    scopus 로고
    • DNA compression challenge revisited
    • Jeju Island, Korea, Springer-Verlag, LNCS
    • Behzadi B, Le Fessant F, (2005) DNA compression challenge revisited. Combinatorial Pattern Matching: Proc. of CPM-2005 Jeju Island, Korea Springer-Verlag pp. 190-200 volume 3537 of LNCS.
    • (2005) Combinatorial Pattern Matching: Proc. Of CPM-2005 , vol.3537 , pp. 190-200
    • Behzadi, B.1    Le Fessant, F.2
  • 9
    • 34547635395 scopus 로고    scopus 로고
    • Normalized maximum likelihood model of order-1 for the compression of DNA sequences
    • Snowbird, Utah
    • Korodi G, Tabus I, (2007) Normalized maximum likelihood model of order-1 for the compression of DNA sequences. pp. 33-42 In: Proc. of the Data Compression Conf., DCC-2007. Snowbird, Utah.
    • (2007) Proc. of the Data Compression Conf., DCC-2007 , pp. 33-42
    • Korodi, G.1    Tabus, I.2
  • 11
    • 67649170975 scopus 로고    scopus 로고
    • Textual data compression in computational biology: a synopsis
    • Giancarlo R, Scaturro D, Utro F, (2009) Textual data compression in computational biology: a synopsis. Bioinformatics 25: 1575-1586.
    • (2009) Bioinformatics , vol.25 , pp. 1575-1586
    • Giancarlo, R.1    Scaturro, D.2    Utro, F.3
  • 12
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • Ziv J, Lempel A, (1977) A universal algorithm for sequential data compression. IEEE Trans on Information Theory 23: 337-343.
    • (1977) IEEE Trans on Information Theory , vol.23 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 13
    • 0002916795 scopus 로고
    • Statistical patterns in primary structures of the functional regions of the genome in Escherichia coli: I. Frequency characteristics
    • Borodovsky MY, Sprizhitsky YA, Golovanov EI, Aleksandrov AA, (1986) Statistical patterns in primary structures of the functional regions of the genome in Escherichia coli: I. Frequency characteristics. Molecular Biology 20: 823-833.
    • (1986) Molecular Biology , vol.20 , pp. 823-833
    • Borodovsky, M.Y.1    Sprizhitsky, Y.A.2    Golovanov, E.I.3    Aleksandrov, A.A.4
  • 14
    • 0002918043 scopus 로고
    • Statistical patterns in primary structures of the functional regions of the genome in Escherichia coli: II. Nonuniform Markov models
    • Borodovsky MY, Sprizhitsky YA, Golovanov EI, Aleksandrov AA, (1986) Statistical patterns in primary structures of the functional regions of the genome in Escherichia coli: II. Nonuniform Markov models. Molecular Biology 20: 833-840.
    • (1986) Molecular Biology , vol.20 , pp. 833-840
    • Borodovsky, M.Y.1    Sprizhitsky, Y.A.2    Golovanov, E.I.3    Aleksandrov, A.A.4
  • 15
    • 0024584990 scopus 로고
    • Codon preference and primary sequence structure in protein-coding regions
    • Tavaré S, Song B, (1989) Codon preference and primary sequence structure in protein-coding regions. Bulletin of Mathematical Biology 51: 95-115.
    • (1989) Bulletin of Mathematical Biology , vol.51 , pp. 95-115
    • Tavaré, S.1    Song, B.2
  • 16
    • 0000241874 scopus 로고
    • GENMARK: Parallel gene recognition for both DNA strands
    • Borodovsky MY, McIninch J, (1993) GENMARK: Parallel gene recognition for both DNA strands. Computers & Chemistry 17: 123-133.
    • (1993) Computers & Chemistry , vol.17 , pp. 123-133
    • Borodovsky, M.Y.1    McIninch, J.2
  • 19
    • 2942527473 scopus 로고    scopus 로고
    • Gene prediction with a hidden Markov model and a new intron submodel
    • Stanke M, Waack S, (2003) Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics 19: ii215-ii225.
    • (2003) Bioinformatics , vol.19 , pp. 215-225
    • Stanke, M.1    Waack, S.2
  • 22
    • 78651326786 scopus 로고    scopus 로고
    • FragGeneScan: predicting genes in short and error-prone reads
    • Rho M, Tang H, Ye Y, (2010) FragGeneScan: predicting genes in short and error-prone reads. Nucleic Acids Research.
    • (2010) Nucleic Acids Research
    • Rho, M.1    Tang, H.2    Ye, Y.3
  • 23
    • 0042622244 scopus 로고    scopus 로고
    • SIC: a tool to detect short inverted segments in a biological sequence
    • Robelin D, Richard H, Prum B, (2003) SIC: a tool to detect short inverted segments in a biological sequence. Nucleic Acids Research 31: 3669-3671.
    • (2003) Nucleic Acids Research , vol.31 , pp. 3669-3671
    • Robelin, D.1    Richard, H.2    Prum, B.3
  • 24
    • 0041620241 scopus 로고    scopus 로고
    • SPA: simple web tool to assess statistical significance of DNA patterns
    • Richard H, Nuel G, (2003) SPA: simple web tool to assess statistical significance of DNA patterns. Nucleic Acids Research 31: 3679-3681.
    • (2003) Nucleic Acids Research , vol.31 , pp. 3679-3681
    • Richard, H.1    Nuel, G.2
  • 26
    • 0018015137 scopus 로고
    • Modeling by shortest data description
    • Rissanen J, (1978) Modeling by shortest data description. Automatica 14: 465-471.
    • (1978) Automatica , vol.14 , pp. 465-471
    • Rissanen, J.1
  • 28
    • 45149113022 scopus 로고    scopus 로고
    • Comparative analysis of long DNA sequences by per element information content using different contexts
    • Dix TI, Powell DR, Allison L, Bernal J, Jaeger S, et al. (2007) Comparative analysis of long DNA sequences by per element information content using different contexts. BMC Bioinformatics 8: S10.
    • (2007) BMC Bioinformatics , vol.8
    • Dix, T.I.1    Powell, D.R.2    Allison, L.3    Bernal, J.4    Jaeger, S.5
  • 29
    • 34547753523 scopus 로고    scopus 로고
    • Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment
    • Ferragina P, Giancarlo R, Greco V, Manzini G, Valiente G, (2007) Compression-based classification of biological sequences and structures via the universal similarity metric: experimental assessment. BMC Bioinformatics 8: 252.
    • (2007) BMC Bioinformatics , vol.8 , pp. 252
    • Ferragina, P.1    Giancarlo, R.2    Greco, V.3    Manzini, G.4    Valiente, G.5
  • 30
    • 78650087192 scopus 로고    scopus 로고
    • A genome alignment algorithm based on compression
    • Cao MD, Dix TI, Allison L, (2010) A genome alignment algorithm based on compression. BMC Bioinformatics 11: 599.
    • (2010) BMC Bioinformatics , vol.11 , pp. 599
    • Cao, M.D.1    Dix, T.I.2    Allison, L.3
  • 31
    • 0025116731 scopus 로고
    • Minimum message length encoding and the comparison of macromolecules
    • Allison L, Yee CN, (1990) Minimum message length encoding and the comparison of macromolecules. Bulletin of Mathematical Biology 52: 431-431.
    • (1990) Bulletin of Mathematical Biology , vol.52 , pp. 431
    • Allison, L.1    Yee, C.N.2
  • 32
    • 0000213329 scopus 로고
    • A maximum entropy principle for the distribution of local complexity in naturally occurring nucleotide sequences
    • Salamon P, Konopka AK, (1992) A maximum entropy principle for the distribution of local complexity in naturally occurring nucleotide sequences. Computers & Chemistry 16: 117-124.
    • (1992) Computers & Chemistry , vol.16 , pp. 117-124
    • Salamon, P.1    Konopka, A.K.2
  • 33
    • 0027194328 scopus 로고
    • Discovering simple DNA sequences by the algorithmic significance method
    • Milosavljević A, Jurka J, (1993) Discovering simple DNA sequences by the algorithmic significance method. Computer Applications in the Biosciences 9: 407-411.
    • (1993) Computer Applications in the Biosciences , vol.9 , pp. 407-411
    • Milosavljević, A.1    Jurka, J.2
  • 40
    • 0003410624 scopus 로고
    • London, Macmillan and Co., 3rd (1st 1866, 2nd 1876) edition
    • Venn J, (1888) The logic of chance London Macmillan and Co., 3rd (1st 1866, 2nd 1876) edition.
    • (1888) The Logic of Chance
    • Venn, J.1
  • 41
    • 79959690615 scopus 로고
    • [Reprinted in Trans of the Faculty of Actuaries, 8 (1920) pp 180-181]
    • Hardy GF, (1889) Letter. Insurance Record [Reprinted in Trans of the Faculty of Actuaries, 8 (1920) pp 180-181].
    • (1889) Letter. Insurance Record
    • Hardy, G.F.1
  • 43
    • 0343846852 scopus 로고
    • Probability: the deductive and inductive problems
    • Johnson WE, (1932) Probability: the deductive and inductive problems. Mind XLI: 409-423.
    • (1932) Mind , vol.XLI , pp. 409-423
    • Johnson, W.E.1
  • 44
    • 0001063049 scopus 로고
    • W. E. Johnson's "sufficientness" postulate
    • Zabell SL, (1982) W. E. Johnson's "sufficientness" postulate. The Annals of Statistics 10: 1091-1099.
    • (1982) The Annals of Statistics , vol.10 , pp. 1091-1099
    • Zabell, S.L.1
  • 45
    • 0000861078 scopus 로고
    • The rule of succession
    • Zabell SL, (1989) The rule of succession. Erkenntnis 31: 283-321.
    • (1989) Erkenntnis , vol.31 , pp. 283-321
    • Zabell, S.L.1
  • 47
    • 0029906607 scopus 로고    scopus 로고
    • Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology
    • Sjölander K, Karplus K, Brown M, Hughey R, Krogh A, et al. (1996) Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology. Bioinformatics 12: 327-345.
    • (1996) Bioinformatics , vol.12 , pp. 327-345
    • Sjölander, K.1    Karplus, K.2    Brown, M.3    Hughey, R.4    Krogh, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.