메뉴 건너뛰기




Volumn 5, Issue 1, 2006, Pages

Numerical solutions for patterns statistics on Markov chains

Author keywords

Benchmark; Compound Poisson approximations; Exact; Gaussian approximations; Large deviations

Indexed keywords

ALGORITHM; COMPUTER PROGRAM; MATHEMATICAL COMPUTING; MATHEMATICAL MODEL; METHODOLOGY; NORMAL DISTRIBUTION; POISSON DISTRIBUTION; PROBABILITY; QUANTITATIVE STUDY; REVIEW; STATISTICAL ANALYSIS; ARTICLE; BACILLUS SUBTILIS; BACTERIAL GENOME; BIOLOGY; CHROMOSOME MAP; COMPARATIVE STUDY; COMPUTER SIMULATION; GENETICS; HUMAN IMMUNODEFICIENCY VIRUS 1; REPRODUCIBILITY; STATISTICAL MODEL; VIRUS GENOME;

EID: 33750243061     PISSN: 15446115     EISSN: 15446115     Source Type: Journal    
DOI: 10.2202/1544-6115.1219     Document Type: Review
Times cited : (19)

References (50)
  • 1
    • 84972496740 scopus 로고
    • Poisson approximation and the Chen-Stein method
    • R. Arratia, L. Goldstein, and L. Gordon. Poisson approximation and the Chen-Stein method. Stat. Sci., 5(4):403-434, 1990.
    • (1990) Stat. Sci. , vol.5 , Issue.4 , pp. 403-434
    • Arratia, R.1    Goldstein, L.2    Gordon, L.3
  • 2
    • 0031603909 scopus 로고    scopus 로고
    • Calculating the exact probability of language-like patterns in biomolecular sequences
    • AAAI Press, editor
    • K. Atteson. Calculating the exact probability of language-like patterns in biomolecular sequences. In AAAI Press, editor, Sixth International Conference on Intelligent Systems for Molecular Biology, pages 17-24, 1998.
    • (1998) Sixth International Conference on Intelligent Systems for Molecular Biology , pp. 17-24
    • Atteson, K.1
  • 3
    • 0000549988 scopus 로고
    • Compound Poisson approximation for nonnegative random variables via Stein method
    • A.D. Barbour, L. H. Y. Chen, and W. L. Loh. Compound Poisson approximation for nonnegative random variables via Stein method. Ann. Probab, 20:1504-1527, 1992.
    • (1992) Ann. Probab , vol.20 , pp. 1504-1527
    • Barbour, A.D.1    Chen, L.H.Y.2    Loh, W.L.3
  • 4
    • 0033858202 scopus 로고    scopus 로고
    • Patterns of Variant Polyadenylation Signal Usage in Human Genes
    • E. Beaudoing, S. Freier, J. R. Wyatt, J.-M. Claverie, and D. Gautheret. Patterns of Variant Polyadenylation Signal Usage in Human Genes. Genome Res., 10(7):1001-1010, 2000.
    • (2000) Genome Res. , vol.10 , Issue.7 , pp. 1001-1010
    • Beaudoing, E.1    Freier, S.2    Wyatt, J.R.3    Claverie, J.-M.4    Gautheret, D.5
  • 5
    • 0000241874 scopus 로고
    • Genmark: Parallel gene recognition for both DNA strands
    • M. Borodovsky and J. D. McIninch. Genmark: parallel gene recognition for both DNA strands. Computer & Chimistry, 17(2):123-133, 1993.
    • (1993) Computer & Chimistry , vol.17 , Issue.2 , pp. 123-133
    • Borodovsky, M.1    McIninch, J.D.2
  • 6
    • 26444585274 scopus 로고    scopus 로고
    • Modèles de Markov parsimonieux: Sélection de modèle et estimation
    • Montreal
    • P.-Y. Bourguignon and D. Robelin. Modèles de Markov parsimonieux: sélection de modèle et estimation. In Proceedings of JOBIM Congress, Montreal, 2004.
    • (2004) Proceedings of JOBIM Congress
    • Bourguignon, P.-Y.1    Robelin, D.2
  • 7
    • 0032413411 scopus 로고    scopus 로고
    • Predicting Gene Regulatory Elements in Silico on a Genomic Scale
    • A. Brazma, I. Jonassen, J. Vilo, and E. Ukkonen. Predicting Gene Regulatory Elements in Silico on a Genomic Scale. Genome Res., 8(11):1202-1215, 1998.
    • (1998) Genome Res. , vol.8 , Issue.11 , pp. 1202-1215
    • Brazma, A.1    Jonassen, I.2    Vilo, J.3    Ukkonen, E.4
  • 9
    • 0039530516 scopus 로고    scopus 로고
    • Variable Length Markov chains
    • A. Buhlmann and A. J. Wyner. Variable Length Markov chains. Annals of Statistics, 27(2):480-513, 1999.
    • (1999) Annals of Statistics , vol.27 , Issue.2 , pp. 480-513
    • Buhlmann, A.1    Wyner, A.J.2
  • 10
    • 0001766090 scopus 로고
    • A limit theorem on the number of overlapping appearances of a pattern in a sequence of independant trials
    • O. Chrysaphinou and S. Papastavridis. A limit theorem on the number of overlapping appearances of a pattern in a sequence of independant trials. Proba. Theory Relat. Fields, 79(1):129-143, 1988.
    • (1988) Proba. Theory Relat. Fields , vol.79 , Issue.1 , pp. 129-143
    • Chrysaphinou, O.1    Papastavridis, S.2
  • 11
    • 0001169815 scopus 로고
    • Expected frequencies of DNA patterns using Whittle's formula
    • R. Cowan. Expected frequencies of DNA patterns using Whittle's formula. J. Appl. Prob., 28:886-892, 1991.
    • (1991) J. Appl. Prob. , vol.28 , pp. 886-892
    • Cowan, R.1
  • 15
    • 0033374593 scopus 로고    scopus 로고
    • Characteristics of Chi distribution on different bacterial genomes
    • M. El Karoui, V. Biaudet, S. Schbath, and A. Gruss. Characteristics of Chi distribution on different bacterial genomes. Res. Microbiol., 150:579-587, 1999.
    • (1999) Res. Microbiol. , vol.150 , pp. 579-587
    • El Karoui, M.1    Biaudet, V.2    Schbath, S.3    Gruss, A.4
  • 16
    • 21444442040 scopus 로고    scopus 로고
    • Distribution theory of runs and patterns associated with a sequence of multi-state trials
    • J. C Fu. Distribution theory of runs and patterns associated with a sequence of multi-state trials. Statistica Sinica, 6(4):957-974, 1996.
    • (1996) Statistica Sinica , vol.6 , Issue.4 , pp. 957-974
    • Fu, J.C.1
  • 17
    • 84950460234 scopus 로고
    • Distribution theory of runs: A Markov chain approach
    • J. C. Fu and M. V. Koutras. Distribution theory of runs: A Markov chain approach. J. Am. Statist. Assoc., 89(427):1050-1058, 1994.
    • (1994) J. Am. Statist. Assoc. , vol.89 , Issue.427 , pp. 1050-1058
    • Fu, J.C.1    Koutras, M.V.2
  • 18
    • 0030740993 scopus 로고    scopus 로고
    • Avoidance of palindromic words in bacterial and archaeal genomes: A close connection with restriction enzymes
    • [published erratum appears in Nucleic Acids Res. 25(24), 5135-6]
    • M. S. Gelfand and E. V. Koonin. Avoidance of palindromic words in bacterial and archaeal genomes: a close connection with restriction enzymes [published erratum appears in Nucleic Acids Res. 25(24), 5135-6]. Nucl. Acids. Res., 25 (12):2430-2439, 1997.
    • (1997) Nucl. Acids. Res. , vol.25 , Issue.12 , pp. 2430-2439
    • Gelfand, M.S.1    Koonin, E.V.2
  • 20
    • 0035999971 scopus 로고    scopus 로고
    • Distribution patterns of over-represented k-mers in non-coding yeast DNA
    • S. Hampson, D. Kibler, and P. Baldi. Distribution patterns of over-represented k-mers in non-coding yeast DNA . Bioinformatics, 18(4):513-528, 2002.
    • (2002) Bioinformatics , vol.18 , Issue.4 , pp. 513-528
    • Hampson, S.1    Kibler, D.2    Baldi, P.3
  • 22
    • 0026600016 scopus 로고
    • Statistical analyses of counts and distributions of restriction sites in DNA sequences
    • S. Karlin, C. Burge, and A. M. Campbell. Statistical analyses of counts and distributions of restriction sites in DNA sequences. Nucl. Acids. Res., 20(6):1363-1370, 1992.
    • (1992) Nucl. Acids. Res. , vol.20 , Issue.6 , pp. 1363-1370
    • Karlin, S.1    Burge, C.2    Campbell, A.M.3
  • 23
    • 0026778661 scopus 로고
    • First and second moment of counts of words in random text generated by Markov chains
    • J. Kleffe and M. Borodovsky. First and second moment of counts of words in random text generated by Markov chains. Comp. Applic. Biosci., 8:433-441, 1992.
    • (1992) Comp. Applic. Biosci. , vol.8 , pp. 433-441
    • Kleffe, J.1    Borodovsky, M.2
  • 27
    • 0034808556 scopus 로고    scopus 로고
    • Fast approximate motif statistics
    • P. Nicodème. Fast approximate motif statistics. J. Comp. Biol., 8(3):235-248, 2001.
    • (2001) J. Comp. Biol. , vol.8 , Issue.3 , pp. 235-248
    • Nicodème, P.1
  • 28
    • 12844258978 scopus 로고    scopus 로고
    • LD-SPatt: Large Deviations Statistics for Patterns on Markov Chains
    • G. Nuel. LD-SPatt: Large Deviations Statistics for Patterns on Markov Chains. J. Comput. Biol., 11(6):1023-1033, 2004.
    • (2004) J. Comput. Biol. , vol.11 , Issue.6 , pp. 1023-1033
    • Nuel, G.1
  • 29
    • 21444446007 scopus 로고    scopus 로고
    • S-SPatt: Simple statistics for patterns on Markov chains
    • G. Nuel. S-SPatt: simple statistics for patterns on Markov chains. Bioinformatics, 21(13):3051-3052, 2005.
    • (2005) Bioinformatics , vol.21 , Issue.13 , pp. 3051-3052
    • Nuel, G.1
  • 30
    • 39449108456 scopus 로고    scopus 로고
    • Cumulative distribution function of a geometric Poisson distribution
    • In press, preprint available at
    • G. Nuel. Cumulative distribution function of a geometric Poisson distribution. J. Stat. Comp. and Sim., 2006a. In press, preprint available at http://stat.genopole.cnrs.fr/~gnuel.
    • (2006) J. Stat. Comp. and Sim.
    • Nuel, G.1
  • 31
    • 33750255281 scopus 로고    scopus 로고
    • Pattern statistics on Markov chains and sensitivity to parameter estimation
    • In revision, preprint available at
    • G. Nuel. Pattern statistics on Markov chains and sensitivity to parameter estimation. Algo. Mol. Biol., 2006b. In revision, preprint available at http://stat.genopole.cnrs.fr/~gnuel.
    • (2006) Algo. Mol. Biol.
    • Nuel, G.1
  • 32
    • 0024514063 scopus 로고
    • Linguistic of nucleotide sequences: The significance of deviation from mean statistical characteristics and prediction of frequencies of occurrence of words
    • P. A. Pevzner, M. Y. Borodovski, and A. A. Mironov. Linguistic of nucleotide sequences: the significance of deviation from mean statistical characteristics and prediction of frequencies of occurrence of words. J. Biomol. Struct. Dyn, 6:1013-1026, 1989.
    • (1989) J. Biomol. Struct. Dyn , vol.6 , pp. 1013-1026
    • Pevzner, P.A.1    Borodovski, M.Y.2    Mironov, A.A.3
  • 34
    • 0344808332 scopus 로고
    • Finding words with unexpected frequencies in DNA sequences
    • B. Prum, F. Rodolphe, and E. de Turckheim. Finding words with unexpected frequencies in DNA sequences. J. R. Statist. Soc. B., 11:190-192, 1995.
    • (1995) J. R. Statist. Soc. B , vol.11 , pp. 190-192
    • Prum, B.1    Rodolphe, F.2    De Turckheim, E.3
  • 35
    • 0024610919 scopus 로고
    • Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
    • L. A. Rabiner. Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. In Proceedings of the IEE, volume 77, pages 254-286, 1989.
    • (1989) Proceedings of the IEE , vol.77 , pp. 254-286
    • Rabiner, L.A.1
  • 36
    • 0031902984 scopus 로고    scopus 로고
    • Compound Poisson and Poisson process approximations for occurrences of multiple words in Markov chains
    • G. Reinert and S. Schbath. Compound Poisson and Poisson process approximations for occurrences of multiple words in Markov chains. J. Comp. Biol., 5:223-254, 1998.
    • (1998) J. Comp. Biol. , vol.5 , pp. 223-254
    • Reinert, G.1    Schbath, S.2
  • 37
  • 38
    • 0034125366 scopus 로고    scopus 로고
    • Probabilistic and Statistical Properties of Words: An Overview
    • G. Reinert, S. Schbath, and S Waterman. Probabilistic and Statistical Properties of Words: An Overview. J. Comp. Biol., 7(1/2):1-46, 2000.
    • (2000) J. Comp. Biol. , vol.7 , Issue.1-2 , pp. 1-46
    • Reinert, G.1    Schbath, S.2    Waterman, S.3
  • 39
    • 0004105719 scopus 로고    scopus 로고
    • chapter Probabilistic and Statistical Properties of Finite Words in Finite Sequences. Cambridge University Press
    • G. Reinert, S. Schbath, and S Waterman. Lothaire: Applied Combinatorics on Words, chapter Probabilistic and Statistical Properties of Finite Words in Finite Sequences. Cambridge University Press, 2005.
    • (2005) Lothaire: Applied Combinatorics on Words
    • Reinert, G.1    Schbath, S.2    Waterman, S.3
  • 40
    • 0036404856 scopus 로고    scopus 로고
    • A compound Poisson model for word occurrences in DNA sequences
    • S. Robin. A compound Poisson model for word occurrences in DNA sequences. J. Roy. Stat. Soc. Ser. C, 51:437-451, 2002.
    • (2002) J. Roy. Stat. Soc. Ser. C , vol.51 , pp. 437-451
    • Robin, S.1
  • 42
    • 0033238297 scopus 로고    scopus 로고
    • Exact distribution of word occurrences in a random sequence of letters
    • S. Robin and J. J. Daudin. Exact distribution of word occurrences in a random sequence of letters. J. App. Prob., 36:179-193, 1999.
    • (1999) J. App. Prob. , vol.36 , pp. 179-193
    • Robin, S.1    Daudin, J.J.2
  • 43
    • 0034786443 scopus 로고    scopus 로고
    • Numerical comparison of several approximations of the word count distribution in random sequences
    • S. Robin and S. Schbath. Numerical comparison of several approximations of the word count distribution in random sequences. J. Comp. Biol., 8:349-359, 2001.
    • (2001) J. Comp. Biol. , vol.8 , pp. 349-359
    • Robin, S.1    Schbath, S.2
  • 44
    • 0000794292 scopus 로고    scopus 로고
    • A unified approach to word occurrence probabilities
    • M. Régnier. A unified approach to word occurrence probabilities. Discrete applied mathematics, 104(1):259-280, 2000.
    • (2000) Discrete Applied Mathematics , vol.104 , Issue.1 , pp. 259-280
    • Régnier, M.1
  • 45
    • 0001194726 scopus 로고    scopus 로고
    • On pattern frequency occurrences in a Markovian sequence
    • M. Régnier and W. Szpankowski. On pattern frequency occurrences in a Markovian sequence. Algorithmica, 22(4):631-649, 1998.
    • (1998) Algorithmica , vol.22 , Issue.4 , pp. 631-649
    • Régnier, M.1    Szpankowski, W.2
  • 46
    • 84996141000 scopus 로고
    • Compound Poisson approximation of word counts in DNA sequences
    • S. Schbath. Compound Poisson approximation of word counts in DNA sequences. ESAIM, Probab. Stat., 1:1-16, 1995.
    • (1995) ESAIM, Probab. Stat. , vol.1 , pp. 1-16
    • Schbath, S.1
  • 47
    • 0030839806 scopus 로고    scopus 로고
    • An efficient statistic to detect over- and under- represented words in DNA sequences
    • S. Schbath. An efficient statistic to detect over- and under- represented words in DNA sequences. J. Comp. Biol., 4:189-192, 1997.
    • (1997) J. Comp. Biol. , vol.4 , pp. 189-192
    • Schbath, S.1
  • 48
    • 0032483307 scopus 로고    scopus 로고
    • Extracting Regulatory Sites from the Upstream Region of Yeast Genes by Computational Analysis of Oligonucleotide Frequencies
    • J. van Helden, B André, and J. Collado-Vides. Extracting Regulatory Sites from the Upstream Region of Yeast Genes by Computational Analysis of Oligonucleotide Frequencies. J. Mol. Biol., 281:827-842, 1998.
    • (1998) J. Mol. Biol. , vol.281 , pp. 827-842
    • Van Helden, J.1    André, B.2    Collado-Vides, J.3
  • 49
    • 0034651804 scopus 로고    scopus 로고
    • Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals
    • J. van Helden, M. li del Olmo, and J. E. Perez-Ortin. Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals. Nucl. Acids. Res., 28(4):1000-1010, 2000.
    • (2000) Nucl. Acids. Res. , vol.28 , Issue.4 , pp. 1000-1010
    • Van Helden, J.1    Li Del Olmo, M.2    Perez-Ortin, J.E.3
  • 50
    • 0001285075 scopus 로고
    • Some distribution and moment fomulæ for the Markov chain
    • P. Whittle. Some distribution and moment fomulæ for the Markov chain. . J. R. Statist. Soc. B., 17:235-242, 1955.
    • (1955) J. R. Statist. Soc. B. , vol.17 , pp. 235-242
    • Whittle, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.