메뉴 건너뛰기




Volumn 56, Issue 1-2, 2008, Pages 51-92

Multiple pattern matching: A Markov chain approach

Author keywords

Automata; Exceptional words; Functionals of Markov chains; Generating functions; Markov chain embedding technique; Modular pattern; Motifs; Pattern matching; Probability; Random strings; RNA; Transfer matrix methods

Indexed keywords

RNA;

EID: 36448949063     PISSN: 03036812     EISSN: 14321416     Source Type: Journal    
DOI: 10.1007/s00285-007-0109-3     Document Type: Article
Times cited : (32)

References (77)
  • 1
    • 0016518897 scopus 로고
    • Efficient string matching: An aid to bibliographic search
    • 6
    • Aho A.V. and Corasick M.J. (1975). Efficient string matching: an aid to bibliographic search. Commun. ACM 18(6): 333-340
    • (1975) Commun. ACM , vol.18 , pp. 333-340
    • Aho, A.V.1    Corasick, M.J.2
  • 2
    • 33645337896 scopus 로고    scopus 로고
    • Waiting time distributions of competing patterns in higher-order Markovian sequences
    • 4
    • Aston J.A.D. and Martin D.E.K. (2005). Waiting time distributions of competing patterns in higher-order Markovian sequences. J. Appl. Prob. 42(4): 977-988
    • (2005) J. Appl. Prob. , vol.42 , pp. 977-988
    • Aston, J.A.D.1    Martin, D.E.K.2
  • 3
    • 0001616513 scopus 로고
    • Markov renewal processes, counters and repeated sequences in Markov chains
    • Biggins J.D. and Cannings C. (1987). Markov renewal processes, counters and repeated sequences in Markov chains. Adv. Appl. Prob. 19: 521-545
    • (1987) Adv. Appl. Prob. , vol.19 , pp. 521-545
    • Biggins, J.D.1    Cannings, C.2
  • 4
    • 0033555906 scopus 로고    scopus 로고
    • Tandem repeats finder: A program to analyze DNA sequences
    • Benson, G.: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 573-580 (1999)
    • (1999) Nucleic Acids Res , pp. 573-580
    • Benson, G.1
  • 6
    • 38249001373 scopus 로고
    • The distribution of subword counts is usually normal
    • 4
    • Bender E.A. and Kochman F. (1993). The distribution of subword counts is usually normal. Eur. J. Comb. 14(4): 265-275
    • (1993) Eur. J. Comb. , vol.14 , pp. 265-275
    • Bender, E.A.1    Kochman, F.2
  • 8
    • 0034730161 scopus 로고    scopus 로고
    • Building a dictionary for genomes: Identification of presumptive regulatory sites by statistical analysis
    • 18
    • Bussemaker H.J., Li H. and Siggia E.D. (2000). Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis. Proc. Natl. Acad. Sci. USA 97(18): 10096-10100
    • (2000) Proc. Natl. Acad. Sci. USA , vol.97 , pp. 10096-10100
    • Bussemaker, H.J.1    Li, H.2    Siggia, E.D.3
  • 12
    • 0001371942 scopus 로고
    • Renewal theory for several patterns
    • Breen S., Waterman M.S. and Zhang N. (1985). Renewal theory for several patterns. J. Appl. Prob. 22: 228-234
    • (1985) J. Appl. Prob. , vol.22 , pp. 228-234
    • Breen, S.1    Waterman, M.S.2    Zhang, N.3
  • 13
    • 0000658795 scopus 로고    scopus 로고
    • Dynamical sources in information theory: A general analysis of trie structures
    • 1
    • Clément J., Flajolet P. and Vallée B. (2001). Dynamical sources in information theory: a general analysis of trie structures. Algorithmica 29(1): 307-369
    • (2001) Algorithmica , vol.29 , pp. 307-369
    • Clément, J.1    Flajolet, P.2    Vallée, B.3
  • 16
    • 0019856947 scopus 로고
    • In vitro splicing of the ribosomal RNA precursor of Tetrahymena: Involvement of a guanosine nucleotide in the excision of the intervening sequence
    • 3 Pt 2
    • Cech T.R., Zaug A.J. and Grabowski P.J. (1981). In vitro splicing of the ribosomal RNA precursor of Tetrahymena: involvement of a guanosine nucleotide in the excision of the intervening sequence. Cell 27(3 Pt 2): 487-496
    • (1981) Cell , vol.27 , pp. 487-496
    • Cech, T.R.1    Zaug, A.J.2    Grabowski, P.J.3
  • 19
    • 0028272315 scopus 로고
    • RNA sequence analysis using covariance models
    • 11
    • Eddy S.R. and Durbin R. (1994). RNA sequence analysis using covariance models. Nucleic Acids Res. 22(11): 2079-2088
    • (1994) Nucleic Acids Res. , vol.22 , pp. 2079-2088
    • Eddy, S.R.1    Durbin, R.2
  • 20
    • 0033861030 scopus 로고    scopus 로고
    • Distribution of hammerhead and hammerhead-like RNA motifs through the GenBank
    • 7
    • Ferbeyre G., Bourdeau V., Pageau M., Miramontes P. and Cedergren R. (2000). Distribution of hammerhead and hammerhead-like RNA motifs through the GenBank. Genome Res. 10(7): 1011-1019
    • (2000) Genome Res. , vol.10 , pp. 1011-1019
    • Ferbeyre, G.1    Bourdeau, V.2    Pageau, M.3    Miramontes, P.4    Cedergren, R.5
  • 21
    • 0035998642 scopus 로고    scopus 로고
    • On probability generating functions for waiting time distributions of compound patterns in a sequence of multistate trials
    • 1
    • Fu J.C. and Chang Y.M. (2002). On probability generating functions for waiting time distributions of compound patterns in a sequence of multistate trials. J. Appl. Prob. 39(1): 70-80
    • (2002) J. Appl. Prob. , vol.39 , pp. 70-80
    • Fu, J.C.1    Chang, Y.M.2
  • 22
    • 0242676838 scopus 로고    scopus 로고
    • On ordered series and later waiting time distributions in a sequence of Markov dependent multistate trials
    • 3
    • Fu J.C. and Chang Y.M. (2003). On ordered series and later waiting time distributions in a sequence of Markov dependent multistate trials. J. Appl. Prob. 40(3): 623-642
    • (2003) J. Appl. Prob. , vol.40 , pp. 623-642
    • Fu, J.C.1    Chang, Y.M.2
  • 24
    • 0019797407 scopus 로고
    • Evolutionary trees from DNA sequences: A maximum likelihood approach
    • 6
    • Felsenstein J. (1981). Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17(6): 368-376
    • (1981) J. Mol. Evol. , vol.17 , pp. 368-376
    • Felsenstein, J.1
  • 25
    • 84950460234 scopus 로고
    • Distribution theory of runs: A Markov chain approach
    • 427
    • Fu J.C. and Koutras M.V. (1994). Distribution theory of runs: a Markov chain approach. J. Am. Statist. Assoc. 89(427): 1050-1058
    • (1994) J. Am. Statist. Assoc. , vol.89 , pp. 1050-1058
    • Fu, J.C.1    Koutras, M.V.2
  • 30
    • 0040629669 scopus 로고    scopus 로고
    • On patterns in sequences of random events
    • Gani J. and Irle A. (1999). On patterns in sequences of random events. Mh. Math. 127: 295-309
    • (1999) Mh. Math. , vol.127 , pp. 295-309
    • Gani, J.1    Irle, A.2
  • 33
    • 0013695809 scopus 로고
    • The occurrence of sequence patterns in repeated experiments and hitting times in a Markov chain
    • 1
    • Gerber H.U. and Li S.-Y.R. (1981). The occurrence of sequence patterns in repeated experiments and hitting times in a Markov chain. Stoch. Process. Appl. 11(1): 101-108
    • (1981) Stoch. Process. Appl. , vol.11 , pp. 101-108
    • Gerber, H.U.1    Li, S.-Y.R.2
  • 34
    • 0018006616 scopus 로고
    • Maximal prefix-synchronized codes
    • 2
    • Guibas L.J. and Odlyzko A.M. (1978). Maximal prefix-synchronized codes. SIAM J. Appl. Math. 35(2): 401-418
    • (1978) SIAM J. Appl. Math. , vol.35 , pp. 401-418
    • Guibas, L.J.1    Odlyzko, A.M.2
  • 36
    • 34250201608 scopus 로고
    • String overlaps, pattern matching, and nontransitive games
    • 2
    • Guibas L.J. and Odlyzko A.M. (1981). String overlaps, pattern matching and nontransitive games. J. Comb. Theory Ser. A 30(2): 183-208
    • (1981) J. Comb. Theory Ser. A , vol.30 , pp. 183-208
    • Guibas, L.J.1    Odlyzko, A.M.2
  • 37
    • 0021013526 scopus 로고
    • The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme
    • 3 Pt 2
    • Guerrier-Takada C., Gardiner K., Marsh T., Pace N. and Altman S. (1983). The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme. Cell 35(3 Pt 2): 849-857
    • (1983) Cell , vol.35 , pp. 849-857
    • Guerrier-Takada, C.1    Gardiner, K.2    Marsh, T.3    Pace, N.4    Altman, S.5
  • 38
  • 39
    • 0038587359 scopus 로고    scopus 로고
    • Sooner and later waiting time problems for patterns in Markov dependent trials
    • 1
    • Han Q. and Hirano K. (2003). Sooner and later waiting time problems for patterns in Markov dependent trials. J. Appl. Prob. 40(1): 73-86
    • (2003) J. Appl. Prob. , vol.40 , pp. 73-86
    • Han, Q.1    Hirano, K.2
  • 41
    • 33645087459 scopus 로고    scopus 로고
    • On the Markov chain central limit theorem
    • Jones G.L. (2004). On the Markov chain central limit theorem. Probab. Surv. 1: 299-320
    • (2004) Probab. Surv. , vol.1 , pp. 299-320
    • Jones, G.L.1
  • 42
    • 27144468672 scopus 로고    scopus 로고
    • Abundance of correctly folded RNA motifs in sequence space, calculated on computational grids
    • 18
    • Knight R., De Sterck H., Markel R., Smit S., Oshmyansky A. and Yarus M. (2005). Abundance of correctly folded RNA motifs in sequence space, calculated on computational grids. Nucleic Acids Res. 33(18): 5924-5935
    • (2005) Nucleic Acids Res. , vol.33 , pp. 5924-5935
    • Knight, R.1    De Sterck, H.2    Markel, R.3    Smit, S.4    Oshmyansky, A.5    Yarus, M.6
  • 43
    • 2942544566 scopus 로고    scopus 로고
    • RSEARCH: Finding homologs of single structured RNA sequences
    • Klein R.J. and Eddy S.R. (2003). RSEARCH: finding homologs of single structured RNA sequences. BMC Bioinform. 4: 44
    • (2003) BMC Bioinform. , vol.4 , pp. 44
    • Klein, R.J.1    Eddy, S.R.2
  • 44
    • 0001464973 scopus 로고
    • Estimation of evolutionary distances between homologous nucleotide sequences
    • 1
    • Kimura M. (1981). Estimation of evolutionary distances between homologous nucleotide sequences. Proc. Natl. Acad. Sci. USA 78(1): 454-458
    • (1981) Proc. Natl. Acad. Sci. USA , vol.78 , pp. 454-458
    • Kimura, M.1
  • 47
    • 0037319334 scopus 로고    scopus 로고
    • Finding specific RNA motifs: Function in a zeptomole world?
    • 2
    • Knight R. and Yarus M. (2003). Finding specific RNA motifs: function in a zeptomole world?. RNA 9(2): 218-230
    • (2003) RNA , vol.9 , pp. 218-230
    • Knight, R.1    Yarus, M.2
  • 48
    • 11844278458 scopus 로고    scopus 로고
    • Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets
    • Jan Letter
    • Lewis, B.P., Burge, C.B., Bartel, D.P.: Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120(1), 15-20, Jan 2005. Letter
    • (2005) Cell , vol.120 , Issue.1 , pp. 15-20
    • Lewis, B.P.1    Burge, C.B.2    Bartel, D.P.3
  • 49
    • 0001530766 scopus 로고
    • A martingale approach to the study of occurrence of sequence patterns in repeated experiments
    • 6
    • Li S.-Y.R. (1980). A martingale approach to the study of occurrence of sequence patterns in repeated experiments. Ann. Probab. 8(6): 1171-1176
    • (1980) Ann. Probab. , vol.8 , pp. 1171-1176
    • Li, S.-Y.R.1
  • 52
    • 24644490196 scopus 로고    scopus 로고
    • Elucidation of the small RNA component of the transcriptome
    • 5740
    • Lu C., Tej S.S., Luo S., Haudenschild C.D., Meyers B.C. and Green P.J. (2005). Elucidation of the small RNA component of the transcriptome. Science 309(5740): 1567-1569
    • (2005) Science , vol.309 , pp. 1567-1569
    • Lu, C.1    Tej, S.S.2    Luo, S.3    Haudenschild, C.D.4    Meyers, B.C.5    Green, P.J.6
  • 53
    • 28944435949 scopus 로고    scopus 로고
    • Distribution of the number of successes in success runs of length at least k in higher-order Markovian sequences
    • 4
    • Martin D. (2005). Distribution of the number of successes in success runs of length at least k in higher-order Markovian sequences. Methodol. Comput. Appl. Probab. 7(4): 543-554
    • (2005) Methodol. Comput. Appl. Probab. , vol.7 , pp. 543-554
    • Martin, D.1
  • 54
    • 17544391424 scopus 로고    scopus 로고
    • Regexpcount, a symbolic package for counting problems on regular expressions and words
    • 1-2
    • Nicodème P. (2003). Regexpcount, a symbolic package for counting problems on regular expressions and words. Fundamenta Informaticae 56(1-2): 71-88
    • (2003) Fundamenta Informaticae , vol.56 , pp. 71-88
    • Nicodème, P.1
  • 56
    • 33644675086 scopus 로고    scopus 로고
    • Waiting times for patterns and a method of gambling teams
    • 2
    • Pozdnyakov V.I. and Kulldorff M. (2006). Waiting times for patterns and a method of gambling teams. Am. Math. Month. 113(2): 134-143
    • (2006) Am. Math. Month. , vol.113 , pp. 134-143
    • Pozdnyakov, V.I.1    Kulldorff, M.2
  • 57
    • 10044285953 scopus 로고    scopus 로고
    • Searching for multiple words in a Markov sequence
    • 4
    • Park Y. and Spouge J.L. (2004). Searching for multiple words in a Markov sequence. INFORMS J. Comput. 16(4): 341-347
    • (2004) INFORMS J. Comput. , vol.16 , pp. 341-347
    • Park, Y.1    Spouge, J.L.2
  • 58
    • 0000180851 scopus 로고    scopus 로고
    • Exact distribution of the distances between any occurrences of a set of words
    • 4
    • Robin S.S. and Daudin J.J. (2001). Exact distribution of the distances between any occurrences of a set of words. Ann. Inst. Statist. Math. 53(4): 895-905
    • (2001) Ann. Inst. Statist. Math. , vol.53 , pp. 895-905
    • Robin, S.S.1    Daudin, J.J.2
  • 59
    • 28444456845 scopus 로고    scopus 로고
    • Rare events and conditional events on random strings
    • 2
    • Régnier M. and Denise A. (2004). Rare events and conditional events on random strings. DMTCS 6(2): 191-214
    • (2004) DMTCS , vol.6 , pp. 191-214
    • Régnier, M.1    Denise, A.2
  • 60
    • 0034046133 scopus 로고    scopus 로고
    • The language of RNA: A formal grammar that includes pseudoknots
    • 4
    • Rivas E. and Eddy S.R. (2000). The language of RNA: a formal grammar that includes pseudoknots. Bioinformatics 16(4): 334-340
    • (2000) Bioinformatics , vol.16 , pp. 334-340
    • Rivas, E.1    Eddy, S.R.2
  • 61
    • 0000794292 scopus 로고    scopus 로고
    • A unified approach to word occurrences probabilities
    • 1. Special issue on Computational Biology
    • Régnier M. (2000). A unified approach to word occurrences probabilities. Discrete Appl. Math. 104(1): 259-280
    • (2000) Discrete Appl. Math. , vol.104 , pp. 259-280
    • Régnier, M.1
  • 64
    • 0001194726 scopus 로고    scopus 로고
    • On pattern frequency occurrences in a Markovian sequence
    • 4
    • Régnier M. and Szpankowski W. (1998). On pattern frequency occurrences in a Markovian sequence. Algorithmica 22(4): 631-649
    • (1998) Algorithmica , vol.22 , pp. 631-649
    • Régnier, M.1    Szpankowski, W.2
  • 65
    • 0035498871 scopus 로고    scopus 로고
    • In vitro evolution suggests multiple origins for the hammerhead ribozyme
    • 6859
    • Salehi-Ashtiani K. and Szostak J.W. (2001). In vitro evolution suggests multiple origins for the hammerhead ribozyme. Nature 414(6859): 82-84
    • (2001) Nature , vol.414 , pp. 82-84
    • Salehi-Ashtiani, K.1    Szostak, J.W.2
  • 68
    • 33747077160 scopus 로고    scopus 로고
    • Building biological complexity with limited genes
    • Singh R., Robida M.D. and Karimpour S. (2006). Building biological complexity with limited genes. Curr. Genom. 7: 97-114
    • (2006) Curr. Genom. , vol.7 , pp. 97-114
    • Singh, R.1    Robida, M.D.2    Karimpour, S.3
  • 69
    • 0031259915 scopus 로고    scopus 로고
    • Accessing rare activities from random RNA sequences: The importance of the length of molecules in the starting pool
    • 10
    • Sabeti P.C., Unrau P.J. and Bartel D.P. (1997). Accessing rare activities from random RNA sequences: the importance of the length of molecules in the starting pool. Chem. Biol. 4(10): 767-774
    • (1997) Chem. Biol. , vol.4 , pp. 767-774
    • Sabeti, P.C.1    Unrau, P.J.2    Bartel, D.P.3
  • 70
    • 0034705119 scopus 로고    scopus 로고
    • Structural diversity of self-cleaving ribozymes
    • 11
    • Tang J. and Breaker R.R. (2000). Structural diversity of self-cleaving ribozymes. Proc. Natl. Acad. Sci. USA 97(11): 5784-5789
    • (2000) Proc. Natl. Acad. Sci. USA , vol.97 , pp. 5784-5789
    • Tang, J.1    Breaker, R.R.2
  • 71
    • 0242269901 scopus 로고    scopus 로고
    • Dynamical sources in information theory: Fundamental intervals and word prefixes
    • 1
    • Vallée B. (2001). Dynamical sources in information theory: fundamental intervals and word prefixes. Algorithmica 29(1): 262-306
    • (2001) Algorithmica , vol.29 , pp. 262-306
    • Vallée, B.1
  • 74
    • 0030970582 scopus 로고    scopus 로고
    • 23S rRNA similarity from selection for peptidyl transferase mimicry
    • 22
    • Welch M., Majerfeld I. and Yarus M. (1997). 23S rRNA similarity from selection for peptidyl transferase mimicry. Biochemistry 36(22): 6614-6623
    • (1997) Biochemistry , vol.36 , pp. 6614-6623
    • Welch, M.1    Majerfeld, I.2    Yarus, M.3
  • 75
    • 0037206833 scopus 로고    scopus 로고
    • Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression
    • 6910
    • Winkler W., Nahvi A. and Breaker R.R. (2002). Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression. Nature 419(6910): 952-956
    • (2002) Nature , vol.419 , pp. 952-956
    • Winkler, W.1    Nahvi, A.2    Breaker, R.R.3
  • 76
    • 22244445209 scopus 로고    scopus 로고
    • Origins of the genetic code: The escaped triplet theory
    • Yarus M., Caporaso J.G. and Knight R. (2005). Origins of the genetic code: the escaped triplet theory. Annu. Rev. Biochem. 74: 179-198
    • (2005) Annu. Rev. Biochem. , vol.74 , pp. 179-198
    • Yarus, M.1    Caporaso, J.G.2    Knight, R.3
  • 77
    • 0033781246 scopus 로고    scopus 로고
    • Peptidyl transferase: Ancient and exiguous
    • 10
    • Yarus M. and Welch M. (2000). Peptidyl transferase: ancient and exiguous. Chem. Biol. 7(10): 187-190
    • (2000) Chem. Biol. , vol.7 , pp. 187-190
    • Yarus, M.1    Welch, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.