메뉴 건너뛰기




Volumn 22, Issue 1, 2006, Pages 21-28

A generic motif discovery algorithm for sequential data

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHM; AMINO ACID SEQUENCE; ARTICLE; BIOINFORMATICS; CLUSTER ANALYSIS; CONTROLLED STUDY; DNA SEQUENCE; INFORMATION PROCESSING; NOMENCLATURE; PRIORITY JOURNAL; PROTEIN EXPRESSION; PROTEIN MOTIF; PROTEIN STRUCTURE; SEQUENCE ANALYSIS; SEQUENCE HOMOLOGY; STRUCTURE ANALYSIS;

EID: 30344460682     PISSN: 13674803     EISSN: 13674811     Source Type: Journal    
DOI: 10.1093/bioinformatics/bti745     Document Type: Article
Times cited : (63)

References (49)
  • 1
    • 0029796428 scopus 로고    scopus 로고
    • SARFing the PDB
    • Alexandrov,N.N. (1996) SARFing the PDB. Protein Eng., 9, 727-732.
    • (1996) Protein Eng. , vol.9 , pp. 727-732
    • Alexandrov, N.N.1
  • 2
    • 0030017889 scopus 로고    scopus 로고
    • Analysis of topological and nontopological structural similarities in the PDB: New examples with old structures
    • Alexandrov,N.N. and Fischer,D (1996) Analysis of topological and nontopological structural similarities in the PDB: New examples with old structures. Proteins, 25, 354-365.
    • (1996) Proteins , vol.25 , pp. 354-365
    • Alexandrov, N.N.1    Fischer, D.2
  • 3
    • 0030801002 scopus 로고    scopus 로고
    • Gapped BLAST and PSI-BLAST: A new generation of protein database search programs
    • Altschul,S.F. et al. (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res., 25, 3389-3402.
    • (1997) Nucleic Acids Res. , vol.25 , pp. 3389-3402
    • Altschul, S.F.1
  • 4
    • 0032408864 scopus 로고    scopus 로고
    • The HD domain defines a new superfamily of metal-dependent phosphohydrolases
    • Aravind,L and Koonin,E.V. (1998) The HD domain defines a new superfamily of metal-dependent phosphohydrolases. Trends Biochem Sci., 23, 469-472.
    • (1998) Trends Biochem Sci. , vol.23 , pp. 469-472
    • Aravind, L.1    Koonin, E.V.2
  • 5
    • 84890517975 scopus 로고
    • Least-squares fitting of two 3-d point sets
    • Arun,K. S. et al. (1987) Least-squares fitting of two 3-d point sets. IEEE Trans. Pattern Anal. Mach. Intell., 9, 698-700.
    • (1987) IEEE Trans. Pattern Anal. Mach. Intell. , vol.9 , pp. 698-700
    • Arun, K.S.1
  • 6
    • 0028685490 scopus 로고
    • Fitting a mixture model by expectation maximization to discover motifs in biopolymers
    • Bailey,T.L. and Elkan,C. (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol., 2, 28-36.
    • (1994) Proc. Int. Conf. Intell. Syst. Mol. Biol. , vol.2 , pp. 28-36
    • Bailey, T.L.1    Elkan, C.2
  • 7
    • 0033963089 scopus 로고    scopus 로고
    • The ENZYME database in 2000
    • Bairoch,A (2000) The ENZYME database in 2000. Nucleic Acids Res., 28, 304-305.
    • (2000) Nucleic Acids Res. , vol.28 , pp. 304-305
    • Bairoch, A.1
  • 8
    • 0033957834 scopus 로고    scopus 로고
    • The Swiss-Prot protein sequence database and its supplement TrEMBL in 2000
    • Bairoch,A. and Apweiler,R. (2000) The Swiss-Prot protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45-48.
    • (2000) Nucleic Acids Res. , vol.28 , pp. 45-48
    • Bairoch, A.1    Apweiler, R.2
  • 9
    • 9144257886 scopus 로고    scopus 로고
    • The Pfam protein families database
    • (Database issue)
    • Bateman,A et al. (2004) The Pfam protein families database. Nucleic Acids Res., 32 (Database issue), D138-D141.
    • (2004) Nucleic Acids Res. , vol.32
    • Bateman, A.1
  • 11
    • 2142738304 scopus 로고    scopus 로고
    • WebLogo: A sequence logo generator
    • Crooks,G.E. et al. (2004) WebLogo: A sequence logo generator. Genome Res., 14, 1188-1190.
    • (2004) Genome Res. , vol.14 , pp. 1188-1190
    • Crooks, G.E.1
  • 12
    • 0034753838 scopus 로고    scopus 로고
    • Identification of homology in protein structure classification
    • Dietmann,S and Holm,L. (2001) Identification of homology in protein structure classification. Nat. Struct. Biol., 8, 953-957.
    • (2001) Nat. Struct. Biol. , vol.8 , pp. 953-957
    • Dietmann, S.1    Holm, L.2
  • 13
    • 0031743421 scopus 로고    scopus 로고
    • Profile hidden Markov models
    • Eddy,S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755-763.
    • (1998) Bioinformatics , vol.14 , pp. 755-763
    • Eddy, S.R.1
  • 14
    • 0034531045 scopus 로고    scopus 로고
    • Structure comparison and structure patterns
    • Eidhammer,I. et al. (2000) Structure comparison and structure patterns. J. Comput. Biol., 7, 685-716.
    • (2000) J. Comput. Biol. , vol.7 , pp. 685-716
    • Eidhammer, I.1
  • 15
    • 0344376570 scopus 로고    scopus 로고
    • Finding composite regulatory patterns in DNA sequences
    • Eskin,E. and Pevzner,P.A. (2002) Finding composite regulatory patterns in DNA sequences. Bioinformatics, 18 ((Suppl. 1)), 354-363.
    • (2002) Bioinformatics , vol.18 , Issue.SUPPL. 1 , pp. 354-363
    • Eskin, E.1    Pevzner, P.A.2
  • 17
    • 0026458378 scopus 로고
    • Amino acid substitution matrices from protein blocks
    • Henikoff,s. and Henikoff,J. G. (1992) Amino acid substitution matrices from protein blocks. Proc. Natl Acad. Sci. USA, 89, 10915-10919.
    • (1992) Proc. Natl Acad. Sci. USA , vol.89 , pp. 10915-10919
    • Henikoff, S.1    Henikoff, J.G.2
  • 18
    • 0028825276 scopus 로고
    • Automated construction and graphical presentation of protein blocks from unaligned sequences
    • Henikoff,S. et al. (1995) Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene, 163, GC17-GC26.
    • (1995) Gene , vol.163
    • Henikoff, S.1
  • 19
    • 0032826179 scopus 로고    scopus 로고
    • Identifying DNA and protein patterns with statistically significant alignments of multiple sequences
    • Hertz,G.Z. and Stormo,G.D. (1999) Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics,15, 563-577.
    • (1999) Bioinformatics , vol.15 , pp. 563-577
    • Hertz, G.Z.1    Stormo, G.D.2
  • 20
    • 0032944238 scopus 로고    scopus 로고
    • The PROSITE database, its status in 1999
    • Hofmann,K. et al. (1999) The PROSITE database, its status in 1999. Nucleic Acids Res., 27, 215-219.
    • (1999) Nucleic Acids Res. , vol.27 , pp. 215-219
    • Hofmann, K.1
  • 21
    • 0027440362 scopus 로고
    • Protein structure comparison by alignment of distance matrices
    • Holm,L. and Sander,C. (1993) Protein structure comparison by alignment of distance matrices. J. Mol. Biol., 233, 123-138.
    • (1993) J. Mol. Biol. , vol.233 , pp. 123-138
    • Holm, L.1    Sander, C.2
  • 23
    • 0027057526 scopus 로고
    • A database of protein structure families with common folding motifs
    • Holm,L. et al. (1992) A database of protein structure families with common folding motifs. Protein Sci., 1, 1691-1698.
    • (1992) Protein Sci. , vol.1 , pp. 1691-1698
    • Holm, L.1
  • 24
    • 84975553202 scopus 로고
    • Closed-form solution of absolute orientation using unit quaternions
    • Horn,B.K.P. (1987) Closed-form solution of absolute orientation using unit quaternions. J. Optical Soc. America A, 4, 629-642.
    • (1987) J. Optical Soc. America A , vol.4 , pp. 629-642
    • Horn, B.K.P.1
  • 25
    • 0037339462 scopus 로고    scopus 로고
    • Protein fragment clustering and canonical local shapes
    • Hunter,C.G. and Subramaniam,S. (2003) Protein fragment clustering and canonical local shapes. Proteins, 50, 580-588.
    • (2003) Proteins , vol.50 , pp. 580-588
    • Hunter, C.G.1    Subramaniam, S.2
  • 26
    • 0029159799 scopus 로고
    • Finding flexible patterns in unaligned protein sequences
    • Jonassen,I et al. (1995) Finding flexible patterns in unaligned protein sequences. Protein Sci, 4, 1587-1595.
    • (1995) Protein Sci , vol.4 , pp. 1587-1595
    • Jonassen, I.1
  • 27
    • 0036184678 scopus 로고    scopus 로고
    • Structure motif discovery and mining the PDB
    • Jonassen,I. et al. (2002) Structure motif discovery and mining the PDB: Bioinformatics, 18, 362-367.
    • (2002) Bioinformatics , vol.18 , pp. 362-367
    • Jonassen, I.1
  • 28
    • 0036772332 scopus 로고    scopus 로고
    • Finding motifs in the twilight zone
    • Keich,U. and Pevzner,P.A. (2002) Finding motifs in the twilight zone. Bioinformatics, 18, 1374-1381.
    • (2002) Bioinformatics , vol.18 , pp. 1374-1381
    • Keich, U.1    Pevzner, P.A.2
  • 29
    • 13444279981 scopus 로고    scopus 로고
    • Comprehensive evaluation of protein structure alignment methods: Scoring by geometric measures
    • Kolodny,R. et al. (2005) Comprehensive evaluation of protein structure alignment methods: Scoring by geometric measures. J. Mol. Biol., 346, 1173-88.
    • (2005) J. Mol. Biol. , vol.346 , pp. 1173-1188
    • Kolodny, R.1
  • 30
    • 0027912333 scopus 로고
    • Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment
    • Lawrence,C.E. et al. (1993) Detecting subtle sequence signals: A Gibbs sampling strategy for multiple alignment. Science, 262, 208-214.
    • (1993) Science , vol.262 , pp. 208-214
    • Lawrence, C.E.1
  • 31
    • 0031570668 scopus 로고    scopus 로고
    • MAD analysis of FHIT, a putative human tumor suppressor from the HIT protein family
    • Lima,C.C. et al. (1997) MAD analysis of FHIT, a putative human tumor suppressor from the HIT protein family. Structure, 5, 763-774.
    • (1997) Structure , vol.5 , pp. 763-774
    • Lima, C.C.1
  • 32
    • 0028838717 scopus 로고
    • Threading a database of protein cores
    • Madej,T et al. (1995) Threading a database of protein cores. Proteins, 23, 356-369.
    • (1995) Proteins , vol.23 , pp. 356-369
    • Madej, T.1
  • 33
    • 30344469630 scopus 로고    scopus 로고
    • Pattern discovery allowing wild-cards, substitution matrices, and multiple score functions
    • Algorithms in Bioinformatics: Third International Workshop, WABI 2003, Budapest, Hungary, Springer-Verlag, Berlin
    • Mancheron,A and Rusu,I. (2003) Pattern discovery allowing wild-cards, substitution matrices, and multiple score functions. In Algorithms in Bioinformatics, Proceedings of the Lecture Notes in Bioinformatics. Algorithms in Bioinformatics: Third International Workshop, WABI 2003, Budapest, Hungary, Springer-Verlag, Berlin, pp. 124-138.
    • (2003) Algorithms in Bioinformatics, Proceedings of the Lecture Notes in Bioinformatics , pp. 124-138
    • Mancheron, A.1    Rusu, I.2
  • 34
    • 0037252683 scopus 로고    scopus 로고
    • CDD: A curated Entrez database of conserved domain alignments
    • Marchler-Bauer,A. et al. (2003) CDD: A curated Entrez database of conserved domain alignments. Nucleic Acids Res., 31, 383-387.
    • (2003) Nucleic Acids Res. , vol.31 , pp. 383-387
    • Marchler-Bauer, A.1
  • 35
    • 0037249551 scopus 로고    scopus 로고
    • RNABase: An annotated database of RNA structures
    • Murthy,V.L. and Rose,G.D. (2003) RNABase: An annotated database of RNA structures. Nucleic Acids Res., 31, 502-504.
    • (2003) Nucleic Acids Res. , vol.31 , pp. 502-504
    • Murthy, V.L.1    Rose, G.D.2
  • 36
    • 0029948001 scopus 로고    scopus 로고
    • SSAP: Sequential structure alignment program for protein structure comparison
    • Orengo,C.A. and Taylor,W.R. (1996) SSAP: Sequential structure alignment program for protein structure comparison. Methods Enzymol, 266, 617-635.
    • (1996) Methods Enzymol , vol.266 , pp. 617-635
    • Orengo, C.A.1    Taylor, W.R.2
  • 37
    • 0036839162 scopus 로고    scopus 로고
    • MAMMOTH (matching molecular models obtained from theory): An automated method for model comparison
    • Ortiz,A.R. et al. (2002) MAMMOTH (matching molecular models obtained from theory): An automated method for model comparison. Protein Sci., 11, 2606-2621.
    • (2002) Protein Sci. , vol.11 , pp. 2606-2621
    • Ortiz, A.R.1
  • 39
    • 30344460947 scopus 로고    scopus 로고
    • private communication
    • Pevzner,P. and Sze,S.H. (2001) private communication.
    • (2001)
    • Pevzner, P.1    Sze, S.H.2
  • 40
    • 2342476143 scopus 로고    scopus 로고
    • Finding subtle motifs by branching from sample strings
    • Price,A et al. (2003) Finding subtle motifs by branching from sample strings. Bioinformatics, 19 (Suppl. 2), II149-II155.
    • (2003) Bioinformatics , vol.19 , Issue.SUPPL. 2
    • Price, A.1
  • 41
    • 0032719352 scopus 로고    scopus 로고
    • Dictionary building via unsupervised hierarchical motif discovery in the sequence space of natural proteins
    • Rigoutsos,I. et al. (1999) Dictionary building via unsupervised hierarchical motif discovery in the sequence space of natural proteins. Proteins, 37, 264-77.
    • (1999) Proteins , vol.37 , pp. 264-277
    • Rigoutsos, I.1
  • 42
    • 0031684427 scopus 로고    scopus 로고
    • Combinatorial pattern discovery in biological-sequences: The TEIRESIAS algorithm
    • Rigoutsos,I. and Floratos,A. (1998) Combinatorial pattern discovery in biological-sequences: The TEIRESIAS algorithm. Bioinformatics, 14, 55-67.
    • (1998) Bioinformatics , vol.14 , pp. 55-67
    • Rigoutsos, I.1    Floratos, A.2
  • 43
    • 9144270496 scopus 로고    scopus 로고
    • RegulonDB (version 4.0): Transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12
    • (Database issue)
    • Salgado,H. et al. (2004) RegulonDB (version 4.0): Transcriptional regulation, operon organization and growth conditions in Escherichia coli K-12. Nucleic Acids Res., 32 (Database issue), D303-D306.
    • (2004) Nucleic Acids Res. , vol.32
    • Salgado, H.1
  • 44
    • 30344464649 scopus 로고    scopus 로고
    • An extension and novel solution to the motif challenge problem
    • Styczynski,M. et al. (2004) An extension and novel solution to the motif challenge problem. Genome Informatics, 15, 63-71.
    • (2004) Genome Informatics , vol.15 , pp. 63-71
    • Styczynski, M.1
  • 45
    • 30344482497 scopus 로고
    • An Optimal Algorithm for finding all the cliques
    • Tomita,E. et al. (1989) An Optimal Algorithm for finding all the cliques. SIG Algorithms, 12, 91-98.
    • (1989) SIG Algorithms , vol.12 , pp. 91-98
    • Tomita, E.1
  • 46
    • 21144439147 scopus 로고    scopus 로고
    • Assessing computational tools for the discovery of transcription factor binding sites
    • Tompa,M. et al. (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol., 23, 137-144.
    • (2005) Nat. Biotechnol. , vol.23 , pp. 137-144
    • Tompa, M.1
  • 47
    • 0029810819 scopus 로고    scopus 로고
    • The structure of nucleotidylated histidine-166 of galactose-1-phosphate uridylyltransferase provides insight into phosphoryl group transfer
    • Wedekind,J.E. et al. (1996) The structure of nucleotidylated histidine-166 of galactose-1-phosphate uridylyltransferase provides insight into phosphoryl group transfer. Biochemistry, 35, 11560-11569.
    • (1996) Biochemistry , vol.35 , pp. 11560-11569
    • Wedekind, J.E.1
  • 49
    • 0033718951 scopus 로고    scopus 로고
    • Scalable algorithms for association mining
    • Zaki,M.J. (2000) Scalable algorithms for association mining. Knowledge Data Eng., 12, 372-390.
    • (2000) Knowledge Data Eng. , vol.12 , pp. 372-390
    • Zaki, N.M.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.