메뉴 건너뛰기




Volumn 10, Issue 5, 2009, Pages 498-508

Towards accurate human promoter recognition: A review of currently used sequence features and classification methods

Author keywords

Feature extraction; Genome annotation; Human promoter recognition; Model selection

Indexed keywords

COMPLEMENTARY DNA; MESSENGER RNA;

EID: 69249146630     PISSN: 14675463     EISSN: 14774054     Source Type: Journal    
DOI: 10.1093/bib/bbp027     Document Type: Review
Times cited : (47)

References (56)
  • 2
    • 39049168212 scopus 로고    scopus 로고
    • Generic eukaryotic core promoter prediction using structural features of DNA
    • Abeel T, Saeys Y, Bonnet E, et al. Generic eukaryotic core promoter prediction using structural features of DNA. Genome Res 2008;18:310-23.
    • (2008) Genome Res , vol.18 , pp. 310-323
    • Abeel, T.1    Saeys, Y.2    Bonnet, E.3
  • 3
    • 38449108806 scopus 로고    scopus 로고
    • Computational analyses of eukaryotic promoters
    • Zhang MQ. Computational analyses of eukaryotic promoters. BMC Bioinform 2007;8(Suppl. 6):S3.
    • (2007) BMC Bioinform , vol.8 , Issue.SUPPL. 6
    • Zhang, M.Q.1
  • 4
    • 33748659202 scopus 로고    scopus 로고
    • Bajic VB, Brent MR, Brown RH, et al. Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment. Genome Biol 2006; 7(Suppl. 1:S3):1-13.
    • Bajic VB, Brent MR, Brown RH, et al. Performance assessment of promoter predictions on ENCODE regions in the EGASP experiment. Genome Biol 2006; 7(Suppl. 1:S3):1-13.
  • 5
    • 8344288229 scopus 로고    scopus 로고
    • Promoter prediction analysis on the whole human genome
    • Bajic VB, Tan SL, Suzuki Y, et al. Promoter prediction analysis on the whole human genome. Nat Biotechnol 2004;22:1467-73.
    • (2004) Nat Biotechnol , vol.22 , pp. 1467-1473
    • Bajic, V.B.1    Tan, S.L.2    Suzuki, Y.3
  • 6
    • 0037869270 scopus 로고    scopus 로고
    • The state of the art of mammalian promoter recognition
    • Werner T. The state of the art of mammalian promoter recognition. Brief Bioinform 2003;4:22-30.
    • (2003) Brief Bioinform , vol.4 , pp. 22-30
    • Werner, T.1
  • 7
    • 0035252126 scopus 로고    scopus 로고
    • Identification and analysis of eukaryotic promoters: Recent computational approaches
    • Ohler U, Niemann H. Identification and analysis of eukaryotic promoters: recent computational approaches. Trends Genet 2001;17:56-60.
    • (2001) Trends Genet , vol.17 , pp. 56-60
    • Ohler, U.1    Niemann, H.2
  • 8
    • 0033563512 scopus 로고    scopus 로고
    • The biology of eukaryotic promoter prediction - a review
    • Pedersen AG, Baldi P, Chauvin Y, et al. The biology of eukaryotic promoter prediction - a review. Comput Chem 1999;23:191-207.
    • (1999) Comput Chem , vol.23 , pp. 191-207
    • Pedersen, A.G.1    Baldi, P.2    Chauvin, Y.3
  • 9
    • 0043269205 scopus 로고    scopus 로고
    • The RNA polymerase II core promoter
    • Smale ST, Kadonaga JT. The RNA polymerase II core promoter. Annu Rev Biochem 2003;72:449-79.
    • (2003) Annu Rev Biochem , vol.72 , pp. 449-479
    • Smale, S.T.1    Kadonaga, J.T.2
  • 10
    • 17444371719 scopus 로고    scopus 로고
    • Synergy of human Pol II core promoter elements revealed by statistical sequence analysis
    • Gershenzon NI, Ioshikhes IP. Synergy of human Pol II core promoter elements revealed by statistical sequence analysis. Bioinformatics 2005;21:1295-300.
    • (2005) Bioinformatics , vol.21 , pp. 1295-1300
    • Gershenzon, N.I.1    Ioshikhes, I.P.2
  • 11
    • 0037133565 scopus 로고    scopus 로고
    • Comprehensive analysis of CpG islands in human chromosomes 21 and 22
    • Takai D, Jones PA. Comprehensive analysis of CpG islands in human chromosomes 21 and 22. Proc Natl Acad Sci USA 2002;99:3740-5.
    • (2002) Proc Natl Acad Sci USA , vol.99 , pp. 3740-3745
    • Takai, D.1    Jones, P.A.2
  • 12
    • 0034737325 scopus 로고    scopus 로고
    • Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: A novel context analysis approach
    • Scherf M, Klingenhoff A, Werner T. Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: A novel context analysis approach. J Mol Biol 2000;297:599-606.
    • (2000) J Mol Biol , vol.297 , pp. 599-606
    • Scherf, M.1    Klingenhoff, A.2    Werner, T.3
  • 13
    • 35548987996 scopus 로고    scopus 로고
    • Sequence-dependent DNA deformability studied using molecular dynamics simulations
    • Fujii S, Kono H, Takenaka S, et al. Sequence-dependent DNA deformability studied using molecular dynamics simulations. Nucleic Acids Res 2007;35:6063-74.
    • (2007) Nucleic Acids Res , vol.35 , pp. 6063-6074
    • Fujii, S.1    Kono, H.2    Takenaka, S.3
  • 14
    • 34250896139 scopus 로고    scopus 로고
    • Position and distance specificity are important determinants of cis-regulatory motifs in addition to evolutionary conservation
    • Vardhanabhuti S, Wang J, Hannenhalli S. Position and distance specificity are important determinants of cis-regulatory motifs in addition to evolutionary conservation. Nucleic Acids Res 2007; 35:3203-13.
    • (2007) Nucleic Acids Res , vol.35 , pp. 3203-3213
    • Vardhanabhuti, S.1    Wang, J.2    Hannenhalli, S.3
  • 15
    • 43349098748 scopus 로고    scopus 로고
    • The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site
    • Tharakaraman K, Bodenreider O, Landsman D, et al. The biological function of some human transcription factor binding motifs varies with position relative to the transcription start site. Nucleic Acids Res 2008;36:2777-86.
    • (2008) Nucleic Acids Res , vol.36 , pp. 2777-2786
    • Tharakaraman, K.1    Bodenreider, O.2    Landsman, D.3
  • 16
    • 22444448187 scopus 로고    scopus 로고
    • A highly distinctive mechanical property found in the majority of human promoters and its transcriptional relevance
    • Fukue Y, Sumida N, Tanase J, et al. A highly distinctive mechanical property found in the majority of human promoters and its transcriptional relevance. Nucleic Acids Res 2005;33:3821-7.
    • (2005) Nucleic Acids Res , vol.33 , pp. 3821-3827
    • Fukue, Y.1    Sumida, N.2    Tanase, J.3
  • 17
    • 31944432339 scopus 로고    scopus 로고
    • A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters
    • Saxonov S, Berg P, Brutlag DL. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc Natl Acad Sci USA 2006;103:1412-7.
    • (2006) Proc Natl Acad Sci USA , vol.103 , pp. 1412-1417
    • Saxonov, S.1    Berg, P.2    Brutlag, D.L.3
  • 18
    • 38049156077 scopus 로고    scopus 로고
    • MetaProm: A neural network based meta-predictor for alternative human promoter prediction
    • Wang J, Ungar LH, Tseng H, et al. MetaProm: A neural network based meta-predictor for alternative human promoter prediction. BMC Genomics 2007;8:374.
    • (2007) BMC Genomics , vol.8 , pp. 374
    • Wang, J.1    Ungar, L.H.2    Tseng, H.3
  • 19
    • 33744805985 scopus 로고    scopus 로고
    • Genome-wide analysis of mammalian promoter architecture and evolution
    • Carninci P, Sandelin A, Lenhard B, et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nat Genet 2006; 38:626-35.
    • (2006) Nat Genet , vol.38 , pp. 626-635
    • Carninci, P.1    Sandelin, A.2    Lenhard, B.3
  • 20
    • 33745744510 scopus 로고    scopus 로고
    • A mammalian promoter model links cis elements to genetic networks
    • Wang J, Hannenhalli S. A mammalian promoter model links cis elements to genetic networks. Biochem Biophys Res Commun 2006;347:166-77.
    • (2006) Biochem Biophys Res Commun , vol.347 , pp. 166-177
    • Wang, J.1    Hannenhalli, S.2
  • 21
    • 59949103192 scopus 로고    scopus 로고
    • High-resolution human core-promoter prediction with CoreBoost_HM
    • Wang X, Xuan Z, Zhao X, et al. High-resolution human core-promoter prediction with CoreBoost_HM. Genome Res 2009;19:266-75.
    • (2009) Genome Res , vol.19 , pp. 266-275
    • Wang, X.1    Xuan, Z.2    Zhao, X.3
  • 22
    • 34447525636 scopus 로고    scopus 로고
    • Boosting with stumps for predicting transcription start sites
    • Zhao X, Xuan Z, Zhang MQ. Boosting with stumps for predicting transcription start sites. Genome Biol 2007;8:R17.
    • (2007) Genome Biol , vol.8
    • Zhao, X.1    Xuan, Z.2    Zhang, M.Q.3
  • 24
    • 34247110449 scopus 로고    scopus 로고
    • Eukaryotic promoter prediction based on relative entropy and positional information
    • Wu S, Xie X, Liew AW, et al. Eukaryotic promoter prediction based on relative entropy and positional information. Phys Rev E Stat Nonlin Soft Matter Phys 2007; 75:041908.
    • (2007) Phys Rev E Stat Nonlin Soft Matter Phys , vol.75 , pp. 041908
    • Wu, S.1    Xie, X.2    Liew, A.W.3
  • 25
    • 40649114197 scopus 로고    scopus 로고
    • DNA sequence and structural properties as predictors of human and mouse promoters
    • Akan P, Deloukas P. DNA sequence and structural properties as predictors of human and mouse promoters. Gene 2008;410:165-76.
    • (2008) Gene , vol.410 , pp. 165-176
    • Akan, P.1    Deloukas, P.2
  • 26
    • 0032575756 scopus 로고    scopus 로고
    • DNA structure in human RNA polymerase II promoters
    • Pedersen AG, Baldi P, Chauvin Y, et al. DNA structure in human RNA polymerase II promoters. J Mol Biol 1998; 281:663-73.
    • (1998) J Mol Biol , vol.281 , pp. 663-673
    • Pedersen, A.G.1    Baldi, P.2    Chauvin, Y.3
  • 28
    • 58149143162 scopus 로고    scopus 로고
    • Structural properties of replication origins in yeast DNA sequences
    • Cao XQ, Zeng J, Yan H. Structural properties of replication origins in yeast DNA sequences. Phys Biol 2008;5:36012.
    • (2008) Phys Biol , vol.5 , pp. 36012
    • Cao, X.Q.1    Zeng, J.2    Yan, H.3
  • 29
    • 0035224747 scopus 로고    scopus 로고
    • Joint modeling of DNA sequence and physical properties to improve eukaryotic promoter recognition
    • Ohler U, Niemann H, Liao G, et al. Joint modeling of DNA sequence and physical properties to improve eukaryotic promoter recognition. Bioinformatics 2001;17(Suppl. 1): S199-206.
    • (2001) Bioinformatics , vol.17 , Issue.SUPPL. 1
    • Ohler, U.1    Niemann, H.2    Liao, G.3
  • 30
    • 77955471136 scopus 로고    scopus 로고
    • SCS: Signal, context and structure features for genome-wide human promoter recognition
    • doi:10.1109/TCBB.2008.95
    • Zeng J, Zhao X-Y, Cao X-Q, et al. SCS: Signal, context and structure features for genome-wide human promoter recognition. IEEE/ ACM Trans Comput Biol Bioinform 2009, doi:10.1109/TCBB.2008.95.
    • (2009) IEEE/ ACM Trans Comput Biol Bioinform
    • Zeng, J.1    Zhao, X.-Y.2    Cao, X.-Q.3
  • 31
    • 40449135396 scopus 로고    scopus 로고
    • Determining promoter location based on DNA structure first-principles calculations
    • Goni JR, Perez A, Torrents D, et al. Determining promoter location based on DNA structure first-principles calculations. Genome Biol 2007;8:R263.
    • (2007) Genome Biol , vol.8
    • Goni, J.R.1    Perez, A.2    Torrents, D.3
  • 32
    • 46249123636 scopus 로고    scopus 로고
    • ProSOM: Core promoter prediction based on unsupervised clustering of DNA physical profiles
    • Abeel T, Saeys Y, Rouze P, et al. ProSOM: Core promoter prediction based on unsupervised clustering of DNA physical profiles. Bioinformatics 2008;24:i24-31.
    • (2008) Bioinformatics , vol.24 , Issue.I24-31
    • Abeel, T.1    Saeys, Y.2    Rouze, P.3
  • 33
    • 0034614487 scopus 로고    scopus 로고
    • Sequence-dependent DNA structure: Tetranucleotide conformational maps
    • Packer MJ, Dauncey MP, Hunter CA. Sequence-dependent DNA structure: Tetranucleotide conformational maps. J Mol Biol 2000;295 85-103.
    • (2000) J Mol Biol , vol.295 , pp. 85-103
    • Packer, M.J.1    Dauncey, M.P.2    Hunter, C.A.3
  • 34
    • 34547624303 scopus 로고    scopus 로고
    • Genome-wide maps of chromatin state in pluripotent and lineage-committed cells
    • Mikkelsen TS, Ku M, Jaffe DB, et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 2007; 448:553-560.
    • (2007) Nature , vol.448 , pp. 553-560
    • Mikkelsen, T.S.1    Ku, M.2    Jaffe, D.B.3
  • 35
    • 15844412591 scopus 로고    scopus 로고
    • Improving promoter prediction for the NNPP2.2 algorithm: A case study using Escherichia coli DNA sequences
    • Burden S, Lin YX, Zhang R. Improving promoter prediction for the NNPP2.2 algorithm: A case study using Escherichia coli DNA sequences. Bioinformatics 2005; 21:601-7.
    • (2005) Bioinformatics , vol.21 , pp. 601-607
    • Burden, S.1    Lin, Y.X.2    Zhang, R.3
  • 36
    • 0035735265 scopus 로고    scopus 로고
    • Computational identification of promoters and first exons in the human genome
    • Davuluri RV, Grosse I, Zhang MQ. Computational identification of promoters and first exons in the human genome. Nat Genet 2001; 29:412-7.
    • (2001) Nat Genet , vol.29 , pp. 412-417
    • Davuluri, R.V.1    Grosse, I.2    Zhang, M.Q.3
  • 37
    • 0035999991 scopus 로고    scopus 로고
    • CpGProD: Identifying CpG islands associated with transcription start sites in large genomic mammalian sequences
    • Ponger L, Mouchiroud D. CpGProD: Identifying CpG islands associated with transcription start sites in large genomic mammalian sequences. Bioinformatics 2002;18: 631-3.
    • (2002) Bioinformatics , vol.18 , pp. 631-633
    • Ponger, L.1    Mouchiroud, D.2
  • 38
    • 0036123111 scopus 로고    scopus 로고
    • Computational detection and location of transcription start sites in mammalian genomic DNA
    • Down TA, Hubbard TJ. Computational detection and location of transcription start sites in mammalian genomic DNA. Genome Res 2002;12:458-61.
    • (2002) Genome Res , vol.12 , pp. 458-461
    • Down, T.A.1    Hubbard, T.J.2
  • 39
    • 33748653762 scopus 로고    scopus 로고
    • Automatic annotation of eukaryotic genes, pseudogenes and promoters
    • Solovyev V, Kosarev P, Seledsov I, et al. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol 2006;7 (Suppl. 1):S1011-2.
    • (2006) Genome Biol , vol.7 , Issue.SUPPL. 1
    • Solovyev, V.1    Kosarev, P.2    Seledsov, I.3
  • 40
    • 69249088554 scopus 로고    scopus 로고
    • PCA-HPR: A principle component analysis model for human promoter recognition
    • Li X, Zeng J, Yan H. PCA-HPR: A principle component analysis model for human promoter recognition. Bioinformation 2008;2:373-8.
    • (2008) Bioinformation , vol.2 , pp. 373-378
    • Li, X.1    Zeng, J.2    Yan, H.3
  • 41
    • 0033025047 scopus 로고    scopus 로고
    • Promoter2.0: For the recognition of PolII promoter sequences
    • Knudsen S. Promoter2.0: For the recognition of PolII promoter sequences. Bioinformatics 1999;15:356-61.
    • (1999) Bioinformatics , vol.15 , pp. 356-361
    • Knudsen, S.1
  • 42
    • 14844346192 scopus 로고    scopus 로고
    • Human pol II promoter prediction: Time series descriptors and machine learning
    • Gangal R, Sharma P. Human pol II promoter prediction: Time series descriptors and machine learning. Nucleic Acids Res 2005;33 1332-6.
    • (2005) Nucleic Acids Res , vol.33 , pp. 1332-1336
    • Gangal, R.1    Sharma, P.2
  • 43
    • 0036166744 scopus 로고    scopus 로고
    • Dragon Promoter Finder: Recognition of vertebrate RNA polymerase II promoters
    • Bajic VB, Seah SH, Chong A, et al. Dragon Promoter Finder: recognition of vertebrate RNA polymerase II promoters. Bioinformatics 2002;18:198-9.
    • (2002) Bioinformatics , vol.18 , pp. 198-199
    • Bajic, V.B.1    Seah, S.H.2    Chong, A.3
  • 44
    • 0042525750 scopus 로고    scopus 로고
    • Dragon gene start finder: An advanced system for finding approximate locations of the start of gene transcriptional units
    • Bajic VB, Seah SH. Dragon gene start finder: An advanced system for finding approximate locations of the start of gene transcriptional units. Genome Res 2003;13: 1923-9.
    • (2003) Genome Res , vol.13 , pp. 1923-1929
    • Bajic, V.B.1    Seah, S.H.2
  • 45
    • 85142129102 scopus 로고    scopus 로고
    • Sonnenburg S, Zien A, Ratsch G. ARTS: Accurate recognition of transcription starts in human. Bioinformatics 2006;22: e472-80.
    • Sonnenburg S, Zien A, Ratsch G. ARTS: Accurate recognition of transcription starts in human. Bioinformatics 2006;22: e472-80.
  • 46
    • 33751019180 scopus 로고    scopus 로고
    • PromoterExplorer: An effective promoter identification method based on the AdaBoost algorithm
    • Xie X, Wu S, Lam KM, et al. PromoterExplorer: An effective promoter identification method based on the AdaBoost algorithm. Bioinformatics 2006;22:2722-8.
    • (2006) Bioinformatics , vol.22 , pp. 2722-2728
    • Xie, X.1    Wu, S.2    Lam, K.M.3
  • 47
    • 39149083713 scopus 로고    scopus 로고
    • EnsemPro: An ensemble approach to predicting transcription start sites in human genomic DNA sequences
    • Won HH, Kim MJ, Kim S, et al. EnsemPro: An ensemble approach to predicting transcription start sites in human genomic DNA sequences. Genomics 2008;91:259-66.
    • (2008) Genomics , vol.91 , pp. 259-266
    • Won, H.H.1    Kim, M.J.2    Kim, S.3
  • 48
    • 33644873945 scopus 로고    scopus 로고
    • EPD in its twentieth year: Towards complete promoter coverage of selected model organisms
    • Schmid CD, Perier R, Praz V, et al. EPD in its twentieth year: towards complete promoter coverage of selected model organisms. Nucleic Acids Res 2006;34:D82-5.
    • (2006) Nucleic Acids Res , vol.34
    • Schmid, C.D.1    Perier, R.2    Praz, V.3
  • 50
    • 38549094621 scopus 로고    scopus 로고
    • DBTSS: Database of transcription start sites, progress
    • Wakaguri H, Yamashita R, Suzuki Y, et al. DBTSS: Database of transcription start sites, progress report 2008. Nucleic Acids Res 2008;36:D97-101.
    • (2008) Nucleic Acids Res , vol.36
    • Wakaguri, H.1    Yamashita, R.2    Suzuki, Y.3
  • 51
    • 33846057724 scopus 로고    scopus 로고
    • NCBI reference sequences (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins
    • Pruitt KD, Tatusova T, Maglott DR. NCBI reference sequences (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 2007;35:D61-5.
    • (2007) Nucleic Acids Res , vol.35
    • Pruitt, K.D.1    Tatusova, T.2    Maglott, D.R.3
  • 52
    • 0033977982 scopus 로고    scopus 로고
    • EID: The exon-intron database - an exhaustive database of protein-coding intron-containing genes
    • Saxonov S, Daizadeh I, Fedorov A, et al. EID: The exon-intron database - an exhaustive database of protein-coding intron-containing genes. Nucleic Acids Res 2000;28:185-190.
    • (2000) Nucleic Acids Res , vol.28 , pp. 185-190
    • Saxonov, S.1    Daizadeh, I.2    Fedorov, A.3
  • 53
    • 13444301473 scopus 로고    scopus 로고
    • UTRdb and UTRsite: A collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs
    • Mignone F, Grillo G, Licciulli F, et al. UTRdb and UTRsite: A collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs. Nucleic Acids Res 2005;33 D141-6.
    • (2005) Nucleic Acids Res , vol.33
    • Mignone, F.1    Grillo, G.2    Licciulli, F.3
  • 54
    • 38449092401 scopus 로고    scopus 로고
    • Prediction of transcription start sites based on feature selection using AMOSA
    • Wang X, Bandyopadhyay S, Xuan Z, et al. Prediction of transcription start sites based on feature selection using AMOSA. Comput Syst Bioinformatics Conf 2007;6:183-93.
    • (2007) Comput Syst Bioinformatics Conf , vol.6 , pp. 183-193
    • Wang, X.1    Bandyopadhyay, S.2    Xuan, Z.3
  • 56
    • 33644876957 scopus 로고    scopus 로고
    • TiProD: The tissue-specific promoter database
    • Chen X, Wu JM, Hornischer K, et al. TiProD: The tissue-specific promoter database. Nucleic Acids Res 2006;34: D104-7.
    • (2006) Nucleic Acids Res , vol.34
    • Chen, X.1    Wu, J.M.2    Hornischer, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.