메뉴 건너뛰기




Volumn 7, Issue , 2016, Pages

Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow

Author keywords

[No Author keywords available]

Indexed keywords

PROTEOME; PROTEIN;

EID: 84973345496     PISSN: None     EISSN: 20411723     Source Type: Journal    
DOI: 10.1038/ncomms11778     Document Type: Article
Times cited : (60)

References (57)
  • 1
    • 33748645500 scopus 로고    scopus 로고
    • GENCODE: Producing a reference annotation for ENCODE
    • Harrow, J. et al. GENCODE: producing a reference annotation for ENCODE. Genome Biol. 7(Suppl 1): S41-S49 (2006).
    • (2006) Genome Biol. , vol.7 , pp. S41-S49
    • Harrow, J.1
  • 2
    • 60749118171 scopus 로고    scopus 로고
    • Identifying protein-coding genes in genomic sequences
    • Harrow, J. et al. Identifying protein-coding genes in genomic sequences. Genome Biol. 10, 201 (2009).
    • (2009) Genome Biol. , vol.10 , pp. 201
    • Harrow, J.1
  • 3
    • 79959446962 scopus 로고    scopus 로고
    • PhyloCSF: A comparative genomics method to distinguish protein coding and non-coding regions
    • Lin, M. F., Jungreis, I. & Kellis, M. PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions. Bioinformatics 27, i275-i282 (2011).
    • (2011) Bioinformatics , vol.27 , pp. i275-i282
    • Lin, M.F.1    Jungreis, I.2    Kellis, M.3
  • 4
    • 84875701652 scopus 로고    scopus 로고
    • Immunofluorescence and fluorescent-protein tagging show high correlation for protein localization in mammalian cells
    • Stadler, C. et al. Immunofluorescence and fluorescent-protein tagging show high correlation for protein localization in mammalian cells. Nat. Methods 10, 315-323 (2013).
    • (2013) Nat. Methods , vol.10 , pp. 315-323
    • Stadler, C.1
  • 5
    • 84891792618 scopus 로고    scopus 로고
    • Proteogenomic analysis of human chromosome 9-encoded genes from human samples and lung cancer tissues
    • Ahn, J. M. et al. Proteogenomic analysis of human chromosome 9-encoded genes from human samples and lung cancer tissues. J. Proteome Res. 13, 137-146 (2014).
    • (2014) J. Proteome Res. , vol.13 , pp. 137-146
    • Ahn, J.M.1
  • 6
    • 84874380007 scopus 로고    scopus 로고
    • Whole human genome proteogenomic mapping for ENCODE cell line data: Identifying protein-coding regions
    • Khatun, J. et al. Whole human genome proteogenomic mapping for ENCODE cell line data: identifying protein-coding regions. BMC Genomics 14, 141 (2013).
    • (2013) BMC Genomics , vol.14 , pp. 141
    • Khatun, J.1
  • 7
    • 84941099453 scopus 로고    scopus 로고
    • Advanced proteogenomic analysis reveals multiple peptide mutations and complex immunoglobulin peptides in colon cancer
    • Woo, S. et al. Advanced proteogenomic analysis reveals multiple peptide mutations and complex immunoglobulin peptides in colon cancer. J. Proteome Res. 14, 3555-3567 (2015).
    • (2015) J. Proteome Res. , vol.14 , pp. 3555-3567
    • Woo, S.1
  • 8
    • 84911444179 scopus 로고    scopus 로고
    • Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes
    • Ezkurdia, I. et al. Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes. Hum. Mol. Genet. 23, 5866-5878 (2014).
    • (2014) Hum. Mol. Genet. , vol.23 , pp. 5866-5878
    • Ezkurdia, I.1
  • 9
    • 33846884410 scopus 로고    scopus 로고
    • Improving gene annotation using peptide mass spectrometry
    • Tanner, S. et al. Improving gene annotation using peptide mass spectrometry. Genome Res. 17, 231-239 (2007).
    • (2007) Genome Res. , vol.17 , pp. 231-239
    • Tanner, S.1
  • 10
    • 79955555530 scopus 로고    scopus 로고
    • Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and 'resurrected' pseudogenes in the mouse genome
    • Brosch, M. et al. Shotgun proteomics aids discovery of novel protein-coding genes, alternative splicing, and 'resurrected' pseudogenes in the mouse genome. Genome Res. 21, 756-767 (2011).
    • (2011) Genome Res. , vol.21 , pp. 756-767
    • Brosch, M.1
  • 12
    • 84879389449 scopus 로고    scopus 로고
    • High performance computational analysis of large-scale proteome data sets to assess incremental contribution to coverage of the human genome
    • Neuhauser, N. et al. High performance computational analysis of large-scale proteome data sets to assess incremental contribution to coverage of the human genome. J. Proteome Res. 12, 2858-2868 (2013).
    • (2013) J. Proteome Res. , vol.12 , pp. 2858-2868
    • Neuhauser, N.1
  • 13
    • 84941057782 scopus 로고    scopus 로고
    • Metrics for the human proteome project 2015: Progress on the human proteome and guidelines for high-confidence protein identification
    • Omenn, G. S. et al. Metrics for the human proteome project 2015: progress on the human proteome and guidelines for high-confidence protein identification. J. Proteome Res. 14, 3452-3460 (2015).
    • (2015) J. Proteome Res. , vol.14 , pp. 3452-3460
    • Omenn, G.S.1
  • 14
    • 84901599553 scopus 로고    scopus 로고
    • A draft map of the human proteome
    • Kim, M. S. et al. A draft map of the human proteome. Nature 509, 575-581 (2014).
    • (2014) Nature , vol.509 , pp. 575-581
    • Kim, M.S.1
  • 15
    • 84901611036 scopus 로고    scopus 로고
    • Mass-spectrometry-based draft of the human proteome
    • Wilhelm, M. et al. Mass-spectrometry-based draft of the human proteome. Nature 509, 582-587 (2014).
    • (2014) Nature , vol.509 , pp. 582-587
    • Wilhelm, M.1
  • 16
    • 84958215121 scopus 로고    scopus 로고
    • Proteogenomics
    • Pandey, A. & Pevzner, P. A. Proteogenomics. Proteomics 14, 2631-2632 (2014).
    • (2014) Proteomics , vol.14 , pp. 2631-2632
    • Pandey, A.1    Pevzner, P.A.2
  • 17
    • 84908540871 scopus 로고    scopus 로고
    • Proteogenomics: Concepts, applications and computational strategies
    • Nesvizhskii, A. I. Proteogenomics: concepts, applications and computational strategies. Nat. Methods 11, 1114-1125 (2014).
    • (2014) Nat. Methods , vol.11 , pp. 1114-1125
    • Nesvizhskii, A.I.1
  • 18
    • 84868310579 scopus 로고    scopus 로고
    • Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies
    • Blakeley, P., Overton, I. M. & Hubbard, S. J. Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies. J. Proteome Res. 11, 5221-5234 (2012).
    • (2012) J. Proteome Res. , vol.11 , pp. 5221-5234
    • Blakeley, P.1    Overton, I.M.2    Hubbard, S.J.3
  • 19
    • 84947491071 scopus 로고    scopus 로고
    • A note on the false discovery rate of novel peptides in proteogenomics
    • Zhang, K. et al. A note on the false discovery rate of novel peptides in proteogenomics. Bioinformatics 31, 3249-3253 (2015).
    • (2015) Bioinformatics , vol.31 , pp. 3249-3253
    • Zhang, K.1
  • 20
    • 84940547273 scopus 로고    scopus 로고
    • A scalable approach for protein false discovery rate estimation in large proteomic data sets
    • Savitski, M. M., Wilhelm, M., Hahne, H., Kuster, B. & Bantscheff, M. A scalable approach for protein false discovery rate estimation in large proteomic data sets. Mol. Cell. Proteomics 14, 2394-2404 (2015).
    • (2015) Mol. Cell. Proteomics , vol.14 , pp. 2394-2404
    • Savitski, M.M.1    Wilhelm, M.2    Hahne, H.3    Kuster, B.4    Bantscheff, M.5
  • 21
    • 79959960525 scopus 로고    scopus 로고
    • Target-decoy approach and false discovery rate: When things may go wrong
    • Gupta, N., Bandeira, N., Keich, U. & Pevzner, P. A. Target-decoy approach and false discovery rate: when things may go wrong. J. Am. Soc. Mass Spectrom. 22, 1111-1120 (2011).
    • (2011) J. Am. Soc. Mass Spectrom. , vol.22 , pp. 1111-1120
    • Gupta, N.1    Bandeira, N.2    Keich, U.3    Pevzner, P.A.4
  • 22
    • 35748972060 scopus 로고    scopus 로고
    • Semi-supervised learning for peptide identification from shotgun proteomics data sets
    • Kall, L., Canterbury, J. D., Weston, J., Noble, W. S. & MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics data sets. Nat. Methods 4, 923-925 (2007).
    • (2007) Nat. Methods , vol.4 , pp. 923-925
    • Kall, L.1    Canterbury, J.D.2    Weston, J.3    Noble, W.S.4    MacCoss, M.J.5
  • 23
    • 84901773142 scopus 로고    scopus 로고
    • Non-model organisms, a species endangered by proteogenomics
    • Armengaud, J. et al. Non-model organisms, a species endangered by proteogenomics. J. Proteomics 105, 5-18 (2014).
    • (2014) J. Proteomics , vol.105 , pp. 5-18
    • Armengaud, J.1
  • 24
    • 84913559600 scopus 로고    scopus 로고
    • Proteogenomics in microbiology: Taking the right turn at the junction of genomics and proteomics
    • Kucharova, V. & Wiker, H. G. Proteogenomics in microbiology: taking the right turn at the junction of genomics and proteomics. Proteomics 14, 2360-2675 (2014).
    • (2014) Proteomics , vol.14 , pp. 2360-2675
    • Kucharova, V.1    Wiker, H.G.2
  • 25
    • 84894619287 scopus 로고    scopus 로고
    • HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics
    • Branca, R. M. et al. HiRIEF LC-MS enables deep proteome coverage and unbiased proteogenomics. Nat. Methods 11, 59-62 (2014).
    • (2014) Nat. Methods , vol.11 , pp. 59-62
    • Branca, R.M.1
  • 26
    • 84941039788 scopus 로고    scopus 로고
    • Tissue-based proteogenomics reveals that human testis endows plentiful missing proteins
    • Zhang, Y. et al. Tissue-based proteogenomics reveals that human testis endows plentiful missing proteins. J. Proteome Res. 14, 3583-3594 (2015).
    • (2015) J. Proteome Res. , vol.14 , pp. 3583-3594
    • Zhang, Y.1
  • 27
    • 84920269464 scopus 로고    scopus 로고
    • Proteomics. Tissue-based map of the human proteome
    • Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
    • (2015) Science , vol.347 , pp. 1260419
    • Uhlen, M.1
  • 28
    • 77449108855 scopus 로고    scopus 로고
    • The peptideatlas project
    • Deutsch, E. W. The peptideatlas project. Methods Mol. Biol. 604, 285-296 (2010).
    • (2010) Methods Mol. Biol. , vol.604 , pp. 285-296
    • Deutsch, E.W.1
  • 30
    • 80555139654 scopus 로고    scopus 로고
    • CAGE (cap analysis of gene expression): A protocol for the detection of promoter and transcriptional networks
    • Takahashi, H., Kato, S., Murata, M. & Carninci, P. CAGE (cap analysis of gene expression): a protocol for the detection of promoter and transcriptional networks. Methods Mol. Biol. 786, 181-200 (2012).
    • (2012) Methods Mol. Biol. , vol.786 , pp. 181-200
    • Takahashi, H.1    Kato, S.2    Murata, M.3    Carninci, P.4
  • 31
    • 84861903786 scopus 로고    scopus 로고
    • A quantitative atlas of polyadenylation in five mammals
    • Derti, A. et al. A quantitative atlas of polyadenylation in five mammals. Genome Res. 22, 1173-1183 (2012).
    • (2012) Genome Res. , vol.22 , pp. 1173-1183
    • Derti, A.1
  • 32
    • 78650034777 scopus 로고    scopus 로고
    • Towards a knowledge-based Human Protein Atlas
    • Uhlen, M. et al. Towards a knowledge-based Human Protein Atlas. Nat. Biotechnol. 28, 1248-1250 (2010).
    • (2010) Nat. Biotechnol. , vol.28 , pp. 1248-1250
    • Uhlen, M.1
  • 33
    • 84874762979 scopus 로고    scopus 로고
    • The PRoteomics IDEntifications (PRIDE) database and associated tools: Status in 2013
    • Vizcaino, J. A. et al. The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 41, D1063-D1069 (2013).
    • (2013) Nucleic Acids Res. , vol.41 , pp. D1063-D1069
    • Vizcaino, J.A.1
  • 34
    • 33644876912 scopus 로고    scopus 로고
    • The PeptideAtlas project
    • Desiere, F. et al. The PeptideAtlas project. Nucleic Acids Res. 34, D655-D658 (2006).
    • (2006) Nucleic Acids Res. , vol.34 , pp. D655-D658
    • Desiere, F.1
  • 35
    • 84867345063 scopus 로고    scopus 로고
    • A cross-platform toolkit for mass spectrometry and proteomics
    • Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918-920 (2012).
    • (2012) Nat. Biotechnol. , vol.30 , pp. 918-920
    • Chambers, M.C.1
  • 36
    • 33846706581 scopus 로고    scopus 로고
    • TOPP - The openMS proteomics pipeline
    • Kohlbacher, O. et al. TOPP - the OpenMS proteomics pipeline. Bioinformatics 23, e191-e197 (2007).
    • (2007) Bioinformatics , vol.23 , pp. e191-e197
    • Kohlbacher, O.1
  • 37
    • 84865760395 scopus 로고    scopus 로고
    • GENCODE: The reference human genome annotation for the ENCODE Project
    • Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760-1774 (2012).
    • (2012) Genome Res. , vol.22 , pp. 1760-1774
    • Harrow, J.1
  • 38
    • 84946069451 scopus 로고    scopus 로고
    • UniProt: A hub for protein information
    • UniProt, C. UniProt: a hub for protein information. Nucleic Acids Res. 43, D204-D212 (2015).
    • (2015) Nucleic Acids Res. , vol.43 , pp. D204-D212
    • UniProt, C.1
  • 40
    • 77952148742 scopus 로고    scopus 로고
    • Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs
    • Guttman, M. et al. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat. Biotechnol. 28, 503-510 (2010).
    • (2010) Nat. Biotechnol. , vol.28 , pp. 503-510
    • Guttman, M.1
  • 41
    • 84865757142 scopus 로고    scopus 로고
    • Landscape of transcription in human cells
    • Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101-108 (2012).
    • (2012) Nature , vol.489 , pp. 101-108
    • Djebali, S.1
  • 42
    • 0034201441 scopus 로고    scopus 로고
    • EMBOSS: The European molecular biology open software suite
    • Rice, P., Longden, I. & Bleasby, A. EMBOSS: the European molecular biology open software suite. Trends Genet. 16, 276-277 (2000).
    • (2000) Trends Genet. , vol.16 , pp. 276-277
    • Rice, P.1    Longden, I.2    Bleasby, A.3
  • 43
    • 84923247212 scopus 로고    scopus 로고
    • MS-GF+ makes progress towards a universal database search tool for proteomics
    • Kim, S. & Pevzner, P. A. MS-GF+ makes progress towards a universal database search tool for proteomics. Nat. Commun. 5, 5277 (2014).
    • (2014) Nat. Commun. , vol.5 , pp. 5277
    • Kim, S.1    Pevzner, P.A.2
  • 44
    • 67049118923 scopus 로고    scopus 로고
    • Accurate and sensitive peptide identification with Mascot Percolator
    • Brosch, M., Yu, L., Hubbard, T. & Choudhary, J. Accurate and sensitive peptide identification with Mascot Percolator. J. Proteome Res. 8, 3176-3181 (2009).
    • (2009) J. Proteome Res. , vol.8 , pp. 3176-3181
    • Brosch, M.1    Yu, L.2    Hubbard, T.3    Choudhary, J.4
  • 45
    • 84864815807 scopus 로고    scopus 로고
    • Enhanced peptide identification by electron transfer dissociation using an improved Mascot Percolator
    • Wright, J. C. et al. Enhanced peptide identification by electron transfer dissociation using an improved Mascot Percolator. Mol. Cell. Proteomics 11, 478-491 (2012).
    • (2012) Mol. Cell. Proteomics , vol.11 , pp. 478-491
    • Wright, J.C.1
  • 46
    • 84893830089 scopus 로고    scopus 로고
    • Fast and accurate database searches with MS-GF+Percolator
    • Granholm, V. et al. Fast and accurate database searches with MS-GF+Percolator. J. Proteome Res. 13, 890-897 (2014).
    • (2014) J. Proteome Res. , vol.13 , pp. 890-897
    • Granholm, V.1
  • 47
    • 67650345724 scopus 로고    scopus 로고
    • Improvements to the percolator algorithm for peptide identification from shotgun proteomics data sets
    • Spivak, M., Weston, J., Bottou, L., Kall, L. & Noble, W. S. Improvements to the percolator algorithm for peptide identification from shotgun proteomics data sets. J. Proteome Res. 8, 3737-3745 (2009).
    • (2009) J. Proteome Res. , vol.8 , pp. 3737-3745
    • Spivak, M.1    Weston, J.2    Bottou, L.3    Kall, L.4    Noble, W.S.5
  • 48
    • 74049108922 scopus 로고    scopus 로고
    • BLAST+: Architecture and applications
    • Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 421 (2009).
    • (2009) BMC Bioinformatics , vol.10 , pp. 421
    • Camacho, C.1
  • 49
    • 84891767394 scopus 로고    scopus 로고
    • RefSeq: An update on mammalian reference sequences
    • Pruitt, K. D. et al. RefSeq: an update on mammalian reference sequences. Nucleic Acids Res. 42, D756-D763 (2014).
    • (2014) Nucleic Acids Res. , vol.42 , pp. D756-D763
    • Pruitt, K.D.1
  • 50
    • 84862187489 scopus 로고    scopus 로고
    • NeXtProt: A knowledge platform for human proteins
    • Lane, L. et al. neXtProt: a knowledge platform for human proteins. Nucleic Acids Res. 40, D76-D83 (2012).
    • (2012) Nucleic Acids Res. , vol.40 , pp. D76-D83
    • Lane, L.1
  • 51
    • 84876996918 scopus 로고    scopus 로고
    • TopHat2: Accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions
    • Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14, R36 (2013).
    • (2013) Genome Biol. , vol.14 , pp. R36
    • Kim, D.1
  • 52
    • 84928987900 scopus 로고    scopus 로고
    • HTSeq - A Python framework to work with high-throughput sequencing data
    • Anders, S., Pyl, P. T. & Huber, W. HTSeq - a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166-169 (2015).
    • (2015) Bioinformatics , vol.31 , pp. 166-169
    • Anders, S.1    Pyl, P.T.2    Huber, W.3
  • 53
    • 84891768365 scopus 로고    scopus 로고
    • Ensembl 2014
    • Flicek, P. et al. Ensembl 2014. Nucleic Acids Res. 42, D749-D755 (2014).
    • (2014) Nucleic Acids Res. , vol.42 , pp. D749-D755
    • Flicek, P.1
  • 54
    • 84946027595 scopus 로고    scopus 로고
    • Database resources of the National Center for Biotechnology Information
    • NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 43, D6-17 (2015).
    • (2015) Nucleic Acids Res. , vol.43 , pp. D6-17
    • NCBI Resource Coordinators1
  • 55
    • 84897406127 scopus 로고    scopus 로고
    • A promoter-level mammalian expression atlas
    • FANTOM Consortium et al. A promoter-level mammalian expression atlas. Nature 507, 462-470 (2014).
    • (2014) Nature , vol.507 , pp. 462-470
    • FANTOM Consortium et al1
  • 56
    • 84893276590 scopus 로고    scopus 로고
    • Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics
    • Fagerberg, L. et al. Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol. Cell. Proteomics 13, 397-406 (2014).
    • (2014) Mol. Cell. Proteomics , vol.13 , pp. 397-406
    • Fagerberg, L.1
  • 57
    • 84929705123 scopus 로고    scopus 로고
    • Principles of long non-coding RNA evolution derived from direct comparison of transcriptomes in 17 species
    • Hezroni, H. et al. Principles of long non-coding RNA evolution derived from direct comparison of transcriptomes in 17 species. Cell Rep. 11, 1110-1122 (2015).
    • (2015) Cell Rep. , vol.11 , pp. 1110-1122
    • Hezroni, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.