메뉴 건너뛰기




Volumn 28, Issue 18, 2012, Pages

An approach to describing and analysing bulk biological annotation quality: A case study using UniProtKB

Author keywords

[No Author keywords available]

Indexed keywords

PROTEIN;

EID: 84866463515     PISSN: 13674803     EISSN: 14602059     Source Type: Journal    
DOI: 10.1093/bioinformatics/bts372     Document Type: Article
Times cited : (16)

References (38)
  • 1
    • 1042273235 scopus 로고    scopus 로고
    • Zipf's law and the internet
    • Adamic,L.A. and Huberman,B.A. (2002) Zipf's law and the internet. Glottometrics, 3, 143-150.
    • (2002) Glottometrics , vol.3 , pp. 143-150
    • Adamic, L.A.1    Huberman, B.A.2
  • 2
    • 34748833491 scopus 로고    scopus 로고
    • Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach
    • Andorf,C. et al. (2007) Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach. BMC Bioinformatics, 8, 284+.
    • (2007) BMC Bioinformatics , vol.8
    • Andorf, C.1
  • 3
    • 0031864543 scopus 로고    scopus 로고
    • The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998
    • Bairoch,A. and Apweiler,R. (1998) The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998. Nucleic Acids Res., 26, 38-42.
    • (1998) Nucleic Acids Res , vol.26 , pp. 38-42
    • Bairoch, A.1    Apweiler, R.2
  • 4
    • 0033957834 scopus 로고    scopus 로고
    • The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000
    • Bairoch,A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45-48.
    • (2000) Nucleic Acids Res , vol.28 , pp. 45-48
    • Bairoch, A.1    Apweiler, R.2
  • 5
    • 0000195489 scopus 로고    scopus 로고
    • Quantitative linguistics and complex system studies
    • Balasubrahmanyan,V.K. and Naranan, S. (1996) Quantitative linguistics and complex system studies. J. Quant. Linguisti., 3, 177-228.
    • (1996) J. Quant. Linguisti , vol.3 , pp. 177-228
    • Balasubrahmanyan, V.K.1    Naranan, S.2
  • 6
    • 34547844156 scopus 로고    scopus 로고
    • Manual curation is not sufficient for annotation of genomic databases
    • Baumgartner,W.A. et al. (2007) Manual curation is not sufficient for annotation of genomic databases. Bioinformatics, 23, i41-148.
    • (2007) Bioinformatics , vol.23
    • Baumgartner, W.A.1
  • 7
    • 0037255072 scopus 로고    scopus 로고
    • The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    • Boeckmann,B. et al. (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res., 31, 365-370.
    • (2003) Nucleic Acids Res. , vol.31 , pp. 365-370
    • Boeckmann, B.1
  • 8
    • 79955956411 scopus 로고    scopus 로고
    • 2nd Edition (Dover Phoenix Editions). Dover Publications
    • Brillouin,L. et al. (2004) Science and Information Theory. 2nd Edition (Dover Phoenix Editions). Dover Publications.
    • (2004) Science and Information Theory
    • Brillouin, L.1
  • 9
    • 39149103734 scopus 로고    scopus 로고
    • Gene ontology annotation quality analysis in model eukaryotes
    • Buza,T.J. et al. (2008) Gene ontology annotation quality analysis in model eukaryotes. Nucleic Acids Research, 36, e12.
    • (2008) Nucleic Acids Research , vol.36
    • Buza, T.J.1
  • 10
    • 33947311528 scopus 로고    scopus 로고
    • An evaluation of GO annotation retrieval for BioCreAtIvE and GOA
    • Camon,E.B. et al. (2005) An evaluation of GO annotation retrieval for BioCreAtIvE and GOA. BMC Bioinformatics, 6 (Suppl. 1), S17.
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL. 1
    • Camon, E.B.1
  • 11
    • 0013122906 scopus 로고    scopus 로고
    • Two regimes in the frequency of words and the origins of complex lexicons: Zipf's law revisited
    • Cancho,R.F. and Solé,R.V. (2001) Two regimes in the frequency of words and the origins of complex lexicons: Zipf's law revisited. J. Quant. Linguist., 8, 165-173.
    • (2001) J. Quant. Linguist , vol.8 , pp. 165-173
    • Cancho, R.F.1    Solé, R.V.2
  • 12
    • 65549085067 scopus 로고    scopus 로고
    • Power-law distributions in empirical data
    • Clauset,A. et al. (2009) Power-law distributions in empirical data. SIAM Rev., 51, 661+.
    • (2009) SIAM Rev , vol.51
    • Clauset, A.1
  • 13
    • 2442707676 scopus 로고    scopus 로고
    • The Ensembl automatic gene annotation system
    • Curwen,V. et al. (2004) The Ensembl automatic gene annotation system. Genome Res., 14, 942-950.
    • (2004) Genome Res. , vol.14 , pp. 942-950
    • Curwen, V.1
  • 14
    • 84860501618 scopus 로고    scopus 로고
    • A procedure for assessing GO annotation consistency
    • i136-i143
    • Dolan,M.E. et al. (2005) A procedure for assessing GO annotation consistency. Bioinformatics, 21 (Suppl. 1), i136-i143.
    • (2005) Bioinformatics , vol.21
    • Dolan, M.E.1
  • 15
    • 17744388988 scopus 로고    scopus 로고
    • The variation of Zipf's law in human language
    • Ferrer,R. (2005) The variation of Zipf's law in human language. Eur. Phys. J. B, 44, 249-257.
    • (2005) Eur. Phys. J. B. , vol.44 , pp. 249-257
    • Ferrer, R.1
  • 16
    • 8344232338 scopus 로고    scopus 로고
    • Decoding least effort and scaling in signal frequency distributions
    • Ferrericancho,R. (2005) Decoding least effort and scaling in signal frequency distributions. Physica A, Stat. Mech. Appl., 345, 275-284.
    • (2005) Physica A, Stat. Mech. Appl , vol.345 , pp. 275-284
    • Ferrericancho, R.1
  • 17
    • 0344229953 scopus 로고
    • A new readability yardstick
    • Flesch,R. (1948) A new readability yardstick. J. Appl. Psychol., 32, 221-233.
    • (1948) J. Appl. Psychol , vol.32 , pp. 221-233
    • Flesch, R.1
  • 18
    • 12244283680 scopus 로고    scopus 로고
    • Modeling the percolation of annotation errors in a database of protein sequences
    • Gilks,W.R. et al. (2002) Modeling the percolation of annotation errors in a database of protein sequences. Bioinformatics, 18, 1641-1649.
    • (2002) Bioinformatics , vol.18 , pp. 1641-1649
    • Gilks, W.R.1
  • 19
    • 84866460435 scopus 로고    scopus 로고
    • GO Consortium Guide to GO evidence codes
    • GO Consortium (2011) Guide to GO evidence codes.
    • (2011)
  • 21
    • 85048998731 scopus 로고    scopus 로고
    • Zipf and type-token rules for the English, Spanish, Irish and Latin languages
    • Ha,L.Q. et al. (2006) Zipf and type-token rules for the English, Spanish, Irish and Latin languages. Web J. Formal Comput. Congni. Linguist., 8, 1-12.
    • (2006) Web J. Formal Comput. Congni. Linguist , vol.8 , pp. 1-12
    • Ha, L.Q.1
  • 22
    • 84877833156 scopus 로고    scopus 로고
    • Multiple gold standards address bias in functional network integration
    • University of Newcastle Upon Tyne
    • James,K. et al. (2011) Multiple gold standards address bias in functional network integration. Technical Report 1302. University of Newcastle Upon Tyne.
    • (2011) Technical Report 1302
    • James, K.1
  • 23
    • 34250790725 scopus 로고    scopus 로고
    • Estimating the annotation error rate of curated GO database sequence annotations
    • Jones,C. et al. (2007) Estimating the annotation error rate of curated GO database sequence annotations. BMC Bioinformatics, 8, 170+.
    • (2007) BMC Bioinformatics , vol.8
    • Jones, C.1
  • 24
    • 84861878887 scopus 로고    scopus 로고
    • The language of gene ontology: a zipf's law analysis
    • Kalankesh,L. et al. (2012) The language of gene ontology: a zipf's law analysis. BMC Bioinformatics, 13, 127+.
    • (2012) BMC Bioinformatics , vol.13
    • Kalankesh, L.1
  • 25
    • 0001736594 scopus 로고
    • SMOG grading-a new readability formula
    • Laughlin,H.M. (1969) SMOG grading-a new readability formula. J. Reading, 12, 639-46.
    • (1969) J. Reading , vol.12 , pp. 639-46
    • Laughlin, H.M.1
  • 26
    • 0037480738 scopus 로고    scopus 로고
    • Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation
    • Lord,P.W. et al. (2003) Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics, 19, 1275-1283.
    • (2003) Bioinformatics , vol.19 , pp. 1275-1283
    • Lord, P.W.1
  • 27
    • 79960976768 scopus 로고    scopus 로고
    • UniProt knowledgebase: a hub of integrated protein data
    • Uniprot Consortium
    • Magrane,M. and Uniprot Consortium (2011) UniProt knowledgebase: a hub of integrated protein data. Database, 2011, bar009.
    • (2011) Database
    • Magrane, M.1
  • 28
    • 11844292002 scopus 로고    scopus 로고
    • Inference of protein function from protein structure
    • Pal,D. and Eisenberg,D. (2005) Inference of protein function from protein structure. Structure, 13, 121-130.
    • (2005) Structure , vol.13 , pp. 121-130
    • Pal, D.1    Eisenberg, D.2
  • 29
    • 34248714355 scopus 로고    scopus 로고
    • Statistical parameters in pathological text
    • Piotrowska,W. and Piotrowska,X. (2004) Statistical parameters in pathological text. J. Quant. Linguist., 11, 133-140.
    • (2004) J. Quant. Linguist , vol.11 , pp. 133-140
    • Piotrowska, W.1    Piotrowska, X.2
  • 30
    • 78651305142 scopus 로고    scopus 로고
    • COMBREX: a project to accelerate the functional annotation of prokaryotic genomes
    • Roberts,R.J. et al. (2011) COMBREX: a project to accelerate the functional annotation of prokaryotic genomes. Nucleic Acids Res., 39 (Suppl. 1), D11-D14.
    • (2011) Nucleic Acids Res , vol.39 , Issue.SUPPL. 1
    • Roberts, R.J.1
  • 31
    • 65449147869 scopus 로고    scopus 로고
    • The use of Gene Ontology evidence codes in preventing classifier assessment bias
    • Rogers,M.F. and Ben-Hur,A. (2009) The use of Gene Ontology evidence codes in preventing classifier assessment bias. Bioinformatics, 25, 1173-1177.
    • (2009) Bioinformatics , vol.25 , pp. 1173-1177
    • Rogers, M.F.1    Ben-Hur, A.2
  • 32
    • 74549221383 scopus 로고    scopus 로고
    • Annotation error in public databases: misannotation of molecular function in enzyme superfamilies
    • Schnoes,A.M. et al. (2009) Annotation error in public databases: misannotation of molecular function in enzyme superfamilies. PLoS Comput. Biol., 5, e1000605+.
    • (2009) PLoS Comput. Biol , vol.5
    • Schnoes, A.M.1
  • 33
    • 65549163086 scopus 로고    scopus 로고
    • Modeling statistical properties of written text
    • Serrano,M.A. et al. (2009) Modeling statistical properties of written text. PLoS ONE, 4, e5372+.
    • (2009) PLoS ONE , vol.4
    • Serrano, M.A.1
  • 34
    • 84860480259 scopus 로고    scopus 로고
    • Application of ontologies in bioinformatics
    • Staab,S. and Studer,R. (eds) International Handbooks Information System, Springer, Berlin, Heidelberg
    • Stevens,R. and Lord,P. (2009) Application of ontologies in bioinformatics. In Staab,S. and Studer,R. (eds) Handbook on Ontologies, International Handbooks Information System, Springer, Berlin, Heidelberg, pp. 735-756.
    • (2009) Handbook on Ontologies , pp. 735-756
    • Stevens, R.1    Lord, P.2
  • 35
    • 75849153303 scopus 로고    scopus 로고
    • The universal protein resource (uniprot) in 2010
    • UniProt Consortium
    • UniProt Consortium (2010) The universal protein resource (uniprot) in 2010. Nucleic Acids Res., 38 (Suppl. 1), D142-D148.
    • (2010) Nucleic Acids Res. , vol.38 , Issue.SUPPL. 1
  • 36
    • 84877839331 scopus 로고    scopus 로고
    • UniProt knowledgebase user manual
    • UniProt Consortium
    • UniProt Consortium (2011) UniProt knowledgebase user manual. http://www.geneontology.org/GO.evidence.shtml
    • (2011)
  • 37
    • 4344568000 scopus 로고    scopus 로고
    • Genome update: annotation quality in sequenced microbial genomes
    • Ussery,D.W. and Hallin,P.F. (2004) Genome update: annotation quality in sequenced microbial genomes. Microbiology, 150 (Pt 7), 2015-2017.
    • (2004) Microbiology , vol.150 , Issue.PART 7 , pp. 2015-2017
    • Ussery, D.W.1    Hallin, P.F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.