메뉴 건너뛰기




Volumn 16, Issue 2, 2009, Pages 247-255

BioTagger-GM: A Gene/Protein Name Recognition System

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; COMPUTER SYSTEM; GENE IDENTIFICATION; LINGUISTICS; MACHINE LEARNING; MEDICAL TECHNOLOGY; PROTEIN ANALYSIS;

EID: 60549093731     PISSN: 10675027     EISSN: None     Source Type: Journal    
DOI: 10.1197/jamia.M2844     Document Type: Article
Times cited : (53)

References (52)
  • 1
    • 0001413266 scopus 로고    scopus 로고
    • Toward routine automatic pathway discovery from on-line scientific text abstracts
    • Ng S.K., and Wong M. Toward routine automatic pathway discovery from on-line scientific text abstracts. Genome Inform Ser Workshop Genome Inform 10 (1999) 104-112
    • (1999) Genome Inform Ser Workshop Genome Inform , vol.10 , pp. 104-112
    • Ng, S.K.1    Wong, M.2
  • 2
    • 0002284942 scopus 로고    scopus 로고
    • Identifying the interaction between genes and gene products based on frequently seen verbs in Medline abstracts
    • Sekimizu T., Park H.S., and Tsujii J. Identifying the interaction between genes and gene products based on frequently seen verbs in Medline abstracts. Genome Inform Ser Workshop Genome Inform 9 (1998) 62-71
    • (1998) Genome Inform Ser Workshop Genome Inform , vol.9 , pp. 62-71
    • Sekimizu, T.1    Park, H.S.2    Tsujii, J.3
  • 3
    • 12144290446 scopus 로고    scopus 로고
    • GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data
    • Rzhetsky A., Iossifov I., Koike T., et al. GeneWays: a system for extracting, analyzing, visualizing, and integrating molecular pathway data. J Biomed Inform 37 (2004) 43-53
    • (2004) J Biomed Inform , vol.37 , pp. 43-53
    • Rzhetsky, A.1    Iossifov, I.2    Koike, T.3
  • 4
    • 3242781571 scopus 로고    scopus 로고
    • The Genomics of a Signaling Pathway: A KDD Cup Challenge Task
    • Craven M. The Genomics of a Signaling Pathway: A KDD Cup Challenge Task. SIGKDD Explorations 4 (2003) 97-98
    • (2003) SIGKDD Explorations , vol.4 , pp. 97-98
    • Craven, M.1
  • 6
    • 0033657546 scopus 로고    scopus 로고
    • EDGAR: extraction of drugs, genes and relations from the biomedical literature
    • Rindflesch T.C., Tanabe L., Weinstein J.N., and Hunter L. EDGAR: extraction of drugs, genes and relations from the biomedical literature. Pac Symp Biocomput (2000) 517-528
    • (2000) Pac Symp Biocomput , pp. 517-528
    • Rindflesch, T.C.1    Tanabe, L.2    Weinstein, J.N.3    Hunter, L.4
  • 7
    • 0034564486 scopus 로고    scopus 로고
    • A pragmatic information extraction strategy for gathering data on genetic interactions
    • Proux D., Rechenmann F., and Julliard L. A pragmatic information extraction strategy for gathering data on genetic interactions. Int Conf Intell Syst Mol Biol 8 (2000) 279-285
    • (2000) Int Conf Intell Syst Mol Biol , vol.8 , pp. 279-285
    • Proux, D.1    Rechenmann, F.2    Julliard, L.3
  • 8
    • 0033643234 scopus 로고    scopus 로고
    • Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures
    • Humphreys K., Demetriou G., and Gaizauskas R. Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures. Pac Symp Biocomput (2000) 505-516
    • (2000) Pac Symp Biocomput , pp. 505-516
    • Humphreys, K.1    Demetriou, G.2    Gaizauskas, R.3
  • 10
    • 0035228039 scopus 로고    scopus 로고
    • PIES, a protein interaction extraction system
    • Wong L. PIES, a protein interaction extraction system. Pac Symp Biocomput (2001) 520-531
    • (2001) Pac Symp Biocomput , pp. 520-531
    • Wong, L.1
  • 11
    • 0031633368 scopus 로고    scopus 로고
    • Toward information extraction: identifying protein names from biological papers
    • Fukuda K., Tamura A., Tsunoda T., and Takagi T. Toward information extraction: identifying protein names from biological papers. Pac Symp Biocomput (1998) 707-718
    • (1998) Pac Symp Biocomput , pp. 707-718
    • Fukuda, K.1    Tamura, A.2    Tsunoda, T.3    Takagi, T.4
  • 14
    • 0043130572 scopus 로고    scopus 로고
    • Playing biology's name game: identifying protein names in scientific text
    • Hanisch D., Fluck J., Mevissen H.T., and Zimmer R. Playing biology's name game: identifying protein names in scientific text. Pac Symp Biocomput (2003) 403-414
    • (2003) Pac Symp Biocomput , pp. 403-414
    • Hanisch, D.1    Fluck, J.2    Mevissen, H.T.3    Zimmer, R.4
  • 15
    • 1042269470 scopus 로고    scopus 로고
    • GAPSCORE: finding gene and protein names one word at a time
    • Chang J.T., Schutze H., and Altman R.B. GAPSCORE: finding gene and protein names one word at a time. Bioinformatics 20 (2004) 216-225
    • (2004) Bioinformatics , vol.20 , pp. 216-225
    • Chang, J.T.1    Schutze, H.2    Altman, R.B.3
  • 16
    • 8444232801 scopus 로고    scopus 로고
    • Improving the performance of dictionary-based approaches in protein name recognition
    • Tsuruoka Y., and Tsujii J. Improving the performance of dictionary-based approaches in protein name recognition. J Biomed Inform 37 (2004) 461-470
    • (2004) J Biomed Inform , vol.37 , pp. 461-470
    • Tsuruoka, Y.1    Tsujii, J.2
  • 17
    • 35748966977 scopus 로고    scopus 로고
    • Learning string similarity measures for gene/protein name dictionary look-up using logistic regression
    • Tsuruoka Y., McNaught J., Tsujii J., and Ananiadou S. Learning string similarity measures for gene/protein name dictionary look-up using logistic regression. Bioinformatics 23 (2007) 2768-2774
    • (2007) Bioinformatics , vol.23 , pp. 2768-2774
    • Tsuruoka, Y.1    McNaught, J.2    Tsujii, J.3    Ananiadou, S.4
  • 18
    • 16344370215 scopus 로고    scopus 로고
    • Automatic extraction of gene/protein biological functions from biomedical text
    • Koike A., Niwa Y., and Takagi T. Automatic extraction of gene/protein biological functions from biomedical text. Bioinformatics 21 (2005) 1227-1236
    • (2005) Bioinformatics , vol.21 , pp. 1227-1236
    • Koike, A.1    Niwa, Y.2    Takagi, T.3
  • 19
    • 29144501213 scopus 로고    scopus 로고
    • High-recall protein entity recognition using a dictionary
    • Kou Z., Cohen W.W., and Murphy R.F. High-recall protein entity recognition using a dictionary. Bioinformatics 21 Suppl 1 (2005) i266-i273
    • (2005) Bioinformatics , vol.21 , Issue.SUPPL. 1
    • Kou, Z.1    Cohen, W.W.2    Murphy, R.F.3
  • 20
    • 2342474548 scopus 로고    scopus 로고
    • A simple and practical dictionary-based approach for identification of proteins in Medline abstracts
    • Egorov S., Yuryev A., and Daraselia N. A simple and practical dictionary-based approach for identification of proteins in Medline abstracts. J Am Med Inform Assoc 11 (2004) 174-178
    • (2004) J Am Med Inform Assoc , vol.11 , pp. 174-178
    • Egorov, S.1    Yuryev, A.2    Daraselia, N.3
  • 21
    • 3242881967 scopus 로고    scopus 로고
    • NLProt: extracting protein names and sequences from papers
    • (Web Server issue)
    • Mika S., and Rost B. NLProt: extracting protein names and sequences from papers. Nucleic Acids Res 32 (2004) W634-W637 (Web Server issue)
    • (2004) Nucleic Acids Res , vol.32
    • Mika, S.1    Rost, B.2
  • 22
    • 33947359082 scopus 로고    scopus 로고
    • BioCreAtIvE task 1A: gene mention finding evaluation
    • Yeh A., Morgan A., Colosimo M., and Hirschman L. BioCreAtIvE task 1A: gene mention finding evaluation. BMC Bioinformatics 6 Suppl 1 (2005) S2
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL. 1
    • Yeh, A.1    Morgan, A.2    Colosimo, M.3    Hirschman, L.4
  • 24
    • 51049088874 scopus 로고    scopus 로고
    • Overview of BioCreative II gene mention recognition
    • Smith L., Tanabe L.K., Ando R.J., et al. Overview of BioCreative II gene mention recognition. Genome Biol 9 Suppl 2 (2008) S2
    • (2008) Genome Biol , vol.9 , Issue.SUPPL. 2
    • Smith, L.1    Tanabe, L.K.2    Ando, R.J.3
  • 28
    • 37249060650 scopus 로고    scopus 로고
    • On the Diversity-Performance Relationship for Majority Voting in Classifier Ensembles
    • Springer-Verlag, New York 7th International Workshop on Multiple Classifier Systems (MCS2007). Springer-Verlag, 2007
    • Chung Y.S., Hsu D.F., and Tang C.Y. On the Diversity-Performance Relationship for Majority Voting in Classifier Ensembles. Proceedings of the Seventh International Workshop on Multiple Classifier Systems, Lecture Notes in Computer Science 4472 (2007), Springer-Verlag, New York 407-420 7th International Workshop on Multiple Classifier Systems (MCS2007). Springer-Verlag, 2007
    • (2007) Proceedings of the Seventh International Workshop on Multiple Classifier Systems, Lecture Notes in Computer Science , vol.4472 , pp. 407-420
    • Chung, Y.S.1    Hsu, D.F.2    Tang, C.Y.3
  • 29
    • 46249124738 scopus 로고    scopus 로고
    • Rich Feature Set, Unification of Bidirectional Parsing and Dictionary Filtering for High-F-Score Gene Mention Tagging
    • Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain
    • Kuo C.-J., Chang Y.-M., Huang H.-S., et al. Rich Feature Set, Unification of Bidirectional Parsing and Dictionary Filtering for High-F-Score Gene Mention Tagging. Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain 257-271
    • (2007) Proceedings of the Second BioCreative Challenge Evaluation Workshop , pp. 257-271
    • Kuo, C.-J.1    Chang, Y.-M.2    Huang, H.-S.3
  • 30
    • 46249127773 scopus 로고    scopus 로고
    • BioCreative II Gene Mention Tagging System at IBM Watson
    • Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain
    • Ando R.K. BioCreative II Gene Mention Tagging System at IBM Watson. Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain 257-271
    • (2007) Proceedings of the Second BioCreative Challenge Evaluation Workshop , pp. 257-271
    • Ando, R.K.1
  • 32
    • 51049096051 scopus 로고    scopus 로고
    • Gene Mention and Gene Normalization Based on Machine Learning and Online Resources
    • Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain
    • Liu H., Torii M., Hu Z.Z., and Wu C.H. Gene Mention and Gene Normalization Based on Machine Learning and Online Resources. Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain 257-271
    • (2007) Proceedings of the Second BioCreative Challenge Evaluation Workshop , pp. 257-271
    • Liu, H.1    Torii, M.2    Hu, Z.Z.3    Wu, C.H.4
  • 33
    • 30344472426 scopus 로고    scopus 로고
    • BioThesaurus: a web-based thesaurus of protein and gene names
    • Liu H., Hu Z.Z., Zhang J., and Wu C. BioThesaurus: a web-based thesaurus of protein and gene names. Bioinformatics 22 (2006) 103-105
    • (2006) Bioinformatics , vol.22 , pp. 103-105
    • Liu, H.1    Hu, Z.Z.2    Zhang, J.3    Wu, C.4
  • 34
  • 35
    • 0842330093 scopus 로고    scopus 로고
    • The iProClass integrated database for protein functional analysis
    • Wu C.H., Huang H., Nikolskaya A., Hu Z., and Barker W.C. The iProClass integrated database for protein functional analysis. Comput Biol Chem 28 (2004) 87-96
    • (2004) Comput Biol Chem , vol.28 , pp. 87-96
    • Wu, C.H.1    Huang, H.2    Nikolskaya, A.3    Hu, Z.4    Barker, W.C.5
  • 36
    • 0345863927 scopus 로고    scopus 로고
    • The Unified Medical Language System (UMLS): integrating biomedical terminology
    • Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 32 (2004) D267-D270
    • (2004) Nucleic Acids Res , vol.32
    • Bodenreider, O.1
  • 40
    • 33646436650 scopus 로고    scopus 로고
    • Conference of Human Language Technology and North American Chapter of Association of Computational Linguistics
    • Sha F., and Pereira F. Shallow Parsing with Conditional Random Fields. Conference of Human Language Technology and North American Chapter of Association of Computational Linguistics (2003)
    • (2003) Shallow Parsing with Conditional Random Fields
    • Sha, F.1    Pereira, F.2
  • 41
    • 33947305118 scopus 로고    scopus 로고
    • Identifying gene and protein mentions in text using conditional random fields
    • McDonald R., and Pereira F. Identifying gene and protein mentions in text using conditional random fields. BMC Bioinformatics 6 Suppl 1 (2005) 56
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL. 1 , pp. 56
    • McDonald, R.1    Pereira, F.2
  • 42
    • 25144520247 scopus 로고    scopus 로고
    • ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text
    • Settles B. ABNER: an open source tool for automatically tagging genes, proteins and other entity names in text. Bioinformatics 21 (2005) 3191-3192
    • (2005) Bioinformatics , vol.21 , pp. 3191-3192
    • Settles, B.1
  • 43
    • 33947250174 scopus 로고    scopus 로고
    • GENETAG: a tagged corpus for gene/protein named entity recognition
    • Tanabe L., Xie N., Thom L.H., Matten W., and Wilbur W.J. GENETAG: a tagged corpus for gene/protein named entity recognition. BMC Bioinformatics 6 Suppl 1 (2005) S3
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL. 1
    • Tanabe, L.1    Xie, N.2    Thom, L.H.3    Matten, W.4    Wilbur, W.J.5
  • 44
    • 46249091349 scopus 로고    scopus 로고
    • Hign-Recall Gene Mention Recognition by Unification of Multiple Backward Parsing Models
    • Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain
    • Huang H.-S., Lin Y.-S., Lin K.-T., et al. Hign-Recall Gene Mention Recognition by Unification of Multiple Backward Parsing Models. Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain 257-271
    • (2007) Proceedings of the Second BioCreative Challenge Evaluation Workshop , pp. 257-271
    • Huang, H.-S.1    Lin, Y.-S.2    Lin, K.-T.3
  • 45
    • 60549102576 scopus 로고    scopus 로고
    • Alias-i. 2007. LingPipe 3.1.2, computer program, Accessed: November 1, 2007
    • Alias-i. 2007. LingPipe 3.1.2. http://alias-i.com/lingpipe/ [computer program]. Accessed: November 1, 2007.
  • 46
    • 0013157252 scopus 로고    scopus 로고
    • Ninth Conference of the European Chapter of the Association for Computational Linguistics
    • Sang E.F.T.K., and Veenstra J. Representing text chunks. Ninth Conference of the European Chapter of the Association for Computational Linguistics (1999) 173-179
    • (1999) Representing text chunks , pp. 173-179
    • Sang, E.F.T.K.1    Veenstra, J.2
  • 47
    • 34248847962 scopus 로고
    • A Method for Disambiguating Word Senses in a Large Corpus
    • Gale W., Church K., and Yarowsky D. A Method for Disambiguating Word Senses in a Large Corpus. Computers and the Humanities 26 (1992) 415-439
    • (1992) Computers and the Humanities , vol.26 , pp. 415-439
    • Gale, W.1    Church, K.2    Yarowsky, D.3
  • 48
    • 0041627757 scopus 로고    scopus 로고
    • A simple algorithm for identifying abbreviation definitions in biomedical text
    • Schwartz A.S., and Hearst M.A. A simple algorithm for identifying abbreviation definitions in biomedical text. Pac Symp Biocomput (2003) 451-462
    • (2003) Pac Symp Biocomput , pp. 451-462
    • Schwartz, A.S.1    Hearst, M.A.2
  • 49
    • 41649102022 scopus 로고    scopus 로고
    • An integrated approach to concept recognition in biomedical text
    • Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain
    • Baumgartner Jr. W.A., Lu Z., Johnson H.L., et al. An integrated approach to concept recognition in biomedical text. Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), Centro Nacional de Investigaciones Oncologicas (CNIO), Madrid, Spain 257-271
    • (2007) Proceedings of the Second BioCreative Challenge Evaluation Workshop , pp. 257-271
    • Baumgartner Jr., W.A.1    Lu, Z.2    Johnson, H.L.3
  • 51
    • 40549140499 scopus 로고    scopus 로고
    • BANNER: An executable survey of advances in biomedical named entity recognition
    • Leaman R., and Gonzalez G. BANNER: An executable survey of advances in biomedical named entity recognition. Pac Symp Biocomput (2008) 652-663
    • (2008) Pac Symp Biocomput , pp. 652-663
    • Leaman, R.1    Gonzalez, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.