메뉴 건너뛰기




Volumn 65, Issue 2, 2015, Pages 145-153

Boosting drug named entity recognition using an aggregate classifier

Author keywords

Drug named entity recognition; Genetic programming evolved string similarity patterns; Gold standard vs. silver standard annotations; Named entity annotation sparsity; Named entity recogniser aggregation

Indexed keywords

ARTIFICIAL INTELLIGENCE; CLASSIFICATION (OF INFORMATION); GENETIC ALGORITHMS; GENETIC PROGRAMMING; LEARNING SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; PHARMACOKINETICS; SILVER; SUPERVISED LEARNING; VOTING MACHINES;

EID: 84943586410     PISSN: 09333657     EISSN: 18732860     Source Type: Journal    
DOI: 10.1016/j.artmed.2015.05.007     Document Type: Article
Times cited : (45)

References (62)
  • 1
    • 17244380380 scopus 로고    scopus 로고
    • A survey of current work in biomedical text mining
    • Cohen A.M., Hersh W.R. A survey of current work in biomedical text mining. Brief Bioinform 2005, 6(1):57-71.
    • (2005) Brief Bioinform , vol.6 , Issue.1 , pp. 57-71
    • Cohen, A.M.1    Hersh, W.R.2
  • 3
    • 84907191783 scopus 로고    scopus 로고
    • How FDA Reviews Proposed Drug Names
    • [online, accessed 15.04.15].
    • U.S. Food and Drug Administration. How FDA Reviews Proposed Drug Names. [online, accessed 15.04.15]. http://www.fda.gov/downloads/Drugs/DrugSafety/MedicationErrors/ucm080867.pdf.
  • 4
    • 38549151817 scopus 로고    scopus 로고
    • DrugBank: a knowledgebase for drugs, drug actions and drug targets
    • Wishart D.S., Knox C., Guo A.C., Cheng D., Shrivastava S., Tzur D., et al. DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucl Acids Res 2008, 36(Suppl. 1):D901-D906.
    • (2008) Nucl Acids Res , vol.36 , pp. D901-D906
    • Wishart, D.S.1    Knox, C.2    Guo, A.C.3    Cheng, D.4    Shrivastava, S.5    Tzur, D.6
  • 5
    • 84893481418 scopus 로고    scopus 로고
    • An integrated pharmacokinetics ontology and semantically annotated corpus for text mining
    • American Medical Informatics Association (AMIA), Bethesda, MD, USA
    • Subhadarshini A., Karnik S., Wang Z., Philips S., Duke J., Quinney S., et al. An integrated pharmacokinetics ontology and semantically annotated corpus for text mining. Proceedings of Summit on Translational Bioinformatics 2012, American Medical Informatics Association (AMIA), Bethesda, MD, USA.
    • (2012) Proceedings of Summit on Translational Bioinformatics
    • Subhadarshini, A.1    Karnik, S.2    Wang, Z.3    Philips, S.4    Duke, J.5    Quinney, S.6
  • 9
    • 9444254243 scopus 로고    scopus 로고
    • Introduction to the CoNLL-2002 shared task: language-independent named entity recognition
    • Association for Computational Linguistics, Stroudsburg, PA, USA
    • Tjong Kim Sang E.F. Introduction to the CoNLL-2002 shared task: language-independent named entity recognition. Proceedings of the 6th Conference on Natural Language Learning, vol. 20, COLING-02 2002, 1-4. Association for Computational Linguistics, Stroudsburg, PA, USA.
    • (2002) Proceedings of the 6th Conference on Natural Language Learning, COLING-02 , vol.20 , pp. 1-4
    • Tjong Kim Sang, E.F.1
  • 11
    • 70350051042 scopus 로고    scopus 로고
    • Named entity recognition in biomedical literature: a comparison of support vector machines and conditional random fields
    • Springer, Berlin & Heidelberg, Germany, J. Filipe, J. Cordeiro, J. Cardoso (Eds.)
    • Liu F., Chen Y., Manderick B. Named entity recognition in biomedical literature: a comparison of support vector machines and conditional random fields. ICEIS (Selected Papers), vol. 12 of Lecture Notes in Business Information Processing 2007, 137-147. Springer, Berlin & Heidelberg, Germany. J. Filipe, J. Cordeiro, J. Cardoso (Eds.).
    • (2007) ICEIS (Selected Papers), of Lecture Notes in Business Information Processing , vol.12 , pp. 137-147
    • Liu, F.1    Chen, Y.2    Manderick, B.3
  • 12
    • 84870336493 scopus 로고    scopus 로고
    • A systematic review of named entity recognition in biomedical texts
    • Goulart R.R.V., de Lima V.L.S., Xavier C.C. A systematic review of named entity recognition in biomedical texts. J Braz Comput Soc 2011, 17(2):103-116.
    • (2011) J Braz Comput Soc , vol.17 , Issue.2 , pp. 103-116
    • Goulart, R.R.V.1    de Lima, V.L.S.2    Xavier, C.C.3
  • 13
    • 80053404897 scopus 로고    scopus 로고
    • Recognizing medication related entities in hospital discharge summaries using support vector machine
    • Association for Computational Linguistics, Stroudsburg, PA, USA, C.-R. Huang, D. Jurafsky (Eds.)
    • Doan S., Xu H. Recognizing medication related entities in hospital discharge summaries using support vector machine. Proceedings of the 23rd International Conference on Computational Linguistics: Posters COLING'10 2010, 259-266. Association for Computational Linguistics, Stroudsburg, PA, USA. C.-R. Huang, D. Jurafsky (Eds.).
    • (2010) Proceedings of the 23rd International Conference on Computational Linguistics: Posters COLING'10 , pp. 259-266
    • Doan, S.1    Xu, H.2
  • 19
  • 20
    • 62549162626 scopus 로고    scopus 로고
    • High-performance gene name normalization with GeNo
    • Wermter J., Tomanek K., Hahn U. High-performance gene name normalization with GeNo. Bioinformatics 2009, 25(6):815-821.
    • (2009) Bioinformatics , vol.25 , Issue.6 , pp. 815-821
    • Wermter, J.1    Tomanek, K.2    Hahn, U.3
  • 21
    • 77955287813 scopus 로고    scopus 로고
    • An overview of MetaMap: historical perspective and recent advances
    • Aronson A.R., Lang F.-M. An overview of MetaMap: historical perspective and recent advances. J Am Med Inform Assoc 2010, 17(3):229-236.
    • (2010) J Am Med Inform Assoc , vol.17 , Issue.3 , pp. 229-236
    • Aronson, A.R.1    Lang, F.-M.2
  • 23
    • 84874259397 scopus 로고    scopus 로고
    • Assessment of NER solutions against the first and second CALBC silver standard corpus
    • CEUR-WS.org, N. Collier, U. Hahn, D. Rebholz-Schuhmann, F. Rinaldi, S. Pyysalo (Eds.)
    • Rebholz-Schuhmann D., Jimeno-Yepes A., Li C., Kafkas S., Lewin I., Kang N., et al. Assessment of NER solutions against the first and second CALBC silver standard corpus. Semantic Mining in Biomedicine vol. 714 of CEUR Workshop Proceedings 2010, CEUR-WS.org. N. Collier, U. Hahn, D. Rebholz-Schuhmann, F. Rinaldi, S. Pyysalo (Eds.).
    • (2010) Semantic Mining in Biomedicine of CEUR Workshop Proceedings , vol.714
    • Rebholz-Schuhmann, D.1    Jimeno-Yepes, A.2    Li, C.3    Kafkas, S.4    Lewin, I.5    Kang, N.6
  • 24
    • 35748966977 scopus 로고    scopus 로고
    • Learning string similarity measures for gene/protein name dictionary look-up using logistic regression
    • Tsuruoka Y., McNaught J., Tsujii J., Ananiadou S. Learning string similarity measures for gene/protein name dictionary look-up using logistic regression. Bioinformatics 2007, 23(20):2768-2774.
    • (2007) Bioinformatics , vol.23 , Issue.20 , pp. 2768-2774
    • Tsuruoka, Y.1    McNaught, J.2    Tsujii, J.3    Ananiadou, S.4
  • 25
    • 34547852239 scopus 로고    scopus 로고
    • Identification of new drug classification terms in textual resources
    • Kolărik C., Hofmann-Apitius M., Zimmermann M., Fluck J. Identification of new drug classification terms in textual resources. Bioinformatics 2007, 23(13):i264-i272.
    • (2007) Bioinformatics , vol.23 , Issue.13 , pp. i264-i272
    • Kolărik, C.1    Hofmann-Apitius, M.2    Zimmermann, M.3    Fluck, J.4
  • 26
  • 27
    • 36749097784 scopus 로고    scopus 로고
    • Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study
    • Chen E.S., Hripcsak G., Xu H., Markatou M., Friedman C. Automated acquisition of disease drug knowledge from biomedical and clinical documents: an initial study. J Am Med Inform Assoc 2008, 15(February):87-98.
    • (2008) J Am Med Inform Assoc , vol.15 , Issue.February , pp. 87-98
    • Chen, E.S.1    Hripcsak, G.2    Xu, H.3    Markatou, M.4    Friedman, C.5
  • 28
    • 37249067007 scopus 로고    scopus 로고
    • Automated extraction of information from the literature on chemical-CYP3A4 interactions
    • Feng C., Yamashita F., Hashida M. Automated extraction of information from the literature on chemical-CYP3A4 interactions. J Chem Inform Model 2007, 47(6):2449-2455.
    • (2007) J Chem Inform Model , vol.47 , Issue.6 , pp. 2449-2455
    • Feng, C.1    Yamashita, F.2    Hashida, M.3
  • 30
    • 0000110017 scopus 로고    scopus 로고
    • EDGAR: extraction of drugs, genes and relations from the biomedical literature
    • Available from: [accessed 15.04.15].
    • Rindflesch T., Tanabe L., Weinstein J.N., Hunter L. EDGAR: extraction of drugs, genes and relations from the biomedical literature. Pacific Symposium of Biocomputing 2000, 514-525. Available from: psb.stanford.edu/psb-online/proceedings/psb00/ [accessed 15.04.15].
    • (2000) Pacific Symposium of Biocomputing , pp. 514-525
    • Rindflesch, T.1    Tanabe, L.2    Weinstein, J.N.3    Hunter, L.4
  • 31
    • 85121124155 scopus 로고    scopus 로고
    • Automatic acquisition of huge training data for bio-medical named entity recognition
    • Association for Computational Linguistics, Stroudsburg, PA, USA, K.B. Cohen, D. Demner-Fushman, S. Ananiadou, J. Pestian, J. Tsujii, B. Webber (Eds.)
    • Usami Y., Cho H.-C., Okazaki N., Tsujii J. Automatic acquisition of huge training data for bio-medical named entity recognition. Proceedings of BioNLP 2011 Workshop BioNLP 2011 2011, Association for Computational Linguistics, Stroudsburg, PA, USA. K.B. Cohen, D. Demner-Fushman, S. Ananiadou, J. Pestian, J. Tsujii, B. Webber (Eds.).
    • (2011) Proceedings of BioNLP 2011 Workshop BioNLP 2011
    • Usami, Y.1    Cho, H.-C.2    Okazaki, N.3    Tsujii, J.4
  • 32
    • 84872974254 scopus 로고    scopus 로고
    • Bootstrapping and evaluating named entity recognition in the biomedical domain
    • Association for Computational Linguistics, New York, NY, K. Verspoor, K.B. Cohen, B. Goertzel, I. Mani (Eds.)
    • Vlachos A., Gasperin C. Bootstrapping and evaluating named entity recognition in the biomedical domain. Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology LNLBioNLP 2006 2006, 138-145. Association for Computational Linguistics, New York, NY. K. Verspoor, K.B. Cohen, B. Goertzel, I. Mani (Eds.).
    • (2006) Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology LNLBioNLP 2006 , pp. 138-145
    • Vlachos, A.1    Gasperin, C.2
  • 34
    • 84893473667 scopus 로고    scopus 로고
    • Weakly labeled corpora as silver standard for drug-drug and protein-protein interaction
    • European Language Resources Association (ELRA), Instanbul, Turkey, Available from: [accessed 15.04.15], S. Ananiadou, K. Cohen, D. Demner-Fushman, P. Thompson (Eds.)
    • Thomas P., Bobic T., Leser U., Hofmann-Apitius M., Klinger R. Weakly labeled corpora as silver standard for drug-drug and protein-protein interaction. Proceedings of the Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM) on Language Resources and Evaluation Conference (LREC) 2012, May, 63-70. European Language Resources Association (ELRA), Instanbul, Turkey, Available from: www.lrec-conf.org/proceedings/lrec2012/workshops/14.BioTxtM-Proceedings.pdf#page=70 [accessed 15.04.15]. S. Ananiadou, K. Cohen, D. Demner-Fushman, P. Thompson (Eds.).
    • (2012) Proceedings of the Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM) on Language Resources and Evaluation Conference (LREC) , pp. 63-70
    • Thomas, P.1    Bobic, T.2    Leser, U.3    Hofmann-Apitius, M.4    Klinger, R.5
  • 39
    • 84899066812 scopus 로고    scopus 로고
    • Accelerating the annotation of sparse named entities by dynamic sentence selection
    • Association for Computational Linguistics, Columbus, OH, USA, D. Demner-Fushman, S. Ananiadou, K.B. Cohen, J. Pestian, J. Tsujii, B. Webber (Eds.)
    • Tsuruoka Y., Tsujii J., Ananiadou S. Accelerating the annotation of sparse named entities by dynamic sentence selection. Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing BioNLP 2008 2008, June, 30-37. Association for Computational Linguistics, Columbus, OH, USA. D. Demner-Fushman, S. Ananiadou, K.B. Cohen, J. Pestian, J. Tsujii, B. Webber (Eds.).
    • (2008) Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing BioNLP 2008 , pp. 30-37
    • Tsuruoka, Y.1    Tsujii, J.2    Ananiadou, S.3
  • 40
    • 0002652285 scopus 로고    scopus 로고
    • A maximum entropy approach to natural language processing
    • Berger A.L., Pietra V.J.D., Pietra S.A.D. A maximum entropy approach to natural language processing. Comput Linguist 1996 Mar, 22:39-71.
    • (1996) Comput Linguist , vol.22 , pp. 39-71
    • Berger, A.L.1    Pietra, V.J.D.2    Pietra, S.A.D.3
  • 44
    • 79959655291 scopus 로고    scopus 로고
    • Using a shallow linguistic kernel for drug-drug interaction extraction
    • Segura-Bedmar I., Martínez P., De Pablo-Sánchez C. Using a shallow linguistic kernel for drug-drug interaction extraction. J Biomed Inform 2011, October, 44:789-804.
    • (2011) J Biomed Inform , vol.44 , pp. 789-804
    • Segura-Bedmar, I.1    Martínez, P.2    De Pablo-Sánchez, C.3
  • 45
    • 77952829690 scopus 로고    scopus 로고
    • Building a high quality sense inventory for improved abbreviation disambiguation
    • Okazaki N., Ananiadou S., Tsujii J. Building a high quality sense inventory for improved abbreviation disambiguation. Bioinformatics 2010, 26(9):1246-1253.
    • (2010) Bioinformatics , vol.26 , Issue.9 , pp. 1246-1253
    • Okazaki, N.1    Ananiadou, S.2    Tsujii, J.3
  • 47
    • 0001882615 scopus 로고
    • Self-organized language modeling for speech recognition
    • Morgan Kaufmann, San Francisco, CA, USA, A. Waibel, K.-F. Lee (Eds.)
    • Jelinek F., Merialdo B., Roukos S., Strauss M.J. Self-organized language modeling for speech recognition. Readings in Speech Recognition 1990, 450-506. Morgan Kaufmann, San Francisco, CA, USA. A. Waibel, K.-F. Lee (Eds.).
    • (1990) Readings in Speech Recognition , pp. 450-506
    • Jelinek, F.1    Merialdo, B.2    Roukos, S.3    Strauss, M.J.4
  • 48
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • Katz S.M. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans Acoust Speech Signal Process 1987, ASSP-35(March):400-401.
    • (1987) IEEE Trans Acoust Speech Signal Process , vol.ASSP-35 , Issue.MARCH , pp. 400-401
    • Katz, S.M.1
  • 49
    • 0009421202 scopus 로고
    • Prepositional phrase attachment through a backed-off model
    • Association for Computational Linguistics, Cambridge, MA, USA, D. Yarowsky, K. Church (Eds.)
    • Collins M., Brooks J. Prepositional phrase attachment through a backed-off model. Proceedings of the Third Workshop on Very Large Corpora 1995, 27-38. Association for Computational Linguistics, Cambridge, MA, USA. D. Yarowsky, K. Church (Eds.).
    • (1995) Proceedings of the Third Workshop on Very Large Corpora , pp. 27-38
    • Collins, M.1    Brooks, J.2
  • 51
    • 0032632354 scopus 로고    scopus 로고
    • An algorithm that learns what's in a name
    • Bikel D.M., Schwartz R., Weischedel R.M. An algorithm that learns what's in a name. Mach Learn 1999, 34(February):211-231.
    • (1999) Mach Learn , vol.34 , Issue.FEBRUARY , pp. 211-231
    • Bikel, D.M.1    Schwartz, R.2    Weischedel, R.M.3
  • 55
    • 84861140628 scopus 로고    scopus 로고
    • Modelling compositional data using dirichlet regression models
    • Hijazi R.A., Jernigan R.W. Modelling compositional data using dirichlet regression models. J Appl Probab Stat 2009, 4(1):77-91.
    • (2009) J Appl Probab Stat , vol.4 , Issue.1 , pp. 77-91
    • Hijazi, R.A.1    Jernigan, R.W.2
  • 57
    • 22044453925 scopus 로고    scopus 로고
    • The combination of text classifiers using reliability indicators
    • Bennett P.N., Dumais S.T., Horvitz E. The combination of text classifiers using reliability indicators. Inform Retr 2005, 8(January):67-100.
    • (2005) Inform Retr , vol.8 , Issue.JANUARY , pp. 67-100
    • Bennett, P.N.1    Dumais, S.T.2    Horvitz, E.3
  • 59
    • 0003012849 scopus 로고    scopus 로고
    • Combining multiple learning strategies for effective cross-validation
    • The International Machine Learning Society, Madison, WI, USA, J. Furnkranz, T. Joachims (Eds.)
    • Yang Y., Ault T., Pierce T. Combining multiple learning strategies for effective cross-validation. International Conference on Machine Learning 2000, 1167-1174. The International Machine Learning Society, Madison, WI, USA. J. Furnkranz, T. Joachims (Eds.).
    • (2000) International Conference on Machine Learning , pp. 1167-1174
    • Yang, Y.1    Ault, T.2    Pierce, T.3
  • 60
    • 38349036472 scopus 로고    scopus 로고
    • Combining rough decisions for intelligent text mining using Dempster's rule
    • Bi Y., Mcclean S., Anderson T. Combining rough decisions for intelligent text mining using Dempster's rule. Artif Intell Rev 2006, 26(November):191-209.
    • (2006) Artif Intell Rev , vol.26 , Issue.NOVEMBER , pp. 191-209
    • Bi, Y.1    Mcclean, S.2    Anderson, T.3
  • 61
    • 84885612850 scopus 로고    scopus 로고
    • Boosting performance of bio-entity recognition by combining results from multiple systems
    • ACM Press, New York, NY, USA
    • Si L. Boosting performance of bio-entity recognition by combining results from multiple systems. Proceedings of the 5th International Workshop on Bioinformatics, BIOKDD 2505 2005, 76-83. ACM Press, New York, NY, USA.
    • (2005) Proceedings of the 5th International Workshop on Bioinformatics, BIOKDD 2505 , pp. 76-83
    • Si, L.1
  • 62
    • 65549161963 scopus 로고    scopus 로고
    • Published via lulu.com and freely available at: With contributions by J.R. Koza [accessed 15.04.15]
    • Poli R., Langdon W.B., McPhee N.F. A Field Guide to Genetic Programming 2008, Published via lulu.com and freely available at: www.gp-field-guide.org.uk, With contributions by J.R. Koza [accessed 15.04.15].
    • (2008) A Field Guide to Genetic Programming
    • Poli, R.1    Langdon, W.B.2    McPhee, N.F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.