메뉴 건너뛰기




Volumn 64, Issue , 2016, Pages 1-9

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases

Author keywords

Biomedical named entity recognition; Fuzzy matching; Machine learning; Stacked ensemble; Text mining

Indexed keywords

ARTIFICIAL INTELLIGENCE; CHARACTER RECOGNITION; DATA MINING; LEARNING SYSTEMS; RANDOM PROCESSES; SEMANTICS; TEXT PROCESSING;

EID: 84988640362     PISSN: 15320464     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jbi.2016.09.009     Document Type: Article
Times cited : (59)

References (46)
  • 2
    • 17244380380 scopus 로고    scopus 로고
    • A survey of current work in biomedical text mining
    • [2] Cohen, A.M., Hersh, W.R., A survey of current work in biomedical text mining. Briefings Bioinform. 6:1 (2005), 57–71.
    • (2005) Briefings Bioinform. , vol.6 , Issue.1 , pp. 57-71
    • Cohen, A.M.1    Hersh, W.R.2
  • 3
    • 27744524253 scopus 로고    scopus 로고
    • A maximum entropy approach to biomedical named entity recognition
    • [3] Lin, Y.F., Tsai, T.H., Chou, W.C., Wu, K.P., Sung, T.Y., Hsu, W.L., A maximum entropy approach to biomedical named entity recognition. BIOKDD, 2004, 56–61.
    • (2004) BIOKDD , pp. 56-61
    • Lin, Y.F.1    Tsai, T.H.2    Chou, W.C.3    Wu, K.P.4    Sung, T.Y.5    Hsu, W.L.6
  • 5
    • 25444533246 scopus 로고    scopus 로고
    • Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
    • [5] J. Lafferty, A. McCallum, F.C. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, 2001.
    • (2001)
    • Lafferty, J.1    McCallum, A.2    Pereira, F.C.3
  • 6
    • 84907582594 scopus 로고    scopus 로고
    • Using empirically constructed lexical resources for named entity recognition
    • [6] Jonnalagadda, S., Cohen, T., Wu, S., Liu, H., Gonzalez, G., Using empirically constructed lexical resources for named entity recognition. Biomed. Inform. Insights, 6(Suppl. 1), 2013, 17.
    • (2013) Biomed. Inform. Insights , vol.6 , pp. 17
    • Jonnalagadda, S.1    Cohen, T.2    Wu, S.3    Liu, H.4    Gonzalez, G.5
  • 7
    • 0002670150 scopus 로고    scopus 로고
    • Extracting the names of genes and gene products with a hidden Markov model
    • Association for Computational Linguistics
    • [7] Collier, N., Nobata, C., Tsujii, J.I., Extracting the names of genes and gene products with a hidden Markov model. Proceedings of the 18th Conference on Computational Linguistics, vol. 1, 2000, Association for Computational Linguistics, 201–207.
    • (2000) Proceedings of the 18th Conference on Computational Linguistics , vol.1 , pp. 201-207
    • Collier, N.1    Nobata, C.2    Tsujii, J.I.3
  • 8
    • 0000747663 scopus 로고    scopus 로고
    • Maximum entropy Markov models for information extraction and segmentation
    • [8] McCallum, A., Freitag, D., Pereira, F.C., Maximum entropy Markov models for information extraction and segmentation. ICML, vol. 17, 2000, 591–598.
    • (2000) ICML , vol.17 , pp. 591-598
    • McCallum, A.1    Freitag, D.2    Pereira, F.C.3
  • 9
    • 77952310847 scopus 로고    scopus 로고
    • Moara: a Java library for extracting and normalizing gene and protein mentions
    • [9] Neves, M.L., Carazo, J.M., Pascual-Montano, A., Moara: a Java library for extracting and normalizing gene and protein mentions. BMC Bioinform., 11(1), 2010, 157.
    • (2010) BMC Bioinform. , vol.11 , Issue.1 , pp. 157
    • Neves, M.L.1    Carazo, J.M.2    Pascual-Montano, A.3
  • 10
    • 8444226720 scopus 로고    scopus 로고
    • Term identification in the biomedical literature
    • [10] Krauthammer, M., Nenadic, G., Term identification in the biomedical literature. J. Biomed. Inform. 37:6 (2004), 512–526.
    • (2004) J. Biomed. Inform. , vol.37 , Issue.6 , pp. 512-526
    • Krauthammer, M.1    Nenadic, G.2
  • 11
    • 84884469671 scopus 로고    scopus 로고
    • A modular framework for biomedical concept recognition
    • [11] Campos, D., Matos, S., Oliveira, J.L., A modular framework for biomedical concept recognition. BMC Bioinform., 14(1), 2013, 281.
    • (2013) BMC Bioinform. , vol.14 , Issue.1 , pp. 281
    • Campos, D.1    Matos, S.2    Oliveira, J.L.3
  • 12
    • 84925619870 scopus 로고    scopus 로고
    • TmChem: a high performance approach for chemical named entity recognition and normalization
    • [12] Leaman, R., Wei, C.H., Lu, Z., TmChem: a high performance approach for chemical named entity recognition and normalization. J. Cheminform., 7(Suppl. 1), 2015.
    • (2015) J. Cheminform. , vol.7
    • Leaman, R.1    Wei, C.H.2    Lu, Z.3
  • 13
    • 84926455019 scopus 로고    scopus 로고
    • Disease named entity recognition by machine learning using semantic type of metathesaurus
    • [13] Huang, Z., Hu, X., Disease named entity recognition by machine learning using semantic type of metathesaurus. Int. J. Mach. Learn. Comput. 3 (2013), 494–498.
    • (2013) Int. J. Mach. Learn. Comput. , vol.3 , pp. 494-498
    • Huang, Z.1    Hu, X.2
  • 14
    • 84943586410 scopus 로고    scopus 로고
    • Boosting drug named entity recognition using an aggregate classifier
    • [14] Korkontzelos, I., Piliouras, D., Dowsey, A.W., Ananiadou, S., Boosting drug named entity recognition using an aggregate classifier. Artif. Intell. Med. 65:2 (2015), 145–153.
    • (2015) Artif. Intell. Med. , vol.65 , Issue.2 , pp. 145-153
    • Korkontzelos, I.1    Piliouras, D.2    Dowsey, A.W.3    Ananiadou, S.4
  • 15
    • 84891401477 scopus 로고    scopus 로고
    • Biomedical named entity extraction: some issues of corpus compatibilities
    • [15] Ekbal, Asif, Saha, Sriparna, Sikdar, Utpal Kumar, Biomedical named entity extraction: some issues of corpus compatibilities. SpringerPlus, 2(1), 2013, 601.
    • (2013) SpringerPlus , vol.2 , Issue.1 , pp. 601
    • Ekbal, A.1    Saha, S.2    Sikdar, U.K.3
  • 16
    • 85007016947 scopus 로고    scopus 로고
    • Disease mention recognition using soft-margin SVM
    • [16] Li, Gang, Disease mention recognition using soft-margin SVM. Training 593 (2012), 5–148.
    • (2012) Training , vol.593 , pp. 5-148
    • Li, G.1
  • 18
    • 0022030599 scopus 로고
    • Efficient randomized pattern-matching algorithms
    • [18] Karp, R.M., Rabin, M.O., Efficient randomized pattern-matching algorithms. IBM J. Res. Dev. 31:2 (1987), 249–260.
    • (1987) IBM J. Res. Dev. , vol.31 , Issue.2 , pp. 249-260
    • Karp, R.M.1    Rabin, M.O.2
  • 19
    • 0017547820 scopus 로고
    • A fast string searching algorithm
    • [19] Boyer, R.S., Moore, J.S., A fast string searching algorithm. Commun. ACM 20:10 (1977), 762–772.
    • (1977) Commun. ACM , vol.20 , Issue.10 , pp. 762-772
    • Boyer, R.S.1    Moore, J.S.2
  • 20
    • 84895437465 scopus 로고    scopus 로고
    • NCBI disease corpus: a resource for disease name recognition and concept normalization
    • [20] Doğan, R.I., Leaman, R., Lu, Z., NCBI disease corpus: a resource for disease name recognition and concept normalization. J. Biomed. Inform. 47 (2014), 1–10.
    • (2014) J. Biomed. Inform. , vol.47 , pp. 1-10
    • Doğan, R.I.1    Leaman, R.2    Lu, Z.3
  • 23
    • 0033927616 scopus 로고    scopus 로고
    • Medical subject headings (MeSH)
    • [23] Lipscomb, C.E., Medical subject headings (MeSH). Bull. Med. Libr. Assoc., 88(3), 2000, 265.
    • (2000) Bull. Med. Libr. Assoc. , vol.88 , Issue.3 , pp. 265
    • Lipscomb, C.E.1
  • 25
    • 0345863927 scopus 로고    scopus 로고
    • The unified medical language system (UMLS): integrating biomedical terminology
    • [25] Bodenreider, O., The unified medical language system (UMLS): integrating biomedical terminology. Nucl. Acids Res. 32:Suppl. 1 (2004), D267–D270.
    • (2004) Nucl. Acids Res. , vol.32 , pp. D267-D270
    • Bodenreider, O.1
  • 27
    • 33744793443 scopus 로고    scopus 로고
    • Evaluation of the content coverage of SNOMED CT: ability of SNOMED clinical terms to represent clinical problem lists
    • Elsevier
    • [27] Elkin, P.L., Brown, S.H., Husser, C.S., Bauer, B.A., Wahner-Roedler, D., Rosenbloom, S.T., Speroff, T., Evaluation of the content coverage of SNOMED CT: ability of SNOMED clinical terms to represent clinical problem lists. Mayo Clinic Proceedings, vol. 81, no. 6, 2006, Elsevier, 741–748.
    • (2006) Mayo Clinic Proceedings , vol.vol. 81 no. 6 , pp. 741-748
    • Elkin, P.L.1    Brown, S.H.2    Husser, C.S.3    Bauer, B.A.4    Wahner-Roedler, D.5    Rosenbloom, S.T.6    Speroff, T.7
  • 28
    • 84862253166 scopus 로고    scopus 로고
    • MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database
    • [28] Davis, A.P., Wiegers, T.C., Rosenstein, M.C., Mattingly, C.J., MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database. Database, 2012, bar065.
    • (2012) Database , pp. bar065
    • Davis, A.P.1    Wiegers, T.C.2    Rosenstein, M.C.3    Mattingly, C.J.4
  • 29
    • 13444266370 scopus 로고    scopus 로고
    • Online Mendelian Inheritance in Man (OMIM) a knowledgebase of human genes and genetic disorders
    • [29] Hamosh, Ada, et al. Online Mendelian Inheritance in Man (OMIM) a knowledgebase of human genes and genetic disorders. Nucl. Acids Res. 33:Suppl. 1 (2005), D514–D517.
    • (2005) Nucl. Acids Res. , vol.33 , pp. D514-D517
    • Hamosh, A.1
  • 30
    • 47749122510 scopus 로고    scopus 로고
    • A survey of named entity recognition and classification
    • [30] Nadeau, David, Sekine, Satoshi, A survey of named entity recognition and classification. Lingvisticae Investigationes 30:1 (2007), 3–26.
    • (2007) Lingvisticae Investigationes , vol.30 , Issue.1 , pp. 3-26
    • Nadeau, D.1    Sekine, S.2
  • 31
    • 25444533246 scopus 로고    scopus 로고
    • Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
    • [31] John Lafferty, Andrew McCallum, Fernando C.N. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, 2001.
    • (2001)
    • Lafferty, J.1    McCallum, A.2    Pereira, F.C.N.3
  • 32
    • 84946138625 scopus 로고    scopus 로고
    • Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations
    • [32] Munkhdalai, Tsendsuren, et al. Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations. J. Cheminform., 7(S-1), 2015, S9.
    • (2015) J. Cheminform. , vol.7 , Issue.S-1 , pp. S9
    • Munkhdalai, T.1
  • 34
    • 84857855190 scopus 로고    scopus 로고
    • Random search for hyper-parameter optimization
    • [34] Bergstra, James, Bengio, Yoshua, Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13:1 (2012), 281–305.
    • (2012) J. Mach. Learn. Res. , vol.13 , Issue.1 , pp. 281-305
    • Bergstra, J.1    Bengio, Y.2
  • 35
    • 46649086341 scopus 로고    scopus 로고
    • Exploiting the contextual cues for bio-entity name recognition in biomedical literature
    • [35] Yang, Z., Lin, H., Li, Y., Exploiting the contextual cues for bio-entity name recognition in biomedical literature. J. Biomed. Inform. 41:4 (2008), 580–587.
    • (2008) J. Biomed. Inform. , vol.41 , Issue.4 , pp. 580-587
    • Yang, Z.1    Lin, H.2    Li, Y.3
  • 36
    • 79960832442 scopus 로고    scopus 로고
    • Weighted vote-based classifier ensemble for named entity recognition: a genetic algorithm-based approach
    • [36] Ekbal, A., Saha, S., Weighted vote-based classifier ensemble for named entity recognition: a genetic algorithm-based approach. ACM Trans. Asian Lang. Inform. Process. (TALIP), 10(2), 2011, 9.
    • (2011) ACM Trans. Asian Lang. Inform. Process. (TALIP) , vol.10 , Issue.2 , pp. 9
    • Ekbal, A.1    Saha, S.2
  • 37
    • 52149087660 scopus 로고    scopus 로고
    • Identifying named entities in biomedical text based on stacked generalization
    • in: 7th World Congress on Intelligent Control and Automation. WCICA 2008, IEEE.
    • [37] H. Wang, T. Zhao, Identifying named entities in biomedical text based on stacked generalization, in: 7th World Congress on Intelligent Control and Automation. WCICA 2008, IEEE, 2008, pp. 160–164.
    • (2008) , pp. 160-164
    • Wang, H.1    Zhao, T.2
  • 38
    • 0018465664 scopus 로고
    • A composite classifier system design: concepts and methodology
    • [38] Dasarathy, B.V., Sheela, B.V., A composite classifier system design: concepts and methodology. Proc. IEEE 67:5 (1979), 708–713.
    • (1979) Proc. IEEE , vol.67 , Issue.5 , pp. 708-713
    • Dasarathy, B.V.1    Sheela, B.V.2
  • 39
    • 0036567392 scopus 로고    scopus 로고
    • Ensembling neural networks: many could be better than all
    • [39] Zhou, Z.H., Wu, J., Tang, W., Ensembling neural networks: many could be better than all. Artif. Intell. 137:1 (2002), 239–263.
    • (2002) Artif. Intell. , vol.137 , Issue.1 , pp. 239-263
    • Zhou, Z.H.1    Wu, J.2    Tang, W.3
  • 40
    • 33947307025 scopus 로고    scopus 로고
    • Recognition of protein/gene names from text using an ensemble of classifiers
    • [40] Zhou, G., Shen, D., Zhang, J., Su, J., Tan, S., Recognition of protein/gene names from text using an ensemble of classifiers. BMC Bioinform., 6(1), 2005, 1.
    • (2005) BMC Bioinform. , vol.6 , Issue.1 , pp. 1
    • Zhou, G.1    Shen, D.2    Zhang, J.3    Su, J.4    Tan, S.5
  • 41
    • 0026692226 scopus 로고
    • Stacked generalization
    • [41] Wolpert, D.H., Stacked generalization. Neural Networks 5:2 (1992), 241–259.
    • (1992) Neural Networks , vol.5 , Issue.2 , pp. 241-259
    • Wolpert, D.H.1
  • 42
    • 80455122674 scopus 로고
    • The Problem of Linguistic Approximation in System Analysis
    • [42] P.P. Bonissone, The Problem of Linguistic Approximation in System Analysis, 1979.
    • (1979)
    • Bonissone, P.P.1
  • 43
    • 0018493536 scopus 로고
    • A general approach to linguistic approximation
    • [43] Eshragh, F., Mamdani, E.H., A general approach to linguistic approximation. Int. J. Man Mach. Stud. 11:4 (1979), 501–519.
    • (1979) Int. J. Man Mach. Stud. , vol.11 , Issue.4 , pp. 501-519
    • Eshragh, F.1    Mamdani, E.H.2
  • 44
    • 0347556783 scopus 로고
    • Quantitative analysis with linguistic values
    • [44] Wenstøp, F., Quantitative analysis with linguistic values. Fuzzy Sets Syst. 4:2 (1980), 99–115.
    • (1980) Fuzzy Sets Syst. , vol.4 , Issue.2 , pp. 99-115
    • Wenstøp, F.1
  • 45
    • 0000654487 scopus 로고
    • Measures of similarity among fuzzy concepts: a comparative analysis
    • [45] Zwick, R., Carlstein, E., Budescu, D.V., Measures of similarity among fuzzy concepts: a comparative analysis. Int. J. Approx. Reason. 1:2 (1987), 221–242.
    • (1987) Int. J. Approx. Reason. , vol.1 , Issue.2 , pp. 221-242
    • Zwick, R.1    Carlstein, E.2    Budescu, D.V.3
  • 46
    • 84883572716 scopus 로고    scopus 로고
    • PubTator: a web-based text mining tool for assisting biocuration
    • [46] Wei, Chih-Hsuan, Kao, Hung-Yu, Lu, Zhiyong, PubTator: a web-based text mining tool for assisting biocuration. Nucl. Acids Res., 2013, gkt441.
    • (2013) Nucl. Acids Res. , pp. gkt441
    • Wei, C.-H.1    Kao, H.-Y.2    Lu, Z.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.