SCOPUS 정보 검색 플랫폼

Journal of Biomedical Informatics

Volumn 64, Issue , 2016, Pages 1-9

Stacked ensemble combined with fuzzy matching for biomedical named entity recognition of diseases

(4) Bhasuran, Balu a Murugesan, Gurusamy a Abdulkadhar, Sabenabanu a Natarajan, Jeyakumar a

a BHARATHIAR UNIVERSITY (India)

Author keywords

Biomedical named entity recognition; Fuzzy matching; Machine learning; Stacked ensemble; Text mining

Indexed keywords

ARTIFICIAL INTELLIGENCE; CHARACTER RECOGNITION; DATA MINING; LEARNING SYSTEMS; RANDOM PROCESSES; SEMANTICS; TEXT PROCESSING;

BIOMEDICAL NAMED ENTITY RECOGNITION; BIOMEDICAL TEXT MININGS; CLASSIFICATION METHODS; CONDITIONAL RANDOM FIELD; FUZZY MATCHING; STACKED ENSEMBLE; STACKED GENERALIZATION; TEXT MINING;

NATURAL LANGUAGE PROCESSING SYSTEMS;

ARTICLE; BIOMEDICAL NAMED ENTITY RECOGNITION; CLASSIFICATION; CONDITIONAL RANDOM FIELD; CONTROLLED STUDY; DATA MINING; FUZZY SYSTEM; MACHINE LEARNING; MEASUREMENT PRECISION; PRIORITY JOURNAL; STACKED ENSEMBLE; TEXT MINING; ALGORITHM; BIOLOGY; DISEASES; FUZZY LOGIC; GENE; HUMAN;

PROTEIN;

ALGORITHMS; CLASSIFICATION; COMPUTATIONAL BIOLOGY; DATA MINING; DISEASE; FUZZY LOGIC; GENES; HUMANS; PROTEINS;

EID: 84988640362 PISSN: 15320464 EISSN: None Source Type: Journal
DOI: 10.1016/j.jbi.2016.09.009 Document Type: Article

Times cited : (59)

References (46)

1
- 84875635420
- Biomedical text mining and its applications in cancer research
- [1] Zhu, F., Patumcharoenpol, P., Zhang, C., Yang, Y., Chan, J., Meechai, A., Shen, B., Biomedical text mining and its applications in cancer research. J. Biomed. Inform. 46:2 (2013), 200–211.
- (2013) J. Biomed. Inform. , vol.46 , Issue.2 , pp. 200-211
- Zhu, F.¹ Patumcharoenpol, P.² Zhang, C.³ Yang, Y.⁴ Chan, J.⁵ Meechai, A.⁶ Shen, B.⁷

2
- 17244380380
- A survey of current work in biomedical text mining
- [2] Cohen, A.M., Hersh, W.R., A survey of current work in biomedical text mining. Briefings Bioinform. 6:1 (2005), 57–71.
- (2005) Briefings Bioinform. , vol.6 , Issue.1 , pp. 57-71
- Cohen, A.M.¹ Hersh, W.R.²

3
- 27744524253
- A maximum entropy approach to biomedical named entity recognition
- [3] Lin, Y.F., Tsai, T.H., Chou, W.C., Wu, K.P., Sung, T.Y., Hsu, W.L., A maximum entropy approach to biomedical named entity recognition. BIOKDD, 2004, 56–61.
- (2004) BIOKDD , pp. 56-61
- Lin, Y.F.¹ Tsai, T.H.² Chou, W.C.³ Wu, K.P.⁴ Sung, T.Y.⁵ Hsu, W.L.⁶

4
- 44649165797
- Assessment of disease named entity recognition on a corpus of annotated sentences
- [4] Jimeno, A., Jimenez-Ruiz, E., Lee, V., Gaudan, S., Berlanga, R., Rebholz-Schuhmann, D., Assessment of disease named entity recognition on a corpus of annotated sentences. BMC Bioinform., 9(Suppl. 3), 2008, S3.
- (2008) BMC Bioinform. , vol.9 , pp. S3
- Jimeno, A.¹ Jimenez-Ruiz, E.² Lee, V.³ Gaudan, S.⁴ Berlanga, R.⁵ Rebholz-Schuhmann, D.⁶

5
- 25444533246
- Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
- [5] J. Lafferty, A. McCallum, F.C. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, 2001.
- (2001)
- Lafferty, J.¹ McCallum, A.² Pereira, F.C.³

6
- 84907582594
- Using empirically constructed lexical resources for named entity recognition
- [6] Jonnalagadda, S., Cohen, T., Wu, S., Liu, H., Gonzalez, G., Using empirically constructed lexical resources for named entity recognition. Biomed. Inform. Insights, 6(Suppl. 1), 2013, 17.
- (2013) Biomed. Inform. Insights , vol.6 , pp. 17
- Jonnalagadda, S.¹ Cohen, T.² Wu, S.³ Liu, H.⁴ Gonzalez, G.⁵

7
- 0002670150
- Extracting the names of genes and gene products with a hidden Markov model
- Association for Computational Linguistics
- [7] Collier, N., Nobata, C., Tsujii, J.I., Extracting the names of genes and gene products with a hidden Markov model. Proceedings of the 18th Conference on Computational Linguistics, vol. 1, 2000, Association for Computational Linguistics, 201–207.
- (2000) Proceedings of the 18th Conference on Computational Linguistics , vol.1 , pp. 201-207
- Collier, N.¹ Nobata, C.² Tsujii, J.I.³

8
- 0000747663
- Maximum entropy Markov models for information extraction and segmentation
- [8] McCallum, A., Freitag, D., Pereira, F.C., Maximum entropy Markov models for information extraction and segmentation. ICML, vol. 17, 2000, 591–598.
- (2000) ICML , vol.17 , pp. 591-598
- McCallum, A.¹ Freitag, D.² Pereira, F.C.³

9
- 77952310847
- Moara: a Java library for extracting and normalizing gene and protein mentions
- [9] Neves, M.L., Carazo, J.M., Pascual-Montano, A., Moara: a Java library for extracting and normalizing gene and protein mentions. BMC Bioinform., 11(1), 2010, 157.
- (2010) BMC Bioinform. , vol.11 , Issue.1 , pp. 157
- Neves, M.L.¹ Carazo, J.M.² Pascual-Montano, A.³

10
- 8444226720
- Term identification in the biomedical literature
- [10] Krauthammer, M., Nenadic, G., Term identification in the biomedical literature. J. Biomed. Inform. 37:6 (2004), 512–526.
- (2004) J. Biomed. Inform. , vol.37 , Issue.6 , pp. 512-526
- Krauthammer, M.¹ Nenadic, G.²

11
- 84884469671
- A modular framework for biomedical concept recognition
- [11] Campos, D., Matos, S., Oliveira, J.L., A modular framework for biomedical concept recognition. BMC Bioinform., 14(1), 2013, 281.
- (2013) BMC Bioinform. , vol.14 , Issue.1 , pp. 281
- Campos, D.¹ Matos, S.² Oliveira, J.L.³

12
- 84925619870
- TmChem: a high performance approach for chemical named entity recognition and normalization
- [12] Leaman, R., Wei, C.H., Lu, Z., TmChem: a high performance approach for chemical named entity recognition and normalization. J. Cheminform., 7(Suppl. 1), 2015.
- (2015) J. Cheminform. , vol.7
- Leaman, R.¹ Wei, C.H.² Lu, Z.³

13
- 84926455019
- Disease named entity recognition by machine learning using semantic type of metathesaurus
- [13] Huang, Z., Hu, X., Disease named entity recognition by machine learning using semantic type of metathesaurus. Int. J. Mach. Learn. Comput. 3 (2013), 494–498.
- (2013) Int. J. Mach. Learn. Comput. , vol.3 , pp. 494-498
- Huang, Z.¹ Hu, X.²

14
- 84943586410
- Boosting drug named entity recognition using an aggregate classifier
- [14] Korkontzelos, I., Piliouras, D., Dowsey, A.W., Ananiadou, S., Boosting drug named entity recognition using an aggregate classifier. Artif. Intell. Med. 65:2 (2015), 145–153.
- (2015) Artif. Intell. Med. , vol.65 , Issue.2 , pp. 145-153
- Korkontzelos, I.¹ Piliouras, D.² Dowsey, A.W.³ Ananiadou, S.⁴

15
- 84891401477
- Biomedical named entity extraction: some issues of corpus compatibilities
- [15] Ekbal, Asif, Saha, Sriparna, Sikdar, Utpal Kumar, Biomedical named entity extraction: some issues of corpus compatibilities. SpringerPlus, 2(1), 2013, 601.
- (2013) SpringerPlus , vol.2 , Issue.1 , pp. 601
- Ekbal, A.¹ Saha, S.² Sikdar, U.K.³

16
- 85007016947
- Disease mention recognition using soft-margin SVM
- [16] Li, Gang, Disease mention recognition using soft-margin SVM. Training 593 (2012), 5–148.
- (2012) Training , vol.593 , pp. 5-148
- Li, G.¹

17
- 84875575941
- An improved corpus of disease mentions in PubMed citations
- Association for Computational Linguistics
- [17] Doğan, RezartaIslamaj, Lu, Zhiyong, An improved corpus of disease mentions in PubMed citations. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, 2012, Association for Computational Linguistics.
- (2012) Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
- Doğan, R.¹ Lu, Z.²

18
- 0022030599
- Efficient randomized pattern-matching algorithms
- [18] Karp, R.M., Rabin, M.O., Efficient randomized pattern-matching algorithms. IBM J. Res. Dev. 31:2 (1987), 249–260.
- (1987) IBM J. Res. Dev. , vol.31 , Issue.2 , pp. 249-260
- Karp, R.M.¹ Rabin, M.O.²

19
- 0017547820
- A fast string searching algorithm
- [19] Boyer, R.S., Moore, J.S., A fast string searching algorithm. Commun. ACM 20:10 (1977), 762–772.
- (1977) Commun. ACM , vol.20 , Issue.10 , pp. 762-772
- Boyer, R.S.¹ Moore, J.S.²

20
- 84895437465
- NCBI disease corpus: a resource for disease name recognition and concept normalization
- [20] Doğan, R.I., Leaman, R., Lu, Z., NCBI disease corpus: a resource for disease name recognition and concept normalization. J. Biomed. Inform. 47 (2014), 1–10.
- (2014) J. Biomed. Inform. , vol.47 , pp. 1-10
- Doğan, R.I.¹ Leaman, R.² Lu, Z.³

21
- 84949317410
- Overview of the BioCreative V chemical disease relation (CDR) task
- [21] Wei, C.H., Peng, Y., Leaman, R., Davis, A.P., Mattingly, C.J., Li, J., Wiegers, T.C., Lu, Z., Overview of the BioCreative V chemical disease relation (CDR) task. Proceedings of the Fifth BioCreative Challenge Evaluation Workshop, Sevilla, Spain, 2015.
- (2015) Proceedings of the Fifth BioCreative Challenge Evaluation Workshop, Sevilla, Spain
- Wei, C.H.¹ Peng, Y.² Leaman, R.³ Davis, A.P.⁴ Mattingly, C.J.⁵ Li, J.⁶ Wiegers, T.C.⁷ Lu, Z.⁸

22
- 85122620960
- Enriching the knowledge sources used in a maximum entropy part-of-speech tagger
- Association for Computational Linguistics
- [22] Toutanova, K., Manning, C.D., Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. Proceedings of the 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora: Held in Conjunction with the 38th Annual Meeting of the Association for Computational Linguistics, vol. 13, 2000, Association for Computational Linguistics, 63–70.
- (2000) Proceedings of the 2000 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora: Held in Conjunction with the 38th Annual Meeting of the Association for Computational Linguistics , vol.13 , pp. 63-70
- Toutanova, K.¹ Manning, C.D.²

23
- 0033927616
- Medical subject headings (MeSH)
- [23] Lipscomb, C.E., Medical subject headings (MeSH). Bull. Med. Libr. Assoc., 88(3), 2000, 265.
- (2000) Bull. Med. Libr. Assoc. , vol.88 , Issue.3 , pp. 265
- Lipscomb, C.E.¹

24
- 0036088113
- PharmGKB: the pharmacogenetics knowledge base
- [24] Hewett, M., Oliver, D.E., Rubin, D.L., Easton, K.L., Stuart, J.M., Altman, R.B., Klein, T.E., PharmGKB: the pharmacogenetics knowledge base. Nucl. Acids Res. 30:1 (2002), 163–165.
- (2002) Nucl. Acids Res. , vol.30 , Issue.1 , pp. 163-165
- Hewett, M.¹ Oliver, D.E.² Rubin, D.L.³ Easton, K.L.⁴ Stuart, J.M.⁵ Altman, R.B.⁶ Klein, T.E.⁷

25
- 0345863927
- The unified medical language system (UMLS): integrating biomedical terminology
- [25] Bodenreider, O., The unified medical language system (UMLS): integrating biomedical terminology. Nucl. Acids Res. 32:Suppl. 1 (2004), D267–D270.
- (2004) Nucl. Acids Res. , vol.32 , pp. D267-D270
- Bodenreider, O.¹

26
- 66349110163
- Annotating the human genome with Disease Ontology
- [26] Osborne, J.D., Flatow, J., Holko, M., Lin, S.M., Kibbe, W.A., Zhu, L.J., Danila, M.I., Feng, G., Chisholm, R.L., Annotating the human genome with Disease Ontology. BMC Genom., 10(Suppl. 1), 2009, S6.
- (2009) BMC Genom. , vol.10 , pp. S6
- Osborne, J.D.¹ Flatow, J.² Holko, M.³ Lin, S.M.⁴ Kibbe, W.A.⁵ Zhu, L.J.⁶ Danila, M.I.⁷ Feng, G.⁸ Chisholm, R.L.⁹

27
- 33744793443
- Evaluation of the content coverage of SNOMED CT: ability of SNOMED clinical terms to represent clinical problem lists
- Elsevier
- [27] Elkin, P.L., Brown, S.H., Husser, C.S., Bauer, B.A., Wahner-Roedler, D., Rosenbloom, S.T., Speroff, T., Evaluation of the content coverage of SNOMED CT: ability of SNOMED clinical terms to represent clinical problem lists. Mayo Clinic Proceedings, vol. 81, no. 6, 2006, Elsevier, 741–748.
- (2006) Mayo Clinic Proceedings , vol.vol. 81 no. 6 , pp. 741-748
- Elkin, P.L.¹ Brown, S.H.² Husser, C.S.³ Bauer, B.A.⁴ Wahner-Roedler, D.⁵ Rosenbloom, S.T.⁶ Speroff, T.⁷

28
- 84862253166
- MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database
- [28] Davis, A.P., Wiegers, T.C., Rosenstein, M.C., Mattingly, C.J., MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database. Database, 2012, bar065.
- (2012) Database , pp. bar065
- Davis, A.P.¹ Wiegers, T.C.² Rosenstein, M.C.³ Mattingly, C.J.⁴

29
- 13444266370
- Online Mendelian Inheritance in Man (OMIM) a knowledgebase of human genes and genetic disorders
- [29] Hamosh, Ada, et al. Online Mendelian Inheritance in Man (OMIM) a knowledgebase of human genes and genetic disorders. Nucl. Acids Res. 33:Suppl. 1 (2005), D514–D517.
- (2005) Nucl. Acids Res. , vol.33 , pp. D514-D517
- Hamosh, A.¹

30
- 47749122510
- A survey of named entity recognition and classification
- [30] Nadeau, David, Sekine, Satoshi, A survey of named entity recognition and classification. Lingvisticae Investigationes 30:1 (2007), 3–26.
- (2007) Lingvisticae Investigationes , vol.30 , Issue.1 , pp. 3-26
- Nadeau, D.¹ Sekine, S.²

31
- 25444533246
- Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
- [31] John Lafferty, Andrew McCallum, Fernando C.N. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data, 2001.
- (2001)
- Lafferty, J.¹ McCallum, A.² Pereira, F.C.N.³

32
- 84946138625
- Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations
- [32] Munkhdalai, Tsendsuren, et al. Incorporating domain knowledge in chemical and biomedical named entity recognition with word representations. J. Cheminform., 7(S-1), 2015, S9.
- (2015) J. Cheminform. , vol.7 , Issue.S-1 , pp. S9
- Munkhdalai, T.¹

33
- 84859972823
- Practical very large scale CRFs
- Association for Computational Linguistics
- [33] Lavergne, Thomas, Cappé, Olivier, Yvon, François, Practical very large scale CRFs. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, 2010, Association for Computational Linguistics.
- (2010) Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
- Lavergne, T.¹ Cappé, O.² Yvon, F.³

34
- 84857855190
- Random search for hyper-parameter optimization
- [34] Bergstra, James, Bengio, Yoshua, Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13:1 (2012), 281–305.
- (2012) J. Mach. Learn. Res. , vol.13 , Issue.1 , pp. 281-305
- Bergstra, J.¹ Bengio, Y.²

35
- 46649086341
- Exploiting the contextual cues for bio-entity name recognition in biomedical literature
- [35] Yang, Z., Lin, H., Li, Y., Exploiting the contextual cues for bio-entity name recognition in biomedical literature. J. Biomed. Inform. 41:4 (2008), 580–587.
- (2008) J. Biomed. Inform. , vol.41 , Issue.4 , pp. 580-587
- Yang, Z.¹ Lin, H.² Li, Y.³

36
- 79960832442
- Weighted vote-based classifier ensemble for named entity recognition: a genetic algorithm-based approach
- [36] Ekbal, A., Saha, S., Weighted vote-based classifier ensemble for named entity recognition: a genetic algorithm-based approach. ACM Trans. Asian Lang. Inform. Process. (TALIP), 10(2), 2011, 9.
- (2011) ACM Trans. Asian Lang. Inform. Process. (TALIP) , vol.10 , Issue.2 , pp. 9
- Ekbal, A.¹ Saha, S.²

37
- 52149087660
- Identifying named entities in biomedical text based on stacked generalization
- in: 7th World Congress on Intelligent Control and Automation. WCICA 2008, IEEE.
- [37] H. Wang, T. Zhao, Identifying named entities in biomedical text based on stacked generalization, in: 7th World Congress on Intelligent Control and Automation. WCICA 2008, IEEE, 2008, pp. 160–164.
- (2008) , pp. 160-164
- Wang, H.¹ Zhao, T.²

38
- 0018465664
- A composite classifier system design: concepts and methodology
- [38] Dasarathy, B.V., Sheela, B.V., A composite classifier system design: concepts and methodology. Proc. IEEE 67:5 (1979), 708–713.
- (1979) Proc. IEEE , vol.67 , Issue.5 , pp. 708-713
- Dasarathy, B.V.¹ Sheela, B.V.²

39
- 0036567392
- Ensembling neural networks: many could be better than all
- [39] Zhou, Z.H., Wu, J., Tang, W., Ensembling neural networks: many could be better than all. Artif. Intell. 137:1 (2002), 239–263.
- (2002) Artif. Intell. , vol.137 , Issue.1 , pp. 239-263
- Zhou, Z.H.¹ Wu, J.² Tang, W.³

40
- 33947307025
- Recognition of protein/gene names from text using an ensemble of classifiers
- [40] Zhou, G., Shen, D., Zhang, J., Su, J., Tan, S., Recognition of protein/gene names from text using an ensemble of classifiers. BMC Bioinform., 6(1), 2005, 1.
- (2005) BMC Bioinform. , vol.6 , Issue.1 , pp. 1
- Zhou, G.¹ Shen, D.² Zhang, J.³ Su, J.⁴ Tan, S.⁵

41
- 0026692226
- Stacked generalization
- [41] Wolpert, D.H., Stacked generalization. Neural Networks 5:2 (1992), 241–259.
- (1992) Neural Networks , vol.5 , Issue.2 , pp. 241-259
- Wolpert, D.H.¹

42
- 80455122674
- The Problem of Linguistic Approximation in System Analysis
- [42] P.P. Bonissone, The Problem of Linguistic Approximation in System Analysis, 1979.
- (1979)
- Bonissone, P.P.¹

43
- 0018493536
- A general approach to linguistic approximation
- [43] Eshragh, F., Mamdani, E.H., A general approach to linguistic approximation. Int. J. Man Mach. Stud. 11:4 (1979), 501–519.
- (1979) Int. J. Man Mach. Stud. , vol.11 , Issue.4 , pp. 501-519
- Eshragh, F.¹ Mamdani, E.H.²

44
- 0347556783
- Quantitative analysis with linguistic values
- [44] Wenstøp, F., Quantitative analysis with linguistic values. Fuzzy Sets Syst. 4:2 (1980), 99–115.
- (1980) Fuzzy Sets Syst. , vol.4 , Issue.2 , pp. 99-115
- Wenstøp, F.¹

45
- 0000654487
- Measures of similarity among fuzzy concepts: a comparative analysis
- [45] Zwick, R., Carlstein, E., Budescu, D.V., Measures of similarity among fuzzy concepts: a comparative analysis. Int. J. Approx. Reason. 1:2 (1987), 221–242.
- (1987) Int. J. Approx. Reason. , vol.1 , Issue.2 , pp. 221-242
- Zwick, R.¹ Carlstein, E.² Budescu, D.V.³

46
- 84883572716
- PubTator: a web-based text mining tool for assisting biocuration
- [46] Wei, Chih-Hsuan, Kao, Hung-Yu, Lu, Zhiyong, PubTator: a web-based text mining tool for assisting biocuration. Nucl. Acids Res., 2013, gkt441.
- (2013) Nucl. Acids Res. , pp. gkt441
- Wei, C.-H.¹ Kao, H.-Y.² Lu, Z.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.