메뉴 건너뛰기




Volumn 47, Issue , 2014, Pages 1-10

NCBI disease corpus: A resource for disease name recognition and concept normalization

Author keywords

Corpus annotation; Disease name corpus; Disease name normalization; Disease name recognition; Named entity recognition

Indexed keywords

BENCHMARKING; DATA MINING; KNOWLEDGE BASED SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; RESEARCH; TEXT PROCESSING; TOOLS;

EID: 84895437465     PISSN: 15320464     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jbi.2013.12.006     Document Type: Article
Times cited : (812)

References (45)
  • 2
    • 66149157263 scopus 로고    scopus 로고
    • Digital disease detection - harnessing the Web for public health surveillance
    • Brownstein J.S., Freifeld C.C., Madoff L.C. Digital disease detection - harnessing the Web for public health surveillance. N Engl J Med 2009, 360:2153-2157.
    • (2009) N Engl J Med , vol.360 , pp. 2153-2157
    • Brownstein, J.S.1    Freifeld, C.C.2    Madoff, L.C.3
  • 4
    • 75649117442 scopus 로고    scopus 로고
    • Analysis of biological processes and diseases using text mining approaches
    • Krallinger M., Leitner F., Valencia A. Analysis of biological processes and diseases using text mining approaches. Methods Mol Biol 2010, 593:341-382.
    • (2010) Methods Mol Biol , vol.593 , pp. 341-382
    • Krallinger, M.1    Leitner, F.2    Valencia, A.3
  • 5
    • 84863245800 scopus 로고    scopus 로고
    • Linking multiple disease-related resources through UMLS. In: Proceedings of the 2nd ACM SIGHIT international health informatics symposium;
    • Neveol A, Li J, Lu Z. Linking multiple disease-related resources through UMLS. In: Proceedings of the 2nd ACM SIGHIT international health informatics symposium; 2012. p. 767-72.
    • (2012) , pp. 767-772
    • Neveol, A.1    Li, J.2    Lu, Z.3
  • 6
    • 84895447788 scopus 로고    scopus 로고
    • Exploring two biomedical text genres for disease recognition. In: Proceedings of the ACL 2009 workshop on natural language processing in biomedicine (BioNLP 2009);
    • Neveol A, Kim W, Wilbur WJ, Lu Z. Exploring two biomedical text genres for disease recognition. In: Proceedings of the ACL 2009 workshop on natural language processing in biomedicine (BioNLP 2009); 2009. p. 144-52.
    • (2009) , pp. 144-152
    • Neveol, A.1    Kim, W.2    Wilbur, W.J.3    Lu, Z.4
  • 7
    • 79958124672 scopus 로고    scopus 로고
    • A context-blocks model for identifying clinical relationships in patient records
    • Islamaj Dogan R., Neveol A., Lu Z. A context-blocks model for identifying clinical relationships in patient records. BMC Bioinformatics 2011, 12(Suppl 3):S3.
    • (2011) BMC Bioinformatics , vol.12 , Issue.SUPPL 3
    • Islamaj Dogan, R.1    Neveol, A.2    Lu, Z.3
  • 10
    • 85121305390 scopus 로고    scopus 로고
    • Disease mention recogntion with specific features. In: Proceedings of the ACL 2010 workshop on natural language processing in biomedicine (BioNLP 2010);
    • Chowdhury FM, Lavelli A. Disease mention recogntion with specific features. In: Proceedings of the ACL 2010 workshop on natural language processing in biomedicine (BioNLP 2010); 2010. p. 83-90.
    • (2010) , pp. 83-90
    • Chowdhury, F.M.1    Lavelli, A.2
  • 11
    • 84895444518 scopus 로고    scopus 로고
    • Enabling recognition of diseases in biomedical text with machine learning: corpus and benchmark. In: Proceedings of the 2009 symposium on languages in biology and medicine. Jeju Island, South Korea
    • Leaman R, Miller C, Gonzalez G. Enabling recognition of diseases in biomedical text with machine learning: corpus and benchmark. In: Proceedings of the 2009 symposium on languages in biology and medicine. Jeju Island, South Korea; 2009. p. 82-9.
    • (2009) , pp. 82-89
    • Leaman, R.1    Miller, C.2    Gonzalez, G.3
  • 12
    • 0035752429 scopus 로고    scopus 로고
    • Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proc AMIA symp;
    • Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proc AMIA symp; 2001. p. 17-21.
    • (2001) , pp. 17-21
    • Aronson, A.R.1
  • 16
    • 33947321307 scopus 로고    scopus 로고
    • Overview of BioCreAtIvE task 1B: normalized gene lists
    • Hirschman L., Colosimo M., Morgan A., Yeh A. Overview of BioCreAtIvE task 1B: normalized gene lists. BMC Bioinformatics 2005, 6(Suppl 1):S11.
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL 1
    • Hirschman, L.1    Colosimo, M.2    Morgan, A.3    Yeh, A.4
  • 17
    • 38449120325 scopus 로고    scopus 로고
    • A reappraisal of sentence and token splitting for life sciences documents
    • Tomanek K., Wermter J., Hahn U. A reappraisal of sentence and token splitting for life sciences documents. Stud Health Technol Inform 2007, 129:524-528.
    • (2007) Stud Health Technol Inform , vol.129 , pp. 524-528
    • Tomanek, K.1    Wermter, J.2    Hahn, U.3
  • 18
    • 84895464025 scopus 로고    scopus 로고
    • Integrated annotation for biomedical information extraction. In: Proc of the human language technology conference and the annual meeting of the North American chapter of the association for, computational linguistics (HLT/NAACL);
    • Kulick S, Bies A, Liberman M, Mandel M, McDonald R, Palmer M, et al. Integrated annotation for biomedical information extraction. In: Proc of the human language technology conference and the annual meeting of the North American chapter of the association for, computational linguistics (HLT/NAACL); 2004.
    • (2004)
    • Kulick, S.1    Bies, A.2    Liberman, M.3    Mandel, M.4    McDonald, R.5    Palmer, M.6
  • 19
    • 33646016255 scopus 로고    scopus 로고
    • Parsing biomedical literature
    • Springer, Berlin, Heidelberg, R. Dale, K.-F. Wong, J. Su, O. Kwong (Eds.)
    • Lease M., Charniak E. Parsing biomedical literature. Natural language processing - IJCNLP 2005 2005, 58-69. Springer, Berlin, Heidelberg. R. Dale, K.-F. Wong, J. Su, O. Kwong (Eds.).
    • (2005) Natural language processing - IJCNLP 2005 , pp. 58-69
    • Lease, M.1    Charniak, E.2
  • 21
    • 84895473273 scopus 로고    scopus 로고
    • An improved corpus of disease mentions in PubMed citations. In: Proceedings of the 2012 workshop on biomedical natural language processing;
    • Islamaj Dogan R, Lu Z. An improved corpus of disease mentions in PubMed citations. In: Proceedings of the 2012 workshop on biomedical natural language processing; 2012. p. 91-9.
    • (2012) , pp. 91-99
    • Islamaj Dogan, R.1    Lu, Z.2
  • 22
    • 84890064920 scopus 로고    scopus 로고
    • DNorm: disease name normalization with pairwise learning to rank. Bioinformatics
    • Leaman R, Islamaj Dogan R, Lu Z. DNorm: disease name normalization with pairwise learning to rank. Bioinformatics, 29, 2013, 2909-17.
    • (2013) , vol.29 , pp. 2909-2917
    • Leaman, R.1    Islamaj Dogan, R.2    Lu, Z.3
  • 24
    • 0345863927 scopus 로고    scopus 로고
    • The unified medical language system (UMLS): integrating biomedical terminology
    • Bodenreider O. The unified medical language system (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004, 32. D267-70.
    • (2004) Nucleic Acids Res , vol.32
    • Bodenreider, O.1
  • 25
    • 33947250174 scopus 로고    scopus 로고
    • GENETAG: a tagged corpus for gene/protein named entity recognition
    • Tanabe L., Xie N., Thom L.H., Matten W., Wilbur W.J. GENETAG: a tagged corpus for gene/protein named entity recognition. BMC Bioinformatics 2005, 6(Suppl 1):S3.
    • (2005) BMC Bioinformatics , vol.6 , Issue.SUPPL 1
    • Tanabe, L.1    Xie, N.2    Thom, L.H.3    Matten, W.4    Wilbur, W.J.5
  • 26
    • 71749121869 scopus 로고    scopus 로고
    • Construction of an annotated corpus to support biomedical information extraction
    • Thompson P., Iqbal S.A., McNaught J., Ananiadou S. Construction of an annotated corpus to support biomedical information extraction. BMC Bioinformatics 2009, 10:349.
    • (2009) BMC Bioinformatics , vol.10 , pp. 349
    • Thompson, P.1    Iqbal, S.A.2    McNaught, J.3    Ananiadou, S.4
  • 27
    • 79952765434 scopus 로고    scopus 로고
    • Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction
    • Neveol A., Islamaj Dogan R., Lu Z. Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction. J Biomed Inform 2011, 44:310-318.
    • (2011) J Biomed Inform , vol.44 , pp. 310-318
    • Neveol, A.1    Islamaj Dogan, R.2    Lu, Z.3
  • 28
    • 84861889356 scopus 로고    scopus 로고
    • Anaphoric reference in clinical reports: characteristics of an annotated corpus
    • Chapman W.W., Savova G.K., Zheng J., Tharp M., Crowley R. Anaphoric reference in clinical reports: characteristics of an annotated corpus. J Biomed Inform 2012, 45:507-521.
    • (2012) J Biomed Inform , vol.45 , pp. 507-521
    • Chapman, W.W.1    Savova, G.K.2    Zheng, J.3    Tharp, M.4    Crowley, R.5
  • 29
    • 84865060319 scopus 로고    scopus 로고
    • A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools
    • Verspoor K., Cohen K.B., Lanfranchi A., Warner C., Johnson H.L., Roeder C., et al. A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools. BMC Bioinformatics 2012, 13:207.
    • (2012) BMC Bioinformatics , vol.13 , pp. 207
    • Verspoor, K.1    Cohen, K.B.2    Lanfranchi, A.3    Warner, C.4    Johnson, H.L.5    Roeder, C.6
  • 31
    • 4944229711 scopus 로고    scopus 로고
    • GENIA corpus - semantically annotated corpus for bio-textmining
    • i180-2
    • Kim J.D., Ohta T., Tateisi Y., Tsujii J. GENIA corpus - semantically annotated corpus for bio-textmining. Bioinformatics 2003, 19(Suppl 1). i180-2.
    • (2003) Bioinformatics , vol.19 , Issue.SUPPL 1
    • Kim, J.D.1    Ohta, T.2    Tateisi, Y.3    Tsujii, J.4
  • 32
    • 77949431037 scopus 로고    scopus 로고
    • LINNAEUS: a species name identification system for biomedical literature
    • Gerner M., Nenadic G., Bergman C.M. LINNAEUS: a species name identification system for biomedical literature. BMC Bioinformatics 2010, 11:85.
    • (2010) BMC Bioinformatics , vol.11 , pp. 85
    • Gerner, M.1    Nenadic, G.2    Bergman, C.M.3
  • 34
    • 47749119407 scopus 로고    scopus 로고
    • Improving biomedical corpus annotation guidelines. In: Proceedings of the joint BioLink and 9th bio-ontologies meeting
    • Lu Z, Bada M, Ogren P, Cohen KB, Hunter L. Improving biomedical corpus annotation guidelines. In: Proceedings of the joint BioLink and 9th bio-ontologies meeting; 2006. p. 89-92.
    • (2006) , pp. 89-92
    • Lu, Z.1    Bada, M.2    Ogren, P.3    Cohen, K.B.4    Hunter, L.5
  • 35
    • 84895432096 scopus 로고    scopus 로고
    • Using rule-based natural language processing to improve disease normalization in biomedical text
    • Kang N., Singh B., Afzal Z., van Mulligen E.M., Kors J.A. Using rule-based natural language processing to improve disease normalization in biomedical text. J Am Med Inform Assoc 2012.
    • (2012) J Am Med Inform Assoc
    • Kang, N.1    Singh, B.2    Afzal, Z.3    van Mulligen, E.M.4    Kors, J.A.5
  • 36
    • 84876499465 scopus 로고    scopus 로고
    • Accelerating literature curation with text mining tools: a case study of using PubTator to curate genes in PubMed abstracts
    • bas041
    • Wei C.H., Harris R.B., Li D., Berardini T.Z., Huala E., Kao H.Y., et al. Accelerating literature curation with text mining tools: a case study of using PubTator to curate genes in PubMed abstracts. Database: J Biol Databases Curation 2012, bas041.
    • (2012) Database: J Biol Databases Curation
    • Wei, C.H.1    Harris, R.B.2    Li, D.3    Berardini, T.Z.4    Huala, E.5    Kao, H.Y.6
  • 37
    • 84871949206 scopus 로고    scopus 로고
    • Pre-annotating clinical notes and clinical trial announcements for gold standard corpus development: evaluating the impact on annotation speed and potential bias.
    • imaging and systems biology (HISB 2012), San Diego, CA
    • Lingren T, Deleger L, Zhai H, Meinzen-Derr J, Kaiser M, Stoutenborough L, et al. Pre-annotating clinical notes and clinical trial announcements for gold standard corpus development: evaluating the impact on annotation speed and potential bias. In: Proceedings of the second IEEE international conference on healthcare informatics, imaging and systems biology (HISB 2012), San Diego, CA; 2012. p. 108.
    • (2012) Proceedings of the second IEEE international conference on healthcare informatics , pp. 108
    • Lingren, T.1    Deleger, L.2    Zhai, H.3    Meinzen-Derr, J.4    Kaiser, M.5    Stoutenborough, L.6
  • 38
    • 84883572716 scopus 로고    scopus 로고
    • PubTator: a web-based text mining tool for assisting biocuration.
    • (Web Server issue)
    • Wei CH, Kao HY, Lu Z. PubTator: a web-based text mining tool for assisting biocuration. Nucleic Acids Res. 2013, 41(Web Server issue): W518-22.
    • (2013) Nucleic Acids Res. , vol.41
    • Wei, C.H.1    Kao, H.Y.2    Lu, Z.3
  • 39
    • 18044385763 scopus 로고    scopus 로고
    • Agreement, the f-measure, and reliability in information retrieval
    • Hripcsak G., Rothschild A.S. Agreement, the f-measure, and reliability in information retrieval. J Am Med Inform Assoc 2005, 12:296-298.
    • (2005) J Am Med Inform Assoc , vol.12 , pp. 296-298
    • Hripcsak, G.1    Rothschild, A.S.2
  • 40
    • 40549140499 scopus 로고    scopus 로고
    • BANNER: an executable survey of advances in biomedical named entity recognition
    • Leaman R., Gonzalez G. BANNER: an executable survey of advances in biomedical named entity recognition. Pac Symp Biocomput 2008, 652-663.
    • (2008) Pac Symp Biocomput , pp. 652-663
    • Leaman, R.1    Gonzalez, G.2
  • 41
    • 84863537188 scopus 로고    scopus 로고
    • Unified medical language system term occurrences in clinical notes: a large-scale corpus analysis
    • Wu S.T., Liu H., Li D., Tao C., Musen M.A., Chute C.G., et al. Unified medical language system term occurrences in clinical notes: a large-scale corpus analysis. J Am Med Inform Assoc 2012, 19. e149-56.
    • (2012) J Am Med Inform Assoc , vol.19
    • Wu, S.T.1    Liu, H.2    Li, D.3    Tao, C.4    Musen, M.A.5    Chute, C.G.6
  • 42
    • 54949148775 scopus 로고    scopus 로고
    • Forty years of SNOMED: a literature review
    • Cornet R., de Keizer N. Forty years of SNOMED: a literature review. BMC Med Inform Decis Mak 2008, 8(Suppl 1):S2.
    • (2008) BMC Med Inform Decis Mak , vol.8 , Issue.SUPPL 1
    • Cornet, R.1    de Keizer, N.2
  • 44
    • 77953936166 scopus 로고    scopus 로고
    • The human phenotype ontology
    • Robinson P.N., Mundlos S. The human phenotype ontology. Clin Genet 2010, 77:525-534.
    • (2010) Clin Genet , vol.77 , pp. 525-534
    • Robinson, P.N.1    Mundlos, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.