메뉴 건너뛰기




Volumn 6, Issue 3, 2011, Pages

Clustering more than two million biomedical publications: Comparing the accuracies of nine text-based similarity approaches

Author keywords

[No Author keywords available]

Indexed keywords

ACCURACY; ANALYTIC METHOD; ARTICLE; CONTROLLED STUDY; INTERMETHOD COMPARISON; MEDICAL INFORMATION; MEDICAL LITERATURE; MEDLINE; PUBLICATION; VALIDATION PROCESS; CLUSTER ANALYSIS; COMPARATIVE STUDY; DOCUMENTATION; INFORMATION RETRIEVAL; MEDICAL RESEARCH; METHODOLOGY;

EID: 79952745199     PISSN: None     EISSN: 19326203     Source Type: Journal    
DOI: 10.1371/journal.pone.0018029     Document Type: Article
Times cited : (241)

References (47)
  • 3
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • Salton G, Buckley C, (1988) Term-weighting approaches in automatic text retrieval. Information Processing & Management 24: 513-523.
    • (1988) Information Processing & Management , vol.24 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 4
  • 6
    • 0022906994 scopus 로고
    • Implementing agglomerative hierarchic clustering algorithms for use in document retrieval
    • Voorhees EM, (1986) Implementing agglomerative hierarchic clustering algorithms for use in document retrieval. Information Processing & Management 22: 465-476.
    • (1986) Information Processing & Management , vol.22 , pp. 465-476
    • Voorhees, E.M.1
  • 7
    • 0030381274 scopus 로고    scopus 로고
    • Reexamining the cluster hypothesis: Scatter/gather on retrieval results
    • Hearst MA, Pedersen JO, (1996) Reexamining the cluster hypothesis: Scatter/gather on retrieval results. Proceedings of ACM SIGIR 1996 pp. 76-84.
    • (1996) Proceedings of ACM SIGIR 1996 , pp. 76-84
    • Hearst, M.A.1    Pedersen, J.O.2
  • 13
    • 14044254746 scopus 로고    scopus 로고
    • Textpresso: An ontology-based information retrieval and extraction system for biological literature
    • Müller HM, Kenny EE, Sternberg PW, (2004) Textpresso: An ontology-based information retrieval and extraction system for biological literature. PLoS Biology 2: e309.
    • (2004) PLoS Biology , vol.2
    • Müller, H.M.1    Kenny, E.E.2    Sternberg, P.W.3
  • 17
    • 33847691469 scopus 로고    scopus 로고
    • Biomedical knowledge navigation by literature clustering
    • Yamamoto Y, Takagi T, (2007) Biomedical knowledge navigation by literature clustering. Journal of Biomedical Informatics 40: 114-130.
    • (2007) Journal of Biomedical Informatics , vol.40 , pp. 114-130
    • Yamamoto, Y.1    Takagi, T.2
  • 18
    • 79952761867 scopus 로고    scopus 로고
    • Utilizing nonnegative matrix factorization for email classification problems
    • In: Berry MW, Kogan J, editors, West Sussex, John Wiley & Sons, Ltd
    • Janacek AGK, Gansterer WN, (2010) Utilizing nonnegative matrix factorization for email classification problems. In: Berry MW, Kogan J, editors. Text Mining: Applications and Theory West Sussex John Wiley & Sons, Ltd pp. 57-80.
    • (2010) Text Mining: Applications and Theory , pp. 57-80
    • Janacek, A.G.K.1    Gansterer, W.N.2
  • 19
    • 79952771802 scopus 로고    scopus 로고
    • Content-based spam email classification using machine-learning algorithms
    • In: Berry MW, Kogan J, editors, West Sussex, John Wiley & Sons, Ltd
    • Jiang EP, (2010) Content-based spam email classification using machine-learning algorithms. In: Berry MW, Kogan J, editors. Text Mining: Applications and Theory West Sussex John Wiley & Sons, Ltd pp. 37-56.
    • (2010) Text Mining: Applications and Theory , pp. 37-56
    • Jiang, E.P.1
  • 20
    • 58149466160 scopus 로고    scopus 로고
    • Document-document similarity approaches and science mapping: Experimental comparison of five approaches
    • Ahlgren P, Colliander C, (2009) Document-document similarity approaches and science mapping: Experimental comparison of five approaches. Journal of Informetrics 3: 49-63.
    • (2009) Journal of Informetrics , vol.3 , pp. 49-63
    • Ahlgren, P.1    Colliander, C.2
  • 21
    • 46749129689 scopus 로고    scopus 로고
    • Bibliographic coupling, common abstract stems and clustering: A comparison of two document-document similarity approaches in the context of science mapping
    • Ahlgren P, Jarneving B, (2008) Bibliographic coupling, common abstract stems and clustering: A comparison of two document-document similarity approaches in the context of science mapping. Scientometrics 76: 273-290.
    • (2008) Scientometrics , vol.76 , pp. 273-290
    • Ahlgren, P.1    Jarneving, B.2
  • 25
    • 33745610921 scopus 로고    scopus 로고
    • Combining contents and citations for scientific document classification
    • Cao MD, Gao X, (2005) Combining contents and citations for scientific document classification. AI 2005: Advances in artificial intelligence pp. 143-152.
    • (2005) AI 2005: Advances in Artificial Intelligence , pp. 143-152
    • Cao, M.D.1    Gao, X.2
  • 26
    • 78449276223 scopus 로고    scopus 로고
    • Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately?
    • Boyack KW, Klavans R, (2010) Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? Journal of the American Society for Information Science and Technology 61: 2389-2404.
    • (2010) Journal of the American Society for Information Science and Technology , vol.61 , pp. 2389-2404
    • Boyack, K.W.1    Klavans, R.2
  • 27
    • 0025952277 scopus 로고
    • Divergence measures based on Shannon entropy
    • Lin J, (1991) Divergence measures based on Shannon entropy. IEEE Transactions on Information Theory 37: 145-151.
    • (1991) IEEE Transactions on Information Theory , vol.37 , pp. 145-151
    • Lin, J.1
  • 29
    • 0032183760 scopus 로고    scopus 로고
    • A semidiscrete matrix decomposition for latent semantic indexing in information retrieval
    • Kolda TG, O'Leary DP, (1998) A semidiscrete matrix decomposition for latent semantic indexing in information retrieval. ACM Transactions on Information Systems 16: 322-346.
    • (1998) ACM Transactions on Information Systems , vol.16 , pp. 322-346
    • Kolda, T.G.1    O'Leary, D.P.2
  • 31
    • 0029546874 scopus 로고
    • Using linear algebra for intelligent information retrieval
    • Berry MW, Dumais ST, O'Brien GW, (1995) Using linear algebra for intelligent information retrieval. SIAM Review 37: 573-595.
    • (1995) SIAM Review , vol.37 , pp. 573-595
    • Berry, M.W.1    Dumais, S.T.2    O'Brien, G.W.3
  • 32
    • 44449173120 scopus 로고    scopus 로고
    • Semantically linking and browsing PubMed abstracts with gene ontology
    • Vanteru BC, Shaik JS, Yeasin M, (2008) Semantically linking and browsing PubMed abstracts with gene ontology. BMC Genomics 9: S10.
    • (2008) BMC Genomics , vol.9
    • Vanteru, B.C.1    Shaik, J.S.2    Yeasin, M.3
  • 34
    • 0012435995 scopus 로고    scopus 로고
    • A probabilistic model of information retrieval: Development and comparative experiments. Part 1
    • Sparck Jones K, Walker S, Robertson SE, (2000) A probabilistic model of information retrieval: Development and comparative experiments. Part 1. Information Processing & Management 36: 779-808.
    • (2000) Information Processing & Management , vol.36 , pp. 779-808
    • Sparck Jones, K.1    Walker, S.2    Robertson, S.E.3
  • 35
    • 0343169734 scopus 로고    scopus 로고
    • A probabilistic model of information retrieval: Development and comparative experiments. Part 2
    • Sparck Jones K, Walker S, Robertson SE, (2000) A probabilistic model of information retrieval: Development and comparative experiments. Part 2. Information Processing & Management 36: 809-840.
    • (2000) Information Processing & Management , vol.36 , pp. 809-840
    • Sparck Jones, K.1    Walker, S.2    Robertson, S.E.3
  • 37
    • 0003410794 scopus 로고    scopus 로고
    • SOM PAK: The Self-Organizing Map program package
    • Technical Report A31. Helsinki University of Technology, Laboratory of Computer and Information Science
    • Kohonen T, Hynninen J, Kangas J, Laaksonen J, (1996) SOM PAK: The Self-Organizing Map program package. Technical Report A31. Helsinki University of Technology, Laboratory of Computer and Information Science.
    • (1996)
    • Kohonen, T.1    Hynninen, J.2    Kangas, J.3    Laaksonen, J.4
  • 40
    • 0034818212 scopus 로고    scopus 로고
    • Unsupervised learning by probabilistic latent semantic analysis
    • Hofmann T, (2001) Unsupervised learning by probabilistic latent semantic analysis. Machine Learning 42: 177-196.
    • (2001) Machine Learning , vol.42 , pp. 177-196
    • Hofmann, T.1
  • 41
    • 38549166666 scopus 로고    scopus 로고
    • PubMed related articles: A probabilistic topic-based model for content similarity
    • Lin J, Wilbur WJ, (2007) PubMed related articles: A probabilistic topic-based model for content similarity. BMC Bioinformatics 8: 423.
    • (2007) BMC Bioinformatics , vol.8 , pp. 423
    • Lin, J.1    Wilbur, W.J.2
  • 47
    • 67349137179 scopus 로고    scopus 로고
    • Visual conceptualizations and models of science
    • Börner K, Scharnhorst A, (2009) Visual conceptualizations and models of science. Journal of Informetrics 3: 161-172.
    • (2009) Journal of Informetrics , vol.3 , pp. 161-172
    • Börner, K.1    Scharnhorst, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.