메뉴 건너뛰기




Volumn 16, Issue 13, 2015, Pages

A heuristic approach to determine an appropriate number of topics in topic modeling

Author keywords

Latent Dirichlet allocation (LDA); Perplexity; Rate of perplexity change (RPC); Topic number

Indexed keywords

ARTIFICIAL INTELLIGENCE; BIOINFORMATICS; DATA MINING; ITERATIVE METHODS; LEARNING SYSTEMS; NUMERICAL METHODS; STATISTICS;

EID: 84961612892     PISSN: None     EISSN: 14712105     Source Type: Journal    
DOI: 10.1186/1471-2105-16-S13-S8     Document Type: Article
Times cited : (286)

References (18)
  • 2
    • 0034818212 scopus 로고    scopus 로고
    • Unsupervised learning by probabilistic latent semantic analysis
    • Hofmann T: Unsupervised learning by probabilistic latent semantic analysis. Machine Learning. 2001, 42 (1-2): 177-196.
    • (2001) Machine Learning , vol.42 , Issue.1-2 , pp. 177-196
    • Hofmann, T.1
  • 6
    • 1542287501 scopus 로고    scopus 로고
    • Modeling annotated data. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
    • Blei DM, Jordan MI: Modeling annotated data. Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. 2003, 127-134.
    • (2003) , pp. 127-134
    • Blei, D.M.1    Jordan, M.I.2
  • 9
    • 79959388502 scopus 로고    scopus 로고
    • Multi-view methods for protein structure comparison using latent dirichlet allocation
    • Shivashankar S, Srivathsan S, Ravindran B, Tendulkar AV: Multi-view methods for protein structure comparison using latent dirichlet allocation. Bioinformatics. 2011, 27 (13): i61-i68. 10.1093/bioinformatics/btr249.
    • (2011) Bioinformatics , vol.27 , Issue.13 , pp. i61-i68
    • Shivashankar, S.1    Srivathsan, S.2    Ravindran, B.3    Tendulkar, A.V.4
  • 10
    • 84961589247 scopus 로고    scopus 로고
    • Topic modeling for cluster analysis of large biological and medical datasets
    • Zhao W, Zou W, Chen JJ: Topic modeling for cluster analysis of large biological and medical datasets. BMC Bioinformatics. 2014, 15 (Suppl 11): S11-10.1186/1471-2105-15-S11-S11.
    • (2014) BMC Bioinformatics , vol.15 , pp. S11
    • Zhao, W.1    Zou, W.2    Chen, J.J.3
  • 11
    • 77954179297 scopus 로고    scopus 로고
    • Quantifying the distribution of probes between subcellular locations using unsupervised pattern unmixing
    • Coelho LP, Peng T, Murphy RF: Quantifying the distribution of probes between subcellular locations using unsupervised pattern unmixing. Bioinformatics. 2010, 26 (12): i7-i12. 10.1093/bioinformatics/btq220.
    • (2010) Bioinformatics , vol.26 , Issue.12 , pp. i7-i12
    • Coelho, L.P.1    Peng, T.2    Murphy, R.F.3
  • 12
    • 84961612772 scopus 로고    scopus 로고
    • Antigenic formulae of the Salmonella serovars. Paris, France: WHO Collaborting Centre for Reference and Research on Salmonella
    • Grimont PA, Weill FX: Antigenic formulae of the Salmonella serovars. Paris, France: WHO Collaborting Centre for Reference and Research on Salmonella. 2007, 9
    • (2007) , pp. 9
    • Grimont, P.A.1    Weill, F.X.2
  • 14
    • 84961639416 scopus 로고    scopus 로고
    • MALLET: A Machine Learning for Language Toolkit
    • McCallun AK: MALLET: A Machine Learning for Language Toolkit. 2002, [ http://http://mallet.cs.umass.edu/ ]
    • (2002)
    • McCallun, A.K.1
  • 15
    • 3042666256 scopus 로고    scopus 로고
    • MUSCLE: multiple sequence alignment with high accuracy and high throughput
    • Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32 (5): 1792-1797. 10.1093/nar/gkh340.
    • (2004) Nucleic Acids Res , vol.32 , Issue.5 , pp. 1792-1797
    • Edgar, R.C.1
  • 16
    • 84940644968 scopus 로고
    • A Mathematical Theory of Communication
    • Shannon CE: A Mathematical Theory of Communication. At&T Tech J. 1948, 27 (3): 379-423.
    • (1948) At&T Tech J , vol.27 , Issue.3 , pp. 379-423
    • Shannon, C.E.1
  • 18
    • 0030606764 scopus 로고    scopus 로고
    • The change-point problem for dependent observations
    • Giraitis L, Leipus R, Surgailis D: The change-point problem for dependent observations. J Stat Plan Infer. 1996, 53 (3): 297-310. 10.1016/0378-3758(95)00148-4.
    • (1996) J Stat Plan Infer , vol.53 , Issue.3 , pp. 297-310
    • Giraitis, L.1    Leipus, R.2    Surgailis, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.