메뉴 건너뛰기




Volumn 9781461432234, Issue , 2012, Pages 129-161

Dimensionality reduction and topic modeling: From latent semantic indexing to latent Dirichlet allocation and beyond

Author keywords

Dimension reduction; Latent Dirichlet allocation; Latent semantic indexing; Topic modeling

Indexed keywords

INDEXING (OF INFORMATION); STATISTICS;

EID: 84938562120     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1007/978-1-4614-3223-4_5     Document Type: Chapter
Times cited : (106)

References (76)
  • 2
    • 84881083774 scopus 로고    scopus 로고
    • A framework for incorporating general domain knowledge into latent Dirichlet allocation using first-order logic
    • D. Andrzejewski, X. Zhu, M. Craven, and B. Recht. A framework for incorporating general domain knowledge into latent Dirichlet allocation using first-order logic. In IJCAI, 2011.
    • (2011) IJCAI
    • Andrzejewski, D.1    Zhu, X.2    Craven, M.3    Recht, B.4
  • 3
    • 77954725706 scopus 로고    scopus 로고
    • On smoothing and inference for topic models
    • A. Asuncion, M. Welling, P. Smyth, and Y. Teh. On smoothing and inference for topic models. In UAI, pages 27-34, 2009.
    • (2009) UAI , pp. 27-34
    • Asuncion, A.1    Welling, M.2    Smyth, P.3    Teh, Y.4
  • 5
    • 35548942978 scopus 로고    scopus 로고
    • Why spectral retrieval works
    • H. Bast and D. Majumdar. Why spectral retrieval works. In SIGIR, page 11, 2005.
    • (2005) SIGIR , pp. 11
    • Bast, H.1    Majumdar, D.2
  • 8
    • 0029546874 scopus 로고
    • Using linear algebra for intelligent information retrieval
    • M. Berry, S. Dumais, and G. O'Brien. Using linear algebra for intelligent information retrieval. SIAM review, 37(4):573-595, 1995.
    • (1995) SIAM Review , vol.37 , Issue.4 , pp. 573-595
    • Berry, M.1    Dumais, S.2    O'Brien, G.3
  • 9
    • 33645025214 scopus 로고    scopus 로고
    • Hierarchical topic models and the nested chinese restaurant process
    • D. Blei, T. Griffiths, M. Jordan, and J. Tenenbaum. Hierarchical topic models and the nested chinese restaurant process. In NIPS, 2003.
    • (2003) NIPS
    • Blei, D.1    Griffiths, T.2    Jordan, M.3    Tenenbaum, J.4
  • 10
    • 33749242628 scopus 로고    scopus 로고
    • Dynamic topic models
    • D. Blei and J. Lafferty. Dynamic topic models. In ICML, pages 113-120, 2006.
    • (2006) ICML , pp. 113-120
    • Blei, D.1    Lafferty, J.2
  • 11
    • 52449116403 scopus 로고    scopus 로고
    • A correlated topic model of science
    • D. Blei and J. Lafferty. A correlated topic model of science. AAS, 1(1):17-35, 2007.
    • (2007) AAS , vol.1 , Issue.1 , pp. 17-35
    • Blei, D.1    Lafferty, J.2
  • 12
    • 74549185029 scopus 로고    scopus 로고
    • Supervised topic models
    • D. Blei and J. McAuliffe. Supervised topic models. In NIPS, 2007.
    • (2007) NIPS
    • Blei, D.1    McAuliffe, J.2
  • 14
    • 79952257141 scopus 로고    scopus 로고
    • Multilingual topic models for unaligned text
    • J. Boyd-Graber and D. Blei. Multilingual topic models for unaligned text. In UAI, pages 75-82, 2009.
    • (2009) UAI , pp. 75-82
    • Boyd-Graber, J.1    Blei, D.2
  • 15
    • 79957668514 scopus 로고    scopus 로고
    • Syntactic topic models
    • J. Boyd-Graber and D. Blei. Syntactic topic models. In NIPS, pages 185-192. 2009.
    • (2009) NIPS , pp. 185-192
    • Boyd-Graber, J.1    Blei, D.2
  • 17
    • 77951168602 scopus 로고    scopus 로고
    • Relational topic models for document networks
    • J. Chang and D. Blei. Relational topic models for document networks. In AIStats, 2009.
    • (2009) AIStats
    • Chang, J.1    Blei, D.2
  • 18
    • 84863381525 scopus 로고    scopus 로고
    • Reading tea leaves: How humans interpret topic models
    • J. Chang, J. Boyd-Graber, S. Gerrish, C. Wang, and D. Blei. Reading tea leaves: How humans interpret topic models. In NIPS, pages 288-296. 2009.
    • (2009) NIPS , pp. 288-296
    • Chang, J.1    Boyd-Graber, J.2    Gerrish, S.3    Wang, C.4    Blei, D.5
  • 20
    • 17444392177 scopus 로고    scopus 로고
    • The missing link-A probabilistic model of document content and hypertext connectivity
    • D. Cohn. The missing link-a probabilistic model of document content and hypertext connectivity. In NIPS, 2001.
    • (2001) NIPS
    • Cohn, D.1
  • 21
    • 0038589432 scopus 로고    scopus 로고
    • Learning to probabilistically identify authoritative documents
    • D. Cohn and H. Chang. Learning to probabilistically identify authoritative documents. In ICML, pages 167-174, 2001.
    • (2001) ICML , pp. 167-174
    • Cohn, D.1    Chang, H.2
  • 25
    • 80052674988 scopus 로고    scopus 로고
    • Probabilistic topic models with biased propagation on heterogeneous information networks
    • San Diego, ACM
    • H. Deng, J. Han, B. Zhao, Y. Yu, and C. Lin. Probabilistic Topic Models with Biased Propagation on Heterogeneous Information Networks. In KDD, pages 1271-1279, San Diego, 2011. ACM.
    • (2011) KDD , pp. 1271-1279
    • Deng, H.1    Han, J.2    Zhao, B.3    Yu, Y.4    Lin, C.5
  • 26
    • 85013386603 scopus 로고    scopus 로고
    • A similarity-based probability model for latent semantic indexing
    • C. Ding. A similarity-based probability model for latent semantic indexing. In SIGIR, pages 58-65, 1999.
    • (1999) SIGIR , pp. 58-65
    • Ding, C.1
  • 27
    • 71149085755 scopus 로고    scopus 로고
    • Accounting for burstiness in topic models
    • G. Doyle and C. Elkan. Accounting for burstiness in topic models. In ICML, 2009.
    • (2009) ICML
    • Doyle, G.1    Elkan, C.2
  • 28
    • 0026989461 scopus 로고
    • Automating the assignment of submitted manuscripts to reviewers
    • S. Dumais and J. Nielsen. Automating the assignment of submitted manuscripts to reviewers. In SIGIR, pages 233-244, 1992.
    • (1992) SIGIR , pp. 233-244
    • Dumais, S.1    Nielsen, J.2
  • 29
    • 1542287487 scopus 로고    scopus 로고
    • Latent concepts and the number orthogonal factors in latent semantic analysis
    • G. Dupret. Latent concepts and the number orthogonal factors in latent semantic analysis. SIGIR, pages 221-226, 2003.
    • (2003) SIGIR , pp. 221-226
    • Dupret, G.1
  • 30
    • 0004236492 scopus 로고    scopus 로고
    • 3rd ed.. Johns Hopkins University Press, Baltimore, MD, USA
    • G. Golub and C. Van Loan. Matrix computations (3rd ed.). Johns Hopkins University Press, Baltimore, MD, USA, 1996.
    • (1996) Matrix Computations
    • Golub, G.1    Van Loan, C.2
  • 34
    • 72449129527 scopus 로고    scopus 로고
    • A latent topic model for linked documents
    • Z. Guo, S. Zhu, Y. Chi, Z. Zhang, and Y. Gong. A latent topic model for linked documents. In SIGIR, page 720, 2009.
    • (2009) SIGIR , pp. 720
    • Guo, Z.1    Zhu, S.2    Chi, Y.3    Zhang, Z.4    Gong, Y.5
  • 35
    • 85162005069 scopus 로고    scopus 로고
    • Online learning for latent Dirichlet allocation
    • M. Hoffman, D. Blei, and F. Bach. Online learning for latent Dirichlet allocation. In NIPS, pages 856-864, 2010.
    • (2010) NIPS , pp. 856-864
    • Hoffman, M.1    Blei, D.2    Bach, F.3
  • 36
    • 0001509519 scopus 로고    scopus 로고
    • Probabilistic latent semantic analysis
    • T. Hofmann. Probabilistic latent semantic analysis. In UAI, page 21, 1999.
    • (1999) UAI , pp. 21
    • Hofmann, T.1
  • 37
    • 85026972772 scopus 로고    scopus 로고
    • Probabilistic latent semantic indexing
    • T. Hofmann. Probabilistic latent semantic indexing. In SIGIR, pages 50-57, 1999.
    • (1999) SIGIR , pp. 50-57
    • Hofmann, T.1
  • 38
    • 0034795233 scopus 로고    scopus 로고
    • Iterative residual rescaling: An analysis and generalization of LSI
    • R. Kubota Ando and L. Lee. Iterative residual rescaling: An analysis and generalization of LSI. In SIGIR, pages 154-162, 2001.
    • (2001) SIGIR , pp. 154-162
    • Kubota Ando, R.1    Lee, L.2
  • 40
    • 85193166229 scopus 로고    scopus 로고
    • On the computational basis of learning and cognition: Arguments from LSA
    • T. Landauer. On the computational basis of learning and cognition: Arguments from LSA. Psychology of learning and motivation, (1):1-63, 2002.
    • (2002) Psychology of Learning and Motivation , Issue.1 , pp. 1-63
    • Landauer, T.1
  • 41
    • 80053184595 scopus 로고    scopus 로고
    • Nonparametric bayes pachinko allocation
    • W. Li, D. Blei, and A. McCallum. Nonparametric Bayes Pachinko allocation. In UAI, 2007.
    • (2007) UAI
    • Li, W.1    Blei, D.2    McCallum, A.3
  • 43
    • 79955694310 scopus 로고    scopus 로고
    • PLDA+: Parallel latent Dirichlet allocation with data placement and pipeline processing
    • May
    • Z. Liu, Y. Zhang, E. Y. Chang, and M. Sun. PLDA+: Parallel latent Dirichlet allocation with data placement and pipeline processing. ACM Trans. Intell. Syst. Technol., 2:26:1-26:18, May 2011.
    • (2011) ACM Trans. Intell. Syst. Technol. , vol.2 , pp. 261-2618
    • Liu, Z.1    Zhang, Y.2    Chang, E.Y.3    Sun, M.4
  • 46
    • 57349152312 scopus 로고    scopus 로고
    • Topic modeling with network regularization
    • Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In WWW, page 101, 2008.
    • (2008) WWW , pp. 101
    • Mei, Q.1    Cai, D.2    Zhang, D.3    Zhai, C.4
  • 47
    • 36849026729 scopus 로고    scopus 로고
    • Automatic labeling of multinomial topic models
    • Q. Mei, X. Shen, and C. Zhai. Automatic labeling of multinomial topic models. In KDD, pages 490-499, 2007.
    • (2007) KDD , pp. 490-499
    • Mei, Q.1    Shen, X.2    Zhai, C.3
  • 48
    • 77951202623 scopus 로고    scopus 로고
    • Topic models conditioned on arbitrary features with dirichlet-multinomial regression
    • D. Mimno and A. McCallum. Topic models conditioned on arbitrary features with dirichlet-multinomial regression. In UAI, 2008.
    • (2008) UAI
    • Mimno, D.1    McCallum, A.2
  • 56
    • 80052119994 scopus 로고    scopus 로고
    • An architecture for parallel topic models
    • September
    • A. Smola and S. Narayanamurthy. An architecture for parallel topic models. Proc. VLDB Endow., 3:703-710, September 2010.
    • (2010) Proc. VLDB Endow. , vol.3 , pp. 703-710
    • Smola, A.1    Narayanamurthy, S.2
  • 57
    • 33749249312 scopus 로고    scopus 로고
    • Hierarchical Dirichlet processes
    • Y. Teh, M. Jordan, M. Beal, and D. Blei. Hierarchical Dirichlet processes. JASA, 101, 2006.
    • (2006) JASA , vol.101
    • Teh, Y.1    Jordan, M.2    Beal, M.3    Blei, D.4
  • 58
    • 57349120510 scopus 로고    scopus 로고
    • Modeling online reviews with multigrain topic models
    • I. Titov and R. McDonald. Modeling online reviews with multigrain topic models. In WWW, pages 111-120, 2008.
    • (2008) WWW , pp. 111-120
    • Titov, I.1    McDonald, R.2
  • 59
    • 84898422540 scopus 로고    scopus 로고
    • Probabilistic latent sequential motifs: Discovering temporal activity patterns in video scenes
    • J. Varadarajan, R. Emonet, and J. Odobez. Probabilistic latent sequential motifs: Discovering temporal activity patterns in video scenes. In BMVC 2010, volume 42, pages 177-196, 2010.
    • (2010) BMVC 2010 , vol.42 , pp. 177-196
    • Varadarajan, J.1    Emonet, R.2    Odobez, J.3
  • 60
    • 79952129745 scopus 로고    scopus 로고
    • Rethinking LDA: Why priors matter
    • H. Wallach, D. Mimno, and A. McCallum. Rethinking LDA: Why priors matter. In NIPS, pages 1973-1981, 2009.
    • (2009) NIPS , pp. 1973-1981
    • Wallach, H.1    Mimno, D.2    McCallum, A.3
  • 62
    • 33749245495 scopus 로고    scopus 로고
    • Topic modeling: Beyond bag-of-words
    • H. Wallach. Topic modeling: beyond bag-of-words. In ICML, 2006.
    • (2006) ICML
    • Wallach, H.1
  • 63
    • 80052122601 scopus 로고    scopus 로고
    • Regularized latent semantic indexing
    • Q. Wang, J. Xu, and H. Li. Regularized latent semantic indexing. In SIGIR, 2011.
    • (2011) SIGIR
    • Wang, Q.1    Xu, J.2    Li, H.3
  • 64
    • 80052128010 scopus 로고    scopus 로고
    • Temporal latent semantic analysis for collaboratively generated content: Preliminary results
    • Y. Wang and E. Agichtein. Temporal latent semantic analysis for collaboratively generated content: preliminary results. In SIGIR, pages 1145-1146, 2011.
    • (2011) SIGIR , pp. 1145-1146
    • Wang, Y.1    Agichtein, E.2
  • 65
    • 33750327222 scopus 로고    scopus 로고
    • LDA-based document models for adhoc retrieval
    • X. Wei and W. Bruce Croft. LDA-based document models for adhoc retrieval. In SIGIR, pages 178-185, 2006.
    • (2006) SIGIR , pp. 178-185
    • Wei, X.1    Bruce Croft, W.2
  • 66
    • 79955691941 scopus 로고    scopus 로고
    • Parallel inference for latent Dirichlet allocation on graphics processing units
    • F. Yan, N. Xu, and Y. Qi. Parallel inference for latent Dirichlet allocation on graphics processing units. In NIPS, pages 2134-2142. 2009.
    • (2009) NIPS , pp. 2134-2142
    • Yan, F.1    Xu, N.2    Qi, Y.3
  • 67
    • 80053144865 scopus 로고    scopus 로고
    • Hybrid generative/discriminative learning for automatic image annotation
    • S. Yang, J. Bian, and H. Zha. Hybrid generative/discriminative learning for automatic image annotation. In UAI, 2010.
    • (2010) UAI
    • Yang, S.1    Bian, J.2    Zha, H.3
  • 68
    • 85193188495 scopus 로고    scopus 로고
    • Briding the language gap: Topic-level adaptation for cross-domain knowledge transfer
    • S. Yang, S. Crain, and H. Zha. Briding the language gap: topic-level adaptation for cross-domain knowledge transfer. In AIStat, 2011.
    • (2011) AIStat
    • Yang, S.1    Crain, S.2    Zha, H.3
  • 69
    • 80052396145 scopus 로고    scopus 로고
    • Like like alike - Joint friendship and interest propagation in social networks
    • S. Yang, B. Long, A. Smola, N. Sadagopan, Z. Zheng, and H. Zha. Like like alike - joint friendship and interest propagation in social networks. In WWW, 2011.
    • (2011) WWW
    • Yang, S.1    Long, B.2    Smola, A.3    Sadagopan, N.4    Zheng, Z.5    Zha, H.6
  • 70
    • 78651277940 scopus 로고    scopus 로고
    • Language pyramid and multi-scale text analysis
    • S. Yang and H. Zha. Language pyramid and multi-scale text analysis. In CIKM, pages 639-648, 2010.
    • (2010) CIKM , pp. 639-648
    • Yang, S.1    Zha, H.2
  • 71
    • 84863338235 scopus 로고    scopus 로고
    • Dirichlet-bernoulli alignment: A generative model for multi-class multi-label multi-instance corpora
    • S. Yang, H. Zha, and B. Hu. Dirichlet-bernoulli alignment: A generative model for multi-class multi-label multi-instance corpora. In NIPS, 2009.
    • (2009) NIPS
    • Yang, S.1    Zha, H.2    Hu, B.3
  • 72
    • 70350681184 scopus 로고    scopus 로고
    • Efficient methods for topic model inference on streaming document collections
    • L. Yao, D. Mimno, and A. McCallum. Efficient methods for topic model inference on streaming document collections. In KDD, pages 937-946, 2009.
    • (2009) KDD , pp. 937-946
    • Yao, L.1    Mimno, D.2    McCallum, A.3
  • 74
    • 0033296577 scopus 로고    scopus 로고
    • On updating problems in latent semantic indexing
    • H. Zha and H. Simon. On updating problems in latent semantic indexing. SIAM Journal on Scientific Computing, 21(2):782, 1999.
    • (1999) SIAM Journal on Scientific Computing , vol.21 , Issue.2 , pp. 782
    • Zha, H.1    Simon, H.2
  • 75
    • 0033633266 scopus 로고    scopus 로고
    • On matrices with low-rank-plus-shift structures: Partial SVD and latent semantic indexing
    • H. Zha and Z. Zhang. On matrices with low-rank-plus-shift structures: Partial SVD and latent semantic indexing. SIAM Journal Matrix Analysis and Applications, 21:522-536, 1999.
    • (1999) SIAM Journal Matrix Analysis and Applications , vol.21 , pp. 522-536
    • Zha, H.1    Zhang, Z.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.