메뉴 건너뛰기




Volumn , Issue , 2011, Pages 267-276

Unified analysis of streaming news

Author keywords

Dirichlet processes; Online inference; Topic models

Indexed keywords

DATA SETS; DIRICHLET PROCESS; DYNAMIC DATA; HYBRID CLUSTERING; INFERENCE ALGORITHM; KEY ENTITY; MEMORY COST; NEWS ARTICLES; ONLINE INFERENCE; SEQUENTIAL MONTE CARLO; STORYLINES; TEMPORAL STRUCTURES; TOPIC MODEL; UNIFIED ANALYSIS; UNIFIED FRAMEWORK;

EID: 84862281878     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1963405.1963445     Document Type: Conference Paper
Times cited : (79)

References (34)
  • 1
    • 84873472412 scopus 로고    scopus 로고
    • The online infinite topic-cluster model: Storylines from streaming text
    • Under review
    • The online infinite topic-cluster model: Storylines from streaming text. AISTATS, 2011. Under review.
    • (2011) AISTATS
  • 3
    • 52649163923 scopus 로고    scopus 로고
    • Dynamic non-parametric mixture models and the recurrent chinese restaurant process: With applications to evolutionary clustering
    • A. Ahmed and E.P. Xing. Dynamic non-parametric mixture models and the recurrent chinese restaurant process: with applications to evolutionary clustering. In SDM, 2008.
    • (2008) SDM
    • Ahmed, A.1    Xing, E.P.2
  • 7
    • 85109214302 scopus 로고    scopus 로고
    • First story detection in TDT is hard
    • J. Allan, V. Lavrenko, and H. Jin. First story detection in TDT is hard. In CIKM, 374-381, 2000.
    • (2000) CIKM , pp. 374-381
    • Allan, J.1    Lavrenko, V.2    Jin, H.3
  • 8
    • 0000708831 scopus 로고
    • Mixtures of dirichlet processes with applications to bayesian nonparametric problems
    • C. E. Antoniak. Mixtures of dirichlet processes with applications to bayesian nonparametric problems. The Annals of Statistics, 2(6):1152-1174, 1974.
    • (1974) The Annals of Statistics , vol.2 , Issue.6 , pp. 1152-1174
    • Antoniak, C.E.1
  • 9
    • 70449126967 scopus 로고    scopus 로고
    • Topic models over text streams: A study of batch and online unsupervised learning
    • A. Banerjee and S. Basu. Topic models over text streams: A study of batch and online unsupervised learning. In Proceedings of SDM, 2007.
    • (2007) Proceedings of SDM
    • Banerjee, A.1    Basu, S.2
  • 10
    • 33749242628 scopus 로고    scopus 로고
    • Dynamic topic models
    • In W. W. Cohen and A. Moore, editors
    • D. Blei and J. Lafferty. Dynamic topic models. In W. W. Cohen and A. Moore, editors, ICML, 2006.
    • (2006) ICML
    • Blei, D.1    Lafferty, J.2
  • 11
    • 0141607824 scopus 로고    scopus 로고
    • Latent dirichlet allocation
    • D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. JMLR, 3:993-1022, 2003.
    • (2003) JMLR , vol.3 , pp. 993-1022
    • Blei, D.1    Ng, A.2    Jordan, M.3
  • 12
    • 77956196951 scopus 로고    scopus 로고
    • Online inference of topics with latent dirichlet allocation
    • K. R. Canini, L. Shi, and T. L. Griffiths. Online inference of topics with latent dirichlet allocation. In AISTATS, 2009.
    • (2009) AISTATS
    • Canini, K.R.1    Shi, L.2    Griffiths, T.L.3
  • 16
    • 71149085755 scopus 로고    scopus 로고
    • Accounting for burstiness in topic models
    • G. Doyle and C. Elkan. Accounting for burstiness in topic models. In ICML, 2009.
    • (2009) ICML
    • Doyle, G.1    Elkan, C.2
  • 17
    • 84950937290 scopus 로고
    • Bayesian density estimation and inference using mixtures
    • M. Escobar and M. West. Bayesian density estimation and inference using mixtures. JASA 90, 1995.
    • (1995) JASA , vol.90
    • Escobar, M.1    West, M.2
  • 18
    • 84859918687 scopus 로고    scopus 로고
    • Incorporating non-local information into information extraction systems by gibbs sampling
    • J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbs sampling. In ACL, pages 363-370, 2005.
    • (2005) ACL , pp. 363-370
    • Finkel, J.R.1    Grenager, T.2    Manning, C.3
  • 19
    • 15044355327 scopus 로고    scopus 로고
    • Similarity search in high dimensions via hashing
    • A. Gionis, P. Indyk, and R. Motwani. Similarity Search in High Dimensions via Hashing. In VLDB, 1999.
    • (1999) VLDB
    • Gionis, A.1    Indyk, P.2    Motwani, R.3
  • 20
    • 0003067623 scopus 로고    scopus 로고
    • Scalable techniques for clustering the web
    • T. Haveliwala, A. Gionis, and P. Indyk. Scalable Techniques for Clustering the Web. In WebDB, 2000.
    • (2000) WebDB
    • Haveliwala, T.1    Gionis, A.2    Indyk, P.3
  • 21
    • 8644246887 scopus 로고    scopus 로고
    • Text classification and named entities for new event detection
    • G. Kumaran and J. Allan. Text classification and named entities for new event detection. In SIGIR, 2004.
    • (2004) SIGIR
    • Kumaran, G.1    Allan, J.2
  • 22
    • 77951202623 scopus 로고    scopus 로고
    • Topic models conditioned on arbitrary features with dirichlet-multinomial regression
    • D. M. Mimno and A. McCallum. Topic models conditioned on arbitrary features with dirichlet-multinomial regression. In UAI, 2008.
    • (2008) UAI
    • Mimno, D.M.1    McCallum, A.2
  • 23
    • 0141596527 scopus 로고    scopus 로고
    • Estimating a Dirichlet distribution
    • T. P. Minka. Estimating a Dirichlet distribution. Technical report, MIT, 2003.
    • (2003) Technical Report, MIT
    • Minka, T.P.1
  • 24
    • 33749547161 scopus 로고    scopus 로고
    • Statistical entity-topic models
    • New York, NY, USA
    • D. Newman, C. Chemudugunta, and P. Smyth. Statistical entity-topic models. In KDD, pages 680-686, New York, NY, USA, 2006.
    • (2006) KDD , pp. 680-686
    • Newman, D.1    Chemudugunta, C.2    Smyth, P.3
  • 25
    • 80053272732 scopus 로고    scopus 로고
    • Streaming first story detection with application to twitter
    • S. Petrovic, M. Osborne and V. Lavrenko. Streaming First Story Detection with application to Twitter. In NAACL, 2010.
    • (2010) NAACL
    • Petrovic, S.1    Osborne, M.2    Lavrenko, V.3
  • 26
    • 84873473178 scopus 로고    scopus 로고
    • NIST. http://www.itl.nist.gov/iad/mig/tests/tdt/2004/workshop.html.
    • NIST
  • 27
    • 21844504293 scopus 로고
    • Exchangeable and partially exchangeable random partitions
    • J. Pitman. Exchangeable and partially exchangeable random partitions. Probability Theory, 102(2), 1995.
    • (1995) Probability Theory , vol.102 , Issue.2
    • Pitman, J.1
  • 30
    • 33749565782 scopus 로고    scopus 로고
    • Topics over time: A non-markov continuous-time model of topical trends
    • X. Wang and A. McCallum. Topics over time: A non-markov continuous-time model of topical trends. In KDD, 2006.
    • (2006) KDD
    • Wang, X.1    McCallum, A.2
  • 31
    • 70350681184 scopus 로고    scopus 로고
    • Efficient methods for topic model inference on streaming document collections
    • L. Yao, D. Mimno, and A. McCallum. Efficient methods for topic model inference on streaming document collections. In KDD, pages 937-946, 2009.
    • (2009) KDD , pp. 937-946
    • Yao, L.1    Mimno, D.2    McCallum, A.3
  • 32
    • 84862635248 scopus 로고    scopus 로고
    • Dirichlet enhanced latent semantic analysis
    • K. Yu, S. Yu, and V. Tresp. Dirichlet enhanced latent semantic analysis. In AISTATS, 2005.
    • (2005) AISTATS
    • Yu, K.1    Yu, S.2    Tresp, V.3
  • 33
    • 29144497253 scopus 로고    scopus 로고
    • A probabilistic model for online document clustering with application to novelty detection
    • J. Zhang, Y. Yang, and Z. Ghahramani. A probabilistic model for online document clustering with application to novelty detection. In NIPS, 2004.
    • (2004) NIPS
    • Zhang, J.1    Yang, Y.2    Ghahramani, Z.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.