-
1
-
-
84873472412
-
The online infinite topic-cluster model: Storylines from streaming text
-
Under review
-
The online infinite topic-cluster model: Storylines from streaming text. AISTATS, 2011. Under review.
-
(2011)
AISTATS
-
-
-
2
-
-
84873446053
-
-
CMU-ML-11-100
-
A. Ahmed, Q. Ho, C. Teo, J. Eisenstein, A. J. Smola, E.P. Xing The online infinite topic-cluster model: storylines from streaming text. CMU-ML-11-100, 2011.
-
(2011)
The Online Infinite Topic-cluster Model: Storylines from Streaming Text
-
-
Ahmed, A.1
Ho, Q.2
Teo, C.3
Eisenstein, J.4
Smola, A.J.5
Xing, E.P.6
-
3
-
-
52649163923
-
Dynamic non-parametric mixture models and the recurrent chinese restaurant process: With applications to evolutionary clustering
-
A. Ahmed and E.P. Xing. Dynamic non-parametric mixture models and the recurrent chinese restaurant process: with applications to evolutionary clustering. In SDM, 2008.
-
(2008)
SDM
-
-
Ahmed, A.1
Xing, E.P.2
-
4
-
-
56349095491
-
-
N. Ailon, M. Charikar, A. Newman Aggregating inconsistent information: Ranking and clustering In Journal of ACM, 55(5):1-27, 2008.
-
(2008)
Aggregating Inconsistent Information: Ranking and Clustering in Journal of ACM
, vol.55
, Issue.5
, pp. 1-27
-
-
Ailon, N.1
Charikar, M.2
Newman, A.3
-
6
-
-
0001924742
-
Topic detection and tracking pilot study: Final report
-
J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang. Topic detection and tracking pilot study: Final report. In DARPA News Understanding Workshop, 1998.
-
(1998)
DARPA News Understanding Workshop
-
-
Allan, J.1
Carbonell, J.2
Doddington, G.3
Yamron, J.4
Yang, Y.5
-
7
-
-
85109214302
-
First story detection in TDT is hard
-
J. Allan, V. Lavrenko, and H. Jin. First story detection in TDT is hard. In CIKM, 374-381, 2000.
-
(2000)
CIKM
, pp. 374-381
-
-
Allan, J.1
Lavrenko, V.2
Jin, H.3
-
8
-
-
0000708831
-
Mixtures of dirichlet processes with applications to bayesian nonparametric problems
-
C. E. Antoniak. Mixtures of dirichlet processes with applications to bayesian nonparametric problems. The Annals of Statistics, 2(6):1152-1174, 1974.
-
(1974)
The Annals of Statistics
, vol.2
, Issue.6
, pp. 1152-1174
-
-
Antoniak, C.E.1
-
9
-
-
70449126967
-
Topic models over text streams: A study of batch and online unsupervised learning
-
A. Banerjee and S. Basu. Topic models over text streams: A study of batch and online unsupervised learning. In Proceedings of SDM, 2007.
-
(2007)
Proceedings of SDM
-
-
Banerjee, A.1
Basu, S.2
-
10
-
-
33749242628
-
Dynamic topic models
-
In W. W. Cohen and A. Moore, editors
-
D. Blei and J. Lafferty. Dynamic topic models. In W. W. Cohen and A. Moore, editors, ICML, 2006.
-
(2006)
ICML
-
-
Blei, D.1
Lafferty, J.2
-
11
-
-
0141607824
-
Latent dirichlet allocation
-
D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. JMLR, 3:993-1022, 2003.
-
(2003)
JMLR
, vol.3
, pp. 993-1022
-
-
Blei, D.1
Ng, A.2
Jordan, M.3
-
12
-
-
77956196951
-
Online inference of topics with latent dirichlet allocation
-
K. R. Canini, L. Shi, and T. L. Griffiths. Online inference of topics with latent dirichlet allocation. In AISTATS, 2009.
-
(2009)
AISTATS
-
-
Canini, K.R.1
Shi, L.2
Griffiths, T.L.3
-
13
-
-
84863381525
-
Reading tea leaves: How humans interpret topic models
-
J. Chang, J. Boyd-Graber, S. Gerrish, C. Wang, and D. Blei. Reading tea leaves: How humans interpret topic models. In Neural Information Processing Systems, 2009.
-
(2009)
Neural Information Processing Systems
-
-
Chang, J.1
Boyd-Graber, J.2
Gerrish, S.3
Wang, C.4
Blei, D.5
-
14
-
-
33745628977
-
UMass at TDT 2004
-
M. Connell, A. Feng, G. Kumaran, H. Raghavan, C. Shah, and J. Allan. UMass at TDT 2004. In TDT 2004 Workshop Proceedings, 2004.
-
(2004)
TDT 2004 Workshop Proceedings
-
-
Connell, M.1
Feng, A.2
Kumaran, G.3
Raghavan, H.4
Shah, C.5
Allan, J.6
-
16
-
-
71149085755
-
Accounting for burstiness in topic models
-
G. Doyle and C. Elkan. Accounting for burstiness in topic models. In ICML, 2009.
-
(2009)
ICML
-
-
Doyle, G.1
Elkan, C.2
-
17
-
-
84950937290
-
Bayesian density estimation and inference using mixtures
-
M. Escobar and M. West. Bayesian density estimation and inference using mixtures. JASA 90, 1995.
-
(1995)
JASA
, vol.90
-
-
Escobar, M.1
West, M.2
-
18
-
-
84859918687
-
Incorporating non-local information into information extraction systems by gibbs sampling
-
J. R. Finkel, T. Grenager, and C. Manning. Incorporating non-local information into information extraction systems by gibbs sampling. In ACL, pages 363-370, 2005.
-
(2005)
ACL
, pp. 363-370
-
-
Finkel, J.R.1
Grenager, T.2
Manning, C.3
-
19
-
-
15044355327
-
Similarity search in high dimensions via hashing
-
A. Gionis, P. Indyk, and R. Motwani. Similarity Search in High Dimensions via Hashing. In VLDB, 1999.
-
(1999)
VLDB
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
-
20
-
-
0003067623
-
Scalable techniques for clustering the web
-
T. Haveliwala, A. Gionis, and P. Indyk. Scalable Techniques for Clustering the Web. In WebDB, 2000.
-
(2000)
WebDB
-
-
Haveliwala, T.1
Gionis, A.2
Indyk, P.3
-
21
-
-
8644246887
-
Text classification and named entities for new event detection
-
G. Kumaran and J. Allan. Text classification and named entities for new event detection. In SIGIR, 2004.
-
(2004)
SIGIR
-
-
Kumaran, G.1
Allan, J.2
-
22
-
-
77951202623
-
Topic models conditioned on arbitrary features with dirichlet-multinomial regression
-
D. M. Mimno and A. McCallum. Topic models conditioned on arbitrary features with dirichlet-multinomial regression. In UAI, 2008.
-
(2008)
UAI
-
-
Mimno, D.M.1
McCallum, A.2
-
23
-
-
0141596527
-
Estimating a Dirichlet distribution
-
T. P. Minka. Estimating a Dirichlet distribution. Technical report, MIT, 2003.
-
(2003)
Technical Report, MIT
-
-
Minka, T.P.1
-
24
-
-
33749547161
-
Statistical entity-topic models
-
New York, NY, USA
-
D. Newman, C. Chemudugunta, and P. Smyth. Statistical entity-topic models. In KDD, pages 680-686, New York, NY, USA, 2006.
-
(2006)
KDD
, pp. 680-686
-
-
Newman, D.1
Chemudugunta, C.2
Smyth, P.3
-
25
-
-
80053272732
-
Streaming first story detection with application to twitter
-
S. Petrovic, M. Osborne and V. Lavrenko. Streaming First Story Detection with application to Twitter. In NAACL, 2010.
-
(2010)
NAACL
-
-
Petrovic, S.1
Osborne, M.2
Lavrenko, V.3
-
26
-
-
84873473178
-
-
NIST. http://www.itl.nist.gov/iad/mig/tests/tdt/2004/workshop.html.
-
NIST
-
-
-
27
-
-
21844504293
-
Exchangeable and partially exchangeable random partitions
-
J. Pitman. Exchangeable and partially exchangeable random partitions. Probability Theory, 102(2), 1995.
-
(1995)
Probability Theory
, vol.102
, Issue.2
-
-
Pitman, J.1
-
30
-
-
33749565782
-
Topics over time: A non-markov continuous-time model of topical trends
-
X. Wang and A. McCallum. Topics over time: A non-markov continuous-time model of topical trends. In KDD, 2006.
-
(2006)
KDD
-
-
Wang, X.1
McCallum, A.2
-
31
-
-
70350681184
-
Efficient methods for topic model inference on streaming document collections
-
L. Yao, D. Mimno, and A. McCallum. Efficient methods for topic model inference on streaming document collections. In KDD, pages 937-946, 2009.
-
(2009)
KDD
, pp. 937-946
-
-
Yao, L.1
Mimno, D.2
McCallum, A.3
-
32
-
-
84862635248
-
Dirichlet enhanced latent semantic analysis
-
K. Yu, S. Yu, and V. Tresp. Dirichlet enhanced latent semantic analysis. In AISTATS, 2005.
-
(2005)
AISTATS
-
-
Yu, K.1
Yu, S.2
Tresp, V.3
-
33
-
-
29144497253
-
A probabilistic model for online document clustering with application to novelty detection
-
J. Zhang, Y. Yang, and Z. Ghahramani. A probabilistic model for online document clustering with application to novelty detection. In NIPS, 2004.
-
(2004)
NIPS
-
-
Zhang, J.1
Yang, Y.2
Ghahramani, Z.3
-
34
-
-
80053428576
-
Resolving surface forms to wikipedia topics
-
Y. Zhou, L. Nie, O. Rouhani-Kalleh, F. Vasile, and S. Gaffney. Resolving Surface Forms to Wikipedia Topics. In COLING, 1335-1343, 2010.
-
(2010)
COLING
, pp. 1335-1343
-
-
Zhou, Y.1
Nie, L.2
Rouhani-Kalleh, O.3
Vasile, F.4
Gaffney, S.5
|