-
2
-
-
85146424500
-
Sentence alignment for monolingual comparable corpora
-
R. Barzilay and N. Elhadad. Sentence alignment for monolingual comparable corpora. In EMNLP, 2003.
-
(2003)
EMNLP
-
-
Barzilay, R.1
Elhadad, N.2
-
3
-
-
77951430107
-
Distributional word clusters vs. Words for text categorization
-
R. Bekkerman, R. El-Yaniv, N. Tishby, and Y. Winter. Distributional word clusters vs. words for text categorization. JMLR, 3:1183-1208, 2003.
-
(2003)
JMLR
, vol.3
, pp. 1183-1208
-
-
Bekkerman, R.1
El-Yaniv, R.2
Tishby, N.3
Winter, Y.4
-
4
-
-
0141607824
-
Latent dirichlet allocation
-
D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 3:993-1022, 2003.
-
(2003)
JMLR
, vol.3
, pp. 993-1022
-
-
Blei, D.1
Ng, A.2
Jordan, M.3
-
5
-
-
0002636321
-
N-gram-based text categorization
-
48113
-
W. Cavnar, J. Trenkle, et al. N-gram-based text categorization. Ann Arbor MI, 48113(2):161-175.
-
Ann Arbor MI
, Issue.2
, pp. 161-175
-
-
Cavnar, W.1
Trenkle, J.2
-
6
-
-
80053280511
-
Semantic role labeling for open information extraction
-
J. Christensen, S. Soderland, O. Etzioni, et al. Semantic role labeling for open information extraction. In ACL HLT, 2010.
-
(2010)
ACL HLT
-
-
Christensen, J.1
Soderland, S.2
Etzioni, O.3
-
7
-
-
80053558787
-
Natural language processing (almost) from scratch
-
R. Collobert, J. Weston, L. Bottou, M. Karlen, K. Kavukcuoglu, and p. Kuksa. Natural language processing (almost) from scratch. JMLR, 2011.
-
(2011)
JMLR
-
-
Collobert, R.1
Weston, J.2
Bottou, L.3
Karlen, M.4
Kavukcuoglu, K.5
Kuksa, P.6
-
9
-
-
50649119439
-
Non-metric affinity propagation for unsupervised image categorization
-
D. Dueck and B. J. Frey. Non-metric affinity propagation for unsupervised image categorization. In ICCV, 2007.
-
(2007)
ICCV
-
-
Dueck, D.1
Frey, B.J.2
-
10
-
-
85105809948
-
Inductive learning algorithms and representations for text categorization
-
S. Dumais, J. Platt, D. Heckerman, and M. Sahami. Inductive learning algorithms and representations for text categorization. In CIKM, 1998.
-
(1998)
CIKM
-
-
Dumais, S.1
Platt, J.2
Heckerman, D.3
Sahami, M.4
-
11
-
-
45749146270
-
A density-based algorithm for discovering clusters in large spatial databases with noise
-
M. Ester, H. Kriegel, J. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In SIGKDD, 1996.
-
(1996)
SIGKDD
-
-
Ester, M.1
Kriegel, H.2
Sander, J.3
Xu, X.4
-
14
-
-
0040076126
-
Automatic labeling of semantic roles
-
D. Gildea and D. Jurafsky. Automatic labeling of semantic roles. Computational Linguistics, 28(3):245-288, 2002.
-
(2002)
Computational Linguistics
, vol.28
, Issue.3
, pp. 245-288
-
-
Gildea, D.1
Jurafsky, D.2
-
15
-
-
8644230223
-
Locality preserving indexing for document representation
-
X. He, D. Cai, H. Liu, and W.-Y. Ma. Locality preserving indexing for document representation. In SIGIR, 2004.
-
(2004)
SIGIR
-
-
He, X.1
Cai, D.2
Liu, H.3
Ma, W.-Y.4
-
16
-
-
0034818212
-
Unsupervised learning by probabilistic latent semantic analysis
-
T. Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine Learning, 42(1):177-196, 2001.
-
(2001)
Machine Learning
, vol.42
, Issue.1
, pp. 177-196
-
-
Hofmann, T.1
-
17
-
-
68949195307
-
Extracting structural paraphrases from aligned monolingual corpora
-
A. Ibrahim, B. Katz, and J. Lin. Extracting structural paraphrases from aligned monolingual corpora. In ACL, 2003.
-
(2003)
ACL
-
-
Ibrahim, A.1
Katz, B.2
Lin, J.3
-
18
-
-
0242625250
-
Simrank: A measure of structuralcontext similarity
-
G. Jeh and J. Widom. Simrank: a measure of structuralcontext similarity. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Jeh, G.1
Widom, J.2
-
19
-
-
8644246887
-
Text classification and named entities for new event detection
-
G. Kumaran and J. Allan. Text classification and named entities for new event detection. In SIGIR, 2004.
-
(2004)
SIGIR
-
-
Kumaran, G.1
Allan, J.2
-
21
-
-
85185398851
-
Phrase clustering for discriminative learning
-
D. Lin and X. Wu. Phrase clustering for discriminative learning. In ACL, 2009.
-
(2009)
ACL
-
-
Lin, D.1
Wu, X.2
-
23
-
-
33745771177
-
Document clustering using character n-grams: A comparative evaluation with term-based and word-based clustering
-
Y. Miao, V. Kešelj, and E. Milios. Document clustering using character n-grams: a comparative evaluation with term-based and word-based clustering. In CIKM, 2005.
-
(2005)
CIKM
-
-
Miao, Y.1
Kešelj, V.2
Milios, E.3
-
24
-
-
33750057514
-
Corpus-based and knowledge-based measures of text semantic similarity
-
R. Mihalcea, C. Corley, and C. Strapparava. Corpus-based and knowledge-based measures of text semantic similarity. In AAAI, 2006.
-
(2006)
AAAI
-
-
Mihalcea, R.1
Corley, C.2
Strapparava, C.3
-
26
-
-
35048879815
-
Complex linguistic features for text classification: A comprehensive study
-
A. Moschitti and R. Basili. Complex linguistic features for text classification: A comprehensive study. Advances in Information Retrieval, pages 181-196, 2004.
-
(2004)
Advances in Information Retrieval
, pp. 181-196
-
-
Moschitti, A.1
Basili, R.2
-
27
-
-
33645983416
-
The proposition bank: An annotated corpus of semantic roles
-
M. Palmer, D. Gildea, and p. Kingsbury. The proposition bank: An annotated corpus of semantic roles. Computational Linguistics, 31(1):71-106, 2005.
-
(2005)
Computational Linguistics
, vol.31
, Issue.1
, pp. 71-106
-
-
Palmer, M.1
Gildea, D.2
Kingsbury, P.3
-
28
-
-
0010836411
-
Feature engineering for text classification
-
S. Scott and S. Matwin. Feature engineering for text classification. In ICML, 1999.
-
(1999)
ICML
-
-
Scott, S.1
Matwin, S.2
-
29
-
-
0002442796
-
Machine learning in automated text categorization
-
F. Sebastiani. Machine learning in automated text categorization. CSUR, 34(1):1-47, 2002.
-
(2002)
CSUR
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
30
-
-
79960232116
-
Using semantic roles to improve question answering
-
D. Shen and M. Lapata. Using semantic roles to improve question answering. In EMNLP, 2007.
-
(2007)
EMNLP
-
-
Shen, D.1
Lapata, M.2
-
31
-
-
84862271526
-
Intrumine: Mining intruders in untrustworthy data of cyber-physical systems
-
L.-A. Tang, Q. Gu, X. Yu, J. Han, T. La Porta, A. Leung, T. Abdelzaher, and L. Kaplan. Intrumine: mining intruders in untrustworthy data of cyber-physical systems. In SDM, 2012.
-
(2012)
SDM
-
-
Tang, L.-A.1
Gu, Q.2
Yu, X.3
Han, J.4
La Porta, T.5
Leung, A.6
Abdelzaher, T.7
Kaplan, L.8
-
32
-
-
84859893667
-
Multi-document summarization using sentence-based topic models
-
D. Wang, S. Zhu, T. Li, and Y. Gong. Multi-document summarization using sentence-based topic models. In ACLIJCNLP, 2009.
-
(2009)
ACLIJCNLP
-
-
Wang, D.1
Zhu, S.2
Li, T.3
Gong, Y.4
-
33
-
-
84863769781
-
A bayesian approach to discovering truth from conflicting sources for data integration
-
B. Zhao, B. I. Rubinstein, J. Gemmell, and J. Han. A bayesian approach to discovering truth from conflicting sources for data integration. VLDB, 5(6):550-561, 2012.
-
(2012)
VLDB
, vol.5
, Issue.6
, pp. 550-561
-
-
Zhao, B.1
Rubinstein, B.I.2
Gemmell, J.3
Han, J.4
-
34
-
-
74549181082
-
P-rank: A comprehensive structural similarity measure over information networks
-
p. Zhao, J. Han, and Y. Sun. P-rank: a comprehensive structural similarity measure over information networks. In CIKM, 2009.
-
(2009)
CIKM
-
-
Zhao, P.1
Han, J.2
Sun, Y.3
|