-
1
-
-
84858049690
-
-
Html TIDY project. http://tidy.sourceforge.net/.
-
-
-
-
2
-
-
84859197607
-
Webtables: Exploring the power of tables on the web
-
M. J. Cafarella, E. Wu, A. Halevy, Y. Zhang, and D. Z. Wang. Webtables: Exploring the power of tables on the web. PVLDB, 2008.
-
(2008)
PVLDB
-
-
Cafarella, M.J.1
Wu, E.2
Halevy, A.3
Zhang, Y.4
Wang, D.Z.5
-
4
-
-
77950891804
-
Coupled semi-supervised learning for information extraction
-
A. Carlson, J. Betteridge, R. C. Wang, E. R. Hruschka, Jr., and T. M. Mitchell. Coupled semi-supervised learning for information extraction. In WSDM, 2010.
-
(2010)
WSDM
-
-
Carlson, A.1
Betteridge, J.2
Wang, R.C.3
Hruschka Jr., E.R.4
Mitchell, T.M.5
-
5
-
-
0002546287
-
Efficient algorithms for agglomerative hierarchical clustering methods
-
W. H. E. Day and H. Edelsbrunner. Efficient algorithms for agglomerative hierarchical clustering methods. In Journal of Classification, 1984.
-
(1984)
Journal of Classification
-
-
Day, W.H.E.1
Edelsbrunner, H.2
-
6
-
-
33746085343
-
Web-scale information extraction in knowitall: (Preliminary results)
-
O. Etzioni, M. Cafarella, D. Downey, S. Kok, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Web-scale information extraction in knowitall: (preliminary results). In WWW, 2004.
-
(2004)
WWW
-
-
Etzioni, O.1
Cafarella, M.2
Downey, D.3
Kok, S.4
Popescu, A.-M.5
Shaked, T.6
Soderland, S.7
Weld, D.S.8
Yates, A.9
-
7
-
-
17644423946
-
Unsupervised named-entity extraction from the web: An experimental study
-
O. Etzioni, M. Cafarella, D. Downey, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yate. Unsupervised named-entity extraction from the web: An experimental study. In AI, 2005.
-
(2005)
AI
-
-
Etzioni, O.1
Cafarella, M.2
Downey, D.3
Popescu, A.-M.4
Shaked, T.5
Soderland, S.6
Weld, D.S.7
Yate, A.8
-
8
-
-
35348900845
-
Towards domain-independent information extraction from web tables
-
W. Gatterbauer, P. Bohunsky, M. Herzog, B. Krüpl, and B. Pollak. Towards domain-independent information extraction from web tables. In WWW, 2007.
-
(2007)
WWW
-
-
Gatterbauer, W.1
Bohunsky, P.2
Herzog, M.3
Krüpl, B.4
Pollak, B.5
-
9
-
-
79952384867
-
Answering table augmentation queries from unstructured lists on the web
-
R. Gupta and S. Sarawagi. Answering table augmentation queries from unstructured lists on the web. In VLDB, 2009.
-
(2009)
VLDB
-
-
Gupta, R.1
Sarawagi, S.2
-
10
-
-
79952397061
-
Joint training for open-domain extraction on the web: Exploiting overlap when supervision is limited
-
R. Gupta and S. Sarawagi. Joint training for open-domain extraction on the web: exploiting overlap when supervision is limited. In WSDM, 2011.
-
(2011)
WSDM
-
-
Gupta, R.1
Sarawagi, S.2
-
11
-
-
0012990385
-
Automatic acquisition of hyponyms from large text corpora
-
M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In ACL, 1992.
-
(1992)
ACL
-
-
Hearst, M.A.1
-
12
-
-
84858049685
-
Using anchor text, spam filtering and wikipedia for web search and entity ranking
-
J. Kamps, R. Kaptein, and M. Koolen. Using anchor text, spam filtering and wikipedia for web search and entity ranking. TREC, 2010.
-
(2010)
TREC
-
-
Kamps, J.1
Kaptein, R.2
Koolen, M.3
-
13
-
-
80053269546
-
A semi-supervised method to learn and construct taxonomies using the web
-
Z. Kozareva and E. Hovy. A semi-supervised method to learn and construct taxonomies using the web. In EMNLP, 2010.
-
(2010)
EMNLP
-
-
Kozareva, Z.1
Hovy, E.2
-
14
-
-
79960022996
-
Annotating and searching web tables using entities, types and relationships
-
G. Limaye, S. Sarawagi, and S. Chakrabarti. Annotating and searching web tables using entities, types and relationships. PVLDB, 2010.
-
(2010)
PVLDB
-
-
Limaye, G.1
Sarawagi, S.2
Chakrabarti, S.3
-
15
-
-
0242496755
-
Concept discovery from text
-
D. Lin and P. Pantel. Concept discovery from text. In COLING, 2002.
-
(2002)
COLING
-
-
Lin, D.1
Pantel, P.2
-
17
-
-
85117710380
-
Automatically labeling semantic classes
-
P. Pantel and D. Ravichandran. Automatically labeling semantic classes. In HLT-NAACL, 2004.
-
(2004)
HLT-NAACL
-
-
Pantel, P.1
Ravichandran, D.2
-
18
-
-
82055172940
-
Towards the web of concepts: Extracting concepts from large datasets
-
A. Parameswaran, H. Garcia-Molina, and A. Rajaraman. Towards the web of concepts: Extracting concepts from large datasets. In VLDB, 2010.
-
(2010)
VLDB
-
-
Parameswaran, A.1
Garcia-Molina, H.2
Rajaraman, A.3
-
20
-
-
70350543748
-
What is this, anyway: Automatic hypernym discovery
-
A. Ritter, S. Soderland, and O. Etzioni. What is this, anyway: Automatic hypernym discovery. In AAAI, 2009.
-
(2009)
AAAI
-
-
Ritter, A.1
Soderland, S.2
Etzioni, O.3
-
21
-
-
84912540476
-
Acquiring hyponymy relations from web documents
-
K. Shinzato and K. Torisawa. Acquiring hyponymy relations from web documents. In HLT-NAACL, 2004.
-
(2004)
HLT-NAACL
-
-
Shinzato, K.1
Torisawa, K.2
-
22
-
-
34247123828
-
Learning syntactic patterns for automatic hypernym discovery
-
R. Snow, D. Jurafsky, and A. Y. Ng. Learning syntactic patterns for automatic hypernym discovery. In NIPS, 2004.
-
(2004)
NIPS
-
-
Snow, R.1
Jurafsky, D.2
Ng, A.Y.3
-
23
-
-
80053360508
-
Cheap and fast - But is it good? evaluating non-expert annotations for natural language tasks
-
R. Snow, B. O'Connor, D. Jurafsky, and A. Y. Ng. Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks. In EMNLP, 2008.
-
(2008)
EMNLP
-
-
Snow, R.1
O'Connor, B.2
Jurafsky, D.3
Ng, A.Y.4
-
24
-
-
80053380980
-
Weakly-supervised acquisition of labeled class instances using graph random walks
-
P. P. Talukdar, J. Reisinger, M. Paşca, D. Ravichandran, R. Bhagat, and F. Pereira. Weakly-supervised acquisition of labeled class instances using graph random walks. In EMNLP, 2008.
-
(2008)
EMNLP
-
-
Talukdar, P.P.1
Reisinger, J.2
Paşca, M.3
Ravichandran, D.4
Bhagat, R.5
Pereira, F.6
-
26
-
-
84858058168
-
Finding cars, goddesses and enzymes: Parametrizable acquisition of labeled instances for open-domain information extraction
-
B. Van Durme and M. Pasca. Finding cars, goddesses and enzymes: parametrizable acquisition of labeled instances for open-domain information extraction. In AAAI, 2008.
-
(2008)
AAAI
-
-
Van Durme, B.1
Pasca, M.2
-
27
-
-
80053408112
-
Automatic set instance extraction using the web
-
R. C. Wang and W. W. Cohen. Automatic set instance extraction using the web. In ACL, 2009.
-
(2009)
ACL
-
-
Wang, R.C.1
Cohen, W.W.2
-
28
-
-
80053426759
-
Character-level analysis of semi-structured documents for set expansion
-
R. C. Wang and W. W. Cohen. Character-level analysis of semi-structured documents for set expansion. In EMNLP, 2009.
-
(2009)
EMNLP
-
-
Wang, R.C.1
Cohen, W.W.2
-
30
-
-
84873560385
-
Textrunner: Open information extraction on the web
-
A. Yates, M. Cafarella, M. Banko, O. Etzioni, M. Broadhead, and S. Soderland. Textrunner: Open information extraction on the web. In NAACL, 2007.
-
(2007)
NAACL
-
-
Yates, A.1
Cafarella, M.2
Banko, M.3
Etzioni, O.4
Broadhead, M.5
Soderland, S.6
|