-
1
-
-
1142303684
-
Extracting structured data from web pages
-
A. Arasu and H. Garcia-Molina. Extracting structured data from web pages. In SIGMOD, pages 337-348, 2003.
-
(2003)
SIGMOD
, pp. 337-348
-
-
Arasu, A.1
Garcia-Molina, H.2
-
2
-
-
84880902141
-
Open information extraction from the web
-
M. Banko, M. J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. Open information extraction from the web. In IJCAI, pages 2670-2676, 2007.
-
(2007)
IJCAI
, pp. 2670-2676
-
-
Banko, M.1
Cafarella, M.J.2
Soderland, S.3
Broadhead, M.4
Etzioni, O.5
-
3
-
-
43349106811
-
Flint: Google-basing the web
-
L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Flint: Google-basing the web. In EDBT, pages 720-724, 2008.
-
(2008)
EDBT
, pp. 720-724
-
-
Blanco, L.1
Crescenzi, V.2
Merialdo, P.3
Papotti, P.4
-
4
-
-
84859197607
-
Webtables: exploring the power of tables on the web
-
M. J. Cafarella, A. Halevy, D. Z. Wang, E. Wu, and Y. Zhang. Webtables: exploring the power of tables on the web. VLDB, 1(1):538-549, 2008.
-
(2008)
VLDB
, vol.1
, Issue.1
, pp. 538-549
-
-
Cafarella, M.J.1
Halevy, A.2
Wang, D.Z.3
Wu, E.4
Zhang, Y.5
-
6
-
-
84944327150
-
Roadrunner: Towards automatic data extraction from large web sites
-
V. Crescenzi, G. Mecca, and P. Merialdo. Roadrunner: Towards automatic data extraction from large web sites. In VLDB, pages 109-118, 2001.
-
(2001)
VLDB
, pp. 109-118
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
7
-
-
70350625343
-
A web of concepts (keynote)
-
June
-
N. Dalvi, R. Kumar, B. Pang, R. Ramakrishnan, A. Tomkins, P. Bohannon, S. Keerthi, and S. Merugu. A web of concepts (keynote). In PODS, pages 1-12, June 2009.
-
(2009)
PODS
, pp. 1-12
-
-
Dalvi, N.1
Kumar, R.2
Pang, B.3
Ramakrishnan, R.4
Tomkins, A.5
Bohannon, P.6
Keerthi, S.7
Merugu, S.8
-
8
-
-
84861026711
-
Automatic wrappers for large scale web extraction
-
N. Dalvi, R. Kumar, and M. A. Soliman. Automatic wrappers for large scale web extraction. PVLDB, 4(4):219-230, 2011.
-
(2011)
PVLDB
, vol.4
, Issue.4
, pp. 219-230
-
-
Dalvi, N.1
Kumar, R.2
Soliman, M.A.3
-
9
-
-
85011016190
-
-
Building structured web community portals: A top-down, compositional, and incremental approach
-
P. DeRose, W. Shen, F. Chen, A. Doan, and R. Ramakrishnan. Building structured web community portals: A top-down, compositional, and incremental approach. In VLDB, pages 399-410, 2007.
-
(2007)
VLDB
, pp. 399-410
-
-
DeRose, P.1
Shen, W.2
Chen, F.3
Doan, A.4
Ramakrishnan, R.5
-
10
-
-
77954301186
-
Harvesting relational tables from lists on the web
-
H. Elmeleegy, J. Madhavan, and A. Y. Halevy. Harvesting relational tables from lists on the web. PVLDB, 2(1):1078-1089, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
, pp. 1078-1089
-
-
Elmeleegy, H.1
Madhavan, J.2
Halevy, A.Y.3
-
11
-
-
17644418833
-
Web-scale information extraction in Knowitall: (preliminary results)
-
O. Etzioni, M. Cafarella, D. Downey, S. Kok, A.-M. Popescu, T. Shaked, S. Soderland, D. S. Weld, and A. Yates. Web-scale information extraction in Knowitall: (preliminary results). In WWW, pages 100-110, 2004.
-
(2004)
WWW
, pp. 100-110
-
-
Etzioni, O.1
Cafarella, M.2
Downey, D.3
Kok, S.4
Popescu, A.-M.5
Shaked, T.6
Soderland, S.7
Weld, D.S.8
Yates, A.9
-
12
-
-
77950915077
-
-
Understanding deja reviewers
-
E. Gilbert and K. Karahalios. Understanding deja reviewers. In CSCW, pages 225-228, 2010.
-
(2010)
CSCW
, pp. 225-228
-
-
Gilbert, E.1
Karahalios, K.2
-
13
-
-
77950954197
-
Anatomy of the long tail: ordinary people with extraordinary tastes
-
S. Goel, A. Broder, E. Gabrilovich, and B. Pang. Anatomy of the long tail: ordinary people with extraordinary tastes. In WSDM, pages 201-210, 2010.
-
(2010)
WSDM
, pp. 201-210
-
-
Goel, S.1
Broder, A.2
Gabrilovich, E.3
Pang, B.4
-
14
-
-
85039691502
-
-
Google sets
-
Google sets: http://labs.google.com/sets.
-
-
-
-
15
-
-
84055203904
-
Exploiting content redundancy for web information extraction
-
PVLDB
-
P. Gulhane, R. Rastogi, S. H. Sengamedu, and A. Tengli. Exploiting content redundancy for web information extraction. PVLDB, 3(1):578-587, 2010.
-
(2010)
, vol.3
, Issue.1
, pp. 578-587
-
-
Gulhane, P.1
Rastogi, R.2
Sengamedu, S.H.3
Tengli, A.4
-
16
-
-
79952384867
-
Answering table augmentation queries from unstructured lists on the web
-
R. Gupta and S. Sarawagi. Answering table augmentation queries from unstructured lists on the web. In VLDB, pages 289-300, 2009.
-
(2009)
VLDB
, pp. 289-300
-
-
Gupta, R.1
Sarawagi, S.2
-
17
-
-
79952427109
-
Collective extraction from heterogeneous web lists
-
A. Machanavajjhala, A. S. Iyer, P. Bohannon, and S. Merugu. Collective extraction from heterogeneous web lists. In WSDM, pages 445-454, 2011.
-
(2011)
WSDM
, pp. 445-454
-
-
Machanavajjhala, A.1
Iyer, A.S.2
Bohannon, P.3
Merugu, S.4
-
19
-
-
63449110275
-
More like these: growing entity classes from seeds
-
L. Sarmento, V. Jijkoun, M. de Rijke, and E. Oliveira. "more like these": growing entity classes from seeds. In CIKM, pages 959-962, 2007.
-
(2007)
CIKM
, pp. 959-962
-
-
Sarmento, L.1
Jijkoun, V.2
de Rijke, M.3
Oliveira, E.4
-
20
-
-
67650153068
-
Automatic wrapper induction from hidden-web sources with domain knowledge
-
P. Senellart, A. Mittal, D. Muschick, R. Gilleron, and M. Tommasi. Automatic wrapper induction from hidden-web sources with domain knowledge. In WIDM, pages 9-16, 2008.
-
(2008)
WIDM
, pp. 9-16
-
-
Senellart, P.1
Mittal, A.2
Muschick, D.3
Gilleron, R.4
Tommasi, M.5
-
21
-
-
67049109676
-
Iterative set expansion of named entities using the web
-
R. C. Wang and W. Cohen. Iterative set expansion of named entities using the web. In ICDM, pages 1091-1096, 2008.
-
(2008)
ICDM
, pp. 1091-1096
-
-
Wang, R.C.1
Cohen, W.2
|