-
1
-
-
0033652943
-
-
Snowball: extracting relations from large plain-text collections. In DL '00
-
E. Agichtein and L. Gravano. Snowball: extracting relations from large plain-text collections. In DL '00, 2000.
-
(2000)
-
-
Agichtein, E.1
Gravano, L.2
-
2
-
-
85039640035
-
Automatically constructing semantic web services from online sources.
-
J. Ambite, S. Darbha, A. Goel, C. Knoblock, K. Lerman, R. Parundekar, and T. Russ. Automatically constructing semantic web services from online sources. In ISWC, 2009.
-
(2009)
ISWC
-
-
Ambite, J.1
Darbha, S.2
Goel, A.3
Knoblock, C.4
Lerman, K.5
Parundekar, R.6
Russ, T.7
-
3
-
-
1142303684
-
Extracting structured data from web pages.
-
A. Arasu and H. Garcia-Molina. Extracting structured data from web pages. In SIGMOD, 2003.
-
(2003)
SIGMOD
-
-
Arasu, A.1
Garcia-Molina, H.2
-
4
-
-
84880902141
-
Open information extraction from the web.
-
M. Banko, M. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni. Open information extraction from the web. In IJCAI, 2007.
-
(2007)
IJCAI
-
-
Banko, M.1
Cafarella, M.2
Soderland, S.3
Broadhead, M.4
Etzioni, O.5
-
5
-
-
84860485957
-
Generic schema matching, ten years later.
-
P. A. Bernstein, J. Madhavan, and E. Rahm. Generic schema matching, ten years later. PVLDB, 4(11), 2011.
-
(2011)
PVLDB
, vol.4
, Issue.11
-
-
Bernstein, P.A.1
Madhavan, J.2
Rahm, E.3
-
6
-
-
77951136761
-
Supporting the automatic construction of entity aware search engines.
-
L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Supporting the automatic construction of entity aware search engines. In WIDM, 2008.
-
(2008)
WIDM
-
-
Blanco, L.1
Crescenzi, V.2
Merialdo, P.3
Papotti, P.4
-
7
-
-
79955068748
-
Probabilistic models to reconcile complex data from inaccurate data sources.
-
L. Blanco, V. Crescenzi, P. Merialdo, and P. Papotti. Probabilistic models to reconcile complex data from inaccurate data sources. In CAiSE, 2010.
-
(2010)
CAiSE
-
-
Blanco, L.1
Crescenzi, V.2
Merialdo, P.3
Papotti, P.4
-
8
-
-
85039641957
-
-
Extraction and integration of partially overlapping web sources. Tech. rep., DIA -Roma Tre -TR201, Dec. 2012.
-
M. Bronzi, V. Crescenzi, P. Merialdo, and P. Papotti. Extraction and integration of partially overlapping web sources. Tech. rep., DIA -Roma Tre -TR201, Dec. 2012.
-
-
-
Bronzi, M.1
Crescenzi, V.2
Merialdo, P.3
Papotti, P.4
-
10
-
-
84859197607
-
Webtables: exploring the power of tables on the web.
-
M. J. Cafarella, A. Y. Halevy, D. Z. Wang, E. Wu, and Y. Zhang. Webtables: exploring the power of tables on the web. PVLDB, 1(1), 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
-
-
Cafarella, M.J.1
Halevy, A.Y.2
Wang, D.Z.3
Wu, E.4
Zhang, Y.5
-
11
-
-
38349193107
-
Learning semantic definitions of online information sources.
-
M. J. Carman and C. A. Knoblock. Learning semantic definitions of online information sources. J. Artif. Int. Res., 30(1):1-50, 2007.
-
(2007)
J. Artif. Int. Res.
, vol.30
, Issue.1
, pp. 1-50
-
-
Carman, M.J.1
Knoblock, C.A.2
-
12
-
-
33748336500
-
A survey of web information extraction systems.
-
C.-H. Chang, M. Kayed, M. R. Girgis, and K. F. Shaalan. A survey of web information extraction systems. IEEE Trans. Knowl. Data Eng., 18(10):1411-1428, 2006.
-
(2006)
IEEE Trans. Knowl. Data Eng.
, vol.18
, Issue.10
, pp. 1411-1428
-
-
Chang, C.-H.1
Kayed, M.2
Girgis, M.R.3
Shaalan, K.F.4
-
13
-
-
85011016482
-
Context aware wrapping: Synchronized data extraction.
-
S.-L. Chuang, K. C. Chang, and C. X. Zhai. Context aware wrapping: Synchronized data extraction. In VLDB, 2007.
-
(2007)
VLDB
-
-
Chuang, S.-L.1
Chang, K.C.2
Zhai, C.X.3
-
14
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks.
-
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWeb, 2003.
-
(2003)
IIWeb
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
18
-
-
84861026711
-
-
Automatic wrappers for large scale web extraction. PVLDB
-
N. N. Dalvi, R. Kumar, and M. A. Soliman. Automatic wrappers for large scale web extraction. PVLDB, 4(4), 2011.
-
(2011)
, vol.4
, Issue.4
-
-
Dalvi, N.N.1
Kumar, R.2
Soliman, M.A.3
-
19
-
-
84863761202
-
An analysis of structured data on the web.
-
N. N. Dalvi, A. Machanavajjhala, and B. Pang. An analysis of structured data on the web. PVLDB, 5(7), 2012.
-
(2012)
PVLDB
, vol.5
, Issue.7
-
-
Dalvi, N.N.1
Machanavajjhala, A.2
Pang, B.3
-
20
-
-
77954301186
-
Harvesting relational tables from lists on the web.
-
H. Elmeleegy, J. Madhavan, and A. Y. Halevy. Harvesting relational tables from lists on the web. PVLDB, 2(1), 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
-
-
Elmeleegy, H.1
Madhavan, J.2
Halevy, A.Y.3
-
21
-
-
84866839638
-
-
and Mausam. Open information extraction: The second generation. In IJCAI
-
O. Etzioni, A. Fader, J. Christensen, S. Soderland, and Mausam. Open information extraction: The second generation. In IJCAI, 2011.
-
(2011)
-
-
Etzioni, O.1
Fader, A.2
Christensen, J.3
Soderland, S.4
-
22
-
-
79957798352
-
Web-scale information extraction with vertex.
-
P. Gulhane, A. Madaan, R. R. Mehta, J. Ramamirtham, R. Rastogi, S. Satpal, S. Sengamedu, A. Tengli, and C. Tiwari. Web-scale information extraction with vertex. In ICDE, 2011.
-
(2011)
ICDE
-
-
Gulhane, P.1
Madaan, A.2
Mehta, R.R.3
Ramamirtham, J.4
Rastogi, R.5
Satpal, S.6
Sengamedu, S.7
Tengli, A.8
Tiwari, C.9
-
23
-
-
84055203904
-
Exploiting content redundancy for web information extraction.
-
P. Gulhane, R. Rastogi, S. H. Sengamedu, and A. Tengli. Exploiting content redundancy for web information extraction. PVLDB, 3(1), 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
-
-
Gulhane, P.1
Rastogi, R.2
Sengamedu, S.H.3
Tengli, A.4
-
24
-
-
80052120564
-
From one tree to a forest: a unified solution for structured web data extraction.
-
Q. Hao, R. Cai, Y. Pang, and L. Zhang. From one tree to a forest: a unified solution for structured web data extraction. In SIGIR, 2011.
-
(2011)
SIGIR
-
-
Hao, Q.1
Cai, R.2
Pang, Y.3
Zhang, L.4
-
25
-
-
75449088101
-
Uninterpreted schema matching with embedded value mapping under opaque column names and data values.
-
A. Jaiswal, D. Miller, and P. Mitra. Uninterpreted schema matching with embedded value mapping under opaque column names and data values. IEEE Trans. Knowl. Data Eng., 22(2):291-304, 2010.
-
(2010)
IEEE Trans. Knowl. Data Eng.
, vol.22
, Issue.2
, pp. 291-304
-
-
Jaiswal, A.1
Miller, D.2
Mitra, P.3
-
26
-
-
1142267348
-
On schema matching with opaque column names and data values.
-
J. Kang and J. F. Naughton. On schema matching with opaque column names and data values. In SIGMOD, 2003.
-
(2003)
SIGMOD
-
-
Kang, J.1
Naughton, J.F.2
-
27
-
-
84875119323
-
Truth finding on the deep web: Is the problem solved?
-
X. Li, X. L. Dong, K. Lyons, W. Meng, and D. Srivastava. Truth finding on the deep web: Is the problem solved? PVLDB, 6(2), 2013.
-
(2013)
PVLDB
, vol.6
, Issue.2
-
-
Li, X.1
Dong, X.L.2
Lyons, K.3
Meng, W.4
Srivastava, D.5
-
29
-
-
34548080780
-
Introduction to Information Retrieval
-
Cambridge University Press
-
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval, Cambridge University Press, 2008.
-
(2008)
-
-
Manning, C.D.1
Raghavan, P.2
Schütze, H.3
-
30
-
-
57149134507
-
Toward best-effort information extraction.
-
W. Shen, P. DeRose, R. McCann, A. Doan, and R. Ramakrishnan. Toward best-effort information extraction. In SIGMOD, 2008.
-
(2008)
SIGMOD
-
-
Shen, W.1
DeRose, P.2
McCann, R.3
Doan, A.4
Ramakrishnan, R.5
-
31
-
-
77649265164
-
Learning to adapt web information extraction knowledge and discovering new attributes via a bayesian approach.
-
T.-L. Wong and W. Lam. Learning to adapt web information extraction knowledge and discovering new attributes via a bayesian approach. IEEE Trans. Knowl. Data Eng., 22(4):523-536, 2010.
-
(2010)
IEEE Trans. Knowl. Data Eng.
, vol.22
, Issue.4
, pp. 523-536
-
-
Wong, T.-L.1
Lam, W.2
|