-
1
-
-
0037806547
-
-
A. Laender and B. Ribeiro-Neto and A. da Silva and J. Teixeira. A Brief Survey of Web Data Extraction Tools. SIGMOD Record, 31(2), 2002.
-
A. Laender and B. Ribeiro-Neto and A. da Silva and J. Teixeira. A Brief Survey of Web Data Extraction Tools. SIGMOD Record, 31(2), 2002.
-
-
-
-
2
-
-
84944318551
-
Visual Web Information Extraction with Lixto
-
R. Baumgartner, Sergio Flesca, and Georg Gottlob. Visual Web Information Extraction with Lixto. In Proc. Very Large Database Conference, Rome, Italy, pages 119-128, 2001.
-
(2001)
Proc. Very Large Database Conference, Rome, Italy
, pp. 119-128
-
-
Baumgartner, R.1
Flesca, S.2
Gottlob, G.3
-
6
-
-
12344333240
-
Automatic information extraction from large websites
-
V. Crescenzi and G. Mecca. Automatic information extraction from large websites. J. ACM, 51(5):731-779, 2004.
-
(2004)
J. ACM
, vol.51
, Issue.5
, pp. 731-779
-
-
Crescenzi, V.1
Mecca, G.2
-
7
-
-
84944327150
-
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
-
V. Crescenzi, G. Mecca, and Paolo Merialdo. RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In Proc. Very Large Database Systems, Rome, Italy, pages 109-118, 2001.
-
(2001)
Proc. Very Large Database Systems, Rome, Italy
, pp. 109-118
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
11
-
-
34250679000
-
-
Documentum Enterprise Content Integration
-
Documentum Enterprise Content Integration. http://www.documentum.com/ solutions/eci.
-
-
-
-
12
-
-
34250613974
-
-
Fetch technologies
-
Fetch technologies, http://www.fetch.com/.
-
-
-
-
13
-
-
0343421094
-
Database techniques for the world-wide web: A survey
-
D. Florescu, A. Y. Levy, and A. O. Mendelzon. Database techniques for the world-wide web: A survey. SIGMOD Record, 27(3):59-74, 1998.
-
(1998)
SIGMOD Record
, vol.27
, Issue.3
, pp. 59-74
-
-
Florescu, D.1
Levy, A.Y.2
Mendelzon, A.O.3
-
15
-
-
3142745227
-
The lixto data extraction project: Back and forth between theory and practice
-
New York, NY, USA, ACM Press
-
G. Gottlob, Ch. Koch, R. Baumgartner, M. Herzog, and S. Flesca. The lixto data extraction project: back and forth between theory and practice. In Proc. 23rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pages 1-12, New York, NY, USA, 2004. ACM Press.
-
(2004)
Proc. 23rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems
, pp. 1-12
-
-
Gottlob, G.1
Koch, C.2
Baumgartner, R.3
Herzog, M.4
Flesca, S.5
-
16
-
-
30544447615
-
Thresher: Automating the unwrapping of semantic content from the world wide web
-
New York, NY, USA, ACM Press
-
A. Hogue and D. Karger. Thresher: automating the unwrapping of semantic content from the world wide web. In WWW'05: Proceedings of the 14th international conference on World Wide Web, pages 86-95, New York, NY, USA, 2005. ACM Press.
-
(2005)
WWW'05: Proceedings of the 14th international conference on World Wide Web
, pp. 86-95
-
-
Hogue, A.1
Karger, D.2
-
17
-
-
84943235581
-
Using Grammatical Inference to Automate Information Extraction from the Web
-
Springer
-
Th. W. Hong and K. L. Clark. Using Grammatical Inference to Automate Information Extraction from the Web. In Proc. 5th European Conf. PKDD, Germany, Freiburg, volume 2168, pages 216-228. Springer, 2001.
-
(2001)
Proc. 5th European Conf. PKDD, Germany, Freiburg
, vol.2168
, pp. 216-228
-
-
Hong, T.W.1
Clark, K.L.2
-
19
-
-
31744452336
-
Evaluating machine learning for information extraction
-
N. Ireson, F. Ciravegna, M.-E. Califf, A. Lavelli, D. Freitag, and N. Kushmerick. Evaluating machine learning for information extraction. In Proc. Int. Conf. Machine Learning, 2005.
-
(2005)
Proc. Int. Conf. Machine Learning
-
-
Ireson, N.1
Ciravegna, F.2
Califf, M.-E.3
Lavelli, A.4
Freitag, D.5
Kushmerick, N.6
-
20
-
-
34250668029
-
-
Itemfield. http://www.itemfield.com/.
-
Itemfield
-
-
-
22
-
-
35248883936
-
Toolkits for generating wrappers: A survey of software toolkits for automated data extraction from websites
-
NetObjectDays, NODe 2002, Erfurt, Germany, 198
-
S. Kuhlins and R. Tredwell. Toolkits for generating wrappers: A survey of software toolkits for automated data extraction from websites. In NetObjectDays, NODe 2002, Erfurt, Germany, LNCS, volume 2591, pages 184 - 198, 2002.
-
(2002)
LNCS
, vol.2591
, pp. 184
-
-
Kuhlins, S.1
Tredwell, R.2
-
23
-
-
0032596649
-
Regression Testing for Wrapper Maintenance
-
N. Kushmerick. Regression Testing for Wrapper Maintenance. In Proc. AAAI, pages 74-79, 1999.
-
(1999)
Proc. AAAI
, pp. 74-79
-
-
Kushmerick, N.1
-
24
-
-
0034172374
-
Wrapper Induction: Efficiency and Expresiveness
-
N. Kushmerick. Wrapper Induction: Efficiency and Expresiveness. Artificial Intelligence, 118:15-68, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, pp. 15-68
-
-
Kushmerick, N.1
-
26
-
-
0033893885
-
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
-
L. Liu, C. Pu, and W. Han. XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources. In Proc. Intern. Conf. Data Engineering, pages 611-621, 2000.
-
(2000)
Proc. Intern. Conf. Data Engineering
, pp. 611-621
-
-
Liu, L.1
Pu, C.2
Han, W.3
-
27
-
-
34250661514
-
-
Lixto software gmbh
-
Lixto software gmbh. http://www.lixto.com/.
-
-
-
-
28
-
-
0032684968
-
A Hierarchical Approach to Wrapper Induction
-
I. Muslea, S. Minton, and C. Knoblock. A Hierarchical Approach to Wrapper Induction. In Proc. the Third Intern. Conf. on Autonomous Agents Conference, Seattle, WA, pages 190-197, 1999.
-
(1999)
Proc. the Third Intern. Conf. on Autonomous Agents Conference, Seattle, WA
, pp. 190-197
-
-
Muslea, I.1
Minton, S.2
Knoblock, C.3
-
29
-
-
0027595230
-
Learning subsequential transducers for pattern recognition interpretation
-
J. Oncina, P. Garcia, and E. Vidai. Learning subsequential transducers for pattern recognition interpretation. IEEE Trans. on Pattern Analysis, 15:448-458, 1993.
-
(1993)
IEEE Trans. on Pattern Analysis
, vol.15
, pp. 448-458
-
-
Oncina, J.1
Garcia, P.2
Vidai, E.3
-
30
-
-
0008749998
-
Building light-weight wrappers for legacy Web data-sources using W4F
-
A. Sahuguet and F. Azavant. Building light-weight wrappers for legacy Web data-sources using W4F. The VLDB Journal, pages 738-741, 1999.
-
(1999)
The VLDB Journal
, pp. 738-741
-
-
Sahuguet, A.1
Azavant, F.2
|