-
1
-
-
12244298488
-
Mining reference tables for automatic text segmentation
-
E. Agichtein and V. Ganti. Mining reference tables for automatic text segmentation. In KDD, pages 20-29, 2004.
-
(2004)
KDD
, pp. 20-29
-
-
Agichtein, E.1
Ganti, V.2
-
2
-
-
37349086786
-
Extracting lists of data records from semi-structured web pages
-
M. Alvarez, A. Pan, J. Raposo, F. Bellas, and F. Cacheda. Extracting lists of data records from semi-structured web pages. Data Knowl. Engg., 2008.
-
(2008)
Data Knowl. Engg
-
-
Alvarez, M.1
Pan, A.2
Raposo, J.3
Bellas, F.4
Cacheda, F.5
-
4
-
-
0040748315
-
Automatic segmentation of text into structured records
-
V. Borkar, K. Deshmukh, and S. Sarawagi. Automatic segmentation of text into structured records. SIGMOD Rec., 30 (2), 2001.
-
(2001)
SIGMOD Rec.
, vol.30
, pp. 2
-
-
Borkar, V.1
Deshmukh, K.2
Sarawagi, S.3
-
5
-
-
79952389300
-
Bootstrapping information extraction from field books
-
S. Canisius and C. Sporleder. Bootstrapping information extraction from field books. In EMNLP, pages 827-836, 2007.
-
(2007)
EMNLP
, pp. 827-836
-
-
Canisius, S.1
Sporleder, C.2
-
7
-
-
85011016482
-
Context-aware wrapping: Synchronized data extraction
-
S.-L. Chuang, K. C.-C. Chang, and C. Zhai. Context-aware wrapping: Synchronized data extraction. In VLDB, 2007.
-
(2007)
VLDB
-
-
Chuang, S.-L.1
Chang, K.C.-C.2
Zhai, C.3
-
8
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
W. W. Cohen. Data integration using similarity joins and a word-based information representation language. ACM Trans. Inf. Syst., 18 (3), 2000.
-
(2000)
ACM Trans. Inf. Syst.
, vol.18
, pp. 3
-
-
Cohen, W.W.1
-
9
-
-
84944327150
-
Roadrunner: Towards automatic data extraction from large web sites
-
V. Crescenzi, G. Mecca, and P. Merialdo. Roadrunner: Towards automatic data extraction from large web sites. In VLDB, 2001.
-
(2001)
VLDB
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
10
-
-
85011016190
-
Building structured web community portals: A top-down, compositional, and incremental approach
-
P. DeRose, W. Shen, F. Chen, A. Doan, and R. Ramakrishnan. Building structured web community portals: A top-down, compositional, and incremental approach. In VLDB, pages 399-410, 2007.
-
(2007)
VLDB
, pp. 399-410
-
-
DeRose, P.1
Shen, W.2
Chen, F.3
Doan, A.4
Ramakrishnan, R.5
-
12
-
-
84055203904
-
Exploiting content redundancy for web information extraction
-
P. Gulhane, R. Rastogi, S. Sengamedu, and A. Tengli. Exploiting content redundancy for web information extraction. In VLDB, 2010.
-
(2010)
VLDB
-
-
Gulhane, P.1
Rastogi, R.2
Sengamedu, S.3
Tengli, A.4
-
13
-
-
79952384867
-
Answering table augmentation queries from unstructured lists on the web
-
R. Gupta and S. Sarawagi. Answering table augmentation queries from unstructured lists on the web. In VLDB, 2009.
-
(2009)
VLDB
-
-
Gupta, R.1
Sarawagi, S.2
-
15
-
-
33749612526
-
Integrating unstructured data into relational databases
-
Washington, DC, USA
-
I. R. Mansuri and S. Sarawagi. Integrating unstructured data into relational databases. In ICDE '06: Proceedings of the 22nd International Conference on Data Engineering, page 29, Washington, DC, USA, 2006.
-
(2006)
ICDE '06: Proceedings of the 22nd International Conference on Data Engineering
, pp. 29
-
-
Mansuri, I.R.1
Sarawagi, S.2
-
16
-
-
0014757386
-
A general method applicable to the search for similarities in the amino acid sequence of two proteins
-
S. B. Needleman and C. D. Wunsch. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol. Bio., 1970.
-
(1970)
J Mol. Bio.
-
-
Needleman, S.B.1
Wunsch, C.D.2
-
17
-
-
79952401901
-
Redundancy-driven web data extraction and integration
-
P. Papotti, V. Crescenzi, P. Merialdo, M. Bronzi, and L. Blanco. Redundancy-driven web data extraction and integration. In WebDB, 2010.
-
(2010)
WebDB
-
-
Papotti, P.1
Crescenzi, V.2
Merialdo, P.3
Bronzi, M.4
Blanco, L.5
-
18
-
-
79952393497
-
Kosmix: Exploring the deep web using taxonomies and categorization
-
A. Rajaraman. Kosmix: Exploring the deep web using taxonomies and categorization. IEEE Data Eng. Bull., 32 (2):12-19, 2009.
-
(2009)
IEEE Data Eng. Bull.
, vol.32
, Issue.2
, pp. 12-19
-
-
Rajaraman, A.1
-
20
-
-
33750032384
-
An introduction to conditional random fields for relational learning
-
chapter 4. MIT Press
-
C. Sutton and A. Mccallum. An introduction to conditional random fields for relational learning. In Introduction to Statistical Relational Learning, chapter 4. MIT Press, 2007.
-
(2007)
Introduction to Statistical Relational Learning
-
-
Sutton, C.1
Mccallum, A.2
-
21
-
-
84935113569
-
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
-
A. J. Viterbi. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory, 13(2), 1967.
-
(1967)
IEEE Transactions on Information Theory
, vol.13
, pp. 2
-
-
Viterbi, A.J.1
-
22
-
-
33744821948
-
Web data extraction based on partial tree alignment
-
Y. Zhai and B. Liu. Web data extraction based on partial tree alignment. In WWW. ACM, 2005.
-
(2005)
WWW. ACM
-
-
Zhai, Y.1
Liu, B.2
-
23
-
-
33749623896
-
Simultaneous record detection and attribute labeling in web data extraction
-
J. Zhu, Z. Nie, J. Wen, B. Zhang, and W. Ma. Simultaneous record detection and attribute labeling in web data extraction. In KDD, 2006.
-
(2006)
KDD
-
-
Zhu, J.1
Nie, Z.2
Wen, J.3
Zhang, B.4
Ma, W.5
|