-
1
-
-
5444246802
-
-
CiteSeer: Scientific Literature Digital Library. http://citeseer.nj.nec. com/.
-
-
-
-
3
-
-
5444224311
-
-
DBLP Computer Science Bibliography. http://dblp.uni-trier.de/.
-
-
-
-
5
-
-
0032092761
-
NoDoSE - A tool for semi-automatically extracting structured and semistructured data from text documents
-
Seattle, Washington, June
-
B. Adelberg. NoDoSE - a tool for semi-automatically extracting structured and semistructured data from text documents. In Proc. of the 1998 ACM SIGMOD Intl. Conf. on Management of Data, pages 283-294, Seattle, Washington, June 1998.
-
(1998)
Proc. of the 1998 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 283-294
-
-
Adelberg, B.1
-
7
-
-
84938213121
-
Nymble: A high-performance learning name-finder
-
Washington, D.C.
-
D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. Nymble: A high-performance learning name-finder. In Proc. of the 5th Conf. on Applied Natural Language Processing, pages 194-201, Washington, D.C., 1997.
-
(1997)
Proc. of the 5th Conf. on Applied Natural Language Processing
, pp. 194-201
-
-
Bikel, D.M.1
Miller, S.2
Schwartz, R.3
Weischedel, R.4
-
8
-
-
0034832365
-
Automatic segmentation of text into structured records
-
Santa Barbara, California, May
-
V. R. Borkar, K. Deshmukh, and S. Sarawagi. Automatic segmentation of text into structured records. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data, pages 175-186, Santa Barbara, California, May 2001.
-
(2001)
Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 175-186
-
-
Borkar, V.R.1
Deshmukh, K.2
Sarawagi, S.3
-
10
-
-
0034830138
-
OminiSearch. A method for searching dynamic content on the Web
-
Santa Barbara, California, May
-
D. Buttler, L. Liu, C. Pu, H. Paques, W. Han, and W. Tang. OminiSearch. A method for searching dynamic content on the Web. In Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data, Santa Barbara, California, May 2001.
-
(2001)
Proc. of the 2001 ACM SIGMOD Intl. Conf. on Management of Data
-
-
Buttler, D.1
Liu, L.2
Pu, C.3
Paques, H.4
Han, W.5
Tang, W.6
-
12
-
-
84944327150
-
RoadRunnen Towards automatic data extraction from large Web sites
-
Roma, Italy, September
-
V. Crescenzi, G. Mecca, and P. Merialdo. RoadRunnen Towards automatic data extraction from large Web sites. In Proc. of the 2001 Intl. Corf. on Very Large Data Bases, pages 109-118, Roma, Italy, September 2001.
-
(2001)
Proc. of the 2001 Intl. Corf. on Very Large Data Bases
, pp. 109-118
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
13
-
-
0346501095
-
Record-boundary discovery in Web documents
-
Philadephia, Pennsylvania, June
-
D. Embley, S. Jiang, and Y. Ng. Record-boundary discovery in Web documents. In Proc. of the 1999 ACM SIGMOD Intl. Conf. on Management of Data, pages 467-478, Philadephia, Pennsylvania, June 1999.
-
(1999)
Proc. of the 1999 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 467-478
-
-
Embley, D.1
Jiang, S.2
Ng, Y.3
-
16
-
-
0035747560
-
Bootstrapping for example-based data extraction
-
Atlanta, Georgia, November
-
P. B. Golgher, A. S. da Silva, A. H. F. Laender, and B. A. Ribeiro-Neto. Bootstrapping for example-based data extraction. In Proc. of the 2001 Intl. Canf. on Information and Knowledge Management, pages 371-378, Atlanta, Georgia, November 2001.
-
(2001)
Proc. of the 2001 Intl. Canf. on Information and Knowledge Management
, pp. 371-378
-
-
Golgher, P.B.1
Da Silva, A.S.2
Laender, A.H.F.3
Ribeiro-Neto, B.A.4
-
17
-
-
84947582877
-
An example-based environment for wrapper generation
-
Salt Lake City, Utah
-
P. B. Golgher, A. H. F. Laender, A. S. da Silva, and B. A. Ribeiro-Neto. An example-based environment for wrapper generation. In Proc. of the 2nd Intl. Workshop on the World Wide Web and Conceptual Modeling, pages 152-164, Salt Lake City, Utah, 2000.
-
(2000)
Proc. of the 2nd Intl. Workshop on the World Wide Web and Conceptual Modeling
, pp. 152-164
-
-
Golgher, P.B.1
Laender, A.H.F.2
Da Silva, A.S.3
Ribeiro-Neto, B.A.4
-
18
-
-
84867731738
-
Extracting semistrueuired information from the Web
-
Tucson, Arizona, May
-
J. Hammer, H. Garcia-Molina, J. Cho, A. Crespo, and R. Aranha. Extracting semistrueuired information from the Web. In Proc. of the Workshop on Management of Semistructured Data, pages 18-25, Tucson, Arizona, May 1997.
-
(1997)
Proc. of the Workshop on Management of Semistructured Data
, pp. 18-25
-
-
Hammer, J.1
Garcia-Molina, H.2
Cho, J.3
Crespo, A.4
Aranha, R.5
-
19
-
-
0002985122
-
Wrapping Web data into XML
-
W. Han, D. Buttler, and C. Pu. Wrapping Web data into XML. SIGMOD Record, 30(3):33-38, 2001.
-
(2001)
SIGMOD Record
, vol.30
, Issue.3
, pp. 33-38
-
-
Han, W.1
Buttler, D.2
Pu, C.3
-
20
-
-
0015600423
-
The Viterbi algorithm
-
March
-
D. Forney Jr. The Viterbi algorithm. Proc. of the IEEE, 61(3), March 1973.
-
(1973)
Proc. of the IEEE
, vol.61
, Issue.3
-
-
Forney Jr., D.1
-
22
-
-
0002781191
-
Accurately and reliably extracting data from the Web: A machine learning approach
-
C. A. Knoblock, K. Lerman, S. Minton, and I. Muslea. Accurately and reliably extracting data from the Web: A machine learning approach. IEEE Data Engineering Bulletin, 23(4):33-41, 2000.
-
(2000)
IEEE Data Engineering Bulletin
, vol.23
, Issue.4
, pp. 33-41
-
-
Knoblock, C.A.1
Lerman, K.2
Minton, S.3
Muslea, I.4
-
23
-
-
0001776223
-
Wrapper induction for information extraction
-
Nagoya, Japan
-
N. Kushmerick, D. S. Weld, and R. B. Doorenbos. Wrapper induction for information extraction. In Proc. of the 1997 Intl. Joint Conf. on Artificial Intelligence, pages 729-737, Nagoya, Japan, 1997.
-
(1997)
Proc. of the 1997 Intl. Joint Conf. on Artificial Intelligence
, pp. 729-737
-
-
Kushmerick, N.1
Weld, D.S.2
Doorenbos, R.B.3
-
24
-
-
0034172374
-
Wrapper induction: Efficiency and expressiveness
-
Nicholas Kushmerick. Wrapper induction: Efficiency and expressiveness. Artificial Intelligence, 118(1-2):15-68, 2000.
-
(2000)
Artificial Intelligence
, vol.118
, Issue.1-2
, pp. 15-68
-
-
Kushmerick, N.1
-
25
-
-
0037806547
-
A brief survey of Web data extraction tools
-
A. H. F. Laender, B. A. Ribeiro-Neto, A. S. da Silva, and J. S. Teixeira. A brief survey of Web data extraction tools. SIGMOD Record, 31(2):84-93, 2002.
-
(2002)
SIGMOD Record
, vol.31
, Issue.2
, pp. 84-93
-
-
Laender, A.H.F.1
Ribeiro-Neto, B.A.2
Da Silva, A.S.3
Teixeira, J.S.4
-
27
-
-
5444251186
-
Top-down extraction of semi-structured data
-
Cancun, Mexico, September
-
B. A. Ribeiro-Neto, A. H. F. Laender, and A. S. da Silva, Top-down extraction of semi-structured data In Proc. of the 6th Syrap. on String Processing and Information Retrieval, pages 176-183, Cancun, Mexico, September 1999.
-
(1999)
Proc. of the 6th Syrap. on String Processing and Information Retrieval
, pp. 176-183
-
-
Ribeiro-Neto, B.A.1
Laender, A.H.F.2
Da Silva, A.S.3
-
28
-
-
0032596539
-
Learning dictionaries for information extraction by multi-level bootstrapping
-
Orlando, Florida, July
-
E. Riloff and R. Jones. Learning dictionaries for information extraction by multi-level bootstrapping, In Proc. of the 1999 National Conf. on Artificial Intelligence, pages 474-4-79, Orlando, Florida, July 1999.
-
(1999)
Proc. of the 1999 National Conf. on Artificial Intelligence
, pp. 474-479
-
-
Riloff, E.1
Jones, R.2
-
29
-
-
84976654685
-
Fast text searching allowing errors
-
October
-
S. Wu and U. Manber. Fast text searching allowing errors. Communications of the ACM, 35(10):83-91, October 1992.
-
(1992)
Communications of the ACM
, vol.35
, Issue.10
, pp. 83-91
-
-
Wu, S.1
Manber, U.2
|