-
1
-
-
12244298488
-
Mining reference tables for automatic text segmentation
-
E. Agichtein and V. Ganti. Mining reference tables for automatic text segmentation. In Proc. of SIGKDD, 2004.
-
(2004)
Proc. of SIGKDD
-
-
Agichtein, E.1
Ganti, V.2
-
5
-
-
84944327150
-
RoadRunner: Towards automatic data extraction from large web sites
-
V. Crescenzi, G. Mecca, and P. Merialdo. RoadRunner: Towards automatic data extraction from large web sites. In Proc. of VLDB, 2001.
-
(2001)
Proc. of VLDB
-
-
Crescenzi, V.1
Mecca, G.2
Merialdo, P.3
-
6
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1-38, 1997.
-
(1997)
Journal of the Royal Statistical Society, Series B
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
8
-
-
0000996386
-
Maximum likelihood estimation from incomplete data
-
H. Hartley. Maximum likelihood estimation from incomplete data. Biometrics, 14:174-194, 1958.
-
(1958)
Biometrics
, vol.14
, pp. 174-194
-
-
Hartley, H.1
-
10
-
-
0037806547
-
A brief survey of web data extraction tools
-
June
-
A. Laender, B. Ribeiro-Neto, A. Silva, and J. Teixeira. A brief survey of web data extraction tools. SIGMOD Record, 31(2):84-93, June 2002.
-
(2002)
SIGMOD Record
, vol.31
, Issue.2
, pp. 84-93
-
-
Laender, A.1
Ribeiro-Neto, B.2
Silva, A.3
Teixeira, J.4
-
11
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmentating and labeling sequence data
-
J. Lafferty, A. McCallum, and F. Pereira. Conditional random fields: Probabilistic models for segmentating and labeling sequence data. In Proc. of ICML, 2000.
-
(2000)
Proc. of ICML
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
14
-
-
33749612526
-
Integrating unstructured data into relational databases
-
I. R. Mansuri and S. Sarawagi. Integrating unstructured data into relational databases. In Proc. of ICDE, 2006.
-
(2006)
Proc. of ICDE
-
-
Mansuri, I.R.1
Sarawagi, S.2
-
15
-
-
0036205389
-
Similarity flooding: A versatile graph matching algorithm and its application to schema matching
-
S. Melnik, H. Garcia-Molina, and E. Rahm. Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In Proc. of ICDE, 2002.
-
(2002)
Proc. of ICDE
-
-
Melnik, S.1
Garcia-Molina, H.2
Rahm, E.3
-
16
-
-
0002788893
-
A view of the EM algorithm that justifies incremental, sparse, and other variants
-
R. M. Neal and G. E. Hinton. A view of the EM algorithm that justifies incremental, sparse, and other variants. Learning in Graphical Models, pages 355-368, 1998.
-
(1998)
Learning in Graphical Models
, pp. 355-368
-
-
Neal, R.M.1
Hinton, G.E.2
-
17
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
18
-
-
0035657983
-
A survey of approaches to automatic schema matching
-
E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB Journal, 10(4):334-350, 2001.
-
(2001)
VLDB Journal
, vol.10
, Issue.4
, pp. 334-350
-
-
Rahm, E.1
Bernstein, P.A.2
-
19
-
-
34047192804
-
Semi-Markov conditional random fields for information extraction
-
S. Sarawagi and W. W. Cohen. Semi-markov conditional random fields for information extraction. In Proc. of NIPS, 2004.
-
(2004)
Proc. of NIPS
-
-
Sarawagi, S.1
Cohen, W.W.2
-
20
-
-
0032624184
-
Learning information extraction rules for semi-structured and free text
-
S. Soderland. Learning information extraction rules for semi-structured and free text. Machine Learning, 34(1-3):233-272, 1999.
-
(1999)
Machine Learning
, vol.34
, Issue.1-3
, pp. 233-272
-
-
Soderland, S.1
-
21
-
-
3142679542
-
An interactive clustering-based approach to integrating source query interfaces on the deep web
-
W. Wu, C. Yu, A. Doan, and W. Meng. An interactive clustering-based approach to integrating source query interfaces on the deep web. In Proc. of SIGMOD, 2004.
-
(2004)
Proc. of SIGMOD
-
-
Wu, W.1
Yu, C.2
Doan, A.3
Meng, W.4
-
22
-
-
33744821948
-
Web data extraction based on partial tree alignment
-
Y. Zhai and B. Liu. Web data extraction based on partial tree alignment. In Proc. of WWW, 2005.
-
(2005)
Proc. of WWW
-
-
Zhai, Y.1
Liu, B.2
-
23
-
-
33744511796
-
Fully automatic wrapper generation for search engines
-
H. Zhao, W. Meng, Z. Wu, V. Raghavan, and C. Yu. Fully automatic wrapper generation for search engines. In Proc. of WWW, 2005.
-
(2005)
Proc. of WWW
-
-
Zhao, H.1
Meng, W.2
Wu, Z.3
Raghavan, V.4
Yu, C.5
-
24
-
-
31844452562
-
2d conditional random fields for web information extraction
-
J. Zhu, Z. Nie, J.-R. Wen, B. Zhang, and W.-Y. Ma. 2d conditional random fields for Web information extraction. In Proc. of ICML, 2005.
-
(2005)
Proc. of ICML
-
-
Zhu, J.1
Nie, Z.2
Wen, J.-R.3
Zhang, B.4
Ma, W.-Y.5
|