-
1
-
-
8644236286
-
Vips: A vision-based page segmentation algorithm
-
MSR-TR-2003-79, 2003
-
D. Cai, S. Yu, J.-R. Wen, and W.-Y. Ma. Vips: a vision-based page segmentation algorithm. Microsoft Technical Report, MSR-TR-2003-79, 2003.
-
Microsoft Technical Report
-
-
Cai, D.1
Yu, S.2
Wen, J.-R.3
Ma, W.-Y.4
-
2
-
-
0034172374
-
Wrapper induction: Efficiency and expressiveness
-
N. Kuhmerick. Wrapper induction: Efficiency and expressiveness. Artif. Intell., 118:15-68, 2000.
-
(2000)
Artif. Intell
, vol.118
, pp. 15-68
-
-
Kuhmerick, N.1
-
3
-
-
0018985316
-
A faster algorithm computing string edit distances
-
W. Masek and M. Paterson. A faster algorithm computing string edit distances. J. Computer and System Sciences, 20:18-31, 1980.
-
(1980)
J. Computer and System Sciences
, vol.20
, pp. 18-31
-
-
Masek, W.1
Paterson, M.2
-
4
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
D. Reis, P. Golgher, A. Silva, and A. Laender. Automatic web news extraction using tree edit distance. In World Wide Weh-04, pages 502-511, 2004.
-
(2004)
World Wide Weh-04
, pp. 502-511
-
-
Reis, D.1
Golgher, P.2
Silva, A.3
Laender, A.4
-
5
-
-
10044268780
-
Data extraction from web data sources
-
Washington, DC, USA, IEEE Computer Society
-
J. Robinson. Data extraction from web data sources. In DEXA '04: Proceedings of the Datahase and Expert Systems Applications, 15th International Workshop on (DEXA'04), pages 282-288, Washington, DC, USA, 2004. IEEE Computer Society.
-
(2004)
DEXA '04: Proceedings of the Datahase and Expert Systems Applications, 15th International Workshop on (DEXA'04)
, pp. 282-288
-
-
Robinson, J.1
-
6
-
-
23944442806
-
Post-supervised template induction for information extraction from lists and tables in dynamic web sources
-
Z. Shi, E. Milios, and N. Zincir-Heywood. Post-supervised template induction for information extraction from lists and tables in dynamic web sources. J. Intell. Inf. Syst., 25(1 ):69-93, 2005.
-
(2005)
J. Intell. Inf. Syst
, vol.25
, Issue.1
, pp. 69-93
-
-
Shi, Z.1
Milios, E.2
Zincir-Heywood, N.3
-
7
-
-
48149114833
-
A hybrid method for web data extraction
-
Washington, DC, USA, IEEE Computer Society
-
Y. Wang and L. Zhou. A hybrid method for web data extraction. In WI '03: Proceedings of the 2003 IEEE/WIC International Conference on Weh Intelligence, page 417, Washington, DC, USA, 2003. IEEE Computer Society.
-
(2003)
WI '03: Proceedings of the 2003 IEEE/WIC International Conference on Weh Intelligence
, pp. 417
-
-
Wang, Y.1
Zhou, L.2
-
8
-
-
70549092342
-
Html page analysis based on visual cues
-
Washington, DC, USA, IEEE Computer Society
-
Y. Yang and H. Zhang. Html page analysis based on visual cues. In ICDAR '01: Proceedings of the Sixth International Conference on Document Analysis and Recognition, page 859, Washington, DC, USA, 2001. IEEE Computer Society.
-
(2001)
ICDAR '01: Proceedings of the Sixth International Conference on Document Analysis and Recognition
, pp. 859
-
-
Yang, Y.1
Zhang, H.2
-
9
-
-
33750797710
-
Structured data extraction from the web based on partial tree alignment
-
Y. Zhai and B. Liu. Structured data extraction from the web based on partial tree alignment. IEEE Transactions on Knowledge and Data Engineering, 18( 12): 1614-1628, 2006.
-
(2006)
IEEE Transactions on Knowledge and Data Engineering
, vol.18
, Issue.12
, pp. 1614-1628
-
-
Zhai, Y.1
Liu, B.2
|