-
1
-
-
77953052174
-
Template detection via data mining and its applications
-
New York, NY, USA, ACM Press
-
Z. Bar-Yossef and S. Rajagopalan. Template detection via data mining and its applications. In Proc. 11th Int. Conf. on WWW, pages 580-591, New York, NY, USA, 2002. ACM Press.
-
(2002)
Proc. 11th Int. Conf. on WWW
, pp. 580-591
-
-
Bar-Yossef, Z.1
Rajagopalan, S.2
-
2
-
-
10944246083
-
On the complexity of schema inference from web pages in the presence of nullable data attributes
-
New York, NY, USA, ACM Press
-
G. Yang, I. V. Ramakrishnan, and M. Kifer. On the complexity of schema inference from web pages in the presence of nullable data attributes. In Proc. 12th Int. Conf. on Information and Knowledge Management, pages 224-231, New York, NY, USA, 2003. ACM Press.
-
(2003)
Proc. 12th Int. Conf. on Information and Knowledge Management
, pp. 224-231
-
-
Yang, G.1
Ramakrishnan, I.V.2
Kifer, M.3
-
3
-
-
0242456776
-
Discovering informative content blocks from web documents
-
New York, NY, USA, ACM Press
-
S.-H. Lin and J.-M. Ho. Discovering informative content blocks from web documents. In Proc. 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 588-593, New York, NY, USA, 2002. ACM Press.
-
(2002)
Proc. 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
, pp. 588-593
-
-
Lin, S.-H.1
Ho, J.-M.2
-
4
-
-
26844469211
-
Automatic extraction of informative blocks from webpages
-
New York, NY, USA, ACM Press
-
S. Debnath, P. Mitra, and C. L. Giles. Automatic extraction of informative blocks from webpages. In Proc. 2005 ACM Symp. on Applied Computing, pages 1722-1726, New York, NY, USA, 2005. ACM Press.
-
(2005)
Proc. 2005 ACM Symp. on Applied Computing
, pp. 1722-1726
-
-
Debnath, S.1
Mitra, P.2
Giles, C.L.3
-
5
-
-
77952370025
-
Eliminating noisy information in web pages for data mining
-
New York, NY, USA, ACM Press
-
L. Yi, B. Liu, and X. Li. Eliminating noisy information in web pages for data mining. In Proc. 9th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 296-305, New York, NY, USA, 2003. ACM Press.
-
(2003)
Proc. 9th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
, pp. 296-305
-
-
Yi, L.1
Liu, B.2
Li, X.3
-
6
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
New York, NY, USA, ACM Press
-
D. C. Reis, P. B. Golgher, A. S. Silva, and A. F. Laender. Automatic web news extraction using tree edit distance. In Proc. 13th Int. Conf. on WWW, pages 502-511, New York, NY, USA, 2004. ACM Press.
-
(2004)
Proc. 13th Int. Conf. on WWW
, pp. 502-511
-
-
Reis, D.C.1
Golgher, P.B.2
Silva, A.S.3
Laender, A.F.4
-
7
-
-
35348883378
-
Pagelevel template detection via isotonic smoothing
-
New York, NY, USA, ACM Press
-
D. Chakrabarti, R. Kumar, and K. Punera. Pagelevel template detection via isotonic smoothing. In Proc.16th Int. Conf. on WWW, pages 61-70, New York, NY, USA, 2007. ACM Press.
-
(2007)
Proc.16th Int. Conf. on WWW
, pp. 61-70
-
-
Chakrabarti, D.1
Kumar, R.2
Punera, K.3
-
8
-
-
77953053369
-
The volume and evolution of web page templates
-
New York, NY, USA, ACM Press
-
D. Gibson, K. Punera, and A. Tomkins. The volume and evolution of web page templates. In Special Interest Tracks and Posters, 14th Int. Conf. on WWW, pages 830-839, New York, NY, USA, 2005. ACM Press.
-
(2005)
Special Interest Tracks and Posters, 14th Int. Conf. on WWW
, pp. 830-839
-
-
Gibson, D.1
Punera, K.2
Tomkins, A.3
-
9
-
-
84880498138
-
DOM-based content extraction of HTML documents
-
New York, NY, USA, ACM Press
-
S. Gupta, G. Kaiser, D. Neistadt, and P. Grimm. DOM-based content extraction of HTML documents. In Proc. 12th Int. Conf. on WWW, pages 207-214, New York, NY, USA, 2003. ACM Press.
-
(2003)
Proc. 12th Int. Conf. on WWW
, pp. 207-214
-
-
Gupta, S.1
Kaiser, G.2
Neistadt, D.3
Grimm, P.4
-
10
-
-
26944496810
-
Identifying content blocks from web documents
-
Foundations of Intelligent Systems
-
S. Debnath, P. Mitra, and C. L. Giles. Identifying content blocks from web documents. In Foundations of Intelligent Systems, LNCS, pages 285-293, 2005.
-
(2005)
LNCS
, pp. 285-293
-
-
Debnath, S.1
Mitra, P.2
Giles, C.L.3
-
12
-
-
0036989234
-
Quasm: A system for question answering using semi-structured data
-
New York, NY, USA, ACM Press
-
D. Pinto, M. Branstein, R. Coleman, W. B. Croft, M. King, W. Li, and X. Wei. Quasm: a system for question answering using semi-structured data. In Proc. 2nd ACM/IEEE-CS joint Conf. on Digital libraries, pages 46-55, New York, NY, USA, 2002. ACM Press.
-
(2002)
Proc. 2nd ACM/IEEE-CS joint Conf. on Digital libraries
, pp. 46-55
-
-
Pinto, D.1
Branstein, M.2
Coleman, R.3
Croft, W.B.4
King, M.5
Li, W.6
Wei, X.7
-
14
-
-
12744279236
-
A short survey of document structure similarity algorithms
-
CSREA Press
-
D. Buttler. A short survey of document structure similarity algorithms. In Proc. Int. Conf. on Internet Computing, pages 3-9. CSREA Press, 2004.
-
(2004)
Proc. Int. Conf. on Internet Computing
, pp. 3-9
-
-
Buttler, D.1
-
15
-
-
0010362121
-
Syntactic clustering of the web
-
A. Z. Broder, S. C. Glassman, M. S. Manasse, and G. Zweig. Syntactic clustering of the web. Computer Networks, 29(8-13):1157-1166, 1997.
-
(1997)
Computer Networks
, vol.29
, Issue.8-13
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
|