-
2
-
-
10944246083
-
On the complexity of schema inference from web pages in the presence of nullable data attributes
-
ACM Press, New York
-
Yang, G., Ramakrishnan, I.V., Kifer, M.: On the complexity of schema inference from web pages in the presence of nullable data attributes. In: CIKM 2003: Proceedings of the twelfth International Conference on Information and Knowledge Management, pp. 224-231. ACM Press, New York (2003)
-
(2003)
CIKM 2003: Proceedings of the twelfth International Conference on Information and Knowledge Management
, pp. 224-231
-
-
Yang, G.1
Ramakrishnan, I.V.2
Kifer, M.3
-
4
-
-
26844469211
-
-
SAC, ACM Press, New York 2005
-
Debnath, S., Mitra, P., Giles, C.L.: Automatic extraction of informative blocks from webpages. In: SAC 2005, pp. 1722-1726. ACM Press, New York (2005)
-
(2005)
Automatic extraction of informative blocks from webpages
, pp. 1722-1726
-
-
Debnath, S.1
Mitra, P.2
Giles, C.L.3
-
5
-
-
77952370025
-
Eliminating noisy information in web pages for data mining
-
ACM Press, New York
-
Yi, L., Liu, B., Li, X.: Eliminating noisy information in web pages for data mining. In: KDD 2003: Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 296-305. ACM Press, New York (2003)
-
(2003)
KDD 2003: Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 296-305
-
-
Yi, L.1
Liu, B.2
Li, X.3
-
6
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
ACM Press, New York , doi:10.1145/988672.988740
-
Reis, D.C., Golgher, P.B., Silva, A.S., Laender, A.F.: Automatic web news extraction using tree edit distance. In: WWW 2004: Proceedings of the 13th International Conference on World Wide Web, pp. 502-511. ACM Press, New York (2004), doi:10.1145/988672.988740
-
(2004)
WWW 2004: Proceedings of the 13th International Conference on World Wide Web
, pp. 502-511
-
-
Reis, D.C.1
Golgher, P.B.2
Silva, A.S.3
Laender, A.F.4
-
7
-
-
77953053369
-
The volume and evolution of web page templates
-
ACM Press, New York
-
Gibson, D., Punera, K., Tomkins, A.: The volume and evolution of web page templates. In: WWW 2005: Special Interest Tracks and Posters of the 14th International Conference on World Wide Web, pp. 830-839. ACM Press, New York (2005)
-
(2005)
WWW 2005: Special Interest Tracks and Posters of the 14th International Conference on World Wide Web
, pp. 830-839
-
-
Gibson, D.1
Punera, K.2
Tomkins, A.3
-
8
-
-
35348883378
-
Page-level template detection via isotonic smoothing
-
ACM Press, New York
-
Chakrabarti, D., Kumar, R., Punera, K.: Page-level template detection via isotonic smoothing. In: WWW 2007: Proceedings of the 16th International Conference on World Wide Web, pp. 61-70. ACM Press, New York (2007)
-
(2007)
WWW 2007: Proceedings of the 16th International Conference on World Wide Web
, pp. 61-70
-
-
Chakrabarti, D.1
Kumar, R.2
Punera, K.3
-
9
-
-
84957632308
-
-
Cruz, I.F., Borisov, S., Marks, M.A., Webbs, T.R.: Measuring structural similarity among web documents: preliminary results. In: Porto, V.W., Waagen, D. (eds.) EP 1998. LNCS, 1447, pp. 513 524. Springer, Heidelberg (1998)
-
Cruz, I.F., Borisov, S., Marks, M.A., Webbs, T.R.: Measuring structural similarity among web documents: preliminary results. In: Porto, V.W., Waagen, D. (eds.) EP 1998. LNCS, vol. 1447, pp. 513 524. Springer, Heidelberg (1998)
-
-
-
-
11
-
-
0010362121
-
Syntactic clustering of the web
-
Broder, A.Z., Glassman, S.C., Manasse, M.S., Zweig, G.: Syntactic clustering of the web. Computer Networks 29(8-13), 1157-1166 (1997)
-
(1997)
Computer Networks
, vol.29
, Issue.8-13
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
12
-
-
2442561063
-
A bag of paths model for measuring structural similarity in web documents
-
ACM Press, New York
-
Joshi, S., Agrawal, N., Krishnapuram, R., Negi, S.: A bag of paths model for measuring structural similarity in web documents. In: KDD 2003: Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 577-582. ACM Press, New York (2003)
-
(2003)
KDD 2003: Proceedings of the ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 577-582
-
-
Joshi, S.1
Agrawal, N.2
Krishnapuram, R.3
Negi, S.4
-
13
-
-
34247399581
-
Fast and simple XML tree differencing by sequence alignment
-
ACM Press, New York
-
Lindholm, T., Kangasharju, J., Tarkoma, S.: Fast and simple XML tree differencing by sequence alignment. In: DocEng 2006: Proceedings of the 2006 ACM Symposium on Document Engineering, pp. 75-84. ACM Press, New York (2006)
-
(2006)
DocEng 2006: Proceedings of the 2006 ACM Symposium on Document Engineering
, pp. 75-84
-
-
Lindholm, T.1
Kangasharju, J.2
Tarkoma, S.3
-
14
-
-
78649256542
-
A DOM tree alignment model for mining parallel data from the web
-
Morristown, NJ, USA, Association for Computational Linguistics, pp
-
Shi, L., Niu, C., Zhou, M., Gao, J.: A DOM tree alignment model for mining parallel data from the web. In: ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, Morristown, NJ, USA, Association for Computational Linguistics, pp. 489-496 (2006)
-
(2006)
ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL
, pp. 489-496
-
-
Shi, L.1
Niu, C.2
Zhou, M.3
Gao, J.4
-
16
-
-
24944533365
-
Nonmetric multidimensional scaling: A numerical method
-
Kruskal, J.B.: Nonmetric multidimensional scaling: A numerical method. Psychometrika 29(2), 115-129 (1964)
-
(1964)
Psychometrika
, vol.29
, Issue.2
, pp. 115-129
-
-
Kruskal, J.B.1
-
17
-
-
84950632109
-
Objective criteria for the evaluation of clustering methods
-
Rand, W.M.: Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association 66(336), 846-850 (1971)
-
(1971)
Journal of the American Statistical Association
, vol.66
, Issue.336
, pp. 846-850
-
-
Rand, W.M.1
-
18
-
-
0002788820
-
Impact of similarity measures on web-page clustering
-
AAAI, pp
-
Strehl, A., Ghosh, J., Mooney, R.: Impact of similarity measures on web-page clustering. In: AAAI 2000: Proceedings of the 17th National Conference on Artificial Intelligence: Workshop of Artificial Intelligence for Web Search, AAAI, pp. 58-64 (2000)
-
(2000)
AAAI 2000: Proceedings of the 17th National Conference on Artificial Intelligence: Workshop of Artificial Intelligence for Web Search
, pp. 58-64
-
-
Strehl, A.1
Ghosh, J.2
Mooney, R.3
|