-
2
-
-
33947190914
-
Vertical navigation of layout adapted web documents
-
Beszteri, I., Vuorimaa, P.: Vertical navigation of layout adapted web documents. World Wide Web 10(1), 1-35 (2007).
-
(2007)
World Wide Web
, vol.10
, Issue.1
, pp. 1-35
-
-
Beszteri, I.1
Vuorimaa, P.2
-
3
-
-
0034791059
-
Enhanced topic distillation using text, markup tags, and hy-perlinks
-
Chakrabarti, S., Joshi, M., Tawde, V.: Enhanced topic distillation using text, markup tags, and hy-perlinks. In: Proceedings the of ACM Conference on Research and Development in Information Retrieval, pp. 208-216 (2001).
-
(2001)
Proceedings the Of ACM Conference On Research and Development In Information Retrieval
, pp. 208-216
-
-
Chakrabarti, S.1
Joshi, M.2
Tawde, V.3
-
4
-
-
35348883378
-
Page-level template detection via isotonic smoothing
-
ACM, New York, NY, USA
-
Chakrabarti, D., Kumar, R., Punera, K.: Page-level template detection via isotonic smoothing. In: WWW'07: Proceedings of the 16th International Conference on World Wide Web, pp. 61-70. ACM, New York, NY, USA (2007).
-
(2007)
WWW'07: Proceedings of The 16th International Conference On World Wide Web
, pp. 61-70
-
-
Chakrabarti, D.1
Kumar, R.2
Punera, K.3
-
5
-
-
0345861203
-
New algorithm for tree-to-tree correction problem
-
Chen, W.: New algorithm for tree-to-tree correction problem. J. Algorithms 40, 135-158 (2001).
-
(2001)
J. Algorithms
, vol.40
, pp. 135-158
-
-
Chen, W.1
-
6
-
-
33751046629
-
Template detection for large scale search engines
-
ACM, New York, NY, USA
-
Chen, L., Ye, S., Li, X.: Template detection for large scale search engines. In: SAC'06: Proceedings of the 2006 ACM Symposium on Applied Computing, pp. 1094-1098. ACM, New York, NY, USA (2006).
-
(2006)
SAC'06: Proceedings of The 2006 Acm Symposium On Applied Computing
, pp. 1094-1098
-
-
Chen, L.1
Ye, S.2
Li, X.3
-
7
-
-
29144484106
-
A methodology for clustering xml documents by structure
-
Dalamagas, T., Cheng, T., Winkel, K. J., Sellis, T. K.: A methodology for clustering xml documents by structure. Inf. Syst. 31(3), 187-228 (2006).
-
(2006)
Inf. Syst.
, vol.31
, Issue.3
, pp. 187-228
-
-
Dalamagas, T.1
Cheng, T.2
Winkel, K.J.3
Sellis, T.K.4
-
8
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
de Castro Reis, D., Golgher, P.B., da Silva, A.S., Laender, A.H.F.: Automatic web news extraction using tree edit distance. In: Proceedings of the International Conference on the World Wide Web, pp. 502-511 (2004).
-
(2004)
Proceedings of the International Conference On The World Wide Web
, pp. 502-511
-
-
de Castro Reis, D.1
Golgher, P.B.2
da Silva, A.S.3
Laender, A.H.F.4
-
9
-
-
26844469211
-
Automatic extraction of informative blocks from webpages
-
Debnath, S., Mitra, P., Giles, C.L.: Automatic extraction of informative blocks from webpages. In: ACM Symposium on Applied Computing, pp. 1722-1726 (2005).
-
(2005)
ACM Symposium On Applied Computing
, pp. 1722-1726
-
-
Debnath, S.1
Mitra, P.2
Giles, C.L.3
-
10
-
-
77953053369
-
The volume and evolution of web page templates
-
Gibson, D., Punera, K., Tomkins, A.: The volume and evolution of web page templates. In: Proceedings of the International Conference on the World Wide Web-Poster Session, pp. 830-839. (2005).
-
(2005)
Proceedings of the International Conference On The World Wide Web-poster Session
, pp. 830-839
-
-
Gibson, D.1
Punera, K.2
Tomkins, A.3
-
11
-
-
2442561063
-
A bag of paths model for measuring structural similarity in web documents
-
Washington, DC, USA
-
Joshi, S., Agrawal, N., Krishnapuram, R., Negi, S.: A bag of paths model for measuring structural similarity in web documents. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, pp. 577-582 (2003).
-
(2003)
Proceedings of The Ninth ACM Sigkdd International Conference On Knowledge Discovery and Data Mining
, pp. 577-582
-
-
Joshi, S.1
Agrawal, N.2
Krishnapuram, R.3
Negi, S.4
-
12
-
-
38349151563
-
A novelty-based clustering method for on-line documents
-
Khy, S., Ishikawa, Y., Kitagawa, H.: A novelty-based clustering method for on-line documents. World Wide Web 11(1), 1-37 (2008).
-
(2008)
World Wide Web
, vol.11
, Issue.1
, pp. 1-37
-
-
Khy, S.1
Ishikawa, Y.2
Kitagawa, H.3
-
13
-
-
0742268827
-
An efficient and scalable algorithm for clustering xml documents by structure
-
Lian, W., Cheung, D.W.L., Mamoulis, N., Yiu, S.M.: An efficient and scalable algorithm for clustering xml documents by structure. IEEE Trans. Knowl. Data Eng. 16(1), 82-96 (2004).
-
(2004)
Ieee Trans. Knowl. Data Eng.
, vol.16
, Issue.1
, pp. 82-96
-
-
Lian, W.1
Cheung, D.W.L.2
Mamoulis, N.3
Yiu, S.M.4
-
14
-
-
43349098737
-
Intelligent assistance in authoring dynamically generated web interfaces
-
Macías, J.A.: Intelligent assistance in authoring dynamically generated web interfaces. World Wide Web 11(2), 253-286 (2008).
-
(2008)
World Wide Web
, vol.11
, Issue.2
, pp. 253-286
-
-
Macías, J.A.1
-
15
-
-
0002359744
-
User interface directions for the web
-
Nielsen, J.: User interface directions for the web. Commun. ACM 42(1), 65-72 (1999).
-
(1999)
Commun. ACM
, vol.42
, Issue.1
, pp. 65-72
-
-
Nielsen, J.1
-
17
-
-
0001122858
-
The tree-to-tree editing problem
-
Selkow, S.M.: The tree-to-tree editing problem. Inf. Process. Lett. 6, 184-186 (1977).
-
(1977)
Inf. Process. Lett.
, vol.6
, pp. 184-186
-
-
Selkow, S.M.1
-
18
-
-
18744381159
-
Learning block importance models for web pages
-
Song, R., Liu, H., Wen, J.R., Ma, W.Y.: Learning block importance models for web pages. In: Proceedings of the International Conference on the World Wide Web, pp. 203-211 (2004).
-
(2004)
Proceedings of the International Conference On The World Wide Web
, pp. 203-211
-
-
Song, R.1
Liu, H.2
Wen, J.R.3
Ma, W.Y.4
-
19
-
-
0018491659
-
The tree-to-tree correction problem
-
Tai, K.C.: The tree-to-tree correction problem. J. ACM 26(3), 422-433 (1979).
-
(1979)
J. ACM
, vol.26
, Issue.3
, pp. 422-433
-
-
Tai, K.C.1
-
21
-
-
34547631600
-
A fast and robust method for web page template detection and removal
-
Arlington, VA, USA
-
Vieira, K., da Silva, A.S., Pinto, N., de Moura, E.S., Cavalcanti, J.M.B., Freire, J.: A fast and robust method for web page template detection and removal. In: Proceedings of the ACM International Conference on Information and Knowledge Management, Arlington, VA, USA, pp. 258-267 (2006).
-
(2006)
Proceedings of The ACM International Conference On Information and Knowledge Management
, pp. 258-267
-
-
Vieira, K.1
da Silva, A.S.2
Pinto, N.3
de Moura, E.S.4
Cavalcanti, J.M.B.5
Freire, J.6
-
22
-
-
0035196763
-
Finding similar consensus between trees: An algorithm and a distance hierarchy
-
Wang, J.T.L., Zhang, K.: Finding similar consensus between trees: an algorithm and a distance hierarchy. Pattern Recogn. 34, 127-137 (2001).
-
(2001)
Pattern Recogn
, vol.34
, pp. 127-137
-
-
Wang, J.T.L.1
Zhang, K.2
-
23
-
-
0026185673
-
Identifying syntactic differences between two programs
-
Yang, W.: Identifying syntactic differences between two programs. Softw. Pract. Exp. 21(7), 739-755 (1991).
-
(1991)
Softw. Pract. Exp.
, vol.21
, Issue.7
, pp. 739-755
-
-
Yang, W.1
-
24
-
-
77952370025
-
Eliminating noisy information in web pages for data mining
-
Yi, L., Liu, B., Li, X.: Eliminating noisy information in web pages for data mining. In: Proceedings of the International ACM Conference on Knowledge Discovery and Data Mining, pp. 296-305 (2003).
-
(2003)
Proceedings of The International ACM Conference On Knowledge Discovery and Data Mining
, pp. 296-305
-
-
Yi, L.1
Liu, B.2
Li, X.3
-
25
-
-
34247869740
-
Extracting web data using instance-based learning
-
Zhai, Y., Liu, B.: Extracting web data using instance-based learning. World Wide Web 10(2), 113-132 (2007).
-
(2007)
World Wide Web
, vol.10
, Issue.2
, pp. 113-132
-
-
Zhai, Y.1
Liu, B.2
-
26
-
-
0000307499
-
On the editing distance between unordered labeled trees
-
Zhang, K., Statman, R., Shasha, D.: On the editing distance between unordered labeled trees. Inf. Process. Lett. 42(3), 133-139 (1992).
-
(1992)
Inf. Process. Lett.
, vol.42
, Issue.3
, pp. 133-139
-
-
Zhang, K.1
Statman, R.2
Shasha, D.3
|