-
5
-
-
0034207121
-
Min-wise independent permutations
-
A.Z. Broder, M. Charikar, A.M. Frieze, and M. Mitzenmacher, "Min-Wise Independent Permutations," J. Computer and System Sciences, vol. 60, no. 3, pp. 630-659, 2000.
-
(2000)
J. Computer and System Sciences
, vol.60
, Issue.3
, pp. 630-659
-
-
Broder, A.Z.1
Charikar, M.2
Frieze, A.M.3
Mitzenmacher, M.4
-
7
-
-
0033688075
-
Selectivity estimation for boolean queries
-
Z. Chen, F. Korn, N. Koudas, and S. Muithukrishnan, "Selectivity Estimation for Boolean Queries," Proc. ACM SIGMOD-SIGACTSIGART Symp. Principles of Database Systems (PODS), 2000.
-
(2000)
Proc. ACM SIGMOD-SIGACTSIGART Symp. Principles of Database Systems (PODS)
-
-
Chen, Z.1
Korn, F.2
Koudas, N.3
Muithukrishnan, S.4
-
11
-
-
18844436436
-
Clustering Web pages based on their structure
-
DOI 10.1016/j.datak.2004.11.004, PII S0169023X04002137, Fifth ACM International Workshop on Web Information and Data Management (WIDM 2003)
-
V. Crescenzi, P. Merialdo, and P. Missier, "Clustering (Pubitemid 40683780)
-
(2005)
Data and Knowledge Engineering
, vol.54
, Issue.3
, pp. 279-299
-
-
Crescenzi, V.1
Merialdo, P.2
Missier, P.3
-
12
-
-
4644340823
-
Automatic web news extraction using tree edit distance
-
M. De Castro Reis, P.B. Golgher, A.S. Da Silva, and A.H.F. Laender, "Automatic Web News Extraction Using Tree Edit Distance," Proc. 13th Int'l Conf. World Wide Web (WWW), 2004.
-
(2004)
Proc. 13th Int'l Conf. World Wide Web (WWW)
-
-
De Castro Reis, M.1
Golgher, P.B.2
Da Silva, A.S.3
Laender, A.H.F.4
-
14
-
-
0000216094
-
Xtract: A system for extracting document type descriptors from xml documents
-
M.N. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim, "Xtract: A System for Extracting Document Type Descriptors from Xml Documents," Proc. ACM SIGMOD, 2000.
-
(2000)
Proc. ACM SIGMOD
-
-
Garofalakis, M.N.1
Gionis, A.2
Rastogi, R.3
Seshadri, S.4
Shim, K.5
-
16
-
-
3142742483
-
Using the structure of web sites for automatic segmentation of tables
-
K. Lerman, L. Getoor, S. Minton, and C. Knoblock, "Using the Structure of Web Sites for Automatic Segmentation of Tables," Proc. ACM SIGMOD, 2004.
-
(2004)
Proc. ACM SIGMOD
-
-
Lerman, K.1
Getoor, L.2
Minton, S.3
Knoblock, C.4
-
18
-
-
57149147732
-
Crd: Fast co-clustering on large data sets utilizing sampling-based matrix decomposition
-
F. Pan, X. Zhang, and W. Wang, "Crd: Fast Co-Clustering on Large Data Sets Utilizing Sampling-Based Matrix Decomposition," Proc. ACM SIGMOD, 2008.
-
(2008)
Proc. ACM SIGMOD
-
-
Pan, F.1
Zhang, X.2
Wang, W.3
-
20
-
-
0018015137
-
Modeling by shortest data description
-
J. Rissanen, "Modeling by Shortest Data Description," Automatica, vol. 14, pp. 465-471, 1978.
-
(1978)
Automatica
, vol.14
, pp. 465-471
-
-
Rissanen, J.1
-
21
-
-
0003250456
-
Stochastic complexity in statistical inquiry
-
J. Rissanen, Stochastic Complexity in Statistical Inquiry. World Scientific, 1989.
-
(1989)
World Scientific
-
-
Rissanen, J.1
-
22
-
-
34547631600
-
A fast and robust method for web page template detection and removal
-
K. Vieira, A.S. Da Silva, N. Pinto, E.S. De Moura, J.M.B. Cavalcanti, and J. Freire, "A Fast and Robust Method for Web Page Template Detection and Removal," Proc. 15th ACM Int'l Conf. Information and Knowledge Management (CIKM), 2006.
-
(2006)
Proc. 15th ACM Int'l Conf. Information and Knowledge Management (CIKM)
-
-
Vieira, K.1
Da Silva, A.S.2
Pinto, N.3
De Moura, E.S.4
Cavalcanti, J.M.B.5
Freire, J.6
-
24
-
-
33744899132
-
Fully automatic wrapper generation for search engines
-
H. Zhao, W. Meng, Z. Wu, V. Raghavan, and C. Yu, "Fully Automatic Wrapper Generation for Search Engines," Proc. 14th Int'l Conf. World Wide Web (WWW), 2005.
-
(2005)
Proc. 14th Int'l Conf. World Wide Web (WWW)
-
-
Zhao, H.1
Meng, W.2
Wu, Z.3
Raghavan, V.4
Yu, C.5
-
26
-
-
36849062139
-
Joint optimization of wrapper generation and template detection
-
S. Zheng, D. Wu, R. Song, and J.-R. Wen, "Joint Optimization of Wrapper Generation and Template Detection," Proc. ACM SIGKDD, 2007.
-
(2007)
Proc. ACM SIGKDD
-
-
Zheng, S.1
Wu, D.2
Song, R.3
Wen, J.-R.4
|