-
2
-
-
67649647516
-
Crawling the content hidden behind web forms
-
M. Alvarez, J. Raposo, A. Pan, F. Cacheda, F. Bellas, and V. Carneiro. Crawling the content hidden behind web forms. In ICCSA, 2007.
-
(2007)
ICCSA
-
-
Alvarez, M.1
Raposo, J.2
Pan, A.3
Cacheda, F.4
Bellas, F.5
Carneiro, V.6
-
4
-
-
1142303671
-
Dynamic sample selection for approximate query processing
-
B. Babcock, S. Chaudhuri, and G. Das. Dynamic sample selection for approximate query processing. In SIGMOD, 2003.
-
(2003)
SIGMOD
-
-
Babcock, B.1
Chaudhuri, S.2
Das, G.3
-
5
-
-
35348840330
-
Efficient search engine measurements
-
Z. Bar-Yossef and M. Gurevich. Efficient search engine measurements. In WWW, 2007.
-
(2007)
WWW
-
-
Bar-Yossef, Z.1
Gurevich, M.2
-
6
-
-
74549198813
-
Mining search engine query logs via suggestion sampling
-
Z. Bar-Yossef and M. Gurevich. Mining search engine query logs via suggestion sampling. In VLDB, 2008.
-
(2008)
VLDB
-
-
Bar-Yossef, Z.1
Gurevich, M.2
-
7
-
-
56349136928
-
Random sampling from a search engine's corpus
-
Z. Bar-Yossef and M. Gurevich. Random sampling from a search engine's corpus. Journal of the ACM, 55(5), 2008.
-
(2008)
Journal of the ACM
, vol.55
, pp. 5
-
-
Bar-Yossef, Z.1
Gurevich, M.2
-
8
-
-
27544439829
-
A technique for measuring the relative size and overlap of public web search engines
-
K. Bharat and A. Broder. A technique for measuring the relative size and overlap of public web search engines. In WWW, 1998.
-
(1998)
WWW
-
-
Bharat, K.1
Broder, A.2
-
9
-
-
34547629212
-
Estimating corpus size via queries
-
A. Broder, M. Fontura, V. Josifovski, R. Kumar, R. Motwani, S. U. Nabar, R. Panigrahy, A. Tomkins, and Y. Xu. Estimating corpus size via queries. In CIKM, 2006.
-
(2006)
CIKM
-
-
Broder, A.1
Fontura, M.2
Josifovski, V.3
Kumar, R.4
Motwani, R.5
Nabar, S.U.6
Panigrahy, R.7
Tomkins, A.8
Xu, Y.9
-
10
-
-
0036211203
-
Evaluating top-k queries over web-accessible databases
-
N. Bruno, L. Gravano, and A. Marian. Evaluating top-k queries over web-accessible databases. In ICDE, 2002.
-
(2002)
ICDE
-
-
Bruno, N.1
Gravano, L.2
Marian, A.3
-
11
-
-
0002104204
-
Query-based sampling of text databases
-
J. P. Callan and M. E. Connell. Query-based sampling of text databases. ACM TOIS, 19(2):97-130, 2001.
-
(2001)
ACM TOIS
, vol.19
, Issue.2
, pp. 97-130
-
-
Callan, J.P.1
Connell, M.E.2
-
12
-
-
0036372482
-
Minimal probing: Supporting expensive predicates for top-k queries
-
K. C.-C. Chang and S. won Hwang. Minimal probing: supporting expensive predicates for top-k queries. In SIGMOD, 2002.
-
(2002)
SIGMOD
-
-
Chang, K.C.-C.1
Won Hwang, S.2
-
13
-
-
35448942785
-
A random walk approach to sampling hidden databases
-
A. Dasgupta, G. Das, and H. Mannila. A random walk approach to sampling hidden databases. In SIGMOD, 2007.
-
(2007)
SIGMOD
-
-
Dasgupta, A.1
Das, G.2
Mannila, H.3
-
14
-
-
67649671784
-
Leveraging count information in sampling hidden databases
-
A. Dasgupta, N. Zhang, and G. Das. Leveraging count information in sampling hidden databases. In ICDE, 2009.
-
(2009)
ICDE
-
-
Dasgupta, A.1
Zhang, N.2
Das, G.3
-
15
-
-
70849084625
-
Privacy preservation of aggregates in hidden databases: Why and how?
-
A. Dasgupta, N. Zhang, G. Das, and S. Chaudhuri. Privacy preservation of aggregates in hidden databases: Why and how? In SIGMOD, 2009.
-
(2009)
SIGMOD
-
-
Dasgupta, A.1
Zhang, N.2
Das, G.3
Chaudhuri, S.4
-
16
-
-
0028447111
-
Quickly generating billion-record synthetic databases
-
J. Gray, P. Sundaresan, S. Englert, K. Baclawski, and P. J. Weinberger. Quickly generating billion-record synthetic databases. In SIGMOD, 1994.
-
(1994)
SIGMOD
-
-
Gray, J.1
Sundaresan, P.2
Englert, S.3
Baclawski, K.4
Weinberger, P.J.5
-
17
-
-
18744389475
-
A two-phase sampling technique for information extraction from hidden web databases
-
Y.-L. Hedley, M. Younas, A. E. James, and M. Sanderson. A two-phase sampling technique for information extraction from hidden web databases. In WIDM, 2004.
-
(2004)
WIDM
-
-
Hedley, Y.-L.1
Younas, M.2
James, A.E.3
Sanderson, M.4
-
18
-
-
33748195920
-
Sampling, information extraction and summarisation of hidden web databases
-
Y.-L. Hedley, M. Younas, A. E. James, and M. Sanderson. Sampling, information extraction and summarisation of hidden web databases. Data and Knowledge Engineering, 59(2):213-230, 2006.
-
(2006)
Data and Knowledge Engineering
, vol.59
, Issue.2
, pp. 213-230
-
-
Hedley, Y.-L.1
Younas, M.2
James, A.E.3
Sanderson, M.4
-
21
-
-
0037818401
-
Discovering the representative of a search engine
-
Y. C. Liu, K. and W. Meng. Discovering the representative of a search engine. In CIKM, 2002.
-
(2002)
CIKM
-
-
Liu, Y.C.1
Meng, K.W.2
-
22
-
-
70349243672
-
Efficient estimation of the size of text deep web data source
-
J. Lu. Efficient estimation of the size of text deep web data source. In CIKM, 2008.
-
(2008)
CIKM
-
-
Lu, J.1
-
23
-
-
27544458897
-
Downloading textual hidden web content through keyword queries
-
A. Ntoulas, P. Zerfos, and J. Cho. Downloading textual hidden web content through keyword queries. In JCDL, 2005.
-
(2005)
JCDL
-
-
Ntoulas, A.1
Zerfos, P.2
Cho, J.3
-
24
-
-
67649663838
-
Distributed search over the hidden web: Hierarchical database sampling and selection
-
L. G. Panagiotis G. Ipeirotis. Distributed search over the hidden web: Hierarchical database sampling and selection. In VLDB, 2002.
-
(2002)
VLDB
-
-
Panagiotis, L.G.1
Ipeirotis, G.2
-
26
-
-
0004007880
-
-
Wiley & Sons, New York
-
B. Ripley. Stochastic Simulation. Wiley & Sons, New York, 1987.
-
(1987)
Stochastic Simulation
-
-
-
28
-
-
33750285514
-
Capturing collection size for distributed non-cooperative retrieval
-
M. Shokouhi, J. Zobel, F. Scholer, and S. Tahaghoghi. Capturing collection size for distributed non-cooperative retrieval. In SIGIR, 2006.
-
(2006)
SIGIR
-
-
Shokouhi, M.1
Zobel, J.2
Scholer, F.3
Tahaghoghi, S.4
|