-
1
-
-
37349017556
-
Adaptive-sampling algorithms for answering aggregation queries on Web sites
-
DOI 10.1016/j.datak.2007.09.014, PII S0169023X07001814
-
Foto N. Afrati, Paraskevas V. Lekeas, and Chen Li. Adaptive-sampling algorithms for answering aggregation queries on web sites. Data Knowl. Eng., 64(2):462-490, 2008. (Pubitemid 350297498)
-
(2008)
Data and Knowledge Engineering
, vol.64
, Issue.2
, pp. 462-490
-
-
Afrati, F.N.1
Lekeas, P.V.2
Li, C.3
-
3
-
-
74549198813
-
Mining search engine query logs via suggestion sampling
-
Ziv Bar-Yossef and Maxim Gurevich. Mining search engine query logs via suggestion sampling. Proc. VLDB Endow., 1(1):54-65, 2008.
-
(2008)
Proc. VLDB Endow.
, vol.1
, Issue.1
, pp. 54-65
-
-
Ziv, B.-Y.1
Gurevich, M.2
-
4
-
-
23044527560
-
Detecting group differences: Mining contrast sets
-
Stephen D. Bay and Michael J. Pazzani. Detecting group differences: Mining contrast sets. Data Mining and Knowledge Discovery, 5(3):213-246, 2001.
-
(2001)
Data Mining and Knowledge Discovery
, vol.5
, Issue.3
, pp. 213-246
-
-
Bay, S.D.1
Pazzani, M.J.2
-
6
-
-
67049119060
-
A randomized approach for approximating the number of frequent sets
-
Washington, DC, USA IEEE Computer Society
-
Mario Boley and Henrik Grosskreutz. A randomized approach for approximating the number of frequent sets. In ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pages 43-52, Washington, DC, USA, 2008. IEEE Computer Society.
-
(2008)
ICDM '08: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
, pp. 43-52
-
-
Boley, M.1
Grosskreutz, H.2
-
7
-
-
70350414002
-
Optimization of multidomain queries on the web
-
D. Braga, S. Ceri, F. Daniel, and D. Martinenghi. Optimization of Multidomain Queries on the Web. VLDB Endowment, 1:562-673, 2008.
-
(2008)
VLDB Endowment
, vol.1
, pp. 562-673
-
-
Braga, D.1
Ceri, S.2
Daniel, F.3
Martinenghi, D.4
-
8
-
-
85011514513
-
Monte Carlo and quasi-Monte Carlo methods
-
R. E. Caflisch. Monte carlo and quasi-monte carlo methods. Acta Numerica, 7:1-49, 1998.
-
(1998)
Acta Numerica
, vol.7
, pp. 1-49
-
-
Caflisch, R.E.1
-
10
-
-
0242625264
-
A new two-phase sampling based algorithm for discovering association rules
-
New York, NY, USA ACM
-
Bin Chen, Peter Haas, and Peter Scheuermann. A new two-phase sampling based algorithm for discovering association rules. In KDD '02: Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 462-468, New York, NY, USA, 2002. ACM.
-
(2002)
KDD '02: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 462-468
-
-
Chen, B.1
Haas, P.2
Scheuermann, P.3
-
12
-
-
35448942785
-
A random walk approach to sampling hidden databases
-
DOI 10.1145/1247480.1247550, SIGMOD 2007: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
Arjun Dasgupta, Gautam Das, and Heikki Mannila. A random walk approach to sampling hidden databases. In SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pages 629-640, New York, NY, USA, 2007. ACM. (Pubitemid 47630840)
-
(2007)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 629-640
-
-
Dasgupta, A.1
Das, G.2
Mannila, H.3
-
13
-
-
77954730150
-
Unbiased estimation of size and other aggregates over hidden web databases
-
New York, NY, USA ACM
-
Arjun Dasgupta, Xin Jin, Bradley Jewell, Nan Zhang, and Gautam Das. Unbiased estimation of size and other aggregates over hidden web databases. In SIGMOD '10: Proceedings of the 2010 international conference on Management of data, pages 855-866, New York, NY, USA, 2010. ACM.
-
(2010)
SIGMOD '10: Proceedings of the 2010 International Conference on Management of Data
, pp. 855-866
-
-
Dasgupta, A.1
Jin, X.2
Jewell, B.3
Zhang, N.4
Das, G.5
-
14
-
-
67649671784
-
Leveraging count information in sampling hidden databases
-
Washington, DC, USA IEEE Computer Society
-
Arjun Dasgupta, Nan Zhang, and Gautam Das. Leveraging count information in sampling hidden databases. In ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering, pages 329-340, Washington, DC, USA, 2009. IEEE Computer Society.
-
(2009)
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data Engineering
, pp. 329-340
-
-
Dasgupta, A.1
Zhang, N.2
Das, G.3
-
15
-
-
70349243816
-
Mining influential attributes that capture class and group contrast behaviour
-
New York, NY, USA ACM
-
Loekito Elsa and Bailey James. Mining influential attributes that capture class and group contrast behaviour. In CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management, pages 971-980, New York, NY, USA, 2008. ACM.
-
(2008)
CIKM '08: Proceeding of the 17th ACM Conference on Information and Knowledge Management
, pp. 971-980
-
-
Elsa, L.1
James, B.2
-
17
-
-
33749648773
-
New sampling-based estimators for OLAP queries
-
DOI 10.1109/ICDE.2006.106, 1617386, Proceedings of the 22nd International Conference on Data Engineering, ICDE '06
-
Ruoming Jin, Leonid Glimcher, Chris Jermaine, and Gagan Agrawal. New sampling-based estimators for olap queries. In ICDE, page 18, 2006. (Pubitemid 44539810)
-
(2006)
Proceedings - International Conference on Data Engineering
, vol.2006
, pp. 18
-
-
Jin, R.1
Glimcher, L.2
Jermaine, C.3
Agrawal, G.4
-
18
-
-
52649114729
-
Robust stratified sampling plans for low selectivity queries
-
Shantanu Joshi and Christopher M. Jermaine. Robust stratified sampling plans for low selectivity queries. In ICDE, pages 199-208, 2008.
-
(2008)
ICDE
, pp. 199-208
-
-
Joshi, S.1
Jermaine, C.M.2
-
19
-
-
35348870148
-
-
Springer-Verlag New York, Inc., Secaucus, NJ, USA
-
Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications). Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2006.
-
(2006)
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-centric Systems and Applications)
-
-
Liu, B.1
-
21
-
-
0001316388
-
Recursive stratified sampling for multidimensional Monte Carlo integration
-
William H. Press and Glennys R. Farrar. Recursive stratified sampling for multidimensional monte carlo integration. Comput. Phys., 4(2):190-195, 1990.
-
(1990)
Comput. Phys.
, vol.4
, Issue.2
, pp. 190-195
-
-
Press, W.H.1
Farrar, G.R.2
-
22
-
-
0002663971
-
Sampling large databases for association rules
-
Morgan Kaufmann
-
Hannu Toivonen. Sampling large databases for association rules. In The VLDB Journal, pages 134-145. Morgan Kaufmann, 1996.
-
(1996)
The VLDB Journal
, pp. 134-145
-
-
Toivonen, H.1
-
23
-
-
47649100510
-
Snpminer: A domain-specific deep web mining tool
-
Fan Wang, Gagan Agrawal, Ruoming Jin, and Helen Piontkivska. Snpminer: A domain-specific deep web mining tool. In Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering, pages 192-199, 2007.
-
(2007)
Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering
, pp. 192-199
-
-
Wang, F.1
Agrawal, G.2
Jin, R.3
Piontkivska, H.4
-
24
-
-
63449122094
-
Guessing the extreme values in a data set: A Bayesian method and its applications
-
Mingxi Wu and Chris Jermaine. Guessing the extreme values in a data set: a bayesian method and its applications. VLDB J., 18(2):571-597, 2009.
-
(2009)
VLDB J.
, vol.18
, Issue.2
, pp. 571-597
-
-
Wu, M.1
Jermaine, C.2
|