-
1
-
-
84881327635
-
-
Earl release website
-
Earl release website: http://yellowstone.cs.ucla.edu/wis/.
-
-
-
-
3
-
-
77952586749
-
-
Technical Report UCB/EECS-2009-136, EECS Department, University of California, Berkeley, Oct
-
Tyson Condie, Neil Conway, Peter Alvaro, Joseph M. Hellerstein, Khaled Elmeleegy, and Russell Sears. Mapreduce online. Technical Report UCB/EECS-2009-136, EECS Department, University of California, Berkeley, Oct 2009.
-
(2009)
Mapreduce Online
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.M.4
Elmeleegy, K.5
Sears, R.6
-
4
-
-
0002344794
-
Bootstrap methods: Another look at the jackknife
-
B Efron. Bootstrap methods: Another look at the jackknife. Annals of Statistics, 7(1):1-26, 1979.
-
(1979)
Annals of Statistics
, vol.7
, Issue.1
, pp. 1-26
-
-
Efron, B.1
-
6
-
-
84864210199
-
Extending map-reduce for efficient predicate-based sampling
-
Raman Grover and Michael Carey. Extending map-reduce for efficient predicate-based sampling. ICDE '12, 2012.
-
(2012)
ICDE '12
-
-
Grover, R.1
Carey, M.2
-
7
-
-
84863165439
-
The Elements of Statistical Learning
-
Second Edition: Data Mining, Inference, and Prediction. Springer, 2nd ed. 2009. corr. 3rd printing 5th printing. edition, February
-
Trevor Hastie, Robert Tibshirani, and Jerome Friedman. The Elements of Statistical Learning, Second Edition: Data Mining, Inference, and Prediction. Springer Series in Statistics. Springer, 2nd ed. 2009. corr. 3rd printing 5th printing. edition, February 2009.
-
(2009)
Springer Series in Statistics
-
-
Hastie, T.1
Tibshirani, R.2
Friedman, J.3
-
8
-
-
80053500227
-
Starfish: A self-tuning system for big data analytics
-
Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, and Shivnath Babu. Starfish: A self-tuning system for big data analytics. In CIDR, pages 261-272, 2011.
-
(2011)
CIDR
, pp. 261-272
-
-
Herodotou, H.1
Lim, H.2
Luo, G.3
Borisov, N.4
Dong, L.5
Cetin, F.B.6
Babu, S.7
-
9
-
-
84873118945
-
Early accurate results for advanced analytics on mapreduce
-
Nikolay Laptev, Kai Zeng, and Carlo Zaniolo. Early accurate results for advanced analytics on mapreduce. PVLDB, 5(10):1028-1039, 2012.
-
(2012)
PVLDB
, vol.5
, Issue.10
, pp. 1028-1039
-
-
Laptev, N.1
Zeng, K.2
Zaniolo, C.3
-
10
-
-
0038237368
-
Estimating dataset size requirements for classifying dna microarray data
-
Sayan Mukherjee, Pablo Tamayo, Simon Rogers, Ryan M. Rifkin, Anna Engle, Colin Campbell, Todd R. Golub, and Jill P. Mesirov. Estimating dataset size requirements for classifying dna microarray data. Journal of Computational Biology, pages 119-142, 2003.
-
(2003)
Journal of Computational Biology
, pp. 119-142
-
-
Mukherjee, S.1
Tamayo, P.2
Rogers, S.3
Rifkin, R.M.4
Engle, A.5
Campbell, C.6
Golub, T.R.7
Mesirov, J.P.8
-
11
-
-
85032328251
-
Random sampling from database files: A survey
-
Frank Olken and Doron Rotem. Random sampling from database files: A survey. In SSDBM, pages 92-111, 1990.
-
(1990)
SSDBM
, pp. 92-111
-
-
Olken, F.1
Rotem, D.2
-
12
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
New York, NY, USA, ACM
-
Christopher Olston, Benjamin Reed, Utkarsh Srivastava, Ravi Kumar, and Andrew Tomkins. Pig latin: a not-so-foreign language for data processing. SIGMOD, pages 1099-1110, New York, NY, USA, 2008. ACM.
-
(2008)
SIGMOD
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
13
-
-
84863769684
-
Online aggregation for large mapreduce jobs
-
Niketan Pansare, Vinayak R. Borkar, Chris Jermaine, and Tyson Condie. Online aggregation for large mapreduce jobs. PVLDB, 4(11):1135-1145, 2011.
-
(2011)
PVLDB
, vol.4
, Issue.11
, pp. 1135-1145
-
-
Pansare, N.1
Borkar, V.R.2
Jermaine, C.3
Condie, T.4
-
14
-
-
77951180196
-
Interpreting the data: Parallel analysis with sawzall
-
Google Inc.
-
Rob Pike, Sean Dorward, Robert Griesemer, Sean Quinlan, and Google Inc. Interpreting the data: Parallel analysis with sawzall. In Scientific Programming Journal, Special Issue on Grids and Worldwide Computing Programming Models and Infrastructure, pages 227-298.
-
Scientific Programming Journal, Special Issue on Grids and Worldwide Computing Programming Models and Infrastructure
, pp. 227-298
-
-
Pike, R.1
Dorward, S.2
Griesemer, R.3
Quinlan, S.4
-
15
-
-
83755163018
-
Detecting novel associations in large data sets
-
David N. Reshef, Yakir A. Reshef, Hilary K. Finucane, Sharon R. Grossman, Gilean McVean, Peter J. Turnbaugh, Eric S. Lander, Michael Mitzenmacher, and Pardis C. Sabeti. Detecting novel associations in large data sets. Science, 334(6062):1518-1524, 2011.
-
(2011)
Science
, vol.334
, Issue.6062
, pp. 1518-1524
-
-
Reshef, D.N.1
Reshef, Y.A.2
Finucane, H.K.3
Grossman, S.R.4
McVean, G.5
Turnbaugh, P.J.6
Lander, E.S.7
Mitzenmacher, M.8
Sabeti, P.C.9
-
18
-
-
84868325513
-
Hive - A warehousing solution over a map-reduce framework
-
Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, and Raghotham Murthy. Hive- a warehousing solution over a map-reduce framework. In VLDB, pages 1626-1629, 2009.
-
(2009)
VLDB
, pp. 1626-1629
-
-
Thusoo, A.1
Sen Sarma, J.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
19
-
-
0001024505
-
On the uniform convergence of relative frequencies of events to their probabilities
-
V N Vapnik and A Y Chervonenkis. On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and Its Applications, 16(2):264-280, 1971.
-
(1971)
Theory of Probability and Its Applications
, vol.16
, Issue.2
, pp. 264-280
-
-
Vapnik, V.N.1
Chervonenkis, A.Y.2
|