-
1
-
-
84950645271
-
The predictive sample reuse method with applications
-
S. Geisser, "The predictive sample reuse method with applications," Journal of the American Statistical Association, vol. 70, no. 350, pp. 320-328, 1975.
-
(1975)
Journal of the American Statistical Association
, vol.70
, Issue.350
, pp. 320-328
-
-
Geisser, S.1
-
2
-
-
0031536511
-
Improvements on cross-validation: The 632+ bootstrap method
-
B. Efron and R. Tibshirani, "Improvements on cross-validation: the 632+ bootstrap method," Journal of the American Statistical Association, vol. 92, no. 438, pp. 548-560, 1997.
-
(1997)
Journal of the American Statistical Association
, vol.92
, Issue.438
, pp. 548-560
-
-
Efron, B.1
Tibshirani, R.2
-
3
-
-
0000354976
-
A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods
-
P. Burman, "A comparative study of ordinary cross-validation, v-fold cross-validation and the repeated learning-testing methods," Biometrika, vol. 76, no. 3, pp. 503-514, 1989.
-
(1989)
Biometrika
, vol.76
, Issue.3
, pp. 503-514
-
-
Burman, P.1
-
4
-
-
84862684679
-
Large-scale machine learning at twitter
-
J. Lin and A. Kolcz, "Large-scale machine learning at twitter," Sigmod' 12, pp. 793-804.
-
Sigmod' 12
, pp. 793-804
-
-
Lin, J.1
Kolcz, A.2
-
5
-
-
0030211964
-
Bagging predictors
-
L. Breiman, "Bagging predictors," Machine learning, vol. 24, no. 2, pp. 123-140, 1996.
-
(1996)
Machine Learning
, vol.24
, Issue.2
, pp. 123-140
-
-
Breiman, L.1
-
6
-
-
79957859069
-
SystemML: Declarative machine learning on MapReduce
-
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan, "SystemML: Declarative machine learning on MapReduce," ICDE'11, pp. 231-242.
-
ICDE'11
, pp. 231-242
-
-
Ghoting, A.1
Krishnamurthy, R.2
Pednault, E.3
Reinwald, B.4
Sindhwani, V.5
Tatikonda, S.6
Tian, Y.7
Vaithyanathan, S.8
-
7
-
-
84894647945
-
Mli: An api for distributed machine learning
-
E. R. Sparks, A. Talwalkar, V. Smith, J. Kottalam, X. Pan, J. Gonzalez, M. J. Franklin, M. I. Jordan, and T. Kraska, "Mli: An api for distributed machine learning," ICDM'13.
-
ICDM'13
-
-
Sparks, E.R.1
Talwalkar, A.2
Smith, V.3
Kottalam, J.4
Pan, X.5
Gonzalez, J.6
Franklin, M.J.7
Jordan, M.I.8
Kraska, T.9
-
8
-
-
84870749286
-
-
"Apache Mahout. " [Online]. Available: http://mahout. apache. org
-
Apache Mahout
-
-
-
9
-
-
84880526880
-
Cumulon: Optimizing statistical data analysis in the cloud
-
B. Huang, S. Babu, and J. Yang, "Cumulon: optimizing statistical data analysis in the cloud," Sigmod'13, pp. 1-12.
-
Sigmod'13
, pp. 1-12
-
-
Huang, B.1
Babu, S.2
Yang, J.3
-
10
-
-
77951152705
-
Pegasus: A peta-scale graph mining system implementation and observations
-
U. Kang, C. E. Tsourakakis, and C. Faloutsos, "Pegasus: A peta-scale graph mining system implementation and observations," ICDM'09, pp. 229-238.
-
ICDM'09
, pp. 229-238
-
-
Kang, U.1
Tsourakakis, C.E.2
Faloutsos, C.3
-
11
-
-
77955032649
-
Planet: Massively parallel learning of tree ensembles with mapreduce
-
B. Panda, J. S. Herbach, S. Basu, and R. J. Bayardo, "Planet: massively parallel learning of tree ensembles with mapreduce," PVLDB'09, pp. 1426-1437.
-
PVLDB'09
, pp. 1426-1437
-
-
Panda, B.1
Herbach, J.S.2
Basu, S.3
Bayardo, R.J.4
-
12
-
-
84896858630
-
Hybrid parallelization strategies for large-scale machine learning in systemml
-
M. Boehm, S. Tatikonda, B. Reinwald, P. Sen, D. Burdick, and S. Vaithyanathan, "Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML," PVLDB'14.
-
PVLDB'14
-
-
Boehm, M.1
Tatikonda, S.2
Reinwald, B.3
Sen, P.4
Burdick, D.5
Vaithyanathan, S.6
-
13
-
-
85084017339
-
Mlbase: A distributed machine-learning system
-
T. Kraska, A. Talwalkar, J. C. Duchi, R. Griffith, M. Franklin, and M. Jordan, "Mlbase: A distributed machine-learning system. " CIDR'13.
-
CIDR'13
-
-
Kraska, T.1
Talwalkar, A.2
Duchi, J.C.3
Griffith, R.4
Franklin, M.5
Jordan, M.6
-
14
-
-
38249002658
-
The computer generation of multinomial random variates
-
C. S. Davis, "The computer generation of multinomial random variates," Computational statistics & data analysis, vol. 16(2), pp. 205-217, 1993.
-
(1993)
Computational Statistics & Data Analysis
, vol.16
, Issue.2
, pp. 205-217
-
-
Davis, C.S.1
-
15
-
-
84880569459
-
Bigbench: Towards an industry standard benchmark for big data analytics
-
A. Ghazal, T. Rabl, M. Hu, F. Raab, M. Poess, A. Crolotte, and H.-A. Jacobsen, "Bigbench: Towards an industry standard benchmark for big data analytics," Sigmod'13, pp. 1197-1208.
-
Sigmod'13
, pp. 1197-1208
-
-
Ghazal, A.1
Rabl, T.2
Hu, M.3
Raab, F.4
Poess, M.5
Crolotte, A.6
Jacobsen, H.-A.7
-
16
-
-
84873130876
-
Myriad: Scalable and expressive data generation
-
A. Alexandrov, K. Tzoumas, and V. Markl, "Myriad: scalable and expressive data generation," PVLDB'12, pp. 1890-1893.
-
PVLDB'12
, pp. 1890-1893
-
-
Alexandrov, A.1
Tzoumas, K.2
Markl, V.3
-
17
-
-
84870452716
-
-
"Apache Hadoop. " [Online]. Available: http://hadoop. apache. org
-
Apache Hadoop
-
-
-
18
-
-
84911993592
-
The Stratosphere platform for big data analytics
-
A. Alexandrov, R. Bergmann, S. Ewen, C. Freytag, F. Hueske, A. Heise, O. Kao, M. Leich, U. Leser, V. Markl, and others, "The Stratosphere platform for big data analytics," VLDB Journal'14
-
VLDB Journal'14
-
-
Alexandrov, A.1
Bergmann, R.2
Ewen, S.3
Freytag, C.4
Hueske, F.5
Heise, A.6
Kao, O.7
Leich, M.8
Leser, U.9
Markl, V.10
-
19
-
-
85040175609
-
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
-
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica, "Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing," NSDI'12.
-
NSDI'12
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
McCauley, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
-
20
-
-
79957872898
-
Hyracks: A flexible and extensible foundation for data-intensive computing
-
V. Borkar, M. Carey, R. Grover, N. Onose, and R. Vernica, "Hyracks: A flexible and extensible foundation for data-intensive computing," ICDE'11, pp. 1151-1162.
-
ICDE'11
, pp. 1151-1162
-
-
Borkar, V.1
Carey, M.2
Grover, R.3
Onose, N.4
Vernica, R.5
-
21
-
-
85008044987
-
Matrix factorization techniques for recommender systems
-
Y. Koren, R. Bell, and C. Volinsky, "Matrix factorization techniques for recommender systems," Computer, vol. 42, pp. 30-37, 2009.
-
(2009)
Computer
, vol.42
, pp. 30-37
-
-
Koren, Y.1
Bell, R.2
Volinsky, C.3
-
23
-
-
51349151355
-
Analysis of incomplete multivariate data
-
J. L. Schafer, Analysis of incomplete multivariate data. CRC, 1997.
-
(1997)
CRC
-
-
Schafer, J.L.1
-
24
-
-
84868307166
-
Mad skills: New analysis practices for big data
-
J. Cohen, B. Dolan, M. Dunlap, J. M. Hellerstein, and C. Welton, "Mad skills: new analysis practices for big data," PVLDB'09, pp. 1481-1492.
-
PVLDB'09
, pp. 1481-1492
-
-
Cohen, J.1
Dolan, B.2
Dunlap, M.3
Hellerstein, J.M.4
Welton, C.5
-
25
-
-
84904296452
-
Major technical advancements in apache hive
-
Y. Huai, A. Chauhan, A. Gates, G. Hagleitner, E. Hanson, O. O'Malley, J. Pandey, Y. Yuan, R. Lee, and X. Zhang, "Major Technical Advancements in Apache Hive," Sigmod'14, pp. 1235-1246
-
Sigmod'14
, pp. 1235-1246
-
-
Huai, Y.1
Chauhan, A.2
Gates, A.3
Hagleitner, G.4
Hanson, E.5
O'malley, O.6
Pandey, J.7
Yuan, Y.8
Lee, R.9
Zhang, X.10
-
26
-
-
84907095419
-
R: A language and environment for statistical computing
-
RC Team
-
RC Team, "R: A language and environment for statistical computing," R foundation for Statistical Computing, 2005.
-
(2005)
R Foundation for Statistical Computing
-
-
-
27
-
-
80555140075
-
Scikit-learn: Machine learning in python
-
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg et al., "Scikit-learn: Machine learning in python," JMLR'11, vol. 12., pp. 2825-2830.
-
JMLR'11
, vol.12
, pp. 2825-2830
-
-
Pedregosa, F.1
Varoquaux, G.2
Gramfort, A.3
Michel, V.4
Thirion, B.5
Grisel, O.6
Blondel, M.7
Prettenhofer, P.8
Weiss, R.9
Dubourg, V.10
-
28
-
-
84863735533
-
Distributed Graphlab: A framework for machine learning and data mining in the cloud
-
Y. Low, D. Bickson, J. Gonzalez, C. Guestrin, A. Kyrola, and J. M. Hellerstein, "Distributed Graphlab: a framework for machine learning and data mining in the cloud," PVLDB'12, vol. 5, no. 8, pp. 716-727.
-
PVLDB'12
, vol.5
, Issue.8
, pp. 716-727
-
-
Low, Y.1
Bickson, D.2
Gonzalez, J.3
Guestrin, C.4
Kyrola, A.5
Hellerstein, J.M.6
-
29
-
-
84864210199
-
Extending map-reduce for efficient predicatebased sampling
-
R. Grover, and M. Carey, "Extending map-reduce for efficient predicatebased sampling," ICDE'12, pp. 486-497.
-
ICDE'12
, pp. 486-497
-
-
Grover, R.1
Carey, M.2
-
30
-
-
84879511901
-
-
arXiv preprint arXiv:1112. 5016
-
A. Kleiner, A. Talwalkar, P. Sarkar, and M. I. Jordan, "A scalable bootstrap for massive data," arXiv preprint arXiv:1112. 5016, 2011.
-
(2011)
A Scalable Bootstrap for Massive Data
-
-
Kleiner, A.1
Talwalkar, A.2
Sarkar, P.3
Jordan, M.I.4
|