-
1
-
-
82155178381
-
-
Amazon ec2. http://aws.amazon.com/ec2/.
-
-
-
-
2
-
-
82155174872
-
-
Hadoop. http://hadoop.apache.org/.
-
-
-
-
3
-
-
82155169994
-
-
Priter project. http://code.google.com/p/priter/.
-
-
-
-
4
-
-
82155178379
-
-
Stanford dataset. http://snap.stanford.edu/data/.
-
-
-
-
5
-
-
57349151435
-
Video suggestion and discovery for youtube: Taking random walks through the view graph
-
S. Baluja, R. Seth, D. Sivakumar, Y. Jing, J. Yagnik, S. Kumar, D. Ravichandran, and M. Aly. Video suggestion and discovery for youtube: taking random walks through the view graph. In WWW '08, pages 895-904, 2008.
-
(2008)
WWW '08
, pp. 895-904
-
-
Baluja, S.1
Seth, R.2
Sivakumar, D.3
Jing, Y.4
Yagnik, J.5
Kumar, S.6
Ravichandran, D.7
Aly, M.8
-
6
-
-
0038589165
-
The anatomy of a large-scale hypertextual web search engine
-
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. In WWW '98, pages 107-117, 1998.
-
(1998)
WWW '98
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
7
-
-
79956351190
-
Haloop: Eficient iterative data processing on large clusters
-
Y. Bu, B. Howe, M. Balazinska, and D. M. Ernst. Haloop: Eficient iterative data processing on large clusters. In VLDB '10, 2010.
-
(2010)
VLDB '10
-
-
Bu, Y.1
Howe, B.2
Balazinska, M.3
Ernst, D.M.4
-
8
-
-
85071319367
-
Bigtable: A distributed storage system for structured data
-
Berkeley, CA, USA. USENIX Association
-
F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: a distributed storage system for structured data. In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7, OSDI '06, pages 15-15, Berkeley, CA, USA, 2006. USENIX Association.
-
(2006)
Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7, OSDI '06
, pp. 15-15
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.E.9
-
9
-
-
56049109090
-
Map-reduce for machine learning on multicore
-
Chu, Cheng T., Kim, Sang K., Lin, Yi A., Yu, Yuanyuan, Bradski, Gary R., Ng, Andrew Y., and Olukotun, Kunle. Map-Reduce for Machine Learning on Multicore. In NIPS, pages 281-288, 2006.
-
(2006)
NIPS
, pp. 281-288
-
-
Chu, C.T.1
Kim, S.K.2
Lin, Y.A.3
Yu, Y.4
Bradski, G.R.5
Ng, A.Y.6
Olukotun, K.7
-
10
-
-
85030321143
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. In OSDI'04, pages 10-10, 2004.
-
(2004)
OSDI'04
, pp. 10-10
-
-
Dean, J.1
Ghemawat, S.2
-
11
-
-
78650003594
-
Twister: A runtime for iterative mapreduce
-
J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S.-H. Bae, J. Qiu, and G. Fox. Twister: a runtime for iterative mapreduce. In MapReduce '10, pages 810-818, 2010.
-
(2010)
MapReduce '10
, pp. 810-818
-
-
Ekanayake, J.1
Li, H.2
Zhang, B.3
Gunarathne, T.4
Bae, S.-H.5
Qiu, J.6
Fox, G.7
-
12
-
-
77954920597
-
Comet: Batched stream processing for data intensive distributed computing
-
B. He, M. Yang, Z. Guo, R. Chen, B. Su, W. Lin, and L. Zhou. Comet: batched stream processing for data intensive distributed computing. In SoCC '10, pages 63-74, 2010.
-
(2010)
SoCC '10
, pp. 63-74
-
-
He, B.1
Yang, M.2
Guo, Z.3
Chen, R.4
Su, B.5
Lin, W.6
Zhou, L.7
-
13
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. In EuroSys '07, pages 59-72.
-
EuroSys '07
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
14
-
-
77951152705
-
Pegasus: A peta-scale graph mining system implementation and observations
-
U. Kang, C. Tsourakakis, and C. Faloutsos. Pegasus: A peta-scale graph mining system implementation and observations. In ICDM '09, pages 229-238, 2009.
-
(2009)
ICDM '09
, pp. 229-238
-
-
Kang, U.1
Tsourakakis, C.2
Faloutsos, C.3
-
15
-
-
0002827622
-
A new status index derived from sociometric analysis
-
L. Katz. A new status index derived from sociometric analysis. Psychometrika, 1953.
-
(1953)
Psychometrika
-
-
Katz, L.1
-
17
-
-
77954926935
-
Stateful bulk processing for incremental analytics
-
D. Logothetis, C. Olston, B. Reed, K. C. Webb, and K. Yocum. Stateful bulk processing for incremental analytics. In SoCC '10, pages 51-62, 2010.
-
(2010)
SoCC '10
, pp. 51-62
-
-
Logothetis, D.1
Olston, C.2
Reed, B.3
Webb, K.C.4
Yocum, K.5
-
18
-
-
81455158564
-
-
CoRR, abs/1006.4990
-
Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein. Graphlab: A new framework for parallel machine learning. CoRR, abs/1006.4990, 2010.
-
(2010)
Graphlab: A New Framework for Parallel Machine Learning
-
-
Low, Y.1
Gonzalez, J.2
Kyrola, A.3
Bickson, D.4
Guestrin, C.5
Hellerstein, J.M.6
-
19
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
G. Malewicz, M. H. Austern, A. J. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD '10, pages 135-146, 2010.
-
(2010)
SIGMOD '10
, pp. 135-146
-
-
Malewicz, G.1
Austern, M.H.2
Bik, A.J.3
Dehnert, J.C.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
20
-
-
85049119901
-
Ciel: A universal execution engine for distributed data-ow computing
-
D. G. Murray, M. Schwarzkopf, C. Smowton, S. Smith, A. Madhavapeddy, and S. Hand. Ciel: A universal execution engine for distributed data-ow computing. In NSDI'11, 2011.
-
(2011)
NSDI'11
-
-
Murray, D.G.1
Schwarzkopf, M.2
Smowton, C.3
Smith, S.4
Madhavapeddy, A.5
Hand, S.6
-
21
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig latin: a not-so-foreign language for data processing. In SIGMOD '08, pages 1099-1110, 2008.
-
(2008)
SIGMOD '08
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
22
-
-
70350512695
-
A comparison of approaches to large-scale data analysis
-
A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, and M. Stonebraker. A comparison of approaches to large-scale data analysis. In SIGMOD '09, pages 165-178, 2009.
-
(2009)
SIGMOD '09
, pp. 165-178
-
-
Pavlo, A.1
Paulson, E.2
Rasin, A.3
Abadi, D.J.4
Dewitt, D.J.5
Madden, S.6
Stonebraker, M.7
-
24
-
-
82155188108
-
Piccolo: Building fast, distributed programs with partitioned tables
-
R. Power and J. Li. Piccolo: Building fast, distributed programs with partitioned tables. In OSDI'10, 2010.
-
(2010)
OSDI'10
-
-
Power, R.1
Li, J.2
-
25
-
-
0036993190
-
Unsupervised document classification using sequential information maximization
-
N. Slonim, N. Friedman, and N. Tishby. Unsupervised document classification using sequential information maximization. In SIGIR '02, pages 129-136, 2002.
-
(2002)
SIGIR '02
, pp. 129-136
-
-
Slonim, N.1
Friedman, N.2
Tishby, N.3
-
26
-
-
79951738724
-
Scalable proximity estimation and link prediction in online social networks
-
H. H. Song, T. W. Cho, V. Dave, Y. Zhang, and L. Qiu. Scalable proximity estimation and link prediction in online social networks. In IMC '09, pages 322-335, 2009.
-
(2009)
IMC '09
, pp. 322-335
-
-
Song, H.H.1
Cho, T.W.2
Dave, V.3
Zhang, Y.4
Qiu, L.5
-
27
-
-
84868325513
-
Hive: A warehousing solution over a map-reduce framework
-
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckofi, and R. Murthy. Hive: a warehousing solution over a map-reduce framework. In VLDB '09, pages 1626-1629, 2009.
-
(2009)
VLDB '09
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckofi, P.8
Murthy, R.9
-
28
-
-
70349153347
-
User interactions in social networks and their implications
-
C. Wilson, B. Boe, A. Sala, K. P. Puttaswamy, and B. Y. Zhao. User interactions in social networks and their implications. In EuroSys '09, pages 205-218, 2009.
-
(2009)
EuroSys '09
, pp. 205-218
-
-
Wilson, C.1
Boe, B.2
Sala, A.3
Puttaswamy, K.P.4
Zhao, B.Y.5
-
29
-
-
85076882757
-
Dryadlinq: A system for general-purpose distributed data-parallel computing using a high-level language
-
Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey. Dryadlinq: a system for general-purpose distributed data-parallel computing using a high-level language. In OSDI '08, pages 1-14, 2008.
-
(2008)
OSDI '08
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, U.5
Gunda, P.K.6
Currey, J.7
-
30
-
-
85085251984
-
Spark: Cluster computing with working sets
-
M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica. Spark: cluster computing with working sets. In HotCloud'10, pages 10-10, 2010.
-
(2010)
HotCloud'10
, pp. 10-10
-
-
Zaharia, M.1
Chowdhury, M.2
Franklin, M.J.3
Shenker, S.4
Stoica, I.5
-
31
-
-
85076883048
-
Improving mapreduce performance in heterogeneous environments
-
M. Zaharia, A. Konwinski, A. D. Joseph, R. H. Katz, and I. Stoica. Improving mapreduce performance in heterogeneous environments. In OSDI '08, pages 29-42, 2008.
-
(2008)
OSDI '08
, pp. 29-42
-
-
Zaharia, M.1
Konwinski, A.2
Joseph, A.D.3
Katz, R.H.4
Stoica, I.5
-
32
-
-
85015476469
-
Imapreduce: A distributed computing framework for iterative computation
-
Y. Zhang, Q. Gao, L. Gao, and C. Wang. imapreduce: A distributed computing framework for iterative computation. In DataCloud '11, 2011.
-
(2011)
DataCloud '11
-
-
Zhang, Y.1
Gao, Q.2
Gao, L.3
Wang, C.4
-
33
-
-
77949497025
-
Solving the apparent diversity-accuracy dilemma of recommender systems
-
March
-
T. Zhou, Z. Kuscsik, J.-G. Liu, M. Medo, J. R. Wakeling, and Y.-C. Zhang. Solving the apparent diversity-accuracy dilemma of recommender systems. Proceedings of the National Academy of Sciences, 107(10):4511-4515, March 2010.
-
(2010)
Proceedings of the National Academy of Sciences
, vol.107
, Issue.10
, pp. 4511-4515
-
-
Zhou, T.1
Kuscsik, Z.2
Liu, J.-G.3
Medo, M.4
Wakeling, J.R.5
Zhang, Y.-C.6
|