메뉴 건너뛰기




Volumn 23, Issue 6, 2014, Pages 939-964

The Stratosphere platform for big data analytics

Author keywords

Big data; Data cleansing; Distributed systems; Graph processing; Parallel databases; Query Optimization; Query processing; Text mining

Indexed keywords

ADVANCED ANALYTICS; BIG DATA; DATA ANALYTICS; DATA HANDLING; DATA WAREHOUSES; DISTRIBUTED DATABASE SYSTEMS; IN SITU PROCESSING; OPEN SOURCE SOFTWARE; OPEN SYSTEMS; PARALLEL PROCESSING SYSTEMS; QUERY LANGUAGES; QUERY PROCESSING; TEXT MINING;

EID: 84911993592     PISSN: 10668888     EISSN: 0949877X     Source Type: Journal    
DOI: 10.1007/s00778-014-0357-y     Document Type: Article
Times cited : (391)

References (74)
  • 4
    • 84871108045 scopus 로고    scopus 로고
    • Apache Giraph. http://incubator.apache.org/giraph/
    • Apache Giraph
  • 5
  • 6
    • 84871119172 scopus 로고    scopus 로고
    • Apache Hive. http://sortbenchmark.org/
    • Apache Hive
  • 7
    • 84871119172 scopus 로고    scopus 로고
    • Apache Hive. http://sortbenchmark.org/
    • Apache Hive
  • 8
    • 77954948422 scopus 로고    scopus 로고
    • Nephele/pacts: a programming model and execution framework for web-scale analytical processing
    • Battré, D., Ewen, S., Hueske, F., Kao, O., Markl, V., Warneke, D.: Nephele/pacts: a programming model and execution framework for web-scale analytical processing. In: SoCC, pp. 119–130 (2010)
    • (2010) SoCC , pp. 119-130
    • Battré, D.1    Ewen, S.2    Hueske, F.3    Kao, O.4    Markl, V.5    Warneke, D.6
  • 9
    • 80053168613 scopus 로고    scopus 로고
    • Evaluation of network topology inference in opaque compute clouds through end-to-end measurements
    • Battré, D., Frejnik, N., Goel, S., Kao, O., Warneke, D.: Evaluation of network topology inference in opaque compute clouds through end-to-end measurements. In: IEEE CLOUD, pp. 17–24 (2011)
    • (2011) IEEE CLOUD , pp. 17-24
    • Battré, D.1    Frejnik, N.2    Goel, S.3    Kao, O.4    Warneke, D.5
  • 10
    • 79961143579 scopus 로고    scopus 로고
    • Inferring network topologies in infrastructure as a service cloud
    • Battré, D., Frejnik, N., Goel, S., Kao, O., Warneke, D.: Inferring network topologies in infrastructure as a service cloud. In: CCGRID, pp. 604–605 (2011)
    • (2011) CCGRID , pp. 604-605
    • Battré, D.1    Frejnik, N.2    Goel, S.3    Kao, O.4    Warneke, D.5
  • 14
    • 84912046052 scopus 로고    scopus 로고
    • Large-scale social media analytics on stratosphere
    • Boden, C., Karnstedt, M., Fernandez, M., Markl, V.: Large-scale social media analytics on stratosphere. In: WWW (2013)
    • (2013) WWW
    • Boden, C.1    Karnstedt, M.2    Fernandez, M.3    Markl, V.4
  • 15
    • 79957872898 scopus 로고    scopus 로고
    • Hyracks: a flexible and extensible foundation for data-intensive computing
    • Borkar, V.R., Carey, M.J., Grover, R., Onose, N., Vernica, R.: Hyracks: a flexible and extensible foundation for data-intensive computing. In: ICDE, pp. 1151–1162 (2011)
    • (2011) ICDE , pp. 1151-1162
    • Borkar, V.R.1    Carey, M.J.2    Grover, R.3    Onose, N.4    Vernica, R.5
  • 17
    • 84890768200 scopus 로고    scopus 로고
    • Measuring user influence in twitter: the million follower fallacy
    • Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in twitter: the million follower fallacy. In: ICWSM (2010)
    • (2010) ICWSM
    • Cha, M.1    Haddadi, H.2    Benevenuto, F.3    Gummadi, P.K.4
  • 20
    • 0008753064 scopus 로고
    • Including group-by in query optimization
    • Chaudhuri, S., Shim, K.: Including group-by in query optimization. In: VLDB, pp. 354–366 (1994)
    • (1994) VLDB , pp. 354-366
    • Chaudhuri, S.1    Shim, K.2
  • 21
    • 67651111624 scopus 로고    scopus 로고
    • Graph twiddling in a mapreduce world
    • Cohen, J.: Graph twiddling in a mapreduce world. Comput. Sci. Eng. 11(4), 29–41 (2009)
    • (2009) Comput. Sci. Eng. , vol.11 , Issue.4 , pp. 29-41
    • Cohen, J.1
  • 22
    • 85030321143 scopus 로고    scopus 로고
    • Mapreduce: simplified data processing on large clusters
    • Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. In: OSDI, pp. 137–150 (2004)
    • (2004) OSDI , pp. 137-150
    • Dean, J.1    Ghemawat, S.2
  • 24
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • Elnozahy, E.N.M., Alvisi, L., Wang, Y.M., Johnson, D.B.: A survey of rollback-recovery protocols in message-passing systems. ACM Comput. Surv. 34(3), 375–408 (2002)
    • (2002) ACM Comput. Surv. , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.N.M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 25
    • 84880522701 scopus 로고    scopus 로고
    • Iterative parallel data processing with stratosphere: an inside look
    • Ewen, S., Schelter, S., Tzoumas, K., Warneke, D., Markl, V.: Iterative parallel data processing with stratosphere: an inside look. In: SIGMOD (2013)
    • (2013) SIGMOD
    • Ewen, S.1    Schelter, S.2    Tzoumas, K.3    Warneke, D.4    Markl, V.5
  • 26
    • 84870501280 scopus 로고    scopus 로고
    • Spinning fast iterative data flows
    • Ewen, S., Tzoumas, K., Kaufmann, M., Markl, V.: Spinning fast iterative data flows. PVLDB 5(11), 1268–1279 (2012)
    • (2012) PVLDB , vol.5 , Issue.11 , pp. 1268-1279
    • Ewen, S.1    Tzoumas, K.2    Kaufmann, M.3    Markl, V.4
  • 27
    • 84863507276 scopus 로고    scopus 로고
    • An optimization framework for map-reduce queries
    • Fegaras, L., Li, C., Gupta, U.: An optimization framework for map-reduce queries. In: EDBT, pp. 26–37 (2012)
    • (2012) EDBT , pp. 26-37
    • Fegaras, L.1    Li, C.2    Gupta, U.3
  • 28
    • 0022821510 scopus 로고
    • An overview of the system software of a parallel relational database machine grace
    • Fushimi, S., Kitsuregawa, M., Tanaka, H.: An overview of the system software of a parallel relational database machine grace. In: VLDB, pp. 209–219 (1986)
    • (1986) VLDB , pp. 209-219
    • Fushimi, S.1    Kitsuregawa, M.2    Tanaka, H.3
  • 30
    • 0002196955 scopus 로고    scopus 로고
    • Hash joins and hash teams in microsoft sql server
    • Graefe, G., Bunker, R., Cooper, S.: Hash joins and hash teams in microsoft sql server. In: VLDB, pp. 86–97 (1998)
    • (1998) VLDB , pp. 86-97
    • Graefe, G.1    Bunker, R.2    Cooper, S.3
  • 31
    • 33749342609 scopus 로고    scopus 로고
    • Implementing sorting in database systems. ACM Comput. Surv. 38(3)
    • Graefe, G.: Implementing sorting in database systems. ACM Comput. Surv. 38(3), Article ID 10 (2006)
    • (2006) Article ID , pp. 10
    • Graefe, G.1
  • 32
    • 79954553074 scopus 로고    scopus 로고
    • Parallel query execution algorithms
    • Graefe, G.: Parallel query execution algorithms. In: Encyclopedia of Database Systems, pp. 2030–2035 (2009)
    • (2009) Encyclopedia of Database Systems , pp. 2030-2035
    • Graefe, G.1
  • 33
    • 0028381846 scopus 로고
    • Volcano—an extensible and parallel query evaluation system
    • Graefe, G.: Volcano—an extensible and parallel query evaluation system. IEEE Trans. Knowl. Data Eng. 6(1), 120–135 (1994)
    • (1994) IEEE Trans. Knowl. Data Eng. , vol.6 , Issue.1 , pp. 120-135
    • Graefe, G.1
  • 34
    • 84871175805 scopus 로고    scopus 로고
    • Spotting code optimizations in data-parallel pipelines through periscope
    • Guo, Z., Fan, X., Chen, R., Zhang, J., Zhou, H., McDirmid, S., Liu, C., Lin, W., Zhou, J., Zhou, L.: Spotting code optimizations in data-parallel pipelines through periscope. In: OSDI, pp. 121–133 (2012)
    • (2012) OSDI , pp. 121-133
    • Lin, W.1    Zhou, J.2    Zhou, L.3
  • 38
    • 84862020063 scopus 로고    scopus 로고
    • Integrating open government data with stratosphere for more transparency
    • Heise, A., Naumann, F.: Integrating open government data with stratosphere for more transparency. Web Semant.: Sci. Serv. Agents World Wide Web 14, 45–56 (2012)
    • (2012) Web Semant.: Sci. Serv. Agents World Wide Web , vol.14 , pp. 45-56
    • Heise, A.1    Naumann, F.2
  • 39
    • 84895198393 scopus 로고    scopus 로고
    • Ephemeral materialization points in stratosphere data management on the cloud
    • Höger, M., Kao, O., Richter, P., Warneke, D.: Ephemeral materialization points in stratosphere data management on the cloud. Adv. Parallel Comput. 23, 163–181 (2013)
    • (2013) Adv. Parallel Comput. , vol.23 , pp. 163-181
    • Höger, M.1    Kao, O.2    Richter, P.3    Warneke, D.4
  • 40
    • 83455172022 scopus 로고    scopus 로고
    • Evaluating adaptive compression to mitigate the effects of shared i/o in clouds
    • Hovestadt, M., Kao, O., Kliem, A., Warneke, D.: Evaluating adaptive compression to mitigate the effects of shared i/o in clouds. In: IPDPS Workshops, pp. 1042–1051 (2011)
    • (2011) IPDPS Workshops , pp. 1042-1051
    • Hovestadt, M.1    Kao, O.2    Kliem, A.3    Warneke, D.4
  • 41
    • 84912010820 scopus 로고    scopus 로고
    • Enabling operator reordering in data flow programs through static code analysis
    • Hueske, F., Krettek, A., Tzoumas, K.: Enabling operator reordering in data flow programs through static code analysis. CoRR abs/1301.4200 (2013)
    • (2013) CoRR abs/1301 , pp. 4200
    • Hueske, F.1    Krettek, A.2    Tzoumas, K.3
  • 44
    • 34548041192 scopus 로고    scopus 로고
    • Dryad: distributed data-parallel programs from sequential building blocks
    • Isard, M., Budiu, M., Yu, Y., Birrell, A., Fetterly, D.: Dryad: distributed data-parallel programs from sequential building blocks. In: EuroSys, pp. 59–72 (2007)
    • (2007) EuroSys , pp. 59-72
    • Isard, M.1    Budiu, M.2    Yu, Y.3    Birrell, A.4    Fetterly, D.5
  • 45
    • 84863535860 scopus 로고    scopus 로고
    • Automatic optimization for mapreduce programs
    • Jahani, E., Cafarella, M.J., Ré, C.: Automatic optimization for mapreduce programs. PVLDB 4(6), 385–396 (2011)
    • (2011) PVLDB , vol.4 , Issue.6 , pp. 385-396
    • Jahani, E.1    Cafarella, M.J.2    Ré, C.3
  • 49
    • 77951152705 scopus 로고    scopus 로고
    • Pegasus: a peta-scale graph mining system
    • Kang, U., Tsourakakis, C.E., Faloutsos, C.: Pegasus: a peta-scale graph mining system. In: ICDM, pp. 229–238 (2009)
    • (2009) ICDM , pp. 229-238
    • Kang, U.1    Tsourakakis, C.E.2    Faloutsos, C.3
  • 50
    • 0019574432 scopus 로고
    • On optimistic methods for concurrency control
    • Kung, H.T., Robinson, J.T.: On optimistic methods for concurrency control. ACM Trans. Database Syst. 6(2), 213–226 (1981)
    • (1981) ACM Trans. Database Syst. , vol.6 , Issue.2 , pp. 213-226
    • Kung, H.T.1    Robinson, J.T.2
  • 52
    • 84873155673 scopus 로고    scopus 로고
    • Stubby: a transformation-based optimizer for mapreduce workflows
    • Lim, H., Herodotou, H., Babu, S.: Stubby: a transformation-based optimizer for mapreduce workflows. PVLDB 5(11), 1196–1207 (2012)
    • (2012) PVLDB , vol.5 , Issue.11 , pp. 1196-1207
    • Lim, H.1    Herodotou, H.2    Babu, S.3
  • 53
    • 84863735533 scopus 로고    scopus 로고
    • Distributed graphlab: a framework for machine learning in the cloud
    • Low, Y., Gonzalez, J., Kyrola, A., Bickson, D., Guestrin, C., Hellerstein, J.M.: Distributed graphlab: a framework for machine learning in the cloud. PVLDB 5(8), 716–727 (2012)
    • (2012) PVLDB , vol.5 , Issue.8 , pp. 716-727
    • Low, Y.1    Gonzalez, J.2    Kyrola, A.3    Bickson, D.4    Guestrin, C.5    Hellerstein, J.M.6
  • 56
    • 84873171138 scopus 로고    scopus 로고
    • Rex: recursive, delta-based data-centric computation
    • Mihaylov, S.R., Ives, Z.G., Guha, S.: Rex: recursive, delta-based data-centric computation. PVLDB 5(11), 1280–1291 (2012)
    • (2012) PVLDB , vol.5 , Issue.11 , pp. 1280-1291
    • Mihaylov, S.R.1    Ives, Z.G.2    Guha, S.3
  • 58
    • 30344452311 scopus 로고    scopus 로고
    • Interpreting the data: parallel analysis with sawzall
    • Pike, R., Dorward, S., Griesemer, R., Quinlan, S.: Interpreting the data: parallel analysis with sawzall. Sci. Program. 13(4), 277–298 (2005)
    • (2005) Sci. Program , vol.13 , Issue.4 , pp. 277-298
    • Pike, R.1    Dorward, S.2    Griesemer, R.3    Quinlan, S.4
  • 61
    • 84864252206 scopus 로고    scopus 로고
    • Exploiting common subexpressions for cloud query processing
    • Silva, Y.N., Larson, P.A., Zhou, J.: Exploiting common subexpressions for cloud query processing. In: ICDE, pp. 1337–1348 (2012)
    • (2012) ICDE , pp. 1337-1348
    • Silva, Y.N.1    Larson, P.A.2    Zhou, J.3
  • 65
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • Valiant, L.G.: A bridging model for parallel computation. Commun. ACM 33(8), 103–111 (1990)
    • (1990) Commun. ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 67
    • 74049096198 scopus 로고    scopus 로고
    • Nephele: efficient parallel data processing in the cloud
    • Warneke, D., Kao, O.: Nephele: efficient parallel data processing in the cloud. In: SC-MTAGS (2009)
    • (2009) SC-MTAGS
    • Warneke, D.1    Kao, O.2
  • 68
    • 79955523308 scopus 로고    scopus 로고
    • Exploiting dynamic resource allocation for efficient parallel data processing in the cloud
    • Warneke, D., Kao, O.: Exploiting dynamic resource allocation for efficient parallel data processing in the cloud. IEEE Trans. Parallel Distrib. Syst. 22(6), 985–997 (2011)
    • (2011) IEEE Trans. Parallel Distrib. Syst. , vol.22 , Issue.6 , pp. 985-997
    • Warneke, D.1    Kao, O.2
  • 69
    • 85076882757 scopus 로고    scopus 로고
    • Dryadlinq: a system for general-purpose distributed data-parallel computing using a high-level language
    • Yu, Y., Isard, M., Fetterly, D., Budiu, M., Erlingsson, Ú., Gunda, P.K., Currey, J.: Dryadlinq: a system for general-purpose distributed data-parallel computing using a high-level language. In: OSDI, pp. 1–14 (2008)
    • (2008) OSDI , pp. 1-14
    • Yu, Y.1    Isard, M.2    Fetterly, D.3    Budiu, M.4    Erlingsson, Ú.5    Gunda, P.K.6    Currey, J.7
  • 71
    • 85076643377 scopus 로고    scopus 로고
    • Optimizing data shuffling in data-parallel computation by understanding user-defined functions
    • Zhang, J., Zhou, H., Chen, R., Fan, X., Guo, Z., Lin, H., Li, J.Y., Lin, W., Zhou, J., Zhou, L.: Optimizing data shuffling in data-parallel computation by understanding user-defined functions. In: NSDI (2012)
    • (2012) NSDI
    • Zhang, J.1    Zhou, H.2    Chen, R.3    Fan, X.4    Guo, Z.5    Lin, H.6    Li, J.Y.7    Lin, W.8    Zhou, J.9    Zhou, L.10
  • 72
    • 84862647304 scopus 로고    scopus 로고
    • Advanced partitioning techniques for massively distributed computation
    • Zhou, J., Bruno, N., Lin, W.: Advanced partitioning techniques for massively distributed computation. In: SIGMOD Conference, pp. 13–24 (2012)
    • (2012) SIGMOD Conference , pp. 13-24
    • Zhou, J.1    Bruno, N.2    Lin, W.3
  • 73
    • 77952771965 scopus 로고    scopus 로고
    • Incorporating partitioning and parallel plans into the scope optimizer
    • Zhou, J., Larson, P.Å., Chaiken, R.: Incorporating partitioning and parallel plans into the scope optimizer. In: ICDE, pp. 1060–1071 (2010)
    • (2010) ICDE , pp. 1060-1071
    • Zhou, J.1    Larson, P.Å.2    Chaiken, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.