SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGMOD International Conference on Management of Data

Volumn , Issue , 2013, Pages 13-24

Shark: SQL and rich analytics at scale

(6) Xin, Reynold S a Rosen, Josh a Zaharia, Matei a Franklin, Michael J a Shenker, Scott a Stoica, Ion a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Data Warehouse; Databases; Hadoop; Machine Learning; Shark; Spark

Indexed keywords

COLUMN-ORIENTED; DATA ANALYSIS SYSTEM; DISTRIBUTED MEMORY; EXECUTION ENGINE; FAULT TOLERANCE PROPERTY; HADOOP; LEARNING PROGRAMS; SHARK;

DATA WAREHOUSES; DATABASE SYSTEMS; DIGITAL STORAGE; ELECTRIC SPARKS; FAULT TOLERANCE; LEARNING SYSTEMS; QUERY LANGUAGES;

ENGINES;

EID: 84880533620 PISSN: 07308078 EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2463676.2465288 Document Type: Conference Paper

Times cited : (333)

References (39)

1
- 84880551718
- https://github.com/cloudera/impala.

2
- 84880556207
- http://hadoop.apache.org/.

3
- 84880568960
- http://aws.amazon.com/elasticmapreduce/.

4
- 79957809015
- Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads
- A. Abouzeid et al. Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads. VLDB, 2009.
- (2009) VLDB
- Abouzeid, A.¹

5
- 84891599477
- Re-optimizing data-parallel computing
- S. Agarwal et al. Re-optimizing data-parallel computing. In NSDI'12.
- NSDI'12
- Agarwal, S.¹

6
- 84919827070
- Pacman: Coordinated memory caching for parallel jobs
- G. Ananthanarayanan et al. Pacman: Coordinated memory caching for parallel jobs. In NSDI, 2012.
- (2012) NSDI
- Ananthanarayanan, G.¹

7
- 0039253775
- Eddies: Continuously adaptive query processing
- R. Avnur and J. M. Hellerstein. Eddies: continuously adaptive query processing. In SIGMOD, 2000.
- (2000) SIGMOD
- Avnur, R.¹ Hellerstein, J.M.²

8
- 77954942463
- Towards automatic optimization of mapreduce programs
- S. Babu. Towards automatic optimization of mapreduce programs. In SoCC'10.
- SoCC'10
- Babu, S.¹

9
- 79958269648
- Asterix: Towards a scalable, semistructured data platform for evolving-world models
- A. Behm et al. Asterix: towards a scalable, semistructured data platform for evolving-world models. Distributed and Parallel Databases, 29(3):185-216, 2011.
- (2011) Distributed and Parallel Databases , vol.29 , Issue.3 , pp. 185-216
- Behm, A.¹

10
- 79957872898
- Hyracks: A flexible and extensible foundation for data-intensive computing
- V. Borkar et al. Hyracks: A flexible and extensible foundation for data-intensive computing. In ICDE'11.
- ICDE'11
- Borkar, V.¹

11
- 79956351190
- HaLoop: Efficient iterative data processing on large clusters
- Y. Bu et al. HaLoop: efficient iterative data processing on large clusters. Proc. VLDB Endow., 2010.
- (2010) Proc. VLDB Endow.
- Bu, Y.¹

12
- 84860560293
- Scope: Easy and efficient parallel processing of massive data sets
- R. Chaiken et al. Scope: easy and efficient parallel processing of massive data sets. VLDB, 2008.
- (2008) VLDB
- Chaiken, R.¹

13
- 84862684677
- Tenzing a sql implementation on the mapreduce framework
- B. Chattopadhyay, et al Tenzing a sql implementation on the mapreduce framework. PVLDB, 4(12):1318-1327, 2011.
- (2011) PVLDB , vol.4 , Issue.12 , pp. 1318-1327
- Chattopadhyay, B.¹

14
- 79957812355
- Cheetah: A high performance, custom data warehouse on top of mapreduce
- S. Chen. Cheetah: a high performance, custom data warehouse on top of mapreduce. VLDB, 2010.
- (2010) VLDB
- Chen, S.¹

15
- 56049109090
- Map-reduce for machine learning on multicore
- C. Chu et al. Map-reduce for machine learning on multicore. Advances in neural information processing systems, 19:281, 2007.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 281
- Chu, C.¹

16
- 84868307166
- Mad skills: New analysis practices for big data
- J. Cohen, B. Dolan, M. Dunlap, J. Hellerstein, and C. Welton. Mad skills: new analysis practices for big data. VLDB, 2009.
- (2009) VLDB
- Cohen, J.¹ Dolan, B.² Dunlap, M.³ Hellerstein, J.⁴ Welton, C.⁵

17
- 85030321143
- MapReduce: Simplified data processing on large clusters
- J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In OSDI, 2004.
- (2004) OSDI
- Dean, J.¹ Ghemawat, S.²

18
- 84862644049
- Towards a unified architecture for in-rdbms analytics
- X. Feng et al. Towards a unified architecture for in-rdbms analytics. In SIGMOD, 2012.
- (2012) SIGMOD
- Feng, X.¹

19
- 80052587953
- Handling data skew in mapreduce
- B. Guffler et al. Handling data skew in mapreduce. In CLOSER'11.
- CLOSER'11
- Guffler, B.¹

20
- 84880566361
- Processing a trillion cells per mouse click
- A. Hall et al. Processing a trillion cells per mouse click. VLDB.
- VLDB
- Hall, A.¹

21
- 80053147709
- Mesos: A platform for fine-grained resource sharing in the data center
- B. Hindman et al. Mesos: A platform for fine-grained resource sharing in the data center. In NSDI'11.
- NSDI'11
- Hindman, B.¹

22
- 35448961922
- Dryad: Distributed data-parallel programs from sequential building blocks
- M. Isard et al. Dryad: distributed data-parallel programs from sequential building blocks. SIGOPS, 2007.
- (2007) SIGOPS
- Isard, M.¹

23
- 72249118633
- Quincy: Fair scheduling for distributed computing clusters
- M. Isard et al. Quincy: Fair scheduling for distributed computing clusters. In SOSP '09, 2009.
- (2009) SOSP '09
- Isard, M.¹

24
- 70849091519
- Distributed data-parallel computing using a high-level programming language
- M. Isard and Y. Yu. Distributed data-parallel computing using a high-level programming language. In SIGMOD, 2009.
- (2009) SIGMOD
- Isard, M.¹ Yu, Y.²

25
- 0032093823
- Efficient mid-query re-optimization of sub-optimal query execution plans
- N. Kabra and D. J. DeWitt. Efficient mid-query re-optimization of sub-optimal query execution plans. In SIGMOD, 1998.
- (1998) SIGMOD
- Kabra, N.¹ DeWitt, J.D.²

26
- 84862648481
- Skewtune: Mitigating skew in mapreduce applications
- Y. Kwon et al. Skewtune: mitigating skew in mapreduce applications. In SIGMOD '12, 2012.
- (2012) SIGMOD '12
- Kwon, Y.¹

27
- 84863735533
- Distributed graphlab: A framework for machine learning and data mining in the cloud
- Y. Low et al. Distributed graphlab: a framework for machine learning and data mining in the cloud. VLDB, 2012.
- (2012) VLDB
- Low, Y.¹

28
- 77954723629
- Pregel: A system for large-scale graph processing
- G. Malewicz et al. Pregel: a system for large-scale graph processing. In SIGMOD, 2010.
- (2010) SIGMOD
- Malewicz, G.¹

29
- 79958258284
- Dremel: Interactive analysis of web-scale datasets
- Sept
- S. Melnik et al. Dremel: interactive analysis of web-scale datasets. Proc. VLDB Endow., 3:330-339, Sept 2010.
- (2010) Proc. VLDB Endow. , vol.3 , pp. 330-339
- Melnik, S.¹

30
- 84989348963
- The case for tiny tasks in compute clusters
- K. Ousterhout et al. The case for tiny tasks in compute clusters. In HotOS'13.
- HotOS'13
- Ousterhout, K.¹

31
- 70350512695
- A comparison of approaches to large-scale data analysis
- A. Pavlo et al. A comparison of approaches to large-scale data analysis. In SIGMOD, 2009.
- (2009) SIGMOD
- Pavlo, A.¹

32
- 33745618477
- C-store: A column-oriented dbms
- M. Stonebraker et al. C-store: a column-oriented dbms. In VLDB'05.
- VLDB'05
- Stonebraker, M.¹

33
- 84880516136
- Mapreduce and parallel dbmss: Friends or foes?
- M. Stonebraker et al. Mapreduce and parallel dbmss: friends or foes? Commun. ACM.
- Commun. ACM
- Stonebraker, M.¹

34
- 77952775707
- Hive - A petabyte scale data warehouse using hadoop
- A. Thusoo et al. Hive - a petabyte scale data warehouse using hadoop. In ICDE, 2010.
- (2010) ICDE
- Thusoo, A.¹

35
- 84871009167
- TPC BENCHMARK H.
- Transaction Processing Performance Council. TPC BENCHMARK H.
- Transaction Processing Performance Council

36
- 0032093705
- Cost-based query scrambling for initial delays
- T. Urhan, M. J. Franklin, and L. Amsaleg. Cost-based query scrambling for initial delays. In SIGMOD, 1998.
- (1998) SIGMOD
- Urhan, T.¹ Franklin, M.J.² Amsaleg, L.³

37
- 77952779172
- Osprey: Implementing mapreduce-style fault tolerance in a shared-nothing distributed database
- C. Yang et al. Osprey: Implementing mapreduce-style fault tolerance in a shared-nothing distributed database. In ICDE, 2010.
- (2010) ICDE
- Yang, C.¹

38
- 77954636142
- Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling
- M. Zaharia et al. Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling. In EuroSys 10, 2010.
- (2010) EuroSys 10
- Zaharia, M.¹

39
- 85040175609
- Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
- M. Zaharia et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. NSDI, 2012.
- (2012) NSDI
- Zaharia, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.