메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1102-1111

Towards scalable one-pass analytics using MapReduce

Author keywords

Data streams; MapReduce; Parallel data processing; Performance analysis

Indexed keywords

ANALYTICAL QUERIES; ARCHITECTURAL CHANGES; COMPUTATION MODEL; CURRENT LIMITATION; DATA SETS; DATA STREAM; DATA-INTENSIVE APPLICATION; DESIGN REQUIREMENTS; EMPIRICAL PERFORMANCE; EXISTING SYSTEMS; INITIAL DESIGN; INTEGRAL PART; LARGE DATASETS; MAP-REDUCE; ONE-PASS; ONLINE AGGREGATIONS; PARALLEL DATA PROCESSING; PERFORMANCE ANALYSIS; PROGRAMMING MODELS; STREAM PROCESSING;

EID: 83455229796     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2011.251     Document Type: Conference Paper
Times cited : (8)

References (36)
  • 1
    • 34547285007 scopus 로고    scopus 로고
    • The design of the borealis stream processing engine
    • D. J. Abadi, Y. Ahmad, et al. The design of the Borealis stream processing engine. In CIDR, 277-289, 2005.
    • (2005) CIDR , pp. 277-289
    • Abadi, D.J.1    Ahmad, Y.2
  • 2
    • 79957809015 scopus 로고    scopus 로고
    • Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads
    • A. Abouzeid, K. Bajda-Pawlikowski, et al. Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads. PVLDB, 2(1):922-933, 2009.
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 922-933
    • Abouzeid, A.1    Bajda-Pawlikowski, K.2
  • 3
    • 33745289988 scopus 로고    scopus 로고
    • The CQL continuous query language: Semantic foundations and query execution
    • A. Arasu, et al. The CQL continuous query language: semantic foundations and query execution. VLDB J., 15(2): 121-142, 2006.
    • (2006) VLDB J. , vol.15 , Issue.2 , pp. 121-142
    • Arasu, A.1
  • 4
    • 79957872898 scopus 로고    scopus 로고
    • Hyracks: A flexible and extensible foundation for data-intensive computing
    • V. Borkar, M. Carey, et al. Hyracks: a flexible and extensible foundation for data-intensive computing. In ICDE, 2011.
    • (2011) ICDE
    • Borkar, V.1    Carey, M.2
  • 5
    • 0345206134 scopus 로고    scopus 로고
    • TelegraphCQ: Continuous dataflow processing for an uncertain world
    • S. Chandrasekaran, O. Cooper, et al. TelegraphCQ: Continuous dataflow processing for an uncertain world. In CIDR, 2003.
    • (2003) CIDR
    • Chandrasekaran, S.1    Cooper, O.2
  • 6
    • 77954780115 scopus 로고    scopus 로고
    • Mapreduce online
    • T. Condie, N. Conway, et al. Mapreduce online. In NSDI, 21-21, 2010.
    • (2010) NSDI , pp. 21-21
    • Condie, T.1    Conway, N.2
  • 7
    • 85030321143 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. In OSDI, 10-10, 2004.
    • (2004) OSDI , pp. 10-10
    • Dean, J.1    Ghemawat, S.2
  • 8
    • 0026870271 scopus 로고
    • Parallel database systems: The future of high performance database systems
    • D. DeWitt and J. Gray. Parallel database systems: the future of high performance database systems. Commun. ACM, 35(6):85-98, 1992.
    • (1992) Commun. ACM , vol.35 , Issue.6 , pp. 85-98
    • DeWitt, D.1    Gray, J.2
  • 9
    • 0022821549 scopus 로고
    • Gamma - A high performance dataflow database machine
    • D. J. DeWitt, R. H. Gerber, et al. Gamma - a high performance dataflow database machine. In VLDB, 228-237, 1986.
    • (1986) VLDB , pp. 228-237
    • DeWitt, D.J.1    Gerber, R.H.2
  • 10
    • 77954889500 scopus 로고    scopus 로고
    • Sql/mapreduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions
    • E. Friedman, P. Pawlowski, et al. Sql/mapreduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions. PVLDB, 2(2):1402-1413, 2009.
    • (2009) PVLDB , vol.2 , Issue.2 , pp. 1402-1413
    • Friedman, E.1    Pawlowski, P.2
  • 11
    • 55649105542 scopus 로고    scopus 로고
    • SPADE: The system S declarative stream processing engine
    • B. Gedik, H. Andrade, et al. SPADE: The System S declarative stream processing engine. In SIGMOD, 2008.
    • (2008) SIGMOD
    • Gedik, B.1    Andrade, H.2
  • 12
    • 80053488113 scopus 로고    scopus 로고
    • Nectar: Automatic management of data and computation in data centers
    • P. K. Gunda, L. Ravindranath, et al. Nectar: automatic management of data and computation in data centers. In OSDI, 2010.
    • (2010) OSDI
    • Gunda, P.K.1    Ravindranath, L.2
  • 13
    • 0346501106 scopus 로고    scopus 로고
    • Ripple joins for online aggregation
    • P. J. Haas and J. M. Hellerstein. Ripple joins for online aggregation. In SIGMOD, 287-298, 1999.
    • (1999) SIGMOD , pp. 287-298
    • Haas, P.J.1    Hellerstein, J.M.2
  • 14
    • 84875152530 scopus 로고    scopus 로고
    • Hive project. http://hadoop.apache.org/hive/.
    • Hive Project
  • 15
    • 34548041192 scopus 로고    scopus 로고
    • Dryad: Distributed data-parallel programs from sequential building blocks
    • M. Isard, M. Budiu, et al. Dryad: distributed data-parallel programs from sequential building blocks. In EuroSys, 59-72, 2007.
    • (2007) EuroSys , pp. 59-72
    • Isard, M.1    Budiu, M.2
  • 16
    • 29844455629 scopus 로고    scopus 로고
    • A disk-based join with probabilistic guarantees
    • C. Jermaine, A. Dobra, et al. A disk-based join with probabilistic guarantees. In SIGMOD, 563-574, 2005.
    • (2005) SIGMOD , pp. 563-574
    • Jermaine, C.1    Dobra, A.2
  • 17
    • 81055143288 scopus 로고    scopus 로고
    • The performance of MapReduce: An in-depth study
    • D. Jiang, B.C. Ooi, et al. The Performance of MapReduce: An In-depth Study. In VLDB, 472-483, 2010.
    • (2010) VLDB , pp. 472-483
    • Jiang, D.1    Ooi, B.C.2
  • 18
    • 77952270709 scopus 로고    scopus 로고
    • DEDUCE: At the intersection of MapReduce and stream processing
    • V. Kumar, H. Andrade, et al. DEDUCE: at the intersection of MapReduce and stream processing. In EDBT, 657-662, 2010.
    • (2010) EDBT , pp. 657-662
    • Kumar, V.1    Andrade, H.2
  • 19
    • 77954913432 scopus 로고    scopus 로고
    • Skew-resistant parallel processing of feature-extracting scientific user-defined functions
    • Y. Kwon, M. Balazinska, et al. Skew-resistant parallel processing of feature-extracting scientific user-defined functions. In SoCC, 2010.
    • (2010) SoCC
    • Kwon, Y.1    Balazinska, M.2
  • 21
    • 77954732118 scopus 로고    scopus 로고
    • Paratimer: A progress indicator for mapreduce dags
    • K. Morton, M. Balazinska, et al. Paratimer: a progress indicator for mapreduce dags. In SIGMOD, 507-518, 2010.
    • (2010) SIGMOD , pp. 507-518
    • Morton, K.1    Balazinska, M.2
  • 22
    • 79959990624 scopus 로고    scopus 로고
    • S4: Distributed stream computing platform
    • L. Neumeyer, B. Robbins, et al. S4: distributed stream computing platform. In KDCloud, 2010.
    • (2010) KDCloud
    • Neumeyer, L.1    Robbins, B.2
  • 23
    • 55349148888 scopus 로고    scopus 로고
    • Pig latin: A not-so-foreign language for data processing
    • C. Olston, B. Reed, et al. Pig latin: a not-so-foreign language for data processing. In SIGMOD, 1099-1110, 2008.
    • (2008) SIGMOD , pp. 1099-1110
    • Olston, C.1    Reed, B.2
  • 24
    • 70350512695 scopus 로고    scopus 로고
    • A comparison of approaches to large-scale data analysis
    • A. Pavlo, E. Paulson, et al. A comparison of approaches to large-scale data analysis. In SIGMOD, 165-178, 2009.
    • (2009) SIGMOD , pp. 165-178
    • Pavlo, A.1    Paulson, E.2
  • 25
    • 84976660052 scopus 로고
    • Join processing in database systems with large main memories
    • L. D. Shapiro. Join processing in database systems with large main memories. ACM Trans. Database Syst., 11(3):239-264, 1986.
    • (1986) ACM Trans. Database Syst. , vol.11 , Issue.3 , pp. 239-264
    • Shapiro, L.D.1
  • 26
    • 85040940552 scopus 로고    scopus 로고
    • A universal calculus for stream processing languages
    • R. Soulé, M. Hirzel, et al. A universal calculus for stream processing languages. In ESOP, 2010.
    • (2010) ESOP
    • Soulé, R.1    Hirzel, M.2
  • 27
    • 84959045524 scopus 로고    scopus 로고
    • Stream it: A language for streaming applications
    • W. Thies, M. Karczmarek, et al. Stream It: a language for streaming applications. In CC, 179-196, 2002.
    • (2002) CC , pp. 179-196
    • Thies, W.1    Karczmarek, M.2
  • 28
    • 31844447979 scopus 로고    scopus 로고
    • Teleport messaging for distributed stream programs
    • W. Thies, M. Karczmarek, et al. Teleport messaging for distributed stream programs. In PPoPP, 2005.
    • (2005) PPoPP
    • Thies, W.1    Karczmarek, M.2
  • 29
    • 78149262760 scopus 로고    scopus 로고
    • An empirical characterization of stream programs and its implications for language and compiler design
    • W. Thies and S. Amarasinghe. An empirical characterization of stream programs and its implications for language and compiler design. In PACT, 2010.
    • (2010) PACT
    • Thies, W.1    Amarasinghe, S.2
  • 30
    • 84868325513 scopus 로고    scopus 로고
    • Hive - A warehousing solution over a map-reduce framework
    • A. Thusoo, J. S. Sarma, et al. Hive - a warehousing solution over a map-reduce framework. PVLDB, 2(2):1626-1629, 2009.
    • (2009) PVLDB , vol.2 , Issue.2 , pp. 1626-1629
    • Thusoo, A.1    Sarma, J.S.2
  • 31
    • 77957862745 scopus 로고    scopus 로고
    • Tuple routing strategies for distributed eddies
    • F. Tian and D. J. DeWitt. Tuple routing strategies for distributed eddies. In VLDB, 333-344, 2003.
    • (2003) VLDB , pp. 333-344
    • Tian, F.1    DeWitt, D.J.2
  • 32
    • 70349110801 scopus 로고    scopus 로고
    • Scalable stream join processing with expensive predicates: Workload distribution and adaptation by time-slicing
    • S. Wang and E. Rundensteiner. Scalable stream join processing with expensive predicates: workload distribution and adaptation by time-slicing. In EDBT, 299-310, 2009.
    • (2009) EDBT , pp. 299-310
    • Wang, S.1    Rundensteiner, E.2
  • 34
    • 72249089011 scopus 로고    scopus 로고
    • Distributed aggregation for data-parallel computing: Interfaces and implementations
    • Y. Yu, P. K. Gunda, et al. Distributed aggregation for data-parallel computing: interfaces and implementations. In SOSP, 247-260, 2009.
    • (2009) SOSP , pp. 247-260
    • Yu, Y.1    Gunda, P.K.2
  • 35
    • 70350582319 scopus 로고    scopus 로고
    • Improving MapReduce performance in heterogeneous environments
    • M. Zaharia, A. Konwinski, et al. Improving MapReduce Performance in Heterogeneous Environments. In OSDI, 2008.
    • (2008) OSDI
    • Zaharia, M.1    Konwinski, A.2
  • 36
    • 80051885561 scopus 로고    scopus 로고
    • From a stream of relational queries to distributed stream processing
    • Q. Zou, H. Wang, et al. From a stream of relational queries to distributed stream processing. In VLDB, 2010.
    • (2010) VLDB
    • Zou, Q.1    Wang, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.