메뉴 건너뛰기




Volumn , Issue , 2012, Pages 100-109

A storage-centric analysis of MapReduce workloads: File popularity, temporal locality and arrival patterns

Author keywords

Access patterns; Big Data; HDFS; MapReduce

Indexed keywords

DATA HANDLING; DIGITAL STORAGE;

EID: 84873453654     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IISWC.2012.6402909     Document Type: Conference Paper
Times cited : (56)

References (19)
  • 1
    • 85030321143 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," in Proc. USENIX OSDI, 2004, pp. 137-150.
    • (2004) Proc. USENIX OSDI , pp. 137-150
    • Dean, J.1    Ghemawat, S.2
  • 2
    • 85060261029 scopus 로고    scopus 로고
    • "Apache Hadoop," Jun. 2011, http://hadoop.apache.org.
    • "Apache Hadoop," Jun. 2011, http://hadoop.apache.org.
  • 3
    • 85060261974 scopus 로고    scopus 로고
    • K. Shvachko, H. Kuang, S. Radia, and R. Chansler, "The Hadoop Distributed File System," in MSST2010.
    • K. Shvachko, H. Kuang, S. Radia, and R. Chansler, "The Hadoop Distributed File System," in MSST2010.
  • 4
    • 84874278017 scopus 로고    scopus 로고
    • C. Abad, H. Luu, N. Roberts, K. Lee, Y. Lu, and R. Campbell, "Metadata traces and workload models for evaluating Big storage systems," in Proc. IEEE UCC, 2012.
    • C. Abad, H. Luu, N. Roberts, K. Lee, Y. Lu, and R. Campbell, "Metadata traces and workload models for evaluating Big storage systems," in Proc. IEEE UCC, 2012.
  • 5
    • 65549085067 scopus 로고    scopus 로고
    • Power-law distributions in empirical data
    • Nov.
    • A. Clauset, C. R. Shalizi, and M. Newman, "Power-law distributions in empirical data," SIAM Rev., vol. 51, no. 4, Nov. 2009.
    • (2009) SIAM Rev. , vol.51 , Issue.4
    • Clauset, A.1    Shalizi, C.R.2    Newman, M.3
  • 6
    • 7244260720 scopus 로고    scopus 로고
    • Analysis of enterprise media server workloads: Access patterns, locality, content evolution, and rates of change
    • L. Cherkasova and M. Gupta, "Analysis of enterprise media server workloads: Access patterns, locality, content evolution, and rates of change," IEEE/ACM Trans. Netw., vol. 12, no. 5, 2004.
    • (2004) IEEE/ACM Trans. Netw. , vol.12 , Issue.5
    • Cherkasova, L.1    Gupta, M.2
  • 7
    • 79955970532 scopus 로고    scopus 로고
    • G. Ananthanarayanan, S. Agarwal, S. Kandula, A. Greenberg, I. Stoica, D. Harlan, and E. Harris, "Scarlett: Coping with skewed popularity content in MapReduce clusters," in Proc. EuroSys, 2011.
    • G. Ananthanarayanan, S. Agarwal, S. Kandula, A. Greenberg, I. Stoica, D. Harlan, and E. Harris, "Scarlett: Coping with skewed popularity content in MapReduce clusters," in Proc. EuroSys, 2011.
  • 8
    • 77950477592 scopus 로고    scopus 로고
    • Diskreduce: Raid for data-intensive scalable computing
    • B. Fan, W. Tantisiriroj, L. Xiao, and G. Gibson, "DiskReduce: RAID for data-intensive scalable computing," in Proc. PDSW, 2009, pp. 6-10.
    • (2009) Proc. PDSW , pp. 6-10
    • Fan, B.1    Tantisiriroj, W.2    Xiao, L.3    Gibson, G.4
  • 9
    • 80955123462 scopus 로고    scopus 로고
    • C. Abad, Y. Lu, and R. Campbell, "DARE: Adaptive data replication for efficient cluster scheduling," in Proc. CLUSTER, 2011.
    • C. Abad, Y. Lu, and R. Campbell, "DARE: Adaptive data replication for efficient cluster scheduling," in Proc. CLUSTER, 2011.
  • 11
    • 0031383380 scopus 로고    scopus 로고
    • Self-similarity in world wide web traffic: Evidence and possible causes
    • M. E. Crovella and A. Bestavros, "Self-similarity in World Wide Web traffic: Evidence and possible causes," IEEE/ACM Trans. on Netw., vol. 5, no. 6, 1997.
    • (1997) IEEE/ACM Trans. on Netw. , vol.5 , Issue.6
    • Crovella, M.E.1    Bestavros, A.2
  • 12
    • 0030384024 scopus 로고    scopus 로고
    • K. Park, G. Kim, and M. Crovella, "On the relationship between file sizes, transport protocols, and self-similar network traffic," in Proc. ICNP, 1996.
    • K. Park, G. Kim, and M. Crovella, "On the relationship between file sizes, transport protocols, and self-similar network traffic," in Proc. ICNP, 1996.
  • 13
    • 70549101150 scopus 로고    scopus 로고
    • Trace data characterization and fitting for markov modeling
    • G. Casale, E. Z. Zhang, and E. Smirni, "Trace data characterization and fitting for Markov modeling," Perform. Eval., vol. 67, no. 2, 2010.
    • (2010) Perform. Eval. , vol.67 , Issue.2
    • Casale, G.1    Zhang, E.Z.2    Smirni, E.3
  • 14
    • 0032670943 scopus 로고    scopus 로고
    • L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, "Web caching and Zipf-like distributions: Evidence and implications," in Proc. INFOCOM, 1999.
    • L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, "Web caching and Zipf-like distributions: Evidence and implications," in Proc. INFOCOM, 1999.
  • 15
    • 82655182943 scopus 로고    scopus 로고
    • Y. Chen, K. Srinivasan, G. Goodson, and R. Katz, "Design implications for enterprise storage systems via multi-dimensional trace analysis," in Proc. SOSP, 2011.
    • Y. Chen, K. Srinivasan, G. Goodson, and R. Katz, "Design implications for enterprise storage systems via multi-dimensional trace analysis," in Proc. SOSP, 2011.
  • 16
    • 34548785494 scopus 로고    scopus 로고
    • H. Li and L. Wolters, "Towards a better understanding of workload dynamics on data-intensive clusters and grids," in Proc. IPDPS, 2007.
    • H. Li and L. Wolters, "Towards a better understanding of workload dynamics on data-intensive clusters and grids," in Proc. IPDPS, 2007.
  • 17
    • 80053019024 scopus 로고    scopus 로고
    • Y. Chen, A. Ganapathi, R. Griffith, and R. Katz, "The case for evaluating MapReduce performance using workload suites," in Proc. MASCOTS, 2011.
    • Y. Chen, A. Ganapathi, R. Griffith, and R. Katz, "The case for evaluating MapReduce performance using workload suites," in Proc. MASCOTS, 2011.
  • 18
    • 84873134968 scopus 로고    scopus 로고
    • Y. Chen, S. Alspaugh, and R. Katz, "Interactive query processing in Big Data systems: A cross-industry study of MapReduce workloads," in Proc. VLDB, 2012.
    • Y. Chen, S. Alspaugh, and R. Katz, "Interactive query processing in Big Data systems: A cross-industry study of MapReduce workloads," in Proc. VLDB, 2012.
  • 19
    • 84884469245 scopus 로고    scopus 로고
    • S. Patil, G. Gibson, G. Ganger, J. Lopez, M. Polte, W. Tantisiroj, and L. Xiao, "In search of an API for scalable file systems: Under the table or above it?" in Proc. USENIX HotCloud, 2009.
    • S. Patil, G. Gibson, G. Ganger, J. Lopez, M. Polte, W. Tantisiroj, and L. Xiao, "In search of an API for scalable file systems: Under the table or above it?" in Proc. USENIX HotCloud, 2009.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.