메뉴 건너뛰기




Volumn , Issue , 2012, Pages 691-702

Miss-correlation folding: Encoding per-block miss correlations in compressed DRAM for data prefetching

Author keywords

cache; compress; data parallel; miss correlation; prefetch; spatial; temporal

Indexed keywords

CACHE; COMPRESS; DATA PARALLEL; PREFETCHES; SPATIAL; TEMPORAL;

EID: 84866864300     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2012.68     Document Type: Conference Paper
Times cited : (2)

References (54)
  • 3
    • 34547676257 scopus 로고    scopus 로고
    • Interactions between compression and prefetching in chip multiprocessors
    • A. R. Alameldeen, and D. A. Wood, "Interactions between compression and prefetching in chip multiprocessors," in 13th HPCA, 2007.
    • (2007) 13th HPCA
    • Alameldeen, A.R.1    Wood, D.A.2
  • 5
    • 0026917364 scopus 로고
    • Reducing memory latency via non-blocking and prefetching caches
    • T.-F. Chen, and J.-L. Baer, "Reducing memory latency via non-blocking and prefetching caches," in 5th ASPLOS, 1992.
    • (1992) 5th ASPLOS
    • Chen, T.-F.1    Baer, J.-L.2
  • 6
    • 0036038136 scopus 로고    scopus 로고
    • Dynamic hot data stream prefetching for general-purpose programs
    • T. M. Chilimbi, and M. Hirzel, "Dynamic hot data stream prefetching for general-purpose programs," in PLDI, 2002.
    • (2002) PLDI
    • Chilimbi, T.M.1    Hirzel, M.2
  • 7
    • 47349132413 scopus 로고    scopus 로고
    • Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications
    • Y. Chou, "Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications," in 40th Micro, 2007.
    • (2007) 40th Micro
    • Chou, Y.1
  • 8
    • 0027621679 scopus 로고
    • Practical prefetching via data compression
    • K. Curewitz, P. Krishnan, and J. Vitter, "Practical prefetching via data compression," ACM SIGMOD Record, vol. 22, no. 2, pp. 266, 1993.
    • (1993) ACM SIGMOD Record , vol.22 , Issue.2 , pp. 266
    • Curewitz, K.1    Krishnan, P.2    Vitter, J.3
  • 9
    • 70450233836 scopus 로고    scopus 로고
    • Stream chaining: Exploiting multiple levels of correlation in data prefetching
    • P. Diaz, and M. Cintra, "Stream chaining: Exploiting multiple levels of correlation in data prefetching," in 36th ISCA, 2009.
    • (2009) 36th ISCA
    • Diaz, P.1    Cintra, M.2
  • 10
    • 27544435752 scopus 로고    scopus 로고
    • A Robust Main-Memory Compression Scheme
    • M. Ekman, and P. Stenstrom, "A Robust Main-Memory Compression Scheme," in 32nd ISCA, 2005.
    • (2005) 32nd ISCA
    • Ekman, M.1    Stenstrom, P.2
  • 11
    • 36949027123 scopus 로고    scopus 로고
    • Last-touch correlated data streaming
    • M. Ferdman, and B. Falsafi, "Last-touch correlated data streaming," in ISPASS, 2007.
    • (2007) ISPASS
    • Ferdman, M.1    Falsafi, B.2
  • 13
    • 77956977035 scopus 로고
    • Stride directed prefetching in scalar processors
    • J. Fu, J. H. Patel, and B. L. Janssens, "Stride directed prefetching in scalar processors," in 25th MICRO, 1992.
    • (1992) 25th MICRO
    • Fu, J.1    Patel, J.H.2    Janssens, B.L.3
  • 15
    • 0035187053 scopus 로고    scopus 로고
    • Exploring the design space of future CMPs
    • J. Huh, D. Burger, and S. W. Keckler, "Exploring the design space of future CMPs," in PACT, 2001.
    • (2001) PACT
    • Huh, J.1    Burger, D.2    Keckler, S.W.3
  • 16
    • 0030677583 scopus 로고    scopus 로고
    • Prefetching using Markov predictors
    • D. Joseph, and D. Grunwald, "Prefetching using Markov predictors," in 24th ISCA, 1997.
    • (1997) 24th ISCA
    • Joseph, D.1    Grunwald, D.2
  • 17
    • 0025429331 scopus 로고
    • Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
    • N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in 17th ISCA, 1990.
    • (1990) 17th ISCA
    • Jouppi, N.P.1
  • 18
    • 0036287598 scopus 로고    scopus 로고
    • Going the distance for TLB prefetching: An application-driven study
    • G. B. Kandiraju, and A. Sivasubramaniam, "Going the distance for TLB prefetching: An application-driven study," in 29th ISCA, 2002.
    • (2002) 29th ISCA
    • Kandiraju, G.B.1    Sivasubramaniam, A.2
  • 19
    • 70450235471 scopus 로고    scopus 로고
    • Architecting phase change memory as a scalable DRAM alternative
    • B. C. Lee, E. Ipek, O. Mutlu, and D. Burger, "Architecting phase change memory as a scalable DRAM alternative," in 36th ISCA, 2009.
    • (2009) 36th ISCA
    • Lee, B.C.1    Ipek, E.2    Mutlu, O.3    Burger, D.4
  • 21
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100x GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU
    • V. Lee, C. Kim, J. Chhugani, M. Deisher, D. Kim et al., "Debunking the 100x GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU," in 37th ISCA, 2010.
    • (2010) 37th ISCA
    • Lee, V.1    Kim, C.2    Chhugani, J.3    Deisher, M.4    Kim, D.5
  • 22
    • 79960875021 scopus 로고    scopus 로고
    • Emerging Applications for Multi/Many-Core Processor
    • V. Lee, Y. Chen, and P. Dubey, "Emerging Applications for Multi/Many-Core Processor," in 38th ISCA, 2011.
    • (2011) 38th ISCA
    • Lee, V.1    Chen, Y.2    Dubey, P.3
  • 24
    • 33748870886 scopus 로고    scopus 로고
    • Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset
    • M. Martin, D. Sorin, B. Beckmann, M. Marty, M. Xu et al., "Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset," ACM SIGARCH Computer Architecture News, vol. 33, no. 4, pp. 92-99, 2005.
    • (2005) ACM SIGARCH Computer Architecture News , vol.33 , Issue.4 , pp. 92-99
    • Martin, M.1    Sorin, D.2    Beckmann, B.3    Marty, M.4    Xu, M.5
  • 27
    • 2342644731 scopus 로고    scopus 로고
    • Data cache prefetching using a Global History Buffer
    • K. J. Nesbit, and J. E. Smith, "Data cache prefetching using a Global History Buffer," in 10th HPCA, 2004.
    • (2004) 10th HPCA
    • Nesbit, K.J.1    Smith, J.E.2
  • 28
    • 84866840089 scopus 로고    scopus 로고
    • OpenMP, http://www.openmp.org/.
  • 30
    • 70450273507 scopus 로고    scopus 로고
    • Scalable high performance main memory system using phase-change memory technology
    • M. K. Qureshi, V. Srinivasan, and J. A. Rivers, "Scalable high performance main memory system using phase-change memory technology," in 36th ISCA, 2009.
    • (2009) 36th ISCA
    • Qureshi, M.K.1    Srinivasan, V.2    Rivers, J.A.3
  • 33
    • 70450077484 scopus 로고    scopus 로고
    • Designing efficient sorting algorithms for manycore gpus
    • N. Satish, M. Harris, and M. Garland, "Designing efficient sorting algorithms for manycore gpus," in IPDPS, 2009.
    • (2009) IPDPS
    • Satish, N.1    Harris, M.2    Garland, M.3
  • 34
    • 33847108092 scopus 로고    scopus 로고
    • Coterminous locality and coterminous group data prefetching on chip-multiprocessors
    • X. Shi, Z. Yang, J.-K. Peir, L. Peng, Y.-K. Chen et al., "Coterminous locality and coterminous group data prefetching on chip-multiprocessors," in 20th IPDPS, 2006.
    • (2006) 20th IPDPS
    • Shi, X.1    Yang, Z.2    Peir, J.-K.3    Peng, L.4    Chen, Y.-K.5
  • 35
    • 0036296856 scopus 로고    scopus 로고
    • Using a user-level memory thread for correlation prefetching
    • Y. Solihin, J. Lee, and J. Torrellas, "Using a user-level memory thread for correlation prefetching," in 29th ISCA, 2002.
    • (2002) 29th ISCA
    • Solihin, Y.1    Lee, J.2    Torrellas, J.3
  • 38
    • 34547655822 scopus 로고    scopus 로고
    • Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers
    • S. Srinath, O. Mutlu, H. Kim, and Y. N. Patt, "Feedback directed prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers," in 13th HPCA, 2007.
    • (2007) 13th HPCA
    • Srinath, S.1    Mutlu, O.2    Kim, H.3    Patt, Y.N.4
  • 40
    • 84866851700 scopus 로고    scopus 로고
    • POSIX thread, https://computing.llnl.gov/tutorials/pthreads/.
    • POSIX Thread
  • 47
    • 0038364440 scopus 로고    scopus 로고
    • Frequent value locality and its applications
    • J. Yang, and R. Gupta, "Frequent value locality and its applications," ACM Trans. on Embedded Computing Systems, vol. 1, no. 1, pp. 79-105, 2002.
    • (2002) ACM Trans. on Embedded Computing Systems , vol.1 , Issue.1 , pp. 79-105
    • Yang, J.1    Gupta, R.2
  • 49
    • 36949014308 scopus 로고    scopus 로고
    • PTLsim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator
    • M. T. Yourst, "PTLsim: A Cycle Accurate Full System x86-64 Microarchitectural Simulator," in ISPASS, 2007.
    • (2007) ISPASS
    • Yourst, M.T.1
  • 50
    • 0034443222 scopus 로고    scopus 로고
    • Frequent value locality and value-centric data cache design
    • Y. Zhang, J. Yang, and R. Gupta, "Frequent value locality and value-centric data cache design," in 9th ASPLOS, 2000.
    • (2000) 9th ASPLOS
    • Zhang, Y.1    Yang, J.2    Gupta, R.3
  • 52
    • 0012525243 scopus 로고    scopus 로고
    • Benchmark health considered harmful
    • C. Zilles, "Benchmark health considered harmful," ACM SIGARCH Computer Architecture News, vol. 29, no. 3, pp. 4-5, 2001.
    • (2001) ACM SIGARCH Computer Architecture News , vol.29 , Issue.3 , pp. 4-5
    • Zilles, C.1
  • 53
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • J. Ziv, and A. Lempel, "A universal algorithm for sequential data compression," IEEE Transactions on Information Theory, vol. 23, no. 3, pp. 337-343, 1977.
    • (1977) IEEE Transactions on Information Theory , vol.23 , Issue.3 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 54
    • 0018019231 scopus 로고
    • Compression of individual sequences via variable-rate coding
    • J. Ziv, and A. Lempel, "Compression of individual sequences via variable-rate coding," IEEE Transactions on Information Theory, vol. 24, no. 5, pp. 530-536, 1978.
    • (1978) IEEE Transactions on Information Theory , vol.24 , Issue.5 , pp. 530-536
    • Ziv, J.1    Lempel, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.