메뉴 건너뛰기




Volumn 2016-April, Issue , 2016, Pages 225-236

Modeling cache performance beyond LRU

Author keywords

[No Author keywords available]

Indexed keywords

BUFFER STORAGE; COMPUTER ARCHITECTURE; FORECASTING; QUALITY OF SERVICE; SUPERCOMPUTERS;

EID: 84965025544     PISSN: 15300897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCA.2016.7446067     Document Type: Conference Paper
Times cited : (49)

References (57)
  • 2
    • 0002991043 scopus 로고    scopus 로고
    • Selective cache ways: On-demand cache resource allocation
    • D. H. Albonesi, "Selective cache ways: On-demand cache resource allocation," in MICRO-32, 1999.
    • (1999) MICRO-32
    • Albonesi, D.H.1
  • 3
    • 84887440618 scopus 로고    scopus 로고
    • Jigsaw: Scalable software-defined caches
    • N. Beckmann and D. Sanchez, "Jigsaw: Scalable software-defined caches," in PACT-22, 2013.
    • (2013) PACT , vol.22
    • Beckmann, N.1    Sanchez, D.2
  • 5
    • 84934268669 scopus 로고    scopus 로고
    • Talus: A simple way to remove cliffs in cache performance
    • N. Beckmann and D. Sanchez, "Talus: A simple way to remove cliffs in cache performance," in HPCA-21, 2015.
    • (2015) HPCA , vol.21
    • Beckmann, N.1    Sanchez, D.2
  • 6
    • 84965055257 scopus 로고    scopus 로고
    • Cache calculus: Modeling caches through differential equations
    • N. Beckmann and D. Sanchez, "Cache calculus: Modeling caches through differential equations," Computer Architecture Letters, 2016.
    • (2016) Computer Architecture Letters
    • Beckmann, N.1    Sanchez, D.2
  • 7
    • 84934297423 scopus 로고    scopus 로고
    • Scaling distributed cache hierarchies through computation and data co-scheduling
    • N. Beckmann, P.-A. Tsai, and D. Sanchez, "Scaling distributed cache hierarchies through computation and data co-scheduling," in HPCA-21, 2015.
    • (2015) HPCA , vol.21
    • Beckmann, N.1    Tsai, P.-A.2    Sanchez, D.3
  • 8
    • 0003003638 scopus 로고
    • A study of replacement algorithms for a virtual-storage computer
    • L. A. Belady, "A study of replacement algorithms for a virtual-storage computer," IBM Sys. J., vol. 5, no. 2, 1966.
    • (1966) IBM Sys. J. , vol.5 , Issue.2
    • Belady, L.A.1
  • 9
    • 2642534571 scopus 로고    scopus 로고
    • StatCache: A probabilistic approach to efficient and accurate data locality analysis
    • E. Berg and E. Hagersten, "StatCache: A probabilistic approach to efficient and accurate data locality analysis," in ISPASS, 2004.
    • (2004) ISPASS
    • Berg, E.1    Hagersten, E.2
  • 10
    • 0013540431 scopus 로고
    • Universal classes of hash functions (extended abstract)
    • J. L. Carter and M. N. Wegman, "Universal classes of hash functions (extended abstract)," in Proc. STOC-9, 1977.
    • (1977) Proc. STOC , vol.9
    • Carter, J.L.1    Wegman, M.N.2
  • 11
    • 0033683314 scopus 로고    scopus 로고
    • Application-specific memory management for embedded systems using software-controlled caches
    • D. Chiou, P. Jain, L. Rudolph et al., "Application-specific memory management for embedded systems using software-controlled caches," in DAC-37, 2000.
    • (2000) DAC-37
    • Chiou, D.1    Jain, P.2    Rudolph, L.3
  • 12
    • 84881160871 scopus 로고    scopus 로고
    • A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness
    • H. Cook, M. Moreto, S. Bird et al., "A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness," in ISCA-40, 2013.
    • (2013) ISCA-40
    • Cook, H.1    Moreto, M.2    Bird, S.3
  • 13
    • 84946572419 scopus 로고    scopus 로고
    • Reuse distance-based probabilistic cache replacement
    • S. Das, T. M. Aamodt, and W. J. Dally, "Reuse distance-based probabilistic cache replacement," ACM TACO, vol. 12, no. 4, 2015.
    • (2015) ACM TACO , vol.12 , Issue.4
    • Das, S.1    Aamodt, T.M.2    Dally, W.J.3
  • 14
    • 84875649537 scopus 로고    scopus 로고
    • Paragon: Qos-aware scheduling for heterogeneous datacenters
    • C. Delimitrou and C. Kozyrakis, "Paragon: Qos-aware scheduling for heterogeneous datacenters," in ASPLOS-XVIII, 2013.
    • (2013) ASPLOS-XVIII
    • Delimitrou, C.1    Kozyrakis, C.2
  • 15
    • 0038716440 scopus 로고    scopus 로고
    • Predicting whole-program locality through reuse distance analysis
    • C. Ding and Y. Zhong, "Predicting whole-program locality through reuse distance analysis," in PLDI, 2003.
    • (2003) PLDI
    • Ding, C.1    Zhong, Y.2
  • 16
    • 84876561848 scopus 로고    scopus 로고
    • Improving cache management policies using dynamic reuse distances
    • N. Duong, D. Zhao, T. Kim et al., "Improving cache management policies using dynamic reuse distances," in MICRO-45, 2012.
    • (2012) MICRO-45
    • Duong, N.1    Zhao, D.2    Kim, T.3
  • 17
    • 77952570425 scopus 로고    scopus 로고
    • StatStack: Efficient modeling of LRU caches
    • D. Eklov and E. Hagersten, "StatStack: Efficient modeling of LRU caches," in ISPASS, 2010.
    • (2010) ISPASS
    • Eklov, D.1    Hagersten, E.2
  • 18
    • 47249094055 scopus 로고    scopus 로고
    • System-level performance metrics for multiprogram workloads
    • S. Eyerman and L. Eeckhout, "System-level performance metrics for multiprogram workloads," IEEE Micro, vol. 28, no. 3, 2008.
    • (2008) IEEE Micro , vol.28 , Issue.3
    • Eyerman, S.1    Eeckhout, L.2
  • 20
    • 47349085427 scopus 로고    scopus 로고
    • A framework for providing quality of service in chip multi-processors
    • F. Guo, Y. Solihin, L. Zhao et al., "A framework for providing quality of service in chip multi-processors," in MICRO-40, 2007.
    • (2007) MICRO-40
    • Guo, F.1    Solihin, Y.2    Zhao, L.3
  • 21
    • 84910129119 scopus 로고    scopus 로고
    • FIESTA: A sample-balanced multi-program workload methodology
    • A. Hilton, N. Eswaran, and A. Roth, "FIESTA: A sample-balanced multi-program workload methodology," in Proc. MoBS, 2009.
    • (2009) Proc. MoBS
    • Hilton, A.1    Eswaran, N.2    Roth, A.3
  • 22
    • 84875858098 scopus 로고    scopus 로고
    • Power management of the third generation Intel Core micro architecture formerly codenamed Ivy Bridge
    • S. Jahagirdar, V. George, I. Sodhi et al., "Power management of the third generation Intel Core micro architecture formerly codenamed Ivy Bridge," in Hot Chips, 2012.
    • (2012) Hot Chips
    • Jahagirdar, S.1    George, V.2    Sodhi, I.3
  • 23
    • 77954998134 scopus 로고    scopus 로고
    • High performance cache replacement using re-reference interval prediction
    • A. Jaleel, K. Theobald, S. Steely et al., "High performance cache replacement using re-reference interval prediction," in ISCA-37, 2010.
    • (2010) ISCA-37
    • Jaleel, A.1    Theobald, K.2    Steely, S.3
  • 24
    • 35448939594 scopus 로고    scopus 로고
    • Iterative methods to nonlinear equations
    • M. Javidi, "Iterative methods to nonlinear equations," Applied Mathematics and Computation, vol. 193, no. 2, 2007.
    • (2007) Applied Mathematics and Computation , vol.193 , Issue.2
    • Javidi, M.1
  • 25
    • 79953124483 scopus 로고    scopus 로고
    • Inter-core prefetching for multicore processors using migrating helper threads
    • M. Kamruzzaman, S. Swanson, and D. M. Tullsen, "Inter-core prefetching for multicore processors using migrating helper threads," in ASPLOS-XVI, 2011.
    • (2011) ASPLOS-XVI
    • Kamruzzaman, M.1    Swanson, S.2    Tullsen, D.M.3
  • 26
    • 84897791436 scopus 로고    scopus 로고
    • Ubik: Efficient cache sharing with strict QoS for latency-critical workloads
    • H. Kasture and D. Sanchez, "Ubik: Efficient cache sharing with strict QoS for latency-critical workloads," in ASPLOS-XIX, 2014.
    • (2014) ASPLOS-XIX
    • Kasture, H.1    Sanchez, D.2
  • 27
    • 52949085794 scopus 로고    scopus 로고
    • Cache replacement based on reuse-distance prediction
    • G. Keramidas, P. Petoumenos, and S. Kaxiras, "Cache replacement based on reuse-distance prediction," in ICCD, 2007.
    • (2007) ICCD
    • Keramidas, G.1    Petoumenos, P.2    Kaxiras, S.3
  • 28
    • 77954479306 scopus 로고    scopus 로고
    • Where replacement algorithms fail: A thorough analysis
    • G. Keramidas, P. Petoumenos, and S. Kaxiras, "Where replacement algorithms fail: A thorough analysis," in Proc. CF-7, 2010.
    • (2010) Proc. CF-7
    • Keramidas, G.1    Petoumenos, P.2    Kaxiras, S.3
  • 29
    • 79951697650 scopus 로고    scopus 로고
    • Sampling dead block prediction for last-level caches
    • S. M. Khan, Y. Tian, and D. A. Jimenez, "Sampling dead block prediction for last-level caches," in MICRO-43, 2010.
    • (2010) MICRO-43
    • Khan, S.M.1    Tian, Y.2    Jimenez, D.A.3
  • 30
  • 31
    • 2342640788 scopus 로고    scopus 로고
    • Using prime numbers for cache indexing to eliminate conflict misses
    • M. Kharbutli, K. Irwin, Y. Solihin et al., "Using prime numbers for cache indexing to eliminate conflict misses," in HPCA-10, 2004.
    • (2004) HPCA-10
    • Kharbutli, M.1    Irwin, K.2    Solihin, Y.3
  • 32
    • 77952125596 scopus 로고    scopus 로고
    • Westmere: A family of 32nm IA processors
    • N. Kurd, S. Bhamidipati, C. Mozak et al., "Westmere: A family of 32nm IA processors," in ISSCC, 2010.
    • (2010) ISSCC
    • Kurd, N.1    Bhamidipati, S.2    Mozak, C.3
  • 33
    • 79955893556 scopus 로고    scopus 로고
    • CloudCache: Expanding and shrinking private caches
    • H. Lee, S. Cho, and B. R. Childers, "CloudCache: Expanding and shrinking private caches," in HPCA-17, 2011.
    • (2011) HPCA-17
    • Lee, H.1    Cho, S.2    Childers, B.R.3
  • 35
    • 84858783719 scopus 로고    scopus 로고
    • Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations
    • J. Mars, L. Tang, R. Hundt et al., "Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations," in MICRO-44, 2011.
    • (2011) MICRO-44
    • Mars, J.1    Tang, L.2    Hundt, R.3
  • 36
    • 0014701246 scopus 로고
    • Evaluation techniques for storage hierarchies
    • R. L. Mattson, J. Gecsei, D. R. Slutz et al., "Evaluation techniques for storage hierarchies," IBM Sys. J., vol. 9, no. 2, 1970.
    • (1970) IBM Sys. J. , vol.9 , Issue.2
    • Mattson, R.L.1    Gecsei, J.2    Slutz, D.R.3
  • 38
    • 84975317561 scopus 로고    scopus 로고
    • Whirlpool: Improving dynamic cache management with static data classification
    • A. Mukkara, N. Beckmann, and D. Sanchez, "Whirlpool: Improving dynamic cache management with static data classification," in ASPLOSXXI, 2016.
    • (2016) ASPLOSXXI
    • Mukkara, A.1    Beckmann, N.2    Sanchez, D.3
  • 40
    • 84892536329 scopus 로고    scopus 로고
    • Imbalanced cache partitioning for balanced data-parallel programs
    • A. Pan and V. S. Pai, "Imbalanced cache partitioning for balanced data-parallel programs," in MICRO-46, 2013.
    • (2013) MICRO-46
    • Pan, A.1    Pai, V.S.2
  • 41
    • 34548042910 scopus 로고    scopus 로고
    • Utility-based cache partitioning: A lowoverhead, high-performance, runtime mechanism to partition shared caches
    • M. Qureshi and Y. Patt, "Utility-based cache partitioning: A lowoverhead, high-performance, runtime mechanism to partition shared caches," in MICRO-39, 2006.
    • (2006) MICRO-39
    • Qureshi, M.1    Patt, Y.2
  • 42
    • 35348920021 scopus 로고    scopus 로고
    • Adaptive insertion policies for high performance caching
    • M. Qureshi, A. Jaleel, Y. Patt et al., "Adaptive insertion policies for high performance caching," in ISCA-34, 2007.
    • (2007) ISCA-34
    • Qureshi, M.1    Jaleel, A.2    Patt, Y.3
  • 44
    • 79951696261 scopus 로고    scopus 로고
    • The ZCache: Decoupling ways and associativity
    • D. Sanchez and C. Kozyrakis, "The ZCache: Decoupling ways and associativity," in MICRO-43, 2010.
    • (2010) MICRO-43
    • Sanchez, D.1    Kozyrakis, C.2
  • 45
    • 80052521720 scopus 로고    scopus 로고
    • Vantage: Scalable and efficient finegrain cache partitioning
    • D. Sanchez and C. Kozyrakis, "Vantage: Scalable and efficient finegrain cache partitioning," in ISCA-38, 2011.
    • (2011) ISCA-38
    • Sanchez, D.1    Kozyrakis, C.2
  • 46
    • 84881154274 scopus 로고    scopus 로고
    • ZSim: Fast and accurate microarchitectural simulation of thousand-core systems
    • D. Sanchez and C. Kozyrakis, "ZSim: Fast and accurate microarchitectural simulation of thousand-core systems," in ISCA-40, 2013.
    • (2013) ISCA-40
    • Sanchez, D.1    Kozyrakis, C.2
  • 47
    • 84934319290 scopus 로고    scopus 로고
    • Reuse-based online models for caches
    • R. Sen and D. A. Wood, "Reuse-based online models for caches," in Proc. SIGMETRICS, 2013.
    • (2013) Proc. SIGMETRICS
    • Sen, R.1    Wood, D.A.2
  • 48
    • 0027307814 scopus 로고
    • A case for two-way skewed-associative caches
    • A. Seznec, "A case for two-way skewed-associative caches," in ISCA-20, 1993.
    • (1993) ISCA-20
    • Seznec, A.1
  • 49
    • 51349168284 scopus 로고    scopus 로고
    • UltraSPARC T2: A highlytreaded, power-efficient, SPARC SOC
    • M. Shah, J. Barren, J. Brooks et al., "UltraSPARC T2: A highlytreaded, power-efficient, SPARC SOC," in ISSCC, 2007.
    • (2007) ISSCC
    • Shah, M.1    Barren, J.2    Brooks, J.3
  • 50
    • 74049129459 scopus 로고    scopus 로고
    • A case for integrated processor-cache partitioning in chip multiprocessors
    • S. Srikantaiah, R. Das, A. K. Mishra et al., "A case for integrated processor-cache partitioning in chip multiprocessors," in SC09, 2009.
    • (2009) SC09
    • Srikantaiah, S.1    Das, R.2    Mishra, A.K.3
  • 51
    • 8344233359 scopus 로고    scopus 로고
    • Inter-reference gap distribution replacement: An improved replacement algorithm for set-associative caches
    • M. Takagi and K. Hiraki, "Inter-reference gap distribution replacement: an improved replacement algorithm for set-associative caches," in ICS'04, 2004.
    • (2004) ICS'04
    • Takagi, M.1    Hiraki, K.2
  • 54
    • 84863389330 scopus 로고    scopus 로고
    • SHiP: Signature-based hit predictor for high performance caching
    • C.-J. Wu, A. Jaleel, W. Hasenplaugh et al., "SHiP: Signature-based hit predictor for high performance caching," in MICRO-44, 2011.
    • (2011) MICRO-44
    • Wu, C.-J.1    Jaleel, A.2    Hasenplaugh, W.3
  • 55
    • 84881142740 scopus 로고    scopus 로고
    • Studying multicore processor scaling via reuse distance analysis
    • M.-J. Wu, M. Zhao, and D. Yeung, "Studying multicore processor scaling via reuse distance analysis," in ISCA-40, 2013.
    • (2013) ISCA-40
    • Wu, M.-J.1    Zhao, M.2    Yeung, D.3
  • 56
    • 84877690617 scopus 로고    scopus 로고
    • CPI2: CPU performance isolation for shared compute clusters
    • X. Zhang, E. Tune, R. Hagmann et al., "CPI2: CPU performance isolation for shared compute clusters," in EuroSys, 2013.
    • (2013) EuroSys
    • Zhang, X.1    Tune, E.2    Hagmann, R.3
  • 57
    • 12844271066 scopus 로고    scopus 로고
    • Dynamic tracking of page miss ratio curve for memory management
    • P. Zhou, V. Pandey, J. Sundaresan et al., "Dynamic tracking of page miss ratio curve for memory management," in ASPLOS-XI, 2004.
    • (2004) ASPLOS-XI
    • Zhou, P.1    Pandey, V.2    Sundaresan, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.