SCOPUS 정보 검색 플랫폼

Proceedings of the Annual International Symposium on Microarchitecture, MICRO

Volumn , Issue , 2010, Pages 187-198

The zcache: Decoupling ways and associativity

(2) Sanchez, Daniel a Kozyrakis, Christos a

a Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ASSOCIATIVE CACHE; ASSOCIATIVITY; CACHE DESIGN; CONFLICT MISS; CONVENTIONAL DESIGN; CRITICAL PATHS; ENERGY COST; LOOKUPS; MAIN MEMORY; MULTITHREADED; SET-ASSOCIATIVE CACHES;

BUFFER STORAGE; ENERGY EFFICIENCY; MULTIPROGRAMMING; PROBABILITY DISTRIBUTIONS;

DESIGN;

EID: 79951696261 PISSN: 10724451 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/MICRO.2010.20 Document Type: Conference Paper

Times cited : (155)

References (45)

1
- 0027192667
- Column-associative caches: A technique for reducing the miss rate of direct-mapped caches
- A. Agarwal and S. D. Pudar, "Column-associative caches: a technique for reducing the miss rate of direct-mapped caches," in Proc. of the 20th annual Intl. Symp. on Computer architecture, 1993.
- (1993) Proc. of the 20th Annual Intl. Symp. on Computer Architecture
- Agarwal, A.¹ Pudar, S.D.²

2
- 33947715600
- IPC considered harmful for multiprocessor workloads
- A. Alameldeen and D. Wood, "IPC considered harmful for multiprocessor workloads," IEEE Micro, vol. 26, no. 4, 2006.
- (2006) IEEE Micro , vol.26 , Issue.4
- Alameldeen, A.¹ Wood, D.²

3
- 47349112480
- Scavenger: A new last level cache architecture with global block priority
- A. Basu, N. Kirman, M. Kirman, M. Chaudhuri, and J. Martinez, "Scavenger: A new last level cache architecture with global block priority," in Proc. of the 40th annual IEEE/ACM Intl Symp. on Microarchitecture, 2007.
- (2007) Proc. of the 40th Annual IEEE/ACM Intl Symp. on Microarchitecture
- Basu, A.¹ Kirman, N.² Kirman, M.³ Chaudhuri, M.⁴ Martinez, J.⁵

4
- 0003003638
- A study of replacement algorithms for a virtualstorage computer
- L. A. Belady, "A study of replacement algorithms for a virtualstorage computer," IBM Syst. J., vol. 5, no. 2, 1966.
- (1966) IBM Syst. J. , vol.5 , Issue.2
- Belady, L.A.¹

5
- 63549095070
- The PARSEC benchmark suite: Characterization and architectural implications
- C. Bienia, S. Kumar, J. P. Singh, and K. Li, "The PARSEC benchmark suite: Characterization and architectural implications," in Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques, 2008.
- (2008) Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques
- Bienia, C.¹ Kumar, S.² Singh, J.P.³ Li, K.⁴

6
- 0014814325
- Space/time trade-offs in hash coding with allowable errors
- B. H. Bloom, "Space/time trade-offs in hash coding with allowable errors," Commun. ACM, vol. 13, no. 7, 1970.
- (1970) Commun. ACM , vol.13 , Issue.7
- Bloom, B.H.¹

7
- 0029189692
- Skewed associativity enhances performance predictability
- F. Bodin and A. Seznec, "Skewed associativity enhances performance predictability," in Proc. of the 22nd annual Intl. Symp. on Computer Architecture, 1995.
- (1995) Proc. of the 22nd Annual Intl. Symp. on Computer Architecture
- Bodin, F.¹ Seznec, A.²

8
- 33846703999
- Disintermediated active communication
- A. Bracy, K. Doshi, and Q. Jacobson, "Disintermediated active communication," Comput. Archit. Lett., vol. 5, no. 2, 2006.
- (2006) Comput. Archit. Lett. , vol.5 , Issue.2
- Bracy, A.¹ Doshi, K.² Jacobson, Q.³

9
- 79951691107
- Ph.D. dissertation, Michigan State University
- M. W. Brehob, "On the mathematics of caching," Ph.D. dissertation, Michigan State University, 2003.
- (2003) On the Mathematics of Caching
- Brehob, M.W.¹

10
- 0029710803
- Predictive sequential associative cache
- B. Calder, D. Grunwald, and J. Emer, "Predictive sequential associative cache," in Proc. of the 2nd IEEE Symp. on High- Performance Computer Architecture, 1996.
- (1996) Proc. of the 2nd IEEE Symp. on High- Performance Computer Architecture
- Calder, B.¹ Grunwald, D.² Emer, J.³

11
- 84963650728
- Universal classes of hash functions (extended abstract)
- J. L. Carter and M. N. Wegman, "Universal classes of hash functions (extended abstract)," in Proc. of the 9th annual ACM Symposium on Theory of Computing, 1977.
- (1977) Proc. of the 9th Annual ACM Symposium on Theory of Computing
- Carter, J.L.¹ Wegman, M.N.²

12
- 35348862407
- BulkSC: Bulk enforcement of sequential consistency
- L. Ceze, J. Tuck, P. Montesinos, and J. Torrellas, "BulkSC: bulk enforcement of sequential consistency," in Proc. of the 34th annual Intl. Symp. on Computer architecture, 2007.
- (2007) Proc. of the 34th Annual Intl. Symp. on Computer Architecture
- Ceze, L.¹ Tuck, J.² Montesinos, P.³ Torrellas, J.⁴

13
- 33845866604
- Bulk disambiguation of speculative threads in multiprocessors
- L. Ceze, J. Tuck, J. Torrellas, and C. Cascaval, "Bulk disambiguation of speculative threads in multiprocessors," in Proc. of the 33rd annual Intl. Symp. on Computer Architecture, 2006.
- (2006) Proc. of the 33rd Annual Intl. Symp. on Computer Architecture
- Ceze, L.¹ Tuck, J.² Torrellas, J.³ Cascaval, C.⁴

14
- 76749083861
- Pseudo-LIFO: The foundation of a new family of replacement policies for last-level caches
- M. Chaudhuri, "Pseudo-LIFO: the foundation of a new family of replacement policies for last-level caches," in Proc. of the 42nd annual IEEE/ACM Intl. Symp. on Microarchitecture, 2009.
- (2009) Proc. of the 42nd Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Chaudhuri, M.¹

15
- 84944411840
- Distance associativity for high-performance energy-efficient non-uniform cache architectures
- Z. Chishti, M. D. Powell, and T. N. Vijaykumar, "Distance associativity for high-performance energy-efficient non-uniform cache architectures," in Proc. of the 36th Annual IEEE/ACM Intl. Symp. on Microarchitecture, 2003.
- (2003) Proc. of the 36th Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Chishti, Z.¹ Powell, M.D.² Vijaykumar, T.N.³

16
- 57849167541
- An efficient hardware-based multi-hash scheme for high speed IP lookup
- S. Demetriades, M. Hanna, S. Cho, and R. Melhem, "An efficient hardware-based multi-hash scheme for high speed IP lookup," in Proc. of the 16th IEEE Symp. on High Performance Interconnects, 2008.
- (2008) Proc. of the 16th IEEE Symp. on High Performance Interconnects
- Demetriades, S.¹ Hanna, M.² Cho, S.³ Melhem, R.⁴

17
- 49549096253
- A sub-1W to 2W low-power IA processor for mobile internet devices and ultra-mobile PCs in 45nm hi-K metal gate CMOS
- G. Gerosa et al., "A sub-1W to 2W low-power IA processor for mobile internet devices and ultra-mobile PCs in 45nm hi-K metal gate CMOS," in IEEE Intl. Solid-State Circuits Conf., 2008.
- (2008) IEEE Intl. Solid-State Circuits Conf.
- Gerosa, G.¹

18
- 0033723498
- A fully associative softwaremanaged cache design
- E. G. Hallnor and S. K. Reinhardt, "A fully associative softwaremanaged cache design," in Proc. of the 27th annual Intl. Symp. on Computer Architecture, 2000.
- (2000) Proc. of the 27th Annual Intl. Symp. on Computer Architecture
- Hallnor, E.G.¹ Reinhardt, S.K.²

19
- 4644359934
- Transactional memory coherence and consistency
- L. Hammond, V. Wong, M. Chen, B. D. Carlstrom, J. D. Davis, B. Hertzberg, M. K. Prabhu, H. Wijaya, C. Kozyrakis, and K. Olukotun, "Transactional memory coherence and consistency," in Proc. of the 31st annual Intl. Symp. on Computer Architecture, 2004.
- (2004) Proc. of the 31st Annual Intl. Symp. on Computer Architecture
- Hammond, L.¹ Wong, V.² Chen, M.³ Carlstrom, B.D.⁴ Davis, J.D.⁵ Hertzberg, B.⁶ Prabhu, M.K.⁷ Wijaya, H.⁸ Kozyrakis, C.⁹ Olukotun, K.¹⁰

20
- 77954997113
- Hewlett-Packard, Tech. Rep.
- Hewlett-Packard, "Inside the Intel Itanium 2 processor," Tech. Rep., 2002.
- (2002) Inside the Intel Itanium 2 Processor

21
- 0024903997
- Evaluating associativity in cpu caches
- M. D. Hill and A. J. Smith, "Evaluating associativity in cpu caches," IEEE Trans. Comput., vol. 38, no. 12, 1989.
- (1989) IEEE Trans. Comput. , vol.38 , Issue.12
- Hill, M.D.¹ Smith, A.J.²

22
- 77952123736
- A 48-core IA-32 message-passing processor with DVFS in 45nm CMOS
- J. Howard et al., "A 48-core IA-32 message-passing processor with DVFS in 45nm CMOS," in IEEE Intl. Solid-State Circuits Conf., 2010.
- (2010) IEEE Intl. Solid-State Circuits Conf.
- Howard, J.¹

23
- 63549149925
- Adaptive insertion policies for managing shared caches
- A. Jaleel, W. Hasenplaugh, M. Qureshi, J. Sebot, S. Steely, Jr., and J. Emer, "Adaptive insertion policies for managing shared caches," in Proc. of the 17th intl. conf. on Parallel Architectures and Compilation Techniques, 2008.
- (2008) Proc. of the 17th Intl. Conf. on Parallel Architectures and Compilation Techniques
- Jaleel, A.¹ Hasenplaugh, W.² Qureshi, M.³ Sebot, J.⁴ Steely Jr., S.⁵ Emer, J.⁶

24
- 77954998134
- High performance cache replacement using re-reference interval prediction (RRIP)
- A. Jaleel, K. Theobald, S. C. S. Jr, and J. Emer, "High performance cache replacement using re-reference interval prediction (RRIP)," in Proc. of the 37th annual Intl. Symp. on Computer Architecture, 2010.
- (2010) Proc. of the 37th Annual Intl. Symp. on Computer Architecture
- Jaleel, A.¹ Theobald, K.² Jr, S.C.S.³ Emer, J.⁴

25
- 0025429331
- Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
- N. P. Jouppi, "Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers," in Proc. of the 17th annual Intl. Symp. on Computer Architecture, 1990.
- (1990) Proc. of the 17th Annual Intl. Symp. on Computer Architecture
- Jouppi, N.P.¹

26
- 2342640788
- Using prime numbers for cache indexing to eliminate conflict misses
- M. Kharbutli, K. Irwin, Y. Solihin, and J. Lee, "Using prime numbers for cache indexing to eliminate conflict misses," in Proc. of the 10th Intl. Symp. on High Performance Computer Architecture, 2004.
- (2004) Proc. of the 10th Intl. Symp. on High Performance Computer Architecture
- Kharbutli, M.¹ Irwin, K.² Solihin, Y.³ Lee, J.⁴

27
- 0036949388
- An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches
- C. Kim, D. Burger, and S. W. Keckler, "An adaptive, nonuniform cache structure for wire-delay dominated on-chip caches," in Proc. of the 10th intl. conf. on Architectural Support for Programming Languages and Operating Systems, 2002.
- (2002) Proc. of the 10th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems
- Kim, C.¹ Burger, D.² Keckler, S.W.³

28
- 84904279959
- Lockup-free instruction fetch/prefetch cache organization
- D. Kroft, "Lockup-free instruction fetch/prefetch cache organization," in Proc. of the 8th annual Intl. Symp. on Computer Architecture, 1981.
- (1981) Proc. of the 8th Annual Intl. Symp. on Computer Architecture
- Kroft, D.¹

29
- 77952125596
- Westmere: A family of 32nm IA processors
- N. Kurd et al., "Westmere: A family of 32nm IA processors," in IEEE Intl. Solid-State Circuits Conf., 2010.
- (2010) IEEE Intl. Solid-State Circuits Conf.
- Kurd, N.¹

30
- 76749146060
- McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
- S. Li, J. H. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi, "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures," in Proc. of the 42nd annual IEEE/ACM Intl. Symp. on Microarchitecture, 2009.
- (2009) Proc. of the 42nd Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Li, S.¹ Ahn, J.H.² Strong, R.D.³ Brockman, J.B.⁴ Tullsen, D.M.⁵ Jouppi, N.P.⁶

31
- 31944440969
- Pin: Building customized program analysis tools with dynamic instrumentation
- C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood, "Pin: building customized program analysis tools with dynamic instrumentation," in Proc. of the ACM SIGPLAN conf. on Programming Language Design and Implementation, 2005.
- (2005) Proc. of the ACM SIGPLAN Conf. on Programming Language Design and Implementation
- Luk, C.-K.¹ Cohn, R.² Muth, R.³ Patil, H.⁴ Klauser, A.⁵ Lowney, G.⁶ Wallace, S.⁷ Reddi, V.J.⁸ Hazelwood, K.⁹

32
- 77955316263
- Some open questions related to cuckoo hashing
- M. Mitzenmacher, "Some open questions related to cuckoo hashing," in Proc. of the 17th annual European Symp. on Algorithms, 2009.
- (2009) Proc. of the 17th Annual European Symp. on Algorithms
- Mitzenmacher, M.¹

33
- 47349084021
- Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0
- N. Muralimanohar, R. Balasubramonian, and N. Jouppi, "Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0," in Proc. of the 40th annual IEEE/ACM Intl. Symp. on Microarchitecture, 2007.
- (2007) Proc. of the 40th Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Muralimanohar, N.¹ Balasubramonian, R.² Jouppi, N.³

34
- 70450253533
- ECMon: Exposing cache events for monitoring
- V. Nagarajan and R. Gupta, "ECMon: exposing cache events for monitoring," in Proc. of the 36th annual Intl. Symp. on Computer Architecture, 2009.
- (2009) Proc. of the 36th Annual Intl. Symp. on Computer Architecture
- Nagarajan, V.¹ Gupta, R.²

35
- 0037972071
- Cuckoo hashing
- R. Pagh and F. F. Rodler, "Cuckoo hashing," in Proc. of the 9th annual European Symp. on Algorithms, 2001.
- (2001) Proc. of the 9th Annual European Symp. on Algorithms
- Pagh, R.¹ Rodler, F.F.²

36
- 27644555246
- The v-way cache: Demand based associativity via global replacement
- M. K. Qureshi, D. Thompson, and Y. N. Patt, "The v-way cache: Demand based associativity via global replacement," in Proc. of the 32nd annual Intl. Symp. on Computer Architecture, 2005.
- (2005) Proc. of the 32nd Annual Intl. Symp. on Computer Architecture
- Qureshi, M.K.¹ Thompson, D.² Patt, Y.N.³

37
- 76749164849
- Adaptive line placement with the set balancing cache
- D. Rolán, B. B. Fraguela, and R. Doallo, "Adaptive line placement with the set balancing cache," in Proc. of the 42nd annual IEEE/ACM Intl. Symp. on Microarchitecture, 2009.
- (2009) Proc. of the 42nd Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Rolán, D.¹ Fraguela, B.B.² Doallo, R.³

38
- 47349104267
- Implementing signatures for transactional memory
- D. Sanchez, L. Yen, M. D. Hill, and K. Sankaralingam, "Implementing signatures for transactional memory," in Proc. of the 40th annual IEEE/ACM Intl. Symp. on Microarchitecture, 2007.
- (2007) Proc. of the 40th Annual IEEE/ACM Intl. Symp. on Microarchitecture
- Sanchez, D.¹ Yen, L.² Hill, M.D.³ Sankaralingam, K.⁴

39
- 0027307814
- A case for two-way skewed-associative caches
- A. Seznec, "A case for two-way skewed-associative caches," in Proc. of the 20th annual Intl. Symp. on Computer Architecture, 1993.
- (1993) Proc. of the 20th Annual Intl. Symp. on Computer Architecture
- Seznec, A.¹

40
- 77952200539
- A 40nm 16-core 128-thread CMT SPARC SoC processor
- J. Shin et al., "A 40nm 16-core 128-thread CMT SPARC SoC processor," in IEEE Intl. Solid-State Circuits Conf., 2010.
- (2010) IEEE Intl. Solid-State Circuits Conf.
- Shin, J.¹

41
- 77954985964
- Sun Microsystems, Tech. Rep.
- Sun Microsystems, "UltraSPARC T2 supplement to the Ultra- SPARC architecture 2007," Tech. Rep., 2007.
- (2007) UltraSPARC T2 Supplement to the Ultra- SPARC Architecture 2007

42
- 71149094440
- The bulk multicore architecture for improved programmability
- J. Torrellas, L. Ceze, J. Tuck, C. Cascaval, P. Montesinos, W. Ahn, and M. Prvulovic, "The bulk multicore architecture for improved programmability," Commun. ACM, vol. 52, no. 12, 2009.
- (2009) Commun. ACM , vol.52 , Issue.12
- Torrellas, J.¹ Ceze, L.² Tuck, J.³ Cascaval, C.⁴ Montesinos, P.⁵ Ahn, W.⁶ Prvulovic, M.⁷

43
- 77952179543
- The implementation of POWER7: A highly parallel and scalable multi-core high-end server processor
- D. Wendel et al., "The implementation of POWER7: A highly parallel and scalable multi-core high-end server processor," in IEEE Intl. Solid-State Circuits Conf., 2010.
- (2010) IEEE Intl. Solid-State Circuits Conf.
- Wendel, D.¹

44
- 70450279102
- PIPP: Promotion/insertion pseudopartitioning of multi-core shared caches
- Y. Xie and G. H. Loh, "PIPP: promotion/insertion pseudopartitioning of multi-core shared caches," in Proc. of the 36th annual Intl. Symp. on Computer Architecture, 2009.
- (2009) Proc. of the 36th Annual Intl. Symp. on Computer Architecture
- Xie, Y.¹ Loh, G.H.²

45
- 0031232542
- Two fast and highassociativity cache schemes
- C. Zhang, X. Zhang, and Y. Yan, "Two fast and highassociativity cache schemes," IEEE Micro, vol. 17, no. 5, 1997.
- (1997) IEEE Micro , vol.17 , Issue.5
- Zhang, C.¹ Zhang, X.² Yan, Y.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.