SCOPUS 정보 검색 플랫폼

Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

Volumn , Issue , 2013, Pages 213-224

Jigsaw: Scalable software-defined caches

(2) Beckmann, Nathan a Sanchez, Daniel a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

cache; isolation; memory; NUCA; partitioning

Indexed keywords

ACCESS LATENCY; CACHE; CACHE PARTITIONING; FUNDAMENTAL LIMITATIONS; ISOLATION; LAST-LEVEL CACHES; NUCA; PARTITIONING;

DATA STORAGE EQUIPMENT; PARALLEL ARCHITECTURES;

INTERFERENCE SUPPRESSION;

EID: 84887440618 PISSN: 1089795X EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/PACT.2013.6618818 Document Type: Conference Paper

Times cited : (76)

References (46)

1
- 33947715600
- IPC considered harmful for multiprocessor workloads
- A. Alameldeen and D. Wood, "IPC considered harmful for multiprocessor workloads," IEEE Micro, vol. 26, no. 4, 2006.
- (2006) IEEE Micro , vol.26 , Issue.4
- Alameldeen, A.¹ Wood, D.²

2
- 34548008288
- ASR: Adaptive selective replication for CMP caches
- B. Beckmann, M. Marty, and D. Wood, "ASR: Adaptive selective replication for CMP caches," in Proc. MICRO-39, 2006.
- (2006) Proc. MICRO-39
- Beckmann, B.¹ Marty, M.² Wood, D.³

3
- 21644472427
- Managing wire delay in large chipmultiprocessor caches
- B. Beckmann and D. Wood, "Managing wire delay in large chipmultiprocessor caches," in Proc. MICRO-37, 2004.
- (2004) Proc. MICRO-37
- Beckmann, B.¹ Wood, D.²

4
- 84887442380
- Jigsaw: Scalable software-defined caches
- Tech. Rep.
- N. Beckmann and D. Sanchez, "Jigsaw: Scalable Software-Defined Caches," MIT CSAIL, Tech. Rep., 2013.
- (2013) MIT CSAIL
- Beckmann, N.¹ Sanchez, D.²

5
- 70449689185
- The PARSEC benchmark suite: Characterization and architectural implications
- C. Bienia et al., "The PARSEC benchmark suite: Characterization and architectural implications," in Proc. PACT-17, 2008.
- (2008) Proc. PACT-17
- Bienia, C.¹

6
- 84887501582
- PACORA: Performance aware convex optimization for resource allocation
- S. Bird and B. Smith, "PACORA: Performance aware convex optimization for resource allocation," in Proc. HotPar-3, 2011.
- (2011) Proc. HotPar-3
- Bird, S.¹ Smith, B.²

7
- 0013540431
- Universal classes of hash functions (extended abstract)
- J. L. Carter and M. N. Wegman, "Universal classes of hash functions (extended abstract)," in Proc. STOC-9, 1977.
- (1977) Proc. STOC-9
- Carter, J.L.¹ Wegman, M.N.²

8
- 33845903561
- Cooperative caching for chip multiprocessors
- J. Chang and G. Sohi, "Cooperative caching for chip multiprocessors," in Proc. ISCA-33, 2006.
- (2006) Proc. ISCA-33
- Chang, J.¹ Sohi, G.²

9
- 0033683314
- Application-specific memory management for embedded systems using software-controlled caches
- D. Chiou et al., "Application-specific memory management for embedded systems using software-controlled caches," in Proc. DAC-37, 2000.
- (2000) Proc. DAC-37
- Chiou, D.¹

10
- 27544432313
- Optimizing replication, communication, and capacity allocation in cmps
- Z. Chishti, M. Powell, and T. Vijaykumar, "Optimizing replication, communication, and capacity allocation in cmps," in ISCA-32, 2005.
- (2005) ISCA-32
- Chishti, Z.¹ Powell, M.² Vijaykumar, T.³

11
- 40349095122
- Managing distributed, shared L2 caches through OS-level page allocation
- S. Cho and L. Jin, "Managing distributed, shared L2 caches through OS-level page allocation," in Proc. MICRO-39, 2006.
- (2006) Proc. MICRO-39
- Cho, S.¹ Jin, L.²

12
- 34547670591
- An adaptive shared/private nuca cache partitioning scheme for chip multiprocessors
- H. Dybdahl and P. Stenstrom, "An adaptive shared/private nuca cache partitioning scheme for chip multiprocessors," in Proc. HPCA-13, 2007.
- (2007) Proc. HPCA-13
- Dybdahl, H.¹ Stenstrom, P.²

13
- 49549096253
- A sub-1w to 2w low-power processor for mobile internet devices and ultra-mobile PCs in 45nm hi-k metal gate CMOS
- G. Gerosa et al., "A sub-1w to 2w low-power processor for mobile internet devices and ultra-mobile PCs in 45nm hi-k metal gate CMOS," in ISSCC, 2008.
- (2008) ISSCC
- Gerosa, G.¹

14
- 70350601187
- Reactive NUCA: Near-optimal block placement and replication in distributed caches
- N. Hardavellas et al., "Reactive NUCA: near-optimal block placement and replication in distributed caches," in Proc. ISCA-36, 2009.
- (2009) Proc. ISCA-36
- Hardavellas, N.¹

15
- 63549130254
- Distributed cooperative caching
- E. Herrero, J. González, and R. Canal, "Distributed Cooperative Caching," in Proc. PACT-17, 2008.
- (2008) Proc. PACT-17
- Herrero, E.¹ González, J.² Canal, R.³

16
- 77955009892
- Elastic cooperative caching: An autonomous dynamically adaptive memory hierarchy for chip multiprocessors
- E. Herrero, J. González, and R. Canal, "Elastic cooperative caching: an autonomous dynamically adaptive memory hierarchy for chip multiprocessors," in Proc. ISCA-37, 2010.
- (2010) Proc. ISCA-37
- Herrero, E.¹ González, J.² Canal, R.³

17
- 84910129119
- Fiesta: A sample-balanced multiprogram workload methodology
- A. Hilton, N. Eswaran, and A. Roth, "FIESTA: A sample-balanced multiprogram workload methodology," in Proc. MoBS, 2009.
- (2009) Proc. MoBS
- Hilton, A.¹ Eswaran, N.² Roth, A.³

18
- 34548225417
- A nuca substrate for flexible CMP cache sharing
- J. Jaehyuk Huh et al., "A NUCA substrate for flexible CMP cache sharing," IEEE Trans. Par. Dist. Sys., vol. 18, no. 8, 2007.
- (2007) IEEE Trans. Par. Dist. Sys. , vol.18 , Issue.8
- Jaehyuk Huh, J.¹

19
- 80051679338
- Last level cache (LLC) performance of data mining workloads on a CMP
- A. Jaleel, M. Mattina, and B. Jacob, "Last Level Cache (LLC) Performance of Data Mining Workloads On A CMP," in HPCA-12, 2006.
- (2006) HPCA-12
- Jaleel, A.¹ Mattina, M.² Jacob, B.³

20
- 0036949388
- An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches
- C. Kim, D. Burger, and S. Keckler, "An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches," in ASPLOS-10, 2002.
- (2002) ASPLOS-10
- Kim, C.¹ Burger, D.² Keckler, S.³

21
- 77952125596
- Westmere: A family of 32nm IA processors
- N. Kurd et al., "Westmere: A family of 32nm IA processors," in ISSCC, 2010.
- (2010) ISSCC
- Kurd, N.¹

22
- 79955893556
- CloudCache: Expanding and shrinking private caches
- H. Lee, S. Cho, and B. R. Childers, "CloudCache: Expanding and shrinking private caches," in Proc. HPCA-17, 2011.
- (2011) Proc. HPCA-17
- Lee, H.¹ Cho, S.² Childers, B.R.³

23
- 79953203158
- CoQoS: Coordinating QoS-aware shared resources in NoC-based SoCs
- B. Li et al., "CoQoS: Coordinating QoS-aware shared resources in NoC-based SoCs," J. Par. Dist. Comp., vol. 71, no. 5, 2011.
- (2011) J. Par. Dist. Comp. , vol.71 , Issue.5
- Li, B.¹

24
- 76749146060
- McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
- S. Li et al., "McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures," in MICRO-42, 2009.
- (2009) MICRO-42
- Li, S.¹

25
- 57749186047
- Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
- J. Lin et al., "Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems," in HPCA-14, 2008.
- (2008) HPCA-14
- Lin, J.¹

26
- 84876533349
- NOC-Out: Microarchitecting a scale-out processor
- P. Lotfi-Kamran, B. Grot, and B. Falsafi, "NOC-Out: Microarchitecting a Scale-Out Processor," in Proc. MICRO-45, 2012.
- (2012) Proc. MICRO-45
- Lotfi-Kamran, P.¹ Grot, B.² Falsafi, B.³

27
- 34547633495
- Pin: Building customized program analysis tools with dynamic instrumentation
- C.-K. Luk et al., "Pin: building customized program analysis tools with dynamic instrumentation," in Proc. PLDI, 2005.
- (2005) Proc. PLDI
- Luk, C.-K.¹

28
- 84864836765
- Probabilistic shared cache management (PriSM)
- R. Manikantan, K. Rajan, and R. Govindarajan, "Probabilistic shared cache management (PriSM)," in Proc. ISCA-39, 2012.
- (2012) Proc. ISCA-39
- Manikantan, R.¹ Rajan, K.² Govindarajan, R.³

29
- 35348900723
- Virtual hierarchies to support server consolidation
- M. Marty and M. Hill, "Virtual hierarchies to support server consolidation," in Proc. ISCA-34, 2007.
- (2007) Proc. ISCA-34
- Marty, M.¹ Hill, M.²

30
- 0023330010
- On-line construction of the convex hull of a simple polyline
- A. A. Melkman, "On-line construction of the convex hull of a simple polyline," Information Processing Letters, vol. 25, no. 1, 1987.
- (1987) Information Processing Letters , vol.25 , Issue.1
- Melkman, A.A.¹

31
- 77952573440
- ESP-nuca: A low-cost adaptive nonuniform cache architecture
- J. Merino, V. Puente, and J. Gregorio, "ESP-NUCA: A low-cost adaptive nonuniform cache architecture," in Proc. HPCA-16, 2010.
- (2010) Proc. HPCA-16
- Merino, J.¹ Puente, V.² Gregorio, J.³

32
- 84934272528
- Micron
- Micron, "1. 35V DDR3L power calculator (4Gb x16 chips)," 2013.
- (2013) 1. 35V DDR3L Power Calculator (4Gb x16 Chips)

33
- 64949187933
- Adaptive spill-receive for robust high-performance caching in CMPs
- M. Qureshi, "Adaptive Spill-Receive for Robust High-Performance Caching in CMPs," in Proc. HPCA-10, 2009.
- (2009) Proc. HPCA-10
- Qureshi, M.¹

34
- 34548042910
- Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
- M. Qureshi and Y. Patt, "Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches," in Proc. MICRO-39, 2006.
- (2006) Proc. MICRO-39
- Qureshi, M.¹ Patt, Y.²

35
- 0033723131
- Reconfigurable caches and their application to media processing
- P. Ranganathan, S. Adve, and N. Jouppi, "Reconfigurable caches and their application to media processing," in Proc. ISCA-27, 2000.
- (2000) Proc. ISCA-27
- Ranganathan, P.¹ Adve, S.² Jouppi, N.³

36
- 79951696261
- The zcache: Decoupling ways and associativity
- D. Sanchez and C. Kozyrakis, "The ZCache: Decoupling Ways and Associativity," in Proc. MICRO-43, 2010.
- (2010) Proc. MICRO-43
- Sanchez, D.¹ Kozyrakis, C.²

37
- 80052521720
- Vantage: Scalable and efficient fine-grain cache partitioning
- D. Sanchez and C. Kozyrakis, "Vantage: Scalable and Efficient Fine-Grain Cache Partitioning," in Proc. ISCA-38, 2011.
- (2011) Proc. ISCA-38
- Sanchez, D.¹ Kozyrakis, C.²

38
- 84881154274
- ZSim: Fast and accurate microarchitectural simulation of thousand-core systems
- D. Sanchez and C. Kozyrakis, "ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems," in Proc. ISCA-40, 2013.
- (2013) Proc. ISCA-40
- Sanchez, D.¹ Kozyrakis, C.²

39
- 0027307814
- A case for two-way skewed-associative caches
- A. Seznec, "A case for two-way skewed-associative caches," in ISCA-20, 1993.
- (1993) ISCA-20
- Seznec, A.¹

40
- 0034443570
- Symbiotic jobscheduling for a simultaneous multithreading processor
- A. Snavely and D. M. Tullsen, "Symbiotic jobscheduling for a simultaneous multithreading processor," in Proc. ASPLOS-8, 2000.
- (2000) Proc. ASPLOS-8
- Snavely, A.¹ Tullsen, D.M.²

41
- 76749118968
- SHARP control: Controlled shared cache management in chip multiprocessors
- S. Srikantaiah, M. Kandemir, and Q. Wang, "SHARP control: Controlled shared cache management in chip multiprocessors," in MICRO-42, 2009.
- (2009) MICRO-42
- Srikantaiah, S.¹ Kandemir, M.² Wang, Q.³

42
- 57749176037
- Managing shared l2 caches on multicore systems in software
- D. Tam et al., "Managing shared l2 caches on multicore systems in software," in WIOSCA, 2007.
- (2007) WIOSCA
- Tam, D.¹

43
- 40349093471
- Molecular caches: A caching structure for dynamic creation of app-specific heterogeneous cache regions
- K. Varadarajan et al., "Molecular caches: A caching structure for dynamic creation of app-specific heterogeneous cache regions," in MICRO-39, 2006.
- (2006) MICRO-39
- Varadarajan, K.¹

44
- 80052529677
- A comparison of capacity management schemes for shared cmp caches
- C. Wu and M. Martonosi, "A Comparison of Capacity Management Schemes for Shared CMP Caches," in WDDD-7, 2008.
- (2008) WDDD-7
- Wu, C.¹ Martonosi, M.²

45
- 70450279102
- PIPP: Promotion/insertion pseudo-partitioning of multicore shared caches
- Y. Xie and G. H. Loh, "PIPP: promotion/insertion pseudo-partitioning of multicore shared caches," in Proc. ISCA-36, 2009.
- (2009) Proc. ISCA-36
- Xie, Y.¹ Loh, G.H.²

46
- 27544495466
- Victim replication: Maximizing capacity while hiding wire delay in tiled chip multiprocessors
- M. Zhang and K. Asanovic, "Victim replication: Maximizing capacity while hiding wire delay in tiled chip multiprocessors," in ISCA-32, 2005.
- (2005) ISCA-32
- Zhang, M.¹ Asanovic, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.