메뉴 건너뛰기




Volumn , Issue , 2009, Pages 262-273

Optimizing communication and capacity in a 3D stacked reconfigurable cache hierarchy

Author keywords

Cache and memory hierarchy; Multi core processors; Non uniform cache architecture (NUCA); On chip networks; Page coloring; SRAM DRAM cache reconfiguration

Indexed keywords

BUFFER STORAGE; DYNAMIC RANDOM ACCESS STORAGE; MEMORY ARCHITECTURE; NETWORK ARCHITECTURE; STATIC RANDOM ACCESS STORAGE; THREE DIMENSIONAL INTEGRATED CIRCUITS;

EID: 64949203821     PISSN: 15300897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCA.2009.4798261     Document Type: Conference Paper
Times cited : (59)

References (49)
  • 1
    • 64949140362 scopus 로고    scopus 로고
    • Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches
    • Feb
    • M. Awasthi, K. Sudan, and R. Balasubramonian. Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches. In Proceedings of HPCA-15, Feb 2009.
    • (2009) Proceedings of HPCA-15
    • Awasthi, M.1    Sudan, K.2    Balasubramonian, R.3
  • 2
    • 0034461413 scopus 로고    scopus 로고
    • R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory Hierarchy Reconfiguration for Energy and Performance in General-Purpose Processor Architectures. In Proceedings of MICRO-33, pages 245?257, December 2000.
    • R. Balasubramonian, D. Albonesi, A. Buyuktosunoglu, and S. Dwarkadas. Memory Hierarchy Reconfiguration for Energy and Performance in General-Purpose Processor Architectures. In Proceedings of MICRO-33, pages 245?257, December 2000.
  • 4
    • 0038346226 scopus 로고    scopus 로고
    • Dynamically Managing the Communication-Parallelism Trade-Off in Future Clustered Processors
    • June
    • R. Balasubramonian, S. Dwarkadas, and D. Albonesi. Dynamically Managing the Communication-Parallelism Trade-Off in Future Clustered Processors. In Proceedings of ISCA-30, pages 275-286, June 2003.
    • (2003) Proceedings of ISCA-30 , pp. 275-286
    • Balasubramonian, R.1    Dwarkadas, S.2    Albonesi, D.3
  • 5
    • 34548008288 scopus 로고    scopus 로고
    • ASR: Adaptive Selective Replication for CMP Caches
    • December
    • B. Beckmann, M. Marty, and D. Wood. ASR: Adaptive Selective Replication for CMP Caches. In Proceedings of MICRO-39, December 2006.
    • (2006) Proceedings of MICRO-39
    • Beckmann, B.1    Marty, M.2    Wood, D.3
  • 7
    • 21644472427 scopus 로고    scopus 로고
    • Managing Wire Delay in Large Chip-Multiprocessor Caches
    • December
    • B. Beckmann and D.Wood. Managing Wire Delay in Large Chip-Multiprocessor Caches. In Proceedings of MICRO-37, December 2004.
    • (2004) Proceedings of MICRO-37
    • Beckmann, B.1    Wood, D.2
  • 8
    • 33845903561 scopus 로고    scopus 로고
    • Co-Operative Caching for Chip Multiprocessors
    • June
    • J. Chang and G. Sohi. Co-Operative Caching for Chip Multiprocessors. In Proceedings of ISCA-33, June 2006.
    • (2006) Proceedings of ISCA-33
    • Chang, J.1    Sohi, G.2
  • 9
    • 84944411840 scopus 로고    scopus 로고
    • Distance Associativity for High-Performance Energy-Efficient Non-Uniform Cache Architectures
    • December
    • Z. Chishti, M. Powell, and T. Vijaykumar. Distance Associativity for High-Performance Energy-Efficient Non-Uniform Cache Architectures. In Proceedings of MICRO-36, December 2003.
    • (2003) Proceedings of MICRO-36
    • Chishti, Z.1    Powell, M.2    Vijaykumar, T.3
  • 10
    • 27544432313 scopus 로고    scopus 로고
    • Optimizing Replication, Communication, and Capacity Allocation in CMPs
    • June
    • Z. Chishti, M. Powell, and T. Vijaykumar. Optimizing Replication, Communication, and Capacity Allocation in CMPs. In Proceedings of ISCA-32, June 2005.
    • (2005) Proceedings of ISCA-32
    • Chishti, Z.1    Powell, M.2    Vijaykumar, T.3
  • 11
    • 40349095122 scopus 로고    scopus 로고
    • Managing Distributed, Shared L2 Caches through OS-Level Page Allocation
    • December
    • S. Cho and L. Jin. Managing Distributed, Shared L2 Caches through OS-Level Page Allocation. In Proceedings of MICRO-39, December 2006.
    • (2006) Proceedings of MICRO-39
    • Cho, S.1    Jin, L.2
  • 12
    • 63549146674 scopus 로고    scopus 로고
    • Workshop on On- and Off-Chip Interconnection Networks for Multicore Systems (OCIN)
    • at
    • W. Dally. Workshop on On- and Off-Chip Interconnection Networks for Multicore Systems (OCIN), 2006. Workshop program and report at http://www.ece.ucdavis.edu/̃ocin06/.
    • (2006) Workshop program and report
    • Dally, W.1
  • 13
    • 0036292415 scopus 로고    scopus 로고
    • A. Dhodapkar and J. E. Smith. Managing Multi- Configurable Hardware via Dynamic Working Set Analysis. In Proceedings of ISCA-29, pages 233?244, May 2002.
    • A. Dhodapkar and J. E. Smith. Managing Multi- Configurable Hardware via Dynamic Working Set Analysis. In Proceedings of ISCA-29, pages 233?244, May 2002.
  • 14
    • 84968860873 scopus 로고    scopus 로고
    • Characterizing and Predicting Program Behavior and its Variability
    • September
    • E. Duesterwald, C. Cascaval, and S. Dwarkadas. Characterizing and Predicting Program Behavior and its Variability. In Proceedings of PACT-12, September 2003.
    • (2003) Proceedings of PACT-12
    • Duesterwald, E.1    Cascaval, C.2    Dwarkadas, S.3
  • 15
    • 34547670591 scopus 로고    scopus 로고
    • An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors
    • February
    • H. Dybdahl and P. Stenstrom. An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors. In Proceedings of HPCA-13, February 2007.
    • (2007) Proceedings of HPCA-13
    • Dybdahl, H.1    Stenstrom, P.2
  • 18
    • 32844471317 scopus 로고    scopus 로고
    • J. Huh, C. Kim, H. Shafi, L. Zhang, D. Burger, and S. Keckler. A NUCA Substrate for Flexible CMP Cache Sharing. In Proceedings of ICS-19, June 2005.
    • J. Huh, C. Kim, H. Shafi, L. Zhang, D. Burger, and S. Keckler. A NUCA Substrate for Flexible CMP Cache Sharing. In Proceedings of ICS-19, June 2005.
  • 20
    • 34547657571 scopus 로고    scopus 로고
    • A Domain-Specific On- Chip Network Design for Large Scale Cache Systems
    • February
    • Y. Jin, E. J. Kim, and K. H. Yum. A Domain-Specific On- Chip Network Design for Large Scale Cache Systems. In Proceedings of HPCA-13, February 2007.
    • (2007) Proceedings of HPCA-13
    • Jin, Y.1    Kim, E.J.2    Yum, K.H.3
  • 22
    • 0036949388 scopus 로고    scopus 로고
    • An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches
    • October
    • C. Kim, D. Burger, and S. Keckler. An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches. In Proceedings of ASPLOS-X, October 2002.
    • (2002) Proceedings of ASPLOS-X
    • Kim, C.1    Burger, D.2    Keckler, S.3
  • 25
  • 26
    • 52649125840 scopus 로고    scopus 로고
    • 3D-Stacked Memory Architectures for Multi-Core Processors
    • June
    • G. Loh. 3D-Stacked Memory Architectures for Multi-Core Processors. In Proceedings of ISCA-35, June 2008.
    • (2008) Proceedings of ISCA-35
    • Loh, G.1
  • 27
  • 29
    • 20344403770 scopus 로고    scopus 로고
    • Montecito: A Dual-Core, Dual-Thread Itanium Processor
    • March/April
    • C. McNairy and R. Bhatia. Montecito: A Dual-Core, Dual-Thread Itanium Processor. IEEE Micro, 25(2), March/April 2005.
    • (2005) IEEE Micro , vol.25 , Issue.2
    • McNairy, C.1    Bhatia, R.2
  • 33
    • 34548042910 scopus 로고    scopus 로고
    • Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches
    • December
    • M. Qureshi and Y. Patt. Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches. In Proceedings of MICRO-39, December 2006.
    • (2006) Proceedings of MICRO-39
    • Qureshi, M.1    Patt, Y.2
  • 34
    • 0033723131 scopus 로고    scopus 로고
    • Reconfigurable caches and their application to media processing
    • June
    • P. Ranganathan, S. Adve, and N. Jouppi. Reconfigurable caches and their application to media processing. Proceedings of ISCA-27, pages 214-224, June 2000.
    • (2000) Proceedings of ISCA-27 , pp. 214-224
    • Ranganathan, P.1    Adve, S.2    Jouppi, N.3
  • 36
    • 84868915709 scopus 로고    scopus 로고
    • Semiconductor Industry Association
    • Semiconductor Industry Association. International Technology Roadmap for Semiconductors 2005. http://www.itrs.net/Links/2005ITRS/Home2005.htm.
    • (2005)
  • 37
    • 0038684860 scopus 로고    scopus 로고
    • K. Skadron, M. R. Stan, W. Huang, S. Velusamy, and K. Sankaranarayanan. Temperature-Aware Microarchitecture. In Proceedings of ISCA-30, pages 2?13, 2003.
    • K. Skadron, M. R. Stan, W. Huang, S. Velusamy, and K. Sankaranarayanan. Temperature-Aware Microarchitecture. In Proceedings of ISCA-30, pages 2?13, 2003.
  • 38
    • 27544498313 scopus 로고    scopus 로고
    • Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors
    • June
    • E. Speight, H. Shafi, L. Zhang, and R. Rajamony. Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors. In Proceedings of ISCA-32, June 2005.
    • (2005) Proceedings of ISCA-32
    • Speight, E.1    Shafi, H.2    Zhang, L.3    Rajamony, R.4
  • 40
    • 52649139073 scopus 로고    scopus 로고
    • A Comprehensive Memory Modeling Tool and its Application to the Design and Analysis of Future Memory Hierarchies
    • June
    • S. Thoziyoor, J. H. Ahn, M. Monchiero, J. B. Brockman, and N. P. Jouppi. A Comprehensive Memory Modeling Tool and its Application to the Design and Analysis of Future Memory Hierarchies. In Proceedings of ISCA-35, June 2008.
    • (2008) Proceedings of ISCA-35
    • Thoziyoor, S.1    Ahn, J.H.2    Monchiero, M.3    Brockman, J.B.4    Jouppi, N.P.5
  • 41
    • 64949203223 scopus 로고    scopus 로고
    • A Third-Generation 65nm 16-Core 32-Thread Plus 32-Scout-Thread CMT
    • Febuary
    • M. Tremblay and S. Chaudhry. A Third-Generation 65nm 16-Core 32-Thread Plus 32-Scout-Thread CMT. In Proceedings of ISSCC, Febuary 2008.
    • (2008) Proceedings of ISSCC
    • Tremblay, M.1    Chaudhry, S.2
  • 44
    • 0037225560 scopus 로고    scopus 로고
    • A Power Model for Routers: Modeling Alpha 21364 and InfiniBand Routers
    • January
    • H. S. Wang, L. S. Peh, and S. Malik. A Power Model for Routers: Modeling Alpha 21364 and InfiniBand Routers. In IEEE Micro, Vol 24, No 1, January 2003.
    • (2003) IEEE Micro , vol.24 , Issue.1
    • Wang, H.S.1    Peh, L.S.2    Malik, S.3
  • 45
    • 84948976085 scopus 로고    scopus 로고
    • Orion: A Power-Performance Simulator for Interconnection Networks
    • November
    • H.-S. Wang, X. Zhu, L.-S. Peh, and S. Malik. Orion: A Power-Performance Simulator for Interconnection Networks. In Proceedings of MICRO-35, November 2002.
    • (2002) Proceedings of MICRO-35
    • Wang, H.-S.1    Zhu, X.2    Peh, L.-S.3    Malik, S.4
  • 47
    • 0038684781 scopus 로고    scopus 로고
    • A Highly Configurable Cache Architecture for Embedded Systems
    • June
    • C. Zhang, F. Vahid, and W. Najjar. A Highly Configurable Cache Architecture for Embedded Systems. In Proceedings of ISCA-30, June 2003.
    • (2003) Proceedings of ISCA-30
    • Zhang, C.1    Vahid, F.2    Najjar, W.3
  • 48
    • 27544495466 scopus 로고    scopus 로고
    • Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors
    • June
    • M. Zhang and K. Asanovic. Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors. In Proceedings of ISCA-32, June 2005.
    • (2005) Proceedings of ISCA-32
    • Zhang, M.1    Asanovic, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.