메뉴 건너뛰기




Volumn , Issue , 2010, Pages 319-330

Handling the problems and opportunities posed by multiple on-chip memory controllers

Author keywords

data placement; DRAM management; memory controller design

Indexed keywords

CONTROLLERS; DYNAMIC RANDOM ACCESS STORAGE; DYNAMICS; INTEGRATED CIRCUIT INTERCONNECTS; PARALLEL ARCHITECTURES;

EID: 78149234452     PISSN: 1089795X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1854273.1854314     Document Type: Conference Paper
Times cited : (101)

References (60)
  • 1
    • 78149244874 scopus 로고    scopus 로고
    • Homepage
    • Perfmon2 Project Homepage. http://perfmon2.sourceforge.net/.
  • 4
  • 6
    • 64949140362 scopus 로고    scopus 로고
    • Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches
    • M. Awasthi, K. Sudan, R. Balasubramonian, and J. Carter. Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches. In Proceedings of HPCA, 2009.
    • Proceedings of HPCA, 2009
    • Awasthi, M.1    Sudan, K.2    Balasubramonian, R.3    Carter, J.4
  • 11
    • 64949190009 scopus 로고    scopus 로고
    • PageNUCA: Selected Policies for Page-Grain Locality Management in Large Shared Chip-Multiprocessor Caches
    • M. Chaudhuri. PageNUCA: Selected Policies For Page-Grain Locality Management In Large Shared Chip-Multiprocessor Caches. In Proceedings of HPCA, 2009.
    • Proceedings of HPCA, 2009
    • Chaudhuri, M.1
  • 13
    • 40349095122 scopus 로고    scopus 로고
    • Managing Distributed, Shared L2 Caches through OS-Level Page Allocation
    • S. Cho and L. Jin. Managing Distributed, Shared L2 Caches through OS-Level Page Allocation. In Proceedings of MICRO, 2006.
    • Proceedings of MICRO, 2006
    • Cho, S.1    Jin, L.2
  • 15
    • 0034856730 scopus 로고    scopus 로고
    • Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance
    • V. Cuppu and B. Jacob. Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance. In Proceedings of ISCA, 2001.
    • Proceedings of ISCA, 2001
    • Cuppu, V.1    Jacob, B.2
  • 19
    • 34547670591 scopus 로고    scopus 로고
    • An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors
    • H. Dybdahl and P. Stenstrom. An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors. In Proceedings of HPCA, 2007.
    • Proceedings of HPCA, 2007
    • Dybdahl, H.1    Stenstrom, P.2
  • 24
    • 84976736383 scopus 로고
    • Page Placement Algorithms for Large Real-Indexed Caches
    • R. E. Kessler and M. D. Hill. Page Placement Algorithms for Large Real-Indexed Caches. ACM Trans. Comput. Syst., 10(4), 1992.
    • (1992) ACM Trans. Comput. Syst. , vol.10 , Issue.4
    • Kessler, R.E.1    Hill, M.D.2
  • 25
    • 40349103382 scopus 로고    scopus 로고
    • An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches
    • C. Kim, D. Burger, and S. Keckler. An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches. In Proceedings of ASPLOS, 2002.
    • Proceedings of ASPLOS, 2002
    • Kim, C.1    Burger, D.2    Keckler, S.3
  • 26
    • 77952558442 scopus 로고    scopus 로고
    • ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers
    • Y. Kim, D. Han, O. Mutlu, and M. Harchol-Balter. ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers. In Proceedings of HPCA, 2010.
    • Proceedings of HPCA, 2010
    • Kim, Y.1    Han, D.2    Mutlu, O.3    Harchol-Balter, M.4
  • 28
    • 0026107998 scopus 로고
    • Page Placement policies for NUMA multiprocessors
    • R. LaRowe and C. Ellis. Page Placement policies for NUMA multiprocessors. J. Parallel Distrib. Comput., 11(2), 1991.
    • (1991) J. Parallel Distrib. Comput. , vol.11 , Issue.2
    • LaRowe, R.1    Ellis, C.2
  • 29
    • 78149251349 scopus 로고    scopus 로고
    • Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor
    • R. LaRowe, J. Wilkes, and C. Ellis. Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor. In Proceedings of PPOPP, 1991.
    • Proceedings of PPOPP, 1991
    • LaRowe, R.1    Wilkes, J.2    Ellis, C.3
  • 32
    • 57749186047 scopus 로고    scopus 로고
    • Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems
    • J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems. In Proceedings of HPCA, 2008.
    • Proceedings of HPCA, 2008
    • Lin, J.1    Lu, Q.2    Ding, X.3    Zhang, Z.4    Zhang, X.5    Sadayappan, P.6
  • 34
    • 52649125840 scopus 로고    scopus 로고
    • 3D-Stacked Memory Architectures for Multi-Core Processors
    • G. Loh. 3D-Stacked Memory Architectures for Multi-Core Processors. In Proceedings of ISCA, 2008.
    • Proceedings of ISCA, 2008
    • Loh, G.1
  • 36
    • 77952562600 scopus 로고    scopus 로고
    • Memphis: Finding and fixing numa-related performance problems on multi-core platforms
    • C. McCurdy and J. Vetter. Memphis: Finding and fixing numa-related performance problems on multi-core platforms. In Proceedings of ISPASS, 2010.
    • Proceedings of ISPASS, 2010
    • McCurdy, C.1    Vetter, J.2
  • 38
    • 0035511103 scopus 로고    scopus 로고
    • Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses
    • R. Min and Y. Hu. Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses. IEEE Trans. Comput., 50(11), 2001.
    • (2001) IEEE Trans. Comput. , vol.50 , Issue.11
    • Min, R.1    Hu, Y.2
  • 40
    • 47349122373 scopus 로고    scopus 로고
    • Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors
    • O. Mutlu and T. Moscibroda. Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors. In Proceedings of MICRO, 2007.
    • Proceedings of MICRO, 2007
    • Mutlu, O.1    Moscibroda, T.2
  • 41
    • 52649119398 scopus 로고    scopus 로고
    • Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems
    • O. Mutlu and T. Moscibroda. Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems. In Proceedings of ISCA, 2008.
    • Proceedings of ISCA, 2008
    • Mutlu, O.1    Moscibroda, T.2
  • 42
    • 12844249966 scopus 로고    scopus 로고
    • Heat-and-Run: Leveraging SMT and CMP to Manage Power Density Through the Operating System
    • M. Powell, M. Gomaa, and T. Vijaykumar. Heat-and-Run: Leveraging SMT and CMP to Manage Power Density Through the Operating System. In Proceedings of ASPLOS, 2004.
    • Proceedings of ASPLOS, 2004
    • Powell, M.1    Gomaa, M.2    Vijaykumar, T.3
  • 43
    • 64949187933 scopus 로고    scopus 로고
    • Adaptive Spill-Receive for Robust High-Performance Caching in CMPs
    • M. K. Qureshi. Adaptive Spill-Receive for Robust High-Performance Caching in CMPs. In Proceedings of HPCA, 2009.
    • Proceedings of HPCA, 2009
    • Qureshi, M.K.1
  • 48
  • 49
    • 27544498313 scopus 로고    scopus 로고
    • Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors
    • E. Speight, H. Shafi, L. Zhang, and R. Rajamony. Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors. In Proceedings of ISCA, 2005.
    • Proceedings of ISCA, 2005
    • Speight, E.1    Shafi, H.2    Zhang, L.3    Rajamony, R.4
  • 51
    • 52649100126 scopus 로고    scopus 로고
    • Corona: System Implications of Emerging Nanophotonic Technology
    • D. Vantrease et al. Corona: System Implications of Emerging Nanophotonic Technology. In Proceedings of ISCA, 2008.
    • Proceedings of ISCA, 2008
    • Vantrease, D.1
  • 52
    • 17044405973 scopus 로고    scopus 로고
    • Operating system support for improving data locality on CC-NUMA compute servers
    • B. Verghese, S. Devine, A. Gupta, and M. Rosenblum. Operating system support for improving data locality on CC-NUMA compute servers. SIGPLAN Not., 31(9), 1996.
    • (1996) SIGPLAN Not. , vol.31 , Issue.9
    • Verghese, B.1    Devine, S.2    Gupta, A.3    Rosenblum, M.4
  • 55
    • 36849030305 scopus 로고    scopus 로고
    • On-Chip Interconnection Architecture of the Tile Processor
    • D. Wentzlaff et al. On-Chip Interconnection Architecture of the Tile Processor. In IEEE Micro, volume 22, 2007.
    • (2007) IEEE Micro , vol.22
    • Wentzlaff, D.1
  • 56
    • 27544495466 scopus 로고    scopus 로고
    • Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors
    • M. Zhang and K. Asanovic. Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors. In Proceedings of ISCA, 2005.
    • Proceedings of ISCA, 2005
    • Zhang, M.1    Asanovic, K.2
  • 57
    • 0034460897 scopus 로고    scopus 로고
    • A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conicts and Exploit Data Locality
    • Z. Zhang, Z. Zhu, and X. Zhand. A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conicts and Exploit Data Locality. In Proceedings of MICRO, 2000.
    • Proceedings of MICRO, 2000
    • Zhang, Z.1    Zhu, Z.2    Zhand, X.3
  • 58
    • 66749162556 scopus 로고    scopus 로고
    • Mini-Rank: Adaptive DRAM Architecture for Improving Memory Power E ciency
    • H. Zheng et al. Mini-Rank: Adaptive DRAM Architecture For Improving Memory Power E ciency. In Proceedings of MICRO, 2008.
    • Proceedings of MICRO, 2008
    • Zheng, H.1
  • 60
    • 28444470842 scopus 로고    scopus 로고
    • A Performance Comparison of DRAM Memory System Optimizations for SMT Processors
    • Z. Zhu and Z. Zhang. A Performance Comparison of DRAM Memory System Optimizations for SMT Processors. In Proceedings of HPCA, 2005
    • Proceedings of HPCA, 2005
    • Zhu, Z.1    Zhang, Z.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.