메뉴 건너뛰기




Volumn , Issue , 2010, Pages 219-230

Micro-pages: Increasing DRAM efficiency with locality-aware data placement

Author keywords

Data placement; DRAM row buffer management

Indexed keywords

ACCESS TIME; BUFFER MANAGEMENT; BUFFER UTILIZATION; CACHE BLOCKS; CELL ARRAY; CHIP-MULTIPROCESSOR; COLOCATIONS; DATA PLACEMENT; ENERGY CONSUMPTION; HARDWARE-ASSISTED; MULTI CORE; MULTI-CORE PROCESSOR; MULTI-CORE SYSTEMS; PAGE SIZES; PERFORMANCE IMPROVEMENTS; POWER CONSUMPTION; PROCESSOR SYSTEMS; SPATIAL AND TEMPORAL LOCALITY; WRITE-BACK;

EID: 77952283542     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1736020.1736045     Document Type: Conference Paper
Times cited : (104)

References (63)
  • 3
    • 72249085531 scopus 로고    scopus 로고
    • Available at
    • Java Server Benchmark, 2005. Available at http://www.spec.org/jbb2005/.
    • (2005) Java Server Benchmark
  • 6
    • 64949140362 scopus 로고    scopus 로고
    • Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches
    • M. Awasthi, K. Sudan, R. Balasubramonian, and J. Carter. Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches. In Proceedings of HPCA, 2009.
    • Proceedings of HPCA, 2009
    • Awasthi, M.1    Sudan, K.2    Balasubramonian, R.3    Carter, J.4
  • 13
    • 64949190009 scopus 로고    scopus 로고
    • PageNUCA: Selected Policies for Page-Grain Locality Management in Large Shared Chip-Multiprocessor Caches
    • M. Chaudhuri. PageNUCA: Selected Policies For Page-Grain Locality Management In Large Shared Chip-Multiprocessor Caches. In Proceedings of HPCA, 2009.
    • Proceedings of HPCA, 2009
    • Chaudhuri, M.1
  • 14
    • 40349095122 scopus 로고    scopus 로고
    • Managing Distributed, Shared L2 Caches through OS-Level Page Allocation
    • S. Cho and L. Jin. Managing Distributed, Shared L2 Caches through OS-Level Page Allocation. In Proceedings of MICRO, 2006.
    • Proceedings of MICRO, 2006
    • Cho, S.1    Jin, L.2
  • 16
    • 0031274906 scopus 로고    scopus 로고
    • Direct Rambus Technology: The New Main Memory Standard
    • R. Crisp. Direct Rambus Technology: The New Main Memory Standard. In Proceedings of MICRO, 1997.
    • Proceedings of MICRO, 1997
    • Crisp, R.1
  • 17
    • 0034856730 scopus 로고    scopus 로고
    • Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance
    • V. Cuppu and B. Jacob. Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance. In Proceedings of ISCA, 2001.
    • Proceedings of ISCA, 2001
    • Cuppu, V.1    Jacob, B.2
  • 30
    • 59249097212 scopus 로고    scopus 로고
    • JEDEC. JEDEC Solid State Technology Association, Virginia, USA
    • JEDEC. JESD79: Double Data Rate (DDR) SDRAM Specification. JEDEC Solid State Technology Association, Virginia, USA, 2003.
    • (2003) JESD79: Double Data Rate (DDR) SDRAM Specification
  • 31
    • 0025429331 scopus 로고
    • Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers
    • May
    • N. Jouppi. Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers. In Proceedings of ISCA-17, pages 364-373, May 1990.
    • (1990) Proceedings of ISCA-17 , pp. 364-373
    • Jouppi, N.1
  • 32
    • 84976736383 scopus 로고
    • Page Placement Algorithms for Large Real-Indexed Caches
    • R. E. Kessler and M. D. Hill. Page Placement Algorithms for Large Real-Indexed Caches. ACM Trans. Comput. Syst., 10(4), 1992.
    • (1992) ACM Trans. Comput. Syst. , vol.10 , Issue.4
    • Kessler, R.E.1    Hill, M.D.2
  • 35
    • 0026107998 scopus 로고
    • Page Placement policies for NUMA multiprocessors
    • R. LaRowe and C. Ellis. Page Placement policies for NUMA multiprocessors. J. Parallel Distrib. Comput., 11(2), 1991.
    • (1991) J. Parallel Distrib. Comput. , vol.11 , Issue.2
    • LaRowe, R.1    Ellis, C.2
  • 36
    • 78149251349 scopus 로고    scopus 로고
    • Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor
    • R. LaRowe, J. Wilkes, and C. Ellis. Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor. In Proceedings of PPOPP, 1991.
    • Proceedings of PPOPP, 1991
    • LaRowe, R.1    Wilkes, J.2    Ellis, C.3
  • 42
    • 0035511103 scopus 로고    scopus 로고
    • Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses
    • R. Min and Y. Hu. Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses. IEEE Trans. Comput., 50(11), 2001.
    • (2001) IEEE Trans. Comput. , vol.50 , Issue.11
    • Min, R.1    Hu, Y.2
  • 44
    • 47349122373 scopus 로고    scopus 로고
    • Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors
    • O. Mutlu and T. Moscibroda. Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors. In Proceedings of MICRO, 2007.
    • Proceedings of MICRO, 2007
    • Mutlu, O.1    Moscibroda, T.2
  • 45
    • 52649119398 scopus 로고    scopus 로고
    • Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems
    • O. Mutlu and T. Moscibroda. Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems. In Proceedings of ISCA, 2008.
    • Proceedings of ISCA, 2008
    • Mutlu, O.1    Moscibroda, T.2
  • 51
  • 53
    • 84978398777 scopus 로고    scopus 로고
    • Surpassing the TLB Performance of Superpages with Less Operating System Support
    • M. Talluri and M. D. Hill. Surpassing the TLB Performance of Superpages with Less Operating System Support. In Proceedings of ASPLOS-VI, 1994.
    • Proceedings of ASPLOS-VI, 1994
    • Talluri, M.1    Hill, M.D.2
  • 55
    • 17044405973 scopus 로고    scopus 로고
    • Operating system support for improving data locality on CC-NUMA compute servers
    • B. Verghese, S. Devine, A. Gupta, and M. Rosenblum. Operating system support for improving data locality on CC-NUMA compute servers. SIGPLAN Not., 31(9), 1996.
    • (1996) SIGPLAN Not. , vol.31 , Issue.9
    • Verghese, B.1    Devine, S.2    Gupta, A.3    Rosenblum, M.4
  • 59
    • 0034460897 scopus 로고    scopus 로고
    • A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conflicts and Exploit Data Locality
    • Z. Zhang, Z. Zhu, and X. Zhand. A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conflicts and Exploit Data Locality. In Proceedings of MICRO, 2000.
    • Proceedings of MICRO, 2000
    • Zhang, Z.1    Zhu, Z.2    Zhand, X.3
  • 61
    • 70450284743 scopus 로고    scopus 로고
    • Decoupled DIMM: Building High-Bandwidth Memory System from Low-Speed DRAM Devices
    • H. Zheng, J. Lin, Z. Zhang, and Z. Zhu. Decoupled DIMM: Building High-Bandwidth Memory System from Low-Speed DRAM Devices. In Proceedings of ISCA, 2009.
    • Proceedings of ISCA, 2009
    • Zheng, H.1    Lin, J.2    Zhang, Z.3    Zhu, Z.4
  • 62
    • 28444470842 scopus 로고    scopus 로고
    • A Performance Comparison of DRAM Memory System Optimizations for SMT Processors
    • Z. Zhu and Z. Zhang. A Performance Comparison of DRAM Memory System Optimizations for SMT Processors. In Proceedings of HPCA, 2005.
    • Proceedings of HPCA, 2005
    • Zhu, Z.1    Zhang, Z.2
  • 63
    • 84949752992 scopus 로고    scopus 로고
    • Fine-grain Priority Scheduling on Multi-channel Memory Systems
    • Z. Zhu, Z. Zhang, and X. Zhang. Fine-grain Priority Scheduling on Multi-channel Memory Systems. In Proceedings of HPCA, 2002.
    • Proceedings of HPCA, 2002
    • Zhu, Z.1    Zhang, Z.2    Zhang, X.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.