메뉴 건너뛰기




Volumn , Issue , 2013, Pages 185-197

RowClone: Fast and energy-efficient in-DRAM bulk data copy and initialization

Author keywords

bulk operations; DRAM; energy; in memory processing; memory bandwidth; page copy; page initialization; performance

Indexed keywords

BULK OPERATIONS; ENERGY; MEMORY BANDWIDTHS; PAGE INITIALIZATION; PERFORMANCE;

EID: 84892504664     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2540708.2540725     Document Type: Conference Paper
Times cited : (407)

References (60)
  • 5
    • 84892520544 scopus 로고    scopus 로고
    • Memory device having page copy mode
    • U.S. patent 5886944
    • J. Ahn. Memory device having page copy mode. U.S. patent 5886944, 1999.
    • (1999)
    • Ahn, J.1
  • 6
    • 74049098606 scopus 로고    scopus 로고
    • PLFS: A checkpoint filesystem for parallel applications
    • J. Bent et al. PLFS: A checkpoint filesystem for parallel applications. In SC, 2009.
    • (2009) SC
    • Bent, J.1
  • 8
    • 0001945085 scopus 로고    scopus 로고
    • Automatic I/O hint generation through speculative execution
    • F. Chang and G. A. Gibson. Automatic I/O hint generation through speculative execution. In OSDI, 1999.
    • (1999) OSDI
    • Chang, F.1    Gibson, G.A.2
  • 9
    • 84889030419 scopus 로고    scopus 로고
    • Shredding Your Garbage: Reducing data lifetime through secure deallocation
    • J. Chow et al. Shredding Your Garbage: Reducing data lifetime through secure deallocation. In USENIX SS, 2005.
    • (2005) USENIX SS
    • Chow, J.1
  • 10
    • 47349110547 scopus 로고    scopus 로고
    • Software-Based Online Detection of Hardware Defects: Mechanisms, architectural support, and evaluation
    • K. Constantinides et al. Software-Based Online Detection of Hardware Defects: Mechanisms, architectural support, and evaluation. In MICRO, 2007.
    • (2007) MICRO
    • Constantinides, K.1
  • 11
    • 66749179303 scopus 로고    scopus 로고
    • Online Design Bug Detection: RTL analysis, Wexible mechanisms, and evaluation
    • K. Constantinides et al. Online Design Bug Detection: RTL analysis, Wexible mechanisms, and evaluation. In MICRO, 2008.
    • (2008) MICRO
    • Constantinides, K.1
  • 12
    • 76749124429 scopus 로고    scopus 로고
    • Application-aware prioritization mechanisms for on-chip networks
    • Reetuparna Das et al. Application-aware prioritization mechanisms for on-chip networks. In MICRO, 2009.
    • (2009) MICRO
    • Das, R.1
  • 13
    • 84977840356 scopus 로고    scopus 로고
    • Eternal Sunshine of the Spotless Machine: Protecting privacy with ephemeral channels
    • A. M. Dunn et al. Eternal Sunshine of the Spotless Machine: Protecting privacy with ephemeral channels. In OSDI, 2012.
    • (2012) OSDI
    • Dunn, A.M.1
  • 14
    • 47249094055 scopus 로고    scopus 로고
    • System-level performance metrics for multiprogram workloads
    • S. Eyerman and L. Eeckhout. System-level performance metrics for multiprogram workloads. IEEE Micro, (3), 2008.
    • (2008) IEEE Micro , Issue.3
    • Eyerman, S.1    Eeckhout, L.2
  • 15
    • 84892501260 scopus 로고    scopus 로고
    • DRAM page copy method
    • U.S. patent 5625601
    • P. B. Gillingham and R. Torrance. DRAM page copy method. U.S. patent 5625601, 1997.
    • (1997)
    • Gillingham, P.B.1    Torrance, R.2
  • 16
    • 85077701725 scopus 로고    scopus 로고
    • Lest We Remember: Cold boot attacks on encryption keys
    • J. A. Halderman et al. Lest We Remember: Cold boot attacks on encryption keys. In USENIX SS, 2008.
    • (2008) USENIX SS
    • Halderman, J.A.1
  • 17
    • 36049035340 scopus 로고    scopus 로고
    • Protecting cryptographic keys from memory disclosure attacks
    • K. Harrison and S. Xu. Protecting cryptographic keys from memory disclosure attacks. In DSN, 2007.
    • (2007) DSN
    • Harrison, K.1    Xu, S.2
  • 22
    • 52649148744 scopus 로고    scopus 로고
    • Self Optimizing Memory Controllers: A reinforcement learning approach
    • E. Ipek et al. Self Optimizing Memory Controllers: A reinforcement learning approach. In ISCA, 2008.
    • (2008) ISCA
    • Ipek, E.1
  • 23
    • 79959904195 scopus 로고    scopus 로고
    • Automatic CPU-GPU communication management and optimization
    • T. B. Jablin et al. Automatic CPU-GPU communication management and optimization. In PLDI, 2011.
    • (2011) PLDI
    • Jablin, T.B.1
  • 24
    • 0036286994 scopus 로고    scopus 로고
    • Avoiding Initialization Misses to the Heap
    • L. A. Jarrod et al. Avoiding Initialization Misses to the Heap. In ISCA, 2002.
    • (2002) ISCA
    • Jarrod, L.A.1
  • 25
    • 84892556032 scopus 로고    scopus 로고
    • JEDEC. Server memory roadmap. http://www.jedec.org/sites/default/files/ Ricki-Dee-Williams.pdf.
    • Server Memory Roadmap
  • 27
    • 84892547632 scopus 로고    scopus 로고
    • JEDEC. DDR3 SDRAM, JESD79-3F, 2012
    • JEDEC. DDR3 SDRAM, JESD79-3F, 2012.
  • 28
    • 70449658281 scopus 로고    scopus 로고
    • Architecture support for improving bulk memory copying and initialization performance
    • X. Jiang et al. Architecture support for improving bulk memory copying and initialization performance. In PACT, 2009.
    • (2009) PACT
    • Jiang, X.1
  • 29
    • 77952558442 scopus 로고    scopus 로고
    • ATLAS: A scalable and high-performance scheduling algorithm for multiple memory controllers
    • Y. Kim et al. ATLAS: A scalable and high-performance scheduling algorithm for multiple memory controllers. In HPCA, 2010.
    • (2010) HPCA
    • Kim, Y.1
  • 30
    • 79951718838 scopus 로고    scopus 로고
    • Thread Cluster Memory Scheduling: Exploiting diUerences in memory access behavior
    • Y. Kim et al. Thread Cluster Memory Scheduling: Exploiting diUerences in memory access behavior. In MICRO, 2010.
    • (2010) MICRO
    • Kim, Y.1
  • 31
    • 84864850807 scopus 로고    scopus 로고
    • A case for exploiting subarray-level parallelism (SALP) in DRAM
    • Y. Kim et al. A case for exploiting subarray-level parallelism (SALP) in DRAM. In ISCA, 2012.
    • (2012) ISCA
    • Kim, Y.1
  • 32
    • 84904306144 scopus 로고
    • EXECUBE - A new architecture for scaleable MPPs
    • P. M. Kogge. EXECUBE - A new architecture for scaleable MPPs. In ICPP, 1994.
    • (1994) ICPP
    • Kogge, P.M.1
  • 33
    • 70349084447 scopus 로고    scopus 로고
    • SnowFlock: Rapid virtual machine cloning for cloud computing
    • H. A. Lagar-Cavilla et al. SnowFlock: Rapid virtual machine cloning for cloud computing. In EuroSys, 2009.
    • (2009) EuroSys
    • Lagar-Cavilla, H.A.1
  • 34
    • 84880276949 scopus 로고    scopus 로고
    • Tiered-Latency DRAM: A low-latency and low-cost DRAM architecture
    • D. Lee et al. Tiered-Latency DRAM: A low-latency and low-cost DRAM architecture. In HPCA, 2013.
    • (2013) HPCA
    • Lee, D.1
  • 35
    • 84962144701 scopus 로고    scopus 로고
    • Balancing thoughput and fairness in SMT processors
    • Kun Luo et al. Balancing thoughput and fairness in SMT processors. In ISPASS, 2001.
    • (2001) ISPASS
    • Luo, K.1
  • 37
    • 84892510662 scopus 로고
    • DRAMs having on-chip row copy circuits for use in testing and video imaging and method for operating same
    • U.S. patent 5440517
    • D. M. Morgan and M. A. Shore. DRAMs having on-chip row copy circuits for use in testing and video imaging and method for operating same. U.S. patent 5440517, 1995.
    • (1995)
    • Morgan, D.M.1    Shore, M.A.2
  • 38
    • 84892532422 scopus 로고    scopus 로고
    • Semiconductor memory device including copy circuit
    • U.S. patent 5854771
    • Kaori Mori. Semiconductor memory device including copy circuit. U.S. patent 5854771, 1998.
    • (1998)
    • Mori, K.1
  • 39
    • 84858771269 scopus 로고    scopus 로고
    • Reducing memory interference in multi-core systems via application-aware memory channel partitioning
    • S. P. Muralidhara et al. Reducing memory interference in multi-core systems via application-aware memory channel partitioning. In MICRO, 2011.
    • (2011) MICRO
    • Muralidhara, S.P.1
  • 40
    • 33644903196 scopus 로고    scopus 로고
    • Efficient Runahead Execution: Power-efficient memory latency tolerance
    • O. Mutlu et al. Efficient Runahead Execution: Power-efficient memory latency tolerance. IEEE Micro, 26(1), 2006.
    • (2006) IEEE Micro , vol.26 , Issue.1
    • Mutlu, O.1
  • 41
    • 0002532551 scopus 로고
    • Why aren't operating systems getting faster as fast as hardware
    • J. K. Ousterhout. Why aren't operating systems getting faster as fast as hardware. In USENIX SC, 1990.
    • (1990) USENIX SC
    • Ousterhout, J.K.1
  • 42
    • 0031096193 scopus 로고    scopus 로고
    • A case for Intelligent RAM
    • D. Patterson et al. A case for Intelligent RAM. IEEE Micro, 17(2), 1997.
    • (1997) IEEE Micro , vol.17 , Issue.2
    • Patterson, D.1
  • 44
    • 84883540577 scopus 로고
    • The impact of architectural trends on operating system performance
    • M. Rosenblum et al. The impact of architectural trends on operating system performance. In SOSP, 1995.
    • (1995) SOSP
    • Rosenblum, M.1
  • 46
    • 84959429714 scopus 로고    scopus 로고
    • DRAM scaling and bandwidth challenges
    • G. Sandhu. DRAM scaling and bandwidth challenges. In WETI, 2012.
    • (2012) WETI
    • Sandhu, G.1
  • 49
    • 34547655822 scopus 로고    scopus 로고
    • Feedback Directed Prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers
    • S. Srinath et al. Feedback Directed Prefetching: Improving the performance and bandwidth-efficiency of hardware prefetchers. In HPCA, 2007.
    • (2007) HPCA
    • Srinath, S.1
  • 50
    • 79959917961 scopus 로고    scopus 로고
    • Flashback: A lightweight extension for rollback and deterministic replay for software debugging
    • S. M. Srinivasan et al. Flashback: A lightweight extension for rollback and deterministic replay for software debugging. In USENIX ATC, 2004.
    • (2004) USENIX ATC
    • Srinivasan, S.M.1
  • 51
    • 84870455720 scopus 로고    scopus 로고
    • Standard Performance Evaluation Corporation. SPEC CPU2006. http://www.spec.org/cpu2006.
    • SPEC CPU2006
  • 52
    • 84880311561 scopus 로고    scopus 로고
    • MISE: Providing performance predictability and improving fairness in shared main memory systems
    • L. Subramanian et al. MISE: Providing performance predictability and improving fairness in shared main memory systems. In HPCA, 2013.
    • (2013) HPCA
    • Subramanian, L.1
  • 53
    • 77952283542 scopus 로고    scopus 로고
    • Micro-pages: Increasing DRAM efficiency with locality-aware data placement
    • K. Sudan et al. Micro-pages: Increasing DRAM efficiency with locality-aware data placement. In ASPLOS, 2010.
    • (2010) ASPLOS
    • Sudan, K.1
  • 54
    • 77954989143 scopus 로고    scopus 로고
    • Rethinking DRAM design and organization for energy-constrained multi-cores
    • A. N. Udipi et al. Rethinking DRAM design and organization for energy-constrained multi-cores. In ISCA, 2010.
    • (2010) ISCA
    • Udipi, A.N.1
  • 55
    • 20344397073 scopus 로고    scopus 로고
    • Memory resource management in VMware ESX server
    • C. A. Waldspurger. Memory resource management in VMware ESX server. In OSDI, 2002.
    • (2002) OSDI
    • Waldspurger, C.A.1
  • 56
    • 49749122679 scopus 로고    scopus 로고
    • Improving power and data efficiency with threaded memory modules
    • F.A. Ware and C. Hampel. Improving power and data efficiency with threaded memory modules. In ICCD, 2006.
    • (2006) ICCD
    • Ware, F.A.1    Hampel, C.2
  • 57
    • 79955974508 scopus 로고    scopus 로고
    • Operating system support for application-specific speculation
    • B. Wester et al. Operating system support for application-specific speculation. In EuroSys, 2011.
    • (2011) EuroSys
    • Wester, B.1
  • 58
    • 81455131754 scopus 로고    scopus 로고
    • Why Nothing Matters: The impact of zeroing
    • X. Yang et al. Why Nothing Matters: The impact of zeroing. In OOPSLA, 2011.
    • (2011) OOPSLA
    • Yang, X.1
  • 59
    • 33748543231 scopus 로고    scopus 로고
    • Hardware support for bulk data movement in server platforms
    • L. Zhao et al. Hardware support for bulk data movement in server platforms. In ICCD, 2005.
    • (2005) ICCD
    • Zhao, L.1
  • 60
    • 66749162556 scopus 로고    scopus 로고
    • Mini-rank: Adaptive DRAM architecture for improving memory power efficiency
    • H. Zheng et al. Mini-rank: Adaptive DRAM architecture for improving memory power efficiency. In MICRO, 2008.
    • (2008) MICRO
    • Zheng, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.