SCOPUS 정보 검색 플랫폼

Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

Volumn , Issue , 2010, Pages 319-330

Handling the problems and opportunities posed by multiple on-chip memory controllers

(5) Awasthi, Manu a Nellans, David W a Sudan, Kshitij a Balasubramonian, Rajeev a Davis, Al a

a Department of Electrical and Computer Engineering (United States)

Author keywords

data placement; DRAM management; memory controller design

Indexed keywords

CONTROLLERS; DYNAMIC RANDOM ACCESS STORAGE; DYNAMICS; INTEGRATED CIRCUIT INTERCONNECTS; PARALLEL ARCHITECTURES;

ALLOCATION STRATEGY; DATA PLACEMENT; DYNAMIC PAGE MIGRATION; INTERCONNECT FABRICS; MEMORY ACCESS LATENCY; MEMORY ADDRESS SPACE; MEMORY CONTROLLER; NON UNIFORM MEMORY ACCESS;

MEMORY ARCHITECTURE;

EID: 78149234452 PISSN: 1089795X EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1854273.1854314 Document Type: Conference Paper

Times cited : (101)

References (60)

1
- 78149244874
- Homepage
- Perfmon2 Project Homepage. http://perfmon2.sourceforge.net/.

2
- 84856296442
- Performance of the AMD Opteron LS21 for IBM BladeCenter. ftp://ftp.software.ibm.com/eserver/benchmarks/wp-ls21-081506.pdf.
- Performance of the AMD Opteron LS21 for IBM BladeCenter

3
- 77949708681
- Intel 845G/845GL/845GV Chipset Datasheet: Intel 82845G/82845GL/82845GV Graphics and Memory Controller Hub (GMCH). http://download.intel.com/design/ chipsets/datashts/29074602.pdf, 2002.
- (2002) Intel 845G/845GL/845GV Chipset Datasheet: Intel 82845G/82845GL/82845GV Graphics and Memory Controller Hub (GMCH)

4
- 78149247170
- Micron DDR3 SDRAM Part MT41J512M4. http://download.micron.com/pdf/ datasheets/dram/ddr3/2Gb-DDR3-SDRAM.pdf, 2006.
- (2006) Micron DDR3 SDRAM Part MT41J512M4

5
- 70450285523
- Achieving Predictable Performance through Better Memory Controller in Many-Core CMPs
- D. Abts, N. Jerger, J. Kim, D. Gibson, and M. Lipasti. Achieving Predictable Performance through Better Memory Controller in Many-Core CMPs. In Proceedings of ISCA, 2009.
- Proceedings of ISCA, 2009
- Abts, D.¹ Jerger, N.² Kim, J.³ Gibson, D.⁴ Lipasti, M.⁵

6
- 64949140362
- Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches
- M. Awasthi, K. Sudan, R. Balasubramonian, and J. Carter. Dynamic Hardware-Assisted Software-Controlled Page Placement to Manage Capacity Allocation and Sharing within Large Caches. In Proceedings of HPCA, 2009.
- Proceedings of HPCA, 2009
- Awasthi, M.¹ Sudan, K.² Balasubramonian, R.³ Carter, J.⁴

7
- 51549095074
- Technical report, Department of Computer Science, Princeton University
- C. Benia et al. The PARSEC Benchmark Suite: Characterization and Architectural Implications. Technical report, Department of Computer Science, Princeton University, 2008.
- (2008) The PARSEC Benchmark Suite: Characterization and Architectural Implications
- Benia, C.¹

8
- 77949709857
- Avoiding Conict Misses Dynamically in Large Direct-Mapped Caches
- B. Bershad, B. Chen, D. Lee, and T. Romer. Avoiding Conict Misses Dynamically in Large Direct-Mapped Caches. In Proceedings of ASPLOS, 1994.
- Proceedings of ASPLOS, 1994
- Bershad, B.¹ Chen, B.² Lee, D.³ Romer, T.⁴

9
- 84989342078
- Scheduling and Page Migration for Multiprocessor Compute Servers
- R. Chandra, S. Devine, B. Verghese, A. Gupta, and M. Rosenblum. Scheduling and Page Migration for Multiprocessor Compute Servers. In Proceedings of ASPLOS, 1994.
- Proceedings of ASPLOS, 1994
- Chandra, R.¹ Devine, S.² Verghese, B.³ Gupta, A.⁴ Rosenblum, M.⁵

10
- 33845903561
- Co-Operative Caching for Chip Multiprocessors
- J. Chang and G. Sohi. Co-Operative Caching for Chip Multiprocessors. In Proceedings of ISCA, 2006.
- Proceedings of ISCA, 2006
- Chang, J.¹ Sohi, G.²

11
- 64949190009
- PageNUCA: Selected Policies for Page-Grain Locality Management in Large Shared Chip-Multiprocessor Caches
- M. Chaudhuri. PageNUCA: Selected Policies For Page-Grain Locality Management In Large Shared Chip-Multiprocessor Caches. In Proceedings of HPCA, 2009.
- Proceedings of HPCA, 2009
- Chaudhuri, M.¹

12
- 27544432313
- Optimizing Replication, Communication, and Capacity Allocation in CMPs
- Z. Chishti, M. Powell, and T. Vijaykumar. Optimizing Replication, Communication, and Capacity Allocation in CMPs. In Proceedings of ISCA-32, June 2005.
- Proceedings of ISCA-32, June 2005
- Chishti, Z.¹ Powell, M.² Vijaykumar, T.³

13
- 40349095122
- Managing Distributed, Shared L2 Caches through OS-Level Page Allocation
- S. Cho and L. Jin. Managing Distributed, Shared L2 Caches through OS-Level Page Allocation. In Proceedings of MICRO, 2006.
- Proceedings of MICRO, 2006
- Cho, S.¹ Jin, L.²

14
- 3543054682
- Page Migration with Dynamic Space-Sharing Scheduling Policies: The case of SGI 02000
- J. Corbalan, X. Martorell, and J. Labarta. Page Migration with Dynamic Space-Sharing Scheduling Policies: The case of SGI 02000. International Journal of Parallel Programming, 32(4), 2004.
- (2004) International Journal of Parallel Programming , vol.32 , Issue.4
- Corbalan, J.¹ Martorell, X.² Labarta, J.³

15
- 0034856730
- Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance
- V. Cuppu and B. Jacob. Concurrency, Latency, or System Overhead: Which Has the Largest Impact on Uniprocessor DRAM-System Performance. In Proceedings of ISCA, 2001.
- Proceedings of ISCA, 2001
- Cuppu, V.¹ Jacob, B.²

16
- 0032687058
- A Performance Comparison of Contemporary DRAM Architectures
- V. Cuppu, B. Jacob, B. Davis, and T. Mudge. A Performance Comparison of Contemporary DRAM Architectures. In Proceedings of ISCA, 1999.
- Proceedings of ISCA, 1999
- Cuppu, V.¹ Jacob, B.² Davis, B.³ Mudge, T.⁴

17
- 78149254950
- W. Dally. Report from Workshop on On- and Off-Chip Interconnection Networks for Multicore Systems (OCIN), 2006. http://www.ece.ucdavis.edu/~ocin06/ .
- Report from Workshop on On- and Off-Chip Interconnection Networks for Multicore Systems (OCIN), 2006
- Dally, W.¹

18
- 33750829443
- MESA: Reducing Cache Conicts by Integrating Static and Run-Time Methods
- X. Ding, D. S. Nikopoulosi, S. Jiang, and X. Zhang. MESA: Reducing Cache Conicts by Integrating Static and Run-Time Methods. In Proceedings of ISPASS, 2006.
- Proceedings of ISPASS, 2006
- Ding, X.¹ Nikopoulosi, D.S.² Jiang, S.³ Zhang, X.⁴

19
- 34547670591
- An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors
- H. Dybdahl and P. Stenstrom. An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors. In Proceedings of HPCA, 2007.
- Proceedings of HPCA, 2007
- Dybdahl, H.¹ Stenstrom, P.²

20
- 0034875742
- Memory Controller Policies for DRAM Power Management
- X. Fan, H. Zeng, and C. Ellis. Memory Controller Policies for DRAM Power Management. In Proceedings of ISLPED, 2001.
- Proceedings of ISLPED, 2001
- Fan, X.¹ Zeng, H.² Ellis, C.³

21
- 70350601187
- Reactive NUCA: Near-Optimal Block Placement and Replication in Distributed Caches
- N. Hardavellas, M. Ferdman, B. Falsafi, and A. Ailamaki. Reactive NUCA: Near-Optimal Block Placement And Replication In Distributed Caches. In Proceedings of ISCA, 2009.
- Proceedings of ISCA, 2009
- Hardavellas, N.¹ Ferdman, M.² Falsafi, B.³ Ailamaki, A.⁴

22
- 52649148744
- Self Optimizing Memory Controllers: A Reinforcement Learning Approach
- E. Ipek, O. Mutlu, J. Martinez, and R. Caruana. Self Optimizing Memory Controllers: A Reinforcement Learning Approach. In Proceedings of ISCA, 2008.
- Proceedings of ISCA, 2008
- Ipek, E.¹ Mutlu, O.² Martinez, J.³ Caruana, R.⁴

23
- 0344362751
- ITRS. Edition
- ITRS. International Technology Roadmap for Semiconductors, 2007 Edition.
- (2007) International Technology Roadmap for Semiconductors

24
- 84976736383
- Page Placement Algorithms for Large Real-Indexed Caches
- R. E. Kessler and M. D. Hill. Page Placement Algorithms for Large Real-Indexed Caches. ACM Trans. Comput. Syst., 10(4), 1992.
- (1992) ACM Trans. Comput. Syst. , vol.10 , Issue.4
- Kessler, R.E.¹ Hill, M.D.²

25
- 40349103382
- An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches
- C. Kim, D. Burger, and S. Keckler. An Adaptive, Non-Uniform Cache Structure for Wire-Dominated On-Chip Caches. In Proceedings of ASPLOS, 2002.
- Proceedings of ASPLOS, 2002
- Kim, C.¹ Burger, D.² Keckler, S.³

26
- 77952558442
- ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers
- Y. Kim, D. Han, O. Mutlu, and M. Harchol-Balter. ATLAS: A Scalable and High-Performance Scheduling Algorithm for Multiple Memory Controllers. In Proceedings of HPCA, 2010.
- Proceedings of HPCA, 2010
- Kim, Y.¹ Han, D.² Mutlu, O.³ Harchol-Balter, M.⁴

27
- 64949143288
- Technical report
- R. LaRowe and C. Ellis. Experimental Comparison of Memory Management Policies for NUMA Multiprocessors. Technical report, 1990.
- (1990) Experimental Comparison of Memory Management Policies for NUMA Multiprocessors
- LaRowe, R.¹ Ellis, C.²

28
- 0026107998
- Page Placement policies for NUMA multiprocessors
- R. LaRowe and C. Ellis. Page Placement policies for NUMA multiprocessors. J. Parallel Distrib. Comput., 11(2), 1991.
- (1991) J. Parallel Distrib. Comput. , vol.11 , Issue.2
- LaRowe, R.¹ Ellis, C.²

29
- 78149251349
- Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor
- R. LaRowe, J. Wilkes, and C. Ellis. Exploiting Operating System Support for Dynamic Page Placement on a NUMA Shared Memory Multiprocessor. In Proceedings of PPOPP, 1991.
- Proceedings of PPOPP, 1991
- LaRowe, R.¹ Wilkes, J.² Ellis, C.³

30
- 0034442261
- Power Aware Page Allocation
- A. Lebeck, X. Fan, H. Zeng, and C. Ellis. Power Aware Page Allocation. In Proceedings of ASPLOS, 2000.
- Proceedings of ASPLOS, 2000
- Lebeck, A.¹ Fan, X.² Zeng, H.³ Ellis, C.⁴

31
- 66749189125
- Prefetch-Aware DRAM Controllers
- C. Lee, O. Mutlu, V. Narasiman, and Y. Patt. Prefetch-Aware DRAM Controllers. In Proceedings of MICRO, 2008.
- Proceedings of MICRO, 2008
- Lee, C.¹ Mutlu, O.² Narasiman, V.³ Patt, Y.⁴

32
- 57749186047
- Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems
- J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining Insights into Multicore Cache Partitioning: Bridging the Gap between Simulation and Real Systems. In Proceedings of HPCA, 2008.
- Proceedings of HPCA, 2008
- Lin, J.¹ Lu, Q.² Ding, X.³ Zhang, Z.⁴ Zhang, X.⁵ Sadayappan, P.⁶

33
- 0035510681
- Designing a Modern Memory Hierarchy with Hardware Prefetching
- W. Lin, S. Reinhardt, and D. Burger. Designing a Modern Memory Hierarchy with Hardware Prefetching. In Proceedings of IEEE Transactions on Computers, 2001.
- Proceedings of IEEE Transactions on Computers, 2001
- Lin, W.¹ Reinhardt, S.² Burger, D.³

34
- 52649125840
- 3D-Stacked Memory Architectures for Multi-Core Processors
- G. Loh. 3D-Stacked Memory Architectures for Multi-Core Processors. In Proceedings of ISCA, 2008.
- Proceedings of ISCA, 2008
- Loh, G.¹

35
- 0036469676
- Simics: A Full System Simulation Platform
- February
- P. Magnusson, M. Christensson, J. Eskilson, D. Forsgren, G. Hallberg, J. Hogberg, F. Larsson, A. Moestedt, and B. Werner. Simics: A Full System Simulation Platform. IEEE Computer, 35(2):50-58, February 2002.
- (2002) IEEE Computer , vol.35 , Issue.2 , pp. 50-58
- Magnusson, P.¹ Christensson, M.² Eskilson, J.³ Forsgren, D.⁴ Hallberg, G.⁵ Hogberg, J.⁶ Larsson, F.⁷ Moestedt, A.⁸ Werner, B.⁹

36
- 77952562600
- Memphis: Finding and fixing numa-related performance problems on multi-core platforms
- C. McCurdy and J. Vetter. Memphis: Finding and fixing numa-related performance problems on multi-core platforms. In Proceedings of ISPASS, 2010.
- Proceedings of ISPASS, 2010
- McCurdy, C.¹ Vetter, J.²

37
- 78149256447
- Micron Technology Inc.
- Micron Technology Inc. Micron DDR2 SDRAM Part MT47H128M8HQ-25, 2007.
- (2007) Micron DDR2 SDRAM Part MT47H128M8HQ-25

38
- 0035511103
- Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses
- R. Min and Y. Hu. Improving Performance of Large Physically Indexed Caches by Decoupling Memory Addresses from Cache Addresses. IEEE Trans. Comput., 50(11), 2001.
- (2001) IEEE Trans. Comput. , vol.50 , Issue.11
- Min, R.¹ Hu, Y.²

39
- 47349084021
- Optimizing NUCA Organizations and Wiring Alternatives for Large Caches with CACTI 6.0
- N. Muralimanohar, R. Balasubramonian, and N. Jouppi. Optimizing NUCA Organizations and Wiring Alternatives for Large Caches with CACTI 6.0. In Proceedings of MICRO, 2007.
- Proceedings of MICRO, 2007
- Muralimanohar, N.¹ Balasubramonian, R.² Jouppi, N.³

40
- 47349122373
- Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors
- O. Mutlu and T. Moscibroda. Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors. In Proceedings of MICRO, 2007.
- Proceedings of MICRO, 2007
- Mutlu, O.¹ Moscibroda, T.²

41
- 52649119398
- Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems
- O. Mutlu and T. Moscibroda. Parallelism-Aware Batch Scheduling: Enhancing Both Performance and Fairness of Shared DRAM Systems. In Proceedings of ISCA, 2008.
- Proceedings of ISCA, 2008
- Mutlu, O.¹ Moscibroda, T.²

42
- 12844249966
- Heat-and-Run: Leveraging SMT and CMP to Manage Power Density Through the Operating System
- M. Powell, M. Gomaa, and T. Vijaykumar. Heat-and-Run: Leveraging SMT and CMP to Manage Power Density Through the Operating System. In Proceedings of ASPLOS, 2004.
- Proceedings of ASPLOS, 2004
- Powell, M.¹ Gomaa, M.² Vijaykumar, T.³

43
- 64949187933
- Adaptive Spill-Receive for Robust High-Performance Caching in CMPs
- M. K. Qureshi. Adaptive Spill-Receive for Robust High-Performance Caching in CMPs. In Proceedings of HPCA, 2009.
- Proceedings of HPCA, 2009
- Qureshi, M.K.¹

44
- 34247108325
- Architectural Support for Operating System Driven CMP Cache Management
- N. Rafique, W. Lim, and M. Thottethodi. Architectural Support for Operating System Driven CMP Cache Management. In Proceedings of PACT, 2006.
- Proceedings of PACT, 2006
- Rafique, N.¹ Lim, W.² Thottethodi, M.³

45
- 0033691565
- Memory Access Scheduling
- S. Rixner, W. Dally, U. Kapasi, P. Mattson, and J. Owens. Memory Access Scheduling. In Proceedings of ISCA, 2000.
- Proceedings of ISCA, 2000
- Rixner, S.¹ Dally, W.² Kapasi, U.³ Mattson, P.⁴ Owens, J.⁵

46
- 70349934427
- V. Romanchenko. Quad-Core Opteron: Architecture and Roadmaps. http://www.digital-daily.com/cpu/quad core opteron.
- Quad-Core Opteron: Architecture and Roadmaps
- Romanchenko, V.¹

47
- 0032644674
- Reducing Cache Misses Using Hardware and Software Page Placement
- T. Sherwood, B. Calder, and J. Emer. Reducing Cache Misses Using Hardware and Software Page Placement. In Proceedings of SC, 1999.
- Proceedings of SC, 1999
- Sherwood, T.¹ Calder, B.² Emer, J.³

48
- 0042455211
- Symbiotic Jobscheduling with Priorities for a Simultaneous Multithreading Processor
- A. Snavely, D. Tullsen, and G. Voelker. Symbiotic Jobscheduling with Priorities for a Simultaneous Multithreading Processor. In Proceedings of SIGMETRICS, 2002.
- Proceedings of SIGMETRICS, 2002
- Snavely, A.¹ Tullsen, D.² Voelker, G.³

49
- 27544498313
- Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors
- E. Speight, H. Shafi, L. Zhang, and R. Rajamony. Adaptive Mechanisms and Policies for Managing Cache Hierarchies in Chip Multiprocessors. In Proceedings of ISCA, 2005.
- Proceedings of ISCA, 2005
- Speight, E.¹ Shafi, H.² Zhang, L.³ Rajamony, R.⁴

50
- 77954999878
- R. Swinburne. Intel Core i7 - Nehalem Architecture Dive. http://www.bit-tech.net/hardware/2008/11/03/intel-core-i7-nehalem-architecture- dive/.
- Intel Core i7 - Nehalem Architecture Dive
- Swinburne, R.¹

51
- 52649100126
- Corona: System Implications of Emerging Nanophotonic Technology
- D. Vantrease et al. Corona: System Implications of Emerging Nanophotonic Technology. In Proceedings of ISCA, 2008.
- Proceedings of ISCA, 2008
- Vantrease, D.¹

52
- 17044405973
- Operating system support for improving data locality on CC-NUMA compute servers
- B. Verghese, S. Devine, A. Gupta, and M. Rosenblum. Operating system support for improving data locality on CC-NUMA compute servers. SIGPLAN Not., 31(9), 1996.
- (1996) SIGPLAN Not. , vol.31 , Issue.9
- Verghese, B.¹ Devine, S.² Gupta, A.³ Rosenblum, M.⁴

53
- 34547490573
- VASA: A Simulator Infrastructure with Adjustable Fidelity
- D. Wallin, H. Zeffer, M. Karlsson, and E. Hagersten. VASA: A Simulator Infrastructure with Adjustable Fidelity. In Proceedings of IASTED International Conference on Parallel and Distributed Computing and Systems, 2005.
- Proceedings of IASTED International Conference on Parallel and Distributed Computing and Systems, 2005
- Wallin, D.¹ Zeffer, H.² Karlsson, M.³ Hagersten, E.⁴

54
- 35348861182
- DRAMsim: A Memory-System Simulator
- D. Wang et al. DRAMsim: A Memory-System Simulator. In SIGARCH Computer Architecture News, September 2005.
- SIGARCH Computer Architecture News, September 2005
- Wang, D.¹

55
- 36849030305
- On-Chip Interconnection Architecture of the Tile Processor
- D. Wentzlaff et al. On-Chip Interconnection Architecture of the Tile Processor. In IEEE Micro, volume 22, 2007.
- (2007) IEEE Micro , vol.22
- Wentzlaff, D.¹

56
- 27544495466
- Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors
- M. Zhang and K. Asanovic. Victim Replication: Maximizing Capacity while Hiding Wire Delay in Tiled Chip Multiprocessors. In Proceedings of ISCA, 2005.
- Proceedings of ISCA, 2005
- Zhang, M.¹ Asanovic, K.²

57
- 0034460897
- A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conicts and Exploit Data Locality
- Z. Zhang, Z. Zhu, and X. Zhand. A Permutation-Based Page Interleaving Scheme to Reduce Row-Buffer Conicts and Exploit Data Locality. In Proceedings of MICRO, 2000.
- Proceedings of MICRO, 2000
- Zhang, Z.¹ Zhu, Z.² Zhand, X.³

58
- 66749162556
- Mini-Rank: Adaptive DRAM Architecture for Improving Memory Power E ciency
- H. Zheng et al. Mini-Rank: Adaptive DRAM Architecture For Improving Memory Power E ciency. In Proceedings of MICRO, 2008.
- Proceedings of MICRO, 2008
- Zheng, H.¹

59
- 55949114476
- Thermal Management for 3D Processor via Task Scheduling
- X. Zhou, Y. Xu, Y. Du, Y. Zhang, and J. Yang. Thermal Management for 3D Processor via Task Scheduling. In Proceedings of ICPP, 2008.
- Proceedings of ICPP, 2008
- Zhou, X.¹ Xu, Y.² Du, Y.³ Zhang, Y.⁴ Yang, J.⁵

60
- 28444470842
- A Performance Comparison of DRAM Memory System Optimizations for SMT Processors
- Z. Zhu and Z. Zhang. A Performance Comparison of DRAM Memory System Optimizations for SMT Processors. In Proceedings of HPCA, 2005
- Proceedings of HPCA, 2005
- Zhu, Z.¹ Zhang, Z.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.