SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on High-Performance Computer Architecture

Volumn , Issue , 2010, Pages

ATLAS: A scalable and high-performance scheduling algorithm for multiple memory controllers

(4) Kim, Yoongu a Han, Dongsu a Mutlu, Onur a Harchol Balter, Mor a

a CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CHIP MULTIPROCESSORS; CONTROL ACCESS; CORE SYSTEMS; EFFICIENT SCHEDULING; HAZARD RATES; MAIN MEMORY; MEMORY CONTROLLER; MEMORY SCHEDULING; PERFORMANCE BENEFITS; PRIORITIZATION; SINGLE SERVER QUEUE; SYSTEM THROUGHPUT; WORK-LOAD DISTRIBUTION;

ACCESS CONTROL; ADAPTIVE CONTROL SYSTEMS; COMPUTER ARCHITECTURE; COMPUTERS; CONTROLLERS; GAME THEORY; MULTIPROGRAMMING; QUEUEING THEORY; THROUGHPUT;

SCHEDULING ALGORITHMS;

EID: 77952558442 PISSN: 15300897 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (363)

References (59)

1
- 70450285523
- Achieving predictable performance through better memory controller placement in many-core CMPs
- D. Abts, N. D. Enright Jerger, J. Kim, D. Gibson, and M. H. Lipasti. Achieving predictable performance through better memory controller placement in many-core CMPs. In ISCA-36, 2009.
- (2009) ISCA-36
- Abts, D.¹ Enright Jerger, N.D.² Kim, J.³ Gibson, D.⁴ Lipasti, M.H.⁵

2
- 77952571684
- Advanced Micro Devices. AMD's six-core Opteron processors. http://techreport.com/articles.x/17005, 2009.
- (2009) Advanced Micro Devices. AMD's Six-core Opteron Processors

3
- 57749201826
- Power-efficient DRAM speculation
- N. Aggarwal, J. F. Cantin, M. H. Lipasti, and J. E. Smith. Power-efficient DRAM speculation. In HPCA-14, 2008.
- (2008) HPCA-14
- Aggarwal, N.¹ Cantin, J.F.² Lipasti, M.H.³ Smith, J.E.⁴

4
- 33745956039
- Framework for instruction-level tracing and analysis of programs
- S. Bhansali, W.-K. Chen, S. de Jong, A. Edwards, R. Murray, M. Drinić, D. Mihočka, and J. Chau. Framework for instruction-level tracing and analysis of programs. In VEE, 2006.
- (2006) VEE
- Bhansali, S.¹ Chen, W.-K.² De Jong, S.³ Edwards, A.⁴ Murray, R.⁵ Drinić, M.⁶ Mihočka, D.⁷ Chau, J.⁸

5
- 44549088807
- Scheduling in practice
- March
- E. W. Biersack, B. Schroeder, and G. Urvoy-Keller. Scheduling in practice. Performance Evaluation Review, Special Issue on "New Perspectives in Scheduling", 34(4), March 2007.
- (2007) Performance Evaluation Review, Special Issue on "New Perspectives in Scheduling" , vol.34 , Issue.4
- Biersack, E.W.¹ Schroeder, B.² Urvoy-Keller, G.³

6
- 0036504582
- Intel 870: A building block for cost-effective, scalable servers
- F. Briggs, M. Cekleov, K. Creta, M. Khare, S. Kulick, A. Kumar, L. P. Looi, C. Natarajan, S. Radhakrishnan, and L. Rankin. Intel 870: A building block for cost-effective, scalable servers. IEEE Micro, 22(2):36-47, 2002.
- (2002) IEEE Micro , vol.22 , Issue.2 , pp. 36-47
- Briggs, F.¹ Cekleov, M.² Creta, K.³ Khare, M.⁴ Kulick, S.⁵ Kumar, A.⁶ Looi, L.P.⁷ Natarajan, C.⁸ Radhakrishnan, S.⁹ Rankin, L.¹⁰

7
- 77952561300
- Micro-architecture techniques in the Intel E8870 scalable memory controller
- F. Briggs, S. Chittor, and K. Cheng. Micro-architecture techniques in the Intel E8870 scalable memory controller. In WMPI-3, 2004.
- (2004) WMPI-3
- Briggs, F.¹ Chittor, S.² Cheng, K.³

8
- 34548238648
- The AMD Opteron northbridge architecture
- P. Conway and B. Hughes. The AMD Opteron northbridge architecture. IEEE Micro, 27(2):10-21, 2007.
- (2007) IEEE Micro , vol.27 , Issue.2 , pp. 10-21
- Conway, P.¹ Hughes, B.²

9
- 0031383380
- Self-similarity in World Wide Web traffic: Evidence and possible causes
- M. E. Crovella and A. Bestavros. Self-similarity in World Wide Web traffic: Evidence and possible causes. IEEE/ACM TON, 5(6):835-846, 1997.
- (1997) IEEE/ACM TON , vol.5 , Issue.6 , pp. 835-846
- Crovella, M.E.¹ Bestavros, A.²

10
- 0001939946
- Heavy-tailed probability distributions in the world wide web
- chapter 1, Chapman & Hall, New York
- M. E. Crovella, M. S. Taqqu, and A. Bestavros. Heavy-tailed probability distributions in the world wide web. In A Practical Guide To Heavy Tails, chapter 1, pages 1-23. Chapman & Hall, New York, 1998.
- (1998) A Practical Guide to Heavy Tails , pp. 1-23
- Crovella, M.E.¹ Taqqu, M.S.² Bestavros, A.³

11
- 0024889726
- Analysis and simulation of a fair queueing algorithm
- A. Demers, S. Keshav, and S. Shenker. Analysis and simulation of a fair queueing algorithm. In SIGCOMM, 1989.
- (1989) SIGCOMM
- Demers, A.¹ Keshav, S.² Shenker, S.³

12
- 35348903171
- Limiting the power consumption of main memory
- B. Diniz, D. O. G. Neto, W. Meira Jr., and R. Bianchini. Limiting the power consumption of main memory. In ISCA-34, 2007.
- (2007) ISCA-34
- Diniz, B.¹ Neto, D.O.G.² Meira Jr., W.³ Bianchini, R.⁴

13
- 47249094055
- System-level performance metrics for multiprogram workloads
- S. Eyerman and L. Eeckhout. System-level performance metrics for multiprogram workloads. IEEE Micro, 28(3):42-53, 2008.
- (2008) IEEE Micro , vol.28 , Issue.3 , pp. 42-53
- Eyerman, S.¹ Eeckhout, L.²

14
- 0022242041
- XOR-Schemes: A flexible data organization in parallel memories
- J. M. Frailong, W. Jalby, and J. Lenfant. XOR-Schemes: A flexible data organization in parallel memories. In ICPP, 1985.
- (1985) ICPP
- Frailong, J.M.¹ Jalby, W.² Lenfant, J.³

15
- 0037885374
- Task assignment with unknown duration
- March
- M. Harchol-Balter. Task assignment with unknown duration. J. ACM, 49(2):260-288, March 2002.
- (2002) J. ACM , vol.49 , Issue.2 , pp. 260-288
- Harchol-Balter, M.¹

16
- 85085698525
- Exploiting process lifetime distributions for dynamic load balancing
- M. Harchol-Balter and A. Downey. Exploiting process lifetime distributions for dynamic load balancing. In SIGMETRICS, 1996.
- (1996) SIGMETRICS
- Harchol-Balter, M.¹ Downey, A.²

17
- 0032785291
- Access order and effective bandwidth for streams on a direct rambus memory
- S. I. Hong, S. A. McKee, M. H. Salinas, R. H. Klenke, J. H. Aylor, and W. A. Wulf. Access order and effective bandwidth for streams on a direct rambus memory. In HPCA-5, 1999.
- (1999) HPCA-5
- Hong, S.I.¹ McKee, S.A.² Salinas, M.H.³ Klenke, R.H.⁴ Aylor, J.H.⁵ Wulf, W.A.⁶

18
- 21644455082
- Adaptive history-based memory schedulers
- I. Hur and C. Lin. Adaptive history-based memory schedulers. In MICRO-37, 2004.
- (2004) MICRO-37
- Hur, I.¹ Lin, C.²

19
- 57749175984
- A comprehensive approach to DRAM power management
- I. Hur and C. Lin. A comprehensive approach to DRAM power management. In HPCA-14, 2008.
- (2008) HPCA-14
- Hur, I.¹ Lin, C.²

20
- 84888279461
- IBM. PowerXCell 8i Processor. http://www.ibm.com/technology/resources/ technology-cell-pdf-PowerXCell-PB-7May2008-pub.pdf.
- PowerXCell 8i Processor

21
- 84872067428
- Intel. Intel Core i7 Processor. http://www.intel.com/products/processor/ corei7/specifications.htm.
- Intel Core i7 Processor

22
- 52649148744
- Self-optimizing memory controllers: A reinforcement learning approach
- E. Ipek, O. Mutlu, J. F. Martínez, and R. Caruana. Self-optimizing memory controllers: A reinforcement learning approach. In ISCA-35, 2008.
- (2008) ISCA-35
- Ipek, E.¹ Mutlu, O.² Martínez, J.F.³ Caruana, R.⁴

23
- 0003487810
- Available at September
- G. Irlam. Unix file size survey - 1993. Available at http:-//www.base. com/gordoni/ufs93.html, September 1994.
- (1994) Unix File Size Survey - 1993
- Irlam, G.¹

24
- 65549093343
- Update
- ITRS. International Technology Roadmap for Semiconductors, 2008 Update. http://www.itrs.net/Links/2008ITRS/Update/2008Tables FOCUS B.xls.
- (2008) International Technology Roadmap for Semiconductors

25
- 4644294548
- A day in the life of a data cache miss
- T. Karkhanis and J. E. Smith. A day in the life of a data cache miss. In WMPI-2, 2002.
- (2002) WMPI-2
- Karkhanis, T.¹ Smith, J.E.²

26
- 0034442261
- Power aware page allocation
- A. R. Lebeck, X. Fan, H. Zeng, and C. S. Ellis. Power aware page allocation. In ASPLOS-IX, 2000.
- (2000) ASPLOS-IX
- Lebeck, A.R.¹ Fan, X.² Zeng, H.³ Ellis, C.S.⁴

27
- 66749189125
- Prefetch-aware DRAM controllers
- C. J. Lee, O. Mutlu, V. Narasiman, and Y. N. Patt. Prefetch-aware DRAM controllers. In MICRO-41, 2008.
- (2008) MICRO-41
- Lee, C.J.¹ Mutlu, O.² Narasiman, V.³ Patt, Y.N.⁴

28
- 31944440969
- Pin: Building customized program analysis tools with dynamic instrumentation
- C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: Building customized program analysis tools with dynamic instrumentation. In PLDI, 2005.
- (2005) PLDI
- Luk, C.-K.¹ Cohn, R.² Muth, R.³ Patil, H.⁴ Klauser, A.⁵ Lowney, G.⁶ Wallace, S.⁷ Reddi, V.J.⁸ Hazelwood, K.⁹

29
- 84962144701
- Balancing thoughput and fairness in SMT processors
- K. Luo, J. Gummaraju, and M. Franklin. Balancing thoughput and fairness in SMT processors. In ISPASS, 2001.
- (2001) ISPASS
- Luo, K.¹ Gummaraju, J.² Franklin, M.³

30
- 0034314462
- Dynamic access ordering for streamed computations
- Nov.
- S. A. McKee, W. A. Wulf, J. H. Aylor, M. H. Salinas, R. H. Klenke, S. I. Hong, and D. A. B. Weikle. Dynamic access ordering for streamed computations. IEEE TC, 49(11):1255-1271, Nov. 2000.
- (2000) IEEE TC , vol.49 , Issue.11 , pp. 1255-1271
- McKee, S.A.¹ Wulf, W.A.² Aylor, J.H.³ Salinas, M.H.⁴ Klenke, R.H.⁵ Hong, S.I.⁶ Weikle, D.A.B.⁷

31
- 77952562767
- May
- Micron. 1Gb DDR2 SDRAM Component: MT47H128M8HQ-25. May 2007. http://download.micron.com/pdf/datasheets/dram/ddr2/1GbDDR2.pdf.
- (2007) 1Gb DDR2 SDRAM Component: MT47H128M8HQ-25

32
- 52649128991
- Memory performance attacks: Denial of memory service in multi-core systems
- T. Moscibroda and O. Mutlu. Memory performance attacks: Denial of memory service in multi-core systems. In USENIX SECURITY, 2007.
- (2007) USENIX Security
- Moscibroda, T.¹ Mutlu, O.²

33
- 57549112769
- Distributed order scheduling and its application to multi-core DRAM controllers
- T. Moscibroda and O. Mutlu. Distributed order scheduling and its application to multi-core DRAM controllers. In PODC, 2008.
- (2008) PODC
- Moscibroda, T.¹ Mutlu, O.²

34
- 33644903196
- Efficient runahead execution: Power-efficient memory latency tolerance
- O. Mutlu, H. Kim, and Y. N. Patt. Efficient runahead execution: Power-efficient memory latency tolerance. IEEE Micro, 26(1):10-20, 2006.
- (2006) IEEE Micro , vol.26 , Issue.1 , pp. 10-20
- Mutlu, O.¹ Kim, H.² Patt, Y.N.³

35
- 47349122373
- Stall-time fair memory access scheduling for chip multiprocessors
- O. Mutlu and T. Moscibroda. Stall-time fair memory access scheduling for chip multiprocessors. In MICRO-40, 2007.
- (2007) MICRO-40
- Mutlu, O.¹ Moscibroda, T.²

36
- 52649119398
- Parallelism-aware batch scheduling: Enhancing both performance and fairness of shared DRAM systems
- O. Mutlu and T. Moscibroda. Parallelism-aware batch scheduling: Enhancing both performance and fairness of shared DRAM systems. In ISCA-36, 2008.
- (2008) ISCA-36
- Mutlu, O.¹ Moscibroda, T.²

37
- 47349089021
- A study of performance impact of memory controller features in multi-processor server environment
- C. Natarajan, B. Christenson, and F. Briggs. A study of performance impact of memory controller features in multi-processor server environment. In WMPI-3, 2004.
- (2004) WMPI-3
- Natarajan, C.¹ Christenson, B.² Briggs, F.³

38
- 34548050337
- Fair queuing memory systems
- K. J. Nesbit, N. Aggarwal, J. Laudon, and J. E. Smith. Fair queuing memory systems. In MICRO-39, 2006.
- (2006) MICRO-39
- Nesbit, K.J.¹ Aggarwal, N.² Laudon, J.³ Smith, J.E.⁴

39
- 21644454187
- Pinpointing representative portions of large Intel Itanium programs with dynamic instrumentation
- H. Patil, R. Cohn, M. Charney, R. Kapoor, A. Sun, and A. Karunanidhi. Pinpointing representative portions of large Intel Itanium programs with dynamic instrumentation. In MICRO-37, 2004.
- (2004) MICRO-37
- Patil, H.¹ Cohn, R.² Charney, M.³ Kapoor, R.⁴ Sun, A.⁵ Karunanidhi, A.⁶

40
- 0029323403
- Wide-area traffic: The failure of Poisson modeling
- June
- V. Paxson and S. Floyd. Wide-area traffic: The failure of Poisson modeling. IEEE/ACM TON, pages 226-244, June 1995.
- (1995) IEEE/ACM TON , pp. 226-244
- Paxson, V.¹ Floyd, S.²

41
- 47849130815
- Effective management of DRAM bandwidth in multicore processors
- N. Rafique, W.-T. Lim, and M. Thottethodi. Effective management of DRAM bandwidth in multicore processors. In PACT-16, 2007.
- (2007) PACT-16
- Rafique, N.¹ Lim, W.-T.² Thottethodi, M.³

42
- 8344231178
- Analysis of LAS scheduling for job size distributions with high variance
- I. A. Rai, G. Urvoy-Keller, and E. W. Biersack. Analysis of LAS scheduling for job size distributions with high variance. In SIGMETRICS, 2003.
- (2003) SIGMETRICS
- Rai, I.A.¹ Urvoy-Keller, G.² Biersack, E.W.³

43
- 0026156613
- Pseudo-randomly interleaved memory
- B. R. Rau. Pseudo-randomly interleaved memory. In ISCA-18, 1991.
- (1991) ISCA-18
- Rau, B.R.¹

44
- 84971109320
- Scheduling multiclass single server queueing systems to stochastically maximize the number of successful departures
- R. Righter and J. Shanthikumar. Scheduling multiclass single server queueing systems to stochastically maximize the number of successful departures. Probability in the Engineering and Information Sciences, 3:967-978, 1989.
- (1989) Probability in the Engineering and Information Sciences , vol.3 , pp. 967-978
- Righter, R.¹ Shanthikumar, J.²

45
- 21644486223
- Memory controller optimizations for web servers
- S. Rixner. Memory controller optimizations for web servers. In MICRO-37, 2004.
- (2004) MICRO-37
- Rixner, S.¹

46
- 0033691565
- Memory access scheduling
- S. Rixner, W. J. Dally, U. J. Kapasi, P. Mattson, and J. D. Owens. Memory access scheduling. In ISCA-27, 2000.
- (2000) ISCA-27
- Rixner, S.¹ Dally, W.J.² Kapasi, U.J.³ Mattson, P.⁴ Owens, J.D.⁵

47
- 0000891048
- A proof of the optimality of the shortest remaining processing time discipline
- L. E. Schrage. A proof of the optimality of the shortest remaining processing time discipline. Operations Research, 16:678-690, 1968.
- (1968) Operations Research , vol.16 , pp. 678-690
- Schrage, L.E.¹

48
- 32844475712
- Evaluation of task assignment policies for supercomputing servers: The case for load unbalancing and fairness
- April
- B. Schroeder and M. Harchol-Balter. Evaluation of task assignment policies for supercomputing servers: The case for load unbalancing and fairness. Cluster Computing: The Journal of Networks, Software Tools, and Applications, 7(2):151-161, April 2004.
- (2004) Cluster Computing: The Journal of Networks, Software Tools, and Applications , vol.7 , Issue.2 , pp. 151-161
- Schroeder, B.¹ Harchol-Balter, M.²

49
- 0002357993
- Load-sensitive routing of long-lived IP flows
- A. Shaikh, J. Rexford, and K. G. Shin. Load-sensitive routing of long-lived IP flows. In SIGCOMM, 1999.
- (1999) SIGCOMM
- Shaikh, A.¹ Rexford, J.² Shin, K.G.³

50
- 34547692955
- A burst scheduling access reordering mechanism
- J. Shao and B. T. Davis. A burst scheduling access reordering mechanism. In HPCA-13, 2007.
- (2007) HPCA-13
- Shao, J.¹ Davis, B.T.²

51
- 0034443570
- Symbiotic jobscheduling for a simultaneous multithreading processor
- A. Snavely and D. M. Tullsen. Symbiotic jobscheduling for a simultaneous multithreading processor. In ASPLOS-IX, 2000.
- (2000) ASPLOS-IX
- Snavely, A.¹ Tullsen, D.M.²

52
- 37449003277
- Sun Microsystems
- Sun Microsystems. OpenSPARC T1 Microarchitecture Specification. http://opensparc-t1.sunsource.net/specs/OpenSPARCT1-Micro-Arch.pdf.
- OpenSPARC T1 Microarchitecture Specification

53
- 0026865523
- Increasing the number of strides for conflict-free vector access
- M. Valero, T. Lang, J. M. Llabería, M. Peiron, E. Ayguadé, and J. J. Navarra. Increasing the number of strides for conflict-free vector access. In ISCA-19, 1992.
- (1992) ISCA-19
- Valero, M.¹ Lang, T.² Llabería, J.M.³ Peiron, M.⁴ Ayguadé, E.⁵ Navarra, J.J.⁶

54
- 36849030305
- On-chip interconnection architecture of the tile processor
- D. Wentzlaff, P. Griffin, H. Hoffmann, L. Bao, B. Edwards, C. Ramey, M. Mattina, C.-C. Miao, J. F. Brown III, and A. Agarwal. On-chip interconnection architecture of the tile processor. IEEE Micro, 27(5):15-31, 2007.
- (2007) IEEE Micro , vol.27 , Issue.5 , pp. 15-31
- Wentzlaff, D.¹ Griffin, P.² Hoffmann, H.³ Bao, L.⁴ Edwards, B.⁵ Ramey, C.⁶ Mattina, M.⁷ Miao, C.-C.⁸ Brown III, J.F.⁹ Agarwal, A.¹⁰

55
- 0033688639
- Hardware-only stream prefetching and dynamic access ordering
- C. Zhang and S. A. McKee. Hardware-only stream prefetching and dynamic access ordering. In ICS, 2000.
- (2000) ICS
- Zhang, C.¹ McKee, S.A.²

56
- 0035510702
- The impulse memory controller
- Nov.
- L. Zhang, Z. Fang, M. Parker, B. K. Mathew, L. Schaelicke, J. B. Carter, W. C. Hsieh, and S. A. McKee. The impulse memory controller. IEEE TC, 50(11):1117-1132, Nov. 2001.
- (2001) IEEE TC , vol.50 , Issue.11 , pp. 1117-1132
- Zhang, L.¹ Fang, Z.² Parker, M.³ Mathew, B.K.⁴ Schaelicke, L.⁵ Carter, J.B.⁶ Hsieh, W.C.⁷ McKee, S.A.⁸

57
- 47349110818
- A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality
- Z. Zhang, Z. Zhu, and X. Zhang. A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality. In MICRO-33, 2000.
- (2000) MICRO-33
- Zhang, Z.¹ Zhu, Z.² Zhang, X.³

58
- 28444470842
- A performance comparison of DRAM memory system optimizations for SMT processors
- Z. Zhu and Z. Zhang. A performance comparison of DRAM memory system optimizations for SMT processors. In HPCA-11, 2005.
- (2005) HPCA-11
- Zhu, Z.¹ Zhang, Z.²

59
- 52649113530
- Controller for a synchronous DRAM that maximizes throughput by allowing memory requests and commands to be issued out of order
- U.S. Patent Number 5,630,096, May
- W. K. Zuravleff and T. Robinson. Controller for a synchronous DRAM that maximizes throughput by allowing memory requests and commands to be issued out of order. U.S. Patent Number 5,630,096, May 1997.
- (1997)
- Zuravleff, W.K.¹ Robinson, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.