SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on Code Generation and Optimization, CGO 2012

Volumn , Issue , 2012, Pages 1-12

Compiling for niceness: Mitigating contention for QoS in warehouse scale computers

(3) Tang, Lingjia a Mars, Jason a Soffa, Mary Lou a

a UNIVERSITY OF VIRGINIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CO-LOCATED; COLOCATIONS; COMPILATION TECHNIQUES; COMPUTATION POWER; DATA CENTERS; LOW PRIORITIES; MACHINE UTILIZATION; MEMORY ACCESS; MULTI-CORE MACHINES; MULTI-CORE PROCESSOR; MULTIPLE APPLICATIONS; OTHER APPLICATIONS; QOS REQUIREMENTS; SENSITIVE APPLICATION; SHARED MEMORIES; SYNTHETIC BENCHMARK;

INTERFERENCE SUPPRESSION; NETWORK COMPONENTS; OPTIMIZATION; PROGRAM COMPILERS; QUALITY OF SERVICE; WAREHOUSES;

BENCHMARKING;

EID: 84863448918 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2259016.2259018 Document Type: Conference Paper

Times cited : (59)

References (40)

1
- 70350754495
- Pam: A novel performance/power aware meta-scheduler for multi-core systems
- Nov.
- M. Banikazemi, D. Poff, and B. Abali. Pam: a novel performance/power aware meta-scheduler for multi-core systems. SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Nov 2008.
- (2008) SC'08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing
- Banikazemi, M.¹ Poff, D.² Abali, B.³

2
- 0037619265
- Web search for a planet: The google cluster architecture
- L. Barroso, J. Dean, and U. Holzle. Web search for a planet: The google cluster architecture. IEEE Micro, 23(2):22-28, 2003.
- (2003) IEEE Micro , vol.23 , Issue.2 , pp. 22-28
- Barroso, L.¹ Dean, J.² Holzle, U.³

3
- 67649170859
- The datacenter as a computer: An introduction to the design of warehouse-scale machines
- L. Barroso and U. Ḧolzle. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis Lectures on Computer Architecture, 4(1):1-108, 2009.
- (2009) Synthesis Lectures on Computer Architecture , vol.4 , Issue.1 , pp. 1-108
- Barroso, L.¹ Ḧolzle, U.²

4
- 77954752931
- An approach to resource-aware co-scheduling for cmps
- Jun.
- M. Bhadauria and S. McKee. An approach to resource-aware co-scheduling for cmps. ICS 2010, Jun 2010.
- (2010) ICS 2010
- Bhadauria, M.¹ McKee, S.²

5
- 40349095122
- Managing distributed, shared l2 caches through os-level page allocation
- Dec.
- S. Cho and L. Jin. Managing distributed, shared l2 caches through os-level page allocation. MICRO 39, Dec 2006.
- (2006) MICRO , vol.39
- Cho, S.¹ Jin, L.²

6
- 77952285828
- Fairness via source throttling: A configurable and high-performance fairness substrate for multi-core memory systems
- Mar.
- E. Ebrahimi, C. Lee, O. Mutlu, and Y. Patt. Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems. ASPLOS 2010, Mar 2010.
- (2010) ASPLOS 2010
- Ebrahimi, E.¹ Lee, C.² Mutlu, O.³ Patt, Y.⁴

7
- 47849108985
- Improving performance isolation on chip multiprocessors via an operating system scheduler
- Sep.
- A. Fedorova, M. Seltzer, and M. Smith. Improving performance isolation on chip multiprocessors via an operating system scheduler. PACT 2007, Sep 2007.
- (2007) PACT 2007
- Fedorova, A.¹ Seltzer, M.² Smith, M.³

8
- 47349085427
- A framework for providing quality of service in chip multi-processors
- F. Guo, Y. Solihin, L. Zhao, and R. Iyer. A framework for providing quality of service in chip multi-processors. MIRCO 2007, pages 343-355, 2007.
- (2007) MIRCO 2007 , pp. 343-355
- Guo, F.¹ Solihin, Y.² Zhao, L.³ Iyer, R.⁴

9
- 70449711364
- Rate-based qos techniques for cache/memory in cmp platforms
- Jun.
- A. Herdrich, R. Illikkal, R. Iyer, D. Newell, V. Chadha, and J. Moses. Rate-based qos techniques for cache/memory in cmp platforms. ICS'09: Proceedings of the 23rd international conference on Supercomputing, Jun 2009.
- (2009) ICS'09: Proceedings of the 23rd International Conference on Supercomputing
- Herdrich, A.¹ Illikkal, R.² Iyer, R.³ Newell, D.⁴ Chadha, V.⁵ Moses, J.⁶

10
- 79957469988
- Mao: An extensible micro-architectural optimizer
- Apr.
- R. Hundt, E. Raman, M. Thuresson, and N. Vachharajani. Mao: An extensible micro-architectural optimizer. In CGO 2011, pages 1 -10, Apr 2011.
- (2011) CGO 2011 , pp. 1-10
- Hundt, R.¹ Raman, E.² Thuresson, M.³ Vachharajani, N.⁴

11
- 67650595791
- Qos policies and architecture for cache/memory in cmp platforms
- Jun.
- R. Iyer, L. Zhao, F. Guo, R. Illikkal, S. Makineni, D. Newell, Y. Solihin, L. Hsu, and S. Reinhardt. Qos policies and architecture for cache/memory in cmp platforms. SIGMETRICS'07, Jun 2007.
- (2007) SIGMETRICS'07
- Iyer, R.¹ Zhao, L.² Guo, F.³ Illikkal, R.⁴ Makineni, S.⁵ Newell, D.⁶ Solihin, Y.⁷ Hsu, L.⁸ Reinhardt, S.⁹

12
- 63549085110
- Analysis and approximation of optimal co-scheduling on chip multiprocessors
- Oct.
- Y. Jiang, X. Shen, J. Chen, and R. Tripathi. Analysis and approximation of optimal co-scheduling on chip multiprocessors. PACT'08, Oct 2008.
- (2008) PACT'08
- Jiang, Y.¹ Shen, X.² Chen, J.³ Tripathi, R.⁴

13
- 77949597137
- Combining locality analysis with online proactive job co-scheduling in chip multiprocessors
- Y. Jiang, K. Tian, and X. Shen. Combining locality analysis with online proactive job co-scheduling in chip multiprocessors. HiPEAC 2010, pages 201-215, 2010.
- (2010) HiPEAC 2010 , pp. 201-215
- Jiang, Y.¹ Tian, K.² Shen, X.³

14
- 76749137634
- Optimizing shared cache behavior of chip multiprocessors
- M. Kandemir, S. Muralidhara, S. Narayanan, Y. Zhang, and O. Ozturk. Optimizing shared cache behavior of chip multiprocessors. Microarchitecture, 2009, pages 505-516, 2009.
- (2009) Microarchitecture, 2009 , pp. 505-516
- Kandemir, M.¹ Muralidhara, S.² Narayanan, S.³ Zhang, Y.⁴ Ozturk, O.⁵

15
- 77954696758
- Cache topology aware computation mapping for multicores
- Jun.
- M. Kandemir, T. Yemliha, S. Muralidhara, S. Srikantaiah, M. Irwin, and Y. Zhnag. Cache topology aware computation mapping for multicores. PLDI'10, Jun 2010.
- (2010) PLDI'10
- Kandemir, M.¹ Yemliha, T.² Muralidhara, S.³ Srikantaiah, S.⁴ Irwin, M.⁵ Zhnag, Y.⁶

16
- 10444238444
- Fair cache sharing and partitioning in a chip multiprocessor architecture
- Sep.
- S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. PACT 2004, Sep 2004.
- (2004) PACT 2004
- Kim, S.¹ Chandra, D.² Solihin, Y.³

17
- 47249103334
- Using os observations to improve performance in multicore systems
- R. Knauerhase, P. Brett, B. Hohlt, T. Li, and S. Hahn. Using os observations to improve performance in multicore systems. IEEE Micro, 28(3):54-66, 2008.
- (2008) IEEE Micro , vol.28 , Issue.3 , pp. 54-66
- Knauerhase, R.¹ Brett, P.² Hohlt, B.³ Li, T.⁴ Hahn, S.⁵

18
- 57749186047
- Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
- J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. HPCA 2008, pages 367-378, 2008.
- (2008) HPCA 2008 , pp. 367-378
- Lin, J.¹ Lu, Q.² Ding, X.³ Zhang, Z.⁴ Zhang, X.⁵ Sadayappan, P.⁶

19
- 77952569781
- Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance
- F. Liu, X. Jiang, and Y. Solihin. Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance. HPCA 2010, pages 1-12, 2010.
- (2010) HPCA 2010 , pp. 1-12
- Liu, F.¹ Jiang, X.² Solihin, Y.³

20
- 31944440969
- Pin: Building customized program analysis tools with dynamic instrumentation
- New York, NY, USA, ACM
- C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. PLDI'05, pages 190-200, New York, NY, USA, 2005. ACM.
- (2005) PLDI'05 , pp. 190-200
- Luk, C.-K.¹ Cohn, R.² Muth, R.³ Patil, H.⁴ Klauser, A.⁵ Lowney, G.⁶ Wallace, S.⁷ Reddi, V.J.⁸ Hazelwood, K.⁹

21
- 67650568324
- Scenario based optimization: A framework for statically enabling online optimizations
- Washington, DC, USA, IEEE Computer Society
- J. Mars and R. Hundt. Scenario based optimization: A framework for statically enabling online optimizations. CGO'09, pages 169-179, Washington, DC, USA, 2009. IEEE Computer Society.
- (2009) CGO'09 , pp. 169-179
- Mars, J.¹ Hundt, R.²

22
- 84858783719
- Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations
- New York, NY, USA, ACM
- J. Mars, L. Tang, R. Hundt, K. Skadron, and M. L. Soffa. Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In MICRO'11: Proceedings of The 44th Annual IEEE/ACM International Symposium on Microarchitecture, New York, NY, USA, 2011. ACM.
- (2011) MICRO'11: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
- Mars, J.¹ Tang, L.² Hundt, R.³ Skadron, K.⁴ Soffa, M.L.⁵

23
- 77954016468
- Contention aware execution: Online contention detection and response
- Apr.
- J. Mars, N. Vachharajani, R. Hundt, and M. Soffa. Contention aware execution: online contention detection and response. CGO'10, Apr 2010.
- (2010) CGO'10
- Mars, J.¹ Vachharajani, N.² Hundt, R.³ Soffa, M.⁴

24
- 67650783130
- Powernap: Eliminating server idle power
- New York, NY, USA, ACM
- D. Meisner, B. T. Gold, and T. F. Wenisch. Powernap: eliminating server idle power. ASPLOS'09, pages 205-216, New York, NY, USA, 2009. ACM.
- (2009) ASPLOS'09 , pp. 205-216
- Meisner, D.¹ Gold, B.T.² Wenisch, T.F.³

25
- 77954569425
- Q-clouds: Managing performance interference effects for qos-aware clouds
- Apr.
- R. Nathuji, A. Kansal, and A. Ghaffarkhah. Q-clouds: managing performance interference effects for qos-aware clouds. EuroSys'10, Apr 2010.
- (2010) EuroSys'10
- Nathuji, R.¹ Kansal, A.² Ghaffarkhah, A.³

26
- 34548050337
- Fair queuing memory systems
- K. Nesbit, N. Aggarwal, J. Laudon, and J. Smith. Fair queuing memory systems. MICRO 2006, pages 208 - 222, 2006.
- (2006) MICRO 2006 , pp. 208-222
- Nesbit, K.¹ Aggarwal, N.² Laudon, J.³ Smith, J.⁴

27
- 34548042910
- Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
- Dec.
- M. Qureshi and Y. Patt. Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches. MICRO 39: Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture, Dec 2006.
- (2006) MICRO 39: Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
- Qureshi, M.¹ Patt, Y.²

28
- 28244474239
- Enterprise IT trends and implications for architecture research
- Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005
- P. Ranganathan and N. Jouppi. Enterprise it trends and implications for architecture research. HPCA 2005, pages 253- 256, 2005. (Pubitemid 41731504)
- (2005) Proceedings - International Symposium on High-Performance Computer Architecture , pp. 253-256
- Ranganathan, P.¹ Jouppi, N.²

29
- 79957503818
- Automated locality optimization based on the reuse distance of string operations
- Apr.
- S. Rus, R. Ashok, and D. Li. Automated locality optimization based on the reuse distance of string operations. CGO'11, pages 181 -190, Apr 2011.
- (2011) CGO'11 , pp. 181-190
- Rus, S.¹ Ashok, R.² Li, D.³

30
- 78650832741
- Reducing cache pollution through detection and elimination of non-temporal memory accesses
- Nov
- A. Sandberg, D. Ekl̈ov, and E. Hagersten. Reducing cache pollution through detection and elimination of non-temporal memory accesses. SC 2010, Nov 2010.
- (2010) SC 2010
- Sandberg, A.¹ Ekl̈ov, D.² Hagersten, E.³

31
- 66749168716
- Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer
- L. Soares, D. Tam, and M. Stumm. Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer. Micro 2008, pages 258 - 269, 2008.
- (2008) Micro 2008 , pp. 258-269
- Soares, L.¹ Tam, D.² Stumm, M.³

32
- 67650091160
- A compiler-directed data prefetching scheme for chip multiprocessors
- Feb.
- S. Son, M. Kandemir, M. Karakoy, and D. Chakrabarti. A compiler-directed data prefetching scheme for chip multiprocessors. PPoPP 2009, Feb 2009.
- (2009) PPoPP 2009
- Son, S.¹ Kandemir, M.² Karakoy, M.³ Chakrabarti, D.⁴

33
- 70350634177
- Adaptive set pinning: Managing shared caches in chip multiprocessors
- Mar.
- S. Srikantaiah, M. Kandemir, and M. Irwin. Adaptive set pinning: managing shared caches in chip multiprocessors. ASPLOS XIII, Mar 2008.
- (2008) ASPLOS XIII
- Srikantaiah, S.¹ Kandemir, M.² Irwin, M.³

34
- 80052535250
- The impact of memory subsystem resource sharing on datacenter applications
- New York, NY, USA, ACM
- L. Tang, J. Mars, N. Vachharajani, R. Hundt, and M. L. Soffa. The impact of memory subsystem resource sharing on datacenter applications. ISCA'11, pages 283-294, New York, NY, USA, 2011. ACM.
- (2011) ISCA'11 , pp. 283-294
- Tang, L.¹ Mars, J.² Vachharajani, N.³ Hundt, R.⁴ Soffa, M.L.⁵

35
- 80053993064
- All-window profiling and composable models of cache sharing
- X. Xiang, B. Bao, T. Bai, C. Ding, and T. Chilimbi. All-window profiling and composable models of cache sharing. PPoPP'11, pages 91-102, 2011.
- (2011) PPoPP'11 , pp. 91-102
- Xiang, X.¹ Bao, B.² Bai, T.³ Ding, C.⁴ Chilimbi, T.⁵

36
- 78149262502
- On mitigating memory bandwidth contention through bandwidth-aware scheduling
- Sep.
- D. Xu, C. Wu, and P.-C. Yew. On mitigating memory bandwidth contention through bandwidth-aware scheduling. PACT 2010, Sep 2010.
- (2010) PACT 2010
- Xu, D.¹ Wu, C.² Yew, P.-C.³

37
- 77749340037
- Does cache sharing on modern cmp matter to the performance of contemporary multithreaded programs?
- E. Zhang, Y. Jiang, and X. Shen. Does cache sharing on modern cmp matter to the performance of contemporary multithreaded programs? PPoPP 2010, pages 203-212, 2010.
- (2010) PPoPP 2010 , pp. 203-212
- Zhang, E.¹ Jiang, Y.² Shen, X.³

38
- 85077083345
- Hardware execution throttling for multi-core resource management
- X. Zhang, S. Dwarkadas, and K. Shen. Hardware execution throttling for multi-core resource management. Proceedings of the 2009 conference on USENIX Annual technical conference, page 23, 2009.
- (2009) Proceedings of the 2009 Conference on USENIX Annual Technical Conference , pp. 23
- Zhang, X.¹ Dwarkadas, S.² Shen, K.³

39
- 80053541916
- Dynamic cache contention detection in multi-threaded applications
- Q. Zhao, D. Koh, S. Raza, D. Bruening, W. Wong, and S. Amarasinghe. Dynamic cache contention detection in multi-threaded applications. VEE 2011, pages 27-38, 2011.
- (2011) VEE 2011 , pp. 27-38
- Zhao, Q.¹ Koh, D.² Raza, S.³ Bruening, D.⁴ Wong, W.⁵ Amarasinghe, S.⁶

40
- 77952248898
- Addressing shared resource contention in multicore processors via scheduling
- Mar.
- S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. ASPLOS 2010, Mar 2010.
- (2010) ASPLOS 2010
- Zhuravlev, S.¹ Blagodurov, S.² Fedorova, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.