-
2
-
-
0037619265
-
Web search for a planet: The google cluster architecture
-
L. Barroso, J. Dean, and U. Holzle. Web search for a planet: The google cluster architecture. IEEE Micro, 23(2):22-28, 2003.
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 22-28
-
-
Barroso, L.1
Dean, J.2
Holzle, U.3
-
3
-
-
67649170859
-
The datacenter as a computer: An introduction to the design of warehouse-scale machines
-
L. Barroso and U. Ḧolzle. The datacenter as a computer: An introduction to the design of warehouse-scale machines. Synthesis Lectures on Computer Architecture, 4(1):1-108, 2009.
-
(2009)
Synthesis Lectures on Computer Architecture
, vol.4
, Issue.1
, pp. 1-108
-
-
Barroso, L.1
Ḧolzle, U.2
-
4
-
-
77954752931
-
An approach to resource-aware co-scheduling for cmps
-
Jun.
-
M. Bhadauria and S. McKee. An approach to resource-aware co-scheduling for cmps. ICS 2010, Jun 2010.
-
(2010)
ICS 2010
-
-
Bhadauria, M.1
McKee, S.2
-
5
-
-
40349095122
-
Managing distributed, shared l2 caches through os-level page allocation
-
Dec.
-
S. Cho and L. Jin. Managing distributed, shared l2 caches through os-level page allocation. MICRO 39, Dec 2006.
-
(2006)
MICRO
, vol.39
-
-
Cho, S.1
Jin, L.2
-
6
-
-
77952285828
-
Fairness via source throttling: A configurable and high-performance fairness substrate for multi-core memory systems
-
Mar.
-
E. Ebrahimi, C. Lee, O. Mutlu, and Y. Patt. Fairness via source throttling: a configurable and high-performance fairness substrate for multi-core memory systems. ASPLOS 2010, Mar 2010.
-
(2010)
ASPLOS 2010
-
-
Ebrahimi, E.1
Lee, C.2
Mutlu, O.3
Patt, Y.4
-
7
-
-
47849108985
-
Improving performance isolation on chip multiprocessors via an operating system scheduler
-
Sep.
-
A. Fedorova, M. Seltzer, and M. Smith. Improving performance isolation on chip multiprocessors via an operating system scheduler. PACT 2007, Sep 2007.
-
(2007)
PACT 2007
-
-
Fedorova, A.1
Seltzer, M.2
Smith, M.3
-
8
-
-
47349085427
-
A framework for providing quality of service in chip multi-processors
-
F. Guo, Y. Solihin, L. Zhao, and R. Iyer. A framework for providing quality of service in chip multi-processors. MIRCO 2007, pages 343-355, 2007.
-
(2007)
MIRCO 2007
, pp. 343-355
-
-
Guo, F.1
Solihin, Y.2
Zhao, L.3
Iyer, R.4
-
9
-
-
70449711364
-
Rate-based qos techniques for cache/memory in cmp platforms
-
Jun.
-
A. Herdrich, R. Illikkal, R. Iyer, D. Newell, V. Chadha, and J. Moses. Rate-based qos techniques for cache/memory in cmp platforms. ICS'09: Proceedings of the 23rd international conference on Supercomputing, Jun 2009.
-
(2009)
ICS'09: Proceedings of the 23rd International Conference on Supercomputing
-
-
Herdrich, A.1
Illikkal, R.2
Iyer, R.3
Newell, D.4
Chadha, V.5
Moses, J.6
-
10
-
-
79957469988
-
Mao: An extensible micro-architectural optimizer
-
Apr.
-
R. Hundt, E. Raman, M. Thuresson, and N. Vachharajani. Mao: An extensible micro-architectural optimizer. In CGO 2011, pages 1 -10, Apr 2011.
-
(2011)
CGO 2011
, pp. 1-10
-
-
Hundt, R.1
Raman, E.2
Thuresson, M.3
Vachharajani, N.4
-
11
-
-
67650595791
-
Qos policies and architecture for cache/memory in cmp platforms
-
Jun.
-
R. Iyer, L. Zhao, F. Guo, R. Illikkal, S. Makineni, D. Newell, Y. Solihin, L. Hsu, and S. Reinhardt. Qos policies and architecture for cache/memory in cmp platforms. SIGMETRICS'07, Jun 2007.
-
(2007)
SIGMETRICS'07
-
-
Iyer, R.1
Zhao, L.2
Guo, F.3
Illikkal, R.4
Makineni, S.5
Newell, D.6
Solihin, Y.7
Hsu, L.8
Reinhardt, S.9
-
12
-
-
63549085110
-
Analysis and approximation of optimal co-scheduling on chip multiprocessors
-
Oct.
-
Y. Jiang, X. Shen, J. Chen, and R. Tripathi. Analysis and approximation of optimal co-scheduling on chip multiprocessors. PACT'08, Oct 2008.
-
(2008)
PACT'08
-
-
Jiang, Y.1
Shen, X.2
Chen, J.3
Tripathi, R.4
-
13
-
-
77949597137
-
Combining locality analysis with online proactive job co-scheduling in chip multiprocessors
-
Y. Jiang, K. Tian, and X. Shen. Combining locality analysis with online proactive job co-scheduling in chip multiprocessors. HiPEAC 2010, pages 201-215, 2010.
-
(2010)
HiPEAC 2010
, pp. 201-215
-
-
Jiang, Y.1
Tian, K.2
Shen, X.3
-
14
-
-
76749137634
-
Optimizing shared cache behavior of chip multiprocessors
-
M. Kandemir, S. Muralidhara, S. Narayanan, Y. Zhang, and O. Ozturk. Optimizing shared cache behavior of chip multiprocessors. Microarchitecture, 2009, pages 505-516, 2009.
-
(2009)
Microarchitecture, 2009
, pp. 505-516
-
-
Kandemir, M.1
Muralidhara, S.2
Narayanan, S.3
Zhang, Y.4
Ozturk, O.5
-
15
-
-
77954696758
-
Cache topology aware computation mapping for multicores
-
Jun.
-
M. Kandemir, T. Yemliha, S. Muralidhara, S. Srikantaiah, M. Irwin, and Y. Zhnag. Cache topology aware computation mapping for multicores. PLDI'10, Jun 2010.
-
(2010)
PLDI'10
-
-
Kandemir, M.1
Yemliha, T.2
Muralidhara, S.3
Srikantaiah, S.4
Irwin, M.5
Zhnag, Y.6
-
16
-
-
10444238444
-
Fair cache sharing and partitioning in a chip multiprocessor architecture
-
Sep.
-
S. Kim, D. Chandra, and Y. Solihin. Fair cache sharing and partitioning in a chip multiprocessor architecture. PACT 2004, Sep 2004.
-
(2004)
PACT 2004
-
-
Kim, S.1
Chandra, D.2
Solihin, Y.3
-
17
-
-
47249103334
-
Using os observations to improve performance in multicore systems
-
R. Knauerhase, P. Brett, B. Hohlt, T. Li, and S. Hahn. Using os observations to improve performance in multicore systems. IEEE Micro, 28(3):54-66, 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.3
, pp. 54-66
-
-
Knauerhase, R.1
Brett, P.2
Hohlt, B.3
Li, T.4
Hahn, S.5
-
18
-
-
57749186047
-
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
-
J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan. Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. HPCA 2008, pages 367-378, 2008.
-
(2008)
HPCA 2008
, pp. 367-378
-
-
Lin, J.1
Lu, Q.2
Ding, X.3
Zhang, Z.4
Zhang, X.5
Sadayappan, P.6
-
19
-
-
77952569781
-
Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance
-
F. Liu, X. Jiang, and Y. Solihin. Understanding how off-chip memory bandwidth partitioning in chip multiprocessors affects system performance. HPCA 2010, pages 1-12, 2010.
-
(2010)
HPCA 2010
, pp. 1-12
-
-
Liu, F.1
Jiang, X.2
Solihin, Y.3
-
20
-
-
31944440969
-
Pin: Building customized program analysis tools with dynamic instrumentation
-
New York, NY, USA, ACM
-
C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, and K. Hazelwood. Pin: building customized program analysis tools with dynamic instrumentation. PLDI'05, pages 190-200, New York, NY, USA, 2005. ACM.
-
(2005)
PLDI'05
, pp. 190-200
-
-
Luk, C.-K.1
Cohn, R.2
Muth, R.3
Patil, H.4
Klauser, A.5
Lowney, G.6
Wallace, S.7
Reddi, V.J.8
Hazelwood, K.9
-
21
-
-
67650568324
-
Scenario based optimization: A framework for statically enabling online optimizations
-
Washington, DC, USA, IEEE Computer Society
-
J. Mars and R. Hundt. Scenario based optimization: A framework for statically enabling online optimizations. CGO'09, pages 169-179, Washington, DC, USA, 2009. IEEE Computer Society.
-
(2009)
CGO'09
, pp. 169-179
-
-
Mars, J.1
Hundt, R.2
-
22
-
-
84858783719
-
Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations
-
New York, NY, USA, ACM
-
J. Mars, L. Tang, R. Hundt, K. Skadron, and M. L. Soffa. Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations. In MICRO'11: Proceedings of The 44th Annual IEEE/ACM International Symposium on Microarchitecture, New York, NY, USA, 2011. ACM.
-
(2011)
MICRO'11: Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
-
-
Mars, J.1
Tang, L.2
Hundt, R.3
Skadron, K.4
Soffa, M.L.5
-
23
-
-
77954016468
-
Contention aware execution: Online contention detection and response
-
Apr.
-
J. Mars, N. Vachharajani, R. Hundt, and M. Soffa. Contention aware execution: online contention detection and response. CGO'10, Apr 2010.
-
(2010)
CGO'10
-
-
Mars, J.1
Vachharajani, N.2
Hundt, R.3
Soffa, M.4
-
24
-
-
67650783130
-
Powernap: Eliminating server idle power
-
New York, NY, USA, ACM
-
D. Meisner, B. T. Gold, and T. F. Wenisch. Powernap: eliminating server idle power. ASPLOS'09, pages 205-216, New York, NY, USA, 2009. ACM.
-
(2009)
ASPLOS'09
, pp. 205-216
-
-
Meisner, D.1
Gold, B.T.2
Wenisch, T.F.3
-
25
-
-
77954569425
-
Q-clouds: Managing performance interference effects for qos-aware clouds
-
Apr.
-
R. Nathuji, A. Kansal, and A. Ghaffarkhah. Q-clouds: managing performance interference effects for qos-aware clouds. EuroSys'10, Apr 2010.
-
(2010)
EuroSys'10
-
-
Nathuji, R.1
Kansal, A.2
Ghaffarkhah, A.3
-
26
-
-
34548050337
-
Fair queuing memory systems
-
K. Nesbit, N. Aggarwal, J. Laudon, and J. Smith. Fair queuing memory systems. MICRO 2006, pages 208 - 222, 2006.
-
(2006)
MICRO 2006
, pp. 208-222
-
-
Nesbit, K.1
Aggarwal, N.2
Laudon, J.3
Smith, J.4
-
29
-
-
79957503818
-
Automated locality optimization based on the reuse distance of string operations
-
Apr.
-
S. Rus, R. Ashok, and D. Li. Automated locality optimization based on the reuse distance of string operations. CGO'11, pages 181 -190, Apr 2011.
-
(2011)
CGO'11
, pp. 181-190
-
-
Rus, S.1
Ashok, R.2
Li, D.3
-
30
-
-
78650832741
-
Reducing cache pollution through detection and elimination of non-temporal memory accesses
-
Nov
-
A. Sandberg, D. Ekl̈ov, and E. Hagersten. Reducing cache pollution through detection and elimination of non-temporal memory accesses. SC 2010, Nov 2010.
-
(2010)
SC 2010
-
-
Sandberg, A.1
Ekl̈ov, D.2
Hagersten, E.3
-
31
-
-
66749168716
-
Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer
-
L. Soares, D. Tam, and M. Stumm. Reducing the harmful effects of last-level cache polluters with an os-level, software-only pollute buffer. Micro 2008, pages 258 - 269, 2008.
-
(2008)
Micro 2008
, pp. 258-269
-
-
Soares, L.1
Tam, D.2
Stumm, M.3
-
32
-
-
67650091160
-
A compiler-directed data prefetching scheme for chip multiprocessors
-
Feb.
-
S. Son, M. Kandemir, M. Karakoy, and D. Chakrabarti. A compiler-directed data prefetching scheme for chip multiprocessors. PPoPP 2009, Feb 2009.
-
(2009)
PPoPP 2009
-
-
Son, S.1
Kandemir, M.2
Karakoy, M.3
Chakrabarti, D.4
-
33
-
-
70350634177
-
Adaptive set pinning: Managing shared caches in chip multiprocessors
-
Mar.
-
S. Srikantaiah, M. Kandemir, and M. Irwin. Adaptive set pinning: managing shared caches in chip multiprocessors. ASPLOS XIII, Mar 2008.
-
(2008)
ASPLOS XIII
-
-
Srikantaiah, S.1
Kandemir, M.2
Irwin, M.3
-
34
-
-
80052535250
-
The impact of memory subsystem resource sharing on datacenter applications
-
New York, NY, USA, ACM
-
L. Tang, J. Mars, N. Vachharajani, R. Hundt, and M. L. Soffa. The impact of memory subsystem resource sharing on datacenter applications. ISCA'11, pages 283-294, New York, NY, USA, 2011. ACM.
-
(2011)
ISCA'11
, pp. 283-294
-
-
Tang, L.1
Mars, J.2
Vachharajani, N.3
Hundt, R.4
Soffa, M.L.5
-
35
-
-
80053993064
-
All-window profiling and composable models of cache sharing
-
X. Xiang, B. Bao, T. Bai, C. Ding, and T. Chilimbi. All-window profiling and composable models of cache sharing. PPoPP'11, pages 91-102, 2011.
-
(2011)
PPoPP'11
, pp. 91-102
-
-
Xiang, X.1
Bao, B.2
Bai, T.3
Ding, C.4
Chilimbi, T.5
-
36
-
-
78149262502
-
On mitigating memory bandwidth contention through bandwidth-aware scheduling
-
Sep.
-
D. Xu, C. Wu, and P.-C. Yew. On mitigating memory bandwidth contention through bandwidth-aware scheduling. PACT 2010, Sep 2010.
-
(2010)
PACT 2010
-
-
Xu, D.1
Wu, C.2
Yew, P.-C.3
-
37
-
-
77749340037
-
Does cache sharing on modern cmp matter to the performance of contemporary multithreaded programs?
-
E. Zhang, Y. Jiang, and X. Shen. Does cache sharing on modern cmp matter to the performance of contemporary multithreaded programs? PPoPP 2010, pages 203-212, 2010.
-
(2010)
PPoPP 2010
, pp. 203-212
-
-
Zhang, E.1
Jiang, Y.2
Shen, X.3
-
39
-
-
80053541916
-
Dynamic cache contention detection in multi-threaded applications
-
Q. Zhao, D. Koh, S. Raza, D. Bruening, W. Wong, and S. Amarasinghe. Dynamic cache contention detection in multi-threaded applications. VEE 2011, pages 27-38, 2011.
-
(2011)
VEE 2011
, pp. 27-38
-
-
Zhao, Q.1
Koh, D.2
Raza, S.3
Bruening, D.4
Wong, W.5
Amarasinghe, S.6
-
40
-
-
77952248898
-
Addressing shared resource contention in multicore processors via scheduling
-
Mar.
-
S. Zhuravlev, S. Blagodurov, and A. Fedorova. Addressing shared resource contention in multicore processors via scheduling. ASPLOS 2010, Mar 2010.
-
(2010)
ASPLOS 2010
-
-
Zhuravlev, S.1
Blagodurov, S.2
Fedorova, A.3
|