-
1
-
-
84937702394
-
-
Online; accessed 23-May-2014
-
Miller, Rich, "The Billion Dollar Datacenter," http://www.datacenterknowledge.com/archives/2013/04/29/the-billion-dollar-data-centers/, 2013, Online; accessed 23-May-2014.
-
(2013)
The Billion Dollar Datacenter
-
-
Miller, R.1
-
4
-
-
84858783719
-
Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations
-
J. Mars, L. Tang, R. Hundt, K. Skadron, and M. L. Soffa, "Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations," in International Symposium on Microarchitecture (MICRO), 2011.
-
(2011)
International Symposium on Microarchitecture (MICRO)
-
-
Mars, J.1
Tang, L.2
Hundt, R.3
Skadron, K.4
Soffa, M.L.5
-
8
-
-
84890018410
-
-
Online; accessed 23-May-2014
-
"ARMv8 Instruction Set Overview," http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.genc010197a/index.html, 2011, Online; accessed 23-May-2014.
-
(2011)
ARMv8 Instruction Set Overview
-
-
-
9
-
-
84881190996
-
Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers
-
H. Yang, A. Breslow, J. Mars, and L. Tang, "Bubble-flux: Precise online qos management for increased utilization in warehouse scale computers," in International Symposium on Computer Architecture (ISCA), 2013.
-
(2013)
International Symposium on Computer Architecture (ISCA)
-
-
Yang, H.1
Breslow, A.2
Mars, J.3
Tang, L.4
-
10
-
-
84875673650
-
Reqos: Reactive static/dynamic compilation for qos in warehouse scale computers
-
L. Tang, J. Mars, W. Wang, T. Dey, and M. L. Soffa, "Reqos: Reactive static/dynamic compilation for qos in warehouse scale computers," in Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2013.
-
(2013)
Architectural Support for Programming Languages and Operating Systems (ASPLOS)
-
-
Tang, L.1
Mars, J.2
Wang, W.3
Dey, T.4
Soffa, M.L.5
-
11
-
-
47249103334
-
Using os observations to improve performance in multicore systems
-
R. Knauerhase, P. Brett, B. Hohlt, T. Li, and S. Hahn, "Using os observations to improve performance in multicore systems," IEEE Micro, 2008.
-
(2008)
IEEE Micro
-
-
Knauerhase, R.1
Brett, P.2
Hohlt, B.3
Li, T.4
Hahn, S.5
-
14
-
-
57749186047
-
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems
-
J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan, "Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems," in High Performance Computer Architecture (HPCA), 2008.
-
(2008)
High Performance Computer Architecture (HPCA)
-
-
Lin, J.1
Lu, Q.2
Ding, X.3
Zhang, Z.4
Zhang, X.5
Sadayappan, P.6
-
16
-
-
67650091160
-
A compiler-directed data prefetching scheme for chip multiprocessors
-
S. W. Son, M. Kandemir, M. Karakoy, and D. Chakrabarti, "A compiler-directed data prefetching scheme for chip multiprocessors," in Principles and Practice of Parallel Programming (PPoPP), 2009.
-
(2009)
Principles and Practice of Parallel Programming (PPoPP)
-
-
Son, S.W.1
Kandemir, M.2
Karakoy, M.3
Chakrabarti, D.4
-
17
-
-
84863448918
-
Compiling for niceness: Mitigating contention for qos in warehouse scale computers
-
L. Tang, J. Mars, and M. L. Soffa, "Compiling for niceness: Mitigating contention for qos in warehouse scale computers," in Code Generation and Optimization (CGO), 2012.
-
(2012)
Code Generation and Optimization (CGO)
-
-
Tang, L.1
Mars, J.2
Soffa, M.L.3
-
18
-
-
79957503818
-
Automated locality optimization based on the reuse distance of string operations
-
S. Rus, R. Ashok, and D. X. Li, "Automated locality optimization based on the reuse distance of string operations," in Code Generation and Optimization (CGO), 2011.
-
(2011)
Code Generation and Optimization (CGO)
-
-
Rus, S.1
Ashok, R.2
Li, D.X.3
-
19
-
-
78650832741
-
Reducing cache pollution through detection and elimination of non-temporal memory accesses
-
A. Sandberg, D. Eklöv, and E. Hagersten, "Reducing cache pollution through detection and elimination of non-temporal memory accesses," in International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2010.
-
(2010)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Sandberg, A.1
Eklöv, D.2
Hagersten, E.3
-
21
-
-
34547335786
-
Tainttrace: Efficient flow tracing with dynamic binary rewriting
-
W. Cheng, Q. Zhao, B. Yu, and S. Hiroshige, "Tainttrace: Efficient flow tracing with dynamic binary rewriting," in IEEE Symposium on Computers and Communications (ISCC), 2006.
-
(2006)
IEEE Symposium on Computers and Communications (ISCC)
-
-
Cheng, W.1
Zhao, Q.2
Yu, B.3
Hiroshige, S.4
-
24
-
-
67650020024
-
The performance of runtime data cache prefetching in a dynamic optimization system
-
J. Lu, H. Chen, R. Fu, W.-C. Hsu, B. Othmer, P.-C. Yew, and D.-Y. Chen, "The performance of runtime data cache prefetching in a dynamic optimization system," in International Symposium on Microarchitecture (MICRO), 2003.
-
(2003)
International Symposium on Microarchitecture (MICRO)
-
-
Lu, J.1
Chen, H.2
Fu, R.3
Hsu, W.-C.4
Othmer, B.5
Yew, P.-C.6
Chen, D.-Y.7
-
25
-
-
3042688082
-
Mojo: A dynamic optimization system
-
W.-K. Chen, S. Lerner, R. Chaiken, and D. M. Gillies, "Mojo: A dynamic optimization system," in Feedback-Directed and Dynamic Optimization (FDDO), 2000.
-
(2000)
Feedback-Directed and Dynamic Optimization (FDDO)
-
-
Chen, W.-K.1
Lerner, S.2
Chaiken, R.3
Gillies, D.M.4
-
27
-
-
84902239814
-
Dynamic and speculative polyhedral parallelization using compiler-generated skeletons
-
A. Jimborean, P. Clauss, J.-F. Dollinger, V. Loechner, and J. M. M. Caamano, "Dynamic and speculative polyhedral parallelization using compiler-generated skeletons," International Journal of Parallel Programming, 2014.
-
(2014)
International Journal of Parallel Programming
-
-
Jimborean, A.1
Clauss, P.2
Dollinger, J.-F.3
Loechner, V.4
Caamano, J.M.M.5
-
28
-
-
18044398549
-
Online impact analysis via dynamic compilation technology
-
B. Breech, A. Danalis, S. Shindo, and L. Pollock, "Online impact analysis via dynamic compilation technology," in International Conference on Software Maintenance (ICSM), 2004.
-
(2004)
International Conference on Software Maintenance (ICSM)
-
-
Breech, B.1
Danalis, A.2
Shindo, S.3
Pollock, L.4
-
29
-
-
77956067021
-
Google-wide profiling: A continuous profiling infrastructure for data centers
-
G. Ren, E. Tune, T. Moseley, Y. Shi, S. Rus, and R. Hundt, "Google-wide profiling: A continuous profiling infrastructure for data centers," IEEE Micro, 2010.
-
(2010)
IEEE Micro
-
-
Ren, G.1
Tune, E.2
Moseley, T.3
Shi, Y.4
Rus, S.5
Hundt, R.6
-
30
-
-
3042658703
-
Llvm: A compilation framework for lifelong program analysis & transformation
-
C. Lattner and V. Adve, "Llvm: A compilation framework for lifelong program analysis & transformation," in Code Generation and Optimization (CGO), 2004.
-
(2004)
Code Generation and Optimization (CGO)
-
-
Lattner, C.1
Adve, V.2
-
31
-
-
84858791438
-
Clearing the clouds: A study of emerging scale-out workloads on modern hardware
-
M. Ferdman, A. Adileh, O. Kocberber, S. Volos, M. Alisafaee, D. Jevdjic, C. Kaynak, A. D. Popescu, A. Ailamaki, and B. Falsafi, "Clearing the clouds: a study of emerging scale-out workloads on modern hardware," in Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2012.
-
(2012)
Architectural Support for Programming Languages and Operating Systems (ASPLOS)
-
-
Ferdman, M.1
Adileh, A.2
Kocberber, O.3
Volos, S.4
Alisafaee, M.5
Jevdjic, D.6
Kaynak, C.7
Popescu, A.D.8
Ailamaki, A.9
Falsafi, B.10
-
33
-
-
63549095070
-
The parsec benchmark suite: Characterization and architectural implications
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li, "The parsec benchmark suite: Characterization and architectural implications," in Parallel Architectures and Compilation Techniques (PACT), 2008.
-
(2008)
Parallel Architectures and Compilation Techniques (PACT)
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
35
-
-
84937683341
-
Smite: Precise qos prediction on real system SMT processors to improve utilization in warehouse scale computers
-
Y. Zhang, M. A. Laurenzano, J. Mars, and L. Tang, "Smite: Precise qos prediction on real system smt processors to improve utilization in warehouse scale computers," in International Symposium on Microarchitecture (MICRO), 2014.
-
(2014)
International Symposium on Microarchitecture (MICRO)
-
-
Zhang, Y.1
Laurenzano, M.A.2
Mars, J.3
Tang, L.4
-
37
-
-
84881160871
-
A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness
-
H. Cook, M. Moreto, S. Bird, K. Dao, D. A. Patterson, and K. Asanovic, "A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness," in International Symposium on Computer Architecture (ISCA), 2013.
-
(2013)
International Symposium on Computer Architecture (ISCA)
-
-
Cook, H.1
Moreto, M.2
Bird, S.3
Dao, K.4
Patterson, D.A.5
Asanovic, K.6
-
38
-
-
34548042910
-
Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches
-
M. K. Qureshi and Y. N. Patt, "Utility-based cache partitioning: A low-overhead, high-performance, runtime mechanism to partition shared caches," in International Symposium on Microarchitecture (MICRO), 2006.
-
(2006)
International Symposium on Microarchitecture (MICRO)
-
-
Qureshi, M.K.1
Patt, Y.N.2
-
41
-
-
66749168716
-
Reducing the harmful effects of last-level cache polluters with an os-level, softwareonly pollute buffer
-
L. Soares, D. Tam, and M. Stumm, "Reducing the harmful effects of last-level cache polluters with an os-level, softwareonly pollute buffer," in International Symposium on Microarchitecture (MICRO), 2008.
-
(2008)
International Symposium on Microarchitecture (MICRO)
-
-
Soares, L.1
Tam, D.2
Stumm, M.3
-
42
-
-
57749176037
-
Managing shared l2 caches on multicore systems in software
-
D. Tam, R. Azimi, L. Soares, and M. Stumm, "Managing shared l2 caches on multicore systems in software," in Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA), 2007.
-
(2007)
Workshop on the Interaction between Operating Systems and Computer Architecture (WIOSCA)
-
-
Tam, D.1
Azimi, R.2
Soares, L.3
Stumm, M.4
-
43
-
-
33845208599
-
A study of the performance potential for dynamic instruction hints selection
-
R. Fu, J. Lu, A. Zhai, and W.-C. Hsu, "A study of the performance potential for dynamic instruction hints selection," in Asia-Pacific Computer Systems Architecture Conference (ACSAC), 2006.
-
(2006)
Asia-Pacific Computer Systems Architecture Conference (ACSAC)
-
-
Fu, R.1
Lu, J.2
Zhai, A.3
Hsu, W.-C.4
-
45
-
-
84943385283
-
Retargetable and reconfigurable software dynamic translation
-
K. Scott, N. Kumar, S. Velusamy, B. Childers, J. W. Davidson, and M. L. Soffa, "Retargetable and reconfigurable software dynamic translation," in Code Generation and Optimization (CGO), 2003.
-
(2003)
Code Generation and Optimization (CGO)
-
-
Scott, K.1
Kumar, N.2
Velusamy, S.3
Childers, B.4
Davidson, J.W.5
Soffa, M.L.6
-
47
-
-
77952564111
-
Pebil: Efficient static binary instrumentation for linux
-
M. A. Laurenzano, M. M. Tikir, L. Carrington, and A. Snavely, "Pebil: Efficient static binary instrumentation for linux," in Performance Analysis of Systems and Software (ISPASS), 2010.
-
(2010)
Performance Analysis of Systems and Software (ISPASS)
-
-
Laurenzano, M.A.1
Tikir, M.M.2
Carrington, L.3
Snavely, A.4
-
48
-
-
84886067644
-
Bird: Binary interpretation using runtime disassembly
-
S. Nanda, W. Li, L.-C. Lam, and T.-C. Chiueh, "Bird: Binary interpretation using runtime disassembly," in Code Generation and Optimization (CGO), 2006.
-
(2006)
Code Generation and Optimization (CGO)
-
-
Nanda, S.1
Li, W.2
Lam, L.-C.3
Chiueh, T.-C.4
-
51
-
-
84883331358
-
The datacenter as a computer: An introduction to the design of warehousescale machines, 2nd edition
-
L. A. Barroso, J. Clidaras, and U. Hölzle, "The datacenter as a computer: an introduction to the design of warehousescale machines, 2nd edition," Synthesis Lectures on Computer Architecture, 2013.
-
(2013)
Synthesis Lectures on Computer Architecture
-
-
Barroso, L.A.1
Clidaras, J.2
Hölzle, U.3
-
52
-
-
84877690617
-
Cpi2: Cpu performance isolation for shared compute clusters
-
X. Zhang, E. Tune, R. Hagmann, R. Jnagal, V. Gokhale, and J. Wilkes, "Cpi2: Cpu performance isolation for shared compute clusters," in European Conference on Computer Systems (EuroSys), 2013.
-
(2013)
European Conference on Computer Systems (EuroSys)
-
-
Zhang, X.1
Tune, E.2
Hagmann, R.3
Jnagal, R.4
Gokhale, V.5
Wilkes, J.6
|