-
2
-
-
77952562358
-
Roofline: an insightful visual performance model for floating-point programs and multicore architectures\
-
EECS Department, University of California, Berkeley, Oct. Available from: [Accessed on 3 December 2012]
-
Williams SW, Waterman A, Patterson DA,. Roofline: an insightful visual performance model for floating-point programs and multicore architectures. Technical Report UCB/EECS-2008-134, EECS Department, University of California, Berkeley, Oct 2008. Available from: http://www.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-134.html [Accessed on 3 December 2012].
-
(2008)
Technical Report UCB/EECS-2008-134
-
-
Williams, S.W.1
Waterman, A.2
Patterson, D.A.3
-
3
-
-
77955113636
-
Introducing a performance model for bandwidth-limited loop kernels
-
Wyrzykowski R. Dongarra J. Karczewski K. Wasniewski J. (eds). Springer: Berlin / Heidelberg
-
Treibig J, Hager G,. Introducing a performance model for bandwidth-limited loop kernels. In Parallel Processing and Applied Mathematics, Lecture Notes in Computer Science, Vol. 6067, Wyrzykowski R, Dongarra J, Karczewski K, Wasniewski J, (eds). Springer: Berlin / Heidelberg, 2010; 615-624. DOI: 10.1007/978-3-642-14390-8-64.
-
(2010)
Parallel Processing and Applied Mathematics, Lecture Notes in Computer Science
, vol.6067
, pp. 615-624
-
-
Treibig, J.1
Hager, G.2
-
4
-
-
67650784628
-
Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs
-
Suleman MA, Qureshi MK, Patt YN., Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on CMPs. SIGARCH-Computer Architecture News 2008; 36 (1): 277-286. DOI: 10.1145/1353534.1346317.
-
(2008)
SIGARCH - Computer Architecture News
, vol.36
, Issue.1
, pp. 277-286
-
-
Suleman, M.A.1
Qureshi, M.K.2
Patt, Y.N.3
-
5
-
-
0034543848
-
Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications
-
Hoisie A, Lubeck O, Wasserman HJ,. Performance and scalability analysis of teraflop-scale parallel architectures using multidimensional wavefront applications. International Journal of High Performance Computing Applications 2000; 14: 330-346. DOI: 10.1177/109434200001400405.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, pp. 330-346
-
-
Hoisie, A.1
Lubeck, O.2
Wasserman, H.J.3
-
6
-
-
0034263076
-
Pace - a toolset for the performance prediction of parallel and distributed systems
-
Nudd GR, Kerbyson DJ, Papaefstathiou E, Perry SC, Harper JS, Wilcox DV., Pace-a toolset for the performance prediction of parallel and distributed systems. International Journal of High Performance Computing Applications 2000; 14 (3): 228-251. DOI: 10.1177/109434200001400306.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 228-251
-
-
Nudd, G.R.1
Kerbyson, D.J.2
Papaefstathiou, E.3
Perry, S.C.4
Harper, J.S.5
Wilcox, D.V.6
-
7
-
-
78149347218
-
Predictive performance and scalability modeling of a large-scale application
-
Supercomputing '01, ACM, New York, NY, USA
-
Kerbyson DJ, Alme HJ, Hoisie A, Petrini F, Wasserman HJ, Gittings M,. Predictive performance and scalability modeling of a large-scale application. Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM), Supercomputing '01, ACM, New York, NY, USA, 2001; 37-37, DOI: 10.1145/582034.582071.
-
(2001)
Proceedings of the 2001 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 37
-
-
Kerbyson, D.J.1
Alme, H.J.2
Hoisie, A.3
Petrini, F.4
Wasserman, H.J.5
Gittings, M.6
-
9
-
-
63549144218
-
Multi-mode energy management for multi-tier server clusters
-
PACT '08. ACM, New York, NY, USA
-
Horvath T, Skadron K,. Multi-mode energy management for multi-tier server clusters. Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08. ACM, New York, NY, USA, 2008; 270-279, DOI: 10.1145/1454115.1454153.
-
(2008)
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
, pp. 270-279
-
-
Horvath, T.1
Skadron, K.2
-
10
-
-
84870895827
-
Strategies for energy efficient resource management of hybrid programming models
-
PrePrints
-
Li D, de Supinski BR, Schulz M, Nikolopoulos DS, Cameron KW,. Strategies for energy efficient resource management of hybrid programming models. IEEE Transactions on Parallel and Distributed Systems 2012; 99, (PrePrints). DOI: 10.1109/TPDS.2012.95.
-
(2012)
IEEE Transactions on Parallel and Distributed Systems
, vol.99
-
-
Li, D.1
De Supinski, B.R.2
Schulz, M.3
Nikolopoulos, D.S.4
Cameron, K.W.5
-
11
-
-
84859729360
-
Power-management architecture of the Intel microarchitecture code-named Sandy Bridge
-
Rotem E, Naveh A, Ananthakrishnan A, Rajwan D, Weissmann E,. Power-management architecture of the Intel microarchitecture code-named Sandy Bridge. IEEE Micro 2012; 32: 20-27. DOI: 10.1109/MM.2012.12.
-
(2012)
IEEE Micro
, vol.32
, pp. 20-27
-
-
Rotem, E.1
Naveh, A.2
Ananthakrishnan, A.3
Rajwan, D.4
Weissmann, E.5
-
12
-
-
84879408937
-
Measuring energy consumption for short code paths using RAPL
-
ACM: New York, NY, USA
-
Hähnel M, Döbel B, Völp M, Härtig H,. Measuring energy consumption for short code paths using RAPL. In ACM SIGMETRICS Performance Evaluation Review, Vol. 40(3). ACM: New York, NY, USA, 2012; 13-17, DOI: 10.1145/2425248.2425252.
-
(2012)
ACM SIGMETRICS Performance Evaluation Review
, vol.40
, Issue.3
, pp. 13-17
-
-
Hähnel, M.1
Döbel, B.2
Völp, M.3
Härtig, H.4
-
14
-
-
78649844813
-
LIKWID: a lightweight performance-oriented tool suite for x86 multicore environments
-
IEEE Computer Society, Los Alamitos, CA, USA
-
Treibig J, Hager G, Wellein G,. LIKWID: a lightweight performance-oriented tool suite for x86 multicore environments. PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures, IEEE Computer Society, Los Alamitos, CA, USA, 2010; 207-216, DOI: 10.1109/ICPPW.2010.38.
-
(2010)
PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures
, pp. 207-216
-
-
Treibig, J.1
Hager, G.2
Wellein, G.3
-
15
-
-
84874415705
-
-
Accessed on 3 December 2012
-
LIKWID performance tools. Available from: http://code.google.com/p/likwid [Accessed on 3 December 2012].
-
LIKWID performance tools
-
-
-
16
-
-
84885229762
-
likwid-bench: an extensible microbenchmarking platform for x86 multicore environments
-
Resch M. et al. (eds). Springer: Berlin Heidelberg, To appear
-
Treibig J, Hager G, Wellein G,. likwid-bench: an extensible microbenchmarking platform for x86 multicore environments. In Toolsfor High Performance Computing 2011, Resch M, et al. (eds). Springer: Berlin Heidelberg, 2012. To appear.
-
(2012)
Toolsfor High Performance Computing 2011
-
-
Treibig, J.1
Hager, G.2
Wellein, G.3
-
17
-
-
84877281508
-
Pushing the limits for medical image reconstruction on recent standard multicore processors
-
Treibig J, Hager G, Hofmann HG, Hornegger J, Wellein G,. Pushing the limits for medical image reconstruction on recent standard multicore processors. International Journal of High Performance Computing Applications 2013; 27 (2): 162-177. DOI: 10.1177/1094342012442424.
-
(2013)
International Journal of High Performance Computing Applications
, vol.27
, Issue.2
, pp. 162-177
-
-
Treibig, J.1
Hager, G.2
Hofmann, H.G.3
Hornegger, J.4
Wellein, G.5
-
18
-
-
84956639009
-
-
Accessed on 1 August 2012
-
Intel architecture code analyzer, version 1.1.3. Available from: http://software.intel.com/en-us/articles/intel-architecture-code-analyzer/ [Accessed on 1 August 2012].
-
Intel architecture code analyzer, version 1.1.3
-
-
-
20
-
-
0345025793
-
STREAM: sustainable memory bandwidth in high performance computers
-
University of Virginia, Charlottesville, VA 1991-2007 a continually updated technical report [Accessed on 1 August 2012]
-
McCalpin JD,. STREAM: sustainable memory bandwidth in high performance computers. Technical Report, University of Virginia, Charlottesville, VA 1991-2007. Available from: http://www.cs.virginia.edu/stream/, a continually updated technical report [Accessed on 1 August 2012].
-
Technical Report
-
-
McCalpin, J.D.1
-
21
-
-
84988301289
-
-
1st edn. CRC Press, Inc.: Boca Raton, FL, USA
-
Hager G, Wellein G,. Introduction to High Performance Computing for Scientists and Engineers, 1st edn. CRC Press, Inc.: Boca Raton, FL, USA, 2010.
-
(2010)
Introduction to High Performance Computing for Scientists and Engineers
-
-
Hager, G.1
Wellein, G.2
-
26
-
-
21144470454
-
Boundary conditions for lattice Boltzmann simulations
-
Ziegler D,. Boundary conditions for lattice Boltzmann simulations. Journal of Statistical Physics 1993; 71 (5/6): 1171-1177.
-
(1993)
Journal of Statistical Physics
, vol.71
, Issue.56
, pp. 1171-1177
-
-
Ziegler, D.1
-
27
-
-
33646809359
-
On the single processor performance of simple lattice Boltzmann kernels
-
Wellein G, Zeiser T, Hager G, Donath S,. On the single processor performance of simple lattice Boltzmann kernels. Computers & Fluids 2006; 35: 910-919.
-
(2006)
Computers & Fluids
, vol.35
, pp. 910-919
-
-
Wellein, G.1
Zeiser, T.2
Hager, G.3
Donath, S.4
-
28
-
-
73849092882
-
Benchmark analysis and application results for lattice Boltzmann simulations on NEC SX vector and Intel Nehalem systems
-
Zeiser T, Hager G, Wellein G,. Benchmark analysis and application results for lattice Boltzmann simulations on NEC SX vector and Intel Nehalem systems. Parallel Processing Letters 2009; 19 (4): 491-511.
-
(2009)
Parallel Processing Letters
, vol.19
, Issue.4
, pp. 491-511
-
-
Zeiser, T.1
Hager, G.2
Wellein, G.3
|