-
2
-
-
0003710740
-
-
second ed ,1, MIT Press
-
M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra, MPI: The Complete Reference, second ed ., vol. 1. MIT Press, 1998.
-
(1998)
MPI: The Complete Reference
-
-
Snir, M.1
Otto, S.2
Huss-Lederman, S.3
Walker, D.4
Dongarra, J.5
-
3
-
-
0037361074
-
Communication and optimization aspects of parallel programming models on hybrid architectures
-
R. Rabenseifner and G. Wellein, "Communication and Optimization Aspects of Parallel Programming Models on Hybrid Architectures, " Int'l J. High Performance Computing Applications, vol. 17, no. 1, pp. 49-62, 2003.
-
(2003)
Int'l J. High Performance Computing Applications
, vol.17
, Issue.1
, pp. 49-62
-
-
Rabenseifner, R.1
Wellein, G.2
-
4
-
-
34248374123
-
Online power-performance adaptation of multithreaded programs using hardware event-based prediction
-
DOI 10.1145/1183401.1183426, Proceedings of the 20th Annual International Conference on Supercomputing, ICS 2006
-
M. Curtis-Maury, J. Dzierwa, C. D. Antonopoulos, and D. S. Nikolopoulos, "Online Power-Performance Adaptation of Multithreaded Programs Using Hardware Event-Based Prediction, " Proc. 20th ACM Int'l Conf. Supercomputing, pp. 157-166, 2006. (Pubitemid 47168502)
-
(2006)
Proceedings of the International Conference on Supercomputing
, pp. 157-166
-
-
Curtis-Maury, M.1
Dzierwa, J.2
Antonopoulos, C.D.3
Nikolopoulos, D.S.4
-
5
-
-
77957764904
-
Feedback-driven threading: Power-efficient and high-performance execution of multi-threaded workloads on CMPs
-
DOI 10.1145/1346281.1346317, ASPLOS XIII - Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems
-
M. A. Suleman, M. K. Qureshi, and Y. N. Patt, "Feedback-Driven Threading: Power-Efficient and High-Performance Execution of Multithreaded Workloads on CMPs, " Proc. 13th ACM Symp. Architectural Support for Programming Languages and Operating Systems, pp. 277-286, 2008. (Pubitemid 351585413)
-
(2008)
International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS
, pp. 277-286
-
-
Suleman, M.A.1
Qureshi, M.K.2
Patt, Y.N.3
-
6
-
-
51649112844
-
Prediction-based power-performance adaptation of multithreaded scientific codes
-
Oct
-
M. Curtis-Maury, F. Blagojevic, C. D. Antonopoulos, and D. S. Nikolopoulos, "Prediction-Based Power-Performance Adaptation of Multithreaded Scientific Codes, " IEEE Trans. Parallel and Distributed Systems, vol. 19, no. 10, pp. 1396-1410, Oct. 2008.
-
(2008)
IEEE Trans. Parallel and Distributed Systems
, vol.19
, Issue.10
, pp. 1396-1410
-
-
Curtis-Maury, M.1
Blagojevic, F.2
Antonopoulos, C.D.3
Nikolopoulos, D.S.4
-
7
-
-
0036374185
-
Critical power slope: Understanding the runtime effects of frequency scaling
-
A. Miyoshi, C. Lefurgy, E. V. Hensbergen, R. Rajamony, and R. Rajkumar, "Critical Power Slope: Understanding the Runtime Effects of Frequency Scaling, " Proc. 16th Ann. ACM Int'l Conf. Supercomputing, pp. 35-44, 2002. (Pubitemid 35039982)
-
(2002)
Proceedings of the International Conference on Supercomputing
, pp. 35-44
-
-
Miyoshi, A.1
Lefurgy, C.2
Van Hensbergen, E.3
Rajamony, R.4
Rajkumar, R.5
-
9
-
-
84944414165
-
Runtime power monitoring in high-end processors: Methodology and empirical data
-
C. Isci and M. Martonosi, "Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data, " Proc. 36th Int'l Symp. Microarchitecture, pp. 93-104, 2003.
-
(2003)
Proc. 36th Int'l Symp. Microarchitecture
, pp. 93-104
-
-
Isci, C.1
Martonosi, M.2
-
10
-
-
33845438524
-
Just-in-time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs
-
V. Freeh, N. Kappiah, D. Lowenthal, and T. Bletsch, "Just-In-Time Dynamic Voltage Scaling: Exploiting Inter-node Slack to Save Energy in MPI Programs, " Proc. Ann. ACM/IEEE Int'l Conf. Supercomputing (SC '05), 2005.
-
(2005)
Proc. Ann. ACM/IEEE Int'l Conf. Supercomputing (SC '05)
-
-
Freeh, V.1
Kappiah, N.2
Lowenthal, D.3
Bletsch, T.4
-
11
-
-
60649114675
-
Energy-oriented open MP parallel loop scheduling
-
Y. Dong, J. Chen, X. Yang, L. Deng, and X. Zhang, "Energy-Oriented OpenMP Parallel Loop Scheduling, " Proc. Int'l Symp. Parallel and Distributed Processing with Applications, 2008.
-
(2008)
Proc. Int'l Symp. Parallel and Distributed Processing with Applications
-
-
Dong, Y.1
Chen, J.2
Yang, X.3
Deng, L.4
Zhang, X.5
-
13
-
-
63549125482
-
Prediction models for multi-dimensional power-performance optimization on many cores
-
M. Curtis-Maury, A. Shah, F. Blagojevic, D. S. Nikolopoulos, B. R. de Supinski, and M. Schulz, "Prediction Models for Multi-Dimensional Power-Performance Optimization on Many Cores, " Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT), pp. 250-259, 2008.
-
(2008)
Proc. 17th Int'l Conf. Parallel Architectures and Compilation Techniques (PACT)
, pp. 250-259
-
-
Curtis-Maury, M.1
Shah, A.2
Blagojevic, F.3
Nikolopoulos, D.S.4
De Supinski, B.R.5
Schulz, M.6
-
14
-
-
33751054291
-
Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster
-
Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06
-
R. Springer, D. Lowenthal, B. Rountree, and V. Freeh, "Minimizing Execution Time in MPI Programs on an Energy-Constrained, Power-Scalable Cluster, " Proc. 11th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), pp. 230-238, 2006. (Pubitemid 44758693)
-
(2006)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
, vol.2006
, pp. 230-238
-
-
Springer, R.1
Lowenthal, D.K.2
Rountree, B.3
Freeh, V.W.4
-
15
-
-
56749165148
-
Bounding energy consumption in large-scale MPI Programs
-
B. Rountree, D. Lowenthal, S. Funk, V. Freeh, B. R. de Supinski, and M. Schulz, "Bounding Energy Consumption in Large-Scale MPI Programs, " Proc. ACM/IEEE Conf. Supercomputing (SC '07), 2007.
-
(2007)
Proc. ACM/IEEE Conf. Supercomputing (SC '07)
-
-
Rountree, B.1
Lowenthal, D.2
Funk, S.3
Freeh, V.4
De Supinski, B.R.5
Schulz, M.6
-
16
-
-
31844450952
-
Using multiple energy gears in MPI programs on a power-scalable cluster
-
DOI 10.1145/1065944.1065967, Proceedings of the 2005 ACM SIGPLAN Symposium on Principles and Practise of Parallel Programming, PROPP 05
-
V. Freeh and D. Lowenthal, "Using Multiple Energy Gears in MPI Programs on a Power-Scalable Cluster, " Proc. 11th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP), pp. 164-173, 2007. (Pubitemid 43182843)
-
(2005)
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
, pp. 164-173
-
-
Freeh, V.W.1
Pan, F.2
Kappiah, N.3
Lowenthal, D.K.4
-
17
-
-
70449728146
-
Adagio: Making DVS practical for complex HPC applications
-
B. Rountree, D. K. Lownenthal, B. R. de Supinski, M. Schulz, V. W. Freeh, and T. Bletsch, "Adagio: Making DVS Practical for Complex HPC Applications, " Proc. 23rd Int'l Conf. Supercomputing, pp. 460-469, 2009.
-
(2009)
Proc. 23rd Int'l Conf. Supercomputing
, pp. 460-469
-
-
Rountree, B.1
Lownenthal, D.K.2
De Supinski, B.R.3
Schulz, M.4
Freeh, V.W.5
Bletsch, T.6
-
18
-
-
84870920866
-
-
Lawrence Livermore Nat'l Laboratory
-
Lawrence Livermore Nat'l Laboratory, "ASC Sequoia Benchmarks, " https://asc. llnl. gov/sequoia/benchmarks, 2012.
-
(2012)
ASC Sequoia Benchmarks
-
-
-
22
-
-
35048825254
-
Design and prototype of a performance tool interface for openMP
-
B. Mohr, A. D. Malony, S. Shende, and F. Wolf, "Design and Prototype of a Performance Tool Interface for OpenMP, " Proc. Ann. Los Alamos Computer Science Inst. Symp. (LACSI), 2001.
-
(2001)
Proc. Ann. Los Alamos Computer Science Inst. Symp. (LACSI)
-
-
Mohr, B.1
Malony, A.D.2
Shende, S.3
Wolf, F.4
-
23
-
-
2442670256
-
-
NASA
-
NASA, "NAS Parallel Benchmarks, " http://www. nas. nasa. gov/Resources/Software/npb. html, 2012.
-
(2012)
NAS Parallel Benchmarks
-
-
-
25
-
-
0036532956
-
BoomerAMG: A parallel algebraic multigrid solver and preconditioner
-
V. E. Henson and U. M. Yang, "BoomerAMG: A Parallel Algebraic Multigrid Solver and Preconditioner, " Applied Numerical Math ., vol. 41, pp. 155-177, 2000.
-
(2000)
Applied Numerical Math
, vol.41
, pp. 155-177
-
-
Henson, V.E.1
Yang, U.M.2
-
27
-
-
70450278773
-
Towards a holistic approach to auto-parallelization integrating profile-driven parallelism detection and machine-learning based mapping
-
G. Tournavitis, Z. Wang, B. Franke, and M. F. O0Boyle, "Towards a Holistic Approach to Auto-Parallelization Integrating Profile-Driven Parallelism Detection and Machine-Learning Based Mapping, " Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation, pp. 177-187, 2009.
-
(2009)
Proc ACM SIGPLAN Conf. Programming Language Design and Implementation
, pp. 177-187
-
-
Tournavitis, G.1
Wang, Z.2
Franke, B.3
Oboyle, M.F.4
-
28
-
-
34547182898
-
Reducing energy consumption of parallel sparse matrix applications through integrated link/CPU voltage scaling
-
DOI 10.1007/s11227-007-0113-9
-
S. W. Son, K. Malkowski, G. Chen, M. Kandemir, and P. Raghavan, "Reducing Energy Consumption of Parallel Sparse Matrix Applications through Integrated Link/CPU Voltage Scaling, " J. Supercomputing, vol. 41, no. 3, pp. 179-213, 2007. (Pubitemid 47115199)
-
(2007)
Journal of Supercomputing
, vol.41
, Issue.3
, pp. 179-213
-
-
Son, S.W.1
Malkowski, K.2
Chen, G.3
Kandemir, M.4
Raghavan, P.5
|