-
1
-
-
84867435449
-
Balance principles for algorithm-architecture co-design
-
K. Czechowski, C. Battaglino, C. McClanahan, A. Chandramowlishwaran, and R. Vuduc, "Balance principles for algorithm-architecture co-design," in Proc. USENIX Wkshp. Hot Topics in Parallelism (HotPar), Berkeley, CA, USA, May 2011.
-
Proc. USENIX Wkshp. Hot Topics in Parallelism (HotPar), Berkeley, CA, USA, May 2011
-
-
Czechowski, K.1
Battaglino, C.2
McClanahan, C.3
Chandramowlishwaran, A.4
Vuduc, R.5
-
2
-
-
79961058189
-
What GPU computing means for high-end systems
-
July/August
-
R. Vuduc and K. Czechowski, "What GPU computing means for high-end systems," IEEE Micro, vol. 31, no. 4, pp. 74-78, July/August 2011.
-
(2011)
IEEE Micro
, vol.31
, Issue.4
, pp. 74-78
-
-
Vuduc, R.1
Czechowski, K.2
-
3
-
-
84864032930
-
On the communication complexity of 3D FFTs and its implications for exascale
-
K. Czechowski, C. McClanahan, C. Battaglino, K. Iyer, P.-K. Yeung, and R. Vuduc, "On the communication complexity of 3D FFTs and its implications for exascale," in Proc. ACM Int'l. Conf. Supercomputing (ICS), San Servolo Island, Venice, Italy, June 2012.
-
Proc. ACM Int'l. Conf. Supercomputing (ICS), San Servolo Island, Venice, Italy, June 2012
-
-
Czechowski, K.1
McClanahan, C.2
Battaglino, C.3
Iyer, K.4
Yeung, P.-K.5
Vuduc, R.6
-
4
-
-
48249118853
-
Amdahl's Law in the Multicore Era
-
Jul.
-
M. D. Hill and M. R. Marty, "Amdahl's Law in the Multicore Era," Computer, vol. 41, no. 7, pp. 33-38, Jul. 2008.
-
(2008)
Computer
, vol.41
, Issue.7
, pp. 33-38
-
-
Hill, M.D.1
Marty, M.R.2
-
5
-
-
64049097304
-
Extending Amdahl's Law for energy-efficient computing in the many-core era
-
Dec.
-
D. H. Woo and H.-H. S. Lee, "Extending Amdahl's Law for energy-efficient computing in the many-core era," IEEE Computer, vol. 41, no. 12, pp. 24-31, Dec. 2008.
-
(2008)
IEEE Computer
, vol.41
, Issue.12
, pp. 24-31
-
-
Woo, D.H.1
Lee, H.-H.S.2
-
9
-
-
0024662699
-
f1/2: A parameter to characterize memory and communication bottlenecks
-
May
-
R. W. Hockney and I. J. Curington, "f1/2: A parameter to characterize memory and communication bottlenecks," Parallel Computing, vol. 10, no. 3, pp. 277-286, May 1989.
-
(1989)
Parallel Computing
, vol.10
, Issue.3
, pp. 277-286
-
-
Hockney, R.W.1
Curington, I.J.2
-
10
-
-
84858657417
-
The hidden cost of low bandwidth communication
-
U. Vishkin, Ed. New York, NY, USA: ACM
-
G. E. Blelloch, B. M. Maggs, and G. L. Miller, "The hidden cost of low bandwidth communication," in Developing a Computer Science Agenda for High-Performance Computing, U. Vishkin, Ed. New York, NY, USA: ACM, 1994, pp. 22-25.
-
(1994)
Developing a Computer Science Agenda for High-Performance Computing
, pp. 22-25
-
-
Blelloch, G.E.1
Maggs, B.M.2
Miller, G.L.3
-
12
-
-
65949107549
-
Roofline: An insightful visual performance model for multicore architectures
-
Apr.
-
S. Williams, A. Waterman, and D. Patterson, "Roofline: An insightful visual performance model for multicore architectures," Communications of the ACM, vol. 52, no. 4, p. 65, Apr. 2009.
-
(2009)
Communications of the ACM
, vol.52
, Issue.4
, pp. 65
-
-
Williams, S.1
Waterman, A.2
Patterson, D.3
-
13
-
-
84971853043
-
I/O complexity: The red-blue pebble game
-
New York, New York, USA: ACM Press, May
-
H. Jia-Wei and H. T. Kung, "I/O complexity: The red-blue pebble game," in Proceedings of the thirteenth annual ACM symposium on Theory of computing - STOC '81. New York, New York, USA: ACM Press, May 1981, pp. 326-333.
-
(1981)
Proceedings of the Thirteenth Annual ACM Symposium on Theory of Computing - STOC '81
, pp. 326-333
-
-
Jia-Wei, H.1
Kung, H.T.2
-
14
-
-
80054875176
-
GPUs and the Future of Parallel Computing
-
Sep.
-
S. W. Keckler, W. J. Dally, B. Khailany, M. Garland, and D. Glasco, "GPUs and the Future of Parallel Computing," IEEE Micro, vol. 31, no. 5, pp. 7-17, Sep. 2011.
-
(2011)
IEEE Micro
, vol.31
, Issue.5
, pp. 7-17
-
-
Keckler, S.W.1
Dally, W.J.2
Khailany, B.3
Garland, M.4
Glasco, D.5
-
15
-
-
80053024794
-
Enhanced Race-To-Halt: A Leakage-Aware Energy Management Approach for Dynamic Priority Systems
-
IEEE, Jul.
-
M. A. Awan and S. M. Petters, "Enhanced Race-To-Halt: A Leakage-Aware Energy Management Approach for Dynamic Priority Systems," in 2011 23rd Euromicro Conference on Real-Time Systems. IEEE, Jul. 2011, pp. 92-101.
-
(2011)
2011 23rd Euromicro Conference on Real-Time Systems
, pp. 92-101
-
-
Awan, M.A.1
Petters, S.M.2
-
16
-
-
77952735662
-
-
RENaissance Computing Institute, University of North Caroliina, Chapel Hill, NC, USA, Tech. Rep.
-
D. Bedard, M. Y. Lim, R. Fowler, and A. Porterfield, "PowerMon 2: Fine-grained, integrated measurement," RENaissance Computing Institute, University of North Caroliina, Chapel Hill, NC, USA, Tech. Rep., 2009.
-
(2009)
PowerMon 2: Fine-grained, Integrated Measurement
-
-
Bedard, D.1
Lim, M.Y.2
Fowler, R.3
Porterfield, A.4
-
17
-
-
0000396658
-
A fast algorithm for particle simulations
-
Dec.
-
L. GREENGARD and V. ROKHLIN, "A fast algorithm for particle simulations," Journal of Computational Physics, vol. 73, no. 2, pp. 325-348, Dec. 1987.
-
(1987)
Journal of Computational Physics
, vol.73
, Issue.2
, pp. 325-348
-
-
Greengard, L.1
Rokhlin, V.2
-
18
-
-
84864128713
-
Towards a communication optimal fast multipole method and its implications for exascale
-
brief announcement
-
A. Chandramowlishwaran, J. W. Choi, K. Madduri, and R. Vuduc, "Towards a communication optimal fast multipole method and its implications for exascale," in Proc. ACM Symp. Parallel Algorithms and Architectures (SPAA), Pittsburgh, PA, USA, June 2012, brief announcement.
-
Proc. ACM Symp. Parallel Algorithms and Architectures (SPAA), Pittsburgh, PA, USA, June 2012
-
-
Chandramowlishwaran, A.1
Choi, J.W.2
Madduri, K.3
Vuduc, R.4
-
21
-
-
84884864329
-
-
University of California, Berkeley, CA, USA, Tech. Rep.
-
J. Demmel, A. Gearhart, O. Schwartz, and B. Lipschitz, "Perfect strong scaling using no additional energy," University of California, Berkeley, CA, USA, Tech. Rep., 2012.
-
(2012)
Perfect Strong Scaling Using No Additional Energy
-
-
Demmel, J.1
Gearhart, A.2
Schwartz, O.3
Lipschitz, B.4
-
22
-
-
85007350084
-
Energy-Time Trade-offs in VLSI Computations
-
Foundations of Software Technology and Theoretical Computer Science
-
A. Tyagi, "Energy-Time Trade-offs in VLSI Computations," in Foundations of Software Technology and Theoretical Computer Science, vol. LNCS 405, 1989, pp. 301-311.
-
(1989)
LNCS
, vol.405
, pp. 301-311
-
-
Tyagi, A.1
-
23
-
-
0035247631
-
Towards an energy complexity of computation
-
Feb.
-
A. J. Martin, "Towards an energy complexity of computation," Information Processing Letters, vol. 77, no. 2-4, pp. 181-187, Feb. 2001.
-
(2001)
Information Processing Letters
, vol.77
, Issue.2-4
, pp. 181-187
-
-
Martin, A.J.1
-
24
-
-
24944449083
-
Towards a model of energy complexity for algorithms
-
IEEE
-
R. Jain, D. Molnar, and Z. Ramzan, "Towards a model of energy complexity for algorithms," in IEEE Wireless Communications and Networking Conference, 2005. IEEE, 2005, pp. 1884-1890.
-
(2005)
IEEE Wireless Communications and Networking Conference, 2005
, pp. 1884-1890
-
-
Jain, R.1
Molnar, D.2
Ramzan, Z.3
-
26
-
-
77951494930
-
Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architectures
-
Vienna, Austria: IEEE, Sep.
-
V. A. Korthikanti and G. Agha, "Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architectures," in 2009 International Conference on Parallel Processing. Vienna, Austria: IEEE, Sep. 2009, pp. 212-219.
-
(2009)
2009 International Conference on Parallel Processing
, pp. 212-219
-
-
Korthikanti, V.A.1
Agha, G.2
-
27
-
-
84880264724
-
-
INRIA, Grenoble, France, Tech. Rep. October
-
G. Aupy, A. Benoit, and Y. Robert, "Energy-aware scheduling under reliability and makespan constraints," INRIA, Grenoble, France, Tech. Rep. October, 2011.
-
(2011)
Energy-aware Scheduling under Reliability and Makespan Constraints
-
-
Aupy, G.1
Benoit, A.2
Robert, Y.3
-
28
-
-
33845388509
-
Performance-constrained distributed dvs scheduling for scientific applications on power-aware clusters
-
Proceedings of the 2005 ACM/IEEE conference on Supercomputing, ser. Washington, DC, USA: IEEE Computer Society
-
R. Ge, X. Feng, and K. W. Cameron, "Performance-constrained distributed dvs scheduling for scientific applications on power-aware clusters," in Proceedings of the 2005 ACM/IEEE conference on Supercomputing, ser. SC '05. Washington, DC, USA: IEEE Computer Society, 2005, pp. 34-.
-
(2005)
SC '05
, pp. 34
-
-
Ge, R.1
Feng, X.2
Cameron, K.W.3
-
29
-
-
34248638757
-
Analyzing the energy-time trade-off in high-performance computing applications
-
DOI 10.1109/TPDS.2007.1026
-
V. W. Freeh, D. K. Lowenthal, F. Pan, N. Kappiah, R. Springer, B. L. Rountree, and M. E. Femal, "Analyzing the energy-time trade-off in high-performance computing applications," IEEE Trans. Parallel Distrib. Syst., vol. 18, no. 6, pp. 835-848, Jun. 2007. (Pubitemid 46767744)
-
(2007)
IEEE Transactions on Parallel and Distributed Systems
, vol.18
, Issue.6
, pp. 835-848
-
-
Freeh, V.W.1
Lowenthal, D.K.2
Pan, F.3
Kappiah, N.4
Springer, R.5
Rountree, B.L.6
Femal, M.E.7
-
30
-
-
79953078406
-
Power and performance characterization of computational kernels on the gpu
-
Y. Jiao, H. Lin, P. Balaji, and W. Feng, "Power and performance characterization of computational kernels on the gpu," in Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int'l Conference on Int'l Conference on Cyber, Physical and Social Computing (CPSCom), dec. 2010, pp. 221-228.
-
Green Computing and Communications (GreenCom), 2010 IEEE/ACM Int'l Conference on Int'l Conference on Cyber, Physical and Social Computing (CPSCom), Dec. 2010
, pp. 221-228
-
-
Jiao, Y.1
Lin, H.2
Balaji, P.3
Feng, W.4
-
32
-
-
80955140979
-
An iso-energy-efficient approach to scalable system power-performance optimization
-
Proceedings of the 2011 IEEE International Conference on Cluster Computing, ser. Washington, DC, USA: IEEE Computer Society
-
S. Song, M. Grove, and K. W. Cameron, "An iso-energy-efficient approach to scalable system power-performance optimization," in Proceedings of the 2011 IEEE International Conference on Cluster Computing, ser. CLUSTER '11. Washington, DC, USA: IEEE Computer Society, 2011, pp. 262-271.
-
(2011)
CLUSTER '11
, pp. 262-271
-
-
Song, S.1
Grove, M.2
Cameron, K.W.3
-
33
-
-
84874590024
-
Energy footprint of advanced dense numerical linear algebra using tile algorithms on multicore architecture
-
J. Dongarra, H. Ltaief, P. Luszczek, and V. M. Weaver, "Energy footprint of advanced dense numerical linear algebra using tile algorithms on multicore architecture," in The 2nd International Conference on Cloud and Green Computing, Nov. 2012.
-
The 2nd International Conference on Cloud and Green Computing, Nov. 2012
-
-
Dongarra, J.1
Ltaief, H.2
Luszczek, P.3
Weaver, V.M.4
-
34
-
-
33746313271
-
Power and energy profiling of scientific applications on distributed systems
-
X. Feng, R. Ge, and K. Cameron, "Power and energy profiling of scientific applications on distributed systems," in Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS), april 2005, p. 34.
-
Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS), April 2005
, pp. 34
-
-
Feng, X.1
Ge, R.2
Cameron, K.3
-
35
-
-
84940216748
-
Power profiling of Cholesky and QR factorizations on distributed memory systems
-
G. Bosilca, H. Ltaief, and J. Dongarra, "Power profiling of Cholesky and QR factorizations on distributed memory systems," Computer Science - Research and Development, pp. 1-9, 2012.
-
(2012)
Computer Science - Research and Development
, pp. 1-9
-
-
Bosilca, G.1
Ltaief, H.2
Dongarra, J.3
-
36
-
-
77950629423
-
Powerpack: Energy profiling and analysis of high-performance systems and applications
-
May
-
R. Ge, X. Feng, S. Song, H.-C. Chang, D. Li, and K. Cameron, "Powerpack: Energy profiling and analysis of high-performance systems and applications," IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 21, no. 5, pp. 658-671, May 2010.
-
(2010)
IEEE Transactions on Parallel and Distributed Systems (TPDS)
, vol.21
, Issue.5
, pp. 658-671
-
-
Ge, R.1
Feng, X.2
Song, S.3
Chang, H.-C.4
Li, D.5
Cameron, K.6
-
37
-
-
84868119278
-
Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency
-
10.1007/s00450-011-0191-z
-
H. Ltaief, P. Luszczek, and J. Dongarra, "Profiling high performance dense linear algebra algorithms on multicore architectures for power and energy efficiency," Computer Science - Research and Development, pp. 1-11, 10.1007/s00450-011-0191-z.
-
Computer Science - Research and Development
, pp. 1-11
-
-
Ltaief, H.1
Luszczek, P.2
Dongarra, J.3
-
38
-
-
84863752817
-
Looking back and looking forward: Power, performance, and upheaval
-
Jul.
-
H. Esmaeilzadeh, T. Cao, X. Yang, S. M. Blackburn, and K. S. McKinley, "Looking back and looking forward: power, performance, and upheaval," Commun. ACM, vol. 55, no. 7, pp. 105-114, Jul. 2012.
-
(2012)
Commun. ACM
, vol.55
, Issue.7
, pp. 105-114
-
-
Esmaeilzadeh, H.1
Cao, T.2
Yang, X.3
Blackburn, S.M.4
McKinley, K.S.5
-
39
-
-
84859729360
-
Power-management architecture of the intel microarchitecture code-named sandy bridge
-
March-April
-
E. Rotem, A. Naveh, D. Rajwan, A. Ananthakrishnan, and E. Weissmann, "Power-management architecture of the intel microarchitecture code-named sandy bridge," IEEE Micro, vol. 32, no. 2, pp. 20-27, March-April 2012.
-
(2012)
IEEE Micro
, vol.32
, Issue.2
, pp. 20-27
-
-
Rotem, E.1
Naveh, A.2
Rajwan, D.3
Ananthakrishnan, A.4
Weissmann, E.5
-
41
-
-
70450231944
-
An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
-
New York, NY, USA: ACM
-
S. Hong and H. Kim, "An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness," in Proceedings of the 36th annual International Symposium on Computer Architecture (ISCA). New York, NY, USA: ACM, 2009, pp. 152-163.
-
(2009)
Proceedings of the 36th Annual International Symposium on Computer Architecture (ISCA)
, pp. 152-163
-
-
Hong, S.1
Kim, H.2
-
42
-
-
77957561221
-
An adaptive performance modeling tool for GPU architectures
-
Jan.
-
S. S. Baghsorkhi, M. Delahaye, S. J. Patel, W. D. Gropp, and W. mei W. Hwu, "An adaptive performance modeling tool for GPU architectures," SIGPLAN Not., vol. 45, no. 5, pp. 105-114, Jan. 2010.
-
(2010)
SIGPLAN Not.
, vol.45
, Issue.5
, pp. 105-114
-
-
Baghsorkhi, S.S.1
Delahaye, M.2
Patel, S.J.3
Gropp, W.D.4
Hwu, W.M.W.5
-
43
-
-
79955921273
-
A quantitative performance analysis model for gpu architectures
-
Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture, ser. Washington, DC, USA: IEEE Computer Society
-
Y. Zhang and J. D. Owens, "A quantitative performance analysis model for gpu architectures," in Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture, ser. HPCA '11. Washington, DC, USA: IEEE Computer Society, 2011, pp. 382-393.
-
(2011)
HPCA '11
, pp. 382-393
-
-
Zhang, Y.1
Owens, J.D.2
-
44
-
-
84868123714
-
Power-aware predictive models of hybrid (MPI/OpenMP) scientific applications on multicore systems
-
10.1007/s00450-011-0190-0
-
C. Lively, X. Wu, V. Taylor, S. Moore, H.-C. Chang, C.-Y. Su, and K. Cameron, "Power-aware predictive models of hybrid (MPI/OpenMP) scientific applications on multicore systems," Computer Science - Research and Development, pp. 1-9, 2011, 10.1007/s00450-011-0190-0.
-
(2011)
Computer Science - Research and Development
, pp. 1-9
-
-
Lively, C.1
Wu, X.2
Taylor, V.3
Moore, S.4
Chang, H.-C.5
Su, C.-Y.6
Cameron, K.7
-
45
-
-
79953116601
-
Statistical power and performance modeling for optimizing the energy efficiency of scientific computing
-
Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing, ser. Washington, DC, USA: IEEE Computer Society
-
B. Subramaniam and W.-C. Feng, "Statistical power and performance modeling for optimizing the energy efficiency of scientific computing," in Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing, ser. GREENCOM-CPSCOM '10. Washington, DC, USA: IEEE Computer Society, 2010, pp. 139-146.
-
(2010)
GREENCOM-CPSCOM '10
, pp. 139-146
-
-
Subramaniam, B.1
Feng, W.-C.2
-
46
-
-
77954994853
-
An integrated GPU power and performance model
-
Jun.
-
S. Hong and H. Kim, "An integrated GPU power and performance model," SIGARCH Comput. Archit. News, vol. 38, no. 3, pp. 280-289, Jun. 2010.
-
(2010)
SIGARCH Comput. Archit. News
, vol.38
, Issue.3
, pp. 280-289
-
-
Hong, S.1
Kim, H.2
-
47
-
-
84873478155
-
Model-based, memory-centric performance and power optimization on numa multiprocessors
-
C. Su, D. Li, D. Nikolopoulos, K. Cameron, B. de Supinski, and E. Leon, "Model-based, memory-centric performance and power optimization on numa multiprocessors," in IEEE International Symposium on Workload Characterization, Nov. 2012.
-
IEEE International Symposium on Workload Characterization, Nov. 2012
-
-
Su, C.1
Li, D.2
Nikolopoulos, D.3
Cameron, K.4
De Supinski, B.5
Leon, E.6
-
48
-
-
0030243819
-
Energy dissipation in general purpose microprocessors
-
sep
-
R. Gonzalez and M. Horowitz, "Energy dissipation in general purpose microprocessors," IEEE J. Solid-State Circuits, vol. 31, no. 9, pp. 1277-1284, sep 1996.
-
(1996)
IEEE J. Solid-State Circuits
, vol.31
, Issue.9
, pp. 1277-1284
-
-
Gonzalez, R.1
Horowitz, M.2
-
49
-
-
84886383678
-
A new energy-aware performance metric
-
C. Bekas and A. Curioni, "A new energy-aware performance metric," in Proceedings of the International Conference on Energy-Aware High-Performance Computing (EnA-HPC), Hamburg, Germany, Sep. 2010.
-
Proceedings of the International Conference on Energy-Aware High-Performance Computing (EnA-HPC), Hamburg, Germany, Sep. 2010
-
-
Bekas, C.1
Curioni, A.2
-
51
-
-
84867423348
-
The Green Index: A metric for evaluating system-wide energy efficiency in HPC systems
-
B. Subramaniam andW.-C. Feng, "The Green Index: A metric for evaluating system-wide energy efficiency in HPC systems," in 8th IEEE Workshop on High-Performance, Power-Aware Computing (HPPAC), Shanghai, China, May 2012.
-
8th IEEE Workshop on High-Performance, Power-Aware Computing (HPPAC), Shanghai, China, May 2012
-
-
Subramaniam, B.1
Feng, W.-C.2
-
52
-
-
84884840394
-
-
Georgia Institute of Technology, School of Computational Science and Engineering, Atlanta, GA, USA, Tech. Rep. GT-CSE-12-01, December
-
J. W. Choi and R. Vuduc, "A roofline model of energy," Georgia Institute of Technology, School of Computational Science and Engineering, Atlanta, GA, USA, Tech. Rep. GT-CSE-12-01, December 2012.
-
(2012)
A Roofline Model of Energy
-
-
Choi, J.W.1
Vuduc, R.2
|