-
3
-
-
80051665004
-
Optimized HPL for AMD GPU and multi-core CPU usage
-
June
-
M. Bach, M. Kretz, V. Lindenstruth, and D. Rohr. Optimized HPL for AMD GPU and multi-core CPU usage. Comput. Sci., 26(3-4):153-164, June 2011.
-
(2011)
Comput. Sci.
, vol.26
, Issue.3-4
, pp. 153-164
-
-
Bach, M.1
Kretz, M.2
Lindenstruth, V.3
Rohr, D.4
-
4
-
-
38049058008
-
The impact of multicore on math software
-
A. Buttari, J. Dongarra, J. Kurzak, J. Langou, P. Luszczek, and S. Tomov. The impact of multicore on math software. In Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, PARA'06, 2007.
-
Proceedings of the 8th International Conference on Applied Parallel Computing: State of the Art in Scientific Computing, PARA'06, 2007
-
-
Buttari, A.1
Dongarra, J.2
Kurzak, J.3
Langou, J.4
Luszczek, P.5
Tomov, S.6
-
5
-
-
80051670160
-
Designing and dynamically load balancing hybrid LU for multi/manycore
-
June
-
M. Deisher, M. Smelyanskiy, B. Nickerson, V. W. Lee, M. Chuvelev, and P. Dubey. Designing and dynamically load balancing hybrid LU for multi/manycore. Comput. Sci., 26(3-4), June 2011.
-
(2011)
Comput. Sci.
, vol.26
, Issue.3-4
-
-
Deisher, M.1
Smelyanskiy, M.2
Nickerson, B.3
Lee, V.W.4
Chuvelev, M.5
Dubey, P.6
-
7
-
-
77954021354
-
Linpack evaluation on a supercomputer with heterogeneous accelerators
-
IEEE
-
T. Endo, A. Nukada, S. Matsuoka, and N. Maruyama. Linpack evaluation on a supercomputer with heterogeneous accelerators. In IPDPS, pages 1-8. IEEE, 2010.
-
(2010)
IPDPS
, pp. 1-8
-
-
Endo, T.1
Nukada, A.2
Matsuoka, S.3
Maruyama, N.4
-
10
-
-
44249094647
-
Anatomy of high-performance matrix multiplication
-
May
-
K. Goto and R. A. v. d. Geijn. Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw., 34(3):12:1-12:25, May 2008.
-
(2008)
ACM Trans. Math. Softw.
, vol.34
, Issue.3
-
-
Goto, K.1
Geijn, R.A.V.D.2
-
11
-
-
48849089104
-
High-performance implementation of the level-3 blas
-
July
-
K. Goto and R. Van De Geijn. High-performance implementation of the level-3 blas. ACM Trans. Math. Softw., 35(1):4:1-4:14, July 2008.
-
(2008)
ACM Trans. Math. Softw.
, vol.35
, Issue.1
-
-
Goto, K.1
Van De Geijn, R.2
-
16
-
-
84864149777
-
A scalable framework for heterogeneous gpu-based clusters
-
New York, NY, USA, ACM
-
F. Song and J. Dongarra. A scalable framework for heterogeneous gpu-based clusters. In Proceedinbgs of the 24th ACM symposium on Parallelism in algorithms and architectures, SPAA '12, pages 91-100, New York, NY, USA, 2012. ACM.
-
(2012)
Proceedinbgs of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA '12
, pp. 91-100
-
-
Song, F.1
Dongarra, J.2
-
19
-
-
84881285062
-
-
June 2012 release
-
www.top500.org. TOP500 list, June 2012 release. 2012.
-
(2012)
TOP500 List
-
-
-
20
-
-
78649488776
-
Adaptive optimization for petascale heterogeneous cpu/gpu computing
-
sept.
-
C. Yang, F. Wang, Y. Du, J. Chen, J. Liu, H. Yi, and K. Lu. Adaptive optimization for petascale heterogeneous cpu/gpu computing. In Cluster Computing (CLUSTER), 2010 IEEE International Conference on, pages 19-28, sept. 2010.
-
(2010)
Cluster Computing (CLUSTER), 2010 IEEE International Conference on
, pp. 19-28
-
-
Yang, C.1
Wang, F.2
Du, Y.3
Chen, J.4
Liu, J.5
Yi, H.6
Lu, K.7
-
21
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
special issue on "Program Generation, Optimization, and Adaptation"
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proceedings of the IEEE, 93(2), 2005. special issue on "Program Generation, Optimization, and Adaptation".
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
-
22
-
-
70450228489
-
The libflame library for dense matrix computations
-
Nov.
-
F. G. V. Zee, E. Chan, R. A. v. d. Geijn, E. S. Quintana- Orti, and G. Quintana-Orti. The libflame library for dense matrix computations. IEEE Des. Test, 11(6):56-63, Nov. 2009.
-
(2009)
IEEE Des. Test
, vol.11
, Issue.6
, pp. 56-63
-
-
Zee, F.G.V.1
Chan, E.2
Geijn, R.A.V.D.3
Quintana-Orti, E.S.4
Quintana-Orti, G.5
|