-
4
-
-
80053214717
-
-
EMCL Preprint Series [Online]. Available
-
H. Anzt, T. Hahn, V. Heuveline, and B. Rocker, "GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers," EMCL Preprint Series, 2010. [Online]. Available: http://www.emcl.kit.edu/preprints/emcl-preprint-2010-04.pdf
-
(2010)
GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers
-
-
Anzt, H.1
Hahn, T.2
Heuveline, V.3
Rocker, B.4
-
7
-
-
70350583245
-
Accelerating scientific computations with mixed precision algorithms
-
M. Baboulin, A. Buttari, J. J. Dongarra, J. Langou, J. Langou, P. Luszcek, J. Kurzak, and S. Tomov, "Accelerating scientific computations with mixed precision algorithms," Computer Physics Communications, vol. 180, no. 12, pp. 2526-2533, 2009.
-
(2009)
Computer Physics Communications
, vol.180
, Issue.12
, pp. 2526-2533
-
-
Baboulin, M.1
Buttari, A.2
Dongarra, J.J.3
Langou, J.4
Langou, J.5
Luszcek, P.6
Kurzak, J.7
Tomov, S.8
-
8
-
-
35548933706
-
Mixed precision iterative refinement techniques for the solution of dense linear systems
-
A. Buttari, J. J. Dongarra, J. Langou, J. Langou, P. Luszcek, and J. Kurzak, "Mixed precision iterative refinement techniques for the solution of dense linear systems," Int. J. of High Performance Computing & Applications, vol. 21, no. 4, pp. 457-486, 2007.
-
(2007)
Int. J. of High Performance Computing & Applications
, vol.21
, Issue.4
, pp. 457-486
-
-
Buttari, A.1
Dongarra, J.J.2
Langou, J.3
Langou, J.4
Luszcek, P.5
Kurzak, J.6
-
9
-
-
51849144655
-
Solving dense linear systems on graphics processors
-
ser. Lecture Notes in Computer Science, 5168, E. Luque, T. Margalef, and D. Benítez, Eds. Springer
-
S. Barrachina, M. Castillo, F. D. Igual, R. Mayo, and E. S. Quintana-Ortí, "Solving dense linear systems on graphics processors," in Proceedings of the 14th international Euro-Par conference on Parallel Processing, ser. Lecture Notes in Computer Science, 5168, E. Luque, T. Margalef, and D. Benítez, Eds. Springer, 2008, pp. 739-748.
-
(2008)
Proceedings of the 14th International Euro-par Conference on Parallel Processing
, pp. 739-748
-
-
Barrachina, S.1
Castillo, M.2
Igual, F.D.3
Mayo, R.4
Quintana-Ortí, E.S.5
-
10
-
-
73349092728
-
Exploiting the capabilities of modern GPUs for dense matrix computations
-
S. Barrachina, M. Castillo, F. D. Igual, R. Mayo, E. S. Quintana-Ortí, and G. Quintana-Ortí, "Exploiting the capabilities of modern GPUs for dense matrix computations," Concurrency and Computation: Practice and Experience, vol. 21, no. 18, pp. 2457-2477, 2009.
-
(2009)
Concurrency and Computation: Practice and Experience
, vol.21
, Issue.18
, pp. 2457-2477
-
-
Barrachina, S.1
Castillo, M.2
Igual, F.D.3
Mayo, R.4
Quintana-Ortí, E.S.5
Quintana-Ortí, G.6
-
11
-
-
33947588604
-
Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
-
D. Göddeke, R. Strzodka, and S. Turek, "Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations," Int. J. of Parallel, Emergent and Distributed Systems, vol. 22, no. 4, pp. 221-256, 2007.
-
(2007)
Int. J. of Parallel, Emergent and Distributed Systems
, vol.22
, Issue.4
, pp. 221-256
-
-
Göddeke, D.1
Strzodka, R.2
Turek, S.3
-
12
-
-
84855221145
-
Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations (part 2: Double precision GPUs)
-
Fakultät für Mathematik July ergebnisberichte des Instituts für Angewandte Mathematik, Nummer
-
D. Göddeke and R. Strzodka, "Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations (part 2: Double precision GPUs)," Fakultät für Mathematik, TU Dortmund, Tech. Rep., July 2008, ergebnisberichte des Instituts für Angewandte Mathematik, Nummer 370.
-
(2008)
TU Dortmund, Tech. Rep.
, pp. 370
-
-
Göddeke, D.1
Strzodka, R.2
-
13
-
-
77958512320
-
Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms
-
H. Anzt, B. Rocker, and V. Heuveline, "Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms," Computer Science - Research and Development, vol. 25, Issue 3, pp. 141-149, 2010.
-
(2010)
Computer Science - Research and Development
, vol.25
, Issue.3
, pp. 141-149
-
-
Anzt, H.1
Rocker, B.2
Heuveline, V.3
-
14
-
-
77958509771
-
A new energy aware performance metric
-
C. Bekas and A. Curioni, "A new energy aware performance metric," Computer Science - Research and Development, vol. 25, Issue 3, pp. 187-195, 2010.
-
(2010)
Computer Science - Research and Development
, vol.25
, Issue.3
, pp. 187-195
-
-
Bekas, C.1
Curioni, A.2
-
15
-
-
1842829625
-
-
Philadelphia, PA, USA: Society for Industrial and Applied Mathematics
-
Y. Saad, Iterative Methods for Sparse Linear Systems. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics, 2003.
-
(2003)
Iterative Methods for Sparse Linear Systems
-
-
Saad, Y.1
-
16
-
-
0000048673
-
Gmres: A generalized minimal residual algorithm for solving nonsymmetric linear systems
-
July [Online]. Available
-
Y. Saad and M. H. Schultz, "Gmres: a generalized minimal residual algorithm for solving nonsymmetric linear systems," SIAM J. Sci. Stat. Comput., vol. 7, pp. 856-869, July 1986. [Online]. Available: http://portal.acm.org/citation.cfm?id=14063.14074
-
(1986)
SIAM J. Sci. Stat. Comput.
, vol.7
, pp. 856-869
-
-
Saad, Y.1
Schultz, M.H.2
-
17
-
-
84937419529
-
-
Document Number: 314774-005US, October Intel Corporation
-
* OS," Document Number: 314774-005US, October 2007, Intel Corporation.
-
(2007)
* OS
-
-
-
18
-
-
0003893794
-
-
Philadelphia: SIAM
-
Z. Bai, J. Demmel, J. Dongarra, A. Ruhe, and H. van der Vorst, Templates for the Solution of Algebraic Eigenvalue Problems. Philadelphia: SIAM, 2000.
-
(2000)
Templates for the Solution of Algebraic Eigenvalue Problems
-
-
Bai, Z.1
Demmel, J.2
Dongarra, J.3
Ruhe, A.4
Van Der Vorst, H.5
-
19
-
-
83455248313
-
-
Intel Corporation, document Number: 307776-002US
-
"Intel C++ Compiler Options," Intel Corporation, document Number: 307776-002US.
-
Intel C++ Compiler Options
-
-
-
22
-
-
36949040798
-
Analysis of dynamic voltage/frequency scaling in chip-multiprocessors
-
ser. ISLPED '07. New York, NY, USA: ACM
-
S. Herbert and D. Marculescu, "Analysis of dynamic voltage/frequency scaling in chip-multiprocessors," in Proceedings of the 2007 international symposium on Low power electronics and design, ser. ISLPED '07. New York, NY, USA: ACM, 2007, pp. 38-43.
-
(2007)
Proceedings of the 2007 International Symposium on Low Power Electronics and Design
, pp. 38-43
-
-
Herbert, S.1
Marculescu, D.2
-
23
-
-
0036036849
-
Real-time dynamic voltage scaling for low-power embedded operating systems
-
October [Online]. Available: http://doi.acm.org/10.1145/502059.502044
-
P. Pillai and K. G. Shin, "Real-time dynamic voltage scaling for low-power embedded operating systems," SIGOPS Oper. Syst. Rev., vol. 35, pp. 89-102, October 2001. [Online]. Available: http://doi.acm.org/10.1145/ 502059.502044
-
(2001)
SIGOPS Oper. Syst. Rev.
, vol.35
, pp. 89-102
-
-
Pillai, P.1
Shin, K.G.2
-
24
-
-
34248638757
-
Analyzing the energy-time trade-off in high-performance computing applications
-
June [Online]. Available
-
V. W. Freeh, D. K. Lowenthal, F. Pan, N. Kappiah, R. Springer, B. L. Rountree, and M. E. Femal, "Analyzing the energy-time trade-off in high-performance computing applications," IEEE Trans. Parallel Distrib. Syst., vol. 18, pp. 835-848, June 2007. [Online]. Available: http://portal.acm.org/citation.cfm?id=1263127.1263246
-
(2007)
IEEE Trans. Parallel Distrib. Syst.
, vol.18
, pp. 835-848
-
-
Freeh, V.W.1
Lowenthal, D.K.2
Pan, F.3
Kappiah, N.4
Springer, R.5
Rountree, B.L.6
Femal, M.E.7
|