-
1
-
-
70350591011
-
-
Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Barry F. Smith, Hong Zhang, PETSc Web page, http://www.mcs.anl.gov/petsc, 2001
-
Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Barry F. Smith, Hong Zhang, PETSc Web page, http://www.mcs.anl.gov/petsc, 2001
-
-
-
-
4
-
-
70350605445
-
-
Alfredo Buttari, Jack J. Dongarra, Jakub Kurzak, PLASMA Web page, http://icl.cs.utk.edu/plasma, 2009
-
Alfredo Buttari, Jack J. Dongarra, Jakub Kurzak, PLASMA Web page, http://icl.cs.utk.edu/plasma, 2009
-
-
-
-
5
-
-
38049058008
-
The impact of multicore on math software
-
Proceedings of PARA 2006, Applied Parallel Computing. State of the Art in Scientific Computing, Springer
-
Buttari A., Dongarra J.J., Kurzak J., Langou J., Luszczek P., and Tomov S. The impact of multicore on math software. Proceedings of PARA 2006, Applied Parallel Computing. State of the Art in Scientific Computing. Lecture Notes in Computer Science vol. 4699 (2006), Springer 1-10
-
(2006)
Lecture Notes in Computer Science
, vol.4699
, pp. 1-10
-
-
Buttari, A.1
Dongarra, J.J.2
Kurzak, J.3
Langou, J.4
Luszczek, P.5
Tomov, S.6
-
6
-
-
70350578630
-
-
Alfredo Buttari, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra, George Bosilca, SCOP3: A rough guide to scientific computing on the PlayStation 3, Technical report, Innovative Computing Laboratory, University of Tennessee Knoxville, 2007. UT-CS-07-595
-
Alfredo Buttari, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra, George Bosilca, SCOP3: A rough guide to scientific computing on the PlayStation 3, Technical report, Innovative Computing Laboratory, University of Tennessee Knoxville, 2007. UT-CS-07-595
-
-
-
-
7
-
-
70350616155
-
-
Phillip Colella, Thom H. Dunning Jr., William D. Gropp, David E. Keyes, A science-based case for large-scale simulation, Technical report, Office of Science, US Department of Energy, http://www.pnl.gov/scales, July 2003
-
Phillip Colella, Thom H. Dunning Jr., William D. Gropp, David E. Keyes, A science-based case for large-scale simulation, Technical report, Office of Science, US Department of Energy, http://www.pnl.gov/scales, July 2003
-
-
-
-
8
-
-
20444470676
-
Numerical solution of the two-dimensional shallow water equations by the application of relaxation methods
-
Delis A.I., and Katsaounis T.D. Numerical solution of the two-dimensional shallow water equations by the application of relaxation methods. Applied Mathematical Modelling 29 8 (2005) 754-783
-
(2005)
Applied Mathematical Modelling
, vol.29
, Issue.8
, pp. 754-783
-
-
Delis, A.I.1
Katsaounis, T.D.2
-
9
-
-
26444516623
-
Fixed and adaptive cache aware algorithms for multigrid methods
-
Multigrid Methods VI. Dick E., Riemslagh K., and Vierendeels J. (Eds), Springer
-
Douglas C.C., Hu J., Karl W., Kowarschik M., Rüde U., and Weiß C. Fixed and adaptive cache aware algorithms for multigrid methods. In: Dick E., Riemslagh K., and Vierendeels J. (Eds). Multigrid Methods VI. Lecture Notes in Computational Science and Engineering vol. 14 (2000), Springer 87-93
-
(2000)
Lecture Notes in Computational Science and Engineering
, vol.14
, pp. 87-93
-
-
Douglas, C.C.1
Hu, J.2
Karl, W.3
Kowarschik, M.4
Rüde, U.5
Weiß, C.6
-
10
-
-
0002349926
-
Cache optimization for structured and unstructured grid multigrid
-
Douglas C.C., Hu J., Kowarschik M., Rüde U., and Weiß C. Cache optimization for structured and unstructured grid multigrid. Electronic Transactions on Numerical Analysis 10 (2000) 21-40
-
(2000)
Electronic Transactions on Numerical Analysis
, vol.10
, pp. 21-40
-
-
Douglas, C.C.1
Hu, J.2
Kowarschik, M.3
Rüde, U.4
Weiß, C.5
-
11
-
-
70350582737
-
A note on cache memory methods for multigrid in three dimensions
-
Douglas C.C., and Thorne D.T. A note on cache memory methods for multigrid in three dimensions. Contemporary Mathematics 306 (2002) 167-177
-
(2002)
Contemporary Mathematics
, vol.306
, pp. 167-177
-
-
Douglas, C.C.1
Thorne, D.T.2
-
12
-
-
34548207355
-
-
Kayvon Fatahalian, Timothy J. Knight, Mike Houston, Mattan Erez, Daniel R. Horn, Larkhoon Leem, Ji Young Park, Manman Ren, Alex Aiken, William J. Dally, Pat Hanrahan, Sequoia: Programming the memory hierarchy, in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, November 2006
-
Kayvon Fatahalian, Timothy J. Knight, Mike Houston, Mattan Erez, Daniel R. Horn, Larkhoon Leem, Ji Young Park, Manman Ren, Alex Aiken, William J. Dally, Pat Hanrahan, Sequoia: Programming the memory hierarchy, in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, November 2006
-
-
-
-
13
-
-
70350604398
-
-
Dominik Göddeke, Robert Strzodka, Performance and accuracy of hardware-oriented native, emulated- and mixed-precision solvers in FEM simulations (Part 2: Double precision GPUs), Technical report, Fakultät für Mathematik, Technische Universität Dortmund, 2008 (Invited talk at NVISION 2008 - The World of Visual Computing, nummer 370)
-
Dominik Göddeke, Robert Strzodka, Performance and accuracy of hardware-oriented native, emulated- and mixed-precision solvers in FEM simulations (Part 2: Double precision GPUs), Technical report, Fakultät für Mathematik, Technische Universität Dortmund, 2008 (Invited talk at NVISION 2008 - The World of Visual Computing, nummer 370)
-
-
-
-
14
-
-
33947588604
-
Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
-
Göddeke D., Strzodka R., and Turek S. Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. International Journal of Parallel, Emergent and Distributed Systems 22 4 (2007) 221-256
-
(2007)
International Journal of Parallel, Emergent and Distributed Systems
, vol.22
, Issue.4
, pp. 221-256
-
-
Göddeke, D.1
Strzodka, R.2
Turek, S.3
-
15
-
-
70350584775
-
-
Kazushige Goto, GotoBLAS, http://www.tacc.utexas.edu/resources/software/#blas
-
GotoBLAS
-
-
Goto, K.1
-
16
-
-
34548020763
-
Representation-transparent matrix algorithms with scalable performance
-
Peter Gottschling, David S. Wise, Michael D. Adams, Representation-transparent matrix algorithms with scalable performance, in: ICS '07: Proceedings of the 21st Annual International Conference on Supercomputing, 2007, pp. 116-125
-
(2007)
ICS '07: Proceedings of the 21st Annual International Conference on Supercomputing
, pp. 116-125
-
-
Gottschling, P.1
Wise, D.S.2
Adams, M.D.3
-
17
-
-
20044393250
-
An overview of the Trilinos project
-
Heroux M.A., Bartlett R.A., Howle V.E., Hoekstra R.J., Hu J.J., Kolda T.G., Lehoucq R.B., Long K.R., Pawlowski R.P., Phipps E.T., Salinger A.G., Thornquist H.K., Tuminaro R.S., Willenbring J.M., Williams A., and Stanley K.S. An overview of the Trilinos project. ACM Transactions on Mathematical Software 31 3 (2005) 397-423. http://trilinos.sandia.gov/
-
(2005)
ACM Transactions on Mathematical Software
, vol.31
, Issue.3
, pp. 397-423
-
-
Heroux, M.A.1
Bartlett, R.A.2
Howle, V.E.3
Hoekstra, R.J.4
Hu, J.J.5
Kolda, T.G.6
Lehoucq, R.B.7
Long, K.R.8
Pawlowski, R.P.9
Phipps, E.T.10
Salinger, A.G.11
Thornquist, H.K.12
Tuminaro, R.S.13
Willenbring, J.M.14
Williams, A.15
Stanley, K.S.16
-
18
-
-
70350602376
-
-
IBM Corporation, SPE Runtime Management Library, http://www-01.ibm.com/chips/techlib/techlib.nsf/pages/main, 2007
-
(2007)
SPE Runtime Management Library
-
-
-
19
-
-
25844503119
-
Introduction to the Cell multiprocessor
-
Kahle J.A., Day M.N., Hofstee H.P., Johns C.R., Maeurer T.R., and Shippy D. Introduction to the Cell multiprocessor. IBM Journal of Research and Development 45 4/5 (2005) 589-604. http://www.research.ibm.com/journal/rd/494/kahle.html
-
(2005)
IBM Journal of Research and Development
, vol.45
, Issue.4-5
, pp. 589-604
-
-
Kahle, J.A.1
Day, M.N.2
Hofstee, H.P.3
Johns, C.R.4
Maeurer, T.R.5
Shippy, D.6
-
20
-
-
14044257293
-
Terascale implicit methods for partial differential equations
-
Recent Advances in Numerical Methods for Partial Differential Equations and Applications. Feng X., and Schulze T.P. (Eds), American Mathematical Society
-
Keyes D.E. Terascale implicit methods for partial differential equations. In: Feng X., and Schulze T.P. (Eds). Recent Advances in Numerical Methods for Partial Differential Equations and Applications. Contemporary Mathematics vol. 306 (January 2002), American Mathematical Society 29-84
-
(2002)
Contemporary Mathematics
, vol.306
, pp. 29-84
-
-
Keyes, D.E.1
-
21
-
-
34548206782
-
Tools and techniques for performance - exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems)
-
Julie Langou, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, Jack J. Dongarra, Tools and techniques for performance - exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems), in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 2006, p. 113
-
(2006)
SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
, pp. 113
-
-
Langou, J.1
Langou, J.2
Luszczek, P.3
Kurzak, J.4
Buttari, A.5
Dongarra, J.J.6
-
22
-
-
44849137198
-
NVIDIA Tesla: A unified graphics and computing architecture
-
Lindholm E., Nickolls J., Oberman S., and Montrym J. NVIDIA Tesla: A unified graphics and computing architecture. IEEE Micro 28 2 (2008) 39-55
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
23
-
-
70350580611
-
-
NVIDIA Corporation, NVIDIA CUDA Compute Unified Device Architecture Programming Guide (Version 2.0), http://www.nvidia.com/cuda, 2008
-
NVIDIA Corporation, NVIDIA CUDA Compute Unified Device Architecture Programming Guide (Version 2.0), http://www.nvidia.com/cuda, 2008
-
-
-
-
24
-
-
49049088756
-
GPU computing
-
Owens J.D., Houston M., Luebke D., Green S., Stone J.E., and Phillips J.C. GPU computing. Proceedings of the IEEE 96 5 (2008) 879-899
-
(2008)
Proceedings of the IEEE
, vol.96
, Issue.5
, pp. 879-899
-
-
Owens, J.D.1
Houston, M.2
Luebke, D.3
Green, S.4
Stone, J.E.5
Phillips, J.C.6
-
25
-
-
33947588048
-
A survey of general-purpose computation on graphics hardware
-
Owens J.D., Luebke D., Govindaraju N., Harris M., Krüger J., Lefohn A.E., and Purcell T.J. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 26 1 (2007) 80-113
-
(2007)
Computer Graphics Forum
, vol.26
, Issue.1
, pp. 80-113
-
-
Owens, J.D.1
Luebke, D.2
Govindaraju, N.3
Harris, M.4
Krüger, J.5
Lefohn, A.E.6
Purcell, T.J.7
-
27
-
-
27344435504
-
-
Dac C. Pham, Shigehiro Asano, Mark Bolliger, Michael N. Day, H. Peter Hofstee, Charles R. Johns, James A. Kahle, Atsushi Kameyama, John Keaty, Yoshio Masubuchi, Mack Riley, David Shippy, Daniel L. Stasiak, Masakazu Suzuoki, M. Wang, James Warnock, Steve Weitzel, Dieter Wendel, Takeshi Yamazaki, Kazuaki Yazawa, The design and implementation of a first-generation CELL processor, in: Solid-State Circuits Conference, ISSCC 2005, Digest of Technical Papers, 1, February 2005, pp. 184-592
-
Dac C. Pham, Shigehiro Asano, Mark Bolliger, Michael N. Day, H. Peter Hofstee, Charles R. Johns, James A. Kahle, Atsushi Kameyama, John Keaty, Yoshio Masubuchi, Mack Riley, David Shippy, Daniel L. Stasiak, Masakazu Suzuoki, M. Wang, James Warnock, Steve Weitzel, Dieter Wendel, Takeshi Yamazaki, Kazuaki Yazawa, The design and implementation of a first-generation CELL processor, in: Solid-State Circuits Conference, ISSCC 2005, Digest of Technical Papers, vol. 1, February 2005, pp. 184-592
-
-
-
-
28
-
-
84946717199
-
-
Sony Corporation, IBM Corporation, Cell BE processor and blade systems, http://www.ibm.com/developerworks/power/cell
-
Sony Corporation, Toshiba Corporation, IBM Corporation, Cell BE processor and blade systems, http://www-03.ibm.com/technology/splash/qs20/, http://www.ibm.com/developerworks/power/cell
-
Toshiba Corporation
-
-
-
29
-
-
26444596160
-
Hardware-oriented numerics and concepts for PDE software
-
Turek S., Becker C., and Kilian S. Hardware-oriented numerics and concepts for PDE software. Future Generation Computer Systems 22 1-2 (2004) 217-238
-
(2004)
Future Generation Computer Systems
, vol.22
, Issue.1-2
, pp. 217-238
-
-
Turek, S.1
Becker, C.2
Kilian, S.3
-
30
-
-
34247349114
-
-
Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine Yelick, The potential of the Cell processor for scientific computing, in: CF '06: Proceedings of the ACM International Conference on Computing Frontiers, May 2006, pp. 9-20
-
Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine Yelick, The potential of the Cell processor for scientific computing, in: CF '06: Proceedings of the ACM International Conference on Computing Frontiers, May 2006, pp. 9-20
-
-
-
|