-
1
-
-
50249141640
-
-
24 July 2007
-
http://top500.org [24 July 2007].
-
-
-
-
2
-
-
27344435504
-
The design and implementation of a first-generation CELL processor
-
Pham D, Asano S, Bolliger M, Day MN, Hofstee HP, Johns C, Kahle J, Kameyama A, Keaty J, Masubuchi Y, Riley M, Shippy D, Stasiak D, Suzuoki M, Wang M, Warnock J, Weitzel S, Wendel D, Yamazaki T, Yazawa K. The design and implementation of a first-generation CELL processor. IEEE International Solid-State Circuits Conference 2005; 184-185.
-
(2005)
IEEE International Solid-State Circuits Conference
, pp. 184-185
-
-
Pham, D.1
Asano, S.2
Bolliger, M.3
Day, M.N.4
Hofstee, H.P.5
Johns, C.6
Kahle, J.7
Kameyama, A.8
Keaty, J.9
Masubuchi, Y.10
Riley, M.11
Shippy, D.12
Stasiak, D.13
Suzuoki, M.14
Wang, M.15
Warnock, J.16
Weitzel, S.17
Wendel, D.18
Yamazaki, T.19
Yazawa, K.20
more..
-
3
-
-
50249133005
-
-
Teraflops research chip, 24 July 2007
-
Teraflops research chip. http://www.intel.com/research/platform/ terascale/teraflops.htm [24 July 2007].
-
-
-
-
4
-
-
0003706460
-
-
3rd edn, SIAM: Philadelphia
-
Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J, Du Croz J, Greenbaum A, Hammarling S, McKenney A, Sorensen D. LAPACK User's Guide (3rd edn). SIAM: Philadelphia, 1999.
-
(1999)
LAPACK User's Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.4
Demmel, J.5
Dongarra, J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.11
-
5
-
-
0030564728
-
ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance
-
Also as LAPACK Working Note #95
-
Choi J, Demmel J, Dhillon I, Dongarra J, Ostrouchov S, Petitet A, Stanley K, Walker D, Whaley RC. ScaLAPACK: A portable linear algebra library for distributed memory computers - Design issues and performance. Computer Physics Communications 1996; 97:1-15. (Also as LAPACK Working Note #95).
-
(1996)
Computer Physics Communications
, vol.97
, pp. 1-15
-
-
Choi, J.1
Demmel, J.2
Dhillon, I.3
Dongarra, J.4
Ostrouchov, S.5
Petitet, A.6
Stanley, K.7
Walker, D.8
Whaley, R.C.9
-
6
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
l-2:3-25
-
Whaley RC, Petitet A, Dongarra J. Automated empirical optimization of software and the ATLAS project. Parallel Computing 2001; 27(l-2):3-25.
-
(2001)
Parallel Computing
, vol.27
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.3
-
7
-
-
34548762396
-
High-performance implementation of the level-3 bias
-
Technical Report TR-2006-23, Department of Computer Sciences, The University of Texas at Austin, FLAME Working Note 20
-
Goto K, van de Geijn R. High-performance implementation of the level-3 bias. Technical Report TR-2006-23, Department of Computer Sciences, The University of Texas at Austin, 2006. FLAME Working Note 20.
-
(2006)
-
-
Goto, K.1
van de Geijn, R.2
-
8
-
-
50249110532
-
-
24 July 2007
-
http://www.intel.com/cd/software/products/asmo-na/eng/307757.htm [24 July 2007].
-
-
-
-
9
-
-
50249118960
-
-
24 July 2007
-
http://developer.amd.com/acml.jsp [24 July 2007].
-
-
-
-
10
-
-
50249118105
-
-
International Organization for Standardization. Informational Technology - Portable Operating System Interface (POSIX) - Part 1: System Application Program Interface (API) [C Language], ISO: Adr, 19%; 743. http://www.iso. ch/cate/d24426.html [24 July 2007].
-
International Organization for Standardization. Informational Technology - Portable Operating System Interface (POSIX) - Part 1: System Application Program Interface (API) [C Language], ISO: Adr, 19%; 743. http://www.iso. ch/cate/d24426.html [24 July 2007].
-
-
-
-
11
-
-
0002806690
-
OpenMP: An industry-standard API for shared-memory programming
-
Dagum L. Menon R. OpenMP: An industry-standard API for shared-memory programming. IEEE Computational Science and Engineering 1998; 5(1):46-55.
-
(1998)
IEEE Computational Science and Engineering
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
-
12
-
-
84947808952
-
-
Choi J, Dongarra J, Ostrouchov S, Petitet A, Walker DW, Clinton Whaley R. A proposal for a set of parallel basic linear algebra subprograms. PARA '95: Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science, London, U.K., 1996. Springer: Berlin, 19%; 107-114.
-
Choi J, Dongarra J, Ostrouchov S, Petitet A, Walker DW, Clinton Whaley R. A proposal for a set of parallel basic linear algebra subprograms. PARA '95: Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science, London, U.K., 1996. Springer: Berlin, 19%; 107-114.
-
-
-
-
13
-
-
50249129153
-
-
Message passing interface Forum. MPI: A message-passing interface standard. The International Journal of Supercomputer Applications and High Performance Computing 1994; 8:165-414.
-
Message passing interface Forum. MPI: A message-passing interface standard. The International Journal of Supercomputer Applications and High Performance Computing 1994; 8:165-414.
-
-
-
-
14
-
-
38049058008
-
The impact of multicore on math software
-
Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing Para06, Umeå, Sweden
-
Buttari A, Dongarra J, Kurzak J, Langou J, Luszczek P, Tomov S. The impact of multicore on math software. Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing (Para06). Springer's Lecture Notes in Computer Science 4699, Umeå, Sweden, 2007; 1-10.
-
(2007)
Springer's Lecture Notes in Computer Science
, vol.4699
, pp. 1-10
-
-
Buttari, A.1
Dongarra, J.2
Kurzak, J.3
Langou, J.4
Luszczek, P.5
Tomov, S.6
-
15
-
-
35248843628
-
Supermatrix out-of-order scheduling of matrix operations for SMP and multicore architectures
-
New York, NY, U.S.A, ACM: New York
-
Chan E, Quintana-Orti ES, Quintana-Orti G, van de Geijn R. Supermatrix out-of-order scheduling of matrix operations for SMP and multicore architectures. SPAA '07: Proceedings of the 19th Annual ACM Symposium on Parallel Algorithms and Architectures, New York, NY, U.S.A., 2007. ACM: New York, 2007; 116-125.
-
(2007)
SPAA '07: Proceedings of the 19th Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 116-125
-
-
Chan, E.1
Quintana-Orti, E.S.2
Quintana-Orti, G.3
van de Geijn, R.4
-
16
-
-
38049005629
-
Implementing linear algebra routines on multicore processors with pipelining and a look ahead
-
Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing Para06, Umeå, Sweden
-
Kurzak J, Dongarra J. Implementing linear algebra routines on multicore processors with pipelining and a look ahead. Proceedings of Workshop on State-of-the-art in Scientific and Parallel Computing (Para06). Springer's Lecture Notes in Computer Science 4699, Umeå, Sweden, 2007; 147-156.
-
(2007)
Springer's Lecture Notes in Computer Science
, vol.4699
, pp. 147-156
-
-
Kurzak, J.1
Dongarra, J.2
-
17
-
-
50249166476
-
Solving systems of linear equations on the CELL processor using Cholesky factorization
-
Technical Report VT-CS-07-596, Innovative Computing Laboratory, University of Tennessee, Knoxville, April
-
Kurzak J, Buttari A, Dongarra J. Solving systems of linear equations on the CELL processor using Cholesky factorization. Technical Report VT-CS-07-596, Innovative Computing Laboratory, University of Tennessee, Knoxville, April 2007.
-
(2007)
-
-
Kurzak, J.1
Buttari, A.2
Dongarra, J.3
-
18
-
-
0034224207
-
Applying recursion to serial and parallel QR factorization leads to better performance
-
Elmroth E, Gustavson FG. Applying recursion to serial and parallel QR factorization leads to better performance. IBM Journal of Research and Development 2000; 44(4):605-624.
-
(2000)
IBM Journal of Research and Development
, vol.44
, Issue.4
, pp. 605-624
-
-
Elmroth, E.1
Gustavson, F.G.2
-
19
-
-
0004236492
-
-
3rd edn, Johns Hopkins University Press: Baltimore, MD, 19
-
Golub G, Van Loan C. Matrix Computations (3rd edn). Johns Hopkins University Press: Baltimore, MD, 19%.
-
Matrix Computations
-
-
Golub, G.1
Van Loan, C.2
-
20
-
-
0004094905
-
-
1st edn, SIAM: Philadelphia, PA
-
Stewart GW. Matrix Algorithms (1st edn), vol. 1. SIAM: Philadelphia, PA, 1998.
-
(1998)
Matrix Algorithms
, vol.1
-
-
Stewart, G.W.1
-
23
-
-
45449092245
-
FORTRAN subroutines for out-of-core solutions of large complex linear systems
-
Technical Report CR-159I42, NASA, November
-
Yip EL. FORTRAN subroutines for out-of-core solutions of large complex linear systems. Technical Report CR-159I42, NASA, November 1979.
-
(1979)
-
-
Yip, E.L.1
-
25
-
-
45449110534
-
Updating an LU factorization with pivoting
-
Technical Report TR-2006-42, Department of Computer Sciences, The University of Texas at Austin, FLAME Working Note 21
-
Quintana-Orti E, van de Geijn R. Updating an LU factorization with pivoting. Technical Report TR-2006-42, Department of Computer Sciences, The University of Texas at Austin, 2006. FLAME Working Note 21.
-
(2006)
-
-
Quintana-Orti, E.1
van de Geijn, R.2
-
26
-
-
0029358998
-
A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-Hessenberg form
-
Berry MW, Dongarra JJ, Kim Y. A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-Hessenberg form. Parallel Computation 1995; 21(8): 1189-1211.
-
(1995)
Parallel Computation
, vol.21
, Issue.8
, pp. 1189-1211
-
-
Berry, M.W.1
Dongarra, J.J.2
Kim, Y.3
-
28
-
-
50249182748
-
-
SMP Superscalar (SMPSs) User's Manual, July 2007. www.bsc.es/media/1002. pdf [24 July 2007].
-
SMP Superscalar (SMPSs) User's Manual, July 2007. www.bsc.es/media/1002. pdf [24 July 2007].
-
-
-
|