-
1
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
Dongarra, J.J., Croz, J.D., Duff, I.S., Hammarling, S.: A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Soft. 16, 1-17 (1990)
-
(1990)
ACM Trans. Math. Soft
, vol.16
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Duff, I.S.3
Hammarling, S.4
-
6
-
-
0025402476
-
A set of Level 3 Basic Linear Algebra Subprograms
-
Dongarra, J.J., Croz, J.D., Duff, I., Hammarling, S.: A set of Level 3 Basic Linear Algebra Subprograms. ACM Trans. Math. Softw. 16(1), 1-17 (1990)
-
(1990)
ACM Trans. Math. Softw
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Duff, I.3
Hammarling, S.4
-
7
-
-
0003706460
-
-
2nd edn. SIAM, Philadelphia
-
Anderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J., Croz, J.D., Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., Sorensen, D.: LAPACK Users' Guide, 2nd edn. SIAM, Philadelphia (1995)
-
(1995)
LAPACK Users' Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Demmel, J.4
Dongarra, J.5
Croz, J.D.6
Greenbaum, A.7
Hammarling, S.8
McKenney, A.9
Ostrouchov, S.10
Sorensen, D.11
-
8
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Whaley, R.C., Petitet, A., Dongarra, J.J.: Automated empirical optimization of software and the ATLAS project. Parallel Computing 27(1-2), 3-35 (2001)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
9
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Bilmes, J., Asanovic, K., Chin, C.W., Demmel, J.: Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. In: International Conference on Supercomputing, pp. 340-347 (1997)
-
(1997)
International Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.W.3
Demmel, J.4
-
10
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
Proceedings of SciDAC 2005. Journal of Physics:, Institute of Physics Publishing June
-
Vuduc, R., Demmel, J., Yelick, K.: OSKI: A library of automatically tuned sparse matrix kernels. In: Proceedings of SciDAC 2005. Journal of Physics: Conference Series, vol. 16, pp. 521-530. Institute of Physics Publishing (June 2005)
-
(2005)
Conference Series
, vol.16
, pp. 521-530
-
-
Vuduc, R.1
Demmel, J.2
Yelick, K.3
-
11
-
-
68849109682
-
-
Fowler, R., Jin, G., Mellor-Crummey, J.: Increasing temporal locality with skewing and recursive blocking. In: Proceedings of SC 2001: High-Performance Computing and Networking (November 2001)
-
Fowler, R., Jin, G., Mellor-Crummey, J.: Increasing temporal locality with skewing and recursive blocking. In: Proceedings of SC 2001: High-Performance Computing and Networking (November 2001)
-
-
-
-
12
-
-
34548765138
-
POET: Parameterized optimizations for empirical tuning
-
IEEE, Los Alamitos
-
Yi, Q., Seymour, K., You, H., Vuduc, R., Quinlan, D.: POET: Parameterized optimizations for empirical tuning. In: Proceedings of the Parallel and Distributed Processing Symposium, 2007, pp. 1-8. IEEE, Los Alamitos (2007)
-
(2007)
Proceedings of the Parallel and Distributed Processing Symposium
, pp. 1-8
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
13
-
-
57349139452
-
Pluto: A practical and fully automatic polyhedral program optimization system
-
Tucson, AZ June
-
Bondhugula, U., Hartono, A., Ramanujam, J., Sadayappan, P.: Pluto: A practical and fully automatic polyhedral program optimization system. In: Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation (PLDI 2008), Tucson, AZ (June 2008)
-
(2008)
Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation (PLDI
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujam, J.3
Sadayappan, P.4
-
15
-
-
34548762396
-
High-performance implementation of the level-3 BLAS
-
Technical Report TR-2006-23, The University of Texas at Austin, Department of Computer Sciences
-
Goto, K., van de Geijn, R.: High-performance implementation of the level-3 BLAS. Technical Report TR-2006-23, The University of Texas at Austin, Department of Computer Sciences (2006)
-
(2006)
-
-
Goto, K.1
van de Geijn, R.2
-
16
-
-
0035276480
-
High-performance parallel implicit CFD
-
Gropp, W.D., Kaushik, D.K., Keyes, D.E., Smith, B.F.: High-performance parallel implicit CFD. Parallel Computing 27, 337-362 (2001)
-
(2001)
Parallel Computing
, vol.27
, pp. 337-362
-
-
Gropp, W.D.1
Kaushik, D.K.2
Keyes, D.E.3
Smith, B.F.4
-
17
-
-
1842829625
-
-
Society for Industrial and Applied Mathematics, Philadelphia, PA, USA
-
Saad, Y.: Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA (2003)
-
(2003)
Iterative Methods for Sparse Linear Systems
-
-
Saad, Y.1
-
18
-
-
51049111121
-
Build to order linear algebra kernels
-
IEEE, Los Alamitos
-
Jessup, E., Karlin, I., Siek, J.: Build to order linear algebra kernels. In: Proceedings of the IEEE International Symposium on Parallel and Distributed (IPDPS), pp. 1-8. IEEE, Los Alamitos (2008)
-
(2008)
Proceedings of the IEEE International Symposium on Parallel and Distributed (IPDPS)
, pp. 1-8
-
-
Jessup, E.1
Karlin, I.2
Siek, J.3
-
19
-
-
68849123940
-
-
Norris, B., Hartono, A., Gropp, W.: Annotations for productivity and performance portability. In: Petascale Computing: Algorithms and Applications. Computational Science, pp. 443-462. Chapman & Hall/CRC Press, Taylor and Francis Group (2007)
-
Norris, B., Hartono, A., Gropp, W.: Annotations for productivity and performance portability. In: Petascale Computing: Algorithms and Applications. Computational Science, pp. 443-462. Chapman & Hall/CRC Press, Taylor and Francis Group (2007)
-
-
-
-
20
-
-
70449793159
-
Annotation-based empirical performance tuning using Orio
-
Rome, Italy, IEEE, Los Alamitos
-
Hartono, A., Norris, B., Sadayappan, P.: Annotation-based empirical performance tuning using Orio. In: Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium, Rome, Italy, IEEE, Los Alamitos (2009)
-
(2009)
Proceedings of the 23rd IEEE International Parallel & Distributed Processing Symposium
-
-
Hartono, A.1
Norris, B.2
Sadayappan, P.3
-
21
-
-
0003584577
-
-
2nd edn. Prentice Hall, Inc, Englewood Cliffs
-
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach, 2nd edn. Prentice Hall, Inc., Englewood Cliffs (2003)
-
(2003)
Artificial Intelligence: A Modern Approach
-
-
Russell, S.1
Norvig, P.2
|