-
1
-
-
79959443441
-
-
NAS Parallel Benchmarks. Website
-
NAS Parallel Benchmarks. Website. http://www.nas.nasa.gov/Software/NPB/.
-
-
-
-
2
-
-
0019567795
-
ON THE PERFORMANCE ENHANCEMENT OF PAGING SYSTEMS THROUGH PROGRAM ANALYSIS AND TRANSFORMATIONS
-
W. Abu-Sufah, D. J. Kuck, and D. H. Lawrie. On the Performance Enhancement of Paging Systems Through Program Analysis and Transformations. IEEE Trans. Comput., 30(5):341-356, 1981. (Pubitemid 11506029)
-
(1981)
IEEE Transactions on Computers
, vol.C30
, Issue.5
, pp. 341-356
-
-
Abu-Sufah, W.1
Kuck, D.J.2
Lawrie, D.H.3
-
3
-
-
0003706460
-
-
Society for Industrial and Applied Mathematics, Philadelphia, PA, third edition
-
E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen. LAPACK Users' Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, third edition, 1999.
-
(1999)
LAPACK Users' Guide.
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.4
Demmel, J.5
Dongarra, J.6
Du Croz, J.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.11
-
4
-
-
33746070421
-
Shared memory programming for large scale machines
-
108-117
-
C. Barton, C. Casçaval, G. Almási, Y Zheng, M. Farreras, S. Chattete, and J. N. Amaral. Shared Memory Programming for Large Scale Machines. In PLDI '06: Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 108-117,2006.
-
(2006)
PLDI '06: Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation
-
-
Barton, C.1
Casçaval, C.2
Almási, G.3
Zheng, Y.4
Farreras, M.5
Chattete, S.6
Amaral, J.N.7
-
5
-
-
17644412337
-
The science of deriving dense linear algebra algorithms
-
Mar.
-
P. Bientinesi, J. A. Gunnels, M. E. Myers, E. S. Quintana-Orti, and R. A. van de Geijn. The Science of Deriving Dense Linear Algebra Algorithms. ACM Tram. Math. Softw., 31(1):l-26, Mar. 2005.
-
(2005)
ACM Tram. Math. Softw.
, vol.31
, Issue.1
-
-
Bientinesi, P.1
Gunnels, J.A.2
Myers, M.E.3
Quintana-Orti, E.S.4
Van De Geijn, R.A.5
-
6
-
-
17644370328
-
Representing linear algebra algorithms in code: The FLAME application program interfaces
-
DOI 10.1145/1055531.1055533
-
P. Bientinesi, E. S. Quintana-Orti, and R. A. van de Geijn. Representing linear algebra algorithms in code: the FLAME application program interfaces. ACM Trani. Math. Softw., 31(1):27-59, 2005. (Pubitemid 40557861)
-
(2005)
ACM Transactions on Mathematical Software
, vol.31
, Issue.1
, pp. 27-59
-
-
Bientinesi, P.1
Quintana-Orti, E.S.2
Van De, G.R.A.3
-
8
-
-
33751022080
-
Programming for parallelism and locality with hierarchically tiled arrays
-
G. Bikshandi, J. Guo, D. Hoeflinger, G. Almasi, B. B. Fraguela, M. J. Garzarán, D. Padua, and C. von Praun. Programming for Parallelism and Locality with Hierarchically Tiled Arrays. In PPoPP '06: Proc. of the ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pages 48-57, 2006.
-
(2006)
PPoPP '06: Proc. of the ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming
, pp. 48-57
-
-
Bikshandi, G.1
Guo, J.2
Hoeflinger, D.3
Almasi, G.4
Fraguela, B.B.5
Garzarán, M.J.6
Padua, D.7
Von Praun, C.8
-
9
-
-
79959476556
-
Design and use of htalib - A library for Hierarchically Tiled Arrays
-
G. Bikshandi, J. Guo, C. von Praun, G. Tanase, B. B. Fraguela, M. J. Garzarán, D. Padua, and L. Rauchwerger. Design and use of htalib - a library for Hierarchically Tiled Arrays. In Proc. of the Intl. Workshop on Languages and Compilers for Parallel Computing, 2006.
-
(2006)
Proc. of the Intl. Workshop on Languages and Compilers for Parallel Computing
-
-
Bikshandi, G.1
Guo, J.2
Von Praun, C.3
Tanase, G.4
Fraguela, B.B.5
Garzarán, M.J.6
Padua, D.7
Rauchwerger, L.8
-
10
-
-
0003510632
-
Introduction to UPC and language specification
-
IDA Center for Computing Sciences
-
W Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren. Introduction to UPC and Language Specification. Technical Report CCS-TR-99-157, IDA Center for Computing Sciences, 1999.
-
(1999)
Technical Report CCS-TR-99-157
-
-
Carlson, W.1
Draper, J.2
Culler, D.3
Yelick, K.4
Brooks, E.5
Warren, K.6
-
11
-
-
0012593025
-
-
Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
-
R. Chandra, L. Dagum, D. Kohr, D. Maydan, J. McDonald, and R. Menon. Parallel programming in OpenMP. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2001.
-
(2001)
Parallel Programming in OpenMP.
-
-
Chandra, R.1
Dagum, L.2
Kohr, D.3
Maydan, D.4
McDonald, J.5
Menon, R.6
-
12
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
K. Fatahalian, D. R. Horn, T J. Knight, L. Leem, M. Houston, J. Y Park, M. Erez, M. Ren, A. Aiken, W J. Dally, and P. Hanrahan. Sequoia: programming the memory hierarchy. In Supercomputing '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, page 83, 2006.
-
(2006)
Supercomputing '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
, pp. 83
-
-
Fatahalian, K.1
Horn, D.R.2
Knight, T.J.3
Leem, L.4
Houston, M.5
Park, J.Y.6
Erez, M.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
13
-
-
0006445049
-
Solving problems on concurrent processors
-
Prentice-Hall, Inc.
-
G. C. Fox, M. A. Johnson, G. A. Lyzenga, S. W Otto, J. K. Salmon, and D. W. Walker. Solving Problems on Concurrent Processors. Vol. 1: General Techniques and Regular Problems. Prentice-Hall, Inc., 1988.
-
(1988)
Vol. 1: General Techniques and Regular Problems
-
-
Fox, G.C.1
Johnson, M.A.2
Lyzenga, G.A.3
Otto, S.W.4
Salmon, J.K.5
Walker, D.W.6
-
14
-
-
0033350255
-
Cacheoblivious algorithms
-
M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran. Cacheoblivious algorithms. In FOCS '99: Proceedings of the 40th Annual Symposium on Foundation? of Computer Science, page 285, 1999.
-
(1999)
FOCS '99: Proceedings of the 40th Annual Symposium on Foundation? of Computer Science
, pp. 285
-
-
Frigo, M.1
Leiserson, C.E.2
Prokop, H.3
Ramachandran, S.4
-
16
-
-
0037230301
-
High-performance linear algebra algorithms using new generalized data structures for matrices
-
F. G. Gustavson. High-performance linear algebra algorithms using new generalized data structures for matrices. IBM. Res. Dev., 47(1):31-55, 2003.
-
(2003)
IBM. Res. Dev.
, vol.47
, Issue.1
, pp. 31-55
-
-
Gustavson, F.G.1
-
19
-
-
84976813879
-
Compiling fortran D for MIMD distributed-memory machines
-
S. Hiranandani, K. Kennedy, and C-W. Tseng. Compiling Fortran D for MIMD Distributed-memory Machines. Commun. ACM, 35(8):66-80, 1992.
-
(1992)
Commun. ACM
, vol.35
, Issue.8
, pp. 66-80
-
-
Hiranandani, S.1
Kennedy, K.2
Tseng, C.-W.3
-
23
-
-
84945709131
-
Organizing matrices and matrix operations for paged memory systems
-
A. C McKellar and J. E. G. Coffman. Organizing Matrices and Matrix Operations for Paged Memory Systems. Communications of the ACM, 12(3): 153-165, 1969.
-
(1969)
Communications of the ACM
, vol.12
, Issue.3
, pp. 153-165
-
-
McKellar, A.C.1
Coffman, J.E.G.2
-
24
-
-
0002081678
-
Co-array fortran for parallel programming
-
R. W. Numrich and J. Reid. Co-array Fortran for Parallel Programming. SIGPLANFortran Forum, 17(2):1-31, 1998.
-
(1998)
SIGPLANFortran Forum
, vol.17
, Issue.2
, pp. 1-31
-
-
Numrich, R.W.1
Reid, J.2
-
25
-
-
10844267383
-
Formal derivation of algorithms: The triangular sylvester equation
-
E. S. Quintana-Orti and R. A. van de Geijn. Formal Derivation of Algorithms: The Triangular Sylvester Equation. ACM Tram. Math. Softw., 29(2):218-243, 2003.
-
(2003)
ACM Tram. Math. Softw.
, vol.29
, Issue.2
, pp. 218-243
-
-
Quintana-Orti, E.S.1
Van De Geijn, R.A.2
-
28
-
-
0002693795
-
POOMA: A framework for scientific simulations of paralllel architectures
-
MIT Press
-
J. V. W Reynders, P. J. Hinker, J. C. Cummings, S. R. Atlas, S. Banerjee, W F. Humphrey, S. R. Karmesin, K. Keahey, M. Srikant, and M. D. Tholburn. POOMA: A Framework for Scientific Simulations of Paralllel Architectures. In Parallel Programming in C+ +, pages 547-588. MIT Press, 1996.
-
(1996)
Parallel Programming in C+ +
, pp. 547-588
-
-
Reynders, J.V.W.1
Hinker, P.J.2
Cummings, J.C.3
Atlas, S.R.4
Banerjee, S.5
Humphrey, W.F.6
Karmesin, S.R.7
Keahey, K.8
Srikant, M.9
Tholburn, M.D.10
-
29
-
-
0031496750
-
Locality of reference in lu decomposition with partial pivoting
-
S. Toledo. Locality of Reference in LU Decomposition with Partial Pivoting. SIAM Journal on Matrix Analysis and Application?, 18(4): 1065-1081, 1997.
-
(1997)
SIAM Journal on Matrix Analysis and Application?
, vol.18
, Issue.4
, pp. 1065-1081
-
-
Toledo, S.1
-
30
-
-
0347875299
-
-
Technical Report TR542, Department of Computer Science, Indiana University
-
T. Veldhuizen. Techniques for scientific C++. Technical Report TR542, Department of Computer Science, Indiana University, 2000.
-
(2000)
Techniques for Scientific C++
-
-
Veldhuizen, T.1
-
31
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
DOI 10.1016/S0167-8191(00)00087-9
-
R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Sofware and the ATLAS Project. Parallel Computing, 27(1-2):3-35, 2001. (Pubitemid 32264775)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Clint, W.R.1
Petitet, A.2
Dongarra, J.J.3
|