-
1
-
-
0003429675
-
-
Intertext Publications/McGraw-Hill Inc.
-
J.C. Adams, W.S. Brainerd, J.T. Martin, B.T. Smith, and J.L. Wagener, Fortran 90 Handbook. Intertext Publications/McGraw-Hill Inc. 1992.
-
(1992)
Fortran 90 Handbook
-
-
Adams, J.C.1
Brainerd, W.S.2
Martin, J.T.3
Smith, B.T.4
Wagener, J.L.5
-
3
-
-
0025447908
-
Improving register allocation for subscripted variables
-
June
-
D. Callahan, S. Carr, and K. Kennedy, "Improving Register Allocation for Subscripted Variables," Proc. ACM SIGPLAN 1990 Conf. Programming Language Design and Implementation, pp. 53-65, June 1990.
-
(1990)
Proc. ACM SIGPLAN 1990 Conf. Programming Language Design and Implementation
, pp. 53-65
-
-
Callahan, D.1
Carr, S.2
Kennedy, K.3
-
4
-
-
84976831704
-
Compiler optimizations for improving data locality
-
Oct.
-
S. Carr, K.S. McKinley, and C.-W. Tseng, "Compiler Optimizations for Improving Data Locality," Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 252-262, Oct. 1994.
-
(1994)
Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 252-262
-
-
Carr, S.1
McKinley, K.S.2
Tseng, C.-W.3
-
5
-
-
0029235623
-
Hierarchical tiling for improved superscalar performance
-
Apr.
-
L. Carter, J. Ferrante, and S.F. Hummel, "Hierarchical Tiling for Improved Superscalar Performance," Proc. Nineth Int'l Symp. Parallel Processing, pp. 239-245, Apr. 1995.
-
(1995)
Proc. Nineth Int'l Symp. Parallel Processing
, pp. 239-245
-
-
Carter, L.1
Ferrante, J.2
Hummel, S.F.3
-
6
-
-
0032659795
-
Recursive array layouts and fast parallel matrix multiplication
-
June
-
S. Chatterjee, A.R. Lebeck, P.K. Patnala, and M. Thottethodi, "Recursive Array Layouts and Fast Parallel Matrix Multiplication," Proc. Eleventh Ann. ACM Symp. Parallel Algorithms and Architectures, pp. 222-231, June 1999.
-
(1999)
Proc. Eleventh Ann. ACM Symp. Parallel Algorithms and Architectures
, pp. 222-231
-
-
Chatterjee, S.1
Lebeck, A.R.2
Patnala, P.K.3
Thottethodi, M.4
-
7
-
-
0032652980
-
Nonlinear array layouts for hierarchical memory systems
-
June
-
S. Chatterjee, V.V. Jain, A.R. Lebeck, S. Mundhra, and M. Thottehodi, "Nonlinear Array Layouts for Hierarchical Memory Systems," Proc. 1999 ACM Int'l Conf. Supercomputing, pp. 444-453, June 1999.
-
(1999)
Proc. 1999 ACM Int'l Conf. Supercomputing
, pp. 444-453
-
-
Chatterjee, S.1
Jain, V.V.2
Lebeck, A.R.3
Mundhra, S.4
Thottehodi, M.5
-
8
-
-
0003795618
-
Unifying data and control transformations for distributed shared memory machines
-
Technical Report TR-542, Dept. of Computer-Science, Univ. of Rochester, Nov.
-
M. Cierniak and W. Li, "Unifying Data and Control Transformations for Distributed Shared Memory Machines," Technical Report TR-542, Dept. of Computer-Science, Univ. of Rochester, Nov. 1994.
-
(1994)
-
-
Cierniak, M.1
Li, W.2
-
11
-
-
84882715552
-
Cache misses prediction for high performance sparse algorithms
-
Sept.
-
B.B. Fraguela, R. Doallo, and E.L. Zapata, "Cache Misses Prediction for High Performance Sparse Algorithms," Proc. Fourth Int'l Euro-Par Conf. (Euro-Par '98), pp. 224-233, Sept. 1998.
-
(1998)
Proc. Fourth Int'l Euro-Par Conf. (Euro-Par '98)
, pp. 224-233
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
12
-
-
0011916940
-
Cache probabilistic modeling for basic sparse algebra kernels involving matrices with a non-uniform distribution
-
June
-
B.B. Fraguela, R. Doallo, and E.L. Zapata, "Cache Probabilistic Modeling for Basic Sparse Algebra Kernels Involving Matrices with a Non-Uniform Distribution," Proc. 24th IEEE Euromicro Conf., pp. 345-348, June 1998.
-
(1998)
Proc. 24th IEEE Euromicro Conf.
, pp. 345-348
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
13
-
-
0032089580
-
Modeling set associative caches behaviour for irregular computations
-
June
-
B.B. Fraguela, R. Doallo, and E.L. Zapata, "Modeling Set Associative Caches Behaviour for Irregular Computations," ACM Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS '98), pp. 192-201, June 1998.
-
(1998)
ACM Int'l Conf. Measurement and Modeling of Computer Systems (SIGMETRICS '98)
, pp. 192-201
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
17
-
-
0033075413
-
Improving cache locality by a combination of loop and data transformations
-
Feb.
-
M. Kandemir, J. Ramanujam, and A. Choudhary, "Improving Cache Locality by a Combination of Loop and Data Transformations," IEEE Trans. Computers, Feb. 1999.
-
(1999)
IEEE Trans. Computers
-
-
Kandemir, M.1
Ramanujam, J.2
Choudhary, A.3
-
18
-
-
0030662867
-
A compiler algorithm for optimizing locality in loop nests
-
July
-
M. Kandemir, J. Ramanujam, and A. Chaoudhary, "A Compiler Algorithm for Optimizing Locality in Loop Nests," Proc. 1997 ACM Int'l Conf. Supercomputing, pp. 269-276, July 1997.
-
(1997)
Proc. 1997 ACM Int'l Conf. Supercomputing
, pp. 269-276
-
-
Kandemir, M.1
Ramanujam, J.2
Chaoudhary, A.3
-
21
-
-
10844257146
-
A relation approach to the compilation of sparse matrix programs
-
Aug.
-
V. Kotlyar, K. Pingali, and P. Stodhill, "A Relation Approach to the Compilation of Sparse Matrix Programs," Euro Par, Aug. 1997.
-
(1997)
Euro Par
-
-
Kotlyar, V.1
Pingali, K.2
Stodhill, P.3
-
23
-
-
84989868541
-
A tensor product formulation of strassen's matrix multiplication algorithm with memory reduction
-
Apr.
-
B. Kumar, C-H. Huang, R.W. Johnson, and P. Sadayappan, "A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction," Proc. Seventh Int'l Parallel Processing Symp., pp. 582-588, Apr. 1993.
-
(1993)
Proc. Seventh Int'l Parallel Processing Symp.
, pp. 582-588
-
-
Kumar, B.1
Huang, C.-H.2
Johnson, R.W.3
Sadayappan, P.4
-
24
-
-
0026137116
-
The cache performance and optimizations of blocked algorithms
-
Apr.
-
M.S. Lam, E.E. Rothberg, and M.E. Wolf, "The Cache Performance and Optimizations of Blocked Algorithms," Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 63-74, Apr. 1991.
-
(1991)
Proc. Fourth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 63-74
-
-
Lam, M.S.1
Rothberg, E.E.2
Wolf, M.E.3
-
26
-
-
0011977499
-
Efficient representation for multi-dimensional matrix operations
-
Mar.
-
J.-S. Liu, J.-Y. Lin, and Y.-C. Chung, "Efficient Representation for Multi-Dimensional Matrix Operations," Proc. Workshop Compiler Techniques for High Performance Computing (CTHPC), pp. 133-142, Mar. 2000.
-
(2000)
Proc. Workshop Compiler Techniques for High Performance Computing (CTHPC)
, pp. 133-142
-
-
Liu, J.-S.1
Lin, J.-Y.2
Chung, Y.-C.3
-
27
-
-
84907042187
-
Efficient parallel algorithms for multi-dimensional matrix operations
-
Dec.
-
J.-S. Liu, J.-Y. Lin, and Y.-C. Chung, "Efficient Parallel Algorithms for Multi-Dimensional Matrix Operations," Proc. IEEE Int'l Symp. Parallel Architectures, Algorithms and Networks (I-SPAN), pp.224-229, Dec. 2000.
-
(2000)
Proc. IEEE Int'l Symp. Parallel Architectures, Algorithms and Networks (I-SPAN)
, pp. 224-229
-
-
Liu, J.-S.1
Lin, J.-Y.2
Chung, Y.-C.3
-
33
-
-
0030295713
-
Parallelization techniques for sparse matrix applications
-
M. Ujaldon, E.L. Zapata, S.D. Sharma, and J. Saltz, "Parallelization Techniques for Sparse Matrix Applications," J. Parallel and Distribution Computing, 1996.
-
(1996)
J. Parallel and Distribution Computing
-
-
Ujaldon, M.1
Zapata, E.L.2
Sharma, S.D.3
Saltz, J.4
-
35
-
-
85028864961
-
Run-time optimization of sparse matrix-vector multiplication on SIMD machines
-
July
-
L.H. Ziantz, C.C. Ozturan, and B.K. Szymanski, "Run-Time Optimization of Sparse Matrix-Vector Multiplication on SIMD Machines," Proc. Int'l Conf. Parallel Architectures and Languages, pp. 313-322, July 1994.
-
(1994)
Proc. Int'l Conf. Parallel Architectures and Languages
, pp. 313-322
-
-
Ziantz, L.H.1
Ozturan, C.C.2
Szymanski, B.K.3
|