-
1
-
-
0012951882
-
-
PhD dissertation, Dept. of Computer Science, Rice Univ., Sept.
-
S. Carr, "Memory-Hierarchy Management," PhD dissertation, Dept. of Computer Science, Rice Univ., Sept. 1992.
-
(1992)
Memory-hierarchy Management
-
-
Carr, S.1
-
3
-
-
84976831704
-
Compiler Optimizations for Improving Data Locality
-
Oct.
-
S. Carr, K. McKinley, and C.-W. Tseng, "Compiler Optimizations for Improving Data Locality," Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 252-262, Oct. 1994.
-
(1994)
Proc. Sixth Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 252-262
-
-
Carr, S.1
McKinley, K.2
Tseng, C.-W.3
-
4
-
-
0008568517
-
Multilevel Orthogonal Blocking for Dense Linear Algebra Computations
-
Fall
-
J.J. Navarro, M. Valero, J. Llabería, and T. Lang, "Multilevel Orthogonal Blocking for Dense Linear Algebra Computations," IEEE Computer Soc. TC on Computer Architecture Newsletter, pp. 10-14, Fall 1993.
-
(1993)
IEEE Computer Soc. TC on Computer Architecture Newsletter
, pp. 10-14
-
-
Navarro, J.J.1
Valero, M.2
Llabería, J.3
Lang, T.4
-
5
-
-
84976827033
-
A Data Locality Optimizing Algorithm
-
June
-
M.E. Wolf and M.S. Lam, "A Data Locality Optimizing Algorithm, " Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation, vol. 26, no. 6, pp. 30-44, June 1991.
-
(1991)
Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation
, vol.26
, Issue.6
, pp. 30-44
-
-
Wolf, M.E.1
Lam, M.S.2
-
6
-
-
84941860740
-
A New Loop Transformation Techniques for Massive Parallelism
-
Yale Univ., Computer Science Dept., Apr.
-
L.-C. Lu and M. Chen, "A New Loop Transformation Techniques for Massive Parallelism," Yale Univ., Computer Science Dept., Technical Report TR-833, Apr. 1990.
-
(1990)
Technical Report
, vol.TR-833
-
-
Lu, L.-C.1
Chen, M.2
-
7
-
-
0026232450
-
A Loop Transformation Theory and an Algorithm to Maximize Parallelism
-
Oct.
-
M.E. Wolf and M.S. Lam, "A Loop Transformation Theory and an Algorithm to Maximize Parallelism," IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 452-471, Oct. 1991.
-
(1991)
IEEE Trans. Parallel and Distributed Systems
, vol.2
, Issue.4
, pp. 452-471
-
-
Wolf, M.E.1
Lam, M.S.2
-
8
-
-
0029235623
-
Hierarchical Tiling for Improved Superscalar Performance
-
Apr.
-
L. Carter, J. Ferrante, and S.F. Hummel, "Hierarchical Tiling for Improved Superscalar Performance," Proc. Ninth Int'l Symp. Parallel Processing, pp. 239-245, Apr. 1995.
-
(1995)
Proc. Ninth Int'l Symp. Parallel Processing
, pp. 239-245
-
-
Carter, L.1
Ferrante, J.2
Hummel, S.F.3
-
9
-
-
0031605328
-
Performance Evaluation of Tiling for the Register Level
-
Jan./Feb.
-
M. Jiménez, J.M. Llabería, and A. Fernández, "Performance Evaluation of Tiling for the Register Level," Proc. Fourth Int'l Symp. High-Performance Computer Architecture, pp. 254-265, Jan./Feb. 1998.
-
(1998)
Proc. Fourth Int'l Symp. High-performance Computer Architecture
, pp. 254-265
-
-
Jiménez, M.1
Llabería, J.M.2
Fernández, A.3
-
10
-
-
0033324646
-
On the Performance of Hand versus Automatically Optimized Numerical Codes
-
Jan.
-
M. Jiménez, J.M. Llabería, and A. Fernández, "On the Performance of Hand versus Automatically Optimized Numerical Codes," Proc. Sixth Int'l Symp. High-Performance Computer Architecture, pp. 183-194, Jan. 1999.
-
(1999)
Proc. Sixth Int'l Symp. High-performance Computer Architecture
, pp. 183-194
-
-
Jiménez, M.1
Llabería, J.M.2
Fernández, A.3
-
11
-
-
0032662838
-
An Experimental Evaluation of Tiling and Shackling for Memory Hierarchy Management
-
June
-
I. Kodukula, K. Pingali, R. Cox, and D. Maydan, "An Experimental Evaluation of Tiling and Shackling for Memory Hierarchy Management," Proc. Int'l Conf. Supercomputing, pp. 482-491, June 1999.
-
(1999)
Proc. Int'l Conf. Supercomputing
, pp. 482-491
-
-
Kodukula, I.1
Pingali, K.2
Cox, R.3
Maydan, D.4
-
12
-
-
0025402476
-
A Set of Level 3 Basic Linear Algebra Subprograms
-
Mar.
-
J.J. Dongarra, J.D. Croz., S. Hammarling, and I. Duff, "A Set of Level 3 Basic Linear Algebra Subprograms," Trans. Math. Software, vol. 16, no. 1, pp. 1-17, Mar. 1990.
-
(1990)
Trans. Math. Software
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.J.1
Croz, J.D.2
Hammarling, S.3
Duff, I.4
-
13
-
-
0003783762
-
-
PhD thesis, Dept. of Computer Architecture, Universitat Politècnica de Catalunya, May
-
M. Jiménez, "Multilevel Tiling for Non-Rectangular Iteration Spaces," PhD thesis, Dept. of Computer Architecture, Universitat Politècnica de Catalunya, http://www.ac.upc.es/pub/reports/DAC/1999/UPC-DAC-1999-16.ps, May 1999.
-
(1999)
Multilevel Tiling for Non-rectangular Iteration Spaces
-
-
Jiménez, M.1
-
14
-
-
0038895757
-
Register Tiling in Nonrectangular Iteration Spaces
-
July
-
M. Jiménez, J.M. Llabería, and A. Fernández, "Register Tiling in Nonrectangular Iteration Spaces," ACM Trans. Programming Languages and Systems, vol. 24, no. 4, pp. 409-453, July 2002.
-
(2002)
ACM Trans. Programming Languages and Systems
, vol.24
, Issue.4
, pp. 409-453
-
-
Jiménez, M.1
Llabería, J.M.2
Fernández, A.3
-
16
-
-
0038835469
-
Implementation of Fourier-Motzkin Elimination
-
Leiden Univ., Dept. of Mathematics and Computer Science
-
A. Bik and H. Wijshoff, "Implementation of Fourier-Motzkin Elimination," Leiden Univ., Dept. of Mathematics and Computer Science, Technical Report TR-94-42, 1994.
-
(1994)
Technical Report
, vol.TR-94-42
-
-
Bik, A.1
Wijshoff, H.2
-
17
-
-
0242706049
-
-
PhD thesis, Dept. of Computer Science, Univ. of Illinois, Urbana-Champaign, Feb.
-
R.H. Kuhn, "Optimization and Interconnection Complexity for: Parallel Processors, Single-Stage Networks, and Decision Trees," PhD thesis, Dept. of Computer Science, Univ. of Illinois, Urbana-Champaign, Feb. 1980.
-
(1980)
Optimization and Interconnection Complexity for: Parallel Processors, Single-stage Networks, and Decision Trees
-
-
Kuhn, R.H.1
-
18
-
-
0029354475
-
Loop Transformation Using Nonunimodular Matrices
-
Aug.
-
A. Fernández, J.M. Llabería, and M. Valero-García, "Loop Transformation Using Nonunimodular Matrices," IEEE Trans. Parallel and Distributed Systems, vol. 6, no. 8, pp. 832-840, Aug. 1995.
-
(1995)
IEEE Trans. Parallel and Distributed Systems
, vol.6
, Issue.8
, pp. 832-840
-
-
Fernández, A.1
Llabería, J.M.2
Valero-García, M.3
-
21
-
-
4243843055
-
Access Normalization: Loop Restructuring for NUMA Compilers
-
Cornell Univ., Computer Science Dept., Apr.
-
W. Li and K. Pingali, "Access Normalization: Loop Restructuring for NUMA Compilers," Cornell Univ., Computer Science Dept., Technical Report TR92-1278, Apr. 1992.
-
(1992)
Technical Report
, vol.TR92-1278
-
-
Li, W.1
Pingali, K.2
-
22
-
-
0029518016
-
Beyond Unimodular Transformations
-
J. Ramanujam, "Beyond Unimodular Transformations," J. Supercomputing, vol. 9, no. 4, pp. 365-389, 1995.
-
(1995)
J. Supercomputing
, vol.9
, Issue.4
, pp. 365-389
-
-
Ramanujam, J.1
-
23
-
-
84976766536
-
Scanning Polyhedra with DO Loops
-
Apr.
-
C. Ancourt and F. Irigoin, "Scanning Polyhedra with DO Loops," Proc. Third ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, vol. 26, no. 7, pp. 39-50, Apr. 1991.
-
(1991)
Proc. Third ACM SIGPLAN Symp. Principles and Practice of Parallel Programming
, vol.26
, Issue.7
, pp. 39-50
-
-
Ancourt, C.1
Irigoin, F.2
-
24
-
-
0242453444
-
A Proposal of Level 3 Interface for Band and Skyline Matrix Factorization Subroutine
-
July
-
H. Samukawa, "A Proposal of Level 3 Interface for Band and Skyline Matrix Factorization Subroutine," Proc. Int'l Conf. Supercomputing, pp. 397-406, July 1993.
-
(1993)
Proc. Int'l Conf. Supercomputing
, pp. 397-406
-
-
Samukawa, H.1
-
25
-
-
0030685988
-
Data-Centric Multi-Level Blocking
-
June
-
I. Kodukula, N. Ahmed, and K. Pingali, "Data-Centric Multi-Level Blocking," Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation, vol. 32, no. 5, pp. 346-357, June 1997.
-
(1997)
Proc. ACM SIGPLAN Conf. Programming Language Design and Implementation
, vol.32
, Issue.5
, pp. 346-357
-
-
Kodukula, I.1
Ahmed, N.2
Pingali, K.3
-
26
-
-
84976676720
-
A Practical Algorithm for Exact Array Dependence Analysis
-
Aug.
-
W. Pugh, "A Practical Algorithm for Exact Array Dependence Analysis," Comm. ACM, vol. 35, no. 8, pp. 102-114, Aug. 1992.
-
(1992)
Comm. ACM
, vol.35
, Issue.8
, pp. 102-114
-
-
Pugh, W.1
|