-
1
-
-
0038823316
-
A case-study in performance programming: Seismic migration
-
G. Almasi, B. Alpern, L. Berman, L. Carter, and D. Hale, "A Case-Study in Performance Programming: Seismic Migration," Proc. Symp. High Performance Computing, Sept. 1991.
-
Proc. Symp. High Performance Computing, Sept. 1991
-
-
Almasi, G.1
Alpern, B.2
Berman, L.3
Carter, L.4
Hale, D.5
-
3
-
-
70350749986
-
Optimal orthogonal tiling
-
Sept.
-
R. Andonov, S. Rajopadhye, and N. Yanev, "Optimal Orthogonal Tiling," Proc. Europar '98, pp. 480-490, Sept. 1998.
-
(1998)
Proc. Europar '98
, pp. 480-490
-
-
Andonov, R.1
Rajopadhye, S.2
Yanev, N.3
-
5
-
-
4243923745
-
Matrix multiply benchmarks
-
technical report, Center for Scientific Computing, Dept. of Math., Univ. of Utah; This report is updated frequently
-
N.H.F. Beebe, "Matrix Multiply Benchmarks," technical report, Center for Scientific Computing, Dept. of Math., Univ. of Utah, 1990, This report is updated frequently.
-
(1990)
-
-
Beebe, N.H.F.1
-
6
-
-
0030661485
-
Optimizing matrix multiply using PhiPAC: A portable, high performance, ANSI C coding methodology
-
July
-
J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel, "Optimizing Matrix Multiply Using PhiPAC: A Portable, High Performance, ANSI C Coding Methodology," Proc. 11th Int'l Conf. Supercomputing (ICS '97), pp. 340-347, July 1997.
-
(1997)
Proc. 11th Int'l Conf. Supercomputing (ICS '97)
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
7
-
-
0028482686
-
(Pen)-ultimate tiling?
-
P. Boulet, A. Darte, T. Risset, and Y. Robert, "(Pen)-Ultimate Tiling?" INTEGRATION, the Very Large Scale Intergration J., vol. 17, pp. 33-51, 1994.
-
(1994)
INTEGRATION, the Very Large Scale Intergration J.
, vol.17
, pp. 33-51
-
-
Boulet, P.1
Darte, A.2
Risset, T.3
Robert, Y.4
-
8
-
-
0000493064
-
Estimating interlock and improving balance for pipelined machines
-
Aug.
-
D. Callahan, J. Cocke, and K. Kennedy, "Estimating Interlock and Improving Balance for Pipelined Machines," J. Parallel and Distributed Computing, vol. 5, no. 4, pp. 334-358, Aug. 1988.
-
(1988)
J. Parallel and Distributed Computing
, vol.5
, Issue.4
, pp. 334-358
-
-
Callahan, D.1
Cocke, J.2
Kennedy, K.3
-
10
-
-
84964748976
-
Compiler blockability of numerical algorithms
-
Nov.
-
S. Carr and K. Kennedy, "Compiler Blockability of Numerical Algorithms," J. Supercomputing, pp. 114-124, Nov. 1992.
-
(1992)
J. Supercomputing
, pp. 114-124
-
-
Carr, S.1
Kennedy, K.2
-
15
-
-
0004116989
-
-
MIT Press and McGraw-Hill
-
T.H. Cormen, C.E. Leiserson, and R.L. Rivest, Introduction to Algorithms, sixth ed., MIT Press and McGraw-Hill, 1992.
-
(1992)
Introduction to Algorithms, Sixth Ed.
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
-
16
-
-
0030287932
-
LogP: A practical model of parallel computation
-
Nov.
-
D. Culler, R. Karp, D. Patterson, A. Sahay, E. Santos, K.E. Schauser, R. Subramonian, and T. von Eicken, "LogP: A Practical Model of Parallel Computation," Comm. ACM, vol. 39, no. 11, pp. 78-85, Nov. 1996.
-
(1996)
Comm. ACM
, vol.39
, Issue.11
, pp. 78-85
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Santos, E.5
Schauser, K.E.6
Subramonian, R.7
Von Eicken, T.8
-
17
-
-
0002352131
-
Linear scheduling is nearly optimal
-
A. Darte, L. Khachiyan, and Y. Robert, "Linear Scheduling is Nearly Optimal," Parallel Processing Letters, vol. 1, no. 2, pp. 73-81, 1991.
-
(1991)
Parallel Processing Letters
, vol.1
, Issue.2
, pp. 73-81
-
-
Darte, A.1
Khachiyan, L.2
Robert, Y.3
-
18
-
-
0031335231
-
Determining the idle time of a tiling: New results
-
F. Desprez, J. Dongarra, F. Rastello, and Y. Robert, "Determining the Idle Time of a Tiling: New Results," Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '97), Nov. 1997.
-
Proc. Int'l Conf. Parallel Architectures and Compilation Techniques (PACT '97), Nov. 1997
-
-
Desprez, F.1
Dongarra, J.2
Rastello, F.3
Robert, Y.4
-
19
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
S.V. Rajopadhye, F. Quiller, and D. Wilde, "Generation of Efficient Nested Loops from Polyhedra," Int'l J. Parallel Programming, vol. 28, no. 5, pp. 469-498, 2000.
-
(2000)
Int'l J. Parallel Programming
, vol.28
, Issue.5
, pp. 469-498
-
-
Rajopadhye, S.V.1
Quiller, F.2
Wilde, D.3
-
20
-
-
0023385308
-
The program dependence graph and its use in optimization
-
July
-
J. Ferrante, K.J. Ottenstein, and J.D. Warren, "The Program Dependence Graph and Its Use in Optimization," ACM Trans. Programming Languages and Systems, vol. 9, no. 3, pp. 319-349, July 1987.
-
(1987)
ACM Trans. Programming Languages and Systems
, vol.9
, Issue.3
, pp. 319-349
-
-
Ferrante, J.1
Ottenstein, K.J.2
Warren, J.D.3
-
21
-
-
0003638028
-
Predicting performance for tiled perfectly nested loops
-
PhD thesis, Univ. of California, San Diego, Dept. of Computer Science and Eng., Dec.
-
K. Högstedt, "Predicting Performance for Tiled Perfectly Nested Loops," PhD thesis, Univ. of California, San Diego, Dept. of Computer Science and Eng., Dec. 1999.
-
(1999)
-
-
Högstedt, K.1
-
27
-
-
0030685988
-
Data-centric multilevel blocking
-
I. Kodukula, N. Ahmed, and K. Pingali, "Data-Centric Multilevel Blocking," Proc. SIGPLAN, Conf. Programming Language Design and Implementation, pp. 346-357, 1997.
-
(1997)
Proc. SIGPLAN, Conf. Programming Language Design and Implementation
, pp. 346-357
-
-
Kodukula, I.1
Ahmed, N.2
Pingali, K.3
-
28
-
-
0032308685
-
Quantifying the multilevel nature of tiling interactions
-
N. Mitchell, K. Högstedt, L. Carter, and J. Ferrante, "Quantifying the Multilevel Nature of Tiling Interactions," Int'l J. Parallel Programming, vol.26, no. 6, pp. 641-670, 1998.
-
(1998)
Int'l J. Parallel Programming
, vol.26
, Issue.6
, pp. 641-670
-
-
Mitchell, N.1
Högstedt, K.2
Carter, L.3
Ferrante, J.4
-
29
-
-
57649182551
-
Quantifying the multilevel nature of tiling interactions
-
N. Mitchell, L. Carter, J. Ferrante, and K. Högstedt, "Quantifying the Multilevel Nature of Tiling Interactions," Proc. Workshop Languages and Compilers for Parallel Computing, 1997.
-
Proc. Workshop Languages and Compilers for Parallel Computing, 1997
-
-
Mitchell, N.1
Carter, L.2
Ferrante, J.3
Högstedt, K.4
-
31
-
-
0038485313
-
Optimizing memory usage in the polyhedral model
-
F. Quiller and S. V. Rajopadhye, "Optimizing Memory Usage in the Polyhedral Model," ACM Trans. Programming Languages and Systems (TOPLAS), vol. 22, no. 5, pp. 773-815, 2000.
-
(2000)
ACM Trans. Programming Languages and Systems (TOPLAS)
, vol.22
, Issue.5
, pp. 773-815
-
-
Quiller, F.1
Rajopadhye, S.V.2
-
32
-
-
0002238004
-
Tiling multidimensional iteration spaces for nonshared memory machines
-
Nov.
-
J. Ramanujam and P. Sadayappan, "Tiling Multidimensional Iteration Spaces for Nonshared Memory Machines," Supercomputing, Nov. 1991.
-
(1991)
Supercomputing
-
-
Ramanujam, J.1
Sadayappan, P.2
-
33
-
-
0023384075
-
Stencils and problem partitionings: Their influence on the performance of multiple processor systems
-
July
-
D.A. Reed, L.M. Adams, and M.L. Patrick, "Stencils and Problem Partitionings: Their Influence on the Performance of Multiple Processor Systems," IEEE Trans. Computers, vol. 36, no. 7, pp. 845-858, July 1987.
-
(1987)
IEEE Trans. Computers
, vol.36
, Issue.7
, pp. 845-858
-
-
Reed, D.A.1
Adams, L.M.2
Patrick, M.L.3
-
34
-
-
0031140581
-
Automatic selection of high-order transformations in the IBM XL FORTRAN compilers
-
V. Sarkar, "Automatic Selection of High-Order Transformations in the IBM XL FORTRAN Compilers," IBM J. Research and Development, vol. 41, no. 3, pp. 233-264, 1997.
-
(1997)
IBM J. Research and Development
, vol.41
, Issue.3
, pp. 233-264
-
-
Sarkar, V.1
-
36
-
-
0037808951
-
-
Standord SUIF Compiler System
-
Standord SUIF Compiler System, http://suif.stanford.edu/, 2002.
-
(2002)
-
-
-
37
-
-
0038485309
-
-
Sweep3D Benchmark
-
Sweep3D Benchmark, www.llnl.gov/asci.benchmarks/asci/limtited/sweep3d/asci_sweep3d.html, 1995.
-
(1995)
-
-
-
39
-
-
0003553286
-
Improving locality and parallelism in nested loops
-
Phd thesis, Stanford Univ., Computer Systems Laboratory, Aug.
-
M.E. Wolf, "Improving Locality and Parallelism in Nested Loops," Phd thesis, Stanford Univ., Computer Systems Laboratory, Aug. 1992.
-
(1992)
-
-
Wolf, M.E.1
-
41
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
M.E. Wolf and M.S. Lam, "A Loop Transformation Theory and an Algorithm to Maximize Parallelism," IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 452-471, 1991.
-
(1991)
IEEE Trans. Parallel and Distributed Systems
, vol.2
, Issue.4
, pp. 452-471
-
-
Wolf, M.E.1
Lam, M.S.2
-
44
-
-
0024935630
-
More iteration space tiling
-
M.J. Wolfe, "More Iteration Space Tiling," Supercomputing, pp. 655-664, 1989.
-
(1989)
Supercomputing
, pp. 655-664
-
-
Wolfe, M.J.1
-
47
-
-
0032315190
-
Reuse-driven tiling for improving data locality
-
J. Xue and C.-H. Huang, "Reuse-Driven Tiling for Improving Data Locality," Int'l J. Parallel Programming, vol. 26, no. 6, pp. 671-696, 1998.
-
(1998)
Int'l J. Parallel Programming
, vol.26
, Issue.6
, pp. 671-696
-
-
Xue, J.1
Huang, C.-H.2
|