-
1
-
-
0029373981
-
Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessor
-
Sept.
-
A. Agarwal, D. Kranz, and V. Natarajan, "Automatic Partitioning of Parallel Loops and Data Arrays for Distributed Shared-Memory Multiprocessor," IEEE Trans. Parallel and Distributed Systems, vol. 6, no. 9, pp. 943-962, Sept. 1995.
-
(1995)
IEEE Trans. Parallel and Distributed Systems
, vol.6
, Issue.9
, pp. 943-962
-
-
Agarwal, A.1
Kranz, D.2
Natarajan, V.3
-
2
-
-
0034826519
-
Optimal semi-oblique tiling
-
July
-
R. Andonov, S. Balev, S. Rajopadhye, and N. Yanev, "Optimal Semi-Oblique Tiling," Proc. 13th ACM Symp. Parallel Algorithms and Architectures, pp. 153-164, July 2001.
-
(2001)
Proc. 13th ACM Symp. Parallel Algorithms and Architectures
, pp. 153-164
-
-
Andonov, R.1
Balev, S.2
Rajopadhye, S.3
Yanev, N.4
-
3
-
-
0030379461
-
Two-dimensional orthogonal tiling: From theory to practice
-
Dec.
-
R. Andonov, H. Bourzoufi, and S. Rajopadhye, "Two-Dimensional Orthogonal Tiling: From Theory to Practice," Proc. Int'l Conf. High Performance Computing, pp. 225-231, Dec. 1996.
-
(1996)
Proc. Int'l Conf. High Performance Computing
, pp. 225-231
-
-
Andonov, R.1
Bourzoufi, H.2
Rajopadhye, S.3
-
4
-
-
0142034937
-
First steps towards optimal oblique tiling of two-dimensional iterations
-
R. Andonov, P.-Y. Calland, S. Niar, S. Rajopadhye, and N. Yanev, "First Steps Towards Optimal Oblique Tiling of Two-Dimensional Iterations," Proc. Workshop Compilers for Parallel Computers, Jan. 2000.
-
Proc. Workshop Compilers for Parallel Computers, Jan. 2000
-
-
Andonov, R.1
Calland, P.-Y.2
Niar, S.3
Rajopadhye, S.4
Yanev, N.5
-
6
-
-
70350749986
-
Optimal orthogonal tiling
-
R. Andonov, S. Rajopadhye, and N. Yanev, "Optimal Orthogonal Tiling," Euro-Par'98 Parallel Processing, Lecture Notes in Computer Science, 1470, pp. 480-490, 1998.
-
(1998)
Euro-Par'98 Parallel Processing, Lecture Notes in Computer Science, 1470
, pp. 480-490
-
-
Andonov, R.1
Rajopadhye, S.2
Yanev, N.3
-
7
-
-
0028482686
-
(Pen)-ultimate tiling?
-
P. Boulet, A. Darte, T. Risset, and Y. Robert, "(Pen)-Ultimate Tiling?" Integration, the VLSI J., vol. 17, pp. 33-51, 1994.
-
(1994)
Integration, the VLSI J.
, vol.17
, pp. 33-51
-
-
Boulet, P.1
Darte, A.2
Risset, T.3
Robert, Y.4
-
8
-
-
0029218339
-
Precise tiling for uniform loop nests
-
P. Cappello, C. Mongenet, G.-R. Perrin, P. Quinton, and Y. Robert, eds.; July
-
P.-Y. Calland and T. Risset, "Precise Tiling for Uniform Loop Nests," Application Specific Array Processors, P. Cappello, C. Mongenet, G.-R. Perrin, P. Quinton, and Y. Robert, eds., pp. 330-337, July 1995.
-
(1995)
Application Specific Array Processors
, pp. 330-337
-
-
Calland, P.-Y.1
Risset, T.2
-
10
-
-
0032028841
-
Determining the idle time of a tiling: New results
-
F. Desprez, J. Dongarra, F. Rastello, and Y. Robert, "Determining the Idle Time of a Tiling: New Results," J. Information Science and Eng., vol. 14, pp. 167-190, 1998.
-
(1998)
J. Information Science and Eng.
, vol.14
, pp. 167-190
-
-
Desprez, F.1
Dongarra, J.2
Rastello, F.3
Robert, Y.4
-
11
-
-
0021760002
-
Fast optimal alignement
-
J. Fickett, "Fast Optimal Alignement," Nucleic Acids Research, vol. 12, no. 1, pp. 175-179, 1984.
-
(1984)
Nucleic Acids Research
, vol.12
, Issue.1
, pp. 175-179
-
-
Fickett, J.1
-
12
-
-
0002662446
-
Evaluating compiler optimizations for fortran D
-
S. Hiranandani, K. Kennedy, and C.-W. Tseng, "Evaluating Compiler Optimizations for Fortran D," J. Parallel and Distributed Computing, vol. 21, pp. 27-45, 1994.
-
(1994)
J. Parallel and Distributed Computing
, vol.21
, pp. 27-45
-
-
Hiranandani, S.1
Kennedy, K.2
Tseng, C.-W.3
-
13
-
-
0032069399
-
On supernode transformation with minimized total running time
-
May
-
E. Hodzic and W. Shang, "On Supernode Transformation with Minimized Total Running Time," IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 5, pp. 417-428 May 1998.
-
(1998)
IEEE Trans. Parallel and Distributed Systems
, vol.9
, Issue.5
, pp. 417-428
-
-
Hodzic, E.1
Shang, W.2
-
14
-
-
0003638028
-
Predicting performance for tiled perfectly nested loops
-
PhD thesis, Dept. of Computer Science and Eng., Univ. of California, San Diego, Dec.
-
K. Högstedt, "Predicting Performance for Tiled Perfectly Nested Loops," PhD thesis, Dept. of Computer Science and Eng., Univ. of California, San Diego, Dec. 1999.
-
(1999)
-
-
Högstedt, K.1
-
16
-
-
0032642196
-
Selecting tile shape for minimal execution time
-
June
-
K. Högstedt, L. Carter, and J. Ferrante, "Selecting Tile Shape for Minimal Execution Time," Proc. 11th ACM Symp. Parallel Algorithms and Architectures, pp. 201-211, June 1999.
-
(1999)
Proc. 11th ACM Symp. Parallel Algorithms and Architectures
, pp. 201-211
-
-
Högstedt, K.1
Carter, L.2
Ferrante, J.3
-
17
-
-
4243832310
-
An analysis of the execution time of tiled loops
-
(journal submission), Mar.
-
K. Högstedt, L. Carter, and J. Ferrante, "An Analysis of the Execution Time of Tiled Loops," http://www-cse.ucsd.edu/ferrante/karjour.ps (journal submission), Mar. 2000.
-
(2000)
-
-
Högstedt, K.1
Carter, L.2
Ferrante, J.3
-
19
-
-
0001512318
-
The organization of computations for uniform recurrence equations
-
July
-
R.M. Karp, R.E. Miller, and S. Winograd, "The Organization of Computations for Uniform Recurrence Equations," J. ACM, vol. 14, no. 3, pp. 563-590, July 1967.
-
(1967)
J. ACM
, vol.14
, Issue.3
, pp. 563-590
-
-
Karp, R.M.1
Miller, R.E.2
Winograd, S.3
-
20
-
-
0025502605
-
Pipelined data-parallel algorithms: Part 1-Concept and modelling
-
Oct.
-
C.-T. King, W.-H. Chou, and L. Ni, "Pipelined Data-Parallel Algorithms: Part 1-Concept and Modelling," IEEE Trans. Parallel and Distributed Systems, vol. 1, no. 4, pp. 470-485 Oct. 1990.
-
(1990)
IEEE Trans. Parallel and Distributed Systems
, vol.1
, Issue.4
, pp. 470-485
-
-
King, C.-T.1
Chou, W.-H.2
Ni, L.3
-
21
-
-
0025505449
-
Pipelined data-parallel algorithms: Part 2-Design
-
Oct.
-
C.-T. King, W.-H. Chou, and L. Ni, "Pipelined Data-Parallel Algorithms: Part 2-Design," IEEE Trans. Parallel and Distributed Systems, vol. 1, no. 4, pp. 486-499, Oct. 1990.
-
(1990)
IEEE Trans. Parallel and Distributed Systems
, vol.1
, Issue.4
, pp. 486-499
-
-
King, C.-T.1
Chou, W.-H.2
Ni, L.3
-
22
-
-
0001875069
-
Scalable computing
-
J. van Leeuwen, ed. Springer Verlag
-
W.F. McColl, "Scalable Computing," Computer Science Today: Recent Trends and Developments, J. van Leeuwen, ed. Springer Verlag, vol. 1000, pp. 46-61, 1995.
-
(1995)
Computer Science Today: Recent Trends and Developments
, vol.1000
, pp. 46-61
-
-
McColl, W.F.1
-
23
-
-
0022482205
-
Partitioning and mapping algorithms into fixed size systolic arrays
-
Jan.
-
D.I. Moldovan and J.A.B. Fortes, "Partitioning and Mapping Algorithms into Fixed Size Systolic Arrays," IEEE Trans. Computers, vol. 35, no. 1, pp. 1-12, Jan. 1986.
-
(1986)
IEEE Trans. Computers
, vol.35
, Issue.1
, pp. 1-12
-
-
Moldovan, D.I.1
Fortes, J.A.B.2
-
24
-
-
0022614403
-
Computing size-independent matrix problems on systolic array processors
-
May
-
J.J. Navarro, J.M. Llabeŕia, and M. Valero, "Computing Size-Independent Matrix Problems on Systolic Array Processors," Proc. Int'l Symp. Computer Architecture, no. 13, May 1986.
-
(1986)
Proc. Int'l Symp. Computer Architecture
, Issue.13
-
-
Navarro, J.J.1
Llabeŕia, J.M.2
Valero, M.3
-
25
-
-
0029181784
-
Optimal tile size adjustment in compiling general DOACROSS loop nests
-
July
-
H. Ohta, Y. Saito, M. Kainaga, and H. Ono, "Optimal Tile Size Adjustment in Compiling General DOACROSS Loop Nests," Proc. Int'l Conf. Supercomputing, pp. 270-279, July 1995.
-
(1995)
Proc. Int'l Conf. Supercomputing
, pp. 270-279
-
-
Ohta, H.1
Saito, Y.2
Kainaga, M.3
Ono, H.4
-
26
-
-
84904334060
-
Communication optimizations used in the PARADIGM compiler for distributed memory multicomputers
-
D. Palermo, E. Su, J. Chandy, and P. Banerjee, "Communication Optimizations Used in the PARADIGM Compiler for Distributed Memory Multicomputers," Proc. Int'l Conf. Parallel Processing, Aug. 1994.
-
Proc. Int'l Conf. Parallel Processing, Aug. 1994
-
-
Palermo, D.1
Su, E.2
Chandy, J.3
Banerjee, P.4
-
27
-
-
0026274706
-
Tiling multidimensional iteration spaces for non shared-memory machines
-
J. Ramanujam and P. Sadayappan, "Tiling Multidimensional Iteration Spaces for Non Shared-Memory Machines," Supercomputing, pp. 111-120, 1991.
-
(1991)
Supercomputing
, pp. 111-120
-
-
Ramanujam, J.1
Sadayappan, P.2
-
28
-
-
0003929457
-
Automatic blocking of nested loops
-
Technical Report 90.38, RIACS, NASA Ames Research Center, Aug.
-
R. Schreiber and J. Dongarra, "Automatic Blocking of Nested Loops," Technical Report 90.38, RIACS, NASA Ames Research Center, Aug. 1990.
-
(1990)
-
-
Schreiber, R.1
Dongarra, J.2
-
30
-
-
0025467711
-
A bridging model for parallel computation
-
Aug.
-
L.G. Valiant, "A Bridging Model for Parallel Computation," Comm. ACM, vol. 33, no. 8, pp. 103-111, Aug. 1990.
-
(1990)
Comm. ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
33
-
-
4243740738
-
Time skewing for parallel computers
-
Technical Report TR-388, Dept. of Computer Science, Rutgers Univ., June
-
D. Wonnacott, "Time Skewing for Parallel Computers," Technical Report TR-388, Dept. of Computer Science, Rutgers Univ., June 1999.
-
(1999)
-
-
Wonnacott, D.1
-
34
-
-
0000703719
-
On tiling as a loop transformation
-
J. Xue, "On Tiling as a Loop Transformation," Parallel Processing Letters, vol. 7, no. 4, pp. 490-424, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.4
, pp. 490-424
-
-
Xue, J.1
|