-
1
-
-
0142134964
-
Optimal semi-oblique tiling
-
R. Andonov, S. Balev, S. V. Rajopadhye, and N. Yanev. Optimal semi-oblique tiling. IEEE Trans. Parallel Distrib. Syst., 14(9):944-960, 2003.
-
(2003)
IEEE Trans. Parallel Distrib. Syst
, vol.14
, Issue.9
, pp. 944-960
-
-
Andonov, R.1
Balev, S.2
Rajopadhye, S.V.3
Yanev, N.4
-
4
-
-
29244443735
-
-
M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.
-
M. Bromley, S. Heller, T. McNerney, and J. Guy L. Steele. Fortran at ten Gigaflops: the connection machine convolution compiler. In PLDI '91: Proceedings of the ACM SIGPLAN 1991 conference on Programming language design and implementation, 1991.
-
-
-
-
5
-
-
0000209582
-
Regular partitioning for synthesizing fixed-size systolic arrays
-
A. Darte. Regular partitioning for synthesizing fixed-size systolic arrays. Integration, The VLSI J., 12(3):293-304, 1991.
-
(1991)
Integration, The VLSI J
, vol.12
, Issue.3
, pp. 293-304
-
-
Darte, A.1
-
6
-
-
4243166952
-
Tight bounds on cache use for stencil operations on rectangular grids
-
M. A. Frumkin and R. F. V. der Wijngaart. Tight bounds on cache use for stencil operations on rectangular grids. J. ACM, 49(3):434-453, 2002.
-
(2002)
J. ACM
, vol.49
, Issue.3
, pp. 434-453
-
-
Frumkin, M.A.1
der Wijngaart, R.F.V.2
-
7
-
-
0023379612
-
Solving pdes on loosely-coupled parallel processors
-
W. D. Gropp. Solving pdes on loosely-coupled parallel processors. Parallel Computing, 5(1-2):165-173, 1987.
-
(1987)
Parallel Computing
, vol.5
, Issue.1-2
, pp. 165-173
-
-
Gropp, W.D.1
-
9
-
-
0005875647
-
Hpfbench: A high performance fortran benchmark suite
-
Y. C. Hu, G. Jin, S. L. Johnsson, D. Kehagias, and N. Shalaby. Hpfbench: a high performance fortran benchmark suite. ACM Trans. Math. Softw., 26(1):99-149, 2000.
-
(2000)
ACM Trans. Math. Softw
, vol.26
, Issue.1
, pp. 99-149
-
-
Hu, Y.C.1
Jin, G.2
Johnsson, S.L.3
Kehagias, D.4
Shalaby, N.5
-
11
-
-
84958661690
-
Impact of modern memory subsystems on cache optimizations for stencil computations
-
S. Kamil, P. Husbands, L. Oliker, J. Shalf, and K. Yelick. Impact of modern memory subsystems on cache optimizations for stencil computations. In MSP '05: Proceedings of the 2005 workshop on Memory system performance, 2005.
-
(2005)
MSP '05: Proceedings of the 2005 workshop on Memory system performance
-
-
Kamil, S.1
Husbands, P.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
12
-
-
0001512318
-
The organization of computations for uniform recurrence equations
-
R. M. Karp, R. E. Miller, and S. Winograd. The organization of computations for uniform recurrence equations. J. ACM, 14(3):563-590, 1967.
-
(1967)
J. ACM
, vol.14
, Issue.3
, pp. 563-590
-
-
Karp, R.M.1
Miller, R.E.2
Winograd, S.3
-
13
-
-
0043048462
-
An infeasible interiorpoint algorithm for solving primal and dual geometric programs
-
K. O. Kortanek, X. Xu, and Y. Ye. An infeasible interiorpoint algorithm for solving primal and dual geometric programs. Math. Program., 76(1):155-181, 1997.
-
(1997)
Math. Program
, vol.76
, Issue.1
, pp. 155-181
-
-
Kortanek, K.O.1
Xu, X.2
Ye, Y.3
-
14
-
-
24644456455
-
Automatic tiling of iterative stencil loops
-
Z. Li and Y. Song. Automatic tiling of iterative stencil loops. ACM Trans. Program. Lang. Syst., 26(6):975-1028, 2004.
-
(2004)
ACM Trans. Program. Lang. Syst
, vol.26
, Issue.6
, pp. 975-1028
-
-
Li, Z.1
Song, Y.2
-
15
-
-
20344396845
-
YALMIP : A toolbox for modeling and optimization in MATLAB
-
J. Löfberg. YALMIP : A toolbox for modeling and optimization in MATLAB. In Proceedings of the CACSD Conference, 2004.
-
(2004)
Proceedings of the CACSD Conference
-
-
Löfberg, J.1
-
16
-
-
0032308685
-
Quantifying the multi-level nature of tiling interactions
-
N. Mitchell, K. Högstedt, L. Carter, and J. Ferrante. Quantifying the multi-level nature of tiling interactions. International J. of Parallel Programming, 26(6):641-670, 1998.
-
(1998)
International J. of Parallel Programming
, vol.26
, Issue.6
, pp. 641-670
-
-
Mitchell, N.1
Högstedt, K.2
Carter, L.3
Ferrante, J.4
-
17
-
-
0022482205
-
Partitioning and mapping algorithms into fixed size systolic arrays
-
351, 12
-
D. I. Moldovan and J. A. B. Fortes. Partitioning and mapping algorithms into fixed size systolic arrays. IEEE Trans. Comput., 35(1)--12, 1986.
-
(1986)
IEEE Trans. Comput
-
-
Moldovan, D.I.1
Fortes, J.A.B.2
-
18
-
-
2442670256
-
-
Available from
-
NAS Parallel Benchmarks. Available from http://www.netlib.org/parkbench/.
-
NAS Parallel Benchmarks
-
-
-
19
-
-
34548743372
-
-
PARKBENCH:, Available from
-
PARKBENCH: PARallel Kernels and BENCHmarks. Available from http://www.netlib.org/parkbench/.
-
PARallel Kernels and BENCHmarks
-
-
-
20
-
-
51249173427
-
The mapping of linear recurrence equations on regular arrays
-
P. Quinton and V. Van Dongen. The mapping of linear recurrence equations on regular arrays. Journal of VLSI Signal Processing, 1(2):95-113, 1989.
-
(1989)
Journal of VLSI Signal Processing
, vol.1
, Issue.2
, pp. 95-113
-
-
Quinton, P.1
Van Dongen, V.2
-
21
-
-
0025446495
-
Synthesizing systolic arrays from recurrence equations
-
June
-
S. V. Rajopadhye and R. M. Fujimoto. Synthesizing systolic arrays from recurrence equations. Parallel Computing, 14:163-189, June 1990.
-
(1990)
Parallel Computing
, vol.14
, pp. 163-189
-
-
Rajopadhye, S.V.1
Fujimoto, R.M.2
-
30
-
-
1542392248
-
Achieving scalable locality with time skewing
-
D. Wonnacott. Achieving scalable locality with time skewing. Int. J. Parallel Program., 30(3):181-221, 2002.
-
(2002)
Int. J. Parallel Program
, vol.30
, Issue.3
, pp. 181-221
-
-
Wonnacott, D.1
-
31
-
-
0000703719
-
On tiling as a loop transformation
-
J. Xue. On tiling as a loop transformation. Parallel Processing Letters, 7(4):409-424, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.4
, pp. 409-424
-
-
Xue, J.1
|