-
2
-
-
0346032593
-
Advanced code generation for high performance Fortran
-
Compiler Optimizations for Scalable Parallel Systems: Languages, Compilation Techniques, and Run Time Systems, Springer-Verlag
-
Adve V., and Mellor-Crummey J. Advanced code generation for high performance Fortran. Compiler Optimizations for Scalable Parallel Systems: Languages, Compilation Techniques, and Run Time Systems. Lecture Notes in Computer Science Series (2001), Springer-Verlag 553-596
-
(2001)
Lecture Notes in Computer Science Series
, pp. 553-596
-
-
Adve, V.1
Mellor-Crummey, J.2
-
3
-
-
84976724523
-
-
S.P. Amarasinghe, M.S. Lam, Communication optimization and code generation for distributed memory machines, in: Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, Albuquerque, New Mexico, USA, June 1993.
-
-
-
-
4
-
-
33750611200
-
-
R. Andonov, P. Calland, S. Niar, S. Rajopadhye, N. Yanev, First steps towards optimal oblique tile sizing, in: Proceedings of the 8th International Workshop on Compilers for Parallel Computers, Aussois, January 2000, pp. 351-366.
-
-
-
-
5
-
-
0042493449
-
Optimal scheduling for UET/UET-UCT generalized N-dimensional grid task graphs
-
Andronikos T., Koziris N., Papakonstantinou G., and Tsanakas P. Optimal scheduling for UET/UET-UCT generalized N-dimensional grid task graphs. Journal of Parallel and Distributed Computing 57 2 (1999) 140-165
-
(1999)
Journal of Parallel and Distributed Computing
, vol.57
, Issue.2
, pp. 140-165
-
-
Andronikos, T.1
Koziris, N.2
Papakonstantinou, G.3
Tsanakas, P.4
-
7
-
-
0032098025
-
On the removal of anti and output dependences
-
Calland P.Y., Darte A., Robert Y., and Vivien F. On the removal of anti and output dependences. International Journal of Parallel Programming 26 2 (1998) 285-312
-
(1998)
International Journal of Parallel Programming
, vol.26
, Issue.2
, pp. 285-312
-
-
Calland, P.Y.1
Darte, A.2
Robert, Y.3
Vivien, F.4
-
9
-
-
0032028841
-
Determining the idle time of a tiling: new results
-
Desprez F., Dongarra J., Rastello F., and Robert Y. Determining the idle time of a tiling: new results. Journal of Information Science and Engineering 14 1 (1997) 167-190
-
(1997)
Journal of Information Science and Engineering
, vol.14
, Issue.1
, pp. 167-190
-
-
Desprez, F.1
Dongarra, J.2
Rastello, F.3
Robert, Y.4
-
11
-
-
33750624678
-
-
G. Fox, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, C. Tseng, M. Wu, Fortran-D language specification, Technical Report TR-91-170, Department of Computer Science, Rice University, December 1991.
-
-
-
-
13
-
-
84981274197
-
-
G. Goumas, A. Sotiropoulos, N. Koziris, Minimizing completion time for loop tiling with computation and communication overlapping, in: Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS'01), San Francisco, April 2001.
-
-
-
-
16
-
-
0030651937
-
-
K. Högstedt, L. Carter, J. Ferrante, Determining the idle time of a tiling, in: Proceedings of the 24th ACM Symposium on Principles of Programming Languages (POPL), January 1997, pp. 160-173.
-
-
-
-
17
-
-
0032642196
-
-
K. Högstedt, L. Carter, J. Ferrante, Selecting tile shape for minimal execution time, in: Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, 1999, pp. 201-211.
-
-
-
-
19
-
-
84976859541
-
-
M. Lam, E. Rothberg, M. Wolf, The cache performance and optimizations of blocked algorithms, in: Proceedings of the 4th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Santa Clara, California, USA, April 1991, pp. 63-74.
-
-
-
-
21
-
-
0022874874
-
Advanced compiler optimizations for supercomputers
-
Padua D., and Wolfe W. Advanced compiler optimizations for supercomputers. Communications of the ACM 29 12 (1986) 1184-1201
-
(1986)
Communications of the ACM
, vol.29
, Issue.12
, pp. 1184-1201
-
-
Padua, D.1
Wolfe, W.2
-
22
-
-
0004161838
-
-
Cambridge University Press, New York, NY, USA
-
Press W., Teukolsky S., Vetterling W., and Flannery B. Numerical Recipes in C: The Art of Scientific Computing (1992), Cambridge University Press, New York, NY, USA
-
(1992)
Numerical Recipes in C: The Art of Scientific Computing
-
-
Press, W.1
Teukolsky, S.2
Vetterling, W.3
Flannery, B.4
-
24
-
-
0026821247
-
Independent partitioning of algorithms with uniform dependencies
-
Shang W., and Fortes J.A.B. Independent partitioning of algorithms with uniform dependencies. IEEE Transactions on Computers 41 2 (1992) 190-206
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.2
, pp. 190-206
-
-
Shang, W.1
Fortes, J.A.B.2
-
25
-
-
0029190371
-
-
E. Su, A. Lain, S. Ramaswamy, D.J. Palermo, E.W. Hodges, P. Banerjee, Advanced compilation techniques in the PARADIGM compiler for distributed memory multicomputers, in: Proceedings of the 9th ACM International Conference on Supercomputing (ICS), Madrid, Spain, July 1995, pp. 424-433.
-
-
-
-
26
-
-
0000778059
-
Generating efficient tiled code for distributed memory machines
-
Tang P., and Xue J. Generating efficient tiled code for distributed memory machines. Parallel Computing 26 11 (2000) 1369-1410
-
(2000)
Parallel Computing
, vol.26
, Issue.11
, pp. 1369-1410
-
-
Tang, P.1
Xue, J.2
-
27
-
-
0004012752
-
-
Prentice-Hall, Inc., Upper Saddle River, NJ, USA
-
Wilkinson B., and Allen M. Parallel Programming: Techniques and Applications using Networked Workstations and Parallel Computers (1999), Prentice-Hall, Inc., Upper Saddle River, NJ, USA
-
(1999)
Parallel Programming: Techniques and Applications using Networked Workstations and Parallel Computers
-
-
Wilkinson, B.1
Allen, M.2
-
28
-
-
85013942562
-
-
M. Wolf, M. Lam, A data locality optimizing algorithm, in: Proceedings of the ACM SIGPLAN'91 Conference on Programming Language Design and Implementation (PLDI), Toronto, Ontario, Canada, June 1991, pp. 30-44.
-
-
-
-
29
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
Wolf M., and Lam M. A loop transformation theory and an algorithm to maximize parallelism. IEEE Transactions on Parallel and Distributed Systems 2 4 (1991) 452-471
-
(1991)
IEEE Transactions on Parallel and Distributed Systems
, vol.2
, Issue.4
, pp. 452-471
-
-
Wolf, M.1
Lam, M.2
-
30
-
-
0003125942
-
Communication-minimal tiling of uniform dependence loops
-
Xue J. Communication-minimal tiling of uniform dependence loops. Journal of Parallel and Distributed Computing 42 1 (1997) 42-59
-
(1997)
Journal of Parallel and Distributed Computing
, vol.42
, Issue.1
, pp. 42-59
-
-
Xue, J.1
-
31
-
-
0036601528
-
Time-minimal tiling when rise is larger than zero
-
Xue J., and Cai W. Time-minimal tiling when rise is larger than zero. Parallel Computing 28 6 (2002) 915-939
-
(2002)
Parallel Computing
, vol.28
, Issue.6
, pp. 915-939
-
-
Xue, J.1
Cai, W.2
|