-
2
-
-
84947997685
-
Efficient code generation for automatic parallelization and optimization
-
Ljubljana, October
-
C. Bastoul. Efficient code generation for automatic parallelization and optimization. In ISPDC'03 IEEE Intl. Symp. on Parallel and Distributed Computing, pages 23-30, Ljubljana, October 2003.
-
(2003)
ISPDC'03 IEEE Intl. Symp. on Parallel and Distributed Computing
, pp. 23-30
-
-
Bastoul, C.1
-
4
-
-
35248818778
-
Improving data locality by chunking
-
Warsaw, april
-
C. Bastoul and P. Feautrier. Improving data locality by chunking. In CC'12 Intl. Conf. on Compiler Construction, LNCS 2622, pages 320-335, Warsaw, april 2003.
-
(2003)
CC'12 Intl. Conf. on Compiler Construction, LNCS
, vol.2622
, pp. 320-335
-
-
Bastoul, C.1
Feautrier, P.2
-
5
-
-
0032066690
-
Loop parallelization algorithms: From parallelism extraction to code generation
-
P. Boulet, A. Darte, G.-A. Silber, and F. Vivien. Loop parallelization algorithms: From parallelism extraction to code generation. Parallel Computing, 24(3):421-444, 1998.
-
(1998)
Parallel Computing
, vol.24
, Issue.3
, pp. 421-444
-
-
Boulet, P.1
Darte, A.2
Silber, G.-A.3
Vivien, F.4
-
6
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
Cambridge, June
-
A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, and N. Vasilache. Facilitating the search for compositions of program transformations. In ACM ICS'05 International Conference on Supercomputing, pages 151-160, Cambridge, June 2005.
-
(2005)
ACM ICS'05 International Conference on Supercomputing
, pp. 151-160
-
-
Cohen, A.1
Girbal, S.2
Parello, D.3
Sigler, M.4
Temam, O.5
Vasilache, N.6
-
7
-
-
0001900752
-
Maximization of a linear function of variables subject to linear inequalities
-
T. Koopmans, editor, New York. John Wiley & Sons, Inc.
-
G. Dantzig. Maximization of a linear function of variables subject to linear inequalities. In T. Koopmans, editor, Activity Analysis of Production and Allocation, Cowles Commission Monograph No. 13, pages 339-347, New York, 1951. John Wiley & Sons, Inc.
-
(1951)
Activity Analysis of Production and Allocation, Cowles Commission Monograph No. 13
, vol.13
, pp. 339-347
-
-
Dantzig, G.1
-
8
-
-
0028436786
-
Mapping uniform loop nests onto distributed memory architectures
-
A. Darte and Y. Robert. Mapping uniform loop nests onto distributed memory architectures. Parallel Computing, 20(5)-.679-710, 1994.
-
(1994)
Parallel Computing
, vol.20
, Issue.5
, pp. 679-710
-
-
Darte, A.1
Robert, Y.2
-
9
-
-
0026109335
-
Dataflow analysis of scalar and array references
-
february
-
P. Feautrier. Dataflow analysis of scalar and array references. International Journal of Parallel Programming, 20(1):23-53, february 1991.
-
(1991)
International Journal of Parallel Programming
, vol.20
, Issue.1
, pp. 23-53
-
-
Feautrier, P.1
-
10
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, part II: Multidimensional time
-
december
-
P. Feautrier. Some efficient solutions to the affine scheduling problem, part II: multidimensional time. Int. Journal of Parallel Programming, 21(6):389-420, december 1992.
-
(1992)
Int. Journal of Parallel Programming
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
11
-
-
14844346973
-
A complete compiler approach to auto-parallelizing c programs for Multi-DSP systems
-
march
-
B. Franke and M. O'Boyle. A complete compiler approach to auto-parallelizing c programs for Multi-DSP systems. IEEE Transactions on Parallel and Distributed Systems (TPDS), 16(3):234-245, march 2005.
-
(2005)
IEEE Transactions on Parallel and Distributed Systems (TPDS)
, vol.16
, Issue.3
, pp. 234-245
-
-
Franke, B.1
O'Boyle, M.2
-
13
-
-
84858920684
-
A case study of design space exploration for embedded multimedia applications in SoCs
-
CRI - École des Mines de Paris, february
-
I. Hurbain, C. Ancourt, F. Irigoin, M. Barreteau, J. Mattioli, and F. Paquier. A case study of design space exploration for embedded multimedia applications in SoCs. Technical Report A-361, CRI - École des Mines de Paris, february 2005.
-
(2005)
Technical Report
, vol.A-361
-
-
Hurbain, I.1
Ancourt, C.2
Irigoin, F.3
Barreteau, M.4
Mattioli, J.5
Paquier, F.6
-
14
-
-
0041562664
-
Programmable stream processors
-
august
-
U. Kapasi, S. Rixner, W. Dally, B. Khailany, J. Ho Ahn, P. Mattson, and J. Owens. Programmable stream processors. IEEE Computer, 36(8):54-62, august 2003.
-
(2003)
IEEE Computer
, vol.36
, Issue.8
, pp. 54-62
-
-
Kapasi, U.1
Rixner, S.2
Dally, W.3
Khailany, B.4
Ahn, J.H.5
Mattson, P.6
Owens, J.7
-
15
-
-
0004261309
-
A framework for unifying reordering transformations
-
University of Maryland
-
W. Kelly and W. Pugh. A framework for unifying reordering transformations. Technical Report CS-TR-3193, University of Maryland, 1993.
-
(1993)
Technical Report
, vol.CS-TR-3193
-
-
Kelly, W.1
Pugh, W.2
-
18
-
-
0004181680
-
A note on Chernikova's algorithm
-
IRISA
-
H. Le Verge. A note on Chernikova's algorithm. Technical Report 635, IRISA, 1992.
-
(1992)
Technical Report
, vol.635
-
-
Le Verge, H.1
-
19
-
-
85029516676
-
Loop parallelization in the polytope model
-
Hildesheim, August
-
C. Lengauer. Loop parallelization in the polytope model. In Int. Conf. on Concurrency Theory, LNCS 715, pages 398-416, Hildesheim, August 1993.
-
(1993)
Int. Conf. on Concurrency Theory, LNCS
, vol.715
, pp. 398-416
-
-
Lengauer, C.1
-
20
-
-
0028409782
-
A singular loop transformation framework based on nonsingular matrices
-
April
-
W. Li and K. Pingali. A singular loop transformation framework based on nonsingular matrices. International Journal of Parallel Programming, 22(2):183-205, April 1994.
-
(1994)
International Journal of Parallel Programming
, vol.22
, Issue.2
, pp. 183-205
-
-
Li, W.1
Pingali, K.2
-
21
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
Paris, January
-
A. Lim and M. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In PoPL'24 ACM Symp. on Principles of Programming Languages, pages 201-214, Paris, January 1997.
-
(1997)
PoPL'24 ACM Symp. on Principles of Programming Languages
, pp. 201-214
-
-
Lim, A.1
Lam, M.2
-
22
-
-
35048822159
-
Optimizing cache access: A tool for source-to-source transformations and real-life compiler tests
-
Pisa, august
-
R. Müller-Pfefferkorn, W. Nagel, and B. Trenkler. Optimizing cache access: A tool for source-to-source transformations and real-life compiler tests. In Euro-Par 2004 Parallel Processing, 10th International Euro-Par Conference, pages 72-81, Pisa, august 2004.
-
(2004)
Euro-par 2004 Parallel Processing, 10th International Euro-par Conference
-
-
Müller-Pfefferkorn, R.1
Nagel, W.2
Trenkler, B.3
-
23
-
-
0026278958
-
The omega test: A fast and practical integer programming algorithm for dependence analysis
-
Albuquerque, august
-
W. Pugh. The omega test: a fast and practical integer programming algorithm for dependence analysis. In Proceedings of the third ACM/IEEE conference on Supercomputing, pages 4-13, Albuquerque, august 1991.
-
(1991)
Proceedings of the Third ACM/IEEE Conference on Supercomputing
, pp. 4-13
-
-
Pugh, W.1
-
25
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
October
-
F. Quilleré, S. Rajopadhye, and D. Wilde. Generation of efficient nested loops from polyhedra. International Journal of Parallel Programming, 28(5).-469-498, October 2000.
-
(2000)
International Journal of Parallel Programming
, vol.28
, Issue.5
, pp. 469-498
-
-
Quilleré, F.1
Rajopadhye, S.2
Wilde, D.3
-
26
-
-
0029518016
-
Beyond unimodular transformations
-
J. Ramanujam. Beyond unimodular transformations. J. of Supercomputing, 9(4):365-389, 1995.
-
(1995)
J. of Supercomputing
, vol.9
, Issue.4
, pp. 365-389
-
-
Ramanujam, J.1
-
29
-
-
0028434044
-
Automating non-unimodular loop transformations for massive parallelism
-
J. Xue. Automating non-unimodular loop transformations for massive parallelism. Parallel Computing, 20(5):711-728, 1994.
-
(1994)
Parallel Computing
, vol.20
, Issue.5
, pp. 711-728
-
-
Xue, J.1
|