-
1
-
-
32844466554
-
An integrated simdization framework using virtual vectors
-
P. Wu, A. E. Eichenberger, A. Wang, and P. Zhao, "An integrated Simdization framework using virtual vectors," in ICS, 2005.
-
(2005)
ICS
-
-
Wu, P.1
Eichenberger, A.E.2
Wang, A.3
Zhao, P.4
-
3
-
-
0344908850
-
Automatic intra-register vectorization for the intel architecture
-
A. J. C. Bik, M. Girkar, P. M. Grey, and X. Tian, "Automatic intra-register vectorization for the Intel architecture," IJPP, vol. 30, no. 2, pp. 65-98, 2002.
-
(2002)
IJPP
, vol.30
, Issue.2
, pp. 65-98
-
-
Bik, A.J.C.1
Girkar, M.2
Grey, P.M.3
Tian, X.4
-
4
-
-
37149019455
-
Autovectorization in GCC - Two years later
-
June
-
D. Nuzman and A. Zaks, "Autovectorization in GCC - two years later," in the GCC Developer's summit, June 2006.
-
(2006)
GCC Developer's summit
-
-
Nuzman, D.1
Zaks, A.2
-
5
-
-
33646554301
-
Superword-level parallelism in the presence of control flow
-
March
-
J. Shin, M. Hall, and J. Chame, "Superword-level parallelism in the presence of control flow," in CGO, March 2005.
-
(2005)
CGO
-
-
Shin, J.1
Hall, M.2
Chame, J.3
-
6
-
-
63549093768
-
Outer-loop vectorization - Revisited for short SIMD architectures
-
October
-
D. Nuzman and A. Zaks, "Outer-loop vectorization - revisited for short SIMD architectures," in PACT, October 2008.
-
(2008)
PACT
-
-
Nuzman, D.1
Zaks, A.2
-
8
-
-
74049164978
-
A practical automatic polyhedral parallelization and locality optimization system
-
Jun.
-
U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan, "A practical automatic polyhedral parallelization and locality optimization system," in PLDI, Jun. 2008.
-
(2008)
PLDI
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujam, J.3
Sadayappan, P.4
-
9
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part II, multidimensional time
-
Jun.
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and J. Cavazos, "Iterative optimization in the polyhedral model: Part II, multidimensional time," in PLDI, Jun. 2008.
-
(2008)
PLDI
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Cavazos, J.4
-
10
-
-
0023438847
-
Automatic translation of fortran programs to vector form
-
R. Allen and K. Kennedy, "Automatic translation of fortran programs to vector form," ACM Tr. on Prog. Lang. and Systems, vol. 9, no. 4, pp. 491-542, 1987.
-
(1987)
ACM Tr. on Prog. Lang. and Systems
, vol.9
, Issue.4
, pp. 491-542
-
-
Allen, R.1
Kennedy, K.2
-
12
-
-
84948740064
-
Compiler-controlled caching in superword register files for multimedia extension architectures
-
September
-
J. Shin, J. Chame, and M. W. Hall, "Compiler-controlled caching in superword register files for multimedia extension architectures," in PACT, September 2002.
-
(2002)
PACT
-
-
Shin, J.1
Chame, J.2
Hall, M.W.3
-
13
-
-
33746034953
-
Auto-vectorization of interleaved data for simd
-
D. Nuzman, I. Rosen, and A. Zaks, "Auto-vectorization of interleaved data for simd," in PLDI, 2006.
-
(2006)
PLDI
-
-
Nuzman, D.1
Rosen, I.2
Zaks, A.3
-
15
-
-
70449667592
-
Compile-time based performance prediction
-
C. Cascaval, L. Derose, D. A. Padua, and D. A. Reed, "Compile-time based performance prediction," in LCPC, 1999.
-
(1999)
LCPC
-
-
Cascaval, C.1
Derose, L.2
Padua, D.A.3
Reed, D.A.4
-
16
-
-
0037340135
-
Probabilistic miss equations: Evaluating memory hierarchy performance
-
B. B. Fraguela, R. Doallo, and E. L. Zapata, "Probabilistic miss equations: Evaluating memory hierarchy performance," IEEE Trans. Comput., vol. 52, no. 3, pp. 321-336, 2003.
-
(2003)
IEEE Trans. Comput.
, vol.52
, Issue.3
, pp. 321-336
-
-
Fraguela, B.B.1
Doallo, R.2
Zapata, E.L.3
-
17
-
-
84958731989
-
Array expansion
-
St. Malo, France, Jul.
-
P. Feautrier, "Array expansion," in ICS, St. Malo, France, Jul. 1988.
-
(1988)
ICS
-
-
Feautrier, P.1
-
18
-
-
33746593747
-
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies
-
Jun. special issue on Microgrids.
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam, "Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies," Intl. J. of Parallel Programming, vol. 34, no. 3, pp. 261-317, Jun. 2006, special issue on Microgrids.
-
(2006)
Intl. J. of Parallel Programming
, vol.34
, Issue.3
, pp. 261-317
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
19
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, part II, multidimensional time
-
Dec.
-
P. Feautrier, "Some efficient solutions to the affine scheduling problem, part II, multidimensional time," Intl. J. of Parallel Programming, vol. 21, no. 6, pp. 389-420, Dec. 1992
-
(1992)
Intl. J. of Parallel Programming
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
20
-
-
35048864273
-
-
see also Part I
-
see also Part I, one dimensional time, 21(5):315-348.
-
One Dimensional Time
, vol.21
, Issue.5
, pp. 315-348
-
-
-
22
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
Paris, Jan.
-
A. Lim and M. Lam, "Maximizing parallelism and minimizing synchronization with affine transforms," in PoPL'24, Paris, Jan. 1997, pp. 201-214.
-
(1997)
PoPL'24
, pp. 201-214
-
-
Lim, A.1
Lam, M.2
-
23
-
-
10444289646
-
Code generation in the polyhedral model is easier than you think
-
Sep.
-
C. Bastoul, "Code generation in the polyhedral model is easier than you think," in PACT, Sep. 2004.
-
(2004)
PACT
-
-
Bastoul, C.1
-
24
-
-
63549147948
-
-
C. G. Lee, "UTDSP benchmarks," http://www.eecg.toronto.edu/ ~corinna/DSP/infrastructure/UTDSP.html, 1998.
-
(1998)
UTDSP Benchmarks
-
-
Lee, C.G.1
|