-
2
-
-
79951702599
-
Efficient selection of vector instructions using dynamic programming
-
R. Barik, J. Zhao, and V. Sarkar. Efficient selection of vector instructions using dynamic programming. In Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO'43, pages 201-212, 2010.
-
(2010)
Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO'43
, pp. 201-212
-
-
Barik, R.1
Zhao, J.2
Sarkar, V.3
-
4
-
-
0001483604
-
Communication optimizations for irregular scientific computations on distributed memory architectures
-
Sep.
-
R. Das, M. Uysal, J. Saltz, and Y.-S. Hwang. Communication optimizations for irregular scientific computations on distributed memory architectures. J. Parallel Distrib. Comput., 22:462-478, Sep. 1994.
-
(1994)
J. Parallel Distrib. Comput.
, vol.22
, pp. 462-478
-
-
Das, R.1
Uysal, M.2
Saltz, J.3
Hwang, Y.-S.4
-
5
-
-
0033872689
-
AltiVec extension to PowerPC accelerates media processing
-
DOI 10.1109/40.848475
-
K. Diefendorff, P. K. Dubey, R. Hochsprung, and H. Scales. AltiVec extension to PowerPC accelerates media processing. IEEE Micro, 20: 85-95, Mar./Apr. 2000. (Pubitemid 30585387)
-
(2000)
IEEE Micro
, vol.20
, Issue.2
, pp. 85-95
-
-
Diefendorff, K.1
Dubey, P.K.2
Hochsprung, R.3
Scales, H.4
-
6
-
-
4544335844
-
Vectorization for SIMD architectures with alignment constraints
-
A. E. Eichenberger, P. Wu, and K. O'Brien. Vectorization for SIMD architectures with alignment constraints. In Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation, PLDI'04, pages 82-93, 2004.
-
(2004)
Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation, PLDI'04
, pp. 82-93
-
-
Eichenberger, A.E.1
Wu, P.2
O'brien, K.3
-
7
-
-
84871776762
-
Polly - polyhedral optimization in llvm
-
T. Grosser, H. Zheng, R. A, A. Simburger, A. Grosslinger, and L.- N. Pouchet. Polly - polyhedral optimization in llvm. In First International Workshop on Polyhedral Compilation Techniques (IMPACT'11), 2011.
-
(2011)
First International Workshop on Polyhedral Compilation Techniques (IMPACT'11)
-
-
Grosser, T.1
Zheng, H.2
Simburger, A.3
Grosslinger, A.4
Pouchet, L.-N.5
-
8
-
-
33646015987
-
Synergistic processing in Cell's multicore architecture
-
Mar.
-
M. Gschwind, H. P. Hofstee, B. Flachs, M. Hopkins, Y.Watanabe, and T. Yamazaki. Synergistic processing in Cell's multicore architecture. IEEE Micro, 26:10-24, Mar. 2006.
-
(2006)
IEEE Micro
, vol.26
, pp. 10-24
-
-
Gschwind, M.1
Hofstee, H.P.2
Flachs, B.3
Hopkins, M.4
Watanabe, Y.5
Yamazaki, T.6
-
9
-
-
36849034066
-
SPEC CPU2006 benchmark descriptions
-
Sep.
-
J. L. Henning. SPEC CPU2006 benchmark descriptions. SIGARCH Comput. Archit. News, 34:1-17, Sep. 2006.
-
(2006)
SIGARCH Comput. Archit. News
, vol.34
, pp. 1-17
-
-
Henning, J.L.1
-
10
-
-
0034250996
-
Compilation techniques for multimedia processors
-
Aug.
-
A. Krall and S. Lelait. Compilation techniques for multimedia processors. Int. J. Parallel Program., 28:347-361, Aug. 2000.
-
(2000)
Int. J. Parallel Program.
, vol.28
, pp. 347-361
-
-
Krall, A.1
Lelait, S.2
-
12
-
-
33749373820
-
Exploiting vector parallelism in software pipelined loops
-
S. Larsen, R. Rabbah, and S. Amarasinghe. Exploiting vector parallelism in software pipelined loops. In Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture, MICRO 38, pages 119-129, 2005.
-
(2005)
Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO
, vol.38
, pp. 119-129
-
-
Larsen, S.1
Rabbah, R.2
Amarasinghe, S.3
-
13
-
-
31844445061
-
-
PhD thesis, Computer Science Dept. University of Illinois at Urbana-Champaign, Urbana, IL, May, [online
-
C. Lattner. Macroscopic Data Structure Analysis and Optimization. PhD thesis, Computer Science Dept., University of Illinois at Urbana-Champaign, Urbana, IL, May 2005. [online] http://llvm.cs.uiuc.edu.
-
(2005)
Macroscopic Data Structure Analysis and Optimization
-
-
Lattner, C.1
-
16
-
-
4544372264
-
Vectorizing for a SIMdD DSP architecture
-
CASES 2003: International Conference on Compilers, Architecture, and Synthesis for Embedded Systems
-
D. Naishlos, M. Biberstein, S. Ben-David, and A. Zaks. Vectorizing for a SIMdD DSP architecture. In Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, CASES'03, pages 2-11, 2003. (Pubitemid 40682144)
-
(2003)
CASES 2003: International Conference on Compilers, Architecture, and Synthesis for Embedded Systems
, pp. 2-11
-
-
Naishlos, D.1
Biberstein, M.2
Ben-David, S.3
Zaks, A.4
-
17
-
-
33745205861
-
Auto-vectorization of interleaved data for SIMD
-
D. Nuzman, I. Rosen, and A. Zaks. Auto-vectorization of interleaved data for SIMD. In Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'06, pages 132-143, 2006.
-
(2006)
Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'06
, pp. 132-143
-
-
Nuzman, D.1
Rosen, I.2
Zaks, A.3
-
18
-
-
33745223764
-
Pointer alignment analysis for processors with SIMD instructions
-
I. Pryanishnikov, A. Krall, T. U. Wien, and N. Horspool. Pointer alignment analysis for processors with SIMD instructions. In Proceedings of the 5th Workshop on Media and Streaming Processors, pages 50-57, 2003.
-
(2003)
Proceedings of the 5th Workshop on Media and Streaming Processors
, pp. 50-57
-
-
Pryanishnikov, I.1
Krall, A.2
Wien, T.U.3
Horspool, N.4
-
20
-
-
33745222449
-
Optimizing data permutations for SIMD devices
-
G. Ren, P.Wu, and D. Padua. Optimizing data permutations for SIMD devices. In Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'06, pages 118-131, 2006.
-
(2006)
Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI'06
, pp. 118-131
-
-
Ren, G.1
Wu, P.2
Padua, D.3
-
23
-
-
0034249157
-
A vectorizing compiler for multimedia extensions
-
Aug.
-
N. Sreraman and R. Govindarajan. A vectorizing compiler for multimedia extensions. Int. J. Parallel Program., 28:363-400, Aug. 2000.
-
(2000)
Int. J. Parallel Program
, vol.28
, pp. 363-400
-
-
Sreraman, N.1
Govindarajan, R.2
-
24
-
-
0001790593
-
Depth-first search and linear graph algorithms
-
R. Tarjan. Depth-first search and linear graph algorithms. SIAM Journal on Computing, 1(2):146-160, 1972.
-
(1972)
SIAM Journal on Computing
, vol.1
, Issue.2
, pp. 146-160
-
-
Tarjan, R.1
-
26
-
-
32844466554
-
An integrated simdization framework using virtual vectors
-
ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
P. Wu, A. E. Eichenberger, A. Wang, and P. Zhao. An integrated simdization framework using virtual vectors. In Proceedings of the 19th annual International Conference on Supercomputing, ICS'05, pages 169-178, 2005. (Pubitemid 43251321)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 169-178
-
-
Wu, P.1
Eichenberger, A.E.2
Wang, A.3
Zhao, P.4
|