-
1
-
-
34548031154
-
Tradeoff between data-, instruction-, and thread-level parallelism in stream processors
-
J. H. Ahn, M. Erez, and W. J. Dall, "Tradeoff between data-, instruction-, and thread-level parallelism in stream processors," in Proc. ICS'07, 2007, pp. 126-137.
-
(2007)
Proc. ICS'07
, pp. 126-137
-
-
Ahn, J.H.1
Erez, M.2
Dall, W.J.3
-
2
-
-
85133420235
-
-
San Mateo, CA: Morgan Kaufmann
-
J. A. Fisher, P. Farabosch, and C. Young, Embedded Computing: A VLIW Approach to Architecture, Compiler and Tools. San Mateo, CA: Morgan Kaufmann, 2004.
-
(2004)
Embedded Computing: A VLIW Approach to Architecture, Compiler and Tools
-
-
Fisher, J.A.1
Farabosch, P.2
Young, C.3
-
3
-
-
43449128019
-
NVIDIA CUDA software and GPU parallel computing architecture
-
Oct
-
J. Nickolls and I. Buck, "NVIDIA CUDA software and GPU parallel computing architecture," in Proc. Microprocessor Forum, Oct. 2007, pp. 103-104.
-
(2007)
Proc. Microprocessor Forum
, pp. 103-104
-
-
Nickolls, J.1
Buck, I.2
-
4
-
-
57349172004
-
Biomedical image analysis on a cooperative cluster of GPUs and multicores
-
T. D. R. Hartley, U. Catalyurek, A. Ruiz, F. Igual, R. Mayo, and M. Ujaldon, "Biomedical image analysis on a cooperative cluster of GPUs and multicores," in Proc. 2008 Int. Conf. Supercomputing, pp. 15-25.
-
Proc. 2008 Int. Conf. Supercomputing
, pp. 15-25
-
-
Hartley, T.D.R.1
Catalyurek, U.2
Ruiz, A.3
Igual, F.4
Mayo, R.5
Ujaldon, M.6
-
5
-
-
62349090240
-
Non-rigid registration for large sets of microscopic images on graphics processors
-
Apr
-
A. Ruiz, M. Ujaldon, L. Cooper, and K. Huang, "Non-rigid registration for large sets of microscopic images on graphics processors," J. Signal Process. Syst., vol. 55, no. 1-3, pp. 229-250, Apr. 2008.
-
(2008)
J. Signal Process. Syst
, vol.55
, Issue.1-3
, pp. 229-250
-
-
Ruiz, A.1
Ujaldon, M.2
Cooper, L.3
Huang, K.4
-
6
-
-
63149128672
-
Larrabee: A many-core x86 architecture for visual computing
-
Jan./Feb
-
L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, P. Dubey, S. Junkins, A. Lake, R. Cavin, R. Espasa, E. Grochowski, T. Juan, M. Abrash, J. Sugerman, and P.Hanrahan, "Larrabee: A many-core x86 architecture for visual computing," ACM Trans. Graph., vol. 29, no. 1, pp. 10-21, Jan./Feb. 2009.
-
(2009)
ACM Trans. Graph
, vol.29
, Issue.1
, pp. 10-21
-
-
Seiler, L.1
Carmean, D.2
Sprangle, E.3
Forsyth, T.4
Dubey, P.5
Junkins, S.6
Lake, A.7
Cavin, R.8
Espasa, R.9
Grochowski, E.10
Juan, T.11
Abrash, M.12
Sugerman, J.13
Hanrahan, P.14
-
7
-
-
33646009337
-
Optimizing compiler for the CELL processor
-
Sept
-
A. E. Eichenberger, K. O'Brien, K. O'Brien, P. Wu, T. Chen, P. H. Oden, D. A. Prener, J. C. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind, "Optimizing compiler for the CELL processor," in Proc. 14th Int. Conf. Parallel Architectures and Compilation Techniques, Sept. 2005, pp. 161-172.
-
(2005)
Proc. 14th Int. Conf. Parallel Architectures and Compilation Techniques
, pp. 161-172
-
-
Eichenberger, A.E.1
O'Brien, K.2
O'Brien, K.3
Wu, P.4
Chen, T.5
Oden, P.H.6
Prener, D.A.7
Shepherd, J.C.8
So, B.9
Sura, Z.10
Wang, A.11
Zhang, T.12
Zhao, P.13
Gschwind, M.14
-
8
-
-
57749168614
-
Uncovering hidden loop level parallelism in sequential applications
-
Feb
-
H. Zhong, M. Mehrara, S. Lieberman, and S. Mahlke, "Uncovering hidden loop level parallelism in sequential applications," in Proc. 14th Int. Symp. HighPerformance Computer Architecture, Feb. 2008, pp. 290-301.
-
(2008)
Proc. 14th Int. Symp. HighPerformance Computer Architecture
, pp. 290-301
-
-
Zhong, H.1
Mehrara, M.2
Lieberman, S.3
Mahlke, S.4
-
9
-
-
84959045524
-
StreamIt: A language for streaming applications
-
W. Thies, M. Karczmarek, and S. P. Amarasinghe, "StreamIt: A language for streaming applications," in Proc. 2002 Int. Conf. Compiler Construction, 2002, pp. 179-196.
-
(2002)
Proc. 2002 Int. Conf. Compiler Construction
, pp. 179-196
-
-
Thies, W.1
Karczmarek, M.2
Amarasinghe, S.P.3
-
10
-
-
0026278958
-
The Omega test: A fast and practical integer programming algorithm for dependence analysis
-
W. Pugh, "The Omega test: A fast and practical integer programming algorithm for dependence analysis," in Proc. 1991 ACM/IEEE Conf. Supercomputing, 1991, pp. 4-13.
-
(1991)
Proc. 1991 ACM/IEEE Conf. Supercomputing
, pp. 4-13
-
-
Pugh, W.1
-
11
-
-
0017922490
-
The CRAY-1 computer system
-
R. M. Russell, "The CRAY-1 computer system," Commun. ACM, vol. 21, no. 1, pp. 63-72, 1978.
-
(1978)
Commun. ACM
, vol.21
, Issue.1
, pp. 63-72
-
-
Russell, R.M.1
-
12
-
-
33749375700
-
Automatic thread extraction with decoupled software pipelining
-
Nov
-
G. Ottoni, R. Rangan, A. Stoler, and D. I. August, "Automatic thread extraction with decoupled software pipelining," in Proc. 38th IEEE/ACM Int. Symp. Microarchitecture, Nov. 2005, pp. 105-116.
-
(2005)
Proc. 38th IEEE/ACM Int. Symp. Microarchitecture
, pp. 105-116
-
-
Ottoni, G.1
Rangan, R.2
Stoler, A.3
August, D.I.4
|