-
1
-
-
56849104962
-
A Novel Asynchronous Software Cache Implementation for the Cell-BE Processor
-
J. Balart, M. Gonzalez, X. Martorell, E. Ayguade, Z. Sura, T. Chen, T. Zhang, K. O'brien, and K. O'brien. A Novel Asynchronous Software Cache Implementation for the Cell-BE Processor. In LCPC'07: Proceedings of the 2007 Workshop on Languages and Compilers for Parallel Computing, 2007.
-
(2007)
LCPC'07: Proceedings of the 2007 Workshop on Languages and Compilers for Parallel Computing
-
-
Balart, J.1
Gonzalez, M.2
Martorell, X.3
Ayguade, E.4
Sura, Z.5
Chen, T.6
Zhang, T.7
O'brien, K.8
O'brien, K.9
-
2
-
-
42649120679
-
Ray Tracing on the Cell Processor
-
Sept
-
C. Benlhin, I. Wald, M. Scherbaum, and H. Friedrich. Ray Tracing on the Cell Processor. IEEE Symposium on Interactive Ray Tracing 2006, pages 15-23, Sept. 2006.
-
(2006)
IEEE Symposium on Interactive Ray Tracing 2006
, pp. 15-23
-
-
Benlhin, C.1
Wald, I.2
Scherbaum, M.3
Friedrich, H.4
-
3
-
-
0033683314
-
Application-specific memory management for embedded systems using software-controlled caches
-
New York, NY, USA, ACM
-
D. Chiou, P. Jain, L. Rudolph, and S. Devadas. Application-specific memory management for embedded systems using software-controlled caches. In DAC'00: Proceedings of the 37th Conference on Design Automation, pages 416-419, New York, NY, USA, 2000. ACM.
-
(2000)
DAC'00: Proceedings of the 37th Conference on Design Automation
, pp. 416-419
-
-
Chiou, D.1
Jain, P.2
Rudolph, L.3
Devadas, S.4
-
4
-
-
33646009337
-
Optimizing Compiler for the CELL Processor
-
Washington, DC, USA, IEEE Computer Society
-
A. E. Eiehenberger, K. O'Brien, K. O'Brien, P. Wu, T. Chen, P. H. Oden, D. A. Prener, J. C. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind. Optimizing Compiler for the CELL Processor. In PACT'05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques, pages 161-172, Washington, DC, USA, 2005. IEEE Computer Society.
-
(2005)
PACT'05: Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
, pp. 161-172
-
-
Eiehenberger, A.E.1
O'Brien, K.2
O'Brien, K.3
Wu, P.4
Chen, T.5
Oden, P.H.6
Prener, D.A.7
Shepherd, J.C.8
So, B.9
Sura, Z.10
Wang, A.11
Zhang, T.12
Zhao, P.13
Gschwind, M.14
-
5
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
K. Fatahalian, D. R. Horn, T. J. Knight, L. Leem, M. Houston, J. Y. Park, M. Erez, M. Ren, A. Aiken, W. J. Dally, and P. Hanrahan. Sequoia: Programming the memory hierarchy. In SC'06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, page 83, 2006.
-
(2006)
SC'06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
, pp. 83
-
-
Fatahalian, K.1
Horn, D.R.2
Knight, T.J.3
Leem, L.4
Houston, M.5
Park, J.Y.6
Erez, M.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
6
-
-
78651269052
-
Understanding the efficiency of GPU algorithms for matrix-matrix multiplication
-
Aug
-
K. Fatahalian, J. Sugerman, and P. Hanrahan. Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In Graphics Hardware 2004, pages 133-138, Aug. 2004.
-
(2004)
Graphics Hardware 2004
, pp. 133-138
-
-
Fatahalian, K.1
Sugerman, J.2
Hanrahan, P.3
-
7
-
-
0038558013
-
Exact genetic linkage computations for general pedigrees
-
M. Fishelson and D. Geiger. Exact genetic linkage computations for general pedigrees. Bioinformatics, 18(Suppl. 1):S189-S198. 2002.
-
(2002)
Bioinformatics
, vol.18
, Issue.SUPPL. 1
-
-
Fishelson, M.1
Geiger, D.2
-
8
-
-
34548292052
-
A memory model for scientific algorithms on graphics processors
-
Nov
-
N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. In Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, page 89, Nov. 2006.
-
(2006)
Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
, pp. 89
-
-
Govindaraju, N.K.1
Larsen, S.2
Gray, J.3
Manocha, D.4
-
10
-
-
34547500808
-
Implicit and explicit optimizations for stencil computations
-
New York, NY, USA, ACM
-
S. Kamil, K. Datta, S. Williams, L. Oliker, J. Shalf, and K. Yelick. Implicit and explicit optimizations for stencil computations. In MSPC'06: Proceedings of the 2006 Workshop on Memory System Performance and Correctness, pages 51-60, New York, NY, USA, 2006. ACM.
-
(2006)
MSPC'06: Proceedings of the 2006 Workshop on Memory System Performance and Correctness
, pp. 51-60
-
-
Kamil, S.1
Datta, K.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Yelick, K.6
-
11
-
-
57349190835
-
Fast and small short vector SIMD matrix multiplication kernel for the synergistic processing element of the CELL processor
-
University of Tennessee
-
J. Kurzak, W. Alvaro, and J. Dongarra. Fast and small short vector SIMD matrix multiplication kernel for the synergistic processing element of the CELL processor. Technical Report LAPACK Working Note 189, University of Tennessee. 2007.
-
(2007)
Technical Report LAPACK Working Note
, vol.189
-
-
Kurzak, J.1
Alvaro, W.2
Dongarra, J.3
-
13
-
-
33947588048
-
A survey of general-purpose computation on graphics hardware
-
J. D. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Krüger, A. E. Lefohn, and T. J. Purcell. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum, 26(1):80-113, 2007.
-
(2007)
Computer Graphics Forum
, vol.26
, Issue.1
, pp. 80-113
-
-
Owens, J.D.1
Luebke, D.2
Govindaraju, N.3
Harris, M.4
Krüger, J.5
Lefohn, A.E.6
Purcell, T.J.7
-
16
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
R. Whaley, A. Petitet, and J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27:3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, pp. 3-35
-
-
Whaley, R.1
Petitet, A.2
Dongarra, J.3
|