-
1
-
-
34548753903
-
-
IBM Developer Works November
-
Chen, T., Raghavan, R., Dale, J., Iwata, E.: Cell Broadband Engine Architecture and its first implementation. IBM Developer Works (November 2005)
-
(2005)
Cell Broadband Engine Architecture and its first implementation
-
-
Chen, T.1
Raghavan, R.2
Dale, J.3
Iwata, E.4
-
2
-
-
77949639334
-
-
NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 2008
-
NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 (2008)
-
-
-
-
3
-
-
74549192511
-
-
NVIDIA corporation: Technical Brief
-
NVIDIA corporation: NVIDIA Tesla GPU Computing Technical Brief (2008)
-
(2008)
NVIDIA Tesla GPU Computing
-
-
-
4
-
-
77949604224
-
-
OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 May 2008
-
OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 (May 2008), http://www.openmp.org
-
-
-
-
5
-
-
60449097203
-
The Design of OpenMP Tasks
-
Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Teruel, X., Unnikrishnan, P., Zhang, G.: The Design of OpenMP Tasks. IEEE Transactions on Parallel and Distributed Systems 20(3), 404-418 (2009)
-
(2009)
IEEE Transactions on Parallel and Distributed Systems
, vol.20
, Issue.3
, pp. 404-418
-
-
Ayguadé, E.1
Copty, N.2
Duran, A.3
Hoeflinger, J.4
Lin, Y.5
Massaioli, F.6
Teruel, X.7
Unnikrishnan, P.8
Zhang, G.9
-
6
-
-
77949625437
-
A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures
-
Ayguadé, E., Badia, R.M., Cabrera, D., Duran, A., Gonzalez, M., Igual, F., Jimenez, D., Labarta, J., Martorell, X., Mayo, R., Perez, J.M., Quintana-Orti, E.: A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures. In: Fifth International Workshop on OpenMP, IWOMP (2009)
-
(2009)
Fifth International Workshop on OpenMP, IWOMP
-
-
Ayguadé, E.1
Badia, R.M.2
Cabrera, D.3
Duran, A.4
Gonzalez, M.5
Igual, F.6
Jimenez, D.7
Labarta, J.8
Martorell, X.9
Mayo, R.10
Perez, J.M.11
Quintana-Orti, E.12
-
7
-
-
0003648799
-
The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance
-
Technical Report NAS-99-011, NASA Ames Research Center
-
Jin, H., Frumkin, M., Yan, J.: The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. Technical Report NAS-99-011, NASA Ames Research Center (1999)
-
(1999)
-
-
Jin, H.1
Frumkin, M.2
Yan, J.3
-
8
-
-
84944046879
-
Performance evaluation of the Omni OpenMP compiler
-
Kusano, K., Satoh, S., Sato, M.: Performance evaluation of the Omni OpenMP compiler. In: Third International Symposium on High Performance Computing, pp. 403-414 (2000)
-
(2000)
Third International Symposium on High Performance Computing
, pp. 403-414
-
-
Kusano, K.1
Satoh, S.2
Sato, M.3
-
9
-
-
77954443618
-
Evaluation of Memory Performance on the Cell BE with the SARC Programming Model
-
October
-
Ferrer, R., Gonzalez, M., Silla, F., Martorell, X., Ayguadé, E.: Evaluation of Memory Performance on the Cell BE with the SARC Programming Model. In: Proceedings of the 9th Workshop on Memory Performance: Dealing with Applications, systems, and architecture (MEDEA 2008) (October 2008)
-
(2008)
Proceedings of the 9th Workshop on Memory Performance: Dealing with Applications, systems, and architecture (MEDEA
-
-
Ferrer, R.1
Gonzalez, M.2
Silla, F.3
Martorell, X.4
Ayguadé, E.5
-
11
-
-
77949601288
-
-
AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf
-
AMD Corporation: AMD 2007 Technology Analyst Day, http://www2.amd.com/us- en/assets/content-type/DownloadableAssets/ FinancialA-DayNewsSummary121307FINAL. pdf
-
-
-
-
12
-
-
77949642925
-
-
Stanford University: BrookGPU, http://graphics.stanford.edu/projects/ brookgpu/
-
BrookGPU
-
-
-
13
-
-
84871286731
-
-
Stanford University: Brook Language, http://merrimac.stanford.edu/brook/
-
Brook Language
-
-
-
15
-
-
48949090561
-
A Proposal for Task Parallelism in OpenMP
-
Chapman, B, Zheng, W, Gao, G.R, Sato, M, Ayguadé, E, Wang, D, eds, IWOMP 2007, Springer, Heidelberg
-
Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Su, E., Unnikrishnan, P., Zhang, G.: A Proposal for Task Parallelism in OpenMP. In: Chapman, B., Zheng, W., Gao, G.R., Sato, M., Ayguadé, E., Wang, D. (eds.) IWOMP 2007. LNCS, vol. 4935, pp. 1-12. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.4935
, pp. 1-12
-
-
Ayguadé, E.1
Copty, N.2
Duran, A.3
Hoeflinger, J.4
Lin, Y.5
Massaioli, F.6
Su, E.7
Unnikrishnan, P.8
Zhang, G.9
-
16
-
-
35649006026
-
CellSs: Making it easier to program the Cell Broadband Engine processor
-
Perez, J.M., Bellens, P., Badia, R.M., Labarta, J.: CellSs: Making it easier to program the Cell Broadband Engine processor. IBM Journal of Research and Development 51(5), 593-604 (2007)
-
(2007)
IBM Journal of Research and Development
, vol.51
, Issue.5
, pp. 593-604
-
-
Perez, J.M.1
Bellens, P.2
Badia, R.M.3
Labarta, J.4
-
17
-
-
67650056929
-
Extending the OpenMP Tasking Model to Allow Dependent Tasks
-
Eigenmann, R, de Supinski, B.R, eds, IWOMP 2008, Springer, Heidelberg
-
Duran, A., Pérez, J.M., Ayguadé, E., Badia, R.M., Labarta, J.: Extending the OpenMP Tasking Model to Allow Dependent Tasks. In: Eigenmann, R., de Supinski, B.R. (eds.) IWOMP 2008. LNCS, vol. 5004, pp. 111-122. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.5004
, pp. 111-122
-
-
Duran, A.1
Pérez, J.M.2
Ayguadé, E.3
Badia, R.M.4
Labarta, J.5
-
18
-
-
77949650408
-
-
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)
-
Dolbeau, R., Bihan, S., Bodin, F.: HMPP: A Hybrid Multi-core Parallel Programming Environment. In: Workshop on General Processing Using GPUs (2006)
-
-
-
-
19
-
-
77949644831
-
-
January 2009
-
IBM Corporation: XL C/C++ for Multicore Acceleration (January 2009), http://www-01.ibm.com/software/awdtools/xlcpp/multicore/
-
C++ for Multicore Acceleration
-
-
XL, C.1
-
20
-
-
85121084005
-
-
International Journal of Parallel Programming
-
O'Brien, K., O'Brien, K., Sura, Z., Chen, T., Zhang, T.: Supporting OpenMP on Cell. International Journal of Parallel Programming (2008)
-
(2008)
Supporting OpenMP on Cell
-
-
O'Brien, K.1
O'Brien, K.2
Sura, Z.3
Chen, T.4
Zhang, T.5
-
21
-
-
54249087677
-
-
Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, 5234, pp. 125-140. Springer, Heidelberg (2008)
-
Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Sura, Z., Chen, T., Zhang, T., O'Brien, K., O'Brien, K.: A Novel Asynchronous Software Cache Implementation for the CELL/BE Processor. In: Adve, V., Garzarán, M.J., Petersen, P. (eds.) LCPC 2007. LNCS, vol. 5234, pp. 125-140. Springer, Heidelberg (2008)
-
-
-
-
23
-
-
56749157122
-
Dma-based prefetching for i/o-intensive workloads on the cell architecture
-
ACM, New York
-
Rafique, M.M., Butt, A.R., Nikolopoulos, D.S.: Dma-based prefetching for i/o-intensive workloads on the cell architecture. In: CF 2008: Proceedings of the 2008 conference on Computing frontiers, pp. 23-32. ACM, New York (2008)
-
(2008)
CF 2008: Proceedings of the 2008 conference on Computing frontiers
, pp. 23-32
-
-
Rafique, M.M.1
Butt, A.R.2
Nikolopoulos, D.S.3
-
24
-
-
43449138842
-
Prefetching irregular references for software cache on cell
-
ACM, New York
-
Chen, T., Zhang, T., Sura, Z., Gonzalez, M.: Prefetching irregular references for software cache on cell. In: CGO 2008: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, pp. 155-164. ACM, New York (2008)
-
(2008)
CGO 2008: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization
, pp. 155-164
-
-
Chen, T.1
Zhang, T.2
Sura, Z.3
Gonzalez, M.4
-
25
-
-
77949601287
-
SPENK: Adding Another Level of Parallelism on the Cell Broadband Engine
-
ACM, New York
-
Ahmed, M.F., Ammar, R.A., Rajasekaran, S.: SPENK: Adding Another Level of Parallelism on the Cell Broadband Engine. In: IFMT 2008: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies, pp. 1-10. ACM, New York (2008)
-
(2008)
IFMT 2008: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies
, pp. 1-10
-
-
Ahmed, M.F.1
Ammar, R.A.2
Rajasekaran, S.3
-
26
-
-
77952225553
-
-
Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)
-
Beltran, V., Carrera, D., Torres, J., Ayguadé, E.: CellMT: A Cooperative Multi-threading Library for the Cell/B.E. In: HiPC 2009: Proceedings of the 16th Annual IEEE International Conference on High Performance Computing. IEEE Computer Society, Los Alamitos (2009)
-
-
-
-
27
-
-
77949625831
-
-
Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)
-
Weltzer, J., Silha, E., May, C., Frey, B., Furukawa, J., Frazier, G.: PowerPC Architecture Book V. 2.02. IBM Corporation (2005)
-
-
-
-
30
-
-
77949601698
-
-
Corporation
-
Corporation, I.: Intel Xeon Processor 5000 Sequence (2009), http://www.intel. com/p/en-US/products/server/processor/xeon5000
-
(2009)
I.: Intel Xeon Processor 5000 Sequence
-
-
-
31
-
-
38149061132
-
-
Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 330-344. Springer, Heidelberg (2007)
-
Balart, J., Gonzalez, M., Martorell, X., Ayguadé, E., Labarta, J.: Runtime Address Space Computation for SDSM Systems. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 330-344. Springer, Heidelberg (2007)
-
-
-
-
32
-
-
38149004865
-
-
Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, 4382, pp. 314-329. Springer, Heidelberg (2007)
-
Chen, T., Sura, Z., O'Brien, K., O'Brien, J.K.: Optimizing the Use of Static Buffers for DMA on a CELL Chip. In: Almási, G.S., Caşcaval, C., Wu, P. (eds.) LCPC 2006. LNCS, vol. 4382, pp. 314-329. Springer, Heidelberg (2007)
-
-
-
|