-
1
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
November
-
Fatahalian, K., Knight, T., Houston, M., Erez, M., Horn, D., Leem, L., Park, H., Ren, M., Aiken, A., Dally, W., Hanrahan, P.: Sequoia: Programming the Memory Hierarchy. In: Proceedings of Supercomputing 2006 (November 2006)
-
(2006)
Proceedings of Supercomputing 2006
-
-
Fatahalian, K.1
Knight, T.2
Houston, M.3
Erez, M.4
Horn, D.5
Leem, L.6
Park, H.7
Ren, M.8
Aiken, A.9
Dally, W.10
Hanrahan, P.11
-
2
-
-
77954395858
-
Hierarchical place trees: A portable abstraction for task parallelism and data movement
-
Gao, G.R., Pollock, L.L., Cavazos, J., Li, X. (eds.), LCPC 2009,Springer, Heidelberg
-
Yan, Y., Zhao, J., Guo, Y., Sarkar, V.: Hierarchical place trees: A portable abstraction for task parallelism and data movement. In: Gao, G.R., Pollock, L.L., Cavazos, J., Li, X. (eds.) LCPC 2009. LNCS, vol. 5898, pp. 172-187. Springer, Heidelberg (2010)
-
(2010)
LNCS
, vol.5898
, pp. 172-187
-
-
Yan, Y.1
Zhao, J.2
Guo, Y.3
Sarkar, V.4
-
3
-
-
76749086882
-
Programming for parallelism and locality with hierarchically tiled arrays
-
New York, USA, March
-
Bikshandi, G., Guo, J., Hoeflinger, D., Almasi, G., Fraguela, B.B., Garzarán, M.J., Padua, D., von Praun, C.: Programming for parallelism and locality with hierarchically tiled arrays. In: PPoPP, New York, USA, March 29-31 (2006)
-
(2006)
PPoPP
, pp. 29-31
-
-
Bikshandi, G.1
Guo, J.2
Hoeflinger, D.3
Almasi, G.4
Fraguela, B.B.5
Garzarán, M.J.6
Padua, D.7
Von Praun, C.8
-
4
-
-
70350625706
-
Performance without pain = productivity: Data layout and collective communication in UPC
-
Nishtala, R., Almasi, G., Cascaval, C.: Performance without pain = productivity: data layout and collective communication in UPC. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2008 (2008)
-
(2008)
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2008
-
-
Nishtala, R.1
Almasi, G.2
Cascaval, C.3
-
5
-
-
58449127539
-
CUDA-lite: Reducing GPU programming complexity
-
Amaral, J.N. (ed.),LCPC 2008,Springer, Heidelberg
-
Ueng, S., Lathara, M., Baghsorkhi, S.S., Hwu, W.W.: CUDA-lite: Reducing GPU programming complexity. In: Amaral, J.N. (ed.) LCPC 2008. LNCS, vol. 5335, pp. 1-15. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.5335
, pp. 1-15
-
-
Ueng, S.1
Lathara, M.2
Baghsorkhi, S.S.3
Hwu, W.W.4
-
7
-
-
70350678845
-
JCUDA: A programmer-friendly interface for accelerating java programs with CUDA
-
Sips, H., Epema, D., Lin, H.-X. (eds.),Euro-Par 2009,Springer, Heidelberg
-
Yan, Y., et al.: JCUDA: a Programmer-Friendly Interface for Accelerating Java Programs with CUDA. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 887-899. Springer, Heidelberg (2009)
-
(2009)
LNCS
, vol.5704
, pp. 887-899
-
-
Yan, Y.1
-
8
-
-
67650081010
-
OpenMP to GPGPU: A compiler framework for automatic translation and optimization
-
February
-
Lee, S., Min, S.-J., Eigenmann, R.: OpenMP to GPGPU: a compiler framework for automatic translation and optimization. In: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp. 101-110 (February 2009)
-
(2009)
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
, pp. 101-110
-
-
Lee, S.1
Min, S.-J.2
Eigenmann, R.3
-
10
-
-
79952576354
-
-
http://www.caps-entreprise.com/fr/ page/index.php?id=49&p-p=36
-
-
-
-
12
-
-
77954691442
-
A GPGPU Compiler for memory optimization and parallelism management
-
June
-
Yang, Y., Xiang, P., Kong, J., Zhou, H.: A GPGPU Compiler for Memory Optimization and Parallelism Management. In: The ACM SIGNPLAN 2010 Conference on Programming Language Design and Implementation, PLDI 2010 (June 2010)
-
(2010)
The ACM SIGNPLAN 2010 Conference on Programming Language Design and Implementation, PLDI 2010
-
-
Yang, Y.1
Xiang, P.2
Kong, J.3
Zhou, H.4
-
13
-
-
79952598285
-
A UPC specification extension proposal for hierarchical parallelism
-
Virginia USA,October
-
Serres, O., Kayi, A., Anbar, A., El-Ghazawi, T.: A UPC Specification Extension Proposal for Hierarchical Parallelism. In: The 3rd Conference on Partitioned Global Address Space Programming Models, Virginia, USA (October 2009)
-
(2009)
The 3rd Conference on Partitioned Global Address Space Programming Models
-
-
Serres, O.1
Kayi, A.2
Anbar, A.3
El-Ghazawi, T.4
-
15
-
-
33746070421
-
Shared memory programming for large scale machines
-
Ottawa, Ontario, Canada, June
-
Barton, C., Casçaval, C., Almási, G., Zheng, Y., Farreras, M., Chatterje, S., Amaral, J.N.: Shared memory programming for large scale machines. In: Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, Ottawa, Ontario, Canada, June 11-14 (2006)
-
(2006)
Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 11-14
-
-
Barton, C.1
Casçaval, C.2
Almási, G.3
Zheng, Y.4
Farreras, M.5
Chatterje, S.6
Amaral, J.N.7
-
16
-
-
33745219957
-
A performance analysis of the Berkeley UPC compiler
-
San Francisco,CA, USA, June
-
Husbands, P., Iancu, C., Yelick, K.: A performance analysis of the Berkeley UPC compiler. In: Proceedings of the 17th Annual International Conference on Supercomputing, San Francisco, CA, USA, June 23-26 (2003)
-
(2003)
Proceedings of the 17th Annual International Conference on Supercomputing
, pp. 23-26
-
-
Husbands, P.1
Iancu, C.2
Yelick, K.3
-
17
-
-
79952586457
-
-
Bauer, M., Clark, J., Schkufza, E., Aiken, A.: Sequoia++ User Manual, http://sequoia.stanford.edu/
-
Sequoia++ User Manual
-
-
Bauer, M.1
Clark, J.2
Schkufza, E.3
Aiken, A.4
|