-
3
-
-
63549095070
-
The PARSEC benchmark suite: Characterization and architectural implications
-
ACM, October
-
C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC benchmark suite: Characterization and architectural implications. In PACT '08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pages 72-81. ACM, October 2008.
-
(2008)
PACT '08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 72-81
-
-
Bienia, C.1
Kumar, S.2
Singh, J.P.3
Li, K.4
-
4
-
-
84947585636
-
The SPMD model: Past, present and future
-
January
-
F. Darema. The SPMD Model: Past, Present and Future. Lecture Notes in Computer Science, 2131(1):1-1, January 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.2131
, Issue.1
, pp. 1-1
-
-
Darema, F.1
-
5
-
-
57349092386
-
CUBA: An architecture for efficient CPU/co-processor data communication
-
ACM, June
-
I. Gelado, J. H. Kelm, S. Ryoo, S. S. Lumetta, N. Navarro, and W.- m. W. Hwu. CUBA: An architecture for efficient CPU/co-processor data communication. In ICS '08: Proceedings of the 22nd annual international conference on Supercomputing, pages 299-308. ACM, June 2008.
-
(2008)
ICS '08: Proceedings of the 22nd annual international conference on Supercomputing
, pp. 299-308
-
-
Gelado, I.1
Kelm, J.H.2
Ryoo, S.3
Lumetta, S.S.4
Navarro, N.5
Hwu, W.-M.W.6
-
6
-
-
78149276036
-
Twin peaks: A software platform for heterogeneous computing on general-purpose and graphics processors
-
ACM
-
J. Gummaraju, L. Morichetti, M. Houston, B. Sander, B. R. Gaster, and B. Zheng. Twin peaks: A software platform for heterogeneous computing on general-purpose and graphics processors. In PACT '10: Proceedings of the 19th international conference on Parallel architectures and compilation techniques, pages 205-216. ACM, 2010.
-
(2010)
PACT '10: Proceedings of the 19th international conference on Parallel architectures and compilation techniques
, pp. 205-216
-
-
Gummaraju, J.1
Morichetti, L.2
Houston, M.3
Sander, B.4
Gaster, B.R.5
Zheng, B.6
-
7
-
-
77952342828
-
-
Khronos OpenCLWorking Group., Khronos Group
-
Khronos OpenCLWorking Group. The OpenCL Specification Version 1.0. Khronos Group, 2009. http://www.khronos.org/opencl.
-
(2009)
The OpenCL Specification Version 1.0
-
-
-
8
-
-
77951157944
-
-
Morgan Kaufmann Publishers Inc., San Francisco, CA, USA. ISBN 0123814723 9780123814722
-
D. B. Kirk and W.-m. W. Hwu. Programming Massively Parallel Processors: A Hands-on Approach. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2010. ISBN 0123814723, 9780123814722.
-
(2010)
Programming Massively Parallel Processors: A Hands-on Approach
-
-
Kirk, D.B.1
Hwu, W.-M.W.2
-
9
-
-
3042658703
-
LLVM: A compilation framework for lifelong program analysis & transformation
-
Washington, DC, USA March, IEEE Computer Society
-
C. Lattner and V. Adve. LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation. In CGO '04: Proceedings of the international symposium on Code generation and optimization, pages 75-86, Washington, DC, USA, March 2004. IEEE Computer Society.
-
(2004)
CGO '04: Proceedings of the International Symposium on Code Generation and Optimization
, pp. 75-86
-
-
Lattner, C.1
Adve, V.2
-
10
-
-
78149255519
-
An OpenCL framework for heterogeneous multicores with local memory
-
ACM
-
J. Lee, J. Kim, S. Seo, S. Kim, J. Park, H. Kim, T. T. Dao, Y. Cho, S. J. Seo, S. H. Lee, S. M. Cho, H. J. Song, S.-B. Suh, and J.-D. Choi. An OpenCL framework for heterogeneous multicores with local memory. In PACT '10: Proceedings of the 19th international conference on Parallel architectures and compilation techniques, pages 193-204. ACM, 2010.
-
(2010)
PACT '10: Proceedings of the 19th international conference on Parallel Architectures and Compilation Techniques
, pp. 193-204
-
-
Lee, J.1
Kim, J.2
Seo, S.3
Kim, S.4
Park, J.5
Kim, H.6
Dao, T.T.7
Cho, Y.8
Seo, S.J.9
Lee, S.H.10
Cho, S.M.11
Song, H.J.12
Suh, S.-B.13
Choi, J.-D.14
-
11
-
-
0003502903
-
-
Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, ISBN 1-55860-320-4
-
S. S. Muchnick. Advanced compiler design and implementation. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1997. ISBN 1-55860-320-4.
-
(1997)
Advanced Compiler Design and Implementation.
-
-
Muchnick, S.S.1
-
12
-
-
80053992838
-
-
NASA Advanced Supercomputing Division. NAS Parallel Benchmarks.
-
NASA Advanced Supercomputing Division. NAS Parallel Benchmarks. http://www.nas.nasa.gov/Resources/Software/npb. html.
-
-
-
-
13
-
-
79952776981
-
-
NVIDIA
-
NVIDIA Fermi Compute Architecture White Paper. NVIDIA, 2009. http://www.nvidia.com/content/PDF/fermi-white- papers/NVIDIA-Fermi-Compute- Architecture-Whitepaper. pdf.
-
(2009)
NVIDIA Fermi Compute Architecture White Paper
-
-
-
16
-
-
79952804884
-
-
NVIDIA, July
-
NVIDIA CUDA Zone. NVIDIA, July 2010. http://www.nvidia. com/object/cuda-home-new.html.
-
(2010)
NVIDIA CUDA Zone.
-
-
-
19
-
-
70350754499
-
Adapting a messagedriven parallel application to GPU-accelerated clusters
-
Piscataway, NJ, USA, November . IEEE Press
-
J. C. Phillips, J. E. Stone, and K. Schulten. Adapting a messagedriven parallel application to GPU-accelerated clusters. In SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, pages 1-9, Piscataway, NJ, USA, November 2008. IEEE Press.
-
(2008)
SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing
, pp. 1-9
-
-
Phillips, J.C.1
Stone, J.E.2
Schulten, K.3
-
20
-
-
67650021816
-
Solving dense linear systems on platforms with multiple hardware accelerators
-
ACM
-
G. Quintana-Ortí, F. D. Igual, E. S. Quintana-Ortí, and R. A. van de Geijn. Solving dense linear systems on platforms with multiple hardware accelerators. In PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 121-130. ACM, 2009.
-
(2009)
PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
, pp. 121-130
-
-
Quintana-Ortí, G.1
Igual, F.D.2
Quintana-Ortí, E.S.3
Van De Geijn, R.A.4
-
21
-
-
77749280360
-
The LOFAR correlator: Implementation and performance analysis
-
ACM
-
J. W. Romein, P. C. Broekema, J. D. Mol, and R. V. van Nieuwpoort. The LOFAR correlator: Implementation and performance analysis. In PPoPP '10: Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 169-178. ACM, 2010.
-
(2010)
PPoPP '10: Proceedings of the 15th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
, pp. 169-178
-
-
Romein, J.W.1
Broekema, P.C.2
Mol, J.D.3
Van Nieuwpoort, R.V.4
-
24
-
-
80054001882
-
-
The IMPACT Research Group. Parboil Benchmark suite
-
The IMPACT Research Group. Parboil Benchmark suite. http://impact.crhc. illinois.edu/parboil.php, 2009.
-
(2009)
-
-
-
25
-
-
0004096330
-
-
Technical report, Amsterdam, The Netherlands, The Netherlands
-
F. Tip. A Survey of Program Slicing Techniques. Technical report, Amsterdam, The Netherlands, The Netherlands, 1994.
-
(1994)
A Survey of Program Slicing Techniques
-
-
Tip, F.1
-
26
-
-
70350771131
-
Benchmarking gpus to tune dense linear algebra
-
Piscataway, NJ, USA, IEEE Press
-
V. Volkov and J.W. Demmel. Benchmarking gpus to tune dense linear algebra. In SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, pages 1-11, Piscataway, NJ, USA, 2008. IEEE Press.
-
(2008)
SC '08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing
, pp. 1-11
-
-
Volkov, V.1
Demmel, J.W.2
-
27
-
-
85050273691
-
Program slicing
-
Piscataway, NJ, USA, IEEE Press
-
M. Weiser. Program Slicing. In ICSE '81: Proceedings of the 5th International Conference on Software Engineering, pages 439-449, Piscataway, NJ, USA, 1981. IEEE Press.
-
(1981)
ICSE '81: Proceedings of the 5th International Conference on Software Engineering
, pp. 439-449
-
-
Weiser, M.1
-
28
-
-
78649488776
-
Adaptive optimization for petascale heterogeneous CPU/GPU computing
-
Los Alamitos, CA, USA, IEEE Computer Society
-
C. Yang, F. Wang, Y. Du, J. Chen, J. Liu, H. Yi, and K. Lu. Adaptive Optimization for Petascale Heterogeneous CPU/GPU Computing. In IEEE Cluster '10: Proceedings of IEEE International Conference on Cluster Computing, pages 19-28, Los Alamitos, CA, USA, 2010. IEEE Computer Society.
-
(2010)
IEEE Cluster '10: Proceedings of IEEE International Conference on Cluster Computing
, pp. 19-28
-
-
Yang, C.1
Wang, F.2
Du, Y.3
Chen, J.4
Liu, J.5
Yi, H.6
Lu, K.7
-
29
-
-
77957600490
-
A GPGPU compiler for memory optimization and parallelism management
-
ACM, June
-
Y. Yang, P. Xiang, J. Kong, and H. Zhou. A GPGPU compiler for memory optimization and parallelism management. In PLDI '10: Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation, pages 86-97. ACM, June 2010.
-
(2010)
PLDI '10: Proceedings of the 2010 ACM SIGPLAN conference on Programming Language Design and Implementation
, pp. 86-97
-
-
Yang, Y.1
Xiang, P.2
Kong, J.3
Zhou, H.4
|