-
1
-
-
0034449842
-
Dynamo: A transparent dynamic optimization system
-
ACM
-
Bala, V., Duesterwald, E., Banerjia, S.: Dynamo: a transparent dynamic optimization system. In: PLDI '00. ACM (2000)
-
(2000)
PLDI '00
-
-
Bala, V.1
Duesterwald, E.2
Banerjia, S.3
-
2
-
-
57349139452
-
A practical automatic polyhedral parallelizer and locality optimizer
-
ACM
-
Bondhugula, U., Hartono, A., Ramanujam, J., Sadayappan, P.: A practical automatic polyhedral parallelizer and locality optimizer. In: PLDI '08. ACM (2008)
-
(2008)
PLDI '08
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujam, J.3
Sadayappan, P.4
-
3
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
IEEE
-
Che, S., Boyer, M.,Meng, J., Tarjan, D., Sheaffer, J.W., Lee, S.H., Skadron, K.: Rodinia: a benchmark suite for heterogeneous computing. In: IISWC, pp. 44-54. IEEE (2009)
-
(2009)
IISWC
, pp. 44-54
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.H.6
Skadron, K.7
-
4
-
-
1842852979
-
Bringing skeletons out of the closet: A pragmatic manifesto for skeletal parallel programming
-
Cole, M.:Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming. Parallel Comput. 30(3), 389-406 (2004)
-
(2004)
Parallel Comput
, vol.30
, Issue.3
, pp. 389-406
-
-
Cole, M.1
-
6
-
-
84902266622
-
-
http://www.ice.rwth-aachen.de/research/tools-projects/entry/detail/ dspstone/
-
-
-
-
7
-
-
84858394473
-
Adapting the polyhedral model as a framework for efficient speculative parallelization
-
Jimborean, A., Clauss, P., Pradelle, B., Mastrangelo, L., Loechner, V.: Adapting the polyhedral model as a framework for efficient speculative parallelization. In: PPoPP '12 (2012)
-
(2012)
PPoPP '12
-
-
Jimborean, A.1
Clauss, P.2
Pradelle, B.3
Mastrangelo, L.4
Loechner, V.5
-
8
-
-
84859129532
-
VMAD: An advanced dynamic program analysis and instrumentation framework
-
OBoyle, M. (ed.) Springer, Berlin, Heidelberg
-
Jimborean, A., Mastrangelo, L., Loechner, V., Clauss, P.: VMAD: an advanced dynamic program analysis and instrumentation framework. In: OBoyle, M. (ed.) Compiler Construction, Lecture Notes in Computer Science, vol. 7210, pp. 220-239. Springer, Berlin, Heidelberg (2012)
-
(2012)
Compiler Construction, Lecture Notes in Computer Science
, vol.7210
, pp. 220-239
-
-
Jimborean, A.1
Mastrangelo, L.2
Loechner, V.3
Clauss, P.4
-
10
-
-
34748838390
-
Speculative thread decomposition through empirical optimization
-
ACM
-
Johnson, T.A., Eigenmann, R., Vijaykumar, T.N.: Speculative thread decomposition through empirical optimization. In: PPoPP '07. ACM (2007)
-
(2007)
PPoPP '07
-
-
Johnson, T.A.1
Eigenmann, R.2
Vijaykumar, T.N.3
-
11
-
-
58149242194
-
Improving performance of optimized kernels through fast instantiations of templates
-
Khan, M.A., Charles, H.P., Barthou, D.: Improving performance of optimized kernels through fast instantiations of templates. Concurr. Comput. Pract. Exp. 21(1), 59-70 (2009)
-
(2009)
Concurr. Comput. Pract. Exp
, vol.21
, Issue.1
, pp. 59-70
-
-
Khan, M.A.1
Charles, H.P.2
Barthou, D.3
-
12
-
-
84863500114
-
Automatic speculative doall for clusters
-
ACM
-
Kim, H., Johnson, N.P., Lee, J.W.,Mahlke, S.A., August, D.I.: Automatic speculative doall for clusters. In: CGO '12. ACM (2012)
-
(2012)
CGO '12
-
-
Kim, H.1
Johnson, N.P.2
Lee, J.W.3
Mahlke, S.A.4
August, D.I.5
-
13
-
-
85121161438
-
-
ACM Trans. Archit. Code Optim
-
Kotzmann, T., Wimmer, C., Mössenböck, H., Rodriguez, T., Russell, K., Cox, D.: Design of the java hotspot client compiler for java 6. ACM Trans. Archit. Code Optim. 5, 7-32 (2008)
-
(2008)
Design of the Java Hotspot Client Compiler for Java 6
, vol.5
, pp. 7-32
-
-
Kotzmann, T.1
Wimmer, C.2
Mössenböck, H.3
Rodriguez, T.4
Russell, K.5
Cox, D.6
-
14
-
-
84870753907
-
Implementation of data-parallel skeletons: A case study using a coarsegrained hierarchical model
-
Li, C., Gava, F., Hains, G.: Implementation of data-parallel skeletons: a case study using a coarsegrained hierarchical model. In: ISPDC, pp. 26-33 (2012)
-
(2012)
ISPDC
, pp. 26-33
-
-
Li, C.1
Gava, F.2
Hains, G.3
-
15
-
-
33751033680
-
POSH: A TLS compiler that exploits program structure
-
Liu, W., Tuck, J., Ceze, L., Ahn, W., Strauss, K., Renau, J., Torrellas, J.: POSH: a TLS compiler that exploits program structure. In: PPoPP '06. ACM (2006)
-
(2006)
PPoPP '06. ACM
-
-
Liu, W.1
Tuck, J.2
Ceze, L.3
Ahn, W.4
Strauss, K.5
Renau, J.6
Torrellas, J.7
-
17
-
-
0031640968
-
Automatic, template-based run-time specialization: Implementation and experimental study
-
IEEE Computer Society Press
-
Noël, F., Hornof, L., Consel, C., Lawall, J.L.: Automatic, template-based run-time specialization: implementation and experimental study. In: International Conference on Computer Languages. IEEE Computer Society Press (1998)
-
(1998)
International Conference on Computer Languages
-
-
Noël, F.1
Hornof, L.2
Consel, C.3
Lawall, J.L.4
-
18
-
-
84858786637
-
Introducing 'Bones': A parallelizing source-to-source compiler based on algorithmic skeletons
-
ACM, New York, NY, USA, doi:10. 1145/2159430.2159431
-
Nugteren, C., Corporaal, H.: Introducing 'Bones': a parallelizing source-to-source compiler based on algorithmic skeletons. In: Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5, pp. 1-10. ACM, New York, NY, USA (2012). doi:10. 1145/2159430.2159431
-
(2012)
Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, GPGPU-5
, pp. 1-10
-
-
Nugteren, C.1
Corporaal, H.2
-
19
-
-
84902246303
-
-
Polybenchs. (2010). http://www-rocq.inria.fr/pouchet/software/polybenchs
-
(2010)
Polybenchs
-
-
-
20
-
-
79952033338
-
Loop transformations: Convexity, pruning and optimization
-
ACM
-
Pouchet, L.N., Bondhugula, U., Bastoul, C., Cohen, A., Ramanujam, J., Sadayappan, P., Vasilache, N.: Loop transformations: convexity, pruning and optimization. In: POPL '11. ACM (2011)
-
(2011)
POPL '11
-
-
Pouchet, L.N.1
Bondhugula, U.2
Bastoul, C.3
Cohen, A.4
Ramanujam, J.5
Sadayappan, P.6
Vasilache, N.7
-
22
-
-
0037702458
-
Using thread-level speculation to simplify manual parallelization
-
ACM
-
Prabhu, M.K., Olukotun, K.: Using thread-level speculation to simplify manual parallelization. In: PPoPP '03. ACM (2003)
-
(2003)
PPoPP '03
-
-
Prabhu, M.K.1
Olukotun, K.2
-
23
-
-
43449123064
-
Spice: Speculative parallel iteration chunk execution
-
ACM
-
Raman, E., Vachharajani, N., Rangan, R., August, D.I.: Spice: speculative parallel iteration chunk execution. In: CGO '08. ACM (2008)
-
(2008)
CGO '08
-
-
Raman, E.1
Vachharajani, N.2
Rangan, R.3
August, D.I.4
-
24
-
-
84946439752
-
The LRPD test: Speculative run-time parallelization of loops with privatization and reduction parallelization
-
ACM
-
Rauchwerger, L., Padua, D.: The LRPD test: speculative run-time parallelization of loops with privatization and reduction parallelization. In: PLDI '95. ACM (1995)
-
(1995)
PLDI '95
-
-
Rauchwerger, L.1
Padua, D.2
-
25
-
-
84902268269
-
-
Rosetta Codes. (2011). http://rosettacode.org/wiki/Rosetta-Code
-
(2011)
-
-
-
27
-
-
0037514204
-
Compiling for template-based run-time code generation
-
DOI 10.1017/S095679680200463X
-
Smith, F., Grossman, D., Morrisett, G., Hornof, L., Jim, T.: Compiling for template-based run-time code generation. J. Funct. Program. 13(3), 677-708 (2003) (Pubitemid 36601130)
-
(2003)
Journal of Functional Programming
, vol.13
, Issue.3
, pp. 677-708
-
-
Smith, F.1
Grossman, D.2
Morrisett, G.3
Hornof, L.4
Jim, T.5
|