-
2
-
-
67650079888
-
A practical automatic polyhedral parallelizer and locality optimizer
-
U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan. A practical automatic polyhedral parallelizer and locality optimizer. SIGPLAN Not., 43(6):101-113, 2008.
-
(2008)
SIGPLAN Not.
, vol.43
, Issue.6
, pp. 101-113
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujam, J.3
Sadayappan, P.4
-
4
-
-
33749564381
-
The cache complexity of multithreaded cache oblivious algorithms
-
New York, NY, USA, ACM
-
M. Frigo and V. Strumpen. The cache complexity of multithreaded cache oblivious algorithms. In SPAA '06: Proceedings of the eighteenth annual ACM symposium on Parallelism in algorithms and architectures, pages 271-280, New York, NY, USA, 2006. ACM.
-
(2006)
SPAA '06: Proceedings of the Eighteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures
, pp. 271-280
-
-
Frigo, M.1
Strumpen, V.2
-
5
-
-
4243166952
-
Tight bounds on cache use for stencil operations on rectangular grids
-
M. A. Frumkin and R. F. Van der Wijngaart. Tight bounds on cache use for stencil operations on rectangular grids. Journal of ACM, 49(3):434-453, 2002.
-
(2002)
Journal of ACM
, vol.49
, Issue.3
, pp. 434-453
-
-
Frumkin, M.A.1
Van Der Wijngaart, R.F.2
-
6
-
-
70449702074
-
Parametric multi-level tiling of imperfectly nested loops
-
A. Hartono, M. M. Baskaran, C. Bastoul, A. Cohen, S. Krishnamoorthy, B. Norris, J. Ramanujam, and P. Sadayappan. Parametric multi-level tiling of imperfectly nested loops. In Proceedings of the 23rd International Conference on Supercomputing, pages 147-157, 2009.
-
(2009)
Proceedings of the 23rd International Conference on Supercomputing
, pp. 147-157
-
-
Hartono, A.1
Baskaran, M.M.2
Bastoul, C.3
Cohen, A.4
Krishnamoorthy, S.5
Norris, B.6
Ramanujam, J.7
Sadayappan, P.8
-
8
-
-
77954022347
-
An auto-tuning framework for parallel multicore stencil computations
-
S. Kamil, C. Chan, L. Oliker, J. Shalf, and S. Williams. An auto-tuning framework for parallel multicore stencil computations. In International Parallel & Distributed Processing Symposium (IPDPS), 2010.
-
International Parallel & Distributed Processing Symposium (IPDPS), 2010
-
-
Kamil, S.1
Chan, C.2
Oliker, L.3
Shalf, J.4
Williams, S.5
-
9
-
-
34547500808
-
Implicit and explicit optimizations for stencil computations
-
ACM
-
S. Kamil, K. Datta, S. Williams, L. Oliker, J. Shalf, and K. Yelick. Implicit and explicit optimizations for stencil computations. In MSPC '06: Proceedings of the 2006 workshop on Memory system performance and correctness, pages 51-60. ACM, 2006.
-
(2006)
MSPC '06: Proceedings of the 2006 Workshop on Memory System Performance and Correctness
, pp. 51-60
-
-
Kamil, S.1
Datta, K.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Yelick, K.6
-
10
-
-
56749175334
-
Multi-level tiling: M for the price of one
-
D. Kim, L. Renganarayanan, D. Rostron, S. V. Rajopadhye, and M. M. Strout. Multi-level tiling: M for the price of one. In Proceedings of the ACM/IEEE Conference on Supercomputing, page 51, 2007.
-
(2007)
Proceedings of the ACM/IEEE Conference on Supercomputing
, pp. 51
-
-
Kim, D.1
Renganarayanan, L.2
Rostron, D.3
Rajopadhye, S.V.4
Strout, M.M.5
-
11
-
-
35448944792
-
Effective automatic parallelization of stencil computations
-
S. Krishnamoorthy, M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan. Effective automatic parallelization of stencil computations. SIGPLAN Not., 42(6):235-244, 2007.
-
(2007)
SIGPLAN Not.
, vol.42
, Issue.6
, pp. 235-244
-
-
Krishnamoorthy, S.1
Baskaran, M.2
Bondhugula, U.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
|