-
1
-
-
35648995516
-
The landscape of parallel computing research: A view from Berkeley
-
University of California, Berkeley
-
K. Asanovic, R. Bodik, B. Catanzaro, et al. The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-2183, EECS, University of California, Berkeley, 2006.
-
(2006)
Technical Report UCB/EECS-2006-2183, EECS
-
-
Asanovic, K.1
Bodik, R.2
Catanzaro, B.3
-
3
-
-
67650673172
-
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
-
08, Austin, Texas
-
K. Datta, M. Murphy, V. Volkov, et al. Stencil Computation Optimization and Auto-Tuning on State-of-the-art Multicore Architectures. In Proceedings of SC '08, Austin, Texas, 2008.
-
(2008)
Proceedings of SC
-
-
Datta, K.1
Murphy, M.2
Volkov, V.3
-
5
-
-
59749100826
-
Optimization and performance modeling of stencil computations on modern microprocessors
-
Kaushik Datta, Shoaib Kamil, Samuel Williams, Leonid Oliker, John Shalf, and Katherine Yelick. Optimization and performance modeling of stencil computations on modern microprocessors. SIAM Review, 51(1):129-159, 2009.
-
(2009)
SIAM Review
, vol.51
, Issue.1
, pp. 129-159
-
-
Datta, K.1
Kamil, S.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Katherine, Yelick.6
-
7
-
-
57349139452
-
A practical automatic polyhedral parallelizer and locality optimizer
-
Bondhugula et al. A practical automatic polyhedral parallelizer and locality optimizer. SIGPLAN Not., 43(6):101-113, 2008.
-
(2008)
SIGPLAN Not.
, vol.43
, Issue.6
, pp. 101-113
-
-
Bondhugula1
-
9
-
-
0348209599
-
A fast fourier transform compiler
-
Matteo Frigo. A fast fourier transform compiler. SIGPLAN Not., 34(5):169-180, 1999.
-
(1999)
SIGPLAN Not.
, vol.34
, Issue.5
, pp. 169-180
-
-
Matteo, Frigo.1
-
11
-
-
77953968816
-
-
GreenFlash. http://www.lbl.gov/CS/html/greenflash.html.
-
GreenFlash
-
-
-
12
-
-
0000631097
-
Numerical integration of the shallow-water equations of a twisted icosahedral grid. part i: Basic design and results of tests
-
R. Heikes and D.A. Randall. Numerical integration of the shallow-water equations of a twisted icosahedral grid. part i: basic design and results of tests. Mon. Wea. Rev., 123:1862-1880, 1995.
-
(1995)
Mon. Wea. Rev.
, vol.123
, pp. 1862-1880
-
-
Heikes, R.1
Randall, D.A.2
-
13
-
-
0024903997
-
Evaluating associativity in CPU caches
-
M. D. Hill and A. J. Smith. Evaluating Associativity in CPU Caches. IEEE Trans. Comput., 38(12):1612-1630, 1989.
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith., A.J.2
-
14
-
-
78249252490
-
A generalized framework for auto-tuning stencil computations
-
S. Kamil, C. Chan, S. Williams, et al. A generalized framework for auto-tuning stencil computations. In Cray User Group, 2009.
-
(2009)
Cray User Group
-
-
Kamil, S.1
Chan, C.2
Williams, S.3
-
15
-
-
34547500808
-
Implicit and explicit optimizations for stencil computations
-
San Jose, CA
-
S. Kamil, K. Datta, S. Williams, et al. Implicit and explicit optimizations for stencil computations. In Workshop Memory Systems Performance and Correctness, San Jose, CA, 2006.
-
(2006)
Workshop Memory Systems Performance and Correctness
-
-
Kamil, S.1
Datta, K.2
Williams, S.3
-
19
-
-
84947551600
-
-
Springer
-
S. Mitra, S. C. Kothari, J. Cho, and A. Krishnaswamy. ParAgent: A Domain-Specific Semi-automatic Parallelization Tool, pages 141-148. Springer, 2000.
-
(2000)
ParAgent: A Domain-Specific Semi-automatic Parallelization Tool, Pages
, pp. 141-148
-
-
Mitra, S.1
Kothari, S.C.2
Cho, J.3
Krishnaswamy, A.4
-
20
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms. Proceedings of the IEEE special issue on
-
M. Puschel, J. Moura, J. Johnson, et al. SPIRAL: Code generation for DSP transforms. Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", 93(2):232-275, 2005.
-
(2005)
Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
, pp. 232-275
-
-
Puschel, M.1
Moura, J.2
Johnson, J.3
-
25
-
-
0343462141
-
Automated Empirical Optimization of Software and the ATLAS project
-
R. C. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimization of Software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra., J.3
-
26
-
-
67650797544
-
Roofline: An insightful visual performance model for floating-point programs and multicore architectures
-
April
-
S. Williams, A. Watterman, and D. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Communications of the ACM, April 2009.
-
(2009)
Communications of the ACM
-
-
Williams, S.1
Watterman, A.2
Patterson, D.3
|