-
1
-
-
67650797545
-
-
K. Asanovic, R. Bodik, B. Catanzaro, et al., The landscape of parallel computing research: A view from Berkeley, Technical Report UCB/EECS-2006-183, EECS, University of California, Berkeley, 2006
-
K. Asanovic, R. Bodik, B. Catanzaro, et al., The landscape of parallel computing research: A view from Berkeley, Technical Report UCB/EECS-2006-183, EECS, University of California, Berkeley, 2006
-
-
-
-
2
-
-
26344468007
-
A model for collisional processes in gases I: Small amplitude processes in charged and neutral one-component systems
-
Bhatnagar P., Gross E., and Krook M. A model for collisional processes in gases I: Small amplitude processes in charged and neutral one-component systems. Phys. Rev. 94 (1954) 511
-
(1954)
Phys. Rev.
, vol.94
, pp. 511
-
-
Bhatnagar, P.1
Gross, E.2
Krook, M.3
-
4
-
-
84899683182
-
Magnetohydrodynamic turbulence simulations on the Earth Simulator using the lattice Boltzmann method
-
J. Carter, M. Soe, L. Oliker, Y. Tsuda, G. Vahala, L. Vahala, A. Macnab, Magnetohydrodynamic turbulence simulations on the Earth Simulator using the lattice Boltzmann method, in: Proc. SC2005: High Performance Computing, Networking, and Storage Conference, 2005
-
(2005)
Proc. SC2005: High Performance Computing, Networking, and Storage Conference
-
-
Carter, J.1
Soe, M.2
Oliker, L.3
Tsuda, Y.4
Vahala, G.5
Vahala, L.6
Macnab, A.7
-
5
-
-
0037054259
-
Lattice kinetic schemes for magnetohydrodynamics
-
Dellar P. Lattice kinetic schemes for magnetohydrodynamics. J. Comput. Phys. 79 (2002)
-
(2002)
J. Comput. Phys.
, vol.79
-
-
Dellar, P.1
-
6
-
-
32844463802
-
Cache oblivious stencil computations
-
M. Frigo, V. Strumpen, Cache oblivious stencil computations, in: Proceedings of the 19th ACM International Conference on Supercomputing, ICS05, 2005, pp. 361-366
-
(2005)
Proceedings of the 19th ACM International Conference on Supercomputing, ICS05
, pp. 361-366
-
-
Frigo, M.1
Strumpen, V.2
-
7
-
-
34247376580
-
Chip multiprocessing and the cell broadband engine
-
New York, NY, USA
-
M. Gschwind, Chip multiprocessing and the cell broadband engine, in: CF '06: Computing Fontiers, New York, NY, USA, 2006, pp. 1-8
-
(2006)
CF '06: Computing Fontiers
, pp. 1-8
-
-
Gschwind, M.1
-
8
-
-
0024903997
-
Evaluating associativity in CPU caches
-
Hill M.D., and Smith A.J. Evaluating associativity in CPU caches. IEEE Trans. Comput. 38 12 (1989) 1612-1630
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
9
-
-
34547500808
-
Implicit and explicit optimizations for stencil computations
-
S. Kamil, K. Datta, S. Williams, L.O.J. Shalf, K. Yelick, Implicit and explicit optimizations for stencil computations, in: Memory Systems Performance and Correctness, MSPC, 2006, pp. 51-60
-
(2006)
Memory Systems Performance and Correctness, MSPC
, pp. 51-60
-
-
Kamil, S.1
Datta, K.2
Williams, S.3
Shalf, L.O.J.4
Yelick, K.5
-
10
-
-
33645446819
-
Lattice Boltzmann model for dissipative MHD
-
Montreux, Switzerland, June 17-21
-
A. Macnab, G. Vahala, L. Vahala, P. Pavlo, Lattice Boltzmann model for dissipative MHD, in: Proc. 29th EPS Conference on Controlled Fusion and Plasma Physics, vol. 26B, Montreux, Switzerland, June 17-21, 2002
-
(2002)
Proc. 29th EPS Conference on Controlled Fusion and Plasma Physics
, vol.26 B
-
-
Macnab, A.1
Vahala, G.2
Vahala, L.3
Pavlo, P.4
-
12
-
-
24144484694
-
A performance evaluation of the Cray X1 for scientific applications
-
Valencia, Spain, June 28-30
-
L. Oliker, R. Biswas, J. Borrill, A. Canning, J. Carter, et al., A performance evaluation of the Cray X1 for scientific applications, in: VECPAR: 6th International Meeting on High Performance Computing for Computational Science, Valencia, Spain, June 28-30, 2004, pp. 51-65
-
(2004)
VECPAR: 6th International Meeting on High Performance Computing for Computational Science
, pp. 51-65
-
-
Oliker, L.1
Biswas, R.2
Borrill, J.3
Canning, A.4
Carter, J.5
-
13
-
-
33845468287
-
Leading computational methods on scalar and vector HEC platforms
-
Seattle, WA
-
L. Oliker, J. Carter, M. Wehner, A. Canning, S. Ethier, et al., Leading computational methods on scalar and vector HEC platforms, in: Proc. SC2005: High Performance Computing, Networking, and Storage Conference, Seattle, WA, 2005, p. 62
-
(2005)
Proc. SC2005: High Performance Computing, Networking, and Storage Conference
, pp. 62
-
-
Oliker, L.1
Carter, J.2
Wehner, M.3
Canning, A.4
Ethier, S.5
-
14
-
-
67650800575
-
-
OpenMP, 1997. http://openmp.org
-
(1997)
-
-
-
16
-
-
1242352441
-
Optimization and profiling of the cache performance of parallel lattice Boltzmann codes
-
Pohl T., Kowarschik M., Wilke J., Iglberger K., and Rüde U. Optimization and profiling of the cache performance of parallel lattice Boltzmann codes. Parallel Process. Lett. 13 4 (2003) S: 549
-
(2003)
Parallel Process. Lett.
, vol.13
, Issue.4
-
-
Pohl, T.1
Kowarschik, M.2
Wilke, J.3
Iglberger, K.4
Rüde, U.5
-
18
-
-
33646924323
-
Microarchitectures for systems on a chip in small process geometries
-
Sylvester D., and Keutzer K. Microarchitectures for systems on a chip in small process geometries. Proc. IEEE Apr. (2001) 467-489
-
(2001)
Proc. IEEE
, Issue.Apr
, pp. 467-489
-
-
Sylvester, D.1
Keutzer, K.2
-
19
-
-
67650845618
-
-
The IEEE and The Open Group, The Open Group Base Specifications Issue 6
-
The IEEE and The Open Group, The Open Group Base Specifications Issue 6, 2004
-
(2004)
-
-
-
20
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels, in: Proc. of SciDAC 2005
-
Vuduc R., Demmel J., and Yelick K. OSKI: A library of automatically tuned sparse matrix kernels, in: Proc. of SciDAC 2005. J. Phys.: Conf. Ser. June (2005) 521-530
-
(2005)
J. Phys.: Conf. Ser.
, Issue.June
, pp. 521-530
-
-
Vuduc, R.1
Demmel, J.2
Yelick, K.3
-
21
-
-
33646809359
-
On the single processor performance of simple lattice Boltzmann kernels
-
Wellein G., Zeiser T., Donath S., and Hager G. On the single processor performance of simple lattice Boltzmann kernels. Comput. Fluids 35 910 (2005)
-
(2005)
Comput. Fluids
, vol.35
, Issue.910
-
-
Wellein, G.1
Zeiser, T.2
Donath, S.3
Hager, G.4
-
22
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Whaley R.C., Petitet A., and Dongarra J. Automated empirical optimization of software and the ATLAS project. Parallel Comput. 27 1-2 (2001) 3-35
-
(2001)
Parallel Comput.
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.3
-
23
-
-
65649090648
-
-
Ph.D. Thesis, EECS Department, University of California, Berkeley, December
-
S. Williams, Auto-tuning performance on multicore computers, Ph.D. Thesis, EECS Department, University of California, Berkeley, December 2008
-
(2008)
Auto-tuning performance on multicore computers
-
-
Williams, S.1
-
24
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiplication on emerging multicore platforms, in: Proc. SC2007: High Performance Computing, Networking, and Storage Conference, 2007, pp. 1-12
-
(2007)
Proc. SC2007: High Performance Computing, Networking, and Storage Conference
, pp. 1-12
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
25
-
-
68949198052
-
The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures
-
August
-
S. Williams, D. Patterson, L. Oliker, J. Shalf, K. Yelick, The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures, in: IEEE HotChips Symposium on High-Performance Chips, HotChips 2008, August 2008
-
(2008)
IEEE HotChips Symposium on High-Performance Chips, HotChips
-
-
Williams, S.1
Patterson, D.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
26
-
-
67650797544
-
Roofline: An insightful visual performance model for floating-point programs and multicore architectures
-
Williams S., Watterman A., and Patterson D. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Commun. ACM April (2009)
-
(2009)
Commun. ACM
, Issue.April
-
-
Williams, S.1
Watterman, A.2
Patterson, D.3
-
27
-
-
0000331979
-
Lattice Boltzmann method for 3D flows with curved boundary
-
Yu D., Mei R., Shyy W., and Luo L. Lattice Boltzmann method for 3D flows with curved boundary. J. Comput. Phys. 161 (2000) 680-699
-
(2000)
J. Comput. Phys.
, vol.161
, pp. 680-699
-
-
Yu, D.1
Mei, R.2
Shyy, W.3
Luo, L.4
-
28
-
-
51049100538
-
Introducing a parallel cache-oblivious blocking approach for the lattice Boltzmann method
-
T. Zeiser, G. Wellein, G. Hager, A. Nitsure, K. Iglberger, G. Hager, Introducing a parallel cache-oblivious blocking approach for the lattice Boltzmann method, in: ICMMES-2006 Proceedings, 2006, pp. 179-188
-
(2006)
ICMMES-2006 Proceedings
, pp. 179-188
-
-
Zeiser, T.1
Wellein, G.2
Hager, G.3
Nitsure, A.4
Iglberger, K.5
Hager, G.6
|