-
1
-
-
26344468007
-
A model for collisional processes in gases I: Small amplitude processes in charged and neutral one-component systems
-
P. Bhatnagar, E. Gross, and M. Krook. A model for collisional processes in gases I: small amplitude processes in charged and neutral one-component systems. Phys. Rev., 94:511, 1954.
-
(1954)
Phys. Rev.
, vol.94
, pp. 511
-
-
Bhatnagar, P.1
Gross, E.2
Krook, M.3
-
3
-
-
84899683182
-
Magnetohydrodynamic turbulence simulations on the earth simulator using the lattice Boltzmann method
-
Seattle, WA
-
J. Carter, M. Soe, L. Oliker, Y. Tsuda, G. Vahala, L. Vahala, and A. Macnab. Magnetohydrodynamic turbulence simulations on the earth simulator using the lattice Boltzmann method. In SC05, Seattle, WA, 2005.
-
(2005)
SC05
-
-
Carter, J.1
Soe, M.2
Oliker, L.3
Tsuda, Y.4
Vahala, G.5
Vahala, L.6
Macnab, A.7
-
4
-
-
77953980209
-
Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures
-
Atlanta, Georgia
-
A. Chandramowlishwaran, S. Williams, L. Oliker, I. Lashuk, G. Biros, and R. Vuduc. Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures. In Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Atlanta, Georgia, 2010.
-
(2010)
Interational Conference on Parallel and Distributed Computing Systems (IPDPS)
-
-
Chandramowlishwaran, A.1
Williams, S.2
Oliker, L.3
Lashuk, I.4
Biros, G.5
Vuduc, R.6
-
6
-
-
59749100826
-
Optimization and performance modeling of stencil computations on modern microprocessors
-
K. Datta, S. Kamil, S. Williams, L. Oliker, J. Shalf, and K. A. Yelick. Optimization and performance modeling of stencil computations on modern microprocessors. SIAM Review, 51(1):129-159, 2009.
-
(2009)
SIAM Review
, vol.51
, Issue.1
, pp. 129-159
-
-
Datta, K.1
Kamil, S.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Yelick, K.A.6
-
7
-
-
70350771127
-
Stencil computation optimization and autotuning on state-of-the-art multicore architectures
-
nov
-
K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter, L. Oliker, D. Patterson, J. Shalf, and K. Yelick. Stencil computation optimization and autotuning on state-of-the-art multicore architectures. In Proc. SC2008: High performance computing, networking, and storage conference, nov 2008.
-
(2008)
Proc. SC2008: High Performance Computing, Networking, and Storage Conference
-
-
Datta, K.1
Murphy, M.2
Volkov, V.3
Williams, S.4
Carter, J.5
Oliker, L.6
Patterson, D.7
Shalf, J.8
Yelick, K.9
-
8
-
-
84971423310
-
Auto-tuning the 27-point stencil for multicore
-
K. Datta, S. Williams, V. Volkov, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Auto-tuning the 27-point stencil for multicore. In In Proc. iWAPT2009: The Fourth International Workshop on Automatic Performance Tuning, 2009.
-
(2009)
Proc. IWAPT2009: The Fourth International Workshop on Automatic Performance Tuning
-
-
Datta, K.1
Williams, S.2
Volkov, V.3
Carter, J.4
Oliker, L.5
Shalf, J.6
Yelick, K.7
-
9
-
-
0037054259
-
Lattice kinetic schemes for magnetohydrodynamics
-
P. Dellar. Lattice kinetic schemes for magnetohydrodynamics. J. Comput. Phys., 79, 2002.
-
(2002)
J. Comput. Phys.
, vol.79
-
-
Dellar, P.1
-
12
-
-
77954022347
-
An auto-tuning framework for parallel multicore stencil computations
-
Atlanta, Georgia
-
S. Kamil, C. Chan, L. Oliker, J. Shalf, and S. Williams. An auto-tuning framework for parallel multicore stencil computations. In Interational Conference on Parallel and Distributed Computing Systems (IPDPS), Atlanta, Georgia, 2010.
-
(2010)
Interational Conference on Parallel and Distributed Computing Systems (IPDPS)
-
-
Kamil, S.1
Chan, C.2
Oliker, L.3
Shalf, J.4
Williams, S.5
-
13
-
-
84958661690
-
Impact of modern memory subsystems on cache optimizations for stencil computations
-
ACM
-
S. Kamil, P. Husbands, L. Oliker, J. Shalf, and K. Yelick. Impact of modern memory subsystems on cache optimizations for stencil computations. In Memory Systen Performance, pages 36-43. ACM, 2005.
-
(2005)
Memory Systen Performance
, pp. 36-43
-
-
Kamil, S.1
Husbands, P.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
14
-
-
33645446819
-
Lattice Boltzmann model for dissipative MHD
-
Montreux, Switzerland, June 17-21
-
A. Macnab, G. Vahala, L. Vahala, and P. Pavlo. Lattice Boltzmann model for dissipative MHD. In Proc. 29th EPS Conference on Controlled Fusion and Plasma Physics, volume 26B, Montreux, Switzerland, June 17-21, 2002.
-
(2002)
Proc. 29th EPS Conference on Controlled Fusion and Plasma Physics
, vol.26 B
-
-
Macnab, A.1
Vahala, G.2
Vahala, L.3
Pavlo, P.4
-
15
-
-
74049134929
-
Memory-efficient optimization of gyrokinetic particle-to-grid interpolation for multicore processors
-
K. Madduri, S. Williams, S. Ethier, L. Oliker, J. Shalf, E. Strohmaier, and K. Yelick. Memory-efficient optimization of gyrokinetic particle-to-grid interpolation for multicore processors. In Proc. SC2009: High performance computing, networking, and storage conference, 2009.
-
(2009)
Proc. SC2009: High Performance Computing, Networking, and Storage Conference
-
-
Madduri, K.1
Williams, S.2
Ethier, S.3
Oliker, L.4
Shalf, J.5
Strohmaier, E.6
Yelick, K.7
-
18
-
-
74049146136
-
Minimizing communication in sparse matrix solvers
-
M. Mohiyuddin, M. Hoemmen, J. Demmel, and K. Yelick. Minimizing communication in sparse matrix solvers. In Proc. SC2009: High performance computing, networking, and storage conference, 2009. http://dx.doi.org/10.1145/ 1654059.1654096.
-
(2009)
Proc. SC2009: High Performance Computing, Networking, and Storage Conference
-
-
Mohiyuddin, M.1
Hoemmen, M.2
Demmel, J.3
Yelick, K.4
-
19
-
-
78650806116
-
3.5-D blocking optimization for stencil computations on modern CPUs and GPUs
-
Washington, DC, USA, IEEE Computer Society
-
A. Nguyen, N. Satish, J. Chhugani, C. Kim, and P. Dubey. 3.5-D blocking optimization for stencil computations on modern CPUs and GPUs. In Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC'10, pages 1-13, Washington, DC, USA, 2010. IEEE Computer Society.
-
(2010)
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC'10
, pp. 1-13
-
-
Nguyen, A.1
Satish, N.2
Chhugani, J.3
Kim, C.4
Dubey, P.5
-
21
-
-
42749090414
-
Progress in lattice Boltzmann methods for magnetohydrodynamic ows relevant to fusion applications
-
M. Pattison, K. Premnath, N. Morley, and M. Abdou. Progress in lattice Boltzmann methods for magnetohydrodynamic ows relevant to fusion applications. Fusion Eng. Des., 83:557-572, 2008.
-
(2008)
Fusion Eng. Des.
, vol.83
, pp. 557-572
-
-
Pattison, M.1
Premnath, K.2
Morley, N.3
Abdou, M.4
-
22
-
-
1242352441
-
Optimization and profiling of the cache performance of parallel lattice Boltzmann codes
-
T. Pohl, M. Kowarschik, J. Wilke, K. Iglberger, and U. Rüde. Optimization and profiling of the cache performance of parallel lattice Boltzmann codes. Parallel Processing Letters, 13(4):S:549, 2003.
-
(2003)
Parallel Processing Letters
, vol.13
, Issue.4
, pp. 549
-
-
Pohl, T.1
Kowarschik, M.2
Wilke, J.3
Iglberger, K.4
Rüde, U.5
-
28
-
-
70449657442
-
Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization
-
G. Wellein, G. Hager, T. Zeiser, M. Wittmann, and H. Fehske. Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization. In International Computer Software and Applications Conference, pages 579-586, 2009.
-
(2009)
International Computer Software and Applications Conference
, pp. 579-586
-
-
Wellein, G.1
Hager, G.2
Zeiser, T.3
Wittmann, M.4
Fehske, H.5
-
29
-
-
33646809359
-
On the single processor performance of simple lattice Boltzmann kernels
-
Nov. ISSN 0045-7930
-
G. Wellein, T. Zeiser, G. Hager, and S. Donath. On the single processor performance of simple lattice Boltzmann kernels. computers & fluids, 35(8-9):910-919, Nov. 2006. ISSN 0045-7930.
-
(2006)
Computers & Fluids
, vol.35
, Issue.8-9
, pp. 910-919
-
-
Wellein, G.1
Zeiser, T.2
Hager, G.3
Donath, S.4
-
30
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
DOI 10.1016/S0167-8191(00)00087-9
-
R. C. Whaley, A. Petitet, and J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001. (Pubitemid 32264775)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Clint Whaley, R.1
Petitet, A.2
Dongarra, J.J.3
-
32
-
-
51049106193
-
Lattice Boltzmann simulation optimization on leading multicore platforms
-
S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Lattice Boltzmann simulation optimization on leading multicore platforms. In International Parallel & Distributed Processing Symposium, 2008.
-
(2008)
International Parallel & Distributed Processing Symposium
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
33
-
-
67650998701
-
Lattice Boltzmann simulation optimization on leading multicore platforms
-
S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Lattice Boltzmann simulation optimization on leading multicore platforms. Journal of Parallel and Distributed Computing, 69(9):762-777, 2009.
-
(2009)
Journal of Parallel and Distributed Computing
, vol.69
, Issue.9
, pp. 762-777
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
34
-
-
83155177858
-
Resource-efficient, hierarchical auto-tuning of a hybrid lattice Boltzmann computation on the Cray XT4
-
S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick. Resource-efficient, hierarchical auto-tuning of a hybrid lattice Boltzmann computation on the Cray XT4. In Proc. CUG09: Cray User Group meeting, 2009.
-
(2009)
Proc. CUG09: Cray User Group Meeting
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
35
-
-
56749158843
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, and J. Demmel. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proc. SC2007: High performance computing, networking, and storage conference, 2007.
-
(2007)
Proc. SC2007: High Performance Computing, Networking, and Storage Conference
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
36
-
-
67650797544
-
Roofline: An insightful visual performance model for floating-point programs and multicore architectures
-
April
-
S. Williams, A. Watterman, and D. Patterson. Roofline: An insightful visual performance model for floating-point programs and multicore architectures. Communications of the ACM, April 2009.
-
(2009)
Communications of the ACM
-
-
Williams, S.1
Watterman, A.2
Patterson, D.3
-
37
-
-
0000331979
-
Lattice Boltzmann method for 3D flows with curved boundary
-
D. Yu, R. Mei, W. Shyy, and L. Luo. Lattice Boltzmann method for 3D flows with curved boundary. Journal of Comp. Physics, 161:680-699, 2000.
-
(2000)
Journal of Comp. Physics
, vol.161
, pp. 680-699
-
-
Yu, D.1
Mei, R.2
Shyy, W.3
Luo, L.4
-
38
-
-
73849092882
-
Benchmark analysis and application results for lattice Boltzmann simulations on NEC SXvector and Intel Nehalemsystems
-
T. Zeiser, G. Hager, and G. Wellein. Benchmark analysis and application results for lattice Boltzmann simulations on NEC SXvector and Intel Nehalemsystems. Parallel Processing Letters, 19(4):491-511, 2009.
-
(2009)
Parallel Processing Letters
, vol.19
, Issue.4
, pp. 491-511
-
-
Zeiser, T.1
Hager, G.2
Wellein, G.3
-
39
-
-
56349170328
-
Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
-
T. Zeiser, G. Wellein, A. Nitsure, K. Iglberger, U. Rude, and G. Hager. Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method. Progress in Computational Fluid Dynamics, 8, 2008.
-
(2008)
Progress in Computational Fluid Dynamics
, vol.8
-
-
Zeiser, T.1
Wellein, G.2
Nitsure, A.3
Iglberger, K.4
Rude, U.5
Hager, G.6
|