-
1
-
-
77953999937
-
Parallel, GPU-based construction of space filling curves and octrees
-
poster
-
P. Ajmera, R. Goradia, S. Chandran, and S. Aluru. Fast, parallel, GPU-based construction of space filling curves and octrees. In Proc. Symp. Interactive 3D Graphics (I3D), 2008. (poster).
-
(2008)
Proc. Symp. Interactive 3D Graphics (I3D)
-
-
Ajmera, P.1
Goradia, R.2
Chandran, S.3
Aluru. Fast, S.4
-
3
-
-
35348871275
-
Hybrid MPI-thread parallelization of the fast multipole method
-
Hagenberg, Austria
-
O. Coulaud, P. Fortin, and J. Roman. Hybrid MPI-thread parallelization of the fast multipole method. In Proc. IS- PDC, Hagenberg, Austria, 2007.
-
(2007)
Proc. IS- PDC
-
-
Coulaud, O.1
Fortin, P.2
Roman., J.3
-
4
-
-
20744449792
-
The design and implementation of FFTW3
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. IEEE, 93, 2005.
-
(2005)
Proc. IEEE
, vol.93
-
-
Frigo, M.1
Johnson., S.G.2
-
5
-
-
0000396658
-
A fast algorithm for particle simulations
-
L. Greengard and V. Rokhlin. A fast algorithm for particle simulations. J. Comp. Phys., 73, 1987.
-
(1987)
J. Comp. Phys.
, vol.73
-
-
Greengard, L.1
Rokhlin., V.2
-
6
-
-
48149107858
-
Fast multipole methods on graphics processors
-
N. A. Gumerov and R. Duraiswami. Fast multipole methods on graphics processors. J. Comp. Phys., 227:8290-8313, 2008.
-
(2008)
J. Comp. Phys.
, vol.227
, pp. 8290-8313
-
-
Gumerov, N.A.1
Duraiswami., R.2
-
7
-
-
18844402673
-
Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods
-
B. Hariharan and S. Aluru. Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods. Par. Co., 31(3-4):311-331, 2005.
-
(2005)
Par. Co.
, vol.31
, Issue.3-4
, pp. 311-331
-
-
Hariharan, B.1
Aluru., S.2
-
8
-
-
19944419779
-
Massively parallel implementation of a fast multipole method for distributed memory machines
-
J. Kurzak and B. M. Pettitt. Massively parallel implementation of a fast multipole method for distributed memory machines. J. Par. Distrib. Comput., 65:870-881, 2005.
-
(2005)
J. Par. Distrib. Comput.
, vol.65
, pp. 870-881
-
-
Kurzak, J.1
Pettitt., B.M.2
-
9
-
-
77954011785
-
A massively parallel adaptive fast multipole method on heterogeneous architectures
-
Nov., (to appear)
-
I. Lashuk, A. Chandramowlishwaran, H. Langston, T.-A. Nguyen, R. Sampath, A. Shringarpure, R. Vuduc, L. Ying, D. Zorin, and G. Biros. A massively parallel adaptive fast multipole method on heterogeneous architectures. In Proc. SC, Nov. 2009. (to appear).
-
(2009)
Proc. SC
-
-
Lashuk, I.1
Chandramowlishwaran, A.2
Langston, H.3
Nguyen, T.-A.4
Sampath, R.5
Shringarpure, A.6
Vuduc, R.7
Ying, L.8
Zorin, D.9
Biros, G.10
-
10
-
-
33751225374
-
Performance tuning of n-body codes on modern microprocessors: I. Direct integration with a Hermite scheme on ×86-64 architecture
-
arXiv:astro-ph/0511062v1
-
K. Nitadori, J. Makino, and P. Hut. Performance tuning of n-body codes on modern microprocessors: I. Direct integration with a Hermite scheme on ×86-64 architecture. New Astron., 12:169-181, 2006. arXiv:astro- ph/0511062v1.
-
(2006)
New Astron
, vol.12
, pp. 169-181
-
-
Nitadori, K.1
Makino, J.2
Hut., P.3
-
11
-
-
0038825209
-
Scalable and portable implementation of the fast multipole method on parallel comptuers
-
July
-
S. Ogata, T. J. Campbell, R. K. Kalia, A. Nakano, P. Vashishta, and S. Vemparala. Scalable and portable implementation of the fast multipole method on parallel comptuers. Computer Phys. Comm., 153(3):445-461, July 2003.
-
(2003)
Computer Phys. Comm.
, vol.153
, Issue.3
, pp. 445-461
-
-
Ogata, S.1
Campbell, T.J.2
Kalia, R.K.3
Nakano, A.4
Vashishta, P.5
Vemparala, S.6
-
12
-
-
79960575885
-
Adapting a message-driven parallel application to GPU-accelerated clusters
-
J. C. Phillips, J. E. Stone, and K. Schulten. Adapting a message-driven parallel application to GPU-accelerated clusters. In Proc. SC, 2008.
-
(2008)
Proc. SC
-
-
Phillips, J.C.1
Stone, J.E.2
Schulten., K.3
-
13
-
-
55349088898
-
Bottom-up construction and 2:1 balance refinement of linear octrees in parallel
-
H. Sundar, R. S. Sampath, and G. Biros. Bottom-up construction and 2:1 balance refinement of linear octrees in parallel. SIAM J. Sci. Comput., 30(5):2675-2708, 2008.
-
(2008)
SIAM J. Sci. Comput.
, vol.30
, Issue.5
, pp. 2675-2708
-
-
Sundar, H.1
Sampath, R.S.2
Biros., G.3
-
14
-
-
0027747808
-
A parallel hashed octtree n-body algorithm
-
M. S. Warren and J. K. Salmon. A parallel hashed octtree n-body algorithm. In Proc. SC, 1993.
-
(1993)
Proc. SC
-
-
Warren, M.S.1
Salmon., J.K.2
-
16
-
-
2442446356
-
A kernel-independent adaptive fast multipole method in two and three dimensions
-
May
-
L. Ying, D. Zorin, and G. Biros. A kernel-independent adaptive fast multipole method in two and three dimensions. J. Comp. Phys., 196:591-626, May 2004.
-
(2004)
J. Comp. Phys.
, vol.196
, pp. 591-626
-
-
Ying, L.1
Zorin, D.2
Biros., G.3
-
17
-
-
8344272049
-
Array regrouping and structure splitting using whole-program reference affinity
-
May
-
Y. Zhong, M. Orlovich, X. Shen, and C. Ding. Array regrouping and structure splitting using whole-program reference affinity. ACM SIGPLAN Notices, 39(6):255-266, May 2004.
-
(2004)
ACM SIGPLAN Notices
, vol.39
, Issue.6
, pp. 255-266
-
-
Zhong, Y.1
Orlovich, M.2
Shen, X.3
Ding., C.4
|