SCOPUS 정보 검색 플랫폼

Proceedings of the 2010 IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010

Volumn , Issue , 2010, Pages

Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures

(6) Chandramowlishwaran, Aparna a,b Williams, Samuel a Oliker, Leonid a Lashuk, Ilya b Biros, Georgec b Vuduc, Richard b

a LAWRENCE BERKELEY NATIONAL LABORATORY (United States)

b Georgia Institute of Technology (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BARCELONA; FAST MULTIPOLE METHOD; MULTI-CORE SYSTEMS; MULTICORE ARCHITECTURES; NUMERICAL APPROXIMATIONS; PARALLELIZATIONS; PERFORMANCE ENHANCEMENTS; POWER EFFICIENCY; SINGLE-NODE PERFORMANCE;

DATA STRUCTURES; DISTRIBUTED PARAMETER NETWORKS; MICROPROCESSOR CHIPS; OPTIMIZATION; SOFTWARE ARCHITECTURE;

TUNING;

EID: 77953980209 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2010.5470415 Document Type: Conference Paper

Times cited : (40)

References (17)

1
- 77953999937
- Parallel, GPU-based construction of space filling curves and octrees
- poster
- P. Ajmera, R. Goradia, S. Chandran, and S. Aluru. Fast, parallel, GPU-based construction of space filling curves and octrees. In Proc. Symp. Interactive 3D Graphics (I3D), 2008. (poster).
- (2008) Proc. Symp. Interactive 3D Graphics (I3D)
- Ajmera, P.¹ Goradia, R.² Chandran, S.³ Aluru. Fast, S.⁴

2
- 77951472684
- Direct n-body kernels for multicore platforms
- Sep.
- N. Arora, A. Shringarpure, and R. Vuduc. Direct n-body kernels for multicore platforms. In Proc. Int'l. Conf. Par. Proc. (ICPP), Sep. 2009.
- (2009) Proc. int'L. Conf. Par. Proc. (ICPP)
- Arora, N.¹ Shringarpure, A.² Vuduc., R.³

3
- 35348871275
- Hybrid MPI-thread parallelization of the fast multipole method
- Hagenberg, Austria
- O. Coulaud, P. Fortin, and J. Roman. Hybrid MPI-thread parallelization of the fast multipole method. In Proc. IS- PDC, Hagenberg, Austria, 2007.
- (2007) Proc. IS- PDC
- Coulaud, O.¹ Fortin, P.² Roman., J.³

4
- 20744449792
- The design and implementation of FFTW3
- M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. IEEE, 93, 2005.
- (2005) Proc. IEEE , vol.93
- Frigo, M.¹ Johnson., S.G.²

5
- 0000396658
- A fast algorithm for particle simulations
- L. Greengard and V. Rokhlin. A fast algorithm for particle simulations. J. Comp. Phys., 73, 1987.
- (1987) J. Comp. Phys. , vol.73
- Greengard, L.¹ Rokhlin., V.²

6
- 48149107858
- Fast multipole methods on graphics processors
- N. A. Gumerov and R. Duraiswami. Fast multipole methods on graphics processors. J. Comp. Phys., 227:8290-8313, 2008.
- (2008) J. Comp. Phys. , vol.227 , pp. 8290-8313
- Gumerov, N.A.¹ Duraiswami., R.²

7
- 18844402673
- Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods
- B. Hariharan and S. Aluru. Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods. Par. Co., 31(3-4):311-331, 2005.
- (2005) Par. Co. , vol.31 , Issue.3-4 , pp. 311-331
- Hariharan, B.¹ Aluru., S.²

8
- 19944419779
- Massively parallel implementation of a fast multipole method for distributed memory machines
- J. Kurzak and B. M. Pettitt. Massively parallel implementation of a fast multipole method for distributed memory machines. J. Par. Distrib. Comput., 65:870-881, 2005.
- (2005) J. Par. Distrib. Comput. , vol.65 , pp. 870-881
- Kurzak, J.¹ Pettitt., B.M.²

9
- 77954011785
- A massively parallel adaptive fast multipole method on heterogeneous architectures
- Nov., (to appear)
- I. Lashuk, A. Chandramowlishwaran, H. Langston, T.-A. Nguyen, R. Sampath, A. Shringarpure, R. Vuduc, L. Ying, D. Zorin, and G. Biros. A massively parallel adaptive fast multipole method on heterogeneous architectures. In Proc. SC, Nov. 2009. (to appear).
- (2009) Proc. SC
- Lashuk, I.¹ Chandramowlishwaran, A.² Langston, H.³ Nguyen, T.-A.⁴ Sampath, R.⁵ Shringarpure, A.⁶ Vuduc, R.⁷ Ying, L.⁸ Zorin, D.⁹ Biros, G.¹⁰

10
- 33751225374
- Performance tuning of n-body codes on modern microprocessors: I. Direct integration with a Hermite scheme on ×86-64 architecture
- arXiv:astro-ph/0511062v1
- K. Nitadori, J. Makino, and P. Hut. Performance tuning of n-body codes on modern microprocessors: I. Direct integration with a Hermite scheme on ×86-64 architecture. New Astron., 12:169-181, 2006. arXiv:astro- ph/0511062v1.
- (2006) New Astron , vol.12 , pp. 169-181
- Nitadori, K.¹ Makino, J.² Hut., P.³

11
- 0038825209
- Scalable and portable implementation of the fast multipole method on parallel comptuers
- July
- S. Ogata, T. J. Campbell, R. K. Kalia, A. Nakano, P. Vashishta, and S. Vemparala. Scalable and portable implementation of the fast multipole method on parallel comptuers. Computer Phys. Comm., 153(3):445-461, July 2003.
- (2003) Computer Phys. Comm. , vol.153 , Issue.3 , pp. 445-461
- Ogata, S.¹ Campbell, T.J.² Kalia, R.K.³ Nakano, A.⁴ Vashishta, P.⁵ Vemparala, S.⁶

12
- 79960575885
- Adapting a message-driven parallel application to GPU-accelerated clusters
- J. C. Phillips, J. E. Stone, and K. Schulten. Adapting a message-driven parallel application to GPU-accelerated clusters. In Proc. SC, 2008.
- (2008) Proc. SC
- Phillips, J.C.¹ Stone, J.E.² Schulten., K.³

13
- 55349088898
- Bottom-up construction and 2:1 balance refinement of linear octrees in parallel
- H. Sundar, R. S. Sampath, and G. Biros. Bottom-up construction and 2:1 balance refinement of linear octrees in parallel. SIAM J. Sci. Comput., 30(5):2675-2708, 2008.
- (2008) SIAM J. Sci. Comput. , vol.30 , Issue.5 , pp. 2675-2708
- Sundar, H.¹ Sampath, R.S.² Biros., G.³

14
- 0027747808
- A parallel hashed octtree n-body algorithm
- M. S. Warren and J. K. Salmon. A parallel hashed octtree n-body algorithm. In Proc. SC, 1993.
- (1993) Proc. SC
- Warren, M.S.¹ Salmon., J.K.²

15
- 48249149028
- A new parallel kernel-independent fast multipole method
- L. Ying, G. Biros, D. Zorin, and H. Langston. A new parallel kernel-independent fast multipole method. In Proc. SC, 2003.
- (2003) Proc. SC
- Ying, L.¹ Biros, G.² Zorin, D.³ Langston., H.⁴

16
- 2442446356
- A kernel-independent adaptive fast multipole method in two and three dimensions
- May
- L. Ying, D. Zorin, and G. Biros. A kernel-independent adaptive fast multipole method in two and three dimensions. J. Comp. Phys., 196:591-626, May 2004.
- (2004) J. Comp. Phys. , vol.196 , pp. 591-626
- Ying, L.¹ Zorin, D.² Biros., G.³

17
- 8344272049
- Array regrouping and structure splitting using whole-program reference affinity
- May
- Y. Zhong, M. Orlovich, X. Shen, and C. Ding. Array regrouping and structure splitting using whole-program reference affinity. ACM SIGPLAN Notices, 39(6):255-266, May 2004.
- (2004) ACM SIGPLAN Notices , vol.39 , Issue.6 , pp. 255-266
- Zhong, Y.¹ Orlovich, M.² Shen, X.³ Ding., C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.