SCOPUS 정보 검색 플랫폼

Annual ACM Symposium on Parallelism in Algorithms and Architectures

Volumn , Issue , 2012, Pages 182-184

Brief announcement: Towards a communication optimal Fast Multipole Method and its implications at exascale

(4) Chandramowlishwaran, Aparna a Choi, Jee Whan a Madduri, Kamesh b Vuduc, Richard a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

b PENNSYLVANIA STATE UNIVERSITY (United States)

Author keywords

Cache complexity analysis; Exascale; Fast Multipole Method; Performance modeling

Indexed keywords

ANALYTICAL PERFORMANCE MODEL; APPROXIMATION ACCURACY; ASYMPTOTICALLY LINEAR; CACHE COMPLEXITY; EXASCALE; FAST MULTIPLE METHOD; FAST MULTIPOLE METHOD; LOWER BOUNDS; MEMORY COST; ON CURRENTS; PARTICLE SYSTEMS; PERFORMANCE MODEL; PERFORMANCE MODELING; SYSTEM CONFIGURATIONS; TUNING PARAMETER;

OPTIMIZATION;

ALGORITHMS;

EID: 84864128713 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2312005.2312039 Document Type: Conference Paper

Times cited : (13)

References (11)

1
- 58449090994
- Provably good multicore cache performance for divide-and-conquer algorithms
- Philadelphia, PA, USA, SIAM
- G. E. Blelloch, R. A. Chowdhury, P. B. Gibbons, V. Ramachandran, S. Chen, and M. Kozuch. Provably good multicore cache performance for divide-and-conquer algorithms. In Proc. 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '08), pages 501-510, Philadelphia, PA, USA, 2008. SIAM.
- (2008) Proc. 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '08) , pp. 501-510
- Blelloch, G.E.¹ Chowdhury, R.A.² Gibbons, P.B.³ Ramachandran, V.⁴ Chen, S.⁵ Kozuch, M.⁶

2
- 77954942935
- Low depth cache-oblivious algorithms
- Thira, Santorini, Greece, July
- G. E. Blelloch, P. B. Gibbons, and H. V. Simhadri. Low depth cache-oblivious algorithms. In Proc. ACM Symposium on Parallel Algorithms and Architectures (SPAA), Thira, Santorini, Greece, July 2010.
- (2010) Proc. ACM Symposium on Parallel Algorithms and Architectures (SPAA)
- Blelloch, G.E.¹ Gibbons, P.B.² Simhadri, H.V.³

3
- 0037482408
- The fast multipole algorithm
- January/February
- J. Board and K. Schulten. The fast multipole algorithm. Computing in Science and Engineering, 2(1):76-79, January/February 2000.
- (2000) Computing in Science and Engineering , vol.2 , Issue.1 , pp. 76-79
- Board, J.¹ Schulten, K.²

4
- 78650822594
- Diagnosis, tuning, and redesign for multicore performance: A case study of the Fast Multipole Method
- New Orleans, LA, USA, November
- A. Chandramowlishwaran, K. Madduri, and R. Vuduc. Diagnosis, tuning, and redesign for multicore performance: A case study of the Fast Multipole Method. In Proc. ACM/IEEE Conf. Supercomputing (SC), New Orleans, LA, USA, November 2010.
- (2010) Proc. ACM/IEEE Conf. Supercomputing (SC)
- Chandramowlishwaran, A.¹ Madduri, K.² Vuduc, R.³

5
- 77953980209
- Optimizing and tuning the Fast Multipole Method for state-of-the-art multicore architectures
- Atlanta, GA, USA, April
- A. Chandramowlishwaran, S. Williams, L. Oliker, I. Lashuk, G. Biros, and R. Vuduc. Optimizing and tuning the Fast Multipole Method for state-of-the-art multicore architectures. In Proc. IEEE Int'l. Parallel and Distributed Processing Symp. (IPDPS), Atlanta, GA, USA, April 2010.
- (2010) Proc. IEEE Int'l. Parallel and Distributed Processing Symp. (IPDPS)
- Chandramowlishwaran, A.¹ Williams, S.² Oliker, L.³ Lashuk, I.⁴ Biros, G.⁵ Vuduc, R.⁶

6
- 0000396658
- A fast algorithm for particle simulations
- L. Greengard and V. Rokhlin. A fast algorithm for particle simulations. J. Comp. Phys., 73:325-348, 1987.
- (1987) J. Comp. Phys. , vol.73 , pp. 325-348
- Greengard, L.¹ Rokhlin, V.²

7
- 74049157044
- A massively parallel adaptive Fast Multipole Method on heterogeneous architectures
- Portland, OR, USA, November
- I. Lashuk, A. Chandramowlishwaran, H. Langston, T.-A. Nguyen, R. Sampath, A. Shringarpure, R. Vuduc, L. Ying, D. Zorin, and G. Biros. A massively parallel adaptive Fast Multipole Method on heterogeneous architectures. In Proc. ACM/IEEE Conf. Supercomputing (SC), Portland, OR, USA, November 2009.
- (2009) Proc. ACM/IEEE Conf. Supercomputing (SC)
- Lashuk, I.¹ Chandramowlishwaran, A.² Langston, H.³ Nguyen, T.-A.⁴ Sampath, R.⁵ Shringarpure, A.⁶ Vuduc, R.⁷ Ying, L.⁸ Zorin, D.⁹ Biros, G.¹⁰

8
- 0018457301
- A separator theorem for planar graphs
- R. Lipton and R. Tarjan. A separator theorem for planar graphs. SIAM Journal on Applied Mathematics, 36(2), 1979.
- (1979) SIAM Journal on Applied Mathematics , vol.36 , Issue.2
- Lipton, R.¹ Tarjan, R.²

9
- 78650814738
- Petascale direct numerical simulation of blood flow on 200k cores and heterogeneous architectures
- New Orleans, LA, USA, November
- A. Rahimian, I. Lashuk, D. Malhotra, A. Chandramowlishwaran, L. Moon, R. Sampath, A. Shringarpure, S. Veerapaneni, J. Vetter, R. Vuduc, D. Zorin, and G. Biros. Petascale direct numerical simulation of blood flow on 200k cores and heterogeneous architectures. In Proc. ACM/IEEE Conf. Supercomputing (SC), New Orleans, LA, USA, November 2010.
- (2010) Proc. ACM/IEEE Conf. Supercomputing (SC)
- Rahimian, A.¹ Lashuk, I.² Malhotra, D.³ Chandramowlishwaran, A.⁴ Moon, L.⁵ Sampath, R.⁶ Shringarpure, A.⁷ Veerapaneni, S.⁸ Vetter, J.⁹ Vuduc, R.¹⁰ Zorin, D.¹¹ Biros, G.¹²

10
- 79961058189
- What GPU computing means for high-end systems
- July/August
- R. Vuduc and K. Czechowski. What GPU computing means for high-end systems. IEEE Micro, 31(4):74-78, July/August 2011.
- (2011) IEEE Micro , vol.31 , Issue.4 , pp. 74-78
- Vuduc, R.¹ Czechowski, K.²

11
- 2442446356
- A kernel-independent adaptive fast multipole algorithm in two and three dimensions
- DOI 10.1016/j.jcp.2003.11.021
- L. Ying, D. Zorin, and G. Biros. A kernel-independent adaptive Fast Multipole Method in two and three dimensions. J. Comp. Phys., 196:591-626, May 2004. (Pubitemid 38613258)
- (2004) Journal of Computational Physics , vol.196 , Issue.2 , pp. 591-626
- Ying, L.¹ Biros, G.² Zorin, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.