-
1
-
-
58449090994
-
Provably good multicore cache performance for divide-and-conquer algorithms
-
Philadelphia, PA, USA, SIAM
-
G. E. Blelloch, R. A. Chowdhury, P. B. Gibbons, V. Ramachandran, S. Chen, and M. Kozuch. Provably good multicore cache performance for divide-and-conquer algorithms. In Proc. 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '08), pages 501-510, Philadelphia, PA, USA, 2008. SIAM.
-
(2008)
Proc. 19th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '08)
, pp. 501-510
-
-
Blelloch, G.E.1
Chowdhury, R.A.2
Gibbons, P.B.3
Ramachandran, V.4
Chen, S.5
Kozuch, M.6
-
2
-
-
77954942935
-
Low depth cache-oblivious algorithms
-
Thira, Santorini, Greece, July
-
G. E. Blelloch, P. B. Gibbons, and H. V. Simhadri. Low depth cache-oblivious algorithms. In Proc. ACM Symposium on Parallel Algorithms and Architectures (SPAA), Thira, Santorini, Greece, July 2010.
-
(2010)
Proc. ACM Symposium on Parallel Algorithms and Architectures (SPAA)
-
-
Blelloch, G.E.1
Gibbons, P.B.2
Simhadri, H.V.3
-
4
-
-
78650822594
-
Diagnosis, tuning, and redesign for multicore performance: A case study of the Fast Multipole Method
-
New Orleans, LA, USA, November
-
A. Chandramowlishwaran, K. Madduri, and R. Vuduc. Diagnosis, tuning, and redesign for multicore performance: A case study of the Fast Multipole Method. In Proc. ACM/IEEE Conf. Supercomputing (SC), New Orleans, LA, USA, November 2010.
-
(2010)
Proc. ACM/IEEE Conf. Supercomputing (SC)
-
-
Chandramowlishwaran, A.1
Madduri, K.2
Vuduc, R.3
-
5
-
-
77953980209
-
Optimizing and tuning the Fast Multipole Method for state-of-the-art multicore architectures
-
Atlanta, GA, USA, April
-
A. Chandramowlishwaran, S. Williams, L. Oliker, I. Lashuk, G. Biros, and R. Vuduc. Optimizing and tuning the Fast Multipole Method for state-of-the-art multicore architectures. In Proc. IEEE Int'l. Parallel and Distributed Processing Symp. (IPDPS), Atlanta, GA, USA, April 2010.
-
(2010)
Proc. IEEE Int'l. Parallel and Distributed Processing Symp. (IPDPS)
-
-
Chandramowlishwaran, A.1
Williams, S.2
Oliker, L.3
Lashuk, I.4
Biros, G.5
Vuduc, R.6
-
6
-
-
0000396658
-
A fast algorithm for particle simulations
-
L. Greengard and V. Rokhlin. A fast algorithm for particle simulations. J. Comp. Phys., 73:325-348, 1987.
-
(1987)
J. Comp. Phys.
, vol.73
, pp. 325-348
-
-
Greengard, L.1
Rokhlin, V.2
-
7
-
-
74049157044
-
A massively parallel adaptive Fast Multipole Method on heterogeneous architectures
-
Portland, OR, USA, November
-
I. Lashuk, A. Chandramowlishwaran, H. Langston, T.-A. Nguyen, R. Sampath, A. Shringarpure, R. Vuduc, L. Ying, D. Zorin, and G. Biros. A massively parallel adaptive Fast Multipole Method on heterogeneous architectures. In Proc. ACM/IEEE Conf. Supercomputing (SC), Portland, OR, USA, November 2009.
-
(2009)
Proc. ACM/IEEE Conf. Supercomputing (SC)
-
-
Lashuk, I.1
Chandramowlishwaran, A.2
Langston, H.3
Nguyen, T.-A.4
Sampath, R.5
Shringarpure, A.6
Vuduc, R.7
Ying, L.8
Zorin, D.9
Biros, G.10
-
9
-
-
78650814738
-
Petascale direct numerical simulation of blood flow on 200k cores and heterogeneous architectures
-
New Orleans, LA, USA, November
-
A. Rahimian, I. Lashuk, D. Malhotra, A. Chandramowlishwaran, L. Moon, R. Sampath, A. Shringarpure, S. Veerapaneni, J. Vetter, R. Vuduc, D. Zorin, and G. Biros. Petascale direct numerical simulation of blood flow on 200k cores and heterogeneous architectures. In Proc. ACM/IEEE Conf. Supercomputing (SC), New Orleans, LA, USA, November 2010.
-
(2010)
Proc. ACM/IEEE Conf. Supercomputing (SC)
-
-
Rahimian, A.1
Lashuk, I.2
Malhotra, D.3
Chandramowlishwaran, A.4
Moon, L.5
Sampath, R.6
Shringarpure, A.7
Veerapaneni, S.8
Vetter, J.9
Vuduc, R.10
Zorin, D.11
Biros, G.12
-
10
-
-
79961058189
-
What GPU computing means for high-end systems
-
July/August
-
R. Vuduc and K. Czechowski. What GPU computing means for high-end systems. IEEE Micro, 31(4):74-78, July/August 2011.
-
(2011)
IEEE Micro
, vol.31
, Issue.4
, pp. 74-78
-
-
Vuduc, R.1
Czechowski, K.2
-
11
-
-
2442446356
-
A kernel-independent adaptive fast multipole algorithm in two and three dimensions
-
DOI 10.1016/j.jcp.2003.11.021
-
L. Ying, D. Zorin, and G. Biros. A kernel-independent adaptive Fast Multipole Method in two and three dimensions. J. Comp. Phys., 196:591-626, May 2004. (Pubitemid 38613258)
-
(2004)
Journal of Computational Physics
, vol.196
, Issue.2
, pp. 591-626
-
-
Ying, L.1
Biros, G.2
Zorin, D.3
|