메뉴 건너뛰기




Volumn 182, Issue 10, 2011, Pages 2084-2098

How to obtain efficient GPU kernels: An illustration using FMM & FGT algorithms

Author keywords

Fast Gauss transform; Fast multipole method; Fast summation methods; Heterogeneous computing

Indexed keywords

BEOWULF CLUSTER; COMMODITY HARDWARE; COMPUTATIONAL SCIENCE; CORE METHOD; FAST GAUSS TRANSFORM; FAST MULTIPOLE METHOD; FAST SUMMATION; GRAPHICS PROCESSOR; HETEROGENEOUS COMPUTING; HIGH-PERFORMANCE COMPUTING; OPEN SOURCE SOFTWARE; PERFORMANCE IMPROVEMENTS;

EID: 79960050975     PISSN: 00104655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cpc.2011.05.002     Document Type: Article
Times cited : (12)

References (20)
  • 2
    • 79960053169 scopus 로고    scopus 로고
    • Unwelcome advice
    • June 30
    • A. Ghuloum, Unwelcome advice, Intel Research Blog, http://blogs.intel. com/research/2008/06/unwelcome-advice.php, June 30, 2008.
    • (2008) Intel Research Blog
    • Ghuloum, A.1
  • 4
    • 13444283314 scopus 로고    scopus 로고
    • Advances in viscous vortex methods - Meshless spatial adaption based on radial basis function interpolation
    • DOI 10.1002/fld.811
    • L.A. Barba, A. Leonard, and C.B. Allen Advances in viscous vortex methods-meshless spatial adaption based on radial basis function interpolation Int. J. Num. Meth. Fluids 47 5 2005 387 421 10.1002/fld.811 (Pubitemid 40205516)
    • (2005) International Journal for Numerical Methods in Fluids , vol.47 , Issue.5 , pp. 387-421
    • Barba, L.A.1    Leonard, A.2    Allen, C.B.3
  • 5
    • 0000396658 scopus 로고
    • A fast algorithm for particle simulations
    • 10.1016/0021-9991
    • L. Greengard, and V. Rokhlin A fast algorithm for particle simulations J. Comput. Phys. 73 2 1987 325 348 10.1016/0021-9991
    • (1987) J. Comput. Phys. , vol.73 , Issue.2 , pp. 325-348
    • Greengard, L.1    Rokhlin, V.2
  • 6
    • 0343881269 scopus 로고    scopus 로고
    • Fast adaptive multipole method for computation of electrostatic energy in simulations of polyelectrolyte DNA
    • M.O. Fenley, W.K. Olson, K. Chua, and A.H. Boschitsc Fast adaptive multipole method for computation of electrostatic energy in simulations of polyelectrolyte DNA J. Comput. Chem. 17 8 1996 976 991 (Pubitemid 126572058)
    • (1996) Journal of Computational Chemistry , vol.17 , Issue.8 , pp. 976-991
    • Fenley, M.O.1    Olson, W.K.2    Chua, K.3    Boschitsch, A.H.4
  • 7
    • 0006703126 scopus 로고
    • On the Rokhlin-Greengard method with vortex blobs for problems posed in all space or periodic in one direction
    • 10.1006/jcph.1995.1177
    • J.T. Hamilton, and G. Majda On the Rokhlin-Greengard method with vortex blobs for problems posed in all space or periodic in one direction J. Comput. Phys. 121 1 1995 29 50 10.1006/jcph.1995.1177
    • (1995) J. Comput. Phys. , vol.121 , Issue.1 , pp. 29-50
    • Hamilton, J.T.1    Majda, G.2
  • 8
    • 85013789628 scopus 로고    scopus 로고
    • Fast Multipole Methods for the Helmholtz Equation in Three Dimensions
    • 1st edition Elsevier Ltd.
    • N.A. Gumerov, and R. Duraiswami Fast Multipole Methods for the Helmholtz Equation in Three Dimensions 1st edition Elsevier Series in Electromagnetism 2004 Elsevier Ltd.
    • (2004) Elsevier Series in Electromagnetism
    • Gumerov, N.A.1    Duraiswami, R.2
  • 10
    • 0001922098 scopus 로고    scopus 로고
    • A short course on fast multipole methods
    • Oxford University Press accessed August 2009
    • R. Beatson, and L. Greengard A short course on fast multipole methods Wavelets, Multilevel Methods and Elliptic PDEs 1997 Oxford University Press 1 37 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.129.7826 accessed August 2009
    • (1997) Wavelets, Multilevel Methods and Elliptic PDEs , pp. 1-37
    • Beatson, R.1    Greengard, L.2
  • 11
    • 78650545454 scopus 로고    scopus 로고
    • PetFMM-a dynamically load-balancing parallel fast multipole library
    • 10.1002/nme.2972
    • F.A. Cruz, M.G. Knepley, and L.A. Barba PetFMM-a dynamically load-balancing parallel fast multipole library Int. J. Num. Meth. Eng. 85 4 2010 403 428 10.1002/nme.2972
    • (2010) Int. J. Num. Meth. Eng. , vol.85 , Issue.4 , pp. 403-428
    • Cruz, F.A.1    Knepley, M.G.2    Barba, L.A.3
  • 12
    • 44849094749 scopus 로고    scopus 로고
    • Fast N-body simulation with CUDA
    • Addison-Wesley Professional (Ch. 31)
    • L. Nyland, M. Harris, and J. Prins Fast N-body simulation with CUDA GPU Gems 3 2007 Addison-Wesley Professional 677 695 (Ch. 31)
    • (2007) GPU Gems 3 , pp. 677-695
    • Nyland, L.1    Harris, M.2    Prins, J.3
  • 13
    • 35148867733 scopus 로고    scopus 로고
    • High performance direct gravitational n-body simulations on graphics processing units II: An implementation in CUDA
    • R.G. Belleman, J. Bédorf, and S.F. Portegies Zwart High performance direct gravitational n-body simulations on graphics processing units II: an implementation in CUDA New Astron. 13 2008 103 112
    • (2008) New Astron. , vol.13 , pp. 103-112
    • Belleman, R.G.1    Bédorf, J.2    Portegies Zwart, S.F.3
  • 14
    • 48149107858 scopus 로고    scopus 로고
    • Fast multipole methods on graphics processors
    • 10.1016/j.jcp.2008.05.023
    • N.A. Gumerov, and R. Duraiswami Fast multipole methods on graphics processors J. Comput. Phys. 227 18 2008 8290 8313 10.1016/j.jcp.2008.05.023
    • (2008) J. Comput. Phys. , vol.227 , Issue.18 , pp. 8290-8313
    • Gumerov, N.A.1    Duraiswami, R.2
  • 15
    • 0141564564 scopus 로고    scopus 로고
    • Application of the fast Gauss transform to option pricing
    • M. Broadie, and Y. Yamamoto Application of the fast Gauss transform to option pricing Manage. Sci. 49 8 2003 1071 1088
    • (2003) Manage. Sci. , vol.49 , Issue.8 , pp. 1071-1088
    • Broadie, M.1    Yamamoto, Y.2
  • 19
    • 70249095018 scopus 로고    scopus 로고
    • Characterization of the accuracy of the fast multipole method in particle simulations
    • 10.1002/nme.2611
    • F.A. Cruz, and L.A. Barba Characterization of the accuracy of the fast multipole method in particle simulations Int. J. Num. Meth. Eng. 79 13 2009 1577 1604 10.1002/nme.2611
    • (2009) Int. J. Num. Meth. Eng. , vol.79 , Issue.13 , pp. 1577-1604
    • Cruz, F.A.1    Barba, L.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.