메뉴 건너뛰기




Volumn 35, Issue 4, 2013, Pages

Parallel three-dimensional nonequispaced fast fourier transforms and their application to particle simulation

Author keywords

NFFT; Parallel fast summation; Parallel nonequispaced fast Fourier transform; Parallel particle mesh methods

Indexed keywords

APPROXIMATION ALGORITHMS; CHARGED PARTICLES; DISTRIBUTED COMPUTER SYSTEMS; MEMORY ARCHITECTURE; MESH GENERATION; OPEN SOURCE SOFTWARE; OPEN SYSTEMS; SIGNAL RECEIVERS;

EID: 84886844122     PISSN: 10648275     EISSN: 10957200     Source Type: Journal    
DOI: 10.1137/120888478     Document Type: Article
Times cited : (30)

References (59)
  • 1
    • 84872903743 scopus 로고    scopus 로고
    • Parallel implementation and scalability analysis of 3d fast fourier transform using 2d domain decomposition
    • O. Ayala and L.P. Wang, Parallel implementation and scalability analysis of 3D fast Fourier transform using 2D domain decomposition, Parallel Comput., 39 (2013), pp. 58-77.
    • (2013) Parallel Comput , vol.39 , pp. 58-77
    • Ayala, O.1    Wang, L.P.2
  • 2
    • 11944254490 scopus 로고
    • Force fields for silicas and aluminophosphates based on ab initio calculations
    • B.W.H. van Beest and G.J. Kramer, Force fields for silicas and aluminophosphates based on ab initio calculations, Phys. Rev. Lett., 64 (1990), pp. 1955-1958.
    • (1990) Phys. Rev. Lett , vol.64 , pp. 1955-1958
    • Van Beest, B.W.H.1    Kramer, G.J.2
  • 3
    • 3142776518 scopus 로고
    • On the fast fourier transform of functions with singularities
    • G. Beylkin, On the fast Fourier transform of functions with singularities, Appl. Comput. Harmon. Anal., 2 (1995), pp. 363-381.
    • (1995) Appl. Comput. Harmon. Anal , vol.2 , pp. 363-381
    • Beylkin, G.1
  • 4
    • 38049155737 scopus 로고    scopus 로고
    • Library support for parallel sorting in scientific computations
    • Lecture Notes in Comput. Sci., Springer, Berlin
    • H. Dachsel, M. Hofmann, and G. Rünger, Library support for parallel sorting in scientific computations, in Proceedings of the 13th International Euro-Par Conference, 4641, Lecture Notes in Comput. Sci. 2007, Springer, Berlin, pp. 695-704.
    • (2007) Proceedings of the 13th International Euro-Par Conference , vol.4641 , pp. 695-704
    • Dachsel, H.1    Hofmann, M.2    Rünger, G.3
  • 5
    • 22244433235 scopus 로고    scopus 로고
    • How to mesh up Ewald sums. I. A theoretical and numerical comparison of various particle mesh routines
    • DOI 10.1063/1.477414, PII S0021960698515425
    • M. Deserno and C. Holm, How to mesh up Ewald sums. I. A theoretical and numerical comparison of various particle mesh routines, J. Chem. Phys., 109 (1998), pp. 7678-7693. (Pubitemid 128678339)
    • (1998) Journal of Chemical Physics , vol.109 , Issue.18 , pp. 7678-7693
    • Deserno, M.1    Holm, C.2
  • 7
    • 0033102416 scopus 로고    scopus 로고
    • Nonuniform fast Fourier transform
    • A.J.W. Duijndam and M. A. Schonewille, Nonuniform fast Fourier transform, Geophysics, 64 (1999), pp. 539-551. (Pubitemid 29388687)
    • (1999) Geophysics , vol.64 , Issue.2 , pp. 539-551
    • Duijndam, A.J.W.1    Schonewille, M.A.2
  • 8
    • 0000527817 scopus 로고
    • Fast fourier transforms for nonequispaced data
    • A. Dutt and V. Rokhlin, Fast Fourier transforms for nonequispaced data, SIAM J. Sci. Comput., 14 (1993), pp. 1368-1393.
    • (1993) SIAM J. Sci. Comput , vol.14 , pp. 1368-1393
    • Dutt, A.1    Rokhlin, V.2
  • 9
    • 33847785446 scopus 로고    scopus 로고
    • Field inhomogeneity correction based on gridding reconstruction for magnetic resonance imaging
    • DOI 10.1109/TMI.2006.891502
    • H. Eggers, T. Knopp, and D. Potts, Field inhomogeneity correction based on gridding reconstruction, IEEE Trans. Med. Imaging, 26 (2007), pp. 374-384. (Pubitemid 46392345)
    • (2007) IEEE Transactions on Medical Imaging , vol.26 , Issue.3 , pp. 374-384
    • Eggers, H.1    Knopp, T.2    Potts, D.3
  • 10
    • 0011937093 scopus 로고    scopus 로고
    • Fast fourier transform for nonequispaced data
    • C.K. Chui and L.L. Schumaker, eds., Vanderbilt University Press, Nashville, TN
    • B. Elbel and G. Steidl, Fast Fourier transform for nonequispaced data, in Approximation Theory IX, C.K. Chui and L.L. Schumaker, eds., Vanderbilt University Press, Nashville, TN, 1998, pp. 39-46.
    • (1998) Approximation Theory IX , pp. 39-46
    • Elbel, B.1    Steidl, G.2
  • 11
    • 35248815996 scopus 로고    scopus 로고
    • A volumetric fft for bluegene/l, high performance computing
    • T.M. Pinkston and V.K. Prasanna, eds., Springer, Berlin
    • M. Eleftheriou, J.E. Moreira, B.G. Fitch, and R.S. Germain, A volumetric FFT for BlueGene/L, High Performance Computing, Lecture Notes in Comput. Sci. 2913, T.M. Pinkston and V.K. Prasanna, eds., Springer, Berlin, 2003, pp, 194-203.
    • (2003) Lecture Notes in Comput. Sci , vol.2913 , pp. 194-203
    • Eleftheriou, M.1    Moreira, J.E.2    Fitch, B.G.3    Germain, R.S.4
  • 13
    • 84977266737 scopus 로고
    • Die berechnung optischer und elektrostatischer gitterpotentiale
    • P.P. Ewald, Die Berechnung optischer und elektrostatischer Gitterpotentiale, Ann. Phys. (4), 369 (1921), pp. 253-287.
    • (1921) Ann. Phys. , vol.369 , Issue.4 , pp. 253-287
    • Ewald, P.P.1
  • 14
    • 33947229391 scopus 로고    scopus 로고
    • Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer
    • DOI 10.1016/j.cpc.2006.12.006, PII S0010465507000276
    • B. Fang, Y. Deng, and G. Martyna, Performance of the 3D FFT on the 6D network torus QCDOC parallel supercomputer, Comput. Phys. Comm., 176 (2007), pp. 531-538. (Pubitemid 46435804)
    • (2007) Computer Physics Communications , vol.176 , Issue.8 , pp. 531-538
    • Fang, B.1    Deng, Y.2    Martyna, G.3
  • 15
    • 27844485098 scopus 로고    scopus 로고
    • Fast nfft based summation of radial functions
    • M. Fen and G. Steidl, Fast NFFT based summation of radial functions, Sampl. Theory Signal Image Process., 3 (2004), pp. 1-28.
    • (2004) Sampl. Theory Signal Image Process , vol.3 , pp. 1-28
    • Fen, M.1    Steidl, G.2
  • 16
    • 0037306521 scopus 로고    scopus 로고
    • Nonuniform fast fourier transforms using min-max interpolation
    • J.A. Fessler and B.P. Sutton, Nonuniform fast Fourier transforms using min-max interpolation, IEEE Trans. Signal Process., 51 (2003), pp. 560-574.
    • (2003) IEEE Trans. Signal Process , vol.51 , pp. 560-574
    • Fessler, J.A.1    Sutton, B.P.2
  • 17
    • 0742289178 scopus 로고    scopus 로고
    • Non-Equispaced Fast Fourier Transforms with Applications to Tomography
    • DOI 10.1007/s00041-003-0021-1
    • K. Fourmont, Non equispaced fast Fourier transforms with applications to tomography, J. Fourier Anal. Appl., 9 (2003), pp. 431-450. (Pubitemid 38156933)
    • (2003) Journal of Fourier Analysis and Applications , vol.9 , Issue.5 , pp. 431-450
    • Fourmont, K.1
  • 18
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • DOI 10.1109/JPROC.2004.840301, Program Generation, Optimization and Platform Adaptation
    • M. Frigo and S.G. Johnson, The design and implementation of FFTW3, Proc. IEEE, 93 (2005), pp. 216-231. (Pubitemid 40851223)
    • (2005) Proceedings of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 20
    • 4944230740 scopus 로고    scopus 로고
    • Accelerating the nonuniform fast fourier transform
    • L. Greengard and J.-Y. Lee, Accelerating the nonuniform fast Fourier transform, SIAM Rev., 46 (2004), pp. 443-454.
    • (2004) SIAM Rev , vol.46 , pp. 443-454
    • Greengard, L.1    Lee, J.-Y.2
  • 21
    • 45449095403 scopus 로고    scopus 로고
    • Numerical simulation in molecular dynamics
    • Springer, Berlin
    • M. Griebel, S. Knapek, and G. Zumbusch, Numerical simulation in molecular dynamics, Texts Comput. Sci. Eng. 5, Springer, Berlin, 2007.
    • (2007) Texts Comput. Sci. Eng , vol.5
    • Griebel, M.1    Knapek, S.2    Zumbusch, G.3
  • 23
    • 33744992654 scopus 로고    scopus 로고
    • Ewald summation based on nonuniform fast Fourier transform
    • DOI 10.1016/j.cplett.2006.04.106, PII S0009261406006051
    • F. Hedman and A. Laaksonen, Ewald summation based on nonuniform fast Fourier transform, Chem. Phys. Lett., 425 (2006), pp. 142-147. (Pubitemid 43866661)
    • (2006) Chemical Physics Letters , vol.425 , Issue.1-3 , pp. 142-147
    • Hedman, F.1    Laaksonen, A.2
  • 24
    • 46249092554 scopus 로고    scopus 로고
    • GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation
    • B. Hess, C. Kutzner, D. van der Spoel, and E. Lindahl, GROMACS 4 : Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation, J. Chem. Theory Comput., 4 (2008), pp. 435-447.
    • (2008) J. Chem. Theory Comput , vol.4 , pp. 435-447
    • Hess, B.1    Kutzner, C.2    Spoel Der D.Van3    Lindahl, E.4
  • 28
    • 84886823183 scopus 로고    scopus 로고
    • JuGene: Jülich Blue Gene/P, http://www.fz-juelich.de/ias/jsc/EN/ Expertise/Supercomputers/ JUGENE/JUGENE node.html.
    • JuGene: Jülich Blue Gene/P
  • 29
    • 84886845522 scopus 로고    scopus 로고
    • The error-controlled fast multipole method for open and periodic boundary conditions
    • IAS Series 6, G. Sutmann, P. Gibbon, and T. Lippert, eds., Forschungszentrum Jülich, Jülich
    • I. Kabadshow and H. Dachsel, The error-controlled fast multipole method for open and periodic boundary conditions, in Fast Methods for Long-Range Interactions in Complex Systems, IAS Series 6, G. Sutmann, P. Gibbon, and T. Lippert, eds., Forschungszentrum Jülich, Jülich, 2011, pp. 85-113.
    • (2011) Fast Methods for Long-Range Interactions in Complex Systems , pp. 85-113
    • Kabadshow, I.1    Dachsel, H.2
  • 31
    • 70349128524 scopus 로고    scopus 로고
    • Using nfft3 - A software library for various nonequispaced fast fourier transforms
    • J. Keiner, S. Kunis, and D. Potts, Using NFFT3 - a software library for various nonequispaced fast Fourier transforms, ACM Trans. Math. Software, 36 (2009), pp. 1-30.
    • (2009) ACM Trans. Math. Software , vol.36 , pp. 1-30
    • Keiner, J.1    Kunis, S.2    Potts, D.3
  • 32
    • 84886867397 scopus 로고    scopus 로고
    • The nonequispaced fft on graphics processing units
    • S. Kunis and S. Kunis, The nonequispaced FFT on graphics processing units, PAMM, 12 (2012), pp. 7-10.
    • (2012) PAMM , vol.12 , pp. 7-10
    • Kunis, S.1    Kunis, S.2
  • 33
    • 34547503017 scopus 로고    scopus 로고
    • Stability results for scattered data interpolation by trigonometric polynomials
    • S. Kunis and D. Potts, Stability results for scattered data interpolation by trigonometric polynomials, SIAM J. Sci. Comput., 29 (2007), pp. 1403-1419.
    • (2007) SIAM J. Sci. Comput , vol.29 , pp. 1403-1419
    • Kunis, S.1    Potts, D.2
  • 34
    • 33846276245 scopus 로고    scopus 로고
    • Fast Gauss transforms with complex parameters using NFFTs
    • DOI 10.1163/156939506779874626
    • S. Kunis, D. Potts, and G. Steidl, Fast Gauss transform with complex parameters using NFFTs, J. Numer. Math., 14 (2006), pp. 295-303. (Pubitemid 46110404)
    • (2006) Journal of Numerical Mathematics , vol.14 , Issue.4 , pp. 295-303
    • Kunis, S.1    Potts, D.2    Steidl, G.3
  • 37
    • 33645981294 scopus 로고    scopus 로고
    • ESPResSo - An extensible simulation package for research on soft matter systems
    • H.J. Limbach, A. Arnold, B.A. Mann, C. Holm, and G. Berne, ESPResSo - an extensible simulation package for research on soft matter systems, Comput. Phys. Comm., 174 (2006), pp. 704-727.
    • (2006) Comput. Phys. Comm , vol.174 , pp. 704-727
    • Limbach, H.J.1    Arnold, A.2    Mann, B.A.3    Holm, C.4    Berne, G.5
  • 38
    • 0346908258 scopus 로고    scopus 로고
    • 3M code for very large-scale cosmological simulations
    • PII S1384107698000335
    • T. MacFarland, H. Couchman, F. Pearce, and J. Pichlmeier, A new parallel code for very large-scale cosmological simulations, New Astron. Rev., 3 (1998), pp. 687-705. (Pubitemid 128467176)
    • (1998) New Astronomy , vol.3 , Issue.8 , pp. 687-705
    • Macfarland, T.1    Couchman, H.M.P.2    Pearce, F.R.3    Pichlmeier, J.4
  • 41
    • 84866371103 scopus 로고    scopus 로고
    • P3DFFT: A framework for parallel computations of fourier transforms in three dimensions
    • D. Pekurovsky, P3DFFT: A framework for parallel computations of Fourier transforms in three dimensions, SIAM J. Sci. Comput., 34 (2012), pp. C192-C209.
    • (2012) SIAM J. Sci. Comput , vol.34
    • Pekurovsky, D.1
  • 45
    • 84884972272 scopus 로고    scopus 로고
    • PFFT: An extension of fftw to massively parallel architectures
    • M. Pippig, PFFT: An extension of FFTW to massively parallel architectures, SIAM J. Sci. Comput., 35 (2013), pp. C213-C236.
    • (2013) SIAM J. Sci. Comput , vol.35
    • Pippig, M.1
  • 46
    • 84886821439 scopus 로고    scopus 로고
    • Particle simulation based on nonequispaced fast fourier transforms
    • IAS Series 6, G. Sutmann, P. Gibbon, and T. Lippert, eds., Forschungszentrum Jülich, Jülich
    • M. Pippig and D. Potts, Particle simulation based on nonequispaced fast Fourier transforms, in Fast Methods for Long-Range Interactions in Complex Systems, IAS Series 6, G. Sutmann, P. Gibbon, and T. Lippert, eds., Forschungszentrum Jülich, Jülich, 2011, pp. 131-158.
    • (2011) Fast Methods for Long-Range Interactions in Complex Systems , pp. 131-158
    • Pippig, M.1    Potts, D.2
  • 49
    • 0030172529 scopus 로고    scopus 로고
    • 3M, FMM, and the Ewald method for large periodic Coulombic systems
    • PII S0010465596000434
    • E. Pollock and J. Glosli, Comments on P3M, FMM, and the Ewald method for large periodic Coulombic systems, Comput. Phys. Comm., 95 (1996), pp. 93-110. (Pubitemid 126362828)
    • (1996) Computer Physics Communications , vol.95 , Issue.2-3 , pp. 93-110
    • Pollock, E.L.1    Glosli, J.2
  • 50
    • 0348207634 scopus 로고    scopus 로고
    • Fast summation at nonequispaced knots by nfft
    • D. Potts and G. Steidl, Fast summation at nonequispaced knots by NFFT, SIAM J. Sci. Comput., 24 (2003), pp. 2013-2037.
    • (2003) SIAM J. Sci. Comput , vol.24 , pp. 2013-2037
    • Potts, D.1    Steidl, G.2
  • 51
    • 4544236321 scopus 로고    scopus 로고
    • Fast convolution with radial kernels at nonequispaced knots
    • DOI 10.1007/s00211-004-0538-5
    • D. Potts, G. Steidl, and A. Nieslony, Fast convolution with radial kernels at nonequispaced knots, Numer. Math., 98 (2004), pp. 329-351. (Pubitemid 39216472)
    • (2004) Numerische Mathematik , vol.98 , Issue.2 , pp. 329-351
    • Potts, D.1    Steidl, G.2    Nieslony, A.3
  • 52
    • 0011937243 scopus 로고    scopus 로고
    • Fast fourier transforms for nonequispaced data: A tutorial
    • J.J. Benedetto and P.J.S.G. Ferreira, eds., Birkhäuser, Boston, MA
    • D. Potts, G. Steidl, and M. Tasche, Fast Fourier transforms for nonequispaced data: A tutorial, in Modern Sampling Theory: Mathematics and Applications, J.J. Benedetto and P.J.S.G. Ferreira, eds., Birkhäuser, Boston, MA, 2001, pp. 247-270.
    • (2001) Modern Sampling Theory: Mathematics and Applications , pp. 247-270
    • Potts, D.1    Steidl, G.2    Tasche, M.3
  • 55
    • 22444454428 scopus 로고    scopus 로고
    • A note on fast fourier transforms for nonequispaced grids
    • G. Steidl, A note on fast Fourier transforms for nonequispaced grids, Adv. Comput. Math., 9 (1998), pp. 337-353.
    • (1998) Adv. Comput. Math , vol.9 , pp. 337-353
    • Steidl, G.1
  • 56
    • 77955106795 scopus 로고    scopus 로고
    • An implementation of parallel 3-d fft with 2-d decomposition on a massively parallel cluster of multi-core processors
    • R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., Springer, Berlin
    • D. Takahashi, An implementation of parallel 3-D FFT with 2-D decomposition on a massively parallel cluster of multi-core processors, in Parallel Processing and Applied Mathematics, Lecture Notes in Comput. Sci. 6067, R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., Springer, Berlin, 2010, pp. 606-614.
    • (2010) Parallel Processing and Applied Mathematics, Lecture Notes in Comput. Sci , vol.6067 , pp. 606-614
    • Takahashi, D.1
  • 57
    • 0028098403 scopus 로고
    • Parallel p3m with exact calculation of short range forces
    • T. Theuns, Parallel P3M with exact calculation of short range forces, Comput. Phys. Comm., 78 (1994), pp. 238-246.
    • (1994) Comput. Phys. Comm , vol.78 , pp. 238-246
    • Theuns, T.1
  • 58
    • 84981190393 scopus 로고    scopus 로고
    • OpenMP parallelization in the nfft software library
    • T. Volkmer, OpenMP Parallelization in the NFFT Software Library, Preprint TU Chemnitz, preprint 7, 2012, http://www.tu-chemnitz.de/~potts/paper/ openmpNFFT.pdf.
    • (2012) Preprint TU Chemnitz, Preprint , vol.7
    • Volkmer, T.1
  • 59
    • 0032301587 scopus 로고    scopus 로고
    • Fast approximate Fourier transforms for irregularly spaced data
    • PII S003614459731533X
    • A.F. Ware, Fast approximate Fourier transforms for irregularly spaced data, SIAM Rev., 40 (1998), pp. 838-856. (Pubitemid 128620599)
    • (1998) SIAM Review , vol.40 , Issue.4 , pp. 838-856
    • Ware, A.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.