메뉴 건너뛰기




Volumn 180, Issue 12, 2009, Pages 2534-2543

HONEI: A collection of libraries for numerical computations targeting multiple processor architectures

Author keywords

Cell BE; CUDA; FEM for PDE; High performance computing; Mixed precision methods; Shallow water equations

Indexed keywords

CUDA; FEM FOR PDE; HIGH PERFORMANCE COMPUTING; MIXED PRECISION METHODS; SHALLOW WATER EQUATIONS;

EID: 70350602398     PISSN: 00104655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cpc.2009.04.018     Document Type: Article
Times cited : (12)

References (30)
  • 1
    • 70350591011 scopus 로고    scopus 로고
    • Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Barry F. Smith, Hong Zhang, PETSc Web page, http://www.mcs.anl.gov/petsc, 2001
    • Satish Balay, Kris Buschelman, William D. Gropp, Dinesh Kaushik, Matthew G. Knepley, Lois Curfman McInnes, Barry F. Smith, Hong Zhang, PETSc Web page, http://www.mcs.anl.gov/petsc, 2001
  • 4
    • 70350605445 scopus 로고    scopus 로고
    • Alfredo Buttari, Jack J. Dongarra, Jakub Kurzak, PLASMA Web page, http://icl.cs.utk.edu/plasma, 2009
    • Alfredo Buttari, Jack J. Dongarra, Jakub Kurzak, PLASMA Web page, http://icl.cs.utk.edu/plasma, 2009
  • 5
    • 38049058008 scopus 로고    scopus 로고
    • The impact of multicore on math software
    • Proceedings of PARA 2006, Applied Parallel Computing. State of the Art in Scientific Computing, Springer
    • Buttari A., Dongarra J.J., Kurzak J., Langou J., Luszczek P., and Tomov S. The impact of multicore on math software. Proceedings of PARA 2006, Applied Parallel Computing. State of the Art in Scientific Computing. Lecture Notes in Computer Science vol. 4699 (2006), Springer 1-10
    • (2006) Lecture Notes in Computer Science , vol.4699 , pp. 1-10
    • Buttari, A.1    Dongarra, J.J.2    Kurzak, J.3    Langou, J.4    Luszczek, P.5    Tomov, S.6
  • 6
    • 70350578630 scopus 로고    scopus 로고
    • Alfredo Buttari, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra, George Bosilca, SCOP3: A rough guide to scientific computing on the PlayStation 3, Technical report, Innovative Computing Laboratory, University of Tennessee Knoxville, 2007. UT-CS-07-595
    • Alfredo Buttari, Piotr Luszczek, Jakub Kurzak, Jack J. Dongarra, George Bosilca, SCOP3: A rough guide to scientific computing on the PlayStation 3, Technical report, Innovative Computing Laboratory, University of Tennessee Knoxville, 2007. UT-CS-07-595
  • 7
    • 70350616155 scopus 로고    scopus 로고
    • Phillip Colella, Thom H. Dunning Jr., William D. Gropp, David E. Keyes, A science-based case for large-scale simulation, Technical report, Office of Science, US Department of Energy, http://www.pnl.gov/scales, July 2003
    • Phillip Colella, Thom H. Dunning Jr., William D. Gropp, David E. Keyes, A science-based case for large-scale simulation, Technical report, Office of Science, US Department of Energy, http://www.pnl.gov/scales, July 2003
  • 8
    • 20444470676 scopus 로고    scopus 로고
    • Numerical solution of the two-dimensional shallow water equations by the application of relaxation methods
    • Delis A.I., and Katsaounis T.D. Numerical solution of the two-dimensional shallow water equations by the application of relaxation methods. Applied Mathematical Modelling 29 8 (2005) 754-783
    • (2005) Applied Mathematical Modelling , vol.29 , Issue.8 , pp. 754-783
    • Delis, A.I.1    Katsaounis, T.D.2
  • 9
    • 26444516623 scopus 로고    scopus 로고
    • Fixed and adaptive cache aware algorithms for multigrid methods
    • Multigrid Methods VI. Dick E., Riemslagh K., and Vierendeels J. (Eds), Springer
    • Douglas C.C., Hu J., Karl W., Kowarschik M., Rüde U., and Weiß C. Fixed and adaptive cache aware algorithms for multigrid methods. In: Dick E., Riemslagh K., and Vierendeels J. (Eds). Multigrid Methods VI. Lecture Notes in Computational Science and Engineering vol. 14 (2000), Springer 87-93
    • (2000) Lecture Notes in Computational Science and Engineering , vol.14 , pp. 87-93
    • Douglas, C.C.1    Hu, J.2    Karl, W.3    Kowarschik, M.4    Rüde, U.5    Weiß, C.6
  • 11
    • 70350582737 scopus 로고    scopus 로고
    • A note on cache memory methods for multigrid in three dimensions
    • Douglas C.C., and Thorne D.T. A note on cache memory methods for multigrid in three dimensions. Contemporary Mathematics 306 (2002) 167-177
    • (2002) Contemporary Mathematics , vol.306 , pp. 167-177
    • Douglas, C.C.1    Thorne, D.T.2
  • 12
    • 34548207355 scopus 로고    scopus 로고
    • Kayvon Fatahalian, Timothy J. Knight, Mike Houston, Mattan Erez, Daniel R. Horn, Larkhoon Leem, Ji Young Park, Manman Ren, Alex Aiken, William J. Dally, Pat Hanrahan, Sequoia: Programming the memory hierarchy, in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, November 2006
    • Kayvon Fatahalian, Timothy J. Knight, Mike Houston, Mattan Erez, Daniel R. Horn, Larkhoon Leem, Ji Young Park, Manman Ren, Alex Aiken, William J. Dally, Pat Hanrahan, Sequoia: Programming the memory hierarchy, in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, November 2006
  • 13
    • 70350604398 scopus 로고    scopus 로고
    • Dominik Göddeke, Robert Strzodka, Performance and accuracy of hardware-oriented native, emulated- and mixed-precision solvers in FEM simulations (Part 2: Double precision GPUs), Technical report, Fakultät für Mathematik, Technische Universität Dortmund, 2008 (Invited talk at NVISION 2008 - The World of Visual Computing, nummer 370)
    • Dominik Göddeke, Robert Strzodka, Performance and accuracy of hardware-oriented native, emulated- and mixed-precision solvers in FEM simulations (Part 2: Double precision GPUs), Technical report, Fakultät für Mathematik, Technische Universität Dortmund, 2008 (Invited talk at NVISION 2008 - The World of Visual Computing, nummer 370)
  • 14
    • 33947588604 scopus 로고    scopus 로고
    • Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
    • Göddeke D., Strzodka R., and Turek S. Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. International Journal of Parallel, Emergent and Distributed Systems 22 4 (2007) 221-256
    • (2007) International Journal of Parallel, Emergent and Distributed Systems , vol.22 , Issue.4 , pp. 221-256
    • Göddeke, D.1    Strzodka, R.2    Turek, S.3
  • 15
    • 70350584775 scopus 로고    scopus 로고
    • Kazushige Goto, GotoBLAS, http://www.tacc.utexas.edu/resources/software/#blas
    • GotoBLAS
    • Goto, K.1
  • 18
    • 70350602376 scopus 로고    scopus 로고
    • IBM Corporation, SPE Runtime Management Library, http://www-01.ibm.com/chips/techlib/techlib.nsf/pages/main, 2007
    • (2007) SPE Runtime Management Library
  • 20
    • 14044257293 scopus 로고    scopus 로고
    • Terascale implicit methods for partial differential equations
    • Recent Advances in Numerical Methods for Partial Differential Equations and Applications. Feng X., and Schulze T.P. (Eds), American Mathematical Society
    • Keyes D.E. Terascale implicit methods for partial differential equations. In: Feng X., and Schulze T.P. (Eds). Recent Advances in Numerical Methods for Partial Differential Equations and Applications. Contemporary Mathematics vol. 306 (January 2002), American Mathematical Society 29-84
    • (2002) Contemporary Mathematics , vol.306 , pp. 29-84
    • Keyes, D.E.1
  • 21
    • 34548206782 scopus 로고    scopus 로고
    • Tools and techniques for performance - exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems)
    • Julie Langou, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, Jack J. Dongarra, Tools and techniques for performance - exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems), in: SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 2006, p. 113
    • (2006) SC '06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing , pp. 113
    • Langou, J.1    Langou, J.2    Luszczek, P.3    Kurzak, J.4    Buttari, A.5    Dongarra, J.J.6
  • 22
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla: A unified graphics and computing architecture
    • Lindholm E., Nickolls J., Oberman S., and Montrym J. NVIDIA Tesla: A unified graphics and computing architecture. IEEE Micro 28 2 (2008) 39-55
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 23
    • 70350580611 scopus 로고    scopus 로고
    • NVIDIA Corporation, NVIDIA CUDA Compute Unified Device Architecture Programming Guide (Version 2.0), http://www.nvidia.com/cuda, 2008
    • NVIDIA Corporation, NVIDIA CUDA Compute Unified Device Architecture Programming Guide (Version 2.0), http://www.nvidia.com/cuda, 2008
  • 27
    • 27344435504 scopus 로고    scopus 로고
    • Dac C. Pham, Shigehiro Asano, Mark Bolliger, Michael N. Day, H. Peter Hofstee, Charles R. Johns, James A. Kahle, Atsushi Kameyama, John Keaty, Yoshio Masubuchi, Mack Riley, David Shippy, Daniel L. Stasiak, Masakazu Suzuoki, M. Wang, James Warnock, Steve Weitzel, Dieter Wendel, Takeshi Yamazaki, Kazuaki Yazawa, The design and implementation of a first-generation CELL processor, in: Solid-State Circuits Conference, ISSCC 2005, Digest of Technical Papers, 1, February 2005, pp. 184-592
    • Dac C. Pham, Shigehiro Asano, Mark Bolliger, Michael N. Day, H. Peter Hofstee, Charles R. Johns, James A. Kahle, Atsushi Kameyama, John Keaty, Yoshio Masubuchi, Mack Riley, David Shippy, Daniel L. Stasiak, Masakazu Suzuoki, M. Wang, James Warnock, Steve Weitzel, Dieter Wendel, Takeshi Yamazaki, Kazuaki Yazawa, The design and implementation of a first-generation CELL processor, in: Solid-State Circuits Conference, ISSCC 2005, Digest of Technical Papers, vol. 1, February 2005, pp. 184-592
  • 28
    • 84946717199 scopus 로고    scopus 로고
    • Sony Corporation, IBM Corporation, Cell BE processor and blade systems, http://www.ibm.com/developerworks/power/cell
    • Sony Corporation, Toshiba Corporation, IBM Corporation, Cell BE processor and blade systems, http://www-03.ibm.com/technology/splash/qs20/, http://www.ibm.com/developerworks/power/cell
    • Toshiba Corporation
  • 29
    • 26444596160 scopus 로고    scopus 로고
    • Hardware-oriented numerics and concepts for PDE software
    • Turek S., Becker C., and Kilian S. Hardware-oriented numerics and concepts for PDE software. Future Generation Computer Systems 22 1-2 (2004) 217-238
    • (2004) Future Generation Computer Systems , vol.22 , Issue.1-2 , pp. 217-238
    • Turek, S.1    Becker, C.2    Kilian, S.3
  • 30
    • 34247349114 scopus 로고    scopus 로고
    • Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine Yelick, The potential of the Cell processor for scientific computing, in: CF '06: Proceedings of the ACM International Conference on Computing Frontiers, May 2006, pp. 9-20
    • Samuel Williams, John Shalf, Leonid Oliker, Shoaib Kamil, Parry Husbands, Katherine Yelick, The potential of the Cell processor for scientific computing, in: CF '06: Proceedings of the ACM International Conference on Computing Frontiers, May 2006, pp. 9-20


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.