메뉴 건너뛰기




Volumn 33, Issue 10-11, 2007, Pages 685-699

Exploring weak scalability for FEM calculations on a GPU-enhanced cluster

Author keywords

Commodity based clusters; Finite elements; Graphics processors; Heterogeneous computing; Parallel multigrid solvers

Indexed keywords

BANDWIDTH; CLUSTER ANALYSIS; COMPUTATION THEORY; ENERGY UTILIZATION;

EID: 35748969304     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.parco.2007.09.002     Document Type: Article
Times cited : (103)

References (35)
  • 1
    • 35748947064 scopus 로고    scopus 로고
    • AMD, Inc., Torrenza technology, 2006. .
  • 2
    • 35748929774 scopus 로고    scopus 로고
    • C. Becker, Strategien und Methoden zur Ausnutzung der High-Performance-Computing-Ressourcen moderner Rechnerarchitekturen für Finite Element Simulationen und ihre Realisierung in FEAST (Finite Element Analysis and Solution Tools), Ph.D. thesis, Universität Dortmund, Fachbereich Mathematik, 2007.
  • 4
    • 35748978670 scopus 로고    scopus 로고
    • A. Buttari, J. Dongarra, J. Kurzak, Limitations of the PlayStation 3 for high performance cluster computing, Tech. rep., University of Tennessee Computer Science, CS-07-594, 2007.
  • 5
    • 35748946169 scopus 로고    scopus 로고
    • A. Buttari, P. Luszczek, J. Kurzak, J. Dongarra, G. Bosilca, SCOP3: a rough guide to scientific computing on the PlayStation 3, Tech. rep., Innovative Computing Laboratory, University of Tennessee Knoxville, UT-CS-07-595, 2007.
  • 6
    • 35748957625 scopus 로고    scopus 로고
    • ClearSpeed Technology, Inc., ClearSpeed Advance Accelerator Boards, 2006. .
  • 7
    • 35748952942 scopus 로고    scopus 로고
    • Cray Inc., Cray XD1 supercomputer, 2006. .
  • 8
    • 35748933053 scopus 로고    scopus 로고
    • G. Da Graça, D. Defour, Implementation of float-float operators on graphics hardware, in: Seventh Conference on Real Numbers and Computers, RNC7, 2006.
  • 9
    • 2942655475 scopus 로고    scopus 로고
    • A column pre-ordering strategy for the unsymmetric-pattern multifrontal method
    • Davis T.A. A column pre-ordering strategy for the unsymmetric-pattern multifrontal method. ACM Transactions on Mathematical Software 30 2 (2004) 165-195
    • (2004) ACM Transactions on Mathematical Software , vol.30 , Issue.2 , pp. 165-195
    • Davis, T.A.1
  • 10
    • 84934299651 scopus 로고    scopus 로고
    • Z. Fan, F. Qiu, A. Kaufman, S. Yoakum-Stover, GPU cluster for high performance computing, in: SC'04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, 2004.
  • 11
    • 35748965702 scopus 로고    scopus 로고
    • Genomic Sciences Center, RIKEN, The GRAPE series of processors, , , .
  • 12
    • 35748983657 scopus 로고    scopus 로고
    • D. Göddeke, R. Strzodka, J. Mohd-Yusof, P. McCormick, H. Wobker, C. Becker, S. Turek, Using GPUs to improve multigrid solver performance on a cluster, International Journal of Computational Science and Engineering, in press.
  • 13
    • 33947588604 scopus 로고    scopus 로고
    • Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
    • Göddeke D., Strzodka R., and Turek S. Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. International Journal of Parallel, Emergent and Distributed Systems 22 4 (2007) 221-256
    • (2007) International Journal of Parallel, Emergent and Distributed Systems , vol.22 , Issue.4 , pp. 221-256
    • Göddeke, D.1    Strzodka, R.2    Turek, S.3
  • 14
    • 35748969711 scopus 로고    scopus 로고
    • GPGPU, General-purpose computation using graphics hardware, 2007. .
  • 15
    • 35748970602 scopus 로고    scopus 로고
    • GraphStream, Inc., GraphStream scalable computing platform (SCP), 2006. .
  • 17
    • 35748956228 scopus 로고    scopus 로고
    • Intel, Inc., Geneseo: PCI Express technology advancement, 2006. .
  • 18
    • 35748977236 scopus 로고    scopus 로고
    • S. Kilian, ScaRC: Ein verallgemeinertes Gebietszerlegungs-/Mehrgitterkonzept auf Parallelrechnern, Ph.D. thesis, Universität Dortmund, Fachbereich Mathematik, 2001.
  • 20
    • 35748970850 scopus 로고    scopus 로고
    • Mercury Computer Systems, Inc., Cell BE accelerator boards. , .
  • 21
    • 35748981925 scopus 로고    scopus 로고
    • H. Meuer, E. Strohmaier, J.J. Dongarra, H.D. Simon, Top500 supercomputer sites, 2007. .
  • 22
    • 35748955955 scopus 로고    scopus 로고
    • F. Mueller, J. Weston, NSCU PlayStation 3 computing cluster. .
  • 23
    • 35748954198 scopus 로고    scopus 로고
    • National Center for Supercomputing Applications, Computer Science, University of Illinois, Scientific computing on the PlayStation 2. .
  • 24
    • 35748971614 scopus 로고    scopus 로고
    • NVIDIA Corporation, NVIDIA CUDA compute unified device architecture programming guide, January 2007. .
  • 26
    • 35748971275 scopus 로고    scopus 로고
    • V. Pande, Stanford University, Folding@Home on ATI GPUs, 2006. .
  • 27
    • 77951558943 scopus 로고    scopus 로고
    • M. Peercy, M. Segal, D. Gerstmann, A performance-oriented data parallel virtual machine for GPUs, in: SIGGRAPH'06: ACM SIGGRAPH 2006 Sketches, 2006.
  • 28
    • 56349149338 scopus 로고    scopus 로고
    • J.W. Sheaffer, D.P. Luebke, K. Skadron, A hardware redundancy and recovery mechanism for reliable scientific computation on graphics processors, in: T. Aila, M. Segal (Eds.), Graphics Hardware 2007, 2007.
  • 29
    • 35748966258 scopus 로고    scopus 로고
    • Sony, Toshiba, IBM, Cell BE processor and blade systems. , .
  • 30
    • 25844449063 scopus 로고    scopus 로고
    • Scientific computation for simulations on programmable graphics hardware
    • Strzodka R., Doggett M., and Kolb A. Scientific computation for simulations on programmable graphics hardware. Simulation Modelling Practice and Theory 13 8 (2005) 667-680
    • (2005) Simulation Modelling Practice and Theory , vol.13 , Issue.8 , pp. 667-680
    • Strzodka, R.1    Doggett, M.2    Kolb, A.3
  • 31
    • 33947595619 scopus 로고    scopus 로고
    • D. Tarditi, S. Puri, J. Oglesby, Accelerator: using data parallelism to program GPUs for general-purpose uses, in: Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems 2006, 2006.
  • 32
    • 84958956603 scopus 로고    scopus 로고
    • A. Thall, Extended-precision floating-point numbers for GPU computation, in: SIGGRAPH'06: ACM SIGGRAPH 2006 Research Posters, 2006.
  • 33
    • 35748958856 scopus 로고    scopus 로고
    • Tokyo Institute of Technology, Global scientific information and computing center. .
  • 34
    • 26444596160 scopus 로고    scopus 로고
    • Hardware-oriented numerics and concepts for PDE software
    • Turek S., Becker C., and Kilian S. Hardware-oriented numerics and concepts for PDE software. Future Generation Computer Systems 22 1-2 (2003) 217-238
    • (2003) Future Generation Computer Systems , vol.22 , Issue.1-2 , pp. 217-238
    • Turek, S.1    Becker, C.2    Kilian, S.3
  • 35
    • 35748934912 scopus 로고    scopus 로고
    • Stanford University Graphics Lab, GPU bench - How much does your GPU bench? 2006. Availbale at: .


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.