메뉴 건너뛰기




Volumn 4, Issue 1, 2008, Pages 3-11

Performance comparison of different parallel lattice Boltzmann implementations on multi-core multi-socket systems

Author keywords

Cache oblivious; Cache optimisation; Lattice Boltzmann; Multi core; Shared memory parallelisation; Task queueing

Indexed keywords

ITERATIVE METHODS; MEMORY ARCHITECTURE; TIME DOMAIN ANALYSIS;

EID: 56349087850     PISSN: 17427185     EISSN: 17427193     Source Type: Journal    
DOI: 10.1504/IJCSE.2008.021107     Document Type: Article
Times cited : (17)

References (16)
  • 3
    • 33845417959 scopus 로고    scopus 로고
    • Hager, G., Zeiser, T., Treibig, J. and Wellein, G. (2006) 'Optimizing performance on modern HPC systems: Learning from simple kernel benchmarks', in Krause, E., Shokin, Y., Resch, M. and Shokina, N. (Eds.): Computational Science and High Performance Computing II: The 2nd Russian-German Advanced Research Workshop, Stuttgart, Germany, 14-16 March, 2005, Notes on Numerical Fluid Mechanics and Multidisciplinary Design, 91, Springer, ISBN: 3-540-31767-8, DOI:10.1007/3-540-31768-6_23.
    • Hager, G., Zeiser, T., Treibig, J. and Wellein, G. (2006) 'Optimizing performance on modern HPC systems: Learning from simple kernel benchmarks', in Krause, E., Shokin, Y., Resch, M. and Shokina, N. (Eds.): Computational Science and High Performance Computing II: The 2nd Russian-German Advanced Research Workshop, Stuttgart, Germany, 14-16 March, 2005, Notes on Numerical Fluid Mechanics and Multidisciplinary Design, Vol. 91, Springer, ISBN: 3-540-31767-8, DOI:10.1007/3-540-31768-6_23.
  • 4
    • 56349094603 scopus 로고    scopus 로고
    • Bachelor's Thesis, Lehrstuhl für Informatik, Systemsimulation, Universität Erlangen-Nürnberg
    • Iglberger, K. (2003) Cache Optimizations for the Lattice Boltzmann Method in 3D, Bachelor's Thesis, Lehrstuhl für Informatik 10 (Systemsimulation), Universität Erlangen-Nürnberg.
    • (2003) Cache Optimizations for the Lattice Boltzmann Method in 3D , pp. 10
    • Iglberger, K.1
  • 5
    • 56349162406 scopus 로고    scopus 로고
    • Intel (2005) Intel C++ Compiler Documentation. Document number: 304967-005.
    • Intel (2005) Intel C++ Compiler Documentation. Document number: 304967-005.
  • 7
    • 0345025793 scopus 로고    scopus 로고
    • STREAM: Sustainable Memory Bandwidth in High Performance Computers
    • Technical report, University of Virginia
    • McCalpin, J.D. (2004) STREAM: Sustainable Memory Bandwidth in High Performance Computers, Technical report, University of Virginia.
    • (2004)
    • McCalpin, J.D.1
  • 8
    • 56349089138 scopus 로고    scopus 로고
    • Nitsure, A., Rüde, U., Iglberger, K., Wellein, G., Hager, G. and Feichtinger, C. (2006) 'Optimization of cache oblivious lattice Boltzmann method in 2D and 3D', in Becker, M. and Szczerbicka, H. (Eds.): Frontiers in Simulation: Simulationstechnique - 19th Symposium in Hannover, September 2006 (ASIM), pp.265-270, Erlangen, SCS Publishing House.
    • Nitsure, A., Rüde, U., Iglberger, K., Wellein, G., Hager, G. and Feichtinger, C. (2006) 'Optimization of cache oblivious lattice Boltzmann method in 2D and 3D', in Becker, M. and Szczerbicka, H. (Eds.): Frontiers in Simulation: Simulationstechnique - 19th Symposium in Hannover, September 2006 (ASIM), pp.265-270, Erlangen, SCS Publishing House.
  • 9
    • 1242352441 scopus 로고    scopus 로고
    • Optimization and profiling of the cache performance of parallel lattice Boltzmann codes
    • Pohl, T., Kowarschik, M., Wilke, J., Igelberger, K. and Rüde, U. (2003) 'Optimization and profiling of the cache performance of parallel lattice Boltzmann codes', Parallel Process. Lett., Vol. 13, No. 4, pp.549-560.
    • (2003) Parallel Process. Lett , vol.13 , Issue.4 , pp. 549-560
    • Pohl, T.1    Kowarschik, M.2    Wilke, J.3    Igelberger, K.4    Rüde, U.5
  • 10
    • 78649765479 scopus 로고    scopus 로고
    • Tiling optimizations for 3D scientific computations
    • IEEE Computer Society, Dallas, Texas, Washington, DC, USA, p, ISBN: 0-7803-9802-5
    • Rivera, G. and Tseng, C.-W. (2000) 'Tiling optimizations for 3D scientific computations', Supercomputing '00: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing (CDROM), IEEE Computer Society, Dallas, Texas, Washington, DC, USA, p.32, ISBN: 0-7803-9802-5, http:// doi.ieeecomputersociety.org/10.1109/SC.2000.10015
    • (2000) Supercomputing '00: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing (CDROM) , pp. 32
    • Rivera, G.1    Tseng, C.-W.2
  • 11
    • 56349139855 scopus 로고    scopus 로고
    • Schulz, M., Krafczyk, M., Tölke, J. and Rank, E. (2002) 'Parallelization strategies and efficiency of CFD computations in complex geometries using lattice Boltzmann methods on high performance computers', Breuer, M., Durst, F. and Zenger, C. (Eds.): High Performance Scientific and Engineering Computing Proceedings of the 3rd International FORTWIHR Conference on HPSEC, Erlangen, 12-14 March, 2001 21 of Lecture Notes in Computational Science and Engineering, pp.115-122, Springer, Berlin.
    • Schulz, M., Krafczyk, M., Tölke, J. and Rank, E. (2002) 'Parallelization strategies and efficiency of CFD computations in complex geometries using lattice Boltzmann methods on high performance computers', Breuer, M., Durst, F. and Zenger, C. (Eds.): High Performance Scientific and Engineering Computing Proceedings of the 3rd International FORTWIHR Conference on HPSEC, Erlangen, 12-14 March, 2001 Vol. 21 of Lecture Notes in Computational Science and Engineering, pp.115-122, Springer, Berlin.
  • 12
    • 0032236389 scopus 로고    scopus 로고
    • Accuracy and computational efficiency in 3D dispersion via lattice-Boltzmann: Models for dispersion in rough fractures and double-diffusive fingering
    • Stockman, H.W. and Glass, R.J. (1998) 'Accuracy and computational efficiency in 3D dispersion via lattice-Boltzmann: Models for dispersion in rough fractures and double-diffusive fingering', IJMPC International Journal of Modern Physics C, Vol. 9, No. 8, pp.1545-1557.
    • (1998) IJMPC International Journal of Modern Physics C , vol.9 , Issue.8 , pp. 1545-1557
    • Stockman, H.W.1    Glass, R.J.2
  • 14
    • 41249089354 scopus 로고    scopus 로고
    • Wellein, G., Lammers, P., Hager, G., Donath, S. and Zeiser, T. (2006a) 'Towards optimal performance for lattice Boltzmann applications on terascale computers', in Deane, A., Brenner, G., Emerson, D., McDonough, J., Periaux, J., Tromeur-Dervout, D., Ecer, A. and Satofuka, N. (Eds.): Parallel Computational Fluid Dynamics: Theory and Applications, Proceedings of the 2005 International Conference on Parallel Computational Fluid Dynamics, 24-27 May, College Park, MD, USA, pp.31-40, Elsevier B.V., Amsterdam, Netherlands.
    • Wellein, G., Lammers, P., Hager, G., Donath, S. and Zeiser, T. (2006a) 'Towards optimal performance for lattice Boltzmann applications on terascale computers', in Deane, A., Brenner, G., Emerson, D., McDonough, J., Periaux, J., Tromeur-Dervout, D., Ecer, A. and Satofuka, N. (Eds.): Parallel Computational Fluid Dynamics: Theory and Applications, Proceedings of the 2005 International Conference on Parallel Computational Fluid Dynamics, 24-27 May, College Park, MD, USA, pp.31-40, Elsevier B.V., Amsterdam, Netherlands.
  • 15
    • 33646809359 scopus 로고    scopus 로고
    • On the single processor performance of simple lattice Boltzmann kernels
    • Wellein, G., Zeiser, T., Donath, S. and Hager, G. (2006b) 'On the single processor performance of simple lattice Boltzmann kernels', Comput. and Fluids, Vol. 35, pp.910-919.
    • (2006) Comput. and Fluids , vol.35 , pp. 910-919
    • Wellein, G.1    Zeiser, T.2    Donath, S.3    Hager, G.4
  • 16
    • 56349170328 scopus 로고    scopus 로고
    • Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method
    • Geneve, Inderscience Publishers, Switzerland, DOI: 10.1504/PCFD.2008.018088
    • Zeiser, T., Wellein, G., Nitsure, A., Iglberger, K. and Hager, G. (2008) 'Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method', Progress in Computational Fluid Dynamics Geneve, Inderscience Publishers, Switzerland, Vol. 8, Nos. 1-4, pp.179-188, DOI: 10.1504/PCFD.2008.018088.
    • (2008) Progress in Computational Fluid Dynamics , vol.8 , Issue.1-4 , pp. 179-188
    • Zeiser, T.1    Wellein, G.2    Nitsure, A.3    Iglberger, K.4    Hager, G.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.