메뉴 건너뛰기




Volumn 18, Issue 2, 2010, Pages 107-123

Adjacency-based data reordering algorithm for acceleration of finite element computations

Author keywords

cache penalty model; Data reordering; finite element analysis; unstructured mesh

Indexed keywords

CACHE MEMORY; FINITE ELEMENT METHOD; GENES; MEMORY ARCHITECTURE; MESH GENERATION;

EID: 77954751611     PISSN: 10589244     EISSN: None     Source Type: Journal    
DOI: 10.1155/2010/273921     Document Type: Article
Times cited : (16)

References (33)
  • 2
    • 0031143627 scopus 로고    scopus 로고
    • A general topology-based mesh data structure
    • M.W. Beall and M.S. Shephard, A general topology-based mesh data structure, Int. J. Numer. Methods Engrg. 40(9) (1997), 1573-1596.
    • (1997) Int. J. Numer. Methods Engrg. , vol.40 , Issue.9 , pp. 1573-1596
    • Beall, M.W.1    Shephard, M.S.2
  • 3
    • 0031120395 scopus 로고    scopus 로고
    • Renumbering unstructured grids to improve the performance of codes on hierarchical memory machines
    • D.A. Burgess and M.B. Giles, Renumbering unstructured grids to improve the performance of codes on hierarchical memory machines, Adv. Engrg. Soft. 28 (1997), 189-201.
    • (1997) Adv. Engrg. Soft. , vol.28 , pp. 189-201
    • Burgess, D.A.1    Giles, M.B.2
  • 4
    • 33645913852 scopus 로고    scopus 로고
    • Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
    • A.L.G.A. Coutinho and M.A.D. Martins, Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations, Int. J. Numer. Methods Engrg. 66 (2006), 431-460.
    • (2006) Int. J. Numer. Methods Engrg. , vol.66 , pp. 431-460
    • Coutinho, A.L.G.A.1    Martins, M.A.D.2
  • 6
    • 0014612601 scopus 로고
    • Reducing the bandwidth of sparse symmetric matrices
    • ACM, ACM, New York, NY, USA
    • E. Cuthill and J. McKee, Reducing the bandwidth of sparse symmetric matrices, in: Proc. 24th Nat. Conf. ACM, ACM, New York, NY, USA, 1969, pp. 157-172.
    • (1969) Proc. 24th Nat. Conf. , pp. 157-172
    • Cuthill, E.1    McKee, J.2
  • 8
    • 0016939622 scopus 로고
    • An algorithm for reducing the bandwidth and profile of sparse matrix
    • N.E. Gibbs, W.G. Poole Jr. and P.K. Stockmeyer, An algorithm for reducing the bandwidth and profile of sparse matrix, SIAM J. Numer. Anal. 13 (1976), 236-250.
    • (1976) SIAM J. Numer. Anal. , vol.13 , pp. 236-250
    • Gibbs, N.E.1    Poole Jr., W.G.2    Stockmeyer, P.K.3
  • 9
    • 0017269689 scopus 로고
    • A comparison of several bandwidth and profile reduction algorithm
    • N.E. Gibbs, W.G. Poole Jr. and P.K. Stockmeyer, A comparison of several bandwidth and profile reduction algorithm, ACM Trans. Math. Soft. 2 (1976), 322-330.
    • (1976) ACM Trans. Math. Soft. , vol.2 , pp. 322-330
    • Gibbs, N.E.1    Poole Jr., W.G.2    Stockmeyer, P.K.3
  • 10
    • 33745715056 scopus 로고    scopus 로고
    • Exploiting locality for irregular scientific codes
    • H. Han and C.W. Tseng, Exploiting locality for irregular scientific codes, IEEE Trans. Parallel Distrib. Systems 17 (2006), 606-618.
    • (2006) IEEE Trans. Parallel Distrib. Systems , vol.17 , pp. 606-618
    • Han, H.1    Tseng, C.W.2
  • 11
    • 0033689713 scopus 로고    scopus 로고
    • Self-avoiding walks over adaptive unstructured grids
    • G. Heber, R. Biswas and G.R. Gao, Self-avoiding walks over adaptive unstructured grids, Concurrency Pract. Exper. 12 (2000), 85-109.
    • (2000) Concurrency Pract. Exper. , vol.12 , pp. 85-109
    • Heber, G.1    Biswas, R.2    Gao, G.R.3
  • 12
    • 85184373524 scopus 로고    scopus 로고
    • http://www.anandtech.com/.
  • 13
    • 85184372383 scopus 로고    scopus 로고
    • http://techreport.com/articles.x/13176/3.
  • 14
    • 0034287408 scopus 로고    scopus 로고
    • Generalized- α method for integrating the filtered Navier-Stokes equations with a stabilized finite element method
    • K.E. Jansen, C.H. Whiting and G.M. Hulbert, Generalized- α method for integrating the filtered Navier-Stokes equations with a stabilized finite element method, Comput. Methods Appl. Mech. Engrg. 190(3,4) (2000), 305-319.
    • (2000) Comput. Methods Appl. Mech. Engrg. , vol.190 , Issue.3-4 , pp. 305-319
    • Jansen, K.E.1    Whiting, C.H.2    Hulbert, G.M.3
  • 15
    • 0029709954 scopus 로고    scopus 로고
    • A parallel algorithm for multilevel graph partitioning and sparse matrix ordering
    • Washington, DC, USA
    • G. Karypis and V. Kumar, A parallel algorithm for multilevel graph partitioning and sparse matrix ordering, in: 10th Intl. Parallel Processing Symposium, IEEE Computer Society, Washington, DC, USA, 1996, pp. 314-319.
    • (1996) 10th Intl. Parallel Processing Symposium, IEEE Computer Society , pp. 314-319
    • Karypis, G.1    Kumar, V.2
  • 16
    • 0032155185 scopus 로고    scopus 로고
    • Renumbering strategies for unstructured-grid solvers operating on shared-memory, cache-based parallel machines
    • R. Löhner, Renumbering strategies for unstructured-grid solvers operating on shared-memory, cache-based parallel machines, Comput. Methods Appl. Mech. Engrg. 163 (1998), 95-109.
    • (1998) Comput. Methods Appl. Mech. Engrg. , vol.163 , pp. 95-109
    • Löhner, R.1
  • 17
    • 0036571680 scopus 로고    scopus 로고
    • Minimization of indirect addressing for edge-based field solvers
    • R. Löhner and M. Galle, Minimization of indirect addressing for edge-based field solvers, Commun. Numer. Methods Engrg. 18(5) (2002), 335-343.
    • (2002) Commun. Numer. Methods Engrg. , vol.18 , Issue.5 , pp. 335-343
    • Löhner, R.1    Galle, M.2
  • 18
    • 85184359735 scopus 로고    scopus 로고
    • Ordering schemes for sparse matrices using modern programming paradigms
    • L. Oliker, X. Li, P. Husbands and R. Biswas, Ordering schemes for sparse matrices using modern programming paradigms, LBNL47803, 2000.
    • (2000) LBNL47803
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 19
    • 85184370679 scopus 로고    scopus 로고
    • Parallel conjugate gradient: Effects of ordering strategies, programming paradigms, and architectural platforms
    • L. Oliker, X. Li, P. Husbands and R. Biswas, Parallel conjugate gradient: effects of ordering strategies, programming paradigms, and architectural platforms, LBNL45828, 2000.
    • (2000) LBNL45828
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 20
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • L. Oliker, X. Li, P. Husbands and R. Biswas, Effects of ordering strategies and programming paradigms on sparse matrix computations, SIAM Rev. 44 (2002), 373-393.
    • (2002) SIAM Rev , vol.44 , pp. 373-393
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 23
    • 0000048673 scopus 로고
    • GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems
    • Y. Saad and M.H. Schultz, GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems, SIAM J. Sci. Statist. Comput. 7 (1986), 856-869.
    • (1986) SIAM J. Sci. Statist. Comput. , vol.7 , pp. 856-869
    • Saad, Y.1    Schultz, M.H.2
  • 25
    • 33845690348 scopus 로고    scopus 로고
    • Efficient distributed mesh data structure for parallel automated adaptive analysis
    • E.S. Seol and M.S. Shephard, Efficient distributed mesh data structure for parallel automated adaptive analysis, Eng. Comput. 22(3,4) (2006), 197-213.
    • (2006) Eng. Comput. , vol.22 , Issue.3-4 , pp. 197-213
    • Seol, E.S.1    Shephard, M.S.2
  • 26
    • 0024752479 scopus 로고
    • A multi-element group preconditioned GMRES algorithm for nonsymmetric systems arising in finite element analysis
    • F. Shakib, T.J.R. Hughes and Z. Johan, A multi-element group preconditioned GMRES algorithm for nonsymmetric systems arising in finite element analysis, Comput. Methods Appl. Mech. Engrg. 75 (1989), 415-456.
    • (1989) Comput. Methods Appl. Mech. Engrg. , vol.75 , pp. 415-456
    • Shakib, F.1    Hughes, T.J.R.2    Johan, Z.3
  • 28
    • 33645906891 scopus 로고    scopus 로고
    • Outflow boundary conditions for three-dimensional finite element modeling of blood flow and pressure in arteries
    • I.E. Vigon-Clementel, C.A. Figueroa, K.E. Jansen and C.A. Taylor, Outflow boundary conditions for three-dimensional finite element modeling of blood flow and pressure in arteries, Comput. Mehtods Appl. Mech. Engrg. 195 (2006), 3776-3796.
    • (2006) Comput. Mehtods Appl. Mech. Engrg. , vol.195 , pp. 3776-3796
    • Vigon-Clementel, I.E.1    Figueroa, C.A.2    Jansen, K.E.3    Taylor, C.A.4
  • 29
    • 85184364576 scopus 로고    scopus 로고
    • available at
    • B. Waldecker, Amd technical briefing, available at: http:// www.nccs.gov/wp-content/training/scaling-workshop-pdfs/ AMD- ORNL-073007.pdf.
    • Amd Technical Briefing
    • Waldecker, B.1
  • 30
    • 0035104205 scopus 로고    scopus 로고
    • A stabilized finite element method for the incompressible Navier-Stokes equations using a hierarchical basis
    • C.H. Whiting and K.E. Jansen, A stabilized finite element method for the incompressible Navier-Stokes equations using a hierarchical basis, Int. J. Numer. Meth. Fluids 35 (2001), 93-116.
    • (2001) Int. J. Numer. Meth. Fluids , vol.35 , pp. 93-116
    • Whiting, C.H.1    Jansen, K.E.2
  • 31
    • 60949098907 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication on emerging multicore platforms
    • S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick and J. Demmel, Optimization of sparse matrix-vector multiplication on emerging multicore platforms, Parallel Comput. 35 (2009), 178-194.
    • (2009) Parallel Comput , vol.35 , pp. 178-194
    • Williams, S.1    Oliker, L.2    Vuduc, R.3    Shalf, J.4    Yelick, K.5    Demmel, J.6
  • 32
    • 55249091059 scopus 로고
    • Method for the calculation of velocity, rate of flow and viscous drag in arteries when the pressure gradient is known
    • J. Womersley, Method for the calculation of velocity, rate of flow and viscous drag in arteries when the pressure gradient is known, J. Physiol. 127 (1955), 553-563.
    • (1955) J. Physiol. , vol.127 , pp. 553-563
    • Womersley, J.1
  • 33
    • 77954707501 scopus 로고    scopus 로고
    • Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
    • A.N. Yzelman and R.H. Bisseling, Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods, SIAM J. Sci. Comput. 31 (2009), 3128-3254.
    • (2009) SIAM J. Sci. Comput. , vol.31 , pp. 3128-3254
    • Yzelman, A.N.1    Bisseling, R.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.