메뉴 건너뛰기




Volumn 39, Issue 2, 2007, Pages 93-112

The memory behavior of cache oblivious stencil computations

Author keywords

Analysis of algorithms; Cache oblivious algorithms; Performance analysis; Stencil computations; System simulation

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; FINITE DIFFERENCE METHOD; HEAT LOSSES; ITERATIVE METHODS; MATHEMATICAL MODELS; PROBLEM SOLVING;

EID: 33947307610     PISSN: 09208542     EISSN: 15730484     Source Type: Journal    
DOI: 10.1007/s11227-007-0111-y     Document Type: Article
Times cited : (39)

References (25)
  • 2
    • 0024082546 scopus 로고
    • The input/output complexity of sorting and related problems
    • Aggarwal A, Vitter JS (1988) The input/output complexity of sorting and related problems. Commun ACM 31(9):1116-1127
    • (1988) Commun ACM , vol.31 , Issue.9 , pp. 1116-1127
    • Aggarwal, A.1    Vitter, J.S.2
  • 3
    • 0029520385 scopus 로고
    • Space-limited procedures: A methodology for portable high-performance
    • Berlin, Germany, October, IEEE Computer Society, pp
    • Alpern B, Carter L, Ferrante J (1995) Space-limited procedures: a methodology for portable high-performance. In: Conference on programming models for massively parallel computers, Berlin, Germany, October 1995. IEEE Computer Society, pp 10-17
    • (1995) Conference on programming models for massively parallel computers , pp. 10-17
    • Alpern, B.1    Carter, L.2    Ferrante, J.3
  • 6
    • 7844234794 scopus 로고
    • RISC microprocessors and scientific computing
    • Portland, OR, November
    • Bailey DH (1993) RISC microprocessors and scientific computing. In: Supercomputing'93, Portland, OR, November 1993, pp 645-654
    • (1993) Supercomputing'93 , pp. 645-654
    • Bailey, D.H.1
  • 8
    • 0029179296 scopus 로고
    • Upper bounds to processor-time tradeoffs under bounded-speed message propagation
    • ACM Press, Santa Barbara
    • Bilardi G, Preparata FP (1995) Upper bounds to processor-time tradeoffs under bounded-speed message propagation. In: 7th ACM symposium on parallel algorithms and architectures, ACM Press, Santa Barbara, 1995, pp 185-194
    • (1995) 7th ACM symposium on parallel algorithms and architectures , pp. 185-194
    • Bilardi, G.1    Preparata, F.P.2
  • 12
    • 33646818416 scopus 로고
    • Lattice-Boltzmann fluid dynamics: A versatile tool for multi-phase and other complicated flows
    • Chen S, Doolen GD, Eggert KG (1994) Lattice-Boltzmann fluid dynamics: a versatile tool for multi-phase and other complicated flows. Los Alamos Sci 22:98-19
    • (1994) Los Alamos Sci , vol.22 , pp. 98-19
    • Chen, S.1    Doolen, G.D.2    Eggert, K.G.3
  • 15
    • 32844463802 scopus 로고    scopus 로고
    • Cache oblivious stencil computations
    • Boston, MA, June, ACM Press, pp
    • Frigo M, Strumpen V (2005) Cache oblivious stencil computations. In: International conference on supercomputing, Boston, MA, June 2005. ACM Press, pp 361-366
    • (2005) International conference on supercomputing , pp. 361-366
    • Frigo, M.1    Strumpen, V.2
  • 17
    • 1542392269 scopus 로고    scopus 로고
    • On Reducing TLB Misses in Matrix Multiplication
    • Technical Report TR-2002-55, Department of Computer Sciences, The University of Texas at Austin FLAME Working Note #9
    • Goto K, van de Geijn R (2001) On Reducing TLB Misses in Matrix Multiplication. Technical Report TR-2002-55, Department of Computer Sciences, The University of Texas at Austin (FLAME Working Note #9)
    • (2001)
    • Goto, K.1    van de Geijn, R.2
  • 19
    • 33947306789 scopus 로고    scopus 로고
    • Kowarschik M (2004) Data locality optimizations for iterative numerical algorithms and cellular automata on hierarchical memory architectures. PhD thesis, Lehrstuhl für Informatik 10 (Systemsimulation), Institut für Informatik, Universität Erlangen-Nürnberg, Erlangen, Germany, July 2004
    • Kowarschik M (2004) Data locality optimizations for iterative numerical algorithms and cellular automata on hierarchical memory architectures. PhD thesis, Lehrstuhl für Informatik 10 (Systemsimulation), Institut für Informatik, Universität Erlangen-Nürnberg, Erlangen, Germany, July 2004
  • 21
    • 84934325826 scopus 로고    scopus 로고
    • Oliker L, Canning A, Carter J, Shalf J, Ethier S (2004) Scientific computations on modern parallel vector systems. In: Supercomputing'04, Pittsburgh, PA, November 2004, IEEE. http://www.sc-conference.org/sc2004/papers. html
    • Oliker L, Canning A, Carter J, Shalf J, Ethier S (2004) Scientific computations on modern parallel vector systems. In: Supercomputing'04, Pittsburgh, PA, November 2004, IEEE. http://www.sc-conference.org/sc2004/papers. html
  • 22
    • 84934289045 scopus 로고    scopus 로고
    • Pohl T, Deserno F, Thürey N, Rüde U, Lammers P, Wellein G, Zeiser T (2004) Performance evaluation of parallel large-scale lattice Boltzmann applications on three supercomputing architectures. In: Supercomputing'04, Pittsburgh, PA, November 2004, IEEE, http://www.sc-conference.org/sc2004/papers. html
    • Pohl T, Deserno F, Thürey N, Rüde U, Lammers P, Wellein G, Zeiser T (2004) Performance evaluation of parallel large-scale lattice Boltzmann applications on three supercomputing architectures. In: Supercomputing'04, Pittsburgh, PA, November 2004, IEEE, http://www.sc-conference.org/sc2004/papers. html
  • 23
    • 0008198155 scopus 로고    scopus 로고
    • Master's thesis, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, June
    • Prokop H (1999) Cache-oblivious algorithms. Master's thesis, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, June 1999
    • (1999) Cache-oblivious algorithms
    • Prokop, H.1
  • 25
    • 0031496750 scopus 로고    scopus 로고
    • Locality of reference in LU decomposition with partial pivoting
    • Toledo S (1997) Locality of reference in LU decomposition with partial pivoting. SIAM J Matrix Anal Appl 18(4):1065-1081
    • (1997) SIAM J Matrix Anal Appl , vol.18 , Issue.4 , pp. 1065-1081
    • Toledo, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.