메뉴 건너뛰기




Volumn 14, Issue 3, 2003, Pages 307-321

On the parallel execution time of tiled loops

Author keywords

Blocking; Compiler optimization; Parallel compilers; Tiling

Indexed keywords

COMPUTER SIMULATION; DIFFERENTIAL EQUATIONS; DYNAMIC PROGRAMMING; INTERPOLATION; LINEAR PROGRAMMING; PROGRAM COMPILERS; RESPONSE TIME (COMPUTER SYSTEMS);

EID: 0037962984     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2003.1189587     Document Type: Article
Times cited : (30)

References (47)
  • 5
    • 4243923745 scopus 로고
    • Matrix multiply benchmarks
    • technical report, Center for Scientific Computing, Dept. of Math., Univ. of Utah; This report is updated frequently
    • N.H.F. Beebe, "Matrix Multiply Benchmarks," technical report, Center for Scientific Computing, Dept. of Math., Univ. of Utah, 1990, This report is updated frequently.
    • (1990)
    • Beebe, N.H.F.1
  • 6
  • 8
    • 0000493064 scopus 로고
    • Estimating interlock and improving balance for pipelined machines
    • Aug.
    • D. Callahan, J. Cocke, and K. Kennedy, "Estimating Interlock and Improving Balance for Pipelined Machines," J. Parallel and Distributed Computing, vol. 5, no. 4, pp. 334-358, Aug. 1988.
    • (1988) J. Parallel and Distributed Computing , vol.5 , Issue.4 , pp. 334-358
    • Callahan, D.1    Cocke, J.2    Kennedy, K.3
  • 10
    • 84964748976 scopus 로고
    • Compiler blockability of numerical algorithms
    • Nov.
    • S. Carr and K. Kennedy, "Compiler Blockability of Numerical Algorithms," J. Supercomputing, pp. 114-124, Nov. 1992.
    • (1992) J. Supercomputing , pp. 114-124
    • Carr, S.1    Kennedy, K.2
  • 19
    • 0034299275 scopus 로고    scopus 로고
    • Generation of efficient nested loops from polyhedra
    • S.V. Rajopadhye, F. Quiller, and D. Wilde, "Generation of Efficient Nested Loops from Polyhedra," Int'l J. Parallel Programming, vol. 28, no. 5, pp. 469-498, 2000.
    • (2000) Int'l J. Parallel Programming , vol.28 , Issue.5 , pp. 469-498
    • Rajopadhye, S.V.1    Quiller, F.2    Wilde, D.3
  • 21
    • 0003638028 scopus 로고    scopus 로고
    • Predicting performance for tiled perfectly nested loops
    • PhD thesis, Univ. of California, San Diego, Dept. of Computer Science and Eng., Dec.
    • K. Högstedt, "Predicting Performance for Tiled Perfectly Nested Loops," PhD thesis, Univ. of California, San Diego, Dept. of Computer Science and Eng., Dec. 1999.
    • (1999)
    • Högstedt, K.1
  • 32
    • 0002238004 scopus 로고
    • Tiling multidimensional iteration spaces for nonshared memory machines
    • Nov.
    • J. Ramanujam and P. Sadayappan, "Tiling Multidimensional Iteration Spaces for Nonshared Memory Machines," Supercomputing, Nov. 1991.
    • (1991) Supercomputing
    • Ramanujam, J.1    Sadayappan, P.2
  • 33
    • 0023384075 scopus 로고
    • Stencils and problem partitionings: Their influence on the performance of multiple processor systems
    • July
    • D.A. Reed, L.M. Adams, and M.L. Patrick, "Stencils and Problem Partitionings: Their Influence on the Performance of Multiple Processor Systems," IEEE Trans. Computers, vol. 36, no. 7, pp. 845-858, July 1987.
    • (1987) IEEE Trans. Computers , vol.36 , Issue.7 , pp. 845-858
    • Reed, D.A.1    Adams, L.M.2    Patrick, M.L.3
  • 34
    • 0031140581 scopus 로고    scopus 로고
    • Automatic selection of high-order transformations in the IBM XL FORTRAN compilers
    • V. Sarkar, "Automatic Selection of High-Order Transformations in the IBM XL FORTRAN Compilers," IBM J. Research and Development, vol. 41, no. 3, pp. 233-264, 1997.
    • (1997) IBM J. Research and Development , vol.41 , Issue.3 , pp. 233-264
    • Sarkar, V.1
  • 36
    • 0037808951 scopus 로고    scopus 로고
    • Standord SUIF Compiler System
    • Standord SUIF Compiler System, http://suif.stanford.edu/, 2002.
    • (2002)
  • 37
    • 0038485309 scopus 로고
    • Sweep3D Benchmark
    • Sweep3D Benchmark, www.llnl.gov/asci.benchmarks/asci/limtited/sweep3d/asci_sweep3d.html, 1995.
    • (1995)
  • 38
    • 0003278639 scopus 로고    scopus 로고
    • Automatically tuned linear algebra software
    • R.C. Whaley and J.J. Dongarra, "Automatically Tuned Linear Algebra Software," Supercomputer, 1998.
    • (1998) Supercomputer
    • Whaley, R.C.1    Dongarra, J.J.2
  • 39
    • 0003553286 scopus 로고
    • Improving locality and parallelism in nested loops
    • Phd thesis, Stanford Univ., Computer Systems Laboratory, Aug.
    • M.E. Wolf, "Improving Locality and Parallelism in Nested Loops," Phd thesis, Stanford Univ., Computer Systems Laboratory, Aug. 1992.
    • (1992)
    • Wolf, M.E.1
  • 41
    • 0026232450 scopus 로고
    • A loop transformation theory and an algorithm to maximize parallelism
    • M.E. Wolf and M.S. Lam, "A Loop Transformation Theory and an Algorithm to Maximize Parallelism," IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 452-471, 1991.
    • (1991) IEEE Trans. Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.E.1    Lam, M.S.2
  • 44
    • 0024935630 scopus 로고
    • More iteration space tiling
    • M.J. Wolfe, "More Iteration Space Tiling," Supercomputing, pp. 655-664, 1989.
    • (1989) Supercomputing , pp. 655-664
    • Wolfe, M.J.1
  • 47
    • 0032315190 scopus 로고    scopus 로고
    • Reuse-driven tiling for improving data locality
    • J. Xue and C.-H. Huang, "Reuse-Driven Tiling for Improving Data Locality," Int'l J. Parallel Programming, vol. 26, no. 6, pp. 671-696, 1998.
    • (1998) Int'l J. Parallel Programming , vol.26 , Issue.6 , pp. 671-696
    • Xue, J.1    Huang, C.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.