메뉴 건너뛰기




Volumn 14, Issue 10, 2003, Pages 1021-1034

An Efficient Code Generation Technique for Tiled Iteration Spaces

Author keywords

Code generation; Fourier Motzkin elimination; Loop tiling; Nonunimodular transformations; Supernodes

Indexed keywords

CODES (SYMBOLS); COMPUTATIONAL COMPLEXITY; DIGITAL STORAGE; ITERATIVE METHODS; MATHEMATICAL TRANSFORMATIONS; MATRIX ALGEBRA; PARALLEL PROCESSING SYSTEMS;

EID: 0242578173     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2003.1239870     Document Type: Article
Times cited : (25)

References (40)
  • 2
    • 0029373981 scopus 로고
    • Automatic Partitioning of Parallel Loops and Data Arrays for Distributed Shared-Memory Multiprocessors
    • Sept.
    • A. Agarwal, D. Kranz, and V. Natarajan, "Automatic Partitioning of Parallel Loops and Data Arrays for Distributed Shared-Memory Multiprocessors," IEEE Trans. Parallel and Distributed Systems, vol. 6, no. 9, pp. 943-962, Sept. 1995.
    • (1995) IEEE Trans. Parallel and Distributed Systems , vol.6 , Issue.9 , pp. 943-962
    • Agarwal, A.1    Kranz, D.2    Natarajan, V.3
  • 9
    • 0032028841 scopus 로고    scopus 로고
    • Determining the Idle Time of a Tiling: New Results
    • Mar.
    • F. Desprez, J. Dongarra, and Y. Robert, "Determining the Idle Time of a Tiling: New Results," J. Information Science and Eng., vol. 14, pp. 167-190, Mar. 1997.
    • (1997) J. Information Science and Eng. , vol.14 , pp. 167-190
    • Desprez, F.1    Dongarra, J.2    Robert, Y.3
  • 10
    • 0026891897 scopus 로고
    • Partitioning and Labeling of Loops by Unimodular Transformations
    • July
    • E. D'Hollander, "Partitioning and Labeling of Loops by Unimodular Transformations," IEEE Trans. Parallel and Distributed Systems, vol. 3, no. 4, pp. 465-476, July 1992.
    • (1992) IEEE Trans. Parallel and Distributed Systems , vol.3 , Issue.4 , pp. 465-476
    • D'Hollander, E.1
  • 16
    • 0032069399 scopus 로고    scopus 로고
    • On Supernode Transformation with Minimized Total Running Time
    • May
    • E. Hodzic and W. Shang, "On Supernode Transformation with Minimized Total Running Time," IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 5, pp. 417-428, May 1998.
    • (1998) IEEE Trans. Parallel and Distributed Systems , vol.9 , Issue.5 , pp. 417-428
    • Hodzic, E.1    Shang, W.2
  • 23
    • 0003904906 scopus 로고
    • The Omega Library Interface Guide
    • Computer Science Dept., Univ. of Maryland, College Park, Mar.
    • W. Kelly, V. Maslov, W. Pugh, E. Rosser, T. Shpeisman, and D. Wonnacott, "The Omega Library Interface Guide," Technical Report CS-TR-3445, Computer Science Dept., Univ. of Maryland, College Park, Mar. 1995.
    • (1995) Technical Report , vol.CS-TR-3445
    • Kelly, W.1    Maslov, V.2    Pugh, W.3    Rosser, E.4    Shpeisman, T.5    Wonnacott, D.6
  • 25
  • 26
    • 85031707006 scopus 로고
    • Non-Unimodular Loop Transformations of Nested Loops
    • Nov.
    • J. Ramanujam, "Non-Unimodular Loop Transformations of Nested Loops," Proc. Supercomputing '92 Conf., pp. 214-223, Nov. 1992.
    • (1992) Proc. Supercomputing '92 Conf. , pp. 214-223
    • Ramanujam, J.1
  • 27
    • 0029518016 scopus 로고
    • Beyond Unimodular Transformations
    • Oct.
    • J. Ramanujam, "Beyond Unimodular Transformations," J. Supercomputing, vol. 9, no. 4, pp. 365-389, Oct. 1995.
    • (1995) J. Supercomputing , vol.9 , Issue.4 , pp. 365-389
    • Ramanujam, J.1
  • 28
    • 38249009019 scopus 로고
    • Tiling Multidimensional Iteration Spaces for Multicomputers
    • J. Ramanujam and P. Sadayappan, "Tiling Multidimensional Iteration Spaces for Multicomputers," J. Parallel and Distributed Computing, vol. 16, pp. 108-120, 1992.
    • (1992) J. Parallel and Distributed Computing , vol.16 , pp. 108-120
    • Ramanujam, J.1    Sadayappan, P.2
  • 29
    • 0026821247 scopus 로고
    • Independent Partitioning of Algorithms with Uniform Dependenies
    • Feb.
    • W. Shang and J. Fortes, "Independent Partitioning of Algorithms with Uniform Dependenies," IEEE Trans. Computers, vol. 41, no. 2, pp. 190-206, Feb. 1992.
    • (1992) IEEE Trans. Computers , vol.41 , Issue.2 , pp. 190-206
    • Shang, W.1    Fortes, J.2
  • 30
    • 0029191426 scopus 로고
    • Partitioning and Mapping Nested Loops for Linear Array Multicomputers
    • J.-P. Sheu and T.-S. Chen, "Partitioning and Mapping Nested Loops for Linear Array Multicomputers," J. Supercomputing, vol. 9, pp. 183-202, 1995.
    • (1995) J. Supercomputing , vol.9 , pp. 183-202
    • Sheu, J.-P.1    Chen, T.-S.2
  • 31
    • 0026231051 scopus 로고
    • Partitioning and Mapping Nested Loops on Multiprocessor Systems
    • Oct.
    • J.-P. Sheu and T.-H. Tai, "Partitioning and Mapping Nested Loops on Multiprocessor Systems," IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 430-439, Oct. 1991.
    • (1991) IEEE Trans. Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 430-439
    • Sheu, J.-P.1    Tai, T.-H.2
  • 34
    • 0000778059 scopus 로고    scopus 로고
    • Generating Efficient Tiled Code for Distributed Memory Machines
    • P. Tang and J. Xue, "Generating Efficient Tiled Code for Distributed Memory Machines," Parallel Computing, vol. 26, no. 11, pp. 1369-1410, 2000.
    • (2000) Parallel Computing , vol.26 , Issue.11 , pp. 1369-1410
    • Tang, P.1    Xue, J.2
  • 35
    • 0034262560 scopus 로고    scopus 로고
    • Chain Grouping: A Method for Partitioning Loops onto Mesh-Connected Processor Arrays
    • Sept.
    • P. Tsanakas, N. Koziris, and G. Papakonstantinou, "Chain Grouping: A Method for Partitioning Loops onto Mesh-Connected Processor Arrays," IEEE Trans. Parallel and Distributed Systems, vol. 11, no. 9, pp. 941-955, Sept. 2000.
    • (2000) IEEE Trans. Parallel and Distributed Systems , vol.11 , Issue.9 , pp. 941-955
    • Tsanakas, P.1    Koziris, N.2    Papakonstantinou, G.3
  • 37
    • 0026232450 scopus 로고
    • A Loop Transformation Theory and an Algorithm to Maximize Parallelism
    • Oct.
    • M. Wolf and M. Lam, "A Loop Transformation Theory and an Algorithm to Maximize Parallelism," IEEE Trans. Parallel and Distributed Systems, vol. 2, no. 4, pp. 452-471, Oct. 1991.
    • (1991) IEEE Trans. Parallel and Distributed Systems , vol.2 , Issue.4 , pp. 452-471
    • Wolf, M.1    Lam, M.2
  • 38
    • 0028434044 scopus 로고
    • Automatic Non-Unimodular Loop Transformations for Massive Parallelism
    • J. Xue, "Automatic Non-Unimodular Loop Transformations for Massive Parallelism," Parallel Computing, vol. 20, no. 5, pp. 711-728, 1994.
    • (1994) Parallel Computing , vol.20 , Issue.5 , pp. 711-728
    • Xue, J.1
  • 39
    • 0003125942 scopus 로고    scopus 로고
    • Communication-Minimal Tiling of Uniform Dependence Loops
    • J. Xue, "Communication-Minimal Tiling of Uniform Dependence Loops," J. Parallel and Distributed Computing, vol. 42, no. 1, pp. 42-59, 1997.
    • (1997) J. Parallel and Distributed Computing , vol.42 , Issue.1 , pp. 42-59
    • Xue, J.1
  • 40
    • 0036601528 scopus 로고    scopus 로고
    • Time-Minimal Tiling when Rise is Larger than Zero
    • J. Xue and W. Cai, "Time-Minimal Tiling when Rise is Larger than Zero," Parallel Computing, vol. 28, no. 6, pp. 915-939, 2002.
    • (2002) Parallel Computing , vol.28 , Issue.6 , pp. 915-939
    • Xue, J.1    Cai, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.