메뉴 건너뛰기




Volumn 29, Issue 5, 2001, Pages 545-581

Optimized unrolling of nested loops

Author keywords

Loop transformations; Loop unrolling; Unroll factors; Unroll and jam

Indexed keywords

LOOP TRANSFORMATIONS; LOOP UNROLLING; UNROLL FACTORS; UNROLL-AND-JAM;

EID: 0348126362     PISSN: 08857458     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (32)

References (31)
  • 1
  • 4
    • 0028743437 scopus 로고
    • Compiler Transformations for High-Performance Computing
    • December
    • D. F. Bacon, S. L. Graham, and O. J. Sharp, Compiler Transformations for High-Performance Computing, ACM Computing Surveys 26(4):345-420 (December 1994).
    • (1994) ACM Computing Surveys , vol.26 , Issue.4 , pp. 345-420
    • Bacon, D.F.1    Graham, S.L.2    Sharp, O.J.3
  • 5
    • 0028277074 scopus 로고
    • Scalar Replacement in the Presence of Conditional Control Flow
    • January
    • Steve Carr and Ken Kennedy, Scalar Replacement in the Presence of Conditional Control Flow, Software - Practice and Experience (1):51-77 (January 1994).
    • (1994) Software - Practice and Experience , Issue.1 , pp. 51-77
    • Carr, S.1    Kennedy, K.2
  • 8
    • 10844249011 scopus 로고
    • Compiler Solutions for the Stale-Data and False-Sharing Problems
    • IBM Santa Teresa Laboratory April
    • Mauricio Breternitz, Michael Lai, Vivek Sarkar, and Barbara Simons, Compiler Solutions for the Stale-Data and False-Sharing Problems, Technical report, TR 03.466, IBM Santa Teresa Laboratory (April 1993).
    • (1993) Technical Report , vol.TR 03.466
    • Breternitz, M.1    Lai, M.2    Sarkar, V.3    Simons, B.4
  • 9
    • 0028549474 scopus 로고
    • Improving the Ratio of Memory Operations to Floating-Point Operations in Loops
    • November
    • Steve Carr and Ken Kennedy, Improving the Ratio of Memory Operations to Floating-Point Operations in Loops, ACM TOPLAS 16(4) (November 1994).
    • (1994) ACM TOPLAS , vol.16 , Issue.4
    • Carr, S.1    Kennedy, K.2
  • 12
    • 0031380928 scopus 로고    scopus 로고
    • Unroll-and-Jam Using Uniformly Generated Sets
    • December
    • S. Carr and Y. Guan, Unroll-and-Jam Using Uniformly Generated Sets, Proc. MICRO-30, pp. 349-357 (December 1997).
    • (1997) Proc. MICRO-30 , pp. 349-357
    • Carr, S.1    Guan, Y.2
  • 15
    • 0031140581 scopus 로고    scopus 로고
    • Automatic Selection of High order Transformations in the IBM XL Fortran Compilers
    • May
    • Vivek Sarkar, Automatic Selection of High Order Transformations in the IBM XL Fortran Compilers. IBM J. Res. Dev. 41(3) (May 1997).
    • (1997) IBM J. Res. Dev. , vol.41 , Issue.3
    • Sarkar, V.1
  • 16
    • 0004062640 scopus 로고
    • Pitman, London and The MIT Press, Cambridge, Massachusetts In the series, Research Monographs in Parallel and Distributed Computing
    • Michael J. Wolfe, Optimizing Supercompilers for Supercomputers, Pitman, London and The MIT Press, Cambridge, Massachusetts (1989). In the series, Research Monographs in Parallel and Distributed Computing.
    • (1989) Optimizing Supercompilers for Supercomputers
    • Wolfe, M.J.1
  • 20
    • 0028768013 scopus 로고
    • Iterative Modulo Scheduling: An Algorithm for Software Pipelining Loops
    • San Jose, California, November
    • B. Ramakrishna Rau, Iterative Modulo Scheduling: An Algorithm for Software Pipelining Loops, Proc. 27th Ann. Int'l. Symp. Microarchitecture, San Jose, California, pp. 63-74 (November 1994).
    • (1994) Proc. 27th Ann. Int'l. Symp. Microarchitecture , pp. 63-74
    • Ramakrishna Rau, B.1
  • 22
    • 0024700878 scopus 로고
    • Determining Average Program Execution Times and their Variance
    • July
    • Vivek Sarkar, Determining Average Program Execution Times and their Variance, Proc. SIGPLAN Conf. Prog. Lang. Design and Implementation 24(7):298-312 (July 1989).
    • (1989) Proc. SIGPLAN Conf. Prog. Lang. Design and Implementation , vol.24 , Issue.7 , pp. 298-312
    • Sarkar, V.1
  • 23
    • 0026213832 scopus 로고
    • Automatic Partitioning of a Program Dependence Graph into Parallel Tasks
    • Vivek Sarkar, Automatic Partitioning of a Program Dependence Graph into Parallel Tasks, IBM J. Res. Dev 35(5/6) (1991).
    • (1991) IBM J. Res. Dev , vol.35 , Issue.5-6
    • Sarkar, V.1
  • 24
    • 10844279641 scopus 로고    scopus 로고
    • The Standard Performance Evaluation Corporation, SPEC CPU95 Benchmarks, http://open.specbench.org/osg/cpu95/ (1997).
    • (1997) SPEC CPU95 Benchmarks
  • 25
    • 3342982100 scopus 로고
    • POWER2 and PowerPC
    • September
    • IBM Corporation, POWER2 and PowerPC, Special issue of IBM J. Res. Dev. 38(5): 489-648 (September 1994).
    • (1994) IBM J. Res. Dev. , vol.38 , Issue.5 SPEC. ISSUE , pp. 489-648
  • 27
    • 84862485598 scopus 로고    scopus 로고
    • Improving the Ratio of Memory Operations to Floating-Point operations in loops
    • Copy of review can be found in the ACM digital library
    • Max Hailperin, Improving the Ratio of Memory Operations to Floating-Point operations in loops, Computing Reviews. Copy of review can be found in the ACM digital library at http://www.acm.org/pubs/citations/journals/toplas/1994-16-6/p1768-carr/.
    • Computing Reviews
    • Hailperin, M.1
  • 31
    • 0029487787 scopus 로고
    • Unrolling-Based Optmizations for Modulo Sheduling
    • December
    • Daniel M. Lavery and Wen-Mei W.Hwu, Unrolling-Based Optmizations for Modulo Sheduling, Proc. MICRO-28, pp. 327-337 (December 1995).
    • (1995) Proc. MICRO-28 , pp. 327-337
    • Lavery, D.M.1    Hwu, W.-M.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.