메뉴 건너뛰기




Volumn 64, Issue 1, 2004, Pages 108-134

Improving effective bandwidth through compiler enhancement of global cache reuse

Author keywords

Data locality; Data transformation; Global cache reuse; Loop fusion; Program analysis; Reference affinity

Indexed keywords


EID: 1642502420     PISSN: 07437315     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jpdc.2003.09.005     Document Type: Article
Times cited : (63)

References (60)
  • 1
    • 0019567795 scopus 로고
    • On the performance enhancement of paging systems through program analysis and transformations
    • May
    • W. Abu-Sufah, D. Kuck, D. Lawrie, On the performance enhancement of paging systems through program analysis and transformations, IEEE Trans. Comput. C 30 (5) (May 1981) 341-356.
    • (1981) IEEE Trans. Comput. C , vol.30 , Issue.5 , pp. 341-356
    • Abu-Sufah, W.1    Kuck, D.2    Lawrie, D.3
  • 4
    • 0026938452 scopus 로고
    • Vector register allocation
    • October
    • J.R. Allen, K. Kennedy, Vector register allocation, IEEE Trans. Comput. 41 (10) (October 1992) 1290-1317.
    • (1992) IEEE Trans. Comput. , vol.41 , Issue.10 , pp. 1290-1317
    • Allen, J.R.1    Kennedy, K.2
  • 7
    • 0002664653 scopus 로고
    • The history of Fortran I, II, and III
    • Wexelblat (Ed.), Academic Press, New York
    • J. Backus, The history of Fortran I, II, and III, in: Wexelblat (Ed.), History of Programming Languages, Academic Press, New York, 1981, pp. 25-45.
    • (1981) History of Programming Languages , pp. 25-45
    • Backus, J.1
  • 8
    • 0039227905 scopus 로고
    • Unfavorable strides in cache memory systems
    • Technical Report RNR-92-015, NASA Ames Research Center
    • D. Bailey, Unfavorable strides in cache memory systems, Technical Report RNR-92-015, NASA Ames Research Center, 1992.
    • (1992)
    • Bailey, D.1
  • 10
    • 0003003638 scopus 로고
    • A study of replacement algorithms for a virtual-storage computer
    • L.A. Belady, A study of replacement algorithms for a virtual-storage computer, IBM Systems J. 5 (2) (1966) 78-101.
    • (1966) IBM Systems J. , vol.5 , Issue.2 , pp. 78-101
    • Belady, L.A.1
  • 17
    • 0038220597 scopus 로고
    • Profitability computations on program flow graphs
    • Technical Report RC 5123, IBM
    • J. Cocke, K. Kennedy, Profitability computations on program flow graphs, Technical Report RC 5123, IBM, 1974.
    • (1974)
    • Cocke, J.1    Kennedy, K.2
  • 22
    • 0002678692 scopus 로고
    • On estimating and enhancing cache effectiveness
    • U. Banerjee, D. Gelernter, A. Nicolau, D. Padua (Eds.). Fourth International Workshop, Springer, Santa Clara, CA, August
    • J. Ferrante, V. Sarkar, W. Thrash. On estimating and enhancing cache effectiveness. in: U. Banerjee, D. Gelernter, A. Nicolau, D. Padua (Eds.). Languages and Compilers for Parallel Computing, Fourth International Workshop, Springer, Santa Clara, CA, August 1991.
    • (1991) Languages and Compilers for Parallel Computing
    • Ferrante, J.1    Sarkar, V.2    Thrash, W.3
  • 23
    • 0001366267 scopus 로고
    • Strategies for cache and local memory management by global program transformation
    • October
    • D. Gannon, W. Jalby, K. Gallivan, Strategies for cache and local memory management by global program transformation, J. Parallel Distrib. Comput. 5 (5) (October 1988) 587-616.
    • (1988) J. Parallel Distrib. Comput. , vol.5 , Issue.5 , pp. 587-616
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 25
    • 0026186967 scopus 로고
    • An implementation of interprocedural bounded regular section analysis
    • July
    • P. Havlak, K. Kennedy, An implementation of interprocedural bounded regular section analysis, IEEE Trans. Parallel Distrib. Systems 2 (3) (July 1991) 350-360.
    • (1991) IEEE Trans. Parallel Distrib. Systems , vol.2 , Issue.3 , pp. 350-360
    • Havlak, P.1    Kennedy, K.2
  • 29
    • 85030909152 scopus 로고    scopus 로고
    • Resource constrained loop fusion
    • Technical Report TR03-424, Department of Computer Science, Rice University, September
    • K. Kennedy, C. Ding, Resource constrained loop fusion, Technical Report TR03-424, Department of Computer Science, Rice University, September 2003.
    • (2003)
    • Kennedy, K.1    Ding, C.2
  • 31
    • 1242268977 scopus 로고
    • Typed fusion with applications to parallel and sequential code generation
    • Technical Report TR93-208 Dept. of Computer Science, Rice University, August (also available as CRPC-TR94370)
    • K. Kennedy, K.S. McKinley, Typed fusion with applications to parallel and sequential code generation, Technical Report TR93-208 Dept. of Computer Science, Rice University, August 1993 (also available as CRPC-TR94370).
    • (1993)
    • Kennedy, K.1    McKinley, K.S.2
  • 33
    • 84983965442 scopus 로고
    • An empirical study of FORTRAN programs
    • D. Knuth, An empirical study of FORTRAN programs, Software - Practice Experience 1 (1971) 105-133.
    • (1971) Software - Practice Experience , vol.1 , pp. 105-133
    • Knuth, D.1
  • 36
    • 0006712810 scopus 로고    scopus 로고
    • Array restructuring for cache locality
    • Ph.D. Thesis, Technical Report UW-CSE-96-08-01, University of Washington
    • S. Leung, Array restructuring for cache locality, Ph.D. Thesis, Technical Report UW-CSE-96-08-01, University of Washington, 1996.
    • (1996)
    • Leung, S.1
  • 39
    • 85030902984 scopus 로고    scopus 로고
    • SimpleScalar LLC, SimpleScalar tool set, www.simplescalar.com
    • SimpleScalar LLC, SimpleScalar tool set, www.simplescalar.com.
  • 40
    • 0003475248 scopus 로고
    • Memory storage patterns in parallel processing
    • Kluwer Academic, Boston
    • M.E. Mace, Memory storage patterns in parallel processing, Kluwer Academic, Boston, 1987.
    • (1987)
    • Mace, M.E.1
  • 44
    • 0003665539 scopus 로고    scopus 로고
    • Quantifying loop nest locality using SPEC'95 and the perfect benchmarks
    • November
    • K.S. McKinley, O. Temam, Quantifying loop nest locality using SPEC'95 and the perfect benchmarks, ACM Transactions on Computer Systems 17 (4) (November 1999) 288-336.
    • (1999) ACM Transactions on Computer Systems , vol.17 , Issue.4 , pp. 288-336
    • McKinley, K.S.1    Temam, O.2
  • 47
    • 0003690938 scopus 로고
    • Software methods for improvement of cache performance
    • Ph.D. Thesis, Dept. of Computer Science, Rice University, May
    • A. Porterfield, Software methods for improvement of cache performance, Ph.D. Thesis, Dept. of Computer Science, Rice University, May 1989.
    • (1989)
    • Porterfield, A.1
  • 48
    • 84976676720 scopus 로고
    • A practical algorithm for exact array dependence analysis
    • August
    • W. Pugh, A practical algorithm for exact array dependence analysis, Comm. ACM 35 (8) (August 1992) 102-114.
    • (1992) Comm. ACM , vol.35 , Issue.8 , pp. 102-114
    • Pugh, W.1
  • 53
    • 0013009642 scopus 로고
    • Multi-configuration simulation algorithms for the evaluation of computer architecture designs
    • Technical Report, University of Michigan
    • R.A. Sugumar, S.G. Abraham, Multi-configuration simulation algorithms for the evaluation of computer architecture designs, Technical Report, University of Michigan, 1993.
    • (1993)
    • Sugumar, R.A.1    Abraham, S.G.2
  • 54
    • 0037882891 scopus 로고
    • Ph.D. Thesis, Dept. of Computer Science, Rice University
    • K.O. Thabit, Cache Management by the Compiler. Ph.D. Thesis, Dept. of Computer Science, Rice University, 1981.
    • (1981) Cache Management By the Compiler
    • Thabit, K.O.1
  • 56
    • 0011452853 scopus 로고
    • Ph.D. Thesis, Dept. of Computer Science, University of Illinois at Urbana-Champaign, October
    • M.J. Wolfe, Optimizing Supercompilers for Supercomputers. Ph.D. Thesis, Dept. of Computer Science, University of Illinois at Urbana-Champaign, October 1982.
    • (1982) Optimizing Supercompilers for Supercomputers
    • Wolfe, M.J.1
  • 58
    • 1542392248 scopus 로고    scopus 로고
    • Achieving scalable locality with time skewing
    • June
    • D. Wonnacott, Achieving scalable locality with time skewing, Internat. J. Parallel Programming 30 (3) (June 2002).
    • (2002) Internat. J. Parallel Programming , vol.30 , Issue.3
    • Wonnacott, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.