메뉴 건너뛰기




Volumn 51, Issue 4, 2005, Pages 223-250

Generating cache hints for improved program efficiency

Author keywords

Compiler optimization; EPIC; Replacement policy; Reuse distance; Source cache hint; Target cache hint

Indexed keywords

COMPUTER PROGRAMMING; COMPUTER SOFTWARE REUSABILITY; DATA PROCESSING; OPTIMIZATION; PROGRAM COMPILERS; PROGRAM PROCESSORS;

EID: 14944380098     PISSN: 13837621     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sysarc.2004.09.004     Document Type: Article
Times cited : (83)

References (55)
  • 1
    • 0033700781 scopus 로고    scopus 로고
    • Synthesizing transformations for locality enhancement of imperfectly-nested loop nests
    • N. Ahmed, N. Mateev, K. Pingali, Synthesizing transformations for locality enhancement of imperfectly-nested loop nests, in: ICS, 2000, pp. 141-152
    • (2000) ICS , pp. 141-152
    • Ahmed, N.1    Mateev, N.2    Pingali, K.3
  • 2
    • 0003003638 scopus 로고
    • A study of replacement algorithms for a virtual-storage computer
    • L.A. Belady A study of replacement algorithms for a virtual-storage computer IBM Systems Journal 5 2 1966 78 101
    • (1966) IBM Systems Journal , vol.5 , Issue.2 , pp. 78-101
    • Belady, L.A.1
  • 4
    • 14944358600 scopus 로고    scopus 로고
    • Compiler generated multithreading to alleviate memory latency
    • Multithreaded Processors and Chip-Multiprocessors
    • K. Beyls, and E. D'Hollander Compiler generated multithreading to alleviate memory latency Multithreaded Processors and Chip-Multiprocessors Journal of Universal Computer Science 6 10 2000 968 993 (special issue)
    • (2000) Journal of Universal Computer Science , vol.6 , Issue.10 , pp. 968-993
    • Beyls, K.1    D'Hollander, E.2
  • 5
    • 4644271073 scopus 로고    scopus 로고
    • Reuse distance as a metric for cache behavior
    • August
    • K. Beyls, E.H. D'Hollander, Reuse distance as a metric for cache behavior, in: Proceedings of PDCS'01, August 2001, pp. 617-662
    • (2001) Proceedings of PDCS'01 , pp. 617-662
    • Beyls, K.1    D'Hollander, E.H.2
  • 6
    • 3042548199 scopus 로고    scopus 로고
    • An analytic model of locality and caching
    • Michigan State University, August
    • M. Brehob, R.J. Enbody, An analytic model of locality and caching, Technical Report MSU-CSE-99-31, Michigan State University, August 1999
    • (1999) Technical Report , vol.MSU-CSE-99-31
    • Brehob, M.1    Enbody, R.J.2
  • 9
    • 0030086672 scopus 로고    scopus 로고
    • Design of the HP PA 7200 CPU
    • Technical information from the laboratories of Hewlett-Packard Company.
    • K.K. Chan, C.C. Hay, J.R. Keller, G.P. Kurpanek, F.X. Schumacher, and J. Zheng Design of the HP PA 7200 CPU Technical information from the laboratories of Hewlett-Packard Company Hewlett-Packard Journal 47 1 1996 25 33
    • (1996) Hewlett-Packard Journal , vol.47 , Issue.1 , pp. 25-33
    • Chan, K.K.1    Hay, C.C.2    Keller, J.R.3    Kurpanek, G.P.4    Schumacher, F.X.5    Zheng, J.6
  • 11
    • 0034832018 scopus 로고    scopus 로고
    • Exact analysis of the cache behavior of nested loops
    • S. Chatterjee, E. Parker, P. Hanlon, A.R. Lebeck, Exact analysis of the cache behavior of nested loops, in: PLDI, 2001, pp. 286-297
    • (2001) PLDI , pp. 286-297
    • Chatterjee, S.1    Parker, E.2    Hanlon, P.3    Lebeck, A.R.4
  • 13
    • 0029717349 scopus 로고    scopus 로고
    • Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: Applications to analyze and transform scientific programs
    • ACM, May
    • P. Clauss, Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: applications to analyze and transform scientific programs, in: ACM International Conference on Supercomputing, ACM, May 1996, pp. 278-285
    • (1996) ACM International Conference on Supercomputing , pp. 278-285
    • Clauss, P.1
  • 14
  • 15
    • 0032066945 scopus 로고    scopus 로고
    • The Fortran parallel transformer and its programming environment
    • E. D'Hollander, F. Zhang, and Q. Wang The Fortran parallel transformer and its programming environment Journal of Information Science 106 7 1998 293 317
    • (1998) Journal of Information Science , vol.106 , Issue.7 , pp. 293-317
    • D'Hollander, E.1    Zhang, F.2    Wang, Q.3
  • 16
    • 0038716440 scopus 로고    scopus 로고
    • Predicting whole-program locality through reuse distance analysis
    • ACM
    • C. Ding, Y. Zhong, Predicting whole-program locality through reuse distance analysis, in: PLDI'03, ACM, 2003
    • (2003) PLDI'03
    • Ding, C.1    Zhong, Y.2
  • 18
    • 84957027384 scopus 로고    scopus 로고
    • Automatic parallelization in the polytope model
    • P. Feautrier, Automatic parallelization in the polytope model, in: The Data Parallel Programming Model, 1996, pp. 79-103
    • (1996) The Data Parallel Programming Model , pp. 79-103
    • Feautrier, P.1
  • 19
    • 1342264156 scopus 로고    scopus 로고
    • A compiler tool to predict memory hierarchy performance of scientific codes
    • B.B. Fraguela, R. Doallo, J. Touriño, and E.L. Zapata A compiler tool to predict memory hierarchy performance of scientific codes Parallel Computing 30 2004 225 248
    • (2004) Parallel Computing , vol.30 , pp. 225-248
    • Fraguela, B.B.1    Doallo, R.2    Touriño, J.3    Zapata, E.L.4
  • 22
    • 0024903997 scopus 로고
    • Evaluating associativity in CPU caches
    • M.D. Hill, and A.J. Smith Evaluating associativity in CPU caches IEEE Transactions on Computers 38 12 1989 1612 1630
    • (1989) IEEE Transactions on Computers , vol.38 , Issue.12 , pp. 1612-1630
    • Hill, M.D.1    Smith, A.J.2
  • 26
    • 0025429331 scopus 로고
    • Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers
    • May
    • N.P. Jouppi, Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers, in: The 17th ISCA, May 1990, pp. 364-373
    • (1990) The 17th ISCA , pp. 364-373
    • Jouppi, N.P.1
  • 31
    • 0347304618 scopus 로고    scopus 로고
    • Data-centric multi-level blocking
    • I. Kodukula, N. Ahmed, K. Pingali, Data-centric multi-level blocking, in: PLDI, 1997, pp. 346-357
    • (1997) PLDI , pp. 346-357
    • Kodukula, I.1    Ahmed, N.2    Pingali, K.3
  • 33
    • 0026918402 scopus 로고
    • Design and evaluation of a compiler algorithm for prefetching
    • T.C. Mowry, M.S. Lam, and A. Gupta Design and evaluation of a compiler algorithm for prefetching ACM SIGPLAN Notices 27 9 1992 62 73
    • (1992) ACM SIGPLAN Notices , vol.27 , Issue.9 , pp. 62-73
    • Mowry, T.C.1    Lam, M.S.2    Gupta, A.3
  • 34
    • 84898062812 scopus 로고    scopus 로고
    • Open64 compiler, < http://sourceforge.net/projects/open64>
    • Open64 Compiler
  • 35
    • 0029488249 scopus 로고
    • Cache miss heuristics and preloading techniques for general-purpose programs
    • Ann Arbor, Michigan, November 29-December 1, IEEE Computer Society TC-MICRO and ACM SIGMICRO
    • T. Ozawa, Y. Kimura, S. Nishizaki, Cache miss heuristics and preloading techniques for general-purpose programs, in: MICRO'95, Ann Arbor, Michigan, November 29-December 1, 1995, IEEE Computer Society TC-MICRO and ACM SIGMICRO, pp. 243-248
    • (1995) MICRO'95 , pp. 243-248
    • Ozawa, T.1    Kimura, Y.2    Nishizaki, S.3
  • 36
    • 0028132512 scopus 로고
    • Counting solutions to Presburger formulas: How and why
    • W. Pugh Counting solutions to Presburger formulas: how and why ACM SIGPLAN Notices 29 6 1994 121 134
    • (1994) ACM SIGPLAN Notices , vol.29 , Issue.6 , pp. 121-134
    • Pugh, W.1
  • 39
    • 84949840377 scopus 로고    scopus 로고
    • Fast, accurate and flexible data locality analysis
    • IEEE Computer Society Press
    • J. Sánchez, and A. González Fast, accurate and flexible data locality analysis PACT'98, Paris, France, October 12-18, 1998 1998 IEEE Computer Society Press 124 129
    • (1998) PACT'98, Paris, France, October 12-18, 1998 , pp. 124-129
    • Sánchez, J.1    González, A.2
  • 41
    • 0033892359 scopus 로고    scopus 로고
    • EPIC: Explicitly parallel instruction computing
    • M.S. Schlansker, and B.R.R. Cover EPIC: Explicitly parallel instruction computing IEEE Computer 33 2 2000 37 45
    • (2000) IEEE Computer , vol.33 , Issue.2 , pp. 37-45
    • Schlansker, M.S.1    Cover, B.R.R.2
  • 43
    • 84943426389 scopus 로고
    • Skewed-associative caches
    • Springer-Verlag, Proceedings of PARLE'93, Munich, Germany, June 14-17, 1993
    • A. Seznec, and F. Bodin Skewed-associative caches Proceedings of PARLE'93, Munich, Germany, June 14-17, 1993 Lecture Notes in Computer Science 1993 Springer-Verlag 305 316
    • (1993) Lecture Notes in Computer Science , pp. 305-316
    • Seznec, A.1    Bodin, F.2
  • 45
    • 0029508817 scopus 로고
    • A modified approach to data cache management
    • Ann Arbor, Michigan, November 29-December 1, IEEE Computer Society TC-MICRO and ACM SIGMICRO
    • G. Tyson, M. Farrens, J. Matthews, A.R. Pleszkun, A modified approach to data cache management, in: MICRO'95, Ann Arbor, Michigan, November 29-December 1, 1995, IEEE Computer Society TC-MICRO and ACM SIGMICRO, pp. 93-103
    • (1995) MICRO'95 , pp. 93-103
    • Tyson, G.1    Farrens, M.2    Matthews, J.3    Pleszkun, A.R.4
  • 48
    • 14944379009 scopus 로고    scopus 로고
    • The Barvinok library
    • S. Verdoolaege, The Barvinok library, < http://freshmeat.net/projects/ barvinok>
    • Verdoolaege, S.1
  • 50
    • 14944380022 scopus 로고    scopus 로고
    • Using the compiler to improve cache replacement decisions
    • September
    • Z. Wang, K. McKinley, A. Rosenberg, C. Weems, Using the compiler to improve cache replacement decisions, in: PACT'02, September 2002
    • (2002) PACT'02
    • Wang, Z.1    McKinley, K.2    Rosenberg, A.3    Weems, C.4
  • 53
    • 0034581198 scopus 로고    scopus 로고
    • Modified LRU policies for improving second-level cache behavior
    • January 8-12
    • W.A. Wong, J.-L. Baer, Modified LRU policies for improving second-level cache behavior, in: HPCA-6, January 8-12, 2000, pp. 49-60
    • (2000) HPCA-6 , pp. 49-60
    • Wong, W.A.1    Baer, J.-L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.