메뉴 건너뛰기




Volumn 53, Issue 5, 2004, Pages 547-566

Efficient and accurate analytical modeling of whole-program data cache behavior

Author keywords

Analytical modeling; Cache memories; Data locality; Modeling techniques; Performance evaluation

Indexed keywords

COMPUTER HARDWARE; COMPUTER SIMULATION; HEURISTIC METHODS; OPTIMIZATION; PROGRAM COMPILERS; SAMPLING;

EID: 3042664555     PISSN: 00189340     EISSN: None     Source Type: Journal    
DOI: 10.1109/TC.2004.1275296     Document Type: Article
Times cited : (23)

References (60)
  • 9
    • 0029717349 scopus 로고    scopus 로고
    • Counting solutions to linear and non-linear constraints through Ehrhart polynomials
    • P. Clauss, "Counting Solutions to Linear and Non-Linear Constraints through Ehrhart Polynomials," Proc. ACM Int'l Conf. Supercomputing (ICS '96), pp. 278-285, 1996.
    • (1996) Proc. ACM Int'l Conf. Supercomputing (ICS '96) , pp. 278-285
    • Clauss, P.1
  • 12
    • 0004007719 scopus 로고    scopus 로고
    • Improving effective bandwidth through compiler enhancement of global dynamic cache reuse
    • PhD thesis, Rice Univ.
    • C. Ding, "Improving Effective Bandwidth through Compiler Enhancement of Global Dynamic Cache Reuse," PhD thesis, Rice Univ., 2000.
    • (2000)
    • Ding, C.1
  • 15
    • 0001023389 scopus 로고
    • Parametric integer programming
    • P. Feautrier, "Parametric Integer Programming," Operations Research, vol. 22, pp. 243-268, 1988.
    • (1988) Operations Research , vol.22 , pp. 243-268
    • Feautrier, P.1
  • 16
    • 84957027384 scopus 로고    scopus 로고
    • Automatic parallelization in the polytope model
    • G.R. Perrin and A. Darte, eds.; Springer Verlag
    • P. Feautrier, "Automatic Parallelization in the Polytope Model," The Data Parallel Programming Model, G.R. Perrin and A. Darte, eds., pp. 79-103, Springer Verlag, 1996.
    • (1996) The Data Parallel Programming Model , pp. 79-103
    • Feautrier, P.1
  • 19
    • 0032089580 scopus 로고    scopus 로고
    • Modeling set associative caches behavior for irregular computations
    • June
    • B.B. Fraguela, R. Doallo, and E.L. Zapata, "Modeling Set Associative Caches Behavior for Irregular Computations," ACM Performance Evaluation Rev., vol. 26, no. 1, pp. 192-201, June 1998.
    • (1998) ACM Performance Evaluation Rev. , vol.26 , Issue.1 , pp. 192-201
    • Fraguela, B.B.1    Doallo, R.2    Zapata, E.L.3
  • 21
    • 0001366267 scopus 로고
    • Strategies for cache and local memory management by global program transformations
    • D. Gannon, W. Jalby, and K. Gallivan, "Strategies for Cache and Local Memory Management by Global Program Transformations," J. Parallel and Distributed Computing, vol. 5, pp. 587-616, 1988.
    • (1988) J. Parallel and Distributed Computing , vol.5 , pp. 587-616
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 22
    • 0001714824 scopus 로고    scopus 로고
    • Cache miss equations: A compiler framework for analyzing and tuning memory behavior
    • S. Ghosh, M. Martonosi, and S. Malik, "Cache Miss Equations: A Compiler Framework for Analyzing and Tuning Memory Behavior," ACM Trans. Programming Languages and Systems, vol. 21, pp. 4, pp. 703-746, 1999.
    • (1999) ACM Trans. Programming Languages and Systems , vol.21 , Issue.4 , pp. 703-746
    • Ghosh, S.1    Martonosi, M.2    Malik, S.3
  • 23
    • 0005329615 scopus 로고    scopus 로고
    • Procedure placement using temporal-ordering information
    • N. Gloy and M.D. Smith, "Procedure Placement Using Temporal-Ordering Information," ACM Trans. Programming Languages and Systems, vol. 21, no. 5, pp. 1028-1075, 1999.
    • (1999) ACM Trans. Programming Languages and Systems , vol.21 , Issue.5 , pp. 1028-1075
    • Gloy, N.1    Smith, M.D.2
  • 25
    • 0033204190 scopus 로고    scopus 로고
    • Analytical modeling of set-associative caches
    • Oct.
    • J.S. Harper, D.J. Kerbyson, and G.R. Nudd, "Analytical Modeling of Set-Associative Caches," IEEE Trans. Computers, vol. 48, no. 10, pp. 1009-1024, Oct. 1999.
    • (1999) IEEE Trans. Computers , vol.48 , Issue.10 , pp. 1009-1024
    • Harper, J.S.1    Kerbyson, D.J.2    Nudd, G.R.3
  • 27
    • 12344315233 scopus 로고    scopus 로고
    • DineroIII: A uniprocessor cache simulator
    • M. Hill, "DineroIII: A Uniprocessor Cache Simulator," http://www.cs.wisc.edu/~larus/warts.html, 2004.
    • (2004)
    • Hill, M.1
  • 30
    • 84976736383 scopus 로고
    • Page placement algorithms for large real-index caches
    • R.E. Kessler and M.D. Hill, "Page Placement Algorithms for Large Real-Index Caches," ACM Trans. Computer Systems, vol. 10, no. 4, pp. 338-359, 1992.
    • (1992) ACM Trans. Computer Systems , vol.10 , Issue.4 , pp. 338-359
    • Kessler, R.E.1    Hill, M.D.2
  • 35
    • 3042676705 scopus 로고    scopus 로고
    • Solving systems of affine (In)equalities: PIP's user's guide
    • The PIP System, "Solving Systems of Affine (In)Equalities: PIP's User's Guide," http://www.prism.uvsq.fr/~paf, 2002.
    • (2002)
  • 36
    • 3042532547 scopus 로고    scopus 로고
    • SUIF: An infrastructure for research on parallelizing and optimizing compilers
    • The SUIF Compiler Group, "SUIF: An Infrastructure for Research on Parallelizing and Optimizing Compilers," http://suif.stanford.edu, 2004.
    • (2004)
  • 39
    • 0003665539 scopus 로고    scopus 로고
    • Quantifying loop nest locality using SPEC '95 and the perfect benchmarks
    • Sept.
    • K.S. McKinley and O. Temam, "Quantifying Loop Nest Locality Using SPEC '95 and the Perfect Benchmarks," ACM Trans. Computer Systems, vol. 17, no. 4, pp. 288-336, Sept. 1999.
    • (1999) ACM Trans. Computer Systems , vol.17 , Issue.4 , pp. 288-336
    • McKinley, K.S.1    Temam, O.2
  • 40
    • 1542601822 scopus 로고    scopus 로고
    • Improving memory hierarchy performance for irregular applications using data and computation reorderings
    • J.M. Mellor-Crummey, D.B. Whalley, and K. Kennedy, "Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings," Int'l J. Parallel Programming, vol. 29, no. 3, pp. 217-247, 2001.
    • (2001) Int'l J. Parallel Programming , vol.29 , Issue.3 , pp. 217-247
    • Mellor-Crummey, J.M.1    Whalley, D.B.2    Kennedy, K.3
  • 41
    • 0003690936 scopus 로고
    • Software methods for improvements of cache performance on supercomputer applications
    • PhD thesis, Dept. of Computer Science, Rice Univ., May
    • A.K. Porterfield, "Software Methods for Improvements of Cache Performance on Supercomputer Applications," PhD thesis, Dept. of Computer Science, Rice Univ., May 1989.
    • (1989)
    • Porterfield, A.K.1
  • 42
    • 84976676720 scopus 로고
    • The omega test: A fast and practical integer programming algorithm for dependence analysis
    • Aug.
    • W. Pugh, "The Omega Test: A Fast and Practical Integer Programming Algorithm for Dependence Analysis," Comm. ACM, vol. 35, no. 8, pp. 102-114, Aug. 1992.
    • (1992) Comm. ACM , vol.35 , Issue.8 , pp. 102-114
    • Pugh, W.1
  • 48
    • 0027764718 scopus 로고
    • To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts
    • O. Temam, E. Granston, and W. Jalby, "To Copy or Not to Copy: A Compile-Time Technique for Accessing when Data Copying Should Be Used to Eliminate Cache Conflicts," Proc. Supercomputing '93, pp. 410-419, 1993.
    • (1993) Proc. Supercomputing '93 , pp. 410-419
    • Temam, O.1    Granston, E.2    Jalby, W.3
  • 49
    • 85031661900 scopus 로고
    • Characterizing the behavior of sparse algorithms on caches
    • O. Temam and W. Jalby, "Characterizing the Behavior of Sparse Algorithms on Caches," Proc. Supercomputing '92, pp. 578-587, 1992.
    • (1992) Proc. Supercomputing '92 , pp. 578-587
    • Temam, O.1    Jalby, W.2
  • 50
    • 0032304622 scopus 로고    scopus 로고
    • Optimizing the instruction cache performance of the operating system
    • J. Torrellas, C. Xia, and R.L. Daigle, "Optimizing the Instruction Cache Performance of the Operating System," IEEE Trans. Computers, vol. 47, no. 12, pp. 1363-1381, 1998.
    • (1998) IEEE Trans. Computers , vol.47 , Issue.12 , pp. 1363-1381
    • Torrellas, J.1    Xia, C.2    Daigle, R.L.3
  • 51
    • 0031153459 scopus 로고    scopus 로고
    • Trace-driven memory simulation: A survey
    • Sept.
    • R.A. Uhlig, and T.N. Mudge, "Trace-Driven Memory Simulation: A Survey," ACM Computing Surveys, vol. 29, no. 3, pp. 128-170, Sept. 1997.
    • (1997) ACM Computing Surveys , vol.29 , Issue.3 , pp. 128-170
    • Uhlig, R.A.1    Mudge, T.N.2
  • 56
    • 0004005802 scopus 로고
    • A library for doing polyhedral operations
    • Technical Report 785, Oregon State Univ.
    • D. Wilde, "A Library for Doing Polyhedral Operations," Technical Report 785, Oregon State Univ., 1993.
    • (1993)
    • Wilde, D.1
  • 58
    • 0031079360 scopus 로고    scopus 로고
    • Unimodular transformations of non-perfectly nested loops
    • J. Xue, "Unimodular Transformations of Non-Perfectly Nested Loops," Parallel Computing, vol. 22, no. 12, pp. 1621-1645, 1997.
    • (1997) Parallel Computing , vol.22 , Issue.12 , pp. 1621-1645
    • Xue, J.1
  • 60
    • 0032315190 scopus 로고    scopus 로고
    • Reuse-driven tiling for improving data locality
    • J. Xue and C.-H. Huang, "Reuse-Driven Tiling for Improving Data Locality," Int'l J. Parallel Programming, vol. 26, no. 6, pp. 671-696, 1998.
    • (1998) Int'l J. Parallel Programming , vol.26 , Issue.6 , pp. 671-696
    • Xue, J.1    Huang, C.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.