메뉴 건너뛰기




Volumn 26, Issue 2, 2004, Pages 263-300

A fast and accurate framework to analyze and optimize cache memory behavior

Author keywords

Cache memories; Optimization; Sampling

Indexed keywords

ALGORITHMS; COMPUTATIONAL COMPLEXITY; MATHEMATICAL TECHNIQUES; PROBABILITY DISTRIBUTIONS; PROGRAM COMPILERS; SAMPLING;

EID: 1842635044     PISSN: 01640925     EISSN: None     Source Type: Journal    
DOI: 10.1145/973097.973099     Document Type: Article
Times cited : (28)

References (55)
  • 4
    • 1842822179 scopus 로고
    • A uniform internal representation for high-level and instruction level transformations
    • Tech rep. UPC-DAC-95-02. Universitat Politècnica de Catalunya, Barcelona, Spain
    • Ayguadé, E. et al. 1995. A uniform internal representation for high-level and instruction level transformations. Tech rep. UPC-DAC-95-02. Universitat Politècnica de Catalunya, Barcelona, Spain.
    • (1995)
    • Ayguadé, E.1
  • 15
    • 84957027384 scopus 로고    scopus 로고
    • Automatic parallelization in the polytope model
    • G. R. Perrin and A. Darte, Eds. Lecture Notes in Computer Science; Springer-Verlag, Berlin, Germany
    • Feautrier, P. 1996. Automatic parallelization in the polytope model. In The Data Parallel Programming Model, G. R. Perrin and A. Darte, Eds. Lecture Notes in Computer Science, vol. 1132. Springer-Verlag, Berlin, Germany, 79-103.
    • (1996) The Data Parallel Programming Model , vol.1132 , pp. 79-103
    • Feautrier, P.1
  • 17
    • 0001366267 scopus 로고
    • Strategies for cache and local memory management by global program transformations
    • Gannon, D., Jalby, W., and Gallivan, K. 1988. Strategies for cache and local memory management by global program transformations. J. Parallel. Distrib. Comput. 5, 587-616.
    • (1988) J. Parallel. Distrib. Comput. , vol.5 , pp. 587-616
    • Gannon, D.1    Jalby, W.2    Gallivan, K.3
  • 18
    • 0027640963 scopus 로고
    • Cache performance of the spec92 benchmark suite
    • (Aug.)
    • Gee, J., Hill, M., Pnevmatikatos, D., and Smith, A. 1993. Cache performance of the spec92 benchmark suite. IEEE Micro 13, 4 (Aug.), 17-27.
    • (1993) IEEE Micro , vol.13 , Issue.4 , pp. 17-27
    • Gee, J.1    Hill, M.2    Pnevmatikatos, D.3    Smith, A.4
  • 19
    • 0345857248 scopus 로고    scopus 로고
    • Compiler analysis framework for tuning memory behavior
    • Ph.D. dissertation. Princeton University, Princeton, NJ
    • Ghosh, S. 1999. Compiler analysis framework for tuning memory behavior. Ph.D. dissertation. Princeton University, Princeton, NJ.
    • (1999)
    • Ghosh, S.1
  • 21
    • 0001714824 scopus 로고    scopus 로고
    • Cache miss equations: A compiler framework for analyzing and tuning memory behavior
    • Ghosh, S., Martonosi, M., and Malik, S. 1999. Cache miss equations: A compiler framework for analyzing and tuning memory behavior. ACM Trans. Programm. Lang. Syst. 21, 4, 703-746.
    • (1999) ACM Trans. Programm. Lang. Syst. , vol.21 , Issue.4 , pp. 703-746
    • Ghosh, S.1    Martonosi, M.2    Malik, S.3
  • 23
    • 0026284572 scopus 로고
    • Performance debugging shared memory multiprocessor programs with mtool
    • Goldberg, A. and Hennessy, J. 1991. Performance debugging shared memory multiprocessor programs with mtool. In Proceedings of Supercomputing (SC'91). 481-490.
    • (1991) Proceedings of Supercomputing (SC'91) , pp. 481-490
    • Goldberg, A.1    Hennessy, J.2
  • 27
    • 0033077834 scopus 로고    scopus 로고
    • A linear algebra framework for automatic determination of optimal data layouts
    • (Feb.)
    • Kandemir, M., Choudhary, A., Banerjee, P., and Ramanujam, J. 1999. A linear algebra framework for automatic determination of optimal data layouts. IEEE Trans. Parallel Distrib. Syst. 10, 2 (Feb.), 115-135.
    • (1999) IEEE Trans. Parallel Distrib. Syst. , vol.10 , Issue.2 , pp. 115-135
    • Kandemir, M.1    Choudhary, A.2    Banerjee, P.3    Ramanujam, J.4
  • 31
    • 0028517833 scopus 로고
    • Cache profiling and the spec benchmarks: A case study
    • (Oct.)
    • Lebeck, A. and Wood, D. 1994. Cache profiling and the spec benchmarks: A case study. IEEE Comput. 27, 10 (Oct.) 15-26.
    • (1994) IEEE Comput. , vol.27 , Issue.10 , pp. 15-26
    • Lebeck, A.1    Wood, D.2
  • 39
    • 1842721363 scopus 로고
    • Padua, D. et al. 1994. Polaris Developer's Document. Available online at http://polaris.uiuc.edu/polaris/polaris-developer/polaris-developer.html
    • (1994)
    • Padua, D.1
  • 46
    • 0013009642 scopus 로고
    • Multi-configuration simulation algorithms for the evaluation of computer designs
    • Ph.D. thesis, University of Michigan
    • Sugumar, R. 1993. Multi-configuration simulation algorithms for the evaluation of computer designs. Ph.D. thesis, University of Michigan.
    • (1993)
    • Sugumar, R.1
  • 48
    • 0027764718 scopus 로고
    • To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts
    • Temam, O., Granston, E., and Jalby, W. 1993. To copy or not to copy: A compile-time technique for accessing when data copying should be used to eliminate cache conflicts. In Proceedings of Supercomputing (SC'93). 410-419.
    • (1993) Proceedings of Supercomputing (SC'93) , pp. 410-419
    • Temam, O.1    Granston, E.2    Jalby, W.3
  • 49
    • 0031153459 scopus 로고    scopus 로고
    • Trace-driven memory simulation: A survey
    • (Sept.)
    • Uhlig, R. A. and Mudge, T. N. 1997. Trace-driven memory simulation: a survey. ACM Comput. Surv. 29, 3 (Sept.), 128-170.
    • (1997) ACM Comput. Surv. , vol.29 , Issue.3 , pp. 128-170
    • Uhlig, R.A.1    Mudge, T.N.2
  • 53
    • 0004005802 scopus 로고
    • A library for doing polyhedral operations
    • Tech. rep. 785, Oregon State University
    • Wilde, D. 1993. A library for doing polyhedral operations. Tech. rep. 785, Oregon State University.
    • (1993)
    • Wilde, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.