메뉴 건너뛰기




Volumn 31, Issue 6, 2009, Pages

Program locality analysis using reuse distance

Author keywords

Program locality; Reuse distance; Stack distance; Training based analysis

Indexed keywords

ACCESS PATTERNS; ANALYSIS PROBLEMS; BENCHMARK APPLICATIONS; DATA ACCESS; DISTANCE-BASED; EXACT METHODS; MEMORY PERFORMANCE; MISS-RATE; MODERN COMPUTER SYSTEMS; NETWORK TRAFFIC ANALYSIS; ORDERS OF MAGNITUDE; PARAMETERIZED MODEL; PERFORMANCE MODELING; PREDICTION PROCESS; PROGRAM LOCALITY; PROGRAM OPTIMIZATION; QUANTITATIVE ANALYSIS; REUSE DISTANCE; STACK DISTANCE; STATISTICAL PREDICTION; TEST EXECUTION; TRAINING-BASED ANALYSIS; VIRTUAL MEMORY MANAGEMENT;

EID: 70349743894     PISSN: 01640925     EISSN: 15584593     Source Type: Journal    
DOI: 10.1145/1552309.1552310     Document Type: Article
Times cited : (138)

References (105)
  • 12
    • 14944380098 scopus 로고    scopus 로고
    • Reuse distance-based cache hint selection. in Proceedings of the 8th International Euro-Par Conference. Paderborn, Germany. Beyls,K. and D'Hollander, E. 2005. Generating cache hints for improved program efficiency
    • Beyls,K. and D'Hollander, E. 2002. Reuse distance-based cache hint selection. In Proceedings of the 8th International Euro-Par Conference. Paderborn, Germany. Beyls,K. and D'Hollander, E. 2005. Generating cache hints for improved program efficiency. J. Syst. Archit. 51, 4, 223-250.
    • (2002) J. Syst. Archit. , vol.51 , Issue.4 , pp. 223-250
    • Beyls, K.1    D'Hollander, E.2
  • 13
    • 33750304084 scopus 로고    scopus 로고
    • Discovery of locality-improving refactoring by reuse path analysis
    • Springer. Lecture Notes in Computer Science, vol
    • Beyls,K. and D'Hollander, E. 2006a. Discovery of locality-improving refactoring by reuse path analysis. In Proceedings of the High-Performance Computing and Communications Council. Springer. Lecture Notes in Computer Science, vol.4208. 220-229.
    • (2006) Proceedings of the High-Performance Computing and Communications Council , vol.4208 , pp. 220-229
    • Beyls, K.1    D'Hollander, E.2
  • 15
    • 0021477751 scopus 로고
    • Measurement of locality and the behaviour of programs
    • Bunt,R.B. and Murphy, J. M. 1984. Measurement of locality and the behaviour of programs. Comput. J. 27, 3, 238-245.
    • (1984) Comput. J. , vol.27 , Issue.3 , pp. 238-245
    • Bunt, R.B.1    Murphy, J.M.2
  • 18
    • 0000579037 scopus 로고
    • Analysis of interprocedural side effects in a parallel programming environment
    • Callahan, D., Cocke,J., and Kennedy, K. 1988a. Analysis of interprocedural side effects in a parallel programming environment. J. Paral. Distrib. Comput. 5, 5, 517-550.
    • (1988) J. Paral. Distrib. Comput. , vol.5 , Issue.5 , pp. 517-550
    • Callahan, D.1    Cocke, J.2    Kennedy, K.3
  • 19
    • 0000493064 scopus 로고
    • Estimating interlock and improving balance for pipelined machines
    • Callahan, D., Cocke,J., and Kennedy, K. 1988b. Estimating interlock and improving balance for pipelined machines. J. Paral. Distrib. Comput. 5, 4, 334-358.
    • (1988) J. Paral. Distrib. Comput. , vol.5 , Issue.4 , pp. 334-358
    • Callahan, D.1    Cocke, J.2    Kennedy, K.3
  • 20
    • 0028549474 scopus 로고
    • Improving the ratio of memory operations to floating-point operations in loops
    • Carr,S. and Kennedy, K. 1994. Improving the ratio of memory operations to floating-point operations in loops. ACM Trans. Program. Lang. Syst. 16, 6, 1768-1810.
    • (1994) ACM Trans. Program. Lang. Syst. , vol.16 , Issue.6 , pp. 1768-1810
    • Carr, S.1    Kennedy, K.2
  • 25
    • 57349121456 scopus 로고    scopus 로고
    • Measuring Temporal Locality Variation Across Program Inputs
    • Department of Computer Science, University of Rochester
    • Cheng,R. and Ding, C. 2005. Measuring temporal locality variation across program inputs. Tech. rep. TR 875, Department of Computer Science, University of Rochester.
    • (2005) Tech. Rep. TR 875
    • Cheng, R.1    Ding, C.2
  • 31
    • 0038220597 scopus 로고
    • Prof Itability Computations on Program Flow Graphs
    • IBM
    • Cocke,J. and Kennedy, K. 1974. Prof itability computations on program flow graphs. Tech. rep. RC 5123, IBM.
    • (1974) Tech. Rep. RC 5123
    • Cocke, J.1    Kennedy, K.2
  • 32
    • 0001483604 scopus 로고
    • Communication optimizations for irregular scientific computations on distributed memory architectures
    • Das, R., Uysal, M., Saltz,J., and Hwang, Y.-S. 1994. Communication optimizations for irregular scientific computations on distributed memory architectures. J. Paral. Distrib. Comput. 22,3, 462-479.
    • (1994) J. Paral. Distrib. Comput , vol.22 , Issue.3 , pp. 462-479
    • Das, R.1    Uysal, M.2    Saltz, J.3    Hwang, Y.-S.4
  • 33
    • 0036767270 scopus 로고    scopus 로고
    • Maintaining stream statistics over sliding windows
    • Datar, M., Gionis, A., Indyk,P., and Motwani, R. 2002. Maintaining stream statistics over sliding windows. SIAM J. Comput. 31, 6, 1794-1813.
    • (2002) SIAM J. Comput , vol.31 , Issue.6 , pp. 1794-1813
    • Datar, M.1    Gionis, A.2    Indyk, P.3    Motwani, R.4
  • 36
    • 1642502420 scopus 로고    scopus 로고
    • Improving effective bandwidth through compiler enhancement of global cache reuse
    • Ding,C. and Kennedy, K. 2004. Improving effective bandwidth through compiler enhancement of global cache reuse. J. Paral. Distrib. Comput. 64, 1, 108-134.
    • (2004) J. Paral. Distrib. Comput , vol.64 , Issue.1 , pp. 108-134
    • Ding, C.1    Kennedy, K.2
  • 42
    • 0001714824 scopus 로고    scopus 로고
    • Cache miss equations: A compiler framework for analyzing and tuning memory behavior
    • Ghosh, S., Martonosi, M., and Malik, S. 1999. Cache miss equations: A compiler framework for analyzing and tuning memory behavior. ACM Trans. Program. Lang. Syst. 21,4.
    • (1999) ACM Trans. Program. Lang. Syst. , vol.21 , pp. 4
    • Ghosh, S.1    Martonosi, M.2    Malik, S.3
  • 44
    • 33745715056 scopus 로고    scopus 로고
    • Exploiting locality for irregular scientific codes
    • Han,H. and Tseng, C.-W. 2006. Exploiting locality for irregular scientific codes. IEEE Trans. Paral. Distrib. Syst. 17, 7, 606-618.
    • (2006) IEEE Trans. Paral. Distrib. Syst. , vol.17 , Issue.7 , pp. 606-618
    • Han, H.1    Tseng, C.-W.2
  • 45
    • 0026186967 scopus 로고
    • An implementation of interprocedural bounded regular section analysis
    • Havlak,P. and Kennedy, K. 1991. An implementation of interprocedural bounded regular section analysis. IEEE Trans. Paral. Distrib. Syst. 2, 3, 350-360.
    • (1991) IEEE Trans. Paral. Distrib. Syst. , vol.2 , Issue.3 , pp. 350-360
    • Havlak, P.1    Kennedy, K.2
  • 46
    • 0024903997 scopus 로고
    • Evaluating associativity in CPU caches
    • Hill,M.D. and Smith, A. J. 1989. Evaluating associativity in CPU caches. IEEE Trans. Com- put. 38, 12, 1612-1630.
    • (1989) IEEE Trans. Com- Put , vol.38 , Issue.12 , pp. 1612-1630
    • Hill, M.D.1    Smith, A.J.2
  • 50
    • 15844367700 scopus 로고    scopus 로고
    • Improving whole-program locality using intra-procedural and interprocedural transformations
    • Kandemir, M. T. 2005. Improving whole-program locality using intra-procedural and interprocedural transformations. J Paral. Distrib. Comput. 65, 5, 564-582.
    • (2005) J. Paral. Distrib. Comput. , vol.65 , Issue.5 , pp. 564-582
    • Kandemir, M.T.1
  • 51
    • 36348975549 scopus 로고    scopus 로고
    • Inducing models of black-box storage arrays
    • HP Laboratories Palo Alto, CA
    • Kelly, T., Cohen, I., Goldszmidt, M., and Keeton, K. 2004. Inducing models of black-box storage arrays. Tech. rep. HPL-2004-2108, HP Laboratories Palo Alto, CA.
    • (2004) Tech. Rep. HPL , pp. 2004-2108
    • Kelly, T.1    Cohen, I.2    Goldszmidt, M.3    Keeton, K.4
  • 52
    • 0003904906 scopus 로고    scopus 로고
    • The Omega Library Interface Guide
    • Department of Computer Science, University of Maryland, College Park
    • Kelly, W., Maslov, V., Pugh, W., Rosser, E., Shpeisman,T., and Wonnacott, D. 1996. The Omega Library Interface Guide. Tech. rep., Department of Computer Science, University of Maryland, College Park.
    • (1996) Tech. Rep.
    • Kelly, W.1    Maslov, V.2    Pugh, W.3    Rosser, E.4    Shpeisman, T.5    Wonnacott, D.6
  • 55
    • 85008031236 scopus 로고    scopus 로고
    • MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research
    • KleinOsowski,A. and Lilja, D. J. 2002. MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research. Comput. Archit. Lett. 1.
    • (2002) Comput. Archit. Lett. , vol.1
    • Kleinosowski, A.1    Lilja, D.J.2
  • 57
    • 84983965442 scopus 로고
    • An empirical study of FORTRAN programs
    • Knuth, D. 1971. An empirical study of FORTRAN programs. Sof tw. Pract. Exper. 1, 105-133.
    • (1971) Sof Tw. Pract. Exper. , vol.1 , pp. 105-133
    • Knuth, D.1
  • 59
    • 0025229934 scopus 로고
    • An efficient data dependence analysis for parallelizing compilers
    • Li, Z., Yew, P., and Zhu, C. 1990. An efficient data dependence analysis for parallelizing compilers. IEEE Trans. Paral. Distrib. Syst. 1, 1, 26-34.
    • (1990) IEEE Trans. Paral. Distrib. Syst. , vol.1 , Issue.1 , pp. 26-34
    • Li, Z.1    Yew, P.2    Zhu, C.3
  • 60
    • 2942729643 scopus 로고    scopus 로고
    • Design and implementation of a lightweight dynamic optimization system
    • Liu, J., Chen, H., Yew, P.-C., and Hsu, W.-C. 2004. Design and implementation of a lightweight dynamic optimization system. J Instruct.-Level Paral. 6.
    • (2004) J. Instruct.-Level Paral , vol.6
    • Liu, J.1    Chen, H.2    Yew, P.-C.3    Hsu, W.-C.4
  • 63
    • 0014701246 scopus 로고
    • Evaluation techniques for storage hierarchies
    • Mattson, R. L., Gecsei, J., Slutz,D., and Traiger, I. L. 1970. Evaluation techniques for storage hierarchies. IBMSyst. J. 9, 2, 78-117.
    • (1970) IBM Syst. J. , vol.9 , Issue.2 , pp. 78-117
    • Mattson, R.L.1    Gecsei, J.2    Slutz, D.3    Traiger, I.L.4
  • 65
    • 34249654094 scopus 로고    scopus 로고
    • Improving memory hierarchy performance for irregular applications
    • Mellor-Crummey, J., Whalley,D., and Kennedy, K. 2001. Improving memory hierarchy performance for irregular applications. Int. J. Paral. Program. 29,3.
    • (2001) Int. J. Paral. Program , vol.29 , pp. 3
    • Mellor-Crummey, J.1    Whalley, D.2    Kennedy, K.3
  • 66
    • 0006946256 scopus 로고
    • Efficient Methods for Calculating the Success Function of Fixed Space Replacement Policies
    • Lawrence Berkeley Laboratory
    • Olken, F. 1981. Efficient methods for calculating the success function of fixed space replacement policies. Tech. rep. LBL-12370, Lawrence Berkeley Laboratory.
    • (1981) Tech. Rep. LBL-12370
    • Olken, F.1
  • 77
    • 34249677927 scopus 로고    scopus 로고
    • Predicting locality phases for dynamic memory optimization
    • Shen, X., Zhong,Y., and Ding, C. 2007. Predicting locality phases for dynamic memory optimization. J. Paral. Distrib. Comput. 67, 7, 783-796.
    • (2007) J. Paral. Distrib. Comput. , vol.67 , Issue.7 , pp. 783-796
    • Shen, X.1    Zhong, Y.2    Ding, C.3
  • 78
    • 0022093855 scopus 로고
    • Self adjusting binary search trees
    • Sleator,D.D. and Tarjan, R. E. 1985. Self adjusting binary search trees. J. ACM 32,3.
    • (1985) J. ACM , vol.32 , pp. 3
    • Sleator, D.D.1    Tarjan, R.E.2
  • 79
    • 0037683142 scopus 로고    scopus 로고
    • The EELRU adaptive replacement algorithm
    • Smaragdakis, Y., Kaplan,S., and Wilson, P. 2003. The EELRU adaptive replacement algorithm. Perform. Eval. 53, 2, 93-123.
    • (2003) Perform. Eval , vol.53 , Issue.2 , pp. 93-123
    • Smaragdakis, Y.1    Kaplan, S.2    Wilson, P.3
  • 85
    • 0013009642 scopus 로고
    • Multi-configuration simulation algorithms for the evaluation of computer architecture designs
    • University of Michigan
    • Sugumar,R.A. and Abraham, S. G. 1993. Multi-configuration simulation algorithms for the evaluation of computer architecture designs. Tech. rep., University of Michigan.
    • (1993) Tech. Rep.
    • Sugumar, R.A.1    Abraham, S.G.2
  • 87
    • 0037882891 scopus 로고
    • Ph.D. thesis, Department of Computer Science, Rice University
    • Thabit, K. O. 1981. Cache management by the compiler. Ph.D. thesis, Department of Computer Science, Rice University.
    • (1981) Cache Management by the Compiler
    • Thabit, K.O.1
  • 88
    • 0024607881 scopus 로고
    • Efficient (stack) algorithms for analysis of write-back and sector memories
    • Thompson,J.G. and Smith, A. J. 1989. Efficient (stack) algorithms for analysis of write-back and sector memories. ACM Trans. Comput. Syst. 7, 1, 78-117.
    • (1989) ACM Trans. Comput. Syst. , vol.7 , Issue.1 , pp. 78-117
    • Thompson, J.G.1    Smith, A.J.2
  • 91
    • 84976706468 scopus 로고
    • Efficient trace-driven simulation methods for cache performance analysis
    • Wang,W. and Baer, J.-L. 1991. Efficient trace-driven simulation methods for cache performance analysis. ACM Trans. Comput. Syst. 9,3.
    • (1991) ACM Trans. Comput. Syst. , vol.9 , pp. 3
    • Wang, W.1    Baer, J.-L.2
  • 94
    • 1542392248 scopus 로고    scopus 로고
    • Achieving scalable locality with time skewing
    • Wonnacott, D. 2002. Achieving scalable locality with time skewing. Int. J. Paral. Program. 30,3.
    • (2002) Int. J. Paral. Program , vol.30 , pp. 3
    • Wonnacott, D.1
  • 95
    • 3042664555 scopus 로고    scopus 로고
    • Efficient and accurate analytical modeling of whole-program data cache behavior
    • Xue, J. and Vera, X. 2004. Efficient and accurate analytical modeling of whole-program data cache behavior. IEEE Trans. Comput. 53,5.
    • (2004) IEEE Trans. Comput , vol.53 , pp. 5
    • Xue, J.1    Vera, X.2
  • 102
    • 33947360666 scopus 로고    scopus 로고
    • Miss rate prediction across program inputs and cache configurations
    • Zhong, Y., Dropsho, S. G., Shen, X., Studer, A., and Ding, C. 2007. Miss rate prediction across program inputs and cache configurations. IEEE Trans. Comput. 56, 3, 328-343.
    • (2007) IEEE Trans. Comput. , vol.56 , Issue.3 , pp. 328-343
    • Zhong, Y.1    Dropsho, S.G.2    Shen, X.3    Studer, A.4    Ding, C.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.