메뉴 건너뛰기




Volumn 31, Issue 1, 2013, Pages

Efficient reuse distance analysis of multicore scaling for loop-based parallel programs

Author keywords

Cache performance; Chip multiprocessors; Reuse distance

Indexed keywords

APPLICATION PARAMETERS; CACHE PERFORMANCE; CHIP MULTIPROCESSOR; EXISTING PROBLEMS; MULTI-CORE PROCESSOR; PREDICTION ACCURACY; REUSE DISTANCE; SCALING PREDICTION;

EID: 84874865302     PISSN: 07342071     EISSN: 15577333     Source Type: Journal    
DOI: 10.1145/2427631.2427632     Document Type: Article
Times cited : (28)

References (40)
  • 4
    • 56449124998 scopus 로고    scopus 로고
    • PARSEC vs. SPLASH2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors
    • IEEE Computer Society
    • Bienia, C., Kumar, S., and Li, K. 2008a. PARSEC vs. SPLASH2: A quantitative comparison of two multithreaded benchmark suites on chip-multiprocessors. In Proceedings of the IEEE International Symposium on Workload Characterization. IEEE Computer Society, 47-56.
    • (2008) Proceedings of the IEEE International Symposium on Workload Characterization , pp. 47-56
    • Bienia, C.1    Kumar, S.2    Li, K.3
  • 7
    • 21244474546 scopus 로고    scopus 로고
    • Predicting inter-thread cache contention on a chip multi-processor architecture
    • Proceedings - 11th International Symposium on High-Performance Computer Architecture, HPCA-11 2005
    • Chandra, D., Guo, F., Kim, S., and Solihin, Y. 2005. Predicting inter-thread cache contention on a chip multi-processor architecture. In Proceedings of the 11th International Symposium on High-Performance Computer Architecture. IEEE Computer Society, 340-351. (Pubitemid 41731513)
    • (2005) Proceedings - International Symposium on High-Performance Computer Architecture , pp. 340-351
    • Chandra, D.1    Guo, F.2    Kim, S.3    Solihin, Y.4
  • 8
    • 33746683732 scopus 로고    scopus 로고
    • Maximizing CMP throughput with mediocre cores
    • DOI 10.1109/PACT.2005.42, 1515580, 14th International Conference on Parallel Architectures and Compilation Techniques, PACT 2005
    • Davis, J., Laudon, J., and Olukotun, K. 2005. Maximizing CMP throughput with mediocre cores. In Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques. IEEE Computer Society, 51-62. (Pubitemid 44159727)
    • (2005) Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT , vol.2005 , pp. 51-62
    • Davis, J.D.1    Laudon, J.2    Olukotun, K.3
  • 11
    • 67650312346 scopus 로고    scopus 로고
    • A mechanistic performance model for superscalar out of order processors
    • 3:1-3:37
    • Eyerman, S., Eeckhout, L., and Karkhanis, T. 2009. A mechanistic performance model for superscalar out of order processors. ACM Trans. Comput. Syst. 27, 2, 3:1-3:37.
    • (2009) ACM Trans. Comput. Syst. , vol.27 , Issue.2
    • Eyerman, S.1    Eeckhout, L.2    Karkhanis, T.3
  • 20
    • 33744504467 scopus 로고    scopus 로고
    • Power-performance implications of thread-level parallelism on chip multiprocessors
    • DOI 10.1109/ISPASS.2005.1430567, 1430567, ISPASS 2005 - IEEE International Symposium on Performance Analysis of Systems and Software
    • Li, J. and Martinez, J. F. 2005. Power-Performance implications of thread-level parallelism on chip multiprocessors. In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software. IEEE Computer Society, 124-134. (Pubitemid 43804310)
    • (2005) ISPASS 2005 - IEEE International Symposium on Performance Analysis of Systems and Software , vol.2005 , pp. 124-134
    • Li, J.1    Martinez, J.F.2
  • 24
    • 0014701246 scopus 로고
    • Evaluation techniques for storage hierarchies
    • Mattson, R. L., Gecsei, J., Slutz, D. R., and Traiger, I. L. 1970. Evaluation techniques for storage hierarchies. IBM Syst. J. 9, 2, 78-117.
    • (1970) IBM Syst. J. , vol.9 , Issue.2 , pp. 78-117
    • Mattson, R.L.1    Gecsei, J.2    Slutz, D.R.3    Traiger, I.L.4
  • 25
    • 80051967684 scopus 로고    scopus 로고
    • Using pin as a memory reference generator for multiprocessor simulation
    • McCurdy, C. and Fischer, C. 2005. Using pin as a memory reference generator for multiprocessor simulation. SIGARCH Comput. Archit. News 33, 5, 39-44.
    • (2005) SIGARCH Comput. Archit. News , vol.33 , Issue.5 , pp. 39-44
    • McCurdy, C.1    Fischer, C.2
  • 36
    • 27544495466 scopus 로고    scopus 로고
    • Victim replication: Maximizing capacity while hiding wire delay in tiled chip multiprocessors
    • Proceedings - 32nd International Symposium on Computer Architecture, ISCA 2005
    • Zhang, M. and Asanovic, K. 2005. Victim replication: Maximizing capacity while hiding wire delay in tiled chip multiprocessors. In Proceedings of the 32nd International Symposium on Computer Architecture. IEEE Computer Society, 336-345. (Pubitemid 41543452)
    • (2005) Proceedings - International Symposium on Computer Architecture , pp. 336-345
    • Zhang, M.1    Asanovic, K.2
  • 40
    • 70349743894 scopus 로고    scopus 로고
    • Program locality analysis using reuse distance
    • 20:1-20:39
    • Zhong, Y., Shen, X., and Ding, C. 2009. Program locality analysis using reuse distance. ACMTrans. Program. Lang. Syst. 31, 6, 20:1-20:39.
    • (2009) ACMTrans. Program. Lang. Syst. , vol.31 , Issue.6
    • Zhong, Y.1    Shen, X.2    Ding, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.