메뉴 건너뛰기




Volumn , Issue , 2010, Pages 133-140

Resource-aware compiler prefetching for many-cores

Author keywords

Optimizing compilers; Parallel architectures

Indexed keywords

CACHE HIERARCHIES; CACHE MISS; COMPILER ALGORITHMS; HARDWARE AND SOFTWARE; MANY-CORE ARCHITECTURE; MEMORY LEVEL PARALLELISMS; OPTIMIZING COMPILERS; OUT-OF-ORDER PROCESSORS; PREFETCHES; PREFETCHING; PREFETCHING ALGORITHM; RESOURCE AWARE; STATE OF THE ART;

EID: 77956435385     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISPDC.2010.16     Document Type: Conference Paper
Times cited : (9)

References (23)
  • 2
    • 77956424086 scopus 로고    scopus 로고
    • Empirical evaluation of multi-socket, multi-core memory concurrency
    • Renaissance Computing Institute January [Online]. Available
    • A. Porterfield, R. Fowler, A. Mandel, and M. Y. Lim, "Empirical evaluation of multi-socket, multi-core memory concurrency," Renaissance Computing Institute, Tech. Rep. RENCI TR-09-01, January 2009. [Online]. Available: http://www.renci.org/publications/techreports/TR-09-01.pdf
    • (2009) Tech. Rep. RENCI TR-09-01
    • Porterfield, A.1    Fowler, R.2    Mandel, A.3    Lim, M.Y.4
  • 7
    • 0026918402 scopus 로고
    • Design and evaluation of a compiler algorithm for prefetching
    • T. C. Mowry, M. S. Lam, and A. Gupta, "Design and evaluation of a compiler algorithm for prefetching," SIGPLAN Not., vol. 27, no. 9, pp. 62-73, 1992.
    • (1992) SIGPLAN Not. , vol.27 , Issue.9 , pp. 62-73
    • Mowry, T.C.1    Lam, M.S.2    Gupta, A.3
  • 8
    • 46449113366 scopus 로고    scopus 로고
    • Layout-accurate design and implementation of a high-throughput interconnection network for single-chip parallel processing
    • A. O. Balkan, M. N. Horak, G. Qu, and U. Vishkin, "Layout-accurate design and implementation of a high-throughput interconnection network for single-chip parallel processing," hoti, pp. 21-28, 2007.
    • (2007) Hoti , pp. 21-28
    • Balkan, A.O.1    Horak, M.N.2    Qu, G.3    Vishkin, U.4
  • 11
    • 52049104934 scopus 로고    scopus 로고
    • A pilot study to compare programming effort for two parallel programming models
    • L. Hochstein, V. R. Basili, U. Vishkin, and J. Gilbert, "A pilot study to compare programming effort for two parallel programming models," Journal of Systems and Software, vol. 81, no. 11, pp. 1920 - 1930, 2008.
    • (2008) Journal of Systems and Software , vol.81 , Issue.11 , pp. 1920-1930
    • Hochstein, L.1    Basili, V.R.2    Vishkin, U.3    Gilbert, J.4
  • 13
    • 0020177251 scopus 로고
    • Cache memories
    • A. J. Smith, "Cache memories," ACM Comput. Surv., vol. 14, no. 3, pp. 473-530, 1982.
    • (1982) ACM Comput. Surv. , vol.14 , Issue.3 , pp. 473-530
    • Smith, A.J.1
  • 17
    • 0034818343 scopus 로고    scopus 로고
    • Reducing dram latencies with an integrated memory hierarchy design
    • W. F. Lin, S. K. Reinhardt, and D. Burger, "Reducing dram latencies with an integrated memory hierarchy design," hpca, vol. 00, p. 0301, 2001.
    • (2001) HPCA , pp. 0301
    • Lin, W.F.1    Reinhardt, S.K.2    Burger, D.3
  • 18
    • 84976656398 scopus 로고
    • Effective cache prefetching on bus-based multiprocessors
    • D. M. Tullsen and S. J. Eggers, "Effective cache prefetching on bus-based multiprocessors," ACM Trans. Comput. Syst., vol. 13, no. 1, pp. 57-88, 1995.
    • (1995) ACM Trans. Comput. Syst. , vol.13 , Issue.1 , pp. 57-88
    • Tullsen, D.M.1    Eggers, S.J.2
  • 19
    • 0031988272 scopus 로고    scopus 로고
    • Tolerating latency in multiprocessors through compiler-inserted prefetching
    • T. C. Mowry, "Tolerating latency in multiprocessors through compiler-inserted prefetching," ACM Trans. Comput. Syst., vol. 16, no. 1, pp. 55-92, 1998.
    • (1998) ACM Trans. Comput. Syst. , vol.16 , Issue.1 , pp. 55-92
    • Mowry, T.C.1
  • 20
    • 0029341212 scopus 로고
    • Sequential hardware prefetching in shared-memory multiprocessors
    • F. Dahlgren, M. Dubois, and P. Stenström, "Sequential hardware prefetching in shared-memory multiprocessors," IEEE Trans. Parallel Distrib. Syst., vol. 6, no. 7, pp. 733-746, 1995.
    • (1995) IEEE Trans. Parallel Distrib. Syst. , vol.6 , Issue.7 , pp. 733-746
    • Dahlgren, F.1    Dubois, M.2    Stenström, P.3
  • 23
    • 33646497615 scopus 로고    scopus 로고
    • Springer-Verlag ch. Improving the Performance of GCC by Exploiting IA-64 Architectural Features
    • C. Yang, X. Yang, and J. Xue, Advances in Computer Systems Archiecture. Springer-Verlag, 2005, vol. 3740/2005, ch. Improving the Performance of GCC by Exploiting IA-64 Architectural Features, pp. 236-251.
    • (2005) Advances in Computer Systems Archiecture , vol.3740 , Issue.2005 , pp. 236-251
    • Yang, C.1    Yang, X.2    Xue, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.