메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1575-1586

Compiler-based data prefetching and streaming non-temporal store generation for the intel(R) xeon phi(TM) coprocessor

Author keywords

compiler; coprocessor; Intel Xeon Phi; non temporal store; performance; prefetching

Indexed keywords

PROGRAM COMPILERS;

EID: 84899707922     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPSW.2013.231     Document Type: Conference Paper
Times cited : (36)

References (28)
  • 5
    • 84899731921 scopus 로고    scopus 로고
    • Intel Corporation
    • Intel Corporation, "Intel-R Composer XE 2013," 2013, http://software.intel.com/en-us/intel-composer-xe/.
    • (2013) Intel-R Composer XE 2013
  • 17
    • 0035510681 scopus 로고    scopus 로고
    • Designing a modern memory hierarchy with hardware prefetching
    • nov
    • W.-F. Lin, S. Reinhardt, and D. Burger, "Designing a modern memory hierarchy with hardware prefetching," Computers, IEEE Transactions on, vol. 50, no. 11, pp. 1202-1218, nov 2001.
    • (2001) Computers, IEEE Transactions on , vol.50 , Issue.11 , pp. 1202-1218
    • Lin, W.-F.1    Reinhardt, S.2    Burger, D.3
  • 19
    • 0001589803 scopus 로고    scopus 로고
    • Data prefetch mechanisms
    • Jun
    • S. P. Vanderwiel and D. J. Lilja, "Data prefetch mechanisms," ACM Comput. Surv., vol. 32, no. 2, pp. 174-199, Jun. 2000.
    • (2000) ACM Comput. Surv. , vol.32 , Issue.2 , pp. 174-199
    • Vanderwiel, S.P.1    Lilja, D.J.2
  • 21
    • 54249085014 scopus 로고    scopus 로고
    • A case study on compiler optimizations for the intel core 2 duo processor
    • Dec
    • A. J. C. Bik, D. L. Kreitzer, and X. Tian, "A case study on compiler optimizations for the intel core 2 duo processor," Int. J. Parallel Program., vol. 36, no. 6, pp. 571-591, Dec. 2008.
    • (2008) Int. J. Parallel Program. , vol.36 , Issue.6 , pp. 571-591
    • Bik, A.J.C.1    Kreitzer, D.L.2    Tian, X.3
  • 23
    • 84859463353 scopus 로고    scopus 로고
    • When prefetching works, when it doesn't, and why
    • Mar.
    • J. Lee, H. Kim, and R. Vuduc, "When prefetching works, when it doesn?t, and why," ACM Trans. Archit. Code Optim., vol. 9, no. 1, pp. 2:1-2:29, Mar. 2012.
    • (2012) ACM Trans. Archit. Code Optim. , vol.9 , Issue.1 , pp. 21-229
    • Lee, J.1    Kim, H.2    Vuduc, R.3
  • 24
    • 0034224812 scopus 로고    scopus 로고
    • Implementing streaming SIMD extensions on the pentium III processor
    • Jul
    • S. K. Raman, V. Pentkovski, and J. Keshava, "Implementing Streaming SIMD Extensions on the Pentium III Processor," IEEE Micro, vol. 20, no. 4, pp. 47-57, Jul. 2000.
    • (2000) IEEE Micro , vol.20 , Issue.4 , pp. 47-57
    • Raman, S.K.1    Pentkovski, V.2    Keshava, J.3
  • 25
    • 84899713228 scopus 로고    scopus 로고
    • Altivec technology: A second generation simd microprocessor architecture
    • M. Phillip, "Altivec. technology: A second generation simd microprocessor architecture," Hot Chips 10, 1998.
    • (1998) Hot Chips , vol.10
    • Phillip, M.1
  • 26
    • 0032633255 scopus 로고    scopus 로고
    • AMD 3DNow! technology: Architecture and implementations
    • Mar
    • S. Oberman, G. Favor, and F. Weber, "AMD 3DNow! Technology: Architecture and Implementations," IEEE Micro, vol. 19, no. 2, pp. 37-48, Mar. 1999.
    • (1999) IEEE Micro , vol.19 , Issue.2 , pp. 37-48
    • Oberman, S.1    Favor, G.2    Weber, F.3
  • 28
    • 84867435009 scopus 로고    scopus 로고
    • An early evaluation of the scalability of graph algorithms on the intel mic architecture
    • E. Saule and U. Catalyurek, "An early evaluation of the scalability of graph algorithms on the intel mic architecture," in IPDPS Workshops' 12, 2012, pp. 1629-1639.
    • (2012) IPDPS Workshops , vol.12 , pp. 1629-1639
    • Saule, E.1    Catalyurek, U.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.