-
1
-
-
84871137952
-
-
Available at
-
Intel compiler. Available at http://software.intel.com/en-us/intel- parallel-studio-xe.
-
Intel Compiler
-
-
-
2
-
-
84903789660
-
-
Available at
-
Intel's VTune. Available at www.intel.com/Software/Products.
-
Intel's VTune
-
-
-
3
-
-
33747416213
-
The efficacy of software prefetching and locality optimizations on future memory systems
-
A.-H. Badawy, A. Aggarwal, D. Yeung, and C.-W. Tseng. The efficacy of software prefetching and locality optimizations on future memory systems. JILP'2004, 6(7).
-
JILP'2004
, vol.6
, Issue.7
-
-
Badawy, A.-H.1
Aggarwal, A.2
Yeung, D.3
Tseng, C.-W.4
-
5
-
-
77956435385
-
Resource-aware compiler prefetching for many-cores
-
G. C. Caragea, A. Tzannes, F. Keceli, R. Barua, and U. Vishkin. Resource-aware compiler prefetching for many-cores. In ISPDC'2010, pages 133-140.
-
ISPDC'2010
, pp. 133-140
-
-
Caragea, G.C.1
Tzannes, A.2
Keceli, F.3
Barua, R.4
Vishkin, U.5
-
6
-
-
0028202735
-
A performance study of software and hardware data prefetching schemes
-
T.-F. Chen and J.-L. Baer. A performance study of software and hardware data prefetching schemes. In ISCA'94, pages 223-232.
-
ISCA'94
, pp. 223-232
-
-
Chen, T.-F.1
Baer, J.-L.2
-
7
-
-
79953124483
-
Inter-core prefetching for multicore processors using migrating helper threads
-
M. Kamruzzaman, S. Swanson, and D. M. Tullsen. Inter-core prefetching for multicore processors using migrating helper threads. In ASPLOS'11, pages 393-404.
-
ASPLOS'11
, pp. 393-404
-
-
Kamruzzaman, M.1
Swanson, S.2
Tullsen, D.M.3
-
8
-
-
0036949290
-
Design and evaluation of compiler algorithms for pre-execution
-
D. Kim and D. Yeung. Design and evaluation of compiler algorithms for pre-execution. In ASPLOS'02, pages 159-170.
-
ASPLOS'02
, pp. 159-170
-
-
Kim, D.1
Yeung, D.2
-
9
-
-
0026153646
-
An architecture for software controlled data prefetching
-
A. C. Klaiber and H. M. Levy. An architecture for software controlled data prefetching. In ISCA'91, pages 43-53.
-
ISCA'91
, pp. 43-53
-
-
Klaiber, A.C.1
Levy, H.M.2
-
10
-
-
84899707922
-
Compiler-based data prefetching and streaming non-temporal store generation for intel xeon phi coprocessor
-
R. Krishnaiyer, E. Kultursay, P. Chawla, S. Preis, A. Zvezdin, and H. Saito. Compiler-based data prefetching and streaming non-temporal store generation for intel xeon phi coprocessor. In Workshop on Multithreaded Architectures and Applications, 2013.
-
(2013)
Workshop on Multithreaded Architectures and Applications
-
-
Krishnaiyer, R.1
Kultursay, E.2
Chawla, P.3
Preis, S.4
Zvezdin, A.5
Saito, H.6
-
11
-
-
37549032725
-
Ibm power6 microarchitecture
-
H. Q. Le, W. J. Starke, J. S. Fields, F. P. O'Connell, D. Q. Nguyen, B. J. Ronchetti, W. M. Sauer, E. M. Schwarz, and M. T. Vaden. Ibm power6 microarchitecture. IBM Journal of Research and Development, 51(6):639-662, 2007.
-
(2007)
IBM Journal of Research and Development
, vol.51
, Issue.6
, pp. 639-662
-
-
Le, H.Q.1
Starke, W.J.2
Fields, J.S.3
O'Connell, F.P.4
Nguyen, D.Q.5
Ronchetti, B.J.6
Sauer, W.M.7
Schwarz, E.M.8
Vaden, M.T.9
-
12
-
-
84859463353
-
When prefetching works, when it does not, and why
-
J. Lee, H. Kim, and R. Vuduc. When prefetching works, when it does not, and why. TACO'2012, 9(1):29.
-
TACO'2012
, vol.9
, Issue.1
, pp. 29
-
-
Lee, J.1
Kim, H.2
Vuduc, R.3
-
13
-
-
0026918402
-
Design and evaluation of a compiler algorithm for prefetching
-
T. C. Mowry, M. S. Lam, and A. Gupta. Design and evaluation of a compiler algorithm for prefetching. In ASPLOS'92, pages 62-73.
-
ASPLOS'92
, pp. 62-73
-
-
Mowry, T.C.1
Lam, M.S.2
Gupta, A.3
-
14
-
-
35248838281
-
The specification of source-to-source transformations for the compile-time optimization of parallel object-oriented scientific applications
-
D. J. Quinlan, M. Schordan, B. Philip, and M. Kowarschik. The specification of source-to-source transformations for the compile-time optimization of parallel object-oriented scientific applications. In LCPC'01, pages 383-394.
-
LCPC'01
, pp. 383-394
-
-
Quinlan, D.J.1
Schordan, M.2
Philip, B.3
Kowarschik, M.4
-
15
-
-
0029727692
-
Improving the effectiveness of software prefetching with adaptive executions
-
R. H. Saavedra and D. Park. Improving the effectiveness of software prefetching with adaptive executions. In PACT'96, pages 68-78.
-
PACT'96
, pp. 68-78
-
-
Saavedra, R.H.1
Park, D.2
-
17
-
-
67650091160
-
A compiler-directed data prefetching scheme for chip multiprocessors
-
S. W. Son, M. Kandemir, M. Karakoy, and D. Chakrabarti. A compiler-directed data prefetching scheme for chip multiprocessors. In PPOPP'09, pages 209-218.
-
PPOPP'09
, pp. 209-218
-
-
Son, S.W.1
Kandemir, M.2
Karakoy, M.3
Chakrabarti, D.4
-
18
-
-
0038345683
-
Guided region prefetching: A cooperative hardware/software approach
-
Z. Wang, D. Burger, K. S. McKinley, S. K. Reinhardt, and C. C. Weems. Guided region prefetching: A cooperative hardware/software approach. In ISCA'03, pages 388-398.
-
ISCA'03
, pp. 388-398
-
-
Wang, Z.1
Burger, D.2
McKinley, K.S.3
Reinhardt, S.K.4
Weems, C.C.5
|