-
5
-
-
84899731921
-
-
Intel Corporation
-
Intel Corporation, "Intel-R Composer XE 2013," 2013, http://software.intel.com/en-us/intel-composer-xe/.
-
(2013)
Intel-R Composer XE 2013
-
-
-
6
-
-
84864831385
-
Can traditional programming bridge the ninja performance gap for parallel computing applications?
-
N. Satish, C. Kim, J. Chhugani, H. Saito, R. Krishnaiyer, M. Smelyanskiy, M. Girkar, and P. Dubey, "Can traditional programming bridge the ninja performance gap for parallel computing applications?" in Proceedings of the 39th Annual International Symposium on Computer Architecture, ser. ISCA, 2012.
-
(2012)
Proceedings of the 39th Annual International Symposium on Computer Architecture, Ser. ISCA
-
-
Satish, N.1
Kim, C.2
Chhugani, J.3
Saito, H.4
Krishnaiyer, R.5
Smelyanskiy, M.6
Girkar, M.7
Dubey, P.8
-
7
-
-
67650816174
-
Specomp: A new benchmark suite for measuring parallel computer performance
-
V. Aslot, M. J. Domeika, R. Eigenmann, G. Gaertner, W. B. Jones, and B. Parady, "Specomp: A new benchmark suite for measuring parallel computer performance," in Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming, ser. WOMPAT, 2001.
-
(2001)
Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming, Ser. WOMPAT
-
-
Aslot, V.1
Domeika, M.J.2
Eigenmann, R.3
Gaertner, G.4
Jones, W.B.5
Parady, B.6
-
9
-
-
84899688494
-
The nas parallel benchmarkssummary and preliminary results
-
D. H. Bailey, E. Barszcz, J. T. Barton, D. S. Browning, R. L. Carter, L. Dagum, R. A. Fatoohi, P. O. Frederickson, T. A. Lasinski, R. S. Schreiber, H. D. Simon, V. Venkatakrishnan, and S. K. Weeratunga, "The nas parallel benchmarkssummary and preliminary results," in Proceedings of the 1991 ACM/IEEE conference on Supercomputing, ser. Supercomputing, 1991.
-
(1991)
Proceedings of the 1991 ACM/IEEE Conference on Supercomputing, Ser. Supercomputing
-
-
Bailey, D.H.1
Barszcz, E.2
Barton, J.T.3
Browning, D.S.4
Carter, R.L.5
Dagum, L.6
Fatoohi, R.A.7
Frederickson, P.O.8
Lasinski, T.A.9
Schreiber, R.S.10
Simon, H.D.11
Venkatakrishnan, V.12
Weeratunga, S.K.13
-
10
-
-
70649092154
-
Rodinia: A benchmark suite for heterogeneous computing
-
S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron, "Rodinia: A benchmark suite for heterogeneous computing," in Proceedings of the 2009 IEEE International Symposium on Workload Characterization, ser. IISWC, 2009.
-
(2009)
Proceedings of the 2009 IEEE International Symposium on Workload Characterization, Ser. IISWC
-
-
Che, S.1
Boyer, M.2
Meng, J.3
Tarjan, D.4
Sheaffer, J.W.5
Lee, S.-H.6
Skadron, K.7
-
11
-
-
0003690936
-
-
Ph.D. dissertation Houston, TX, USA Rice University
-
A. K. Porterfield, "Software methods for improvement of cache performance on supercomputer applications," Ph.D. dissertation, Houston, TX, USA, 1989, Rice University.
-
(1989)
Software Methods for Improvement of Cache Performance on Supercomputer Applications
-
-
Porterfield, A.K.1
-
12
-
-
0026138044
-
Software prefetching
-
D. Callahan, K. Kennedy, and A. Porterfield, "Software prefetching," in Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, ser. ASPLOS IV, 1991.
-
(1991)
Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, Ser. ASPLOS
, vol.4
-
-
Callahan, D.1
Kennedy, K.2
Porterfield, A.3
-
16
-
-
0035691709
-
Dynamic speculative precomputation
-
J. D. Collins, D. M. Tullsen, H. Wang, and J. P. Shen, "Dynamic speculative precomputation," in Proceedings of the 34th annual ACM/IEEE international symposium on Microarchitecture, ser. MICRO 34, 2001.
-
(2001)
Proceedings of the 34th Annual ACM/IEEE International Symposium on Microarchitecture, Ser. MICRO 34
-
-
Collins, J.D.1
Tullsen, D.M.2
Wang, H.3
Shen, J.P.4
-
17
-
-
0035510681
-
Designing a modern memory hierarchy with hardware prefetching
-
nov
-
W.-F. Lin, S. Reinhardt, and D. Burger, "Designing a modern memory hierarchy with hardware prefetching," Computers, IEEE Transactions on, vol. 50, no. 11, pp. 1202-1218, nov 2001.
-
(2001)
Computers, IEEE Transactions on
, vol.50
, Issue.11
, pp. 1202-1218
-
-
Lin, W.-F.1
Reinhardt, S.2
Burger, D.3
-
18
-
-
84867511610
-
Making data prefetch smarter: Adaptive prefetching on power7
-
V. Jim?enez, R. Gioiosa, F. J. Cazorla, A. Buyuktosunoglu, P. Bose, and F. P. O'Connell, "Making data prefetch smarter: adaptive prefetching on power7," in Proceedings of the 21st international conference on Parallel architectures and compilation techniques, ser. PACT, 2012.
-
(2012)
Proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques, Ser. PACT
-
-
Jimenez, V.1
Gioiosa, R.2
Cazorla, F.J.3
Buyuktosunoglu, A.4
Bose, P.5
O'Connell, F.P.6
-
19
-
-
0001589803
-
Data prefetch mechanisms
-
Jun
-
S. P. Vanderwiel and D. J. Lilja, "Data prefetch mechanisms," ACM Comput. Surv., vol. 32, no. 2, pp. 174-199, Jun. 2000.
-
(2000)
ACM Comput. Surv.
, vol.32
, Issue.2
, pp. 174-199
-
-
Vanderwiel, S.P.1
Lilja, D.J.2
-
20
-
-
35048892897
-
Effect of optimizations on performance of openmp programs
-
Springer Berlin Heidelberg
-
X. Tian and M. Girkar, "Effect of optimizations on performance of openmp programs," in High Performance Computing-HiPC 2004, ser. Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2005, pp. 133-143.
-
High Performance Computing-HiPC 2004, Ser. Lecture Notes in Computer Science
, vol.2005
, pp. 133-143
-
-
Tian, X.1
Girkar, M.2
-
21
-
-
54249085014
-
A case study on compiler optimizations for the intel core 2 duo processor
-
Dec
-
A. J. C. Bik, D. L. Kreitzer, and X. Tian, "A case study on compiler optimizations for the intel core 2 duo processor," Int. J. Parallel Program., vol. 36, no. 6, pp. 571-591, Dec. 2008.
-
(2008)
Int. J. Parallel Program.
, vol.36
, Issue.6
, pp. 571-591
-
-
Bik, A.J.C.1
Kreitzer, D.L.2
Tian, X.3
-
22
-
-
33746291130
-
Impact of compiler-based data-prefetching techniques on spec omp application performance
-
X. Tian, R. Krishnaiyer, H. Saito, M. Girkar, and W. Li, "Impact of compiler-based data-prefetching techniques on spec omp application performance," in Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, ser. IPDPS '05, 2005.
-
(2005)
Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, Ser. IPDPS '05
-
-
Tian, X.1
Krishnaiyer, R.2
Saito, H.3
Girkar, M.4
Li, W.5
-
23
-
-
84859463353
-
When prefetching works, when it doesn't, and why
-
Mar.
-
J. Lee, H. Kim, and R. Vuduc, "When prefetching works, when it doesn?t, and why," ACM Trans. Archit. Code Optim., vol. 9, no. 1, pp. 2:1-2:29, Mar. 2012.
-
(2012)
ACM Trans. Archit. Code Optim.
, vol.9
, Issue.1
, pp. 21-229
-
-
Lee, J.1
Kim, H.2
Vuduc, R.3
-
24
-
-
0034224812
-
Implementing streaming SIMD extensions on the pentium III processor
-
Jul
-
S. K. Raman, V. Pentkovski, and J. Keshava, "Implementing Streaming SIMD Extensions on the Pentium III Processor," IEEE Micro, vol. 20, no. 4, pp. 47-57, Jul. 2000.
-
(2000)
IEEE Micro
, vol.20
, Issue.4
, pp. 47-57
-
-
Raman, S.K.1
Pentkovski, V.2
Keshava, J.3
-
25
-
-
84899713228
-
Altivec technology: A second generation simd microprocessor architecture
-
M. Phillip, "Altivec. technology: A second generation simd microprocessor architecture," Hot Chips 10, 1998.
-
(1998)
Hot Chips
, vol.10
-
-
Phillip, M.1
-
26
-
-
0032633255
-
AMD 3DNow! technology: Architecture and implementations
-
Mar
-
S. Oberman, G. Favor, and F. Weber, "AMD 3DNow! Technology: Architecture and Implementations," IEEE Micro, vol. 19, no. 2, pp. 37-48, Mar. 1999.
-
(1999)
IEEE Micro
, vol.19
, Issue.2
, pp. 37-48
-
-
Oberman, S.1
Favor, G.2
Weber, F.3
-
27
-
-
43049114025
-
Applications tuning for sreaming simd extensions
-
J. Abel, K. Balasubramanian, M. Bargeron, T. Craver, and M. Phlipot, "Applications tuning for sreaming simd extensions," Intel Technology Journal Q2, 1999.
-
(1999)
Intel Technology Journal
-
-
Abel, J.1
Balasubramanian, K.2
Bargeron, M.3
Craver, T.4
Phlipot, M.5
-
28
-
-
84867435009
-
An early evaluation of the scalability of graph algorithms on the intel mic architecture
-
E. Saule and U. Catalyurek, "An early evaluation of the scalability of graph algorithms on the intel mic architecture," in IPDPS Workshops' 12, 2012, pp. 1629-1639.
-
(2012)
IPDPS Workshops
, vol.12
, pp. 1629-1639
-
-
Saule, E.1
Catalyurek, U.2
|