-
1
-
-
67650998701
-
Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms
-
Williams, S., Carter, J., Oliker, L., Shalf, J., Yelick, K.A.: Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms. Journal of Parallel and Distributed Computing (2009)
-
(2009)
Journal of Parallel and Distributed Computing
-
-
Williams, S.1
Carter, J.2
Oliker, L.3
Shalf, J.4
Yelick, K.A.5
-
2
-
-
3042632157
-
MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
-
IEEE Computer Society, Los Alamitos
-
Cappello, F., Etiemble, D.: MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks. In: Supercomputing 2000: Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), Washington, DC, USA. IEEE Computer Society, Los Alamitos (2000)
-
(2000)
Supercomputing 2000: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing (CDROM), Washington, DC, USA
-
-
Cappello, F.1
Etiemble, D.2
-
3
-
-
70450079998
-
Handling OS jitter on multicore multithreaded systems
-
IEEE Computer Society Press, Los Alamitos
-
Mann, P.D.V., Mittaly, U.: Handling OS jitter on multicore multithreaded systems. In: IPDPS 2009: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, Washington, DC, USA. IEEE Computer Society Press, Los Alamitos (2009)
-
(2009)
IPDPS 2009: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, Washington, DC, USA
-
-
Mann, P.D.V.1
Mittaly, U.2
-
5
-
-
78249252490
-
A generalized framework for auto-tuning stencil computations
-
Kamil, S., Chan, C., Williams, S., Oliker, L., Shalf, J., Howison, M., Bethel, E.W.: A generalized framework for auto-tuning stencil computations. In: Proceedings of the Cray User Group Conference (2009)
-
Proceedings of the Cray User Group Conference (2009)
-
-
Kamil, S.1
Chan, C.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Howison, M.6
Bethel, E.W.7
-
6
-
-
0029191296
-
Cilk: An efficient multithreaded runtime system
-
Blumofe, R.D., Joerg, C.F., Kuszmaul, B.C., Leiserson, C.E., Randall, K.H., Zhou, Y.: Cilk: An efficient multithreaded runtime system. Journal of Parallel and Distributed Computing (1995)
-
(1995)
Journal of Parallel and Distributed Computing
-
-
Blumofe, R.D.1
Joerg, C.F.2
Kuszmaul, B.C.3
Leiserson, C.E.4
Randall, K.H.5
Zhou, Y.6
-
7
-
-
74049102092
-
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
-
ACM, New York
-
Song, F., YarKhan, A., Dongarra, J.: Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems. In: SC 2009: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. ACM, New York (2009)
-
(2009)
SC 2009: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
-
-
Song, F.1
YarKhan, A.2
Dongarra, J.3
-
8
-
-
84877019178
-
The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q
-
IEEE Computer Society Press, Los Alamitos
-
Petrini, F., Kerbyson, D.J., Pakin, S.: The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q. In: SC 2003: Proceedings of the 2003 ACM/IEEE conference on Supercomputing, Washington, DC, USA, IEEE Computer Society Press, Los Alamitos (2003)
-
(2003)
SC 2003: Proceedings of the 2003 ACM/IEEE Conference on Supercomputing, Washington, DC, USA
-
-
Petrini, F.1
Kerbyson, D.J.2
Pakin, S.3
-
9
-
-
78149278896
-
-
Klug, T., Ott, M., Weidendorfer, J., Trinitis, C., Müchen, T.U.: Autopin, automated optimization of thread-to-core pinning on multicore systems (2008)
-
(2008)
Autopin, Automated Optimization of Thread-to-core Pinning on Multicore Systems
-
-
Klug, T.1
Ott, M.2
Weidendorfer, J.3
Trinitis, C.4
Müchen, T.U.5
|