-
1
-
-
85092791812
-
-
Intel Xeon Quad Processor
-
Intel Xeon Quad Processor. http://www.intel.com/products/processor/xeon5000.
-
-
-
-
4
-
-
32844464238
-
Optimization of mpi collective communication on BlueGene/L systems
-
(New York, NY, USA), ACM Press
-
ALMÁSI, G., HEIDELBERGER, P., ARCHER, C. J., MARTORELL, X., ERWAY, C. C., MOREIRA, J. E., STEINMACHERBUROW, B., AND ZHENG, Y. Optimization of mpi collective communication on BlueGene/L systems. In ICS '05: Proceedings of the 19th annual international conference on Supercomputing (New York, NY, USA, 2005), ACM Press, pp. 253-262.
-
(2005)
ICS '05: Proceedings of the 19th annual international conference on Supercomputing
, pp. 253-262
-
-
ALMÁSI, G.1
HEIDELBERGER, P.2
ARCHER, C. J.3
MARTORELL, X.4
ERWAY, C. C.5
MOREIRA, J. E.6
STEINMACHERBUROW, B.7
ZHENG, Y.8
-
5
-
-
35648995516
-
-
Tech. Rep. UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec
-
ASANOVIC, K., BODIK, R., CATANZARO, B. C., GEBIS, J. J., HUSBANDS, P., KEUTZER, K., PATTERSON, D. A., PLISHKER, W. L., SHALF, J., WILLIAMS, S. W., AND YELICK, K. A. The landscape of parallel computing research: A view from berkeley. Tech. Rep. UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
-
(2006)
The landscape of parallel computing research: A view from berkeley
-
-
ASANOVIC, K.1
BODIK, R.2
CATANZARO, B. C.3
GEBIS, J. J.4
HUSBANDS, P.5
KEUTZER, K.6
PATTERSON, D. A.7
PLISHKER, W. L.8
SHALF, J.9
WILLIAMS, S. W.10
YELICK, K. A.11
-
6
-
-
85092794707
-
Nonuniformly communicating noncontiguous data: A case study with petsc and mpi
-
BALAJI, P., BUNTINAS, D., BALAY, S., SMITH, B., THAKUR, R., AND GROPP, W. Nonuniformly communicating noncontiguous data: A case study with petsc and mpi. In IEEE Parallel and Distributed Processing Symposium (IPDPS) (2006).
-
(2006)
IEEE Parallel and Distributed Processing Symposium (IPDPS)
-
-
BALAJI, P.1
BUNTINAS, D.2
BALAY, S.3
SMITH, B.4
THAKUR, R.5
GROPP, W.6
-
7
-
-
33847103649
-
Optimizing bandwidth limited problems using one-sided communication and overlap
-
BELL, C., BONACHEA, D., NISHTALA, R., AND YELICK, K. Optimizing bandwidth limited problems using one-sided communication and overlap. In The 20th Int'l Parallel and Distributed Processing Symposium (IPDPS) (2006).
-
(2006)
The 20th Int'l Parallel and Distributed Processing Symposium (IPDPS)
-
-
BELL, C.1
BONACHEA, D.2
NISHTALA, R.3
YELICK, K.4
-
8
-
-
0031269329
-
Efficient algorithms for all-to-all communications in multiport message-passing systems
-
BRUCK, J., HO, C.-T., UPFAL, E., KIPNIS, S., AND WEATHERSBY, D. Efficient algorithms for all-to-all communications in multiport message-passing systems. IEEE Trans. Parallel Distrib. Syst. 8, 11 (1997), 1143-1156.
-
(1997)
IEEE Trans. Parallel Distrib. Syst
, vol.8
, Issue.11
, pp. 1143-1156
-
-
BRUCK, J.1
HO, C.-T.2
UPFAL, E.3
KIPNIS, S.4
WEATHERSBY, D.5
-
9
-
-
10044225941
-
Co-array Fortran performance and potential: An NPB experimental study
-
(October)
-
COARFA, C., DOTSENKO, Y., ECKHARDT, J., AND MELLORCRUMMEY, J. Co-array Fortran performance and potential: An NPB experimental study. In 16th Int'l Workshop on Languages and Compilers for Parallel Processing (LCPC) (October 2003).
-
(2003)
16th Int'l Workshop on Languages and Compilers for Parallel Processing (LCPC)
-
-
COARFA, C.1
DOTSENKO, Y.2
ECKHARDT, J.3
MELLORCRUMMEY, J.4
-
10
-
-
84976790986
-
Towards a realistic model of parallel computation
-
CULLER, D. E., KARP, R. M., PATTERSON, D. A., SAHAY, A., SCHAUSER, K. E., SANTOS, E., SUBRAMONIAN, R., AND VON EICKEN, T. LogP: Towards a realistic model of parallel computation. In Proc. 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (1993), pp. 1-12.
-
(1993)
Proc. 4th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 1-12
-
-
CULLER, D. E.1
KARP, R. M.2
PATTERSON, D. A.3
SAHAY, A.4
SCHAUSER, K. E.5
SANTOS, E.6
SUBRAMONIAN, R.7
VON EICKEN, T.8
Log, P9
-
11
-
-
70350771127
-
Stencil Computation Optimization and Auto-tuning on State-of-the-Art Multicore Architectures
-
(November)
-
DATTA, K., MURPHY, M., VOLKOV, V., WILLIAMS, S., CARTER, J., OLIKER, L., PATTERSON, D., SHALF, J., AND YELICK, K. Stencil Computation Optimization and Auto-tuning on State-of-the-Art Multicore Architectures. In Supercomputing 2008 (SC08) (November 2008).
-
(2008)
Supercomputing 2008 (SC08)
-
-
DATTA, K.1
MURPHY, M.2
VOLKOV, V.3
WILLIAMS, S.4
CARTER, J.5
OLIKER, L.6
PATTERSON, D.7
SHALF, J.8
YELICK, K.9
-
12
-
-
50649091849
-
Mpi collectives on modern multicore clusters: Performance optimizations and communication characteristics
-
Lyon, France (May)
-
MAMIDALA, A., KUMAR, R., DE, D., AND PANDA, D. K. Mpi collectives on modern multicore clusters: Performance optimizations and communication characteristics. Int'l Symposium on Cluster Computing and the Grid, Lyon, France (May 2008).
-
(2008)
Int'l Symposium on Cluster Computing and the Grid
-
-
MAMIDALA, A.1
KUMAR, R.2
DE, D.3
PANDA, D. K.4
-
13
-
-
84976718540
-
Algorithms for scalable synchronization on shared-memory multiprocessors
-
MELLOR-CRUMMEY, J. M., AND SCOTT, M. L. Algorithms for scalable synchronization on shared-memory multiprocessors. ACM Trans. Comput. Syst. 9, 1 (1991), 21-65.
-
(1991)
ACM Trans. Comput. Syst
, vol.9
, Issue.1
, pp. 21-65
-
-
MELLOR-CRUMMEY, J. M.1
SCOTT, M. L.2
-
14
-
-
0003413672
-
-
v1.1. Technical report, University of Tennessee, Knoxville, June 12
-
MPI: A message-passing interface standard, v1.1. Technical report, University of Tennessee, Knoxville, June 12, 1995.
-
(1995)
MPI: A message-passing interface standard
-
-
-
18
-
-
34447571243
-
-
v1.2. Tech. Rep. LBNL-59208, Berkeley National Lab
-
UPC language specifications, v1.2. Tech. Rep. LBNL-59208, Berkeley National Lab, 2005.
-
(2005)
UPC language specifications
-
-
-
19
-
-
51049106193
-
Lattice Boltzmann simulation optimization on leading multicore platforms
-
WILLIAMS, S., CARTER, J., OLIKER, L., SHALF, J., AND YELICK, K. Lattice Boltzmann simulation optimization on leading multicore platforms. In Interational Conference on Parallel and Distributed Computing Systems (IPDPS) (2008).
-
(2008)
Interational Conference on Parallel and Distributed Computing Systems (IPDPS)
-
-
WILLIAMS, S.1
CARTER, J.2
OLIKER, L.3
SHALF, J.4
YELICK, K.5
-
20
-
-
51549110265
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
WILLIAMS, S., OLIKER, L., VUDUC, R., SHALF, J., YELICK, K., AND DEMMEL, J. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In Proceedings of Supercomputing 2007 (2007).
-
(2007)
Proceedings of Supercomputing 2007
-
-
WILLIAMS, S.1
OLIKER, L.2
VUDUC, R.3
SHALF, J.4
YELICK, K.5
DEMMEL, J.6
-
21
-
-
0001310691
-
Titanium: a high performance java dialect
-
(February)
-
YELICK, K., SEMENZATO, L., PIKE, G., MIYAMOTO, C., LIBLIT, B., KRISHNAMURTHY, A., HILFINGER, P., GRAHAM, S., GAY, D., COLELLA, P., AND AIKEN, A. Titanium: a high performance java dialect. In Proc. of ACM 1998 Workshop on Java for High-Performance Network Computing (February 1998).
-
(1998)
Proc. of ACM 1998 Workshop on Java for High-Performance Network Computing
-
-
YELICK, K.1
SEMENZATO, L.2
PIKE, G.3
MIYAMOTO, C.4
LIBLIT, B.5
KRISHNAMURTHY, A.6
HILFINGER, P.7
GRAHAM, S.8
GAY, D.9
COLELLA, P.10
AIKEN, A.11
|