-
1
-
-
0028757636
-
A high performance parallel algorithm for 1-d fft
-
R. C. Agarwal, F. G. Gustavson, and M. Zubair. A high performance parallel algorithm for 1-d fft. In SC, pages 34-40, 1994.
-
(1994)
SC
, pp. 34-40
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
3
-
-
25844504635
-
QSNETII: Defining high-performance network design
-
J. Beecroft , et al. QSNETII: Defining high-performance network design. IEEE Micro, 25(4):34-47, 2005.
-
(2005)
IEEE Micro
, vol.25
, Issue.4
, pp. 34-47
-
-
Beecroft, J.1
-
5
-
-
85033334323
-
-
C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. Technical Report LBNL-59207, Berkeley National Lab, 2005.
-
C. Bell, D. Bonachea, R. Nishtala, and K. Yelick. Optimizing bandwidth limited problems using one-sided communication and overlap. Technical Report LBNL-59207, Berkeley National Lab, 2005.
-
-
-
-
7
-
-
85033341619
-
-
The Berkeley UPC Compiler
-
The Berkeley UPC Compiler, 2002. http://upc.lbl.gov.
-
(2002)
-
-
-
8
-
-
33847094060
-
GASNet specification
-
Technical Report CSD-02-1207, University of California, Berkeley, October
-
D. Bonachea. GASNet specification. Technical Report CSD-02-1207, University of California, Berkeley, October 2002.
-
(2002)
-
-
Bonachea, D.1
-
9
-
-
33746759468
-
Proposal for extending the UPC memory copy library functions and supporting extensions to GASNet, v1.0
-
Technical Report LBNL-56495, Berkeley National Lab, October 2004
-
D. Bonachea. Proposal for extending the UPC memory copy library functions and supporting extensions to GASNet, v1.0. Technical Report LBNL-56495, Berkeley National Lab, October 2004.
-
-
-
Bonachea, D.1
-
13
-
-
1142293067
-
A Performance Analysis of the Berkeley UPC Compiler
-
June
-
W. Chen, D. Bonachea, J. Duell, P. Husband, C. Iancu, and K. Yelick. A Performance Analysis of the Berkeley UPC Compiler. In Proc. of Int'l Conference on Supercomputing (ICS), June 2003.
-
(2003)
Proc. of Int'l Conference on Supercomputing (ICS)
-
-
Chen, W.1
Bonachea, D.2
Duell, J.3
Husband, P.4
Iancu, C.5
Yelick, K.6
-
16
-
-
33845393854
-
Transformations to parallel codes for communication-computation overlap
-
November
-
A. Danalis, K.-Y. Kim, L. Pollock, and M. Swany. Transformations to parallel codes for communication-computation overlap. In Supercomputing 2005, November 2005.
-
(2005)
Supercomputing 2005
-
-
Danalis, A.1
Kim, K.-Y.2
Pollock, L.3
Swany, M.4
-
18
-
-
0031997862
-
A method for exploiting communication/computation overlap in hypercubes
-
L. Díaz, M. Valero-García, and A. González. A method for exploiting communication/computation overlap in hypercubes. Parallel Computing, 24(2):221-245, 1998.
-
(1998)
Parallel Computing
, vol.24
, Issue.2
, pp. 221-245
-
-
Díaz, L.1
Valero-García, M.2
González, A.3
-
19
-
-
0035980881
-
Scalable parallel FFT for spectral simulations on a beowulf cluster
-
P. Dmitruk, et al. Scalable parallel FFT for spectral simulations on a beowulf cluster. Parallel Computing, 2001.
-
(2001)
Parallel Computing
-
-
Dmitruk, P.1
-
20
-
-
80052802178
-
UPC performance and potential: A NPB experimental study
-
T. El-Ghazawi and F. Cantonnet. UPC performance and potential: A NPB experimental study. In Supercomputing, 2002.
-
(2002)
Supercomputing
-
-
El-Ghazawi, T.1
Cantonnet, F.2
-
21
-
-
33847169750
-
Automatic generation and tuning of MPI collective communication routines
-
A. Faraj and X. Yuan. Automatic generation and tuning of MPI collective communication routines. In Proc. Supercomputing, 2005.
-
(2005)
Proc. Supercomputing
-
-
Faraj, A.1
Yuan, X.2
-
22
-
-
20744449792
-
The design and implementation of FFTW3
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. of the IEEE, 93(2):216-231, 2005.
-
(2005)
Proc. of the IEEE
, vol.93
, Issue.2
, pp. 216-231
-
-
Frigo, M.1
Johnson, S.G.2
-
23
-
-
85033334967
-
-
home
-
GASNet home page. http://gasnet.cs.berkeley.edu/.
-
GASNet
-
-
-
24
-
-
33847106262
-
Survey of MPI call usage
-
D. Han and T. Jones. Survey of MPI call usage. In SciComp, 2004.
-
(2004)
SciComp
-
-
Han, D.1
Jones, T.2
-
25
-
-
85033330031
-
-
P. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick. Titanium language reference manual. Tech Report UCB/CSD-01-1163, U.C. Berkeley, November 2001.
-
P. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick. Titanium language reference manual. Tech Report UCB/CSD-01-1163, U.C. Berkeley, November 2001.
-
-
-
-
27
-
-
33847152014
-
Building multirail Infiniband clusters: MPI-level design
-
J. Liu, A. Vishnu, and D. K. Panda. Building multirail Infiniband clusters: MPI-level design. In SuperComputing, 2004.
-
(2004)
SuperComputing
-
-
Liu, J.1
Vishnu, A.2
Panda, D.K.3
-
28
-
-
3042721503
-
High performance RDMA-based mpi implementation over Infiniband
-
J. Liu, J. Wu, and D. K. Panda. High performance RDMA-based mpi implementation over Infiniband. Int'l J. of Parallel Prog., 2004.
-
(2004)
Int'l J. of Parallel Prog
-
-
Liu, J.1
Wu, J.2
Panda, D.K.3
-
29
-
-
0003413675
-
A message-passing interface standard, v1.1
-
MPI:, Technical report, University of Tennessee, Knoxville, June 12
-
MPI: A message-passing interface standard, v1.1. Technical report, University of Tennessee, Knoxville, June 12, 1995.
-
(1995)
-
-
-
30
-
-
85033343554
-
-
MPI-2: a message-passing interface standard. Int'l J. of High Performance Computing Applications, 12:1-299, 1998.
-
MPI-2: a message-passing interface standard. Int'l J. of High Performance Computing Applications, 12:1-299, 1998.
-
-
-
-
31
-
-
0006168939
-
ARMCI: A portable remote memory copy library for distributed array libraries and compiler run-time systems
-
J. Nieplocha and B. Carpenter. ARMCI: A portable remote memory copy library for distributed array libraries and compiler run-time systems. In Proc. RTSPP IPPS/SDP'99, 1999.
-
(1999)
Proc. RTSPP IPPS/SDP'99
-
-
Nieplocha, J.1
Carpenter, B.2
-
32
-
-
0002081678
-
Co-array fortran for parallel programming
-
R. Numrich and J. Reid. Co-array fortran for parallel programming. In ACM Fortran Forum 17, 2, 1-31., 1998.
-
(1998)
ACM Fortran Forum
, vol.17
, Issue.2
, pp. 1-31
-
-
Numrich, R.1
Reid, J.2
-
33
-
-
33845425848
-
Scientific computations on modern parallel vector systems
-
L. Oliker, et al. Scientific computations on modern parallel vector systems. In Proc. of Supercomputing, 2004.
-
(2004)
Proc. of Supercomputing
-
-
Oliker, L.1
-
34
-
-
0035342056
-
A comparison of optimal FFTs on torus and hypercube multicomputers
-
P. Swartztrauber and S. Hammond. A comparison of optimal FFTs on torus and hypercube multicomputers. Parallel Computing, 2001.
-
(2001)
Parallel Computing
-
-
Swartztrauber, P.1
Hammond, S.2
-
35
-
-
85033349291
-
-
UPC consortium home
-
UPC consortium home page. http://upc.gwu.edu/.
-
-
-
-
36
-
-
85033327980
-
-
UPC language specifications, v1.2. Technical Report LBNL-59208, Berkeley National Lab, 2005.
-
UPC language specifications, v1.2. Technical Report LBNL-59208, Berkeley National Lab, 2005.
-
-
-
-
38
-
-
84942813297
-
Programming the Infiniband network architecture for high performance message passing systems
-
V. Velusamy, et al. Programming the Infiniband network architecture for high performance message passing systems. In ISCA, 2003.
-
(2003)
ISCA
-
-
Velusamy, V.1
|