-
1
-
-
33751044112
-
Overlap of computation and communication on shared-memory networks-of-workstations
-
T. S. Abdelrahman and G. Liu. Overlap of computation and communication on shared-memory networks-of-workstations. Cluster computing, pages 35-45, 2001.
-
(2001)
Cluster Computing
, pp. 35-45
-
-
Abdelrahman, T.S.1
Liu, G.2
-
2
-
-
84870548923
-
An overview of the BlueGene/L supercomputer
-
Baltimore, MD, November
-
N. Adiga, G. Almasi, G. Almasi, Y. Aridor, R. Barik, D. Beece, R. Bellofatto, G. Bhanot, R. Bickford, M. Blumrich, A. Bright, and J. An overview of the BlueGene/L supercomputer. In SC2002 - High Performance Networking and Computing, Baltimore, MD, November 2002.
-
(2002)
SC2002 - High Performance Networking and Computing
-
-
Adiga, N.1
Almasi, G.2
Almasi, G.3
Aridor, Y.4
Barik, R.5
Beece, D.6
Bellofatto, R.7
Bhanot, G.8
Bickford, R.9
Blumrich, M.10
Bright, A.11
-
3
-
-
35048819077
-
An overview of the BlueGene/L system software organization
-
Lecture Notes in Computer Science, Klagenfurt, Austria, August. Springer-Verlag
-
G. Almasi, R. Bellofatto, J. Brunheroto, C. Caşcaval, J. G. C. nos, L. Ceze, P. Crumley, C. Erway, J. Gagliano, D. Lieber, X. Martorell, J. E. Moreira, A. Sanomiya, and K. Strauss. An overview of the BlueGene/L system software organization, In Proceedings of Euro-Par 2003 Conference, Lecture Notes in Computer Science, Klagenfurt, Austria, August 2003. Springer-Verlag.
-
(2003)
Proceedings of Euro-Par 2003 Conference
-
-
Almasi, G.1
Bellofatto, R.2
Brunheroto, J.3
Caşcaval, C.4
Nos, J.G.C.5
Ceze, L.6
Crumley, P.7
Erway, C.8
Gagliano, J.9
Lieber, D.10
Martorell, X.11
Moreira, J.E.12
Sanomiya, A.13
Strauss, K.14
-
4
-
-
84900342836
-
SPEComp: A new benchmark suite for measuring parallel computer performance
-
July
-
V. Aslot, M. Domeika, R. Eigenmann, G. Gaertner, W. B. Jones, and B. Parady. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. In Proc. of the Workshop on OpenMP Applications and Tools (WOMPAT2001), Lecture Notes in Computer Science, 2104, pages 1-10, July 2001.
-
(2001)
Proc. of the Workshop on OpenMP Applications and Tools (WOMPAT2001), Lecture Notes in Computer Science, 2104
, pp. 1-10
-
-
Aslot, V.1
Domeika, M.2
Eigenmann, R.3
Gaertner, G.4
Jones, W.B.5
Parady, B.6
-
6
-
-
84986512474
-
Charmm: A program for macromolecular energy, minimization, and dynamics calculations
-
B. R. Brooks, R. E. Bruccoleri, B. D. Olafson, D. J. States, S. Swaminathan, and M. Karplus. Charmm: A program for macromolecular energy, minimization, and dynamics calculations. J. Comp. Chem., 4:187-217, 1983.
-
(1983)
J. Comp. Chem.
, vol.4
, pp. 187-217
-
-
Brooks, B.R.1
Bruccoleri, R.E.2
Olafson, B.D.3
States, D.J.4
Swaminathan, S.5
Karplus, M.6
-
8
-
-
84976831704
-
Compiler optimizations for improving data locality
-
New York, NY, USA. ACM Press
-
S. Carr, K. S. McKinley, and C.-W. Tseng. Compiler optimizations for improving data locality. In ASPLOS-VI: Proceedings of the sixth international conference on Architectural support for programming languages and operating systems, pages 252-262, New York, NY, USA, 1994. ACM Press.
-
(1994)
ASPLOS-VI: Proceedings of the Sixth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 252-262
-
-
Carr, S.1
McKinley, K.S.2
Tseng, C.-W.3
-
9
-
-
0023996569
-
A single-program-multiple-data computational model for epex/fortran
-
F. Darema, D. A. George, V. A. Norton, and G. F. Pfister. A single-program-multiple-data computational model for epex/fortran. Parallel Computing, 7(1):11-24, 1988.
-
(1988)
Parallel Computing
, vol.7
, Issue.1
, pp. 11-24
-
-
Darema, F.1
George, D.A.2
Norton, V.A.3
Pfister, G.F.4
-
10
-
-
0037883031
-
The design and implementation of a parallel unstructured Euler solver using software primitives
-
AIAA-92-0562
-
R. Das, D. Mavriplis, J. Saltz, S. Gupta, and R. Ponnusamy. The design and implementation of a parallel unstructured Euler solver using software primitives, AIAA-92-0562. In Proceedings of the 30th Aerospace Sciences Meeting, 1992.
-
(1992)
Proceedings of the 30th Aerospace Sciences Meeting
-
-
Das, R.1
Mavriplis, D.2
Saltz, J.3
Gupta, S.4
Ponnusamy, R.5
-
11
-
-
0001483604
-
Communication optimizations for irregular scientific computations on distributed memory architectures
-
R. Das, M. Uysal, J. Saltz, and Y.-S. S. Hwang. Communication optimizations for irregular scientific computations on distributed memory architectures. Journal of Parallel and Distributed Computing, 22(3):462-478, 1994.
-
(1994)
Journal of Parallel and Distributed Computing
, vol.22
, Issue.3
, pp. 462-478
-
-
Das, R.1
Uysal, M.2
Saltz, J.3
Hwang, Y.-S.S.4
-
12
-
-
0029322543
-
Distributed memory compiler design for sparse problems
-
R. Das, J. Wu, J. Saltz, H. Berryman, and S. Hiranandani. Distributed memory compiler design for sparse problems. IEEE Trans. Comput., 44(6):737-753, 1995.
-
(1995)
IEEE Trans. Comput.
, vol.44
, Issue.6
, pp. 737-753
-
-
Das, R.1
Wu, J.2
Saltz, J.3
Berryman, H.4
Hiranandani, S.5
-
13
-
-
0032667957
-
Improving cache performance in dynamic applications through data and computation reorganization at run time
-
New York, NY, USA, ACM Press
-
C. Ding and K. Kennedy, Improving cache performance in dynamic applications through data and computation reorganization at run time. In PLDI '99; Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, pages 229-241, New York, NY, USA, 1999, ACM Press.
-
(1999)
PLDI '99; Proceedings of the ACM SIGPLAN 1999 Conference on Programming Language Design and Implementation
, pp. 229-241
-
-
Ding, C.1
Kennedy, K.2
-
14
-
-
0003413672
-
MPI: A message-passing interface standard
-
M. P. I. Forum
-
M. P. I. Forum. MPI: A Message-Passing Interface Standard. Technical Report UT-CS-94-230, 1994.
-
(1994)
Technical Report UT-CS-94-230
-
-
-
16
-
-
0030243005
-
A high-performance, portable implementation of the MPI message passing interface standard
-
Sept.
-
W. Gropp, E. Lusk, N. Doss, and A. Skjellum. A high-performance, portable implementation of the MPI message passing interface standard. Parallel Computing, 22(6):789-828, Sept. 1996.
-
(1996)
Parallel Computing
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
17
-
-
0039436975
-
An HPF compiler for the IBM SP2
-
San Diego, CA
-
M. Gupta, S. Midkiff, E. Schonberg, V. Seshadri, D. Shields, K. Wang, W. Ching, and T. Ngo. An HPF compiler for the IBM SP2. In Proceedings of Supercomputing '95, San Diego, CA, 1995.
-
(1995)
Proceedings of Supercomputing '95
-
-
Gupta, M.1
Midkiff, S.2
Schonberg, E.3
Seshadri, V.4
Shields, D.5
Wang, K.6
Ching, W.7
Ngo, T.8
-
18
-
-
84958961828
-
A comparison of locality transformations for irregular codes
-
London, UK. Springer-Verlag
-
H. Han and C.-W. Tseng. A comparison of locality transformations for irregular codes. In LCR '00: Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, pages 70-84, London, UK, 2000. Springer-Verlag.
-
(2000)
LCR '00: Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers
, pp. 70-84
-
-
Han, H.1
Tseng, C.-W.2
-
21
-
-
1142307058
-
-
Technical report, Berkeley, CA, USA
-
P. N. Hilfinger, D. Bonachea, D. Gay, S. Graham, B. Liblit, G. Pike, and K. Yelick. Titanium language reference manual. Technical report, Berkeley, CA, USA, 2001.
-
(2001)
Titanium Language Reference Manual
-
-
Hilfinger, P.N.1
Bonachea, D.2
Gay, D.3
Graham, S.4
Liblit, B.5
Pike, G.6
Yelick, K.7
-
22
-
-
0034291415
-
Porting and performance evaluation of irregular codes using OpenMP
-
D. Hisley, G. Agrawal, P. Satya-narayana, and L. Pollock. Porting and performance evaluation of irregular codes using OpenMP. Concurrency; Practice and Experience, 12(12):1241-1259, 2000.
-
(2000)
Concurrency; Practice and Experience
, vol.12
, Issue.12
, pp. 1241-1259
-
-
Hisley, D.1
Agrawal, G.2
Satya-Narayana, P.3
Pollock, L.4
-
24
-
-
0033733534
-
A loop transformation algorithm for communication overlapping
-
K. Ishizaki, H. Komatsu, and T. Nakatani. "a loop transformation algorithm for communication overlapping". International Journal of Parallel Programming, 28(2):135-154, 2000.
-
(2000)
International Journal of Parallel Programming
, vol.28
, Issue.2
, pp. 135-154
-
-
Ishizaki, K.1
Komatsu, H.2
Nakatani, T.3
-
25
-
-
0003648799
-
The OpenMP implementation of NAS parallel benchmarks and its performance
-
H. Jin, M. Frumkin, and J. Yan. The OpenMP Implementation of NAS Parallel Benchmarks and Its Performance. Technical Report NAS-99-011.
-
Technical Report NAS-99-011
-
-
Jin, H.1
Frumkin, M.2
Yan, J.3
-
26
-
-
0025550566
-
Loop distribution with arbitrary control flow
-
Washington, DC, USA. IEEE Computer Society
-
K. Kennedy and K. S. McKinley. Loop distribution with arbitrary control flow. In Supercomputing '90; Proceedings of the 1990 ACM/IEEE conference on Supercomputing, pages 407-416, Washington, DC, USA, 1990. IEEE Computer Society.
-
(1990)
Supercomputing '90; Proceedings of the 1990 ACM/IEEE Conference on Supercomputing
, pp. 407-416
-
-
Kennedy, K.1
McKinley, K.S.2
-
27
-
-
85012982762
-
Techniques to overlap computation and communication in irregular iterative applications
-
New York, NY, USA. ACM Press
-
A. Lain and P. Banerjee. Techniques to overlap computation and communication in irregular iterative applications. In ICS '94: Proceedings of the 8th international conference on Supercomputing, pages 236-245, New York, NY, USA, 1994. ACM Press.
-
(1994)
ICS '94: Proceedings of the 8th International Conference on Supercomputing
, pp. 236-245
-
-
Lain, A.1
Banerjee, P.2
-
29
-
-
0346675759
-
Compiler and software distributed shared memory support for irregular applications
-
H. Lu, A. L. Cox, S. Dwarkadas, R. Rajamony, and W. Zwaenepoel. Compiler and software distributed shared memory support for irregular applications. In Proc. of the Sixth ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPOPP'97), pages 48-56, 1997.
-
(1997)
Proc. of the Sixth ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (PPOPP'97)
, pp. 48-56
-
-
Lu, H.1
Cox, A.L.2
Dwarkadas, S.3
Rajamony, R.4
Zwaenepoel, W.5
-
30
-
-
0347133221
-
Optimizing OpenMP programs on software distributed shared memory systems
-
June
-
S.-J. Min, A. Basumallik, and R. Eigenmann. Optimizing OpenMP programs on Software Distributed Shared Memory Systems. International Journal of Parallel Programming, 31(3):225-249, June 2003.
-
(2003)
International Journal of Parallel Programming
, vol.31
, Issue.3
, pp. 225-249
-
-
Min, S.-J.1
Basumallik, A.2
Eigenmann, R.3
-
32
-
-
0028752664
-
Enabling unimodular transformations
-
Los Alamitos, CA, USA, IEEE Computer Society Press
-
R. Sass and M. Mutka. Enabling unimodular transformations. In Supercomputing '94: Proceedings of the 1994 conference on Supercomputing, pages 753-762, Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
-
(1994)
Supercomputing '94: Proceedings of the 1994 Conference on Supercomputing
, pp. 753-762
-
-
Sass, R.1
Mutka, M.2
-
33
-
-
44949282112
-
Partitioning of unstructured problems for parallel processing
-
H. Simon. Partitioning of unstructured problems for parallel processing. Computing Systems in Engineering, 2(2-3):135-148, 1991.
-
(1991)
Computing Systems in Engineering
, vol.2
, Issue.2-3
, pp. 135-148
-
-
Simon, H.1
-
35
-
-
33751027415
-
Array privatization for shared and distributed memory machines
-
P. Tu and D. Padua. Array privatization for shared and distributed memory machines (extended abstract). SIGPLAN Not., 28(1):64-67, 1993.
-
(1993)
SIGPLAN Not.
, vol.28
, Issue.1
, pp. 64-67
-
-
Tu, P.1
Padua, D.2
-
36
-
-
0029712979
-
Data-localization for fortran macro-dataflow computation using partial static task assignment
-
Philadelphia, Pennsylvania, USA. ACM Press
-
A. Yoshida, K. Koshizuka, and H. Kasahara. Data-localization for fortran macro-dataflow computation using partial static task assignment. In ICS '96: Proceedings of the 10th international conference on Supercomputing, pages 61-68, Philadelphia, Pennsylvania, USA, 1996. ACM Press.
-
(1996)
ICS '96: Proceedings of the 10th International Conference on Supercomputing
, pp. 61-68
-
-
Yoshida, A.1
Koshizuka, K.2
Kasahara, H.3
|