-
2
-
-
0003417929
-
-
MIT Press, 2nd edition
-
William Gropp, Ewing Lusk, and Anthony Skjellum, Using MPI: Portable Parallel Programming with the Message-Passing Interface, MIT Press, 2nd edition, 1999.
-
(1999)
Using MPI: Portable Parallel Programming with the Message-Passing Interface
-
-
Gropp, W.1
Lusk, E.2
Skjellum, A.3
-
3
-
-
84958049806
-
Flattening on the fly: Efficient handling of MPI derived datatypes
-
Springer
-
Jesper Larsson Träff, Rolf Hempel, Hubert Ritzdorf, and Falk Zimmermann, "Flattening on the Fly: Efficient Handling of MPI Derived Datatypes," in Proceedings of the 6th European PVM/MPI Users' Group Meeting, Lecture Notes in Computer Science, Vol. 1697, Springer, pp. 109-116, 1999.
-
(1999)
Proceedings of the 6th European PVM/MPI Users' Group Meeting, Lecture Notes in Computer Science
, vol.1697
, pp. 109-116
-
-
Träff, J.L.1
Hempel, R.2
Ritzdorf, H.3
Zimmermann, F.4
-
4
-
-
0002105046
-
Improving the performance of MPI derived datatypes
-
MPI Software Technology Press March
-
William Gropp, Ewing Lusk, and Deborah Swider, "Improving the Performance of MPI Derived Datatypes", in Proceedings of the Third MPI Developer's and User's Conference, MPI Software Technology Press, pp. 25-30, March 1999.
-
(1999)
Proceedings of the Third MPI Developer's and User's Conference
, pp. 25-30
-
-
Gropp, W.1
Lusk, E.2
Swider, D.3
-
7
-
-
0026137116
-
The cache performance of blocked algorithms
-
April
-
Monica Lam, Edward E. Rothberg, and Michael E. Wolf, "The Cache Performance of Blocked Algorithms," in Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 63-74, April 1991.
-
(1991)
Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 63-74
-
-
Lam, M.1
Rothberg, E.E.2
Wolf, M.E.3
-
8
-
-
0033075413
-
Cache locality by a combination of loop and data transformations
-
February
-
M. Kandemir, J. Ramanujam and A. Choudhary, "Cache Locality by a Combination of Loop and Data Transformations," IEEE Transactions on Computers (TC) 48(2): 159-167, February 1999.
-
(1999)
IEEE Transactions on Computers (TC)
, vol.48
, Issue.2
, pp. 159-167
-
-
Kandemir, M.1
Ramanujam, J.2
Choudhary, A.3
-
12
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
R. Clint Whaley, Antoine Petitet, and Jack Dongarra, "Automated Empirical Optimizations of Software and the ATLAS Project," Parallel Computing, 27(1-2):3-25, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-25
-
-
Clint Whaley, R.1
Petitet, A.2
Dongarra, J.3
-
14
-
-
0009346826
-
LogP: Towards a realistic model of parallel computation
-
May
-
David Culler, Richard Karp, David Patterson, Abhijit Sahay, Klaus Erik Schauser, Eunice Santos, Ramesh Subramonian, and Thorsten von Eicken, "LogP: Towards a Realistic Model of Parallel Computation," in Proceedings of Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 1-12, May 1993.
-
(1993)
Proceedings of Fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 1-12
-
-
Culler, D.1
Karp, R.2
Patterson, D.3
Sahay, A.4
Schauser, K.E.5
Santos, E.6
Subramonian, R.7
Von Eicken, T.8
-
16
-
-
0033707299
-
Recency-based TLB preloading
-
June
-
Ashley Saulsbury, Fredrik Dahlgren, and Per Stenström, "Recency-based TLB Preloading," in Proceedings of the 27th Annual International Symposium on Computer Architecture, pp. 117-127, June 2000.
-
(2000)
Proceedings of the 27th Annual International Symposium on Computer Architecture
, pp. 117-127
-
-
Saulsbury, A.1
Dahlgren, F.2
Stenström, P.3
-
17
-
-
0027727666
-
Micro benchmark analysis of the KSR1
-
December
-
R. H. Saavedra, R. S. Gaines, and M. J. Carlton, Micro Benchmark Analysis of the KSR1, in Proceedings of Supercomputing '93, pp. 202-213, December 1993.
-
(1993)
Proceedings of Supercomputing '93
, pp. 202-213
-
-
Saavedra, R.H.1
Gaines, R.S.2
Carlton, M.J.3
|