-
1
-
-
49149108553
-
Memory and thread placement effects as a function of cache usage: A study of the Gaussian chemistry code on the SunFire X4600 M2
-
R. Yang, J. Antony, P. P. Janes, and A. P. Rendell, "Memory and Thread Placement Effects as a Function of Cache Usage: A Study of the Gaussian Chemistry Code on the SunFire X4600 M2," in Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks (i-span 2008), 2008, pp. 31-36.
-
(2008)
Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks (i-span 2008)
, pp. 31-36
-
-
Yang, R.1
Antony, J.2
Janes, P.P.3
Rendell, A.P.4
-
3
-
-
78149274127
-
Adaptive MPI multirail tuning for non-uniform input/output access
-
ser. Lecture Notes in Computer Science, E. G. Rainer Keller and J. Dongarra, Eds. Stuttgart, Germany: Springer-Verlag, Sep. [Online]. Available
-
S. Moreaud, B. Goglin, and R. Namyst, "Adaptive MPI Multirail Tuning for Non-Uniform Input/Output Access," in Recent Advances in the Message Passing Interface. The 17th European MPI User's Group Meeting (EuroMPI 2010), ser. Lecture Notes in Computer Science, E. G. Rainer Keller and J. Dongarra, Eds., vol. 6305. Stuttgart, Germany: Springer-Verlag, Sep. 2010, pp. 239-248. [Online]. Available: http://hal.inria.fr/inria-00486178
-
(2010)
Recent Advances in the Message Passing Interface. The 17th European MPI User's Group Meeting (EuroMPI 2010)
, vol.6305
, pp. 239-248
-
-
Moreaud, S.1
Goglin, B.2
Namyst, R.3
-
4
-
-
0037957323
-
The AMD opteron processor for multiprocessor servers
-
Mar.
-
C. N. Keltcher, K. J. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro, vol. 23, no. 2, pp. 66-76, Mar. 2003.
-
(2003)
IEEE Micro
, vol.23
, Issue.2
, pp. 66-76
-
-
Keltcher, C.N.1
McGrath, K.J.2
Ahmed, A.3
Conway, P.4
-
5
-
-
70449693703
-
-
I. Corp. Jan. [Online]. Available
-
I. Corp., "An Introduction to the Intel QuicPath Interconnect," Jan. 2009. [Online]. Available: http://www.intel.com/technology/quickpath/ introduction.pdf
-
(2009)
An Introduction to the Intel QuicPath Interconnect
-
-
-
6
-
-
78249264728
-
Near-optimal placement of MPI processes on hierarchical NUMA architecture
-
Ischia, Italy: Springer, Aug.
-
E. Jeannot and G. Mercier, "Near-optimal placement of MPI processes on hierarchical NUMA architecture," in Proceedings of the 16th International Euro-Par Conference, Lecture Notes in Computer Science, ser. Lecture Notes in Computer Science, vol. 6272. Ischia, Italy: Springer, Aug. 2010.
-
(2010)
Proceedings of the 16th International Euro-par Conference, Lecture Notes in Computer Science, ser. Lecture Notes in Computer Science
, vol.6272
-
-
Jeannot, E.1
Mercier, G.2
-
7
-
-
35248859849
-
Improving the performance of collective operations in MPICH
-
Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science Venice, Italy: Springer, Sep.
-
R. Thakur and W. Gropp, "Improving the Performance of Collective Operations in MPICH," in Proceedings of the 10th European PVM/MPI Users' Group Meeting (Euro PVM/MPI 2003), Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science, vol. 2840. Venice, Italy: Springer, Sep. 2003, pp. 257-267.
-
(2003)
Proceedings of the 10th European PVM/MPI Users' Group Meeting (Euro PVM/MPI 2003)
, vol.2840
, pp. 257-267
-
-
Thakur, R.1
Gropp, W.2
-
8
-
-
56449097786
-
MPI support for multi-core architectures: Optimized shared memory collectives
-
Dublin, Ireland: Springer-Verlag, Sep.
-
R. L. Graham and G. Shipman, "MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives," in Proceedings of the 15th European PVM/MPI Users' Group Meeting, Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes In Computer Science, vol. 5208. Dublin, Ireland: Springer-Verlag, Sep. 2008, pp. 130-140.
-
(2008)
Proceedings of the 15th European PVM/MPI Users' Group Meeting, Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science
, vol.5208
, pp. 130-140
-
-
Graham, R.L.1
Shipman, G.2
-
9
-
-
50649091849
-
MPI collectives on modern multicore clusters: Performance optimizations and communication characteristics
-
Lyon, France: IEEE Computer Society Press, May
-
A. R. Mamidala, R. Kumar, D. De, and D. K. Panda, "MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics," in Proceedings of the Int'l Symposium on Cluster Computing and the Grid (CCGrid). Lyon, France: IEEE Computer Society Press, May 2008.
-
(2008)
Proceedings of the Int'l Symposium on Cluster Computing and the Grid (CCGrid)
-
-
Mamidala, A.R.1
Kumar, R.2
De, D.3
Panda, D.K.4
-
10
-
-
51049092668
-
Scaling alltoall collective on multi-core systems
-
Miami, USA: IEEE Computer Society Press, Apr.
-
R. Kumar, A. Mamidala, and D. K. Panda, "Scaling Alltoall Collective on Multi-core Systems," in Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS '08. Miami, USA: IEEE Computer Society Press, Apr. 2008.
-
(2008)
Workshop on Communication Architecture for Clusters, Held in Conjunction with IPDPS '08
-
-
Kumar, R.1
Mamidala, A.2
Panda, D.K.3
-
11
-
-
35048884271
-
Open MPI: Goals, concept, and design of a next generation MPI implementation
-
Budapest, Hungary, Sep.
-
E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall, "Open MPI: Goals, concept, and design of a next generation MPI implementation," in Proceedings, 11th European PVM/MPI Users' Group Meeting, Budapest, Hungary, Sep. 2004, pp. 97-104.
-
(2004)
Proceedings, 11th European PVM/MPI Users' Group Meeting
, pp. 97-104
-
-
Gabriel, E.1
Fagg, G.E.2
Bosilca, G.3
Angskun, T.4
Dongarra, J.J.5
Squyres, J.M.6
Sahay, V.7
Kambadur, P.8
Barrett, B.9
Lumsdaine, A.10
Castain, R.H.11
Daniel, D.J.12
Graham, R.L.13
Woodall, T.S.14
-
12
-
-
33745201924
-
The component architecture of open MPI: Enabling third-party collective algorithms
-
V. Getov and T. Kielmann, Eds. St. Malo, France: Springer, July
-
J. M. Squyres and A. Lumsdaine, "The component architecture of open MPI: Enabling third-party collective algorithms," in Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, V. Getov and T. Kielmann, Eds. St. Malo, France: Springer, July 2004, pp. 167-185.
-
(2004)
Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications
, pp. 167-185
-
-
Squyres, J.M.1
Lumsdaine, A.2
-
13
-
-
83455261521
-
Tuned: An open MPI collective communications component
-
P. Kacsuk, T. Fahringer, and Z. Nmeth, Eds. Springer
-
G. Fagg, G. Bosilca, J. Pjeivac-Grbovi, T. Angskun, and J. Dongarra, "Tuned: An Open MPI Collective Communications Component," in Distributed and Parallel Systems, P. Kacsuk, T. Fahringer, and Z. Nmeth, Eds. Springer, 2007, pp. 65-72.
-
(2007)
Distributed and Parallel Systems
, pp. 65-72
-
-
Fagg, G.1
Bosilca, G.2
Pjeivac-Grbovi, J.3
Angskun, T.4
Dongarra, J.5
-
14
-
-
33947638068
-
-
"Intel MPI Benchmarks," http://software.intel.com/en-us/ articles/intel-mpi-benchmarks/.
-
Intel MPI Benchmarks
-
-
-
15
-
-
77950930965
-
MiAMI: Multi-core aware processor affinity for TCP/IP over multiple network interfaces
-
New York, USA, Aug.
-
H.-C. Jang and H.-W. Jin, "MiAMI: Multi-core Aware Processor Affinity for TCP/IP over Multiple Network Interfaces," in Proceedings of the 17th Annual Symposium on HighPerformance Interconnects (HotI'09), New York, USA, Aug. 2009, pp. 73-82.
-
(2009)
Proceedings of the 17th Annual Symposium on HighPerformance Interconnects (HotI'09)
, pp. 73-82
-
-
Jang, H.-C.1
Jin, H.-W.2
-
16
-
-
70450059324
-
Designing multi-leader-based allgather algorithms for multi-core clusters
-
held in conjunction with IPDPS 2009. Roma, Italy: IEEE Computer Society Press, May
-
K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop, and D. K. Panda, "Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters," in CAC 2009: The 9th Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2009. Roma, Italy: IEEE Computer Society Press, May 2009.
-
(2009)
CAC 2009: The 9th Workshop on Communication Architecture for Clusters
-
-
Kandalla, K.1
Subramoni, H.2
Santhanaraman, G.3
Koop, M.4
Panda, D.K.5
|