SCOPUS 정보 검색 플랫폼

IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum

Volumn , Issue , 2011, Pages 788-794

Dodging non-uniform I/O access in hierarchical collective operations for multicore clusters

(2) Goglin, Brice a Moreaud, Stéphanie a

a UNIVERSITÉ DE BORDEAUX (France)

Author keywords

[No Author keywords available]

Indexed keywords

BROADCAST OPERATIONS; CLUSTER TOPOLOGY; COLLECTIVE OPERATIONS; HYPERTRANSPORT; I/O DEVICE; INPUT/OUTPUT; MEMORY BANKS; MULTI-CORE CLUSTER; MULTIPLE STRATEGY; NETWORK ACCESS; NON UNIFORM MEMORY ACCESS; SCALABILITY ISSUE; SINGLE PROCESSORS; THROUGHPUT IMPROVEMENT;

DISTRIBUTED PARAMETER NETWORKS;

MEMORY ARCHITECTURE;

EID: 83455229663 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2011.222 Document Type: Conference Paper

Times cited : (13)

References (16)

1
- 49149108553
- Memory and thread placement effects as a function of cache usage: A study of the Gaussian chemistry code on the SunFire X4600 M2
- R. Yang, J. Antony, P. P. Janes, and A. P. Rendell, "Memory and Thread Placement Effects as a Function of Cache Usage: A Study of the Gaussian Chemistry Code on the SunFire X4600 M2," in Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks (i-span 2008), 2008, pp. 31-36.
- (2008) Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks (i-span 2008) , pp. 31-36
- Yang, R.¹ Antony, J.² Janes, P.P.³ Rendell, A.P.⁴

2
- 62449293027
- Impact of NUMA effects on high-speed networking with multi-opteron machines
- Cambridge, USA: ACTA Press, Nov. [Online]. Available
- S. Moreaud and B. Goglin, "Impact of NUMA Effects on High-Speed Networking with Multi-Opteron Machines," in The 19th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2007). Cambridge, USA: ACTA Press, Nov. 2007. [Online]. Available: http://hal.inria.fr/ inria-00175747
- (2007) The 19th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2007)
- Moreaud, S.¹ Goglin, B.²

3
- 78149274127
- Adaptive MPI multirail tuning for non-uniform input/output access
- ser. Lecture Notes in Computer Science, E. G. Rainer Keller and J. Dongarra, Eds. Stuttgart, Germany: Springer-Verlag, Sep. [Online]. Available
- S. Moreaud, B. Goglin, and R. Namyst, "Adaptive MPI Multirail Tuning for Non-Uniform Input/Output Access," in Recent Advances in the Message Passing Interface. The 17th European MPI User's Group Meeting (EuroMPI 2010), ser. Lecture Notes in Computer Science, E. G. Rainer Keller and J. Dongarra, Eds., vol. 6305. Stuttgart, Germany: Springer-Verlag, Sep. 2010, pp. 239-248. [Online]. Available: http://hal.inria.fr/inria-00486178
- (2010) Recent Advances in the Message Passing Interface. The 17th European MPI User's Group Meeting (EuroMPI 2010) , vol.6305 , pp. 239-248
- Moreaud, S.¹ Goglin, B.² Namyst, R.³

4
- 0037957323
- The AMD opteron processor for multiprocessor servers
- Mar.
- C. N. Keltcher, K. J. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro, vol. 23, no. 2, pp. 66-76, Mar. 2003.
- (2003) IEEE Micro , vol.23 , Issue.2 , pp. 66-76
- Keltcher, C.N.¹ McGrath, K.J.² Ahmed, A.³ Conway, P.⁴

5
- 70449693703
- I. Corp. Jan. [Online]. Available
- I. Corp., "An Introduction to the Intel QuicPath Interconnect," Jan. 2009. [Online]. Available: http://www.intel.com/technology/quickpath/ introduction.pdf
- (2009) An Introduction to the Intel QuicPath Interconnect

6
- 78249264728
- Near-optimal placement of MPI processes on hierarchical NUMA architecture
- Ischia, Italy: Springer, Aug.
- E. Jeannot and G. Mercier, "Near-optimal placement of MPI processes on hierarchical NUMA architecture," in Proceedings of the 16th International Euro-Par Conference, Lecture Notes in Computer Science, ser. Lecture Notes in Computer Science, vol. 6272. Ischia, Italy: Springer, Aug. 2010.
- (2010) Proceedings of the 16th International Euro-par Conference, Lecture Notes in Computer Science, ser. Lecture Notes in Computer Science , vol.6272
- Jeannot, E.¹ Mercier, G.²

7
- 35248859849
- Improving the performance of collective operations in MPICH
- Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science Venice, Italy: Springer, Sep.
- R. Thakur and W. Gropp, "Improving the Performance of Collective Operations in MPICH," in Proceedings of the 10th European PVM/MPI Users' Group Meeting (Euro PVM/MPI 2003), Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science, vol. 2840. Venice, Italy: Springer, Sep. 2003, pp. 257-267.
- (2003) Proceedings of the 10th European PVM/MPI Users' Group Meeting (Euro PVM/MPI 2003) , vol.2840 , pp. 257-267
- Thakur, R.¹ Gropp, W.²

8
- 56449097786
- MPI support for multi-core architectures: Optimized shared memory collectives
- Dublin, Ireland: Springer-Verlag, Sep.
- R. L. Graham and G. Shipman, "MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives," in Proceedings of the 15th European PVM/MPI Users' Group Meeting, Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes In Computer Science, vol. 5208. Dublin, Ireland: Springer-Verlag, Sep. 2008, pp. 130-140.
- (2008) Proceedings of the 15th European PVM/MPI Users' Group Meeting, Recent Advances in Parallel Virtual Machine and Message Passing Interface, ser. Lecture Notes in Computer Science , vol.5208 , pp. 130-140
- Graham, R.L.¹ Shipman, G.²

9
- 50649091849
- MPI collectives on modern multicore clusters: Performance optimizations and communication characteristics
- Lyon, France: IEEE Computer Society Press, May
- A. R. Mamidala, R. Kumar, D. De, and D. K. Panda, "MPI Collectives on Modern Multicore Clusters: Performance Optimizations and Communication Characteristics," in Proceedings of the Int'l Symposium on Cluster Computing and the Grid (CCGrid). Lyon, France: IEEE Computer Society Press, May 2008.
- (2008) Proceedings of the Int'l Symposium on Cluster Computing and the Grid (CCGrid)
- Mamidala, A.R.¹ Kumar, R.² De, D.³ Panda, D.K.⁴

10
- 51049092668
- Scaling alltoall collective on multi-core systems
- Miami, USA: IEEE Computer Society Press, Apr.
- R. Kumar, A. Mamidala, and D. K. Panda, "Scaling Alltoall Collective on Multi-core Systems," in Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS '08. Miami, USA: IEEE Computer Society Press, Apr. 2008.
- (2008) Workshop on Communication Architecture for Clusters, Held in Conjunction with IPDPS '08
- Kumar, R.¹ Mamidala, A.² Panda, D.K.³

11
- 35048884271
- Open MPI: Goals, concept, and design of a next generation MPI implementation
- Budapest, Hungary, Sep.
- E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall, "Open MPI: Goals, concept, and design of a next generation MPI implementation," in Proceedings, 11th European PVM/MPI Users' Group Meeting, Budapest, Hungary, Sep. 2004, pp. 97-104.
- (2004) Proceedings, 11th European PVM/MPI Users' Group Meeting , pp. 97-104
- Gabriel, E.¹ Fagg, G.E.² Bosilca, G.³ Angskun, T.⁴ Dongarra, J.J.⁵ Squyres, J.M.⁶ Sahay, V.⁷ Kambadur, P.⁸ Barrett, B.⁹ Lumsdaine, A.¹⁰ Castain, R.H.¹¹ Daniel, D.J.¹² Graham, R.L.¹³ Woodall, T.S.¹⁴

12
- 33745201924
- The component architecture of open MPI: Enabling third-party collective algorithms
- V. Getov and T. Kielmann, Eds. St. Malo, France: Springer, July
- J. M. Squyres and A. Lumsdaine, "The component architecture of open MPI: Enabling third-party collective algorithms," in Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications, V. Getov and T. Kielmann, Eds. St. Malo, France: Springer, July 2004, pp. 167-185.
- (2004) Proceedings, 18th ACM International Conference on Supercomputing, Workshop on Component Models and Systems for Grid Applications , pp. 167-185
- Squyres, J.M.¹ Lumsdaine, A.²

13
- 83455261521
- Tuned: An open MPI collective communications component
- P. Kacsuk, T. Fahringer, and Z. Nmeth, Eds. Springer
- G. Fagg, G. Bosilca, J. Pjeivac-Grbovi, T. Angskun, and J. Dongarra, "Tuned: An Open MPI Collective Communications Component," in Distributed and Parallel Systems, P. Kacsuk, T. Fahringer, and Z. Nmeth, Eds. Springer, 2007, pp. 65-72.
- (2007) Distributed and Parallel Systems , pp. 65-72
- Fagg, G.¹ Bosilca, G.² Pjeivac-Grbovi, J.³ Angskun, T.⁴ Dongarra, J.⁵

14
- 33947638068
- "Intel MPI Benchmarks," http://software.intel.com/en-us/ articles/intel-mpi-benchmarks/.
- Intel MPI Benchmarks

15
- 77950930965
- MiAMI: Multi-core aware processor affinity for TCP/IP over multiple network interfaces
- New York, USA, Aug.
- H.-C. Jang and H.-W. Jin, "MiAMI: Multi-core Aware Processor Affinity for TCP/IP over Multiple Network Interfaces," in Proceedings of the 17th Annual Symposium on HighPerformance Interconnects (HotI'09), New York, USA, Aug. 2009, pp. 73-82.
- (2009) Proceedings of the 17th Annual Symposium on HighPerformance Interconnects (HotI'09) , pp. 73-82
- Jang, H.-C.¹ Jin, H.-W.²

16
- 70450059324
- Designing multi-leader-based allgather algorithms for multi-core clusters
- held in conjunction with IPDPS 2009. Roma, Italy: IEEE Computer Society Press, May
- K. Kandalla, H. Subramoni, G. Santhanaraman, M. Koop, and D. K. Panda, "Designing Multi-Leader-Based Allgather Algorithms for Multi-Core Clusters," in CAC 2009: The 9th Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2009. Roma, Italy: IEEE Computer Society Press, May 2009.
- (2009) CAC 2009: The 9th Workshop on Communication Architecture for Clusters
- Kandalla, K.¹ Subramoni, H.² Santhanaraman, G.³ Koop, M.⁴ Panda, D.K.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.