SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6305 LNCS, Issue , 2010, Pages 229-238

Load balancing for regular meshes on SMPs with MPI

(2) Kale, Vivek a Gropp, William a

a UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DOMAIN DECOMPOSITIONS; JACOBI ALGORITHM; LOAD-BALANCING; MPI APPLICATIONS; PARALLEL COMPUTER; PERFORMANCE GAIN; REGULAR MESHES; SCHEDULING STRATEGIES; SYSTEM NOISE;

DOMAIN DECOMPOSITION METHODS; INTERFACES (COMPUTER); SHAPE MEMORY EFFECT;

MESSAGE PASSING;

EID: 78149277691 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-15646-5_24 Document Type: Conference Paper

Times cited : (12)

References (9)

1
- 67650998701
- Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms
- Williams, S., Carter, J., Oliker, L., Shalf, J., Yelick, K.A.: Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms. Journal of Parallel and Distributed Computing (2009)
- (2009) Journal of Parallel and Distributed Computing
- Williams, S.¹ Carter, J.² Oliker, L.³ Shalf, J.⁴ Yelick, K.A.⁵

2
- 3042632157
- MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks
- IEEE Computer Society, Los Alamitos
- Cappello, F., Etiemble, D.: MPI versus MPI+OpenMP on IBM SP for the NAS benchmarks. In: Supercomputing 2000: Proceedings of the 2000 ACM/IEEE conference on Supercomputing (CDROM), Washington, DC, USA. IEEE Computer Society, Los Alamitos (2000)
- (2000) Supercomputing 2000: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing (CDROM), Washington, DC, USA
- Cappello, F.¹ Etiemble, D.²

3
- 70450079998
- Handling OS jitter on multicore multithreaded systems
- IEEE Computer Society Press, Los Alamitos
- Mann, P.D.V., Mittaly, U.: Handling OS jitter on multicore multithreaded systems. In: IPDPS 2009: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, Washington, DC, USA. IEEE Computer Society Press, Los Alamitos (2009)
- (2009) IPDPS 2009: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, Washington, DC, USA
- Mann, P.D.V.¹ Mittaly, U.²

4
- 69049084475
- The bottom-up implementation of one MILC lattice QCD application on the Cell blade
- Shi, G., Kindratenko, V., Gottlieb, S.: The bottom-up implementation of one MILC lattice QCD application on the Cell blade. International Journal of Parallel Programming 37 (2009)
- (2009) International Journal of Parallel Programming , vol.37
- Shi, G.¹ Kindratenko, V.² Gottlieb, S.³

5
- 78249252490
- A generalized framework for auto-tuning stencil computations
- Kamil, S., Chan, C., Williams, S., Oliker, L., Shalf, J., Howison, M., Bethel, E.W.: A generalized framework for auto-tuning stencil computations. In: Proceedings of the Cray User Group Conference (2009)
- Proceedings of the Cray User Group Conference (2009)
- Kamil, S.¹ Chan, C.² Williams, S.³ Oliker, L.⁴ Shalf, J.⁵ Howison, M.⁶ Bethel, E.W.⁷

6
- 0029191296
- Cilk: An efficient multithreaded runtime system
- Blumofe, R.D., Joerg, C.F., Kuszmaul, B.C., Leiserson, C.E., Randall, K.H., Zhou, Y.: Cilk: An efficient multithreaded runtime system. Journal of Parallel and Distributed Computing (1995)
- (1995) Journal of Parallel and Distributed Computing
- Blumofe, R.D.¹ Joerg, C.F.² Kuszmaul, B.C.³ Leiserson, C.E.⁴ Randall, K.H.⁵ Zhou, Y.⁶

7
- 74049102092
- Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
- ACM, New York
- Song, F., YarKhan, A., Dongarra, J.: Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems. In: SC 2009: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. ACM, New York (2009)
- (2009) SC 2009: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
- Song, F.¹ YarKhan, A.² Dongarra, J.³

8
- 84877019178
- The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q
- IEEE Computer Society Press, Los Alamitos
- Petrini, F., Kerbyson, D.J., Pakin, S.: The case of the missing supercomputer performance: Achieving optimal performance on the 8,192 processors of ASCI Q. In: SC 2003: Proceedings of the 2003 ACM/IEEE conference on Supercomputing, Washington, DC, USA, IEEE Computer Society Press, Los Alamitos (2003)
- (2003) SC 2003: Proceedings of the 2003 ACM/IEEE Conference on Supercomputing, Washington, DC, USA
- Petrini, F.¹ Kerbyson, D.J.² Pakin, S.³

9
- 78149278896
- Klug, T., Ott, M., Weidendorfer, J., Trinitis, C., Müchen, T.U.: Autopin, automated optimization of thread-to-core pinning on multicore systems (2008)
- (2008) Autopin, Automated Optimization of Thread-to-core Pinning on Multicore Systems
- Klug, T.¹ Ott, M.² Weidendorfer, J.³ Trinitis, C.⁴ Müchen, T.U.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.