SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Cluster Computing, ICCC

Volumn , Issue , 2009, Pages

Analytical modeling and optimization for affinity based thread scheduling on multicore systems

(3) Song, Fengguang a Moore, Shirley b Dongarra, Jack c

a University of Tennessee (United States)

b OAK RIDGE NATIONAL LABORATORY (United States)

c UNIVERSITY OF MANCHESTER (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

ANALYTICAL MODEL; ANALYTICAL MODELING; CHOLESKY FACTORIZATIONS; DATA DEPENDENCE; ESTIMATED COSTS; MEMORY HIERARCHY; MULTI-CORE SYSTEMS; NEAR-OPTIMAL SOLUTIONS; NP-HARDNESS; PROGRAM PERFORMANCE; REAL-WORLD APPLICATION; SCHEDULING PROBLEM; SUBMODELS; THREAD SCHEDULING;

APPROXIMATION ALGORITHMS; CLUSTER COMPUTING; COMPUTATIONAL EFFICIENCY; COMPUTATIONAL FLUID DYNAMICS; COMPUTER SCIENCE; COMPUTER SIMULATION; COST BENEFIT ANALYSIS; COSTS; MODELS; OPTIMIZATION; SHAPE MEMORY EFFECT; TECHNICAL PRESENTATIONS;

MULTITASKING;

EID: 72049130291 PISSN: 15525244 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CLUSTR.2009.5289173 Document Type: Conference Paper

Times cited : (15)

References (16)

1
- 70350626161
- R. Golla, "Niagara2: A highly threaded server-on-a-chip," 2007.
- (2007) Niagara2: A Highly Threaded Server-on-a-chip
- Golla, R.¹

2
- 49249086142
- Larrabee: A many-core X86 architecture for visual computing
- L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan, and P. Hanrahan, "Larrabee: a many-core X86 architecture for visual computing," ACM Trans. Graph., vol.27, no.3, pp. 1-15, 2008.
- (2008) ACM Trans. Graph , vol.27 , Issue.3 , pp. 1-15
- Seiler, L.¹ Carmean, D.² Sprangle, E.³ Forsyth, T.⁴ Abrash, M.⁵ Dubey, P.⁶ Junkins, S.⁷ Lake, A.⁸ Sugerman, J.⁹ Cavin, R.¹⁰ Espasa, R.¹¹ Grochowski, E.¹² Juan, T.¹³ Hanrahan, P.¹⁴

3
- 37549032725
- IBM Power6 microarchitecture
- H. Q. Le, W. J. Starke, J. S. Fields, F. P. O'Connell, D. Q. Nguyen, B. J. Ronchetti, W. M. Sauer, E. M. Schwarz, and M. T. Vaden, "IBM Power6 microarchitecture," IBM J. Res. Dev., vol.51, no.6, pp. 639-662, 2007.
- (2007) IBM J. Res. Dev. , vol.51 , Issue.6 , pp. 639-662
- Le, H.Q.¹ Starke, W.J.² Fields, J.S.³ O'connell, F.P.⁴ Nguyen, D.Q.⁵ Ronchetti, B.J.⁶ Sauer, W.M.⁷ Schwarz, E.M.⁸ Vaden, M.T.⁹

4
- 27544432558
- The impact of performance asymmetry in emerging multicore architectures
- S. Balakrishnan, R. Rajwar, M. Upton, and K. K. Lai, "The impact of performance asymmetry in emerging multicore architectures." in ISCA. IEEE Computer Society, 2005, pp. 506-517.
- (2005) ISCA. IEEE Computer Society , pp. 506-517
- Balakrishnan, S.¹ Rajwar, R.² Upton, M.³ Lai, K.K.⁴

5
- 34247174509
- Core architecture optimization for heterogeneous chip multiprocessors
- E. R. Altman, K. Skadron, and B. G. Zorn, Eds. ACM
- R. Kumar, D. M. Tullsen, and N. P. Jouppi, "Core architecture optimization for heterogeneous chip multiprocessors." in PACT, E. R. Altman, K. Skadron, and B. G. Zorn, Eds. ACM, 2006, pp. 23-32.
- (2006) PACT , pp. 23-32
- Kumar, R.¹ Tullsen, D.M.² Jouppi, N.P.³

6
- 4644370318
- Single-ISA heterogeneous multi-core architectures for multithreaded workload performance
- R. Kumar, D. M. Tullsen, P. Ranganathan, N. P. Jouppi, and K. I. Farkas, "Single-ISA heterogeneous multi-core architectures for multithreaded workload performance." in ISCA. IEEE Computer Society, 2004, pp. 64-75.
- (2004) ISCA. IEEE Computer Society , pp. 64-75
- Kumar, R.¹ Tullsen, D.M.² Ranganathan, P.³ Jouppi, N.P.⁴ Farkas, K.I.⁵

7
- 34548088089
- Feedback-directed thread scheduling with memory considerations
- F. Song, S. Moore, and J. Dongarra, "Feedback-directed thread scheduling with memory considerations," in HPDC '07: Proceedings of the 16th international symposium on High performance distributed computing, 2007, pp. 97-106.
- (2007) HPDC '07: Proceedings of the 16th International Symposium on High Performance Distributed Computing , pp. 97-106
- Song, F.¹ Moore, S.² Dongarra, J.³

8
- 72049121618
- University of Tennessee, Computer Science Tech. Rep. UT-CS-08-626
- F. Song, S. Moore, and J. Dongarra, "Analytical modeling for affinitybased thread scheduling on multicore platforms," University of Tennessee, Computer Science Tech. Rep. UT-CS-08-626, 2008.
- (2008) Analytical Modeling for Affinitybased Thread Scheduling on Multicore Platforms
- Song, F.¹ Moore, S.² Dongarra, J.³

9
- 0003876316
- Upper Saddle River, NJ, USA: Prentice-Hall, Inc.
- H. El-Rewini, T. G. Lewis, and H. H. Ali, Task scheduling in parallel and distributed systems. Upper Saddle River, NJ, USA: Prentice-Hall, Inc., 1994.
- (1994) Task Scheduling in Parallel and Distributed Systems
- El-Rewini, H.¹ Lewis, T.G.² Ali, H.H.³

10
- 0030259746
- Thread scheduling for cache locality
- J. Philbin, J. Edler, O. J. Anshus, C. C. Douglas, and K. Li, "Thread scheduling for cache locality." in ASPLOS, 1996, pp. 60-71.
- (1996) ASPLOS , pp. 60-71
- Philbin, J.¹ Edler, J.² Anshus, O.J.³ Douglas, C.C.⁴ Li, K.⁵

11
- 0346502782
- Restructuring computations for temporal data cache locality
- V. K. Pingali, S. A. McKee, W. C. Hsieh, and J. B. Carter, "Restructuring computations for temporal data cache locality." International Journal of Parallel Programming, vol.31, no.4, pp. 305-338, 2003.
- (2003) International Journal of Parallel Programming , vol.31 , Issue.4 , pp. 305-338
- Pingali, V.K.¹ McKee, S.A.² Hsieh, W.C.³ Carter, J.B.⁴

12
- 0242422577
- Implementing the MPI process topology mechanism
- Los Alamitos, CA, USA: IEEE Computer Society Press
- J. L. Träff, "Implementing the MPI process topology mechanism," in Supercomputing '02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing. Los Alamitos, CA, USA: IEEE Computer Society Press, 2002, pp. 1-14.
- (2002) Supercomputing '02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing , pp. 1-14
- Träff, J.L.¹

13
- 3042618790
- Improving the locality of the sparse matrix-vector product on shared memory multiprocessors
- J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera, "Improving the locality of the sparse matrix-vector product on shared memory multiprocessors." in PDP. IEEE Computer Society, 2004, pp. 66-71.
- (2004) PDP. IEEE Computer Society , pp. 66-71
- Pichel, J.C.¹ Heras, D.B.² Cabaleiro, J.C.³ Rivera, F.F.⁴

14
- 33846984912
- A new technique to reduce false sharing in parallel irregular codes based on distance functions
- J. C. Pichel, D. B. Heras, J. C. Cabaleiro, and F. F. Rivera, "A new technique to reduce false sharing in parallel irregular codes based on distance functions," in ISPAN. IEEE Computer Society, 2005, pp. 306-311.
- (2005) ISPAN. IEEE Computer Society , pp. 306-311
- Pichel, J.C.¹ Heras, D.B.² Cabaleiro, J.C.³ Rivera, F.F.⁴

15
- 0031215997
- How good is recursive bisection?
- H. D. Simon and S.-H. Teng, "How good is recursive bisection?" SIAM J. Sci. Comput., vol.18, no.5, pp. 1436-1445, 1997.
- (1997) SIAM J. Sci. Comput. , vol.18 , Issue.5 , pp. 1436-1445
- Simon, H.D.¹ Teng, S.-H.²

16
- 0012453312
- [Online]. Available
- T. Davis, "University of Florida sparse matrix collection." [Online]. Available: http://www.cise.ufl.edu/research/sparse
- University of Florida Sparse Matrix Collection
- Davis, T.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.