SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on Code Generation and Optimization, CGO 2012

Volumn , Issue , 2012, Pages 230-241

Matching memory access patterns and data placement for NUMA systems

(2) Majo, Zoltan a Gross, Thomas R a

a ETH ZURICH (Switzerland)

Author keywords

Data placement; NUMA; Scheduling

Indexed keywords

ACCESS PATTERNS; DATA ACCESS PATTERNS; DATA DISTRIBUTION; DATA PARTITIONING; DATA PLACEMENT; LOOP SCHEDULING; MEMORY ACCESS PATTERNS; MULTI CORE; NON-UNIFORM MEMORY ARCHITECTURE; NUMA; NUMA SYSTEMS; PARALLEL PROGRAM; REMOTE ACCESS; REMOTE MEMORY ACCESS;

APPLICATION PROGRAMMING INTERFACES (API); MEMORY ARCHITECTURE; NETWORK COMPONENTS; OPTIMIZATION; PARALLEL ARCHITECTURES;

SCHEDULING;

EID: 84863469851 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2259016.2259046 Document Type: Conference Paper

Times cited : (47)

References (19)

1
- 33751022080
- Programming for parallelism, and locality with hierarchically tiled arrays
- Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06
- G. Bikshandi, J. Guo, D. Hoeinger, G. Almasi, B. B. Fraguela, M. J. Garzarfian, D. Padua, and C. von Praun. Programming for parallelism and locality with hierarchically tiled arrays. In PPoPP'06, pages 48-57, New York, NY, USA, 2006. ACM. (Pubitemid 44758674)
- (2006) Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP , vol.2006 , pp. 48-57
- Bikshandi, G.¹ Jia, G.² Hoeflinger, D.³ Almasi, G.⁴ Fraguela, B.B.⁵ Garzaran, M.J.⁶ Padua, D.⁷ Von Praun, C.⁸

2
- 84863430235
- A case for NUMA-aware contention management on multicore systems
- Berkeley, CA, USA, USENIX Association
- S. Blagodurov, S. Zhuravlev, M. Dashti, and A. Fedorova. A case for NUMA-aware contention management on multicore systems. In USENIX ATC'11, Berkeley, CA, USA, 2011. USENIX Association.
- (2011) USENIX ATC'11
- Blagodurov, S.¹ Zhuravlev, S.² Dashti, M.³ Fedorova, A.⁴

3
- 0346043334
- Data distribution support on distributed shared memory multiprocessors
- R. Chandra, D.-K. Chen, R. Cox, D. E. Maydan, N. Nedeljkovic, and J. M. Anderson. Data distribution support on distributed shared memory multiprocessors. In PLDI'97, pages 334-345, New York, NY, USA, 1997. ACM. (Pubitemid 127453709)
- (1997) SIGPLAN Notices (ACM Special Interest Group on Programming Languages) , vol.32 , Issue.5 , pp. 334-345
- Chandra, R.¹ Chen, D.-K.² Cox, R.³ Maydan, D.E.⁴ Nedeljkovic, N.⁵ Anderson, J.M.⁶

4
- 0242276320
- Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations
- DOI 10.1016/S0743-7315(03)00103-5
- A. Darte, J. Mellor-Crummey, R. Fowler, and D. Chavarrá-Miranda. Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations. J. Parallel Distrib. Comput., 63:887-911, September 2003. (Pubitemid 37364493)
- (2003) Journal of Parallel and Distributed Computing , vol.63 , Issue.9 , pp. 887-911
- Darte, A.¹ Mellor-Crummey, J.² Fowler, R.³ Chavarria-Miranda, D.⁴

5
- 77953995394
- What can performance counters do for memory subsystem analysis?
- New York, NY, USA, ACM
- S. Eranian. What can performance counters do for memory subsystem analysis? In MSPC'08, pages 26-30, New York, NY, USA, 2008. ACM.
- (2008) MSPC'08 , pp. 26-30
- Eranian, S.¹

6
- 0003648799
- The OpenMP implementation of NAS parallel benchmarks and its performance
- H. Jin, M. Frumkin, and J. Yan. The OpenMP implementation of NAS parallel benchmarks and its performance. Technical report, NASA Ames Research Center, 1999.
- (1999) Technical Report, NASA Ames Research Center
- Jin, H.¹ Frumkin, M.² Yan, J.³

7
- 77954696758
- Cache topology aware computation mapping for multicores
- New York, NY, USA, ACM
- M. Kandemir, T. Yemliha, S. Muralidhara, S. Srikantaiah, M. J. Irwin, and Y. Zhang. Cache topology aware computation mapping for multicores. In PLDI'10, pages 74-85, New York, NY, USA, 2010. ACM.
- (2010) PLDI'10 , pp. 74-85
- Kandemir, M.¹ Yemliha, T.² Muralidhara, S.³ Srikantaiah, S.⁴ Irwin, M.J.⁵ Zhang, Y.⁶

8
- 85023182347
- Locality and loop scheduling on NUMA multiprocessors
- CRC Press, Inc.
- H. Li, H. L. Sudarsan, M. Stumm, and K. C. Sevcik. Locality and loop scheduling on NUMA multiprocessors. In ICPP'93, pages 140-147. CRC Press, Inc, 1993.
- (1993) ICPP'93 , pp. 140-147
- Li, H.¹ Sudarsan, H.L.² Stumm, M.³ Sevcik, K.C.⁴

9
- 79959898692
- Memory management in numa multicore systems: Trapped between cache contention and interconnect overhead
- New York, NY, USA, ACM
- Z. Majo and T. R. Gross. Memory management in numa multicore systems: trapped between cache contention and interconnect overhead. In ISMM'11, pages 11-20, New York, NY, USA, 2011. ACM.
- (2011) ISMM'11 , pp. 11-20
- Majo, Z.¹ Gross, T.R.²

10
- 78650178980
- Feedback-directed page placement for cc-NUMA via hardware-generated memory traces
- December
- J. Marathe, V. Thakkar, and F. Mueller. Feedback-directed page placement for cc-NUMA via hardware-generated memory traces. J. Par. Distrib. Comput., 70:1204-1219, December 2010.
- (2010) J. Par. Distrib. Comput. , vol.70 , pp. 1204-1219
- Marathe, J.¹ Thakkar, V.² Mueller, F.³

11
- 77952562600
- Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms
- C. McCurdy and J. S. Vetter. Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms. In ISPASS'10, pages 87-96, 2010.
- (2010) ISPASS'10 , pp. 87-96
- McCurdy, C.¹ Vetter, J.S.²

12
- 0033700063
- A case for user-level dynamic page migration
- New York, NY, USA, ACM
- D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguadé. A case for user-level dynamic page migration. In ICS'00, pages 119{130, New York, NY, USA, 2000. ACM.
- (2000) ICS'00 , pp. 119-130
- Nikolopoulos, D.S.¹ Papatheodorou, T.S.² Polychronopoulos, C.D.³ Labarta, J.⁴ Ayguadé, E.⁵

13
- 79959204964
- Is data distribution necessary in OpenMP?
- Washington, DC, USA, IEEE Computer Society
- D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguade. Is data distribution necessary in OpenMP? In Supercomputing'00, Washington, DC, USA, 2000. IEEE Computer Society.
- (2000) Supercomputing'00
- Nikolopoulos, D.S.¹ Papatheodorou, T.S.² Polychronopoulos, C.D.³ Labarta, J.⁴ Ayguadé, E.⁵

14
- 34548030923
- Thread clustering: Sharing-aware scheduling on SMP-CMP-SMT multiprocessors
- DOI 10.1145/1272996.1273004, Operating Systems Review - Proceedings of the 2007 EuroSys Conference
- D. Tam, R. Azimi, and M. Stumm. Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors. In EuroSys'07, pages 47-58, New York, NY, USA, 2007. ACM. (Pubitemid 47281574)
- (2007) Operating Systems Review (ACM) , pp. 47-58
- Tam, D.¹ Azimi, R.² Stumm, M.³

15
- 0030652844
- Automatic partitioning of data and computations on scalable shared memory multiprocessors
- August
- S. Tandri and T. Abdelrahman. Automatic partitioning of data and computations on scalable shared memory multiprocessors. In ICPP'97, pages 64 -73, August 1997.
- (1997) ICPP'97 , pp. 64-73
- Tandri, S.¹ Abdelrahman, T.²

16
- 0028346514
- Impact of sharing-based thread placement on multithreaded architectures
- Los Alamitos, CA, USA, IEEE Computer Society Press
- R. Thekkath and S. J. Eggers. Impact of sharing-based thread placement on multithreaded architectures. In ISCA'94, pages 176-186, Los Alamitos, CA, USA, 1994. IEEE Computer Society Press.
- (1994) ISCA'94 , pp. 176-186
- Thekkath, R.¹ Eggers, S.J.²

17
- 84934274832
- Using hardware counters to automatically improve memory performance
- Washington, DC, USA, IEEE Computer Society
- M. M. Tikir and J. K. Hollingsworth. Using hardware counters to automatically improve memory performance. In Supercomputing'04, Washington, DC, USA, 2004. IEEE Computer Society.
- (2004) Supercomputing'04
- Tikir, M.M.¹ Hollingsworth, J.K.²

18
- 2842582520
- Operating system support for improving data locality on CC-NUMA compute servers
- New York, NY, USA, ACM
- B. Verghese, S. Devine, A. Gupta, and M. Rosenblum. Operating system support for improving data locality on CC-NUMA compute servers. In ASPLOS'96, pages 279-289, New York, NY, USA, 1996. ACM.
- (1996) ASPLOS'96 , pp. 279-289
- Verghese, B.¹ Devine, S.² Gupta, A.³ Rosenblum, M.⁴

19
- 77749340037
- Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?
- New York, NY, USA, ACM
- E. Z. Zhang, Y. Jiang, and X. Shen. Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? In PPoPP'10, pages 203-212, New York, NY, USA, 2010. ACM.
- (2010) PPoPP'10 , pp. 203-212
- Zhang, E.Z.¹ Jiang, Y.² Shen, X.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.