SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2013, Pages 1080-1087

A multi-level optimization method for stencil computation on the domain that is bigger than memory capacity of GPU

Author keywords

GPU memory capacity; multi level optimization; stencil computation; temporal blocking

Indexed keywords

DISTRIBUTED COMPUTER SYSTEMS; PROBLEM SOLVING;

2 LAYER; MEMORY ACCESS; MEMORY CAPACITY; MULTILEVEL OPTIMIZATION; PROBLEM SIZE; STENCIL COMPUTATIONS; TEMPORAL BLOCKING;

GRAPHICS PROCESSING UNIT;

EID: 84899705665 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPSW.2013.58 Document Type: Conference Paper

Times cited : (13)

References (11)

1
- 70350771127
- Stencil computation optimization and autotuning on state-of-The-Art multicore architectures
- K. Datta, M. Murphy, V. Volkov, S. Williams,J. Carter, L. Oliker, D. Patterson, J. Shalf, and K. Yelick, "Stencil computation optimization and autotuning on state-of-The-Art multicore architectures,"In Proceedings of the 2008 ACM/IEEE Conference on Supercomputing (SC08), pp. 1-12, 2008.
- (2008) Proceedings of the 2008 ACM/IEEE Conference on Supercomputing (SC08) , pp. 1-12
- Datta, K.¹ Murphy, M.² Volkov, V.³ Williams, S.⁴ Carter, J.⁵ Oliker, L.⁶ Patterson, D.⁷ Shalf, J.⁸ Yelick, K.⁹

2
- 83155190228
- Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
- Takashi Shimokawabe, Takayuki Aoki, Tomohiro Takaki, Akinori Yamanaka, Akira Nukada, Toshio Endo, Naoya Maruyama, and Satoshi Matsuoka, "Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer,"In Proceedings of IEEE/ACM International Conference on Supercomputing (SC11), pp. 1-11, 2011.
- (2011) Proceedings of IEEE/ACM International Conference on Supercomputing (SC11) , pp. 1-11
- Shimokawabe, T.¹ Aoki, T.² Takaki, T.³ Yamanaka, A.⁴ Nukada, A.⁵ Endo, T.⁶ Maruyama, N.⁷ Matsuoka, S.⁸

3
- 84893593562
- Physis: An implicitly-parallel programming model for stencil computing on large-scale GPU-Accelerated supercomputers
- Naoya Maruyama, Tatsuo Nomura, Kento Sato, and Satoshi Matsuoka, "Physis: An implicitly-parallel programming model for stencil computing on large-scale GPU-Accelerated supercomputers," IEEE SC11,2011.
- (2011) IEEE SC11
- Maruyama, N.¹ Nomura, T.² Sato, K.³ Matsuoka, S.⁴

6
- 70449657442
- Efcient temporal blocking for stencil computations by multicore-Aware wavefront parallelization
- Gerhard Wellein, Georg Hager, Thomas Zeiser, Markus Wittmann and Holger Fehske, "Ef-cient temporal blocking for stencil computations by multicore-Aware wavefront parallelization," Computer Software and Applications Conference, vol.1, pp. 579-586, 2009.
- (2009) Computer Software and Applications Conference , vol.1 , pp. 579-586
- Wellein, G.¹ Hager, G.² Zeiser, T.³ Wittmann, M.⁴ Fehske, H.⁵

8
- 79958272014
- 3.5-D blocking optimization for stencil computations on modern CPUs and GPUs
- Anthony Nguyen, Nadathur Satish, Jatin Chhugani, Changkyu Kim, and Pradeep Dubey, "3.5-D blocking optimization for stencil computations on modern CPUs and GPUs," IEEE SC10, 2010.
- (2010) IEEE SC10
- Nguyen, A.¹ Satish, N.² Chhugani, J.³ Kim, C.⁴ Dubey, P.⁵

9
- 79953768747
- Overcoming the GPU memory limitation on FDTDthrough the use of overlappingsubgrids
- Leonardo Mattes and Sergio Kofuji, "Overcoming the GPU memory limitation on FDTDthrough the use of overlappingsubgrids," ICMMT, pp.1536-1539, 2010.
- (2010) ICMMT , pp. 1536-1539
- Mattes, L.¹ Kofuji, S.²

10
- 77954903012
- The use of overlapping subgrids to accelerate the FDTD on GPU devices
- Leonardo Mattes and Sergio Kofuji, "The use of overlapping subgrids to accelerate the FDTD on GPU devices,"Radar Conference, pp. 807-810, 2010.
- (2010) Radar Conference , pp. 807-810
- Mattes, L.¹ Kofuji, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.