SCOPUS 정보 검색 플랫폼

Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012

Volumn , Issue , 2012, Pages 567-573

Communication library to overlap computation and communication for OpenCL application

(3) Komoda, Toshiya a Miwa, Shinobu a Nakamura, Hiroshi a

a UNIVERSITY OF TOKYO (Japan)

Author keywords

Accelerators; Double Buffering; OpenCL; Stream Graph

Indexed keywords

APPLICATION DEVELOPERS; COMMUNICATION LIBRARY; COMMUNICATION OPTIMIZATION; COMMUNICATION PATTERN; DOUBLE BUFFERING; ERROR PRONES; EXPERT PROGRAMMERS; IMAGE PROCESSING APPLICATIONS; LOW LEVEL; MEMORY MANAGEMENT; OPENCL; PARALLEL PROGRAMMING ENVIRONMENT; PERFORMANCE IMPROVEMENTS; PROGRAMMING INTERFACE; PROTOTYPE SYSTEM; STREAM GRAPH;

COMMUNICATION SYSTEMS; DISTRIBUTED PARAMETER NETWORKS; IMAGE PROCESSING; OPTIMIZATION; PARALLEL PROGRAMMING; PARTICLE ACCELERATORS;

ACCELERATION;

EID: 84867427249 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPSW.2012.68 Document Type: Conference Paper

Times cited : (6)

References (12)

1
- 78651550268
- Scalable parallel programming with cuda
- March
- J. Nickolls, I. Buck, M. Garland, and K. Skadron, "Scalable parallel programming with cuda," Queue, vol. 6, pp. 40-53, March 2008.
- (2008) Queue , vol.6 , pp. 40-53
- Nickolls, J.¹ Buck, I.² Garland, M.³ Skadron, K.⁴

2
- 70349100958
- Rev. 1.2, [Online]. Available
- OpenCL Specification, Khronous OpenCL Working Group Std., Rev. 1.2, 2011. [Online]. Available: http://www.khronos.org/opencl/
- (2011) OpenCL Specification

3
- 79959904195
- Automatic CPU-GPU communication management and optimization
- T. B. Jablin, P. Prabhu, J. A. Jablin, N. P. Johnson, S. R. Beard, and D. I. August, "Automatic CPU-GPU communication management and optimization," in Proc. the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation(PLDI'11), New York, NY, USA, 2011, pp. 142-151.
- Proc. the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation(PLDI'11), New York, NY, USA, 2011 , pp. 142-151
- Jablin, T.B.¹ Prabhu, P.² Jablin, J.A.³ Johnson, N.P.⁴ Beard, S.R.⁵ August, D.I.⁶

4
- 67650081010
- OpenMP to GPGPU: A compiler framework for automatic translation and optimization
- S. Lee, S.-J. Min, and R. Eigenmann, "OpenMP to GPGPU: a compiler framework for automatic translation and optimization," in Proc. the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming (PPoPP'09), New York, NY, USA, 2009, pp. 101-110.
- Proc. the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP'09), New York, NY, USA, 2009 , pp. 101-110
- Lee, S.¹ Min, S.-J.² Eigenmann, R.³

5
- 0037521913
- Streamit: A language for streaming applications
- W. Thies, M. Karczmarek, and S. Amarasinghe, "Streamit: A language for streaming applications," in Proc. International Conference on Compiler Construction(CC'02), Grenoble, France, Apr 2002.
- Proc. International Conference on Compiler Construction(CC'02), Grenoble, France, Apr 2002
- Thies, W.¹ Karczmarek, M.² Amarasinghe, S.³

6
- 47349118686
- A practical approach to exploiting coarse-grained pipeline parallelism in C programs
- W. Thies, V. Chandrasekhar, and S. Amarasinghe, "A practical approach to exploiting coarse-grained pipeline parallelism in C programs," in Proc. the 40th Annual IEEE/ACM International Symposium on Microarchitecture(MICRO'07), Chicago,Illinois, USA, dec. 2007, pp. 356-369.
- Proc. the 40th Annual IEEE/ACM International Symposium on Microarchitecture(MICRO'07), Chicago,Illinois, USA, Dec. 2007 , pp. 356-369
- Thies, W.¹ Chandrasekhar, V.² Amarasinghe, S.³

7
- 79959906704
- Kremlin: Rethinking and rebooting gprof for the multicore age
- S. Garcia, D. Jeon, C. M. Louie, and M. B. Taylor, "Kremlin: rethinking and rebooting gprof for the multicore age," in Proc. the 32nd ACM SIGPLAN conference on Programming language design and implementation (PLDI '11), New York, NY, USA, 2011, pp. 458-469.
- Proc. the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI '11), New York, NY, USA, 2011 , pp. 458-469
- Garcia, S.¹ Jeon, D.² Louie, C.M.³ Taylor, M.B.⁴

8
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S.-H. Lee, and K. Skadron, "Rodinia: A benchmark suite for heterogeneous computing," in Proc. IEEE International Symposium on Workload Characterization (IISWC '09), Washington, DC, USA, 2009, pp. 44-54.
- Proc. IEEE International Symposium on Workload Characterization (IISWC '09), Washington, DC, USA, 2009 , pp. 44-54
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.W.⁵ Lee, S.-H.⁶ Skadron, K.⁷

9
- 83155190228
- Peta-scale phase-field simulation for dendritic solidification on the tsubame 2.0 supercomputer
- T. Shimokawabe, T. Aoki, T. Takaki, T. Endo, A. Yamanaka, N. Maruyama, A. Nukada, and S. Matsuoka, "Peta-scale phase-field simulation for dendritic solidification on the tsubame 2.0 supercomputer," in Proc. the 2011 ACM/IEEE conference on Supercomputing (SC'11), New York, NY, USA, 2011, pp. 3:1-3:11.
- Proc. the 2011 ACM/IEEE Conference on Supercomputing (SC'11), New York, NY, USA, 2011
- Shimokawabe, T.¹ Aoki, T.² Takaki, T.³ Endo, T.⁴ Yamanaka, A.⁵ Maruyama, N.⁶ Nukada, A.⁷ Matsuoka, S.⁸

10
- 80054863945
- Medical ultrasound imaging: To GPU or not to GPU?
- H. K.-H. So, J. Chen, B. Y. Yiu, and A. C. Yu, "Medical ultrasound imaging: To GPU or not to GPU?" IEEE Micro, vol. 31, pp. 54-65, 2011.
- (2011) IEEE Micro , vol.31 , pp. 54-65
- So, H.K.-H.¹ Chen, J.² Yiu, B.Y.³ Yu, A.C.⁴

11
- 84866867636
- [Online]. Available
- Cuda toolkit 4.0. NVIDIA Corporation. [Online]. Available: http://developer.nvidia.com/cuda-toolkit-40
- Cuda Toolkit 4.0

12
- 83155190224
- Physis: An implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers
- N. Maruyama, T. Nomura, K. Sato, and S. Matsuoka, "Physis: an implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers." in Proc. the 2011 ACM/IEEE conference on Supercomputing (SC'11), New York, NY, USA, 2011, pp. 1-12.
- Proc. the 2011 ACM/IEEE Conference on Supercomputing (SC'11), New York, NY, USA, 2011 , pp. 1-12
- Maruyama, N.¹ Nomura, T.² Sato, K.³ Matsuoka, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.