SCOPUS 정보 검색 플랫폼

Proceedings - 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011

Volumn , Issue , 2011, Pages 467-478

Automated architecture-aware mapping of streaming applications onto GPUs

(4) Hagiescu, Andrei a Huynh, Huynh Phung b Wong, Weng Fai a Goh, Rick Siow Mong b

a NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

b INSTITUTE OF HIGH PERFORMANCE COMPUTING (Singapore)

Author keywords

GPU; stream processing; StreamIt

Indexed keywords

ARCHITECTURAL FEATURES; DATA MOVEMENTS; GENERAL PURPOSE; GPU; GPU PROGRAMMING; GRAPHIC PROCESSING UNITS; MEMORY ACCESS; MEMORY FOOTPRINT; MEMORY HIERARCHY; NUMBER OF THREADS; OFF-CHIP; POOR PERFORMANCE; PROCESSING CORE; PROGRAMMING LANGUAGE; SHARED MEMORIES; STREAM PROCESSING; STREAMING APPLICATIONS; STREAMIT; THREAD GROUPS;

DISTRIBUTED PARAMETER NETWORKS; MEMORY ARCHITECTURE; MULTIPROCESSING SYSTEMS; PROGRAM COMPILERS;

COMPUTER HARDWARE DESCRIPTION LANGUAGES;

EID: 80053240142 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2011.52 Document Type: Conference Paper

Times cited : (25)

References (19)

1
- 0036959649
- A stream compiler for communication-exposed architectures
- M. I. Gordon and et al., "A stream compiler for communication- exposed architectures," in ASPLOS '02, Oct 2002.
- ASPLOS '02, Oct 2002
- Gordon, M.I.¹

2
- 77951154340
- The GPU computing era
- J. Nickolls and W. J. Dally, "The GPU computing era," IEEE Micro, vol. 30, pp. 56-69, 2010.
- (2010) IEEE Micro , vol.30 , pp. 56-69
- Nickolls, J.¹ Dally, W.J.²

3
- 33947588048
- A survey of general-purpose computation on graphics hardware
- J. D. Owens and et al., "A survey of general-purpose computation on graphics hardware," Computer Graphics Forum, vol. 26, no. 1, pp. 80-113, 2007.
- (2007) Computer Graphics Forum , vol.26 , Issue.1 , pp. 80-113
- Owens, J.D.¹

4
- 70349100958
- Khronos OpenCL Working Group, version 1.0.29, 8 December
- Khronos OpenCL Working Group, The OpenCL Specification, version 1.0.29, 8 December 2008.
- (2008) The OpenCL Specification

5
- 84870629709
- Nvidia cuda. Http://www.nvidia.com/object/cuda
- Nvidia Cuda

6
- 84877609547
- Brook for GPUs: Stream computing on graphics hardware
- New York, NY, USA: ACM
- I. Buck and et al., "Brook for GPUs: stream computing on graphics hardware," in SIGGRAPH '04. New York, NY, USA: ACM, 2004, pp. 777-786.
- (2004) SIGGRAPH '04 , pp. 777-786
- Buck, I.¹

7
- 34547423880
- Exploiting coarse-grained task, data, and pipeline parallelism in stream programs
- New York, NY, USA: ACM
- M. I. Gordon, W. Thies, and S. Amarasinghe, "Exploiting coarse-grained task, data, and pipeline parallelism in stream programs," in ASPLOS '06. New York, NY, USA: ACM, 2006, pp. 151-162.
- (2006) ASPLOS '06 , pp. 151-162
- Gordon, M.I.¹ Thies, W.² Amarasinghe, S.³

8
- 57349172999
- Orchestrating the execution of stream programs on multicore platforms
- M. Kudlur and S. Mahlke, "Orchestrating the execution of stream programs on multicore platforms," in PLDI '08, 2008, pp. 114-124.
- (2008) PLDI '08 , pp. 114-124
- Kudlur, M.¹ Mahlke, S.²

9
- 67650563116
- Software pipelined execution of stream programs on GPUs
- A. Udupa, R. Govindarajan, and M. J. Thazhuthaveetil, "Software pipelined execution of stream programs on GPUs," in CGO '09, 2009, pp. 200-209.
- (2009) CGO '09 , pp. 200-209
- Udupa, A.¹ Govindarajan, R.² Thazhuthaveetil, M.J.³

10
- 0023138886
- Static scheduling of synchronous data flow programs for digital signal processing
- E. A. Lee and D. G. Messerschmitt, "Static scheduling of synchronous data flow programs for digital signal processing," IEEE Trans. Comput., vol. 36, no. 1, pp. 24-35, 1987.
- (1987) IEEE Trans. Comput. , vol.36 , Issue.1 , pp. 24-35
- Lee, E.A.¹ Messerschmitt, D.G.²

11
- 79959466764
- Optimization principles and application performance evaluation of a multithreaded gpu using cuda
- S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W.-m. W. Hwu, "Optimization principles and application performance evaluation of a multithreaded gpu using cuda," in PPoPP '08, 2008, pp. 73-82.
- (2008) PPoPP '08 , pp. 73-82
- Ryoo, S.¹ Rodrigues, C.I.² Baghsorkhi, S.S.³ Stone, S.S.⁴ Kirk, D.B.⁵ Hwu, W.-M.W.⁶

12
- 84924180957
- Apr.
- X. Ye, D. Fan, W. Lin, N. Yuan, and P. Ienne, "High performance comparison-based sorting algorithm on many-core GPUs," Apr. 2010, pp. 1-10.
- (2010) High Performance Comparison-based Sorting Algorithm on Many-core GPUs , pp. 1-10
- Ye, X.¹ Fan, D.² Lin, W.³ Yuan, N.⁴ Ienne, P.⁵

13
- 80053263954
- Hpc project, par4all. Par4All
- Hpc project, par4all. HPC Project, Par4All.
- HPC Project

14
- 74549119494
- Heterogeneous multicore parallel programming for graphics processing units
- F. Bodin and S. Bihan, "Heterogeneous multicore parallel programming for graphics processing units," Sci. Program., vol. 17, no. 4, pp. 325-336, 2009.
- (2009) Sci. Program. , vol.17 , Issue.4 , pp. 325-336
- Bodin, F.¹ Bihan, S.²

15
- 77952281697
- Implementing the PGI accelerator model
- M. Wolfe, "Implementing the PGI accelerator model," in GPGPU '10, 2010, pp. 43-50.
- (2010) GPGPU '10 , pp. 43-50
- Wolfe, M.¹

16
- 31844444712
- Cache aware optimization of stream programs
- J. Sermulins, W. Thies, R. Rabbah, and S. Amarasinghe, "Cache aware optimization of stream programs," SIGPLAN Not., vol. 40, no. 7, pp. 115-126, 2005.
- (2005) SIGPLAN Not. , vol.40 , Issue.7 , pp. 115-126
- Sermulins, J.¹ Thies, W.² Rabbah, R.³ Amarasinghe, S.⁴

17
- 70350121632
- Compiler-directed scratchpad memory management via graph coloring
- L. Li, H. Feng, and J. Xue, "Compiler-directed scratchpad memory management via graph coloring," ACM Trans. Archit. Code Optim., vol. 6, no. 3, pp. 1-17, 2009.
- (2009) ACM Trans. Archit. Code Optim. , vol.6 , Issue.3 , pp. 1-17
- Li, L.¹ Feng, H.² Xue, J.³

18
- 77953977802
- Streamit benchmarks. http://groups.csail.mit.edu/cag/streamit/shtml/ benchmarks.shtml.
- Streamit Benchmarks

19
- 80053234976
- ATI stream computing programming guide. http://developer.amd.com/gpu/ ATIStreamSDK/assets/ATI-Stream-SDK-OpenCL-Programming-Guide.pdf.
- ATI Stream Computing Programming Guide

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.