-
2
-
-
43449094719
-
Program optimization space pruning for a multithreaded GPU
-
DOI 10.1145/1356058.1356084, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
-
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S.-Z. Ueng, J. A. Stratton, and W.-M. W. Hwu, "Program Optimization Space Pruning for a Multithreaded GPU," in CGO '08: Proc. of the sixth annual IEEE/ACM Intl. Symp. on Code Generation and Optimization, 2008, pp. 195-204. (Pubitemid 351667266)
-
(2008)
Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
Hwu, W.-M.W.7
-
3
-
-
67650530896
-
-
[Online]. Available
-
ATI CTM Guide. [Online]. Available: http://ati.amd.com/companyinfo/ researcher/documents/ATI CTM Guide.pdf
-
ATI CTM Guide
-
-
-
4
-
-
67650543629
-
-
[Online]. Available
-
NVIDIA CUDA. [Online]. Available: http://www.nvidia.com/cuda
-
-
-
-
5
-
-
34547423880
-
Exploiting Coarsegrained Task, Data, and Pipeline Parallelism in Stream Programs
-
M. I. Gordon, W. Thies, and S. Amarasinghe, "Exploiting Coarsegrained Task, Data, and Pipeline Parallelism in Stream Programs," in ASPLOS-XII: Proc. of the 12th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2006, pp. 151-162.
-
ASPLOS-XII: Proc. of the 12th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2006
, pp. 151-162
-
-
Gordon, M.I.1
Thies, W.2
Amarasinghe, S.3
-
6
-
-
84959045524
-
StreamIt: A Language for Streaming Applications
-
W. Thies, M. Karczmarek, and S. P. Amarasinghe, "StreamIt: A Language for Streaming Applications," in CC '02: Proc. of the 11th Intl. Conf. on Compiler Construction, 2002, pp. 179-196.
-
CC '02: Proc. of the 11th Intl. Conf. on Compiler Construction, 2002
, pp. 179-196
-
-
Thies, W.1
Karczmarek, M.2
Amarasinghe, S.P.3
-
7
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
DOI 10.1145/1015706.1015800, ACM Transactions on Graphics - Proceedings of ACM SIGGRAPH 2004
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan, "Brook for GPUs: Stream Computing on Graphics Hardware," ACM Trans. on Graphics, vol.23, no.3, pp. 777-786, 2004. (Pubitemid 40163782)
-
(2004)
ACM Transactions on Graphics
, vol.23
, Issue.3
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
8
-
-
33947595619
-
Accelerator: Using data parallelism to program GPUs for general-purpose uses
-
DOI 10.1145/1168857.1168898, ASPLOS XII: Twelfth International Conference on Architectural Support for Programming Languages and Operating Systems
-
D. Tarditi, S. Puri, and J. Oglesby, "Accelerator: Using Data Parallelism to Program GPUs for General-Purpose Uses," in ASPLOS-XII: Proc. of the 12th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2006, pp. 325-335. (Pubitemid 47168412)
-
(2006)
International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS
, pp. 325-335
-
-
Tarditi, D.1
Puri, S.2
Oglesby, J.3
-
10
-
-
29144520630
-
Optimizing Stream Programs using Linear State Space Analysis
-
S. Agrawal, W. Thies, and S. Amarasinghe, "Optimizing Stream Programs using Linear State Space Analysis," in CASES '05: Proc. of the 2005 Intl. Conf. on Compilers, Architectures and Synthesis for Embedded Systems, 2005, pp. 126-136.
-
CASES '05: Proc. of the 2005 Intl. Conf. on Compilers, Architectures and Synthesis for Embedded Systems, 2005
, pp. 126-136
-
-
Agrawal, S.1
Thies, W.2
Amarasinghe, S.3
-
11
-
-
0023138886
-
STATIC SCHEDULING of SYNCHRONOUS DATA FLOW PROGRAMS for DIGITAL SIGNAL PROCESSING
-
E. A. Lee and D. G. Messerschmitt, "Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing," IEEE Trans. on Computers, vol.36, no.1, pp. 24-35, 1987. (Pubitemid 17517473)
-
(1987)
IEEE Transactions on Computers
, vol.C-36
, Issue.1
, pp. 24-35
-
-
Lee Edward, A.1
Messerschmitt David, G.2
-
12
-
-
0028731856
-
Looped Schedules for Dataflow Descriptions of Multirate Signal Processing Algorithms
-
S. S. Bhattacharyya and E. A. Lee, "Looped Schedules for Dataflow Descriptions of Multirate Signal Processing Algorithms," Formal Methods in System Design, vol.5, no.3, pp. 183-205, 1994.
-
(1994)
Formal Methods in System Design
, vol.5
, Issue.3
, pp. 183-205
-
-
Bhattacharyya, S.S.1
Lee, E.A.2
-
13
-
-
0242696254
-
Phased Scheduling of Stream Programs
-
M. Karczmarek, W. Thies, and S. Amarasinghe, "Phased Scheduling of Stream Programs," in LCTES '03: Proc. of the 2003 ACM SIGPLAN Conf. on Language, Compiler, and Tool Support for Embedded Systems, 2003, pp. 103-112.
-
LCTES '03: Proc. of the 2003 ACM SIGPLAN Conf. on Language, Compiler, and Tool Support for Embedded Systems, 2003
, pp. 103-112
-
-
Karczmarek, M.1
Thies, W.2
Amarasinghe, S.3
-
14
-
-
0028768026
-
Minimizing Register Requirements under Resource-constrained Rate-optimal Software Pipelining
-
R. Govindarajan, E. R. Altman, and G. R. Gao, "Minimizing Register Requirements Under Resource-constrained Rate-optimal Software Pipelining," in MICRO 27: Proc. of the 27th annual Intl. Symp. on Microarchitecture, 1994, pp. 85-94.
-
MICRO 27: Proc. of the 27th Annual Intl. Symp. on Microarchitecture, 1994
, pp. 85-94
-
-
Govindarajan, R.1
Altman, E.R.2
Gao, G.R.3
-
16
-
-
0026976353
-
Code generation schema for modulo scheduled loops
-
B. R. Rau, M. S. Schlansker, and P. P. Tirumalai, "Code Generation Schema for Modulo Scheduled Loops," in MICRO 25: Proc. of the 25th annual Intl. Symp. on Microarchitecture, 1992, pp. 158-169. (Pubitemid 23633740)
-
(1992)
Proceedings of the 25th Annual International Symposium on Microarchitecture
, pp. 158-169
-
-
Ramakrishna Rau, B.1
Schlansker Michael, S.2
Tirumalai, P.P.3
-
17
-
-
67650513069
-
-
StreamIt Home Page. [Online]. Available
-
StreamIt Home Page. [Online]. Available: http://www.cag.lcs.mit.edu/ streamit/
-
-
-
-
18
-
-
2942564428
-
Buffer merging - A powerful technique for reducing memory requirements of synchronous dataflow specifications
-
DOI 10.1145/989995.989999
-
P. K. Murthy and S. S. Bhattacharyya, "Buffer Merging-A Powerful Technique for Reducing Memory Requirements of Synchronous Dataflow Specifications," ACM Trans. on Design and Automation of Electronic Systems, vol.9, no.2, pp. 212-237, 2004. (Pubitemid 38732390)
-
(2004)
ACM Transactions on Design Automation of Electronic Systems
, vol.9
, Issue.2
, pp. 212-237
-
-
Murthy, P.K.1
Bhattacharyya, S.S.2
-
19
-
-
80455123249
-
Well-Behaved Dataflow Programs for DSP Computation
-
vol.5, Mar
-
G. Gao, R. Govindarajan, and P. Panangaden, "Well-Behaved Dataflow Programs for DSP Computation," ICASSP-92: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1992., vol.5, pp. 561-564 vol.5, Mar 1992.
-
(1992)
ICASSP-92: IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, 1992
, vol.5
, pp. 561-564
-
-
Gao, G.1
Govindarajan, R.2
Panangaden, P.3
-
20
-
-
0028602312
-
Minimizing Memory Requirements in Rate-optimal Schedules
-
R. Govindarajan, G. Gao, and P. Desai, "Minimizing Memory Requirements in Rate-optimal Schedules," in ASAP '94: Proc. of the 1994 Intl. Conf. on Application Specific Array Processors, Aug 1994, pp. 75-86.
-
ASAP '94: Proc. of the 1994 Intl. Conf. on Application Specific Array Processors, Aug 1994
, pp. 75-86
-
-
Govindarajan, R.1
Gao, G.2
Desai, P.3
-
21
-
-
0036959649
-
A Stream Compiler for Communication-Exposed Architectures
-
M. I. Gordon, W. Thies, M. Karczmarek, J. Lin, A. S. Meli, A. A. Lamb, C. Leger, J. Wong, H. Hoffmann, D. Maze, and S. Amarasinghe, "A Stream Compiler for Communication-Exposed Architectures," in ASPLOS-X: Proc. of the 10th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2002, pp. 291-303.
-
ASPLOS-X: Proc. of the 10th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, 2002
, pp. 291-303
-
-
Gordon, M.I.1
Thies, W.2
Karczmarek, M.3
Lin, J.4
Meli, A.S.5
Lamb, A.A.6
Leger, C.7
Wong, J.8
Hoffmann, H.9
Maze, D.10
Amarasinghe, S.11
-
22
-
-
67650528532
-
A Lightweight Streaming Layer for Multicore Execution
-
D. Zhang, Q. J. Li, R. Rabbah, and S. Amarasinghe, "A Lightweight Streaming Layer for Multicore Execution," SIGARCH Computer Architecture News, vol.36, no.2, pp. 18-27, 2008.
-
(2008)
SIGARCH Computer Architecture News
, vol.36
, Issue.2
, pp. 18-27
-
-
Zhang, D.1
Li, Q.J.2
Rabbah, R.3
Amarasinghe, S.4
-
23
-
-
79959466764
-
Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu, "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA," in PPoPP '08: Proc. of the 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, 2008, pp. 73-82.
-
PPoPP '08: Proc. of the 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, 2008
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.M.W.6
|