-
1
-
-
43649092214
-
-
Advanced Micro Devices, Inc, 1.01 edition
-
Advanced Micro Devices, Inc. ATI CTM Guide, 1.01 edition, 2006.
-
(2006)
ATI CTM Guide
-
-
-
3
-
-
4644295630
-
Evaluating the Imagine stream architecture
-
J. H. Ahn,W. J. Dally, B. Khailany, U. J. Kapasi, and A. Das. Evaluating the Imagine stream architecture. In Proc. 31st Int'l Symp. on Computer Architecture, page 14, 2004.
-
(2004)
Proc. 31st Int'l Symp. on Computer Architecture
, pp. 14
-
-
Ahn, J.H.1
Dally, W.J.2
Khailany, B.3
Kapasi, U.J.4
Das, A.5
-
4
-
-
57349130987
-
StoreGPU: Exploiting graphics processing units to accelerate distributed storage systems
-
S. Al-Kiswany, A. Gharaibeh, E. Santos-Neto, G. Yuan, and M. Ripeanu. StoreGPU: exploiting graphics processing units to accelerate distributed storage systems. In Proc. 17th Int'l Symp. on High Performance Distributed Computing, pages 165-174, 2008.
-
(2008)
Proc. 17th Int'l Symp. on High Performance Distributed Computing
, pp. 165-174
-
-
Al-Kiswany, S.1
Gharaibeh, A.2
Santos-Neto, E.3
Yuan, G.4
Ripeanu, M.5
-
6
-
-
0033725306
-
Methodology for I/O cell placement and checking in ASIC designs using area-array power grid
-
P. Buffet, J. Natonio, R. Proctor, Y. Sun, and G. Yasar. Methodology for I/O cell placement and checking in ASIC designs using area-array power grid. In IEEE Custom Integrated Circuits Conference, 2000.
-
(2000)
IEEE Custom Integrated Circuits Conference
-
-
Buffet, P.1
Natonio, J.2
Proctor, R.3
Sun, Y.4
Yasar, G.5
-
7
-
-
34247371330
-
Cell Broadband Engine interconnect and memory interface
-
Palo Alto, CA, August
-
S. Clark, K. Haselhorst, K. Imming, J. Irish, D. Krolak, and T. Ozguner. Cell Broadband Engine interconnect and memory interface. In Hot Chips 17, Palo Alto, CA, August 2005.
-
(2005)
Hot Chips 17
-
-
Clark, S.1
Haselhorst, K.2
Imming, K.3
Irish, J.4
Krolak, D.5
Ozguner, T.6
-
8
-
-
84877083867
-
Merrimac: Supercomputing with streams
-
W. J. Dally, F. Labonte, A. Das, P. Hanrahan, J.-H. Ahn, J. Gummaraju, M. Erez, N. Jayasena, I. Buck, T. J. Knight, and U. J. Kapasi. Merrimac: Supercomputing with streams. In SC '03: Proc. 2003 ACM/IEEE Conf. on Supercomputing, page 35, 2003.
-
(2003)
SC '03: Proc. 2003 ACM/IEEE Conf. on Supercomputing
, pp. 35
-
-
Dally, W.J.1
Labonte, F.2
Das, A.3
Hanrahan, P.4
Ahn, J.-H.5
Gummaraju, J.6
Erez, M.7
Jayasena, N.8
Buck, I.9
Knight, T.J.10
Kapasi, U.J.11
-
10
-
-
33750834456
-
-
V. del Barrio, C. Gonzalez, J. Roca, A. Fernandez, and E. E. ATTILA: a cycle-level execution-driven simulator for modern GPU architectures. Int'l Symp. on Performance Analysis of Systems and Software, pages 231-241, March 2006.
-
V. del Barrio, C. Gonzalez, J. Roca, A. Fernandez, and E. E. ATTILA: a cycle-level execution-driven simulator for modern GPU architectures. Int'l Symp. on Performance Analysis of Systems and Software, pages 231-241, March 2006.
-
-
-
-
15
-
-
38349041620
-
Accelerating Large Graph Algorithms on the GPU Using CUDA
-
P. Harish and P. J. Narayanan. Accelerating Large Graph Algorithms on the GPU Using CUDA. In HiPC, pages 197-208, 2007.
-
(2007)
HiPC
, pp. 197-208
-
-
Harish, P.1
Narayanan, P.J.2
-
16
-
-
0003278283
-
The Microarchitecture of the Pentium® 4 Processor
-
G. Hinton, D. Sager, M. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Roussel. The Microarchitecture of the Pentium® 4 Processor. Intel® Technology Journal, 5(1), 2001.
-
(2001)
Intel® Technology Journal
, vol.5
, Issue.1
-
-
Hinton, G.1
Sager, D.2
Upton, M.3
Boggs, D.4
Carmean, D.5
Kyker, A.6
Roussel, P.7
-
18
-
-
67650692011
-
-
Illinois Microarchitecture Project utilizing Advanced Compiler Technology Research Group
-
Illinois Microarchitecture Project utilizing Advanced Compiler Technology Research Group. Parboil benchmark suite. http://www.crhc.uiuc.edu/IMPACT/ parboil.php.
-
Parboil benchmark suite
-
-
-
19
-
-
70349173991
-
-
Infineon. 256Mbit GDDR3 DRAM, Revision 1.03 (Part No. HYB18H256321AF). http://www.infineon.com, December 2005.
-
Infineon. 256Mbit GDDR3 DRAM, Revision 1.03 (Part No. HYB18H256321AF). http://www.infineon.com, December 2005.
-
-
-
-
20
-
-
84955473128
-
Exploring the VLSI scalability of stream processors
-
B. Khailany, W. J. Dally, S. Rixner, U. J. Kapasi, J. D. Owens, and B. Towles. Exploring the VLSI scalability of stream processors. In Proc. 9th Int'l Symp. on High Performance Computer Architecture, page 153, 2003.
-
(2003)
Proc. 9th Int'l Symp. on High Performance Computer Architecture
, pp. 153
-
-
Khailany, B.1
Dally, W.J.2
Rixner, S.3
Kapasi, U.J.4
Owens, J.D.5
Towles, B.6
-
21
-
-
0019892368
-
Lockup-free Instruction Fetch/Prefetch Cache Organization
-
D. Kroft. Lockup-free Instruction Fetch/Prefetch Cache Organization. In Proc. 8th Int'l Symp. Computer Architecture, pages 81-87, 1981.
-
(1981)
Proc. 8th Int'l Symp. Computer Architecture
, pp. 81-87
-
-
Kroft, D.1
-
22
-
-
44849137198
-
NVIDIA Tesla: A Unified Graphics and Computing Architecture
-
E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym. NVIDIA Tesla: A Unified Graphics and Computing Architecture. IEEE Micro, 28(2):39-55, 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
25
-
-
70349167821
-
-
Marco Chiappetta. ATI Radeon HD 2900 XT - R600 Has Arrived. http://www.hothardware.com/printarticle.aspx?articleid=966.
-
Marco Chiappetta. ATI Radeon HD 2900 XT - R600 Has Arrived. http://www.hothardware.com/printarticle.aspx?articleid=966.
-
-
-
-
27
-
-
51049099597
-
-
J. Michalakes and M. Vachharajani. GPU acceleration of numerical weather prediction. IPDPS 2008: IEEE Int'l Symp. on Parallel and Distributed Processing, pages 1-7, April 2008.
-
J. Michalakes and M. Vachharajani. GPU acceleration of numerical weather prediction. IPDPS 2008: IEEE Int'l Symp. on Parallel and Distributed Processing, pages 1-7, April 2008.
-
-
-
-
29
-
-
78651550268
-
Scalable Parallel Programming with CUDA
-
Mar.-Apr
-
J. Nickolls, I. Buck, M. Garland, and K. Skadron. Scalable Parallel Programming with CUDA. ACM Queue, 6(2):40-53, Mar.-Apr. 2008.
-
(2008)
ACM Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
30
-
-
70349186177
-
-
NVIDIA. CUDA ZONE. http://www.nvidia.com/cuda.
-
NVIDIA. CUDA ZONE. http://www.nvidia.com/cuda.
-
-
-
-
31
-
-
70349170944
-
-
NVIDIA. Geforce 8 series. http://www.nvidia.com/page/geforce8.html.
-
NVIDIA. Geforce 8 series. http://www.nvidia.com/page/geforce8.html.
-
-
-
-
32
-
-
84872053761
-
-
NVIDIA Corporation. NVIDIA CUDA SDK code samples. http://developer. download.nvidia.com/compute/cuda/sdk/website/samples.html.
-
NVIDIA CUDA SDK code samples
-
-
-
33
-
-
70349170942
-
-
NVIDIA Corporation. NVIDIA CUDA Programming Guide, 1.1 edition, 2007.
-
NVIDIA Corporation. NVIDIA CUDA Programming Guide, 1.1 edition, 2007.
-
-
-
-
35
-
-
70349183057
-
-
NVIDIA Corporation. PTX: Parallel Thread Execution ISA, 1.1 edition, 2007.
-
NVIDIA Corporation. PTX: Parallel Thread Execution ISA, 1.1 edition, 2007.
-
-
-
-
37
-
-
70349167820
-
-
Pcchen. N-Queens Solver. http://forums.nvidia.com/index.php?showtopic= 76893.
-
Pcchen. N-Queens Solver. http://forums.nvidia.com/index.php?showtopic= 76893.
-
-
-
-
38
-
-
27344435504
-
-
D. Pham, S. Asano, M. Bolliger, M. D. , H. Hofstee, C. Johns, J. Kahle, A.Kameyama, J. Keaty, Y. Masubuchi, D. S. M. Riley, D. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S. W. D. Wendel, T.Yamazaki, and K. Yazawa. The design and implementation of a first-generation Cell processor. Digest of Technical Papers, IEEE Int'l Solid-State Circuits Conference (ISSCC), pages 184-592 1, 10-10 Feb. 2005.
-
D. Pham, S. Asano, M. Bolliger, M. D. , H. Hofstee, C. Johns, J. Kahle, A.Kameyama, J. Keaty, Y. Masubuchi, D. S. M. Riley, D. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S. W. D. Wendel, T.Yamazaki, and K. Yazawa. The design and implementation of a first-generation Cell processor. Digest of Technical Papers, IEEE Int'l Solid-State Circuits Conference (ISSCC), pages 184-592 Vol. 1, 10-10 Feb. 2005.
-
-
-
-
39
-
-
0033691565
-
Memory access scheduling
-
S. Rixner, W. J. Dally, U. J. Kapasi, P. Mattson, and J. D. Owens. Memory access scheduling. In Proc. 27th Int'l Symp. on Computer Architecture, pages 128-138, 2000.
-
(2000)
Proc. 27th Int'l Symp. on Computer Architecture
, pp. 128-138
-
-
Rixner, S.1
Dally, W.J.2
Kapasi, U.J.3
Mattson, P.4
Owens, J.D.5
-
40
-
-
43449094719
-
Program optimization space pruning for a multithreaded GPU
-
April
-
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S.-Z. Ueng, J. Stratton, and W.W. Hwu. Program optimization space pruning for a multithreaded GPU. In Proc. 6th Int'l Symp. on Code Generation and Optimization (CGO), pages 195-204, April 2008.
-
(2008)
Proc. 6th Int'l Symp. on Code Generation and Optimization (CGO)
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.2
Stone, S.3
Baghsorkhi, S.4
Ueng, S.-Z.5
Stratton, J.6
Hwu, W.W.7
-
41
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In Proc. 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, pages 73-82, 2008.
-
(2008)
Proc. 13th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.W.6
-
42
-
-
38849131252
-
High-throughput sequence alignment using Graphics Processing Units
-
M. Schatz, C. Trapnell, A. Delcher, and A. Varshney. High-throughput sequence alignment using Graphics Processing Units. BMC Bioinformatics, 8(1):474, 2007.
-
(2007)
BMC Bioinformatics
, vol.8
, Issue.1
, pp. 474
-
-
Schatz, M.1
Trapnell, C.2
Delcher, A.3
Varshney, A.4
|