-
1
-
-
83155188972
-
CudaDMA: Optimizing GPU memory bandwidth via warp specialization
-
M. Bauer, H. Cook, and B. Khailany. cudaDMA: Optimizing GPU Memory Bandwidth via Warp Specialization. In Proc. 2011 Intl. Conf. for High Performance Computing, Networking, Storage and Analysis (SC), pages 12:1-12:11, 2011.
-
(2011)
Proc. 2011 Intl. Conf. for High Performance Computing, Networking, Storage and Analysis (SC)
, pp. 121-1211
-
-
Bauer, M.1
Cook, H.2
Khailany, B.3
-
2
-
-
80051839847
-
Meraculous: De novo genome assembly with short paired-end reads
-
J. Chapman, I. Ho, S. Sunkara, S. Luo, G. Schroth, and D. Rokhsar. Meraculous: De Novo Genome Assembly with Short Paired-End Reads. PLoS ONE, (8):e23501, 2011.
-
(2011)
PLoS ONE
, Issue.8
-
-
Chapman, J.1
Ho, I.2
Sunkara, S.3
Luo, S.4
Schroth, G.5
Rokhsar, D.6
-
4
-
-
0001483604
-
Communication optimizations for irregular scientific computations on distributed memory architectures
-
R. Das, M. Uysal, J. Saltz, and Y. Hwang. Communication Optimizations for Irregular Scientific Computations on Distributed Memory Architectures. Journal of Parallel and Distributed Computing, pages 462-478, 1994.
-
(1994)
Journal of Parallel and Distributed Computing
, pp. 462-478
-
-
Das, R.1
Uysal, M.2
Saltz, J.3
Hwang, Y.4
-
5
-
-
77952251540
-
An asymmetric distributed shared memory model for heterogeneous parallel systems
-
I. Gelado, J. Stone, J. Cabezas, S. Patel, N. Navarro, and W. Hwu. An Asymmetric Distributed Shared Memory Model for Heterogeneous Parallel Systems. In Proc. 15th Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 347-358, 2010.
-
(2010)
Proc. 15th Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
, pp. 347-358
-
-
Gelado, I.1
Stone, J.2
Cabezas, J.3
Patel, S.4
Navarro, N.5
Hwu, W.6
-
7
-
-
80053240142
-
Automated architecture-aware mapping of streaming applications onto GPUs
-
A. Hagiescu, H. Huynh, W. Wong, and R. Goh. Automated Architecture-Aware Mapping of Streaming Applications onto GPUs. In Proc. 25th IEEE Intl. Parallel Distributed Processing Symp. (IPDPS), pages 467-478, 2011.
-
(2011)
Proc. 25th IEEE Intl. Parallel Distributed Processing Symp. (IPDPS)
, pp. 467-478
-
-
Hagiescu, A.1
Huynh, H.2
Wong, W.3
Goh, R.4
-
9
-
-
84858381077
-
Scalable framework for mapping streaming applications onto multi-GPU systems
-
H. Huynh, A. Hagiescu, W. Wong, and R. Goh. Scalable Framework for Mapping Streaming Applications onto Multi-GPU Systems. In Proc. 17th Symp. on Principles and Practice of Parallel Programming (PPoPP), pages 1-10, 2012.
-
(2012)
Proc. 17th Symp. on Principles and Practice of Parallel Programming (PPoPP)
, pp. 1-10
-
-
Huynh, H.1
Hagiescu, A.2
Wong, W.3
Goh, R.4
-
10
-
-
84863423999
-
Dynamically managed data for CPU-GPU architectures
-
T. Jablin, J. Jablin, P. Prabhu, F. Liu, and D. August. Dynamically managed data for CPU-GPU architectures. In Proc. 10th Intl. Symp. on Code Generation and Optimization (CGO), pages 165-174, 2012.
-
(2012)
Proc. 10th Intl. Symp. on Code Generation and Optimization (CGO)
, pp. 165-174
-
-
Jablin, T.1
Jablin, J.2
Prabhu, P.3
Liu, F.4
August, D.5
-
11
-
-
79959904195
-
Automatic CPU-GPU communication management and optimization
-
T. Jablin, P. Prabhu, J. Jablin, N. Johnson, S. Beard, and D. August. Automatic CPU-GPU Communication Management and Optimization. In Proc. 32nd Conf. on Programming Language Design and Implementation (PLDI), pages 142-151, 2011.
-
(2011)
Proc. 32nd Conf. on Programming Language Design and Implementation (PLDI)
, pp. 142-151
-
-
Jablin, T.1
Prabhu, P.2
Jablin, J.3
Johnson, N.4
Beard, S.5
August, D.6
-
15
-
-
0028741448
-
Run-time and compile-time support for adaptive irregular problems
-
S. Sharma, R. Ponnusamy, B. Moon, Y. Hwang, R. Das, and J. Saltz. Run-time and Compile-time Support for Adaptive Irregular Problems. In Proc. of the 1994 Conf. on Supercomputing, pages 97-106, 1994.
-
(1994)
Proc. of the 1994 Conf. on Supercomputing
, pp. 97-106
-
-
Sharma, S.1
Ponnusamy, R.2
Moon, B.3
Hwang, Y.4
Das, R.5
Saltz, J.6
-
16
-
-
58449127539
-
CUDA-lite: Reducing GPU programming complexity
-
S. Ueng, M. Lathara, S. Baghsorkhi, and W. Hwu. CUDA-Lite: Reducing GPU Programming Complexity. In Languages and Compilers for Parallel Computing, volume 5335, pages 1-15. 2008.
-
(2008)
Languages and Compilers for Parallel Computing
, vol.5335
, pp. 1-15
-
-
Ueng, S.1
Lathara, M.2
Baghsorkhi, S.3
Hwu, W.4
-
17
-
-
80053277662
-
OpinionFinder: A system for subjectivity analysis
-
HLT-Demo '05
-
T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. OpinionFinder: a System for Subjectivity Analysis. In Proc. HLT/EMNLP on Interactive Demonstrations, HLT-Demo '05, pages 34-35, 2005.
-
(2005)
Proc. HLT/EMNLP on Interactive Demonstrations
, pp. 34-35
-
-
Wilson, T.1
Hoffmann, P.2
Somasundaran, S.3
Kessler, J.4
Wiebe, J.5
Choi, Y.6
Cardie, C.7
Riloff, E.8
Patwardhan, S.9
-
19
-
-
77954691442
-
A GPGPU compiler for cemory cptimization and carallelism canagement
-
Y. Yang, P. Xiang, J. Kong, and H. Zhou. A GPGPU Compiler for Cemory Cptimization and Carallelism Canagement. In Proc. 2010 Conf. on Programming Language Design and Implementation (PLDI), pages 86-97, 2010.
-
(2010)
Proc. 2010 Conf. on Programming Language Design and Implementation (PLDI)
, pp. 86-97
-
-
Yang, Y.1
Xiang, P.2
Kong, J.3
Zhou, H.4
-
20
-
-
79953126288
-
On-the-fly elimination of dynamic irregularities for GPU computing
-
E. Zhang, Y. Jiang, Z. Guo, K. Tian, and X. Shen. On-the-fly Elimination of Dynamic Irregularities for GPU Computing. In Proc. 16th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pages 369-380, 2011.
-
(2011)
Proc. 16th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
, pp. 369-380
-
-
Zhang, E.1
Jiang, Y.2
Guo, Z.3
Tian, K.4
Shen, X.5
|