-
2
-
-
84858778295
-
Quantifying numa and contention effects in multi-gpu systems
-
ser. GPGPU-4. New York, NY, USA: ACM
-
K. Spafford, J. S. Meredith, and J. S. Vetter, "Quantifying numa and contention effects in multi-gpu systems," in Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, ser. GPGPU-4. New York, NY, USA: ACM, 2011, pp. 11:1-11:7.
-
(2011)
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units
-
-
Spafford, K.1
Meredith, J.S.2
Vetter, J.S.3
-
3
-
-
0025467711
-
A bridging model for parallel computation
-
L. G. Valiant, "A bridging model for parallel computation," Commun. ACM, 1990.
-
(1990)
Commun. ACM
-
-
Valiant, L.G.1
-
4
-
-
84863457471
-
Characterization and transformation of unstructured control flow in gpu applications
-
ACM, June
-
H. Wu, G. Diamos, S. Li, and S. Yalamanchili, "Characterization and transformation of unstructured control flow in gpu applications," in The First International Workshop on Characterizing Applications for Heterogeneous Exascale Systems. ACM, June 2011.
-
(2011)
The First International Workshop on Characterizing Applications for Heterogeneous Exascale Systems
-
-
Wu, H.1
Diamos, G.2
Li, S.3
Yalamanchili, S.4
-
5
-
-
0021458622
-
Chap - A simd graphics processor
-
A. Levinthal and T. Porter, "Chap - a simd graphics processor," SIGGRAPH Comput. Graph., vol. 18, no. 3, pp. 77-82, 1984.
-
(1984)
SIGGRAPH Comput. Graph.
, vol.18
, Issue.3
, pp. 77-82
-
-
Levinthal, A.1
Porter, T.2
-
6
-
-
47349104432
-
Dynamic warp formation and scheduling for efficient gpu control flow
-
W. W. L. Fung, I. Sham, G. Yuan, and T. M. Aamodt, "Dynamic warp formation and scheduling for efficient gpu control flow," in Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, 2007.
-
Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, 2007
-
-
Fung, W.W.L.1
Sham, I.2
Yuan, G.3
Aamodt, T.M.4
-
8
-
-
77952579552
-
Demystifying gpu microarchitecture through microbenchmarking
-
H. Wong, M.-M. Papadopoulou, M. Sadooghi-Alvandi, and A. Moshovos, "Demystifying gpu microarchitecture through microbenchmarking," in Performance Analysis of Systems Software (ISPASS), 2010 IEEE International Symposium on, 2010, pp. 235-246.
-
Performance Analysis of Systems Software (ISPASS), 2010 IEEE International Symposium On, 2010
, pp. 235-246
-
-
Wong, H.1
Papadopoulou, M.-M.2
Sadooghi-Alvandi, M.3
Moshovos, A.4
-
9
-
-
84863364461
-
Real-time adaptive gpu multi-agent path planning
-
Wen-mei W. Hwu, Ed. Morgan Kaufmann, Sep.
-
Wen-mei W. Hwu, "Real-time adaptive gpu multi-agent path planning," in GPU Computing Gems, Wen-mei W. Hwu, Ed. Morgan Kaufmann, Sep. 2010, vol. 2.
-
(2010)
GPU Computing Gems
, vol.2
-
-
Hwu, W.W.1
-
10
-
-
69949100622
-
Optimizing data intensive gpgpu computations for dna sequence alignment
-
August
-
C. Trapnell and M. C. Schatz, "Optimizing data intensive gpgpu computations for dna sequence alignment," Parallel Computing, vol. 35, pp. 429-440, August 2009.
-
(2009)
Parallel Computing
, vol.35
, pp. 429-440
-
-
Trapnell, C.1
Schatz, M.C.2
-
11
-
-
70350729133
-
Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel gpu
-
A. Badal and A. Badano, "Accelerating monte carlo simulations of photon transport in a voxelized geometry using a massively parallel gpu," Medical Physics 36, 2009.
-
(2009)
Medical Physics
, vol.36
-
-
Badal, A.1
Badano, A.2
-
12
-
-
79952149071
-
Gpu implementation of extended gaussian mixture model for background subtraction
-
V. Pham, P. Vo, H. V. Thanh, and B. L. Hoai, "Gpu implementation of extended gaussian mixture model for background subtraction," International Conference on Computing and Telecommunication Technologies, 2010.
-
International Conference on Computing and Telecommunication Technologies, 2010
-
-
Pham, V.1
Vo, P.2
Thanh, H.V.3
Hoai, B.L.4
-
13
-
-
70749119824
-
Monte carlo simulation of photon migration in 3d turbid media accelerated by graphics processing units
-
17
-
Q. Fang and D. A. Boas, "Monte carlo simulation of photon migration in 3d turbid media accelerated by graphics processing units," Optical Express 17, vol. 17, pp. 20178-20190.
-
Optical Express
, vol.17
, pp. 20178-20190
-
-
Fang, Q.1
Boas, D.A.2
-
15
-
-
77956373685
-
Optix: A general purpose ray tracing engine
-
13, July
-
S. G. Parker, J. Bigler, A. Dietrich, H. Friedrich, J. Hoberock, D. Luebke, D. McAllister, M. McGuire, K. Morley, A. Robison, and M. Stich, "Optix: a general purpose ray tracing engine," ACM Transactions on Graphics, vol. 29, pp. 66:1-66:13, July 2010.
-
(2010)
ACM Transactions on Graphics
, vol.29
-
-
Parker, S.G.1
Bigler, J.2
Dietrich, A.3
Friedrich, H.4
Hoberock, J.5
Luebke, D.6
McAllister, D.7
McGuire, M.8
Morley, K.9
Robison, A.10
Stich, M.11
-
16
-
-
78149233155
-
Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneous systems, in
-
G. Diamos, A. Kerr, S. Yalamanchili, and N. Clark, "Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneous systems," in Proceedings of PACT '10, 2010.
-
Proceedings of PACT '10, 2010
-
-
Diamos, G.1
Kerr, A.2
Yalamanchili, S.3
Clark, N.4
-
17
-
-
70649104826
-
A characterization and analysis of ptx kernels
-
A. Kerr, G. Diamos, and S. Yalamanchili, "A characterization and analysis of ptx kernels," in IISWC09: IEEE International Symposium on Workload Characterization, Austin, TX, USA, October 2009.
-
IISWC09: IEEE International Symposium on Workload Characterization, Austin, TX, USA, October 2009
-
-
Kerr, A.1
Diamos, G.2
Yalamanchili, S.3
-
18
-
-
0015330108
-
The illiac iv system
-
apr.
-
W. Bouknight, S. Denenberg, D. McIntyre, J. Randall, A. Sameh, and D. Slotnick, "The illiac iv system," Proceedings of the IEEE, vol. 60, no. 4, pp. 369-388, apr. 1972.
-
(1972)
Proceedings of the IEEE
, vol.60
, Issue.4
, pp. 369-388
-
-
Bouknight, W.1
Denenberg, S.2
McIntyre, D.3
Randall, J.4
Sameh, A.5
Slotnick, D.6
-
21
-
-
77954976292
-
Dynamic warp subdivision for integrated branch and memory divergence tolerance
-
ser. ISCA '10. New York, NY, USA: ACM
-
J. Meng, D. Tarjan, and K. Skadron, "Dynamic warp subdivision for integrated branch and memory divergence tolerance," in Proceedings of the 37th annual international symposium on Computer architecture, ser. ISCA '10. New York, NY, USA: ACM, 2010, pp. 235-246.
-
(2010)
Proceedings of the 37th Annual International Symposium on Computer Architecture
, pp. 235-246
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
23
-
-
78049504879
-
System and method for managing divergent threads in a simd architecture
-
Patent US 7 353 369, April
-
B. W. Coon and E. J. Lindholm, "System and method for managing divergent threads in a simd architecture," Patent US 7 353 369, April, 2008.
-
(2008)
-
-
Coon, B.W.1
Lindholm, E.J.2
|