-
1
-
-
84960162408
-
-
"Global flight network." [Online]. Available: http://www.visualizing. org/datasets/global-flights-network
-
Global Flight Network
-
-
-
2
-
-
84860351763
-
The case for gpgpu spatial multitasking
-
J. Adriaens, K. Compton, N. S. Kim, and M. Schulte, "The case for gpgpu spatial multitasking," in High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on, 2012.
-
(2012)
High Performance Computer Architecture (HPCA), 2012 IEEE 18th International Symposium on
-
-
Adriaens, J.1
Compton, K.2
Kim, N.S.3
Schulte, M.4
-
3
-
-
41249087856
-
General purpose molecular dynamics simulations fully implemented on graphics processing units
-
J. A. Anderson, C. D. Lorenz, and A. Travesset, "General purpose molecular dynamics simulations fully implemented on graphics processing units," Journal of Computational Physics, vol. 227, no. 10, pp. 5342-5359, 2008.
-
(2008)
Journal of Computational Physics
, vol.227
, Issue.10
, pp. 5342-5359
-
-
Anderson, J.A.1
Lorenz, C.D.2
Travesset, A.3
-
5
-
-
70349169075
-
Analyzing CUDA workloads using a detailed GPU simulator
-
April
-
A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt, "Analyzing cuda workloads using a detailed GPU simulator," in 2009 IEEE International Symposium on Performance Analysis of Systems and Software(ISPASS 2009), April 2009, pp. 163-174.
-
(2009)
2009 IEEE International Symposium on Performance Analysis of Systems and Software(ISPASS 2009)
, pp. 163-174
-
-
Bakhoda, A.1
Yuan, G.2
Fung, W.3
Wong, H.4
Aamodt, T.5
-
6
-
-
84867546922
-
Nested data-parallelism on the GPU
-
ACM
-
L. Bergstrom and J. Reppy, "Nested data-parallelism on the gpu," in ACM SIGPLAN Notices, vol. 47, no. 9. ACM, 2012, pp. 247-258.
-
(2012)
ACM SIGPLAN Notices
, vol.47
, Issue.9
, pp. 247-258
-
-
Bergstrom, L.1
Reppy, J.2
-
7
-
-
84873458159
-
A quantitative study of irregular programs on GPUs
-
IEEE
-
M. Burtscher, R. Nasre, and K. Pingali, "A quantitative study of irregular programs on GPUS," in Workload Characterization (IISWC), 2012 IEEE International Symposium on. IEEE, 2012, pp. 141-151.
-
(2012)
Workload Characterization (IISWC), 2012 IEEE International Symposium on
, pp. 141-151
-
-
Burtscher, M.1
Nasre, R.2
Pingali, K.3
-
8
-
-
84858427151
-
An efficient cu da implementation of the tree-based barnes hut n-body algorithm
-
M. Burtscher and K. Pingali, "An efficient cu da implementation of the tree-based barnes hut n-body algorithm," GPU computing Gems Emerald edition, p. 75, 2011.
-
(2011)
GPU Computing Gems Emerald Edition
, pp. 75
-
-
Burtscher, M.1
Pingali, K.2
-
9
-
-
84893628986
-
Pannotia: Understanding irregular gpgpu graph applications
-
IEEE
-
S. Che, B. M. Beckmann, S. K. Reinhardt, and K. Skadron, "Pannotia: Understanding irregular gpgpu graph applications," in Workload Characterization (IISWC), 2013 IEEE International Symposium on. IEEE, 2013, pp. 185-195.
-
(2013)
Workload Characterization (IISWC), 2013 IEEE International Symposium on
, pp. 185-195
-
-
Che, S.1
Beckmann, B.M.2
Reinhardt, S.K.3
Skadron, K.4
-
11
-
-
84960108780
-
-
US Patent
-
B. W. Coon, J. R. Nickolls, J. E. Lindholm, R. J. Stoll, N. Wang, and J. H. Choquette, "Thread group scheduler for computing on a parallel thread processor," US Patent 8,732,713, 2014.
-
(2014)
Thread Group Scheduler for Computing on A Parallel Thread Processor
-
-
Coon, B.W.1
Nickolls, J.R.2
Lindholm, J.E.3
Stoll, R.J.4
Wang, N.5
Choquette, J.H.6
-
12
-
-
84875193084
-
Relational algorithms for multi-bulk-synchronous processors
-
February
-
G. Diamos, H. Wu, J. Wang, A. Lele, and S. Yalamanchili, "Relational algorithms for multi-bulk-synchronous processors," in 18th ACM SIGPLAN Symposium on Principles andPractice of Parallel Programming (PPOPP'13), February 2013.
-
(2013)
18th ACM SIGPLAN Symposium on Principles AndPractice of Parallel Programming (PPOPP'13)
-
-
Diamos, G.1
Wu, H.2
Wang, J.3
Lele, A.4
Yalamanchili, S.5
-
13
-
-
47349104432
-
Dynamic warp formation and scheduling for efficient GPU control flow
-
IEEE Computer Society
-
W. W. Fung, I. Sham, G. Yuan, and T. M. Aamodt, "Dynamic warp formation and scheduling for efficient GPU control flow," in Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 2007, pp. 407-420.
-
(2007)
Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 407-420
-
-
Fung, W.W.1
Sham, I.2
Yuan, G.3
Aamodt, T.M.4
-
14
-
-
84865327496
-
Can gpgpu programming be liberated from the data-parallel bottleneck
-
B. R. Gaster and L. Howes, "Can gpgpu programming be liberated from the data-parallel bottleneck" Computer, vol. 45, no. 8, pp. 42-52, 2012.
-
(2012)
Computer
, vol.45
, Issue.8
, pp. 42-52
-
-
Gaster, B.R.1
Howes, L.2
-
15
-
-
84870690379
-
A study of persistent threads style GPU programming for gpgpu workloads
-
K. Gupta, J. A. Stuart, and J. D. Owens, "A study of persistent threads style GPU programming for gpgpu workloads," in Innovative Parallel Computing (InPar), 2012. IEEE, 2012, pp. 1-14.
-
(2012)
Innovative Parallel Computing (InPar), 2012. IEEE
, pp. 1-14
-
-
Gupta, K.1
Stuart, J.A.2
Owens, J.D.3
-
16
-
-
85015559680
-
An algorithmic framework for performing collaborative filtering
-
J. L. Herlocker, J. A. Konstan, A. Borchers, and J. Riedl, "An algorithmic framework for performing collaborative filtering," in Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval, 1999.
-
(1999)
Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
-
-
Herlocker, J.L.1
Konstan, J.A.2
Borchers, A.3
Riedl, J.4
-
19
-
-
70349191933
-
Lonestar: A suite of parallel irregular programs
-
M. Kulkarni, M. Burtscher, C. Casçaval, and K. Pingali, "Lonestar: A suite of parallel irregular programs," in ISPASS '09: IEEE International Symposium on Performance Analysis of Systems and Software, 2009.
-
(2009)
ISPASS '09: IEEE International Symposium on Performance Analysis of Systems and Software
-
-
Kulkarni, M.1
Burtscher, M.2
Casçaval, C.3
Pingali, K.4
-
20
-
-
85047004205
-
Localityaware mapping of nested parallel patterns on GPUs
-
H. Lee, K. Brown, A. Sujeeth, T. Rompf, and K. Olukotun, "Localityaware mapping of nested parallel patterns on GPUS," in the 47th International Symposium on Microarchitecture (MICRO '14), 2014.
-
(2014)
The 47th International Symposium on Microarchitecture (MICRO '14)
-
-
Lee, H.1
Brown, K.2
Sujeeth, A.3
Rompf, T.4
Olukotun, K.5
-
21
-
-
85019691440
-
Testing intrusion detection systems: A critique of the 1998 and 1999 DARPA intrusion detection system evaluations as performed by lincoln laboratory
-
J. McHugh, "Testing intrusion detection systems: a critique of the 1998 and 1999 darpa intrusion detection system evaluations as performed by lincoln laboratory," ACM transactions on Information and system Security, vol. 3, no. 4, pp. 262-294, 2000.
-
(2000)
ACM Transactions on Information and System Security
, vol.3
, Issue.4
, pp. 262-294
-
-
McHugh, J.1
-
22
-
-
77954976292
-
Dynamic warp subdivision for integrated branch and memory divergence tolerance
-
J. Meng, D. Tarjan, and K. Skadron, "Dynamic warp subdivision for integrated branch and memory divergence tolerance," in ACM SIGARCH Computer Architecture News, vol. 38, no. 3, 2010, pp. 235-246.
-
(2010)
ACM SIGARCH Computer Architecture News
, vol.38
, Issue.3
, pp. 235-246
-
-
Meng, J.1
Tarjan, D.2
Skadron, K.3
-
23
-
-
84858391043
-
Scalable GPU graph traversal
-
D. Merrill, M. Garland, and A. Grimshaw, "Scalable GPU graph traversal," in In 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'12, 2012.
-
(2012)
17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP'12
-
-
Merrill, D.1
Garland, M.2
Grimshaw, A.3
-
25
-
-
84893303174
-
GPU accelerated item-based collaborative filtering for big-data applications
-
IEEE
-
C. H. Nadungodage, Y. Xia, J. J. Lee, M. Lee, and C. S. Park, "Gpu accelerated item-based collaborative filtering for big-data applications," in Big Data, 2013 IEEE International Conference on. IEEE, 2013, pp. 175-180.
-
(2013)
Big Data, 2013 IEEE International Conference on
, pp. 175-180
-
-
Nadungodage, C.H.1
Xia, Y.2
Lee, J.J.3
Lee, M.4
Park, C.S.5
-
26
-
-
84960162412
-
-
NVIDIA
-
NVIDIA, "Hyperq sample," 2012.
-
(2012)
Hyperq Sample
-
-
-
29
-
-
84905454859
-
Finegrain task aggregation and coordination on GPUs
-
M. S. Orr, B. M. Beckmann, S. K. Reinhardt, and D. A. Wood, "Finegrain task aggregation and coordination on GPUS," in Proceeding of the 41st Annual International Symposium on Computer Architecuture, ser. ISCA '14, 2014.
-
(2014)
Proceeding of the 41st Annual International Symposium on Computer Architecuture, Ser. ISCA '14
-
-
Orr, M.S.1
Beckmann, B.M.2
Reinhardt, S.K.3
Wood, D.A.4
-
30
-
-
77956373685
-
Optix: A general purpose ray tracing engine
-
ACM
-
S. G. Parker, J. Bigler, A. Dietrich, H. Friedrich, J. Hoberock, D. Luebke, D. McAllister, M. McGuire, K. Morley, A. Robison et al., "Optix: a general purpose ray tracing engine," in ACM Transactions on Graphics (TOG), vol. 29, no. 4. ACM, 2010, p. 66.
-
(2010)
ACM Transactions on Graphics (TOG)
, vol.29
, Issue.4
, pp. 66
-
-
Parker, S.G.1
Bigler, J.2
Dietrich, A.3
Friedrich, H.4
Hoberock, J.5
Luebke, D.6
McAllister, D.7
McGuire, M.8
Morley, K.9
Robison, A.10
-
33
-
-
77954050570
-
Performance study of mapping irregular computations on GPUs
-
IEEE
-
S. Solomon and P. Thulasiraman, "Performance study of mapping irregular computations on GPUS," in Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on. IEEE, 2010, pp. 1-8.
-
(2010)
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
, pp. 1-8
-
-
Solomon, S.1
Thulasiraman, P.2
-
35
-
-
84905509992
-
Enabling preemptive multiprogramming on GPUs
-
I. Tanasic, I. Gelado, J. Cabezas, A. Ramirez, N. Navarro, and M. Valero, "Enabling preemptive multiprogramming on GPUS," in Proceeding of the 41st Annual International Symposium on Computer Architecuture, 2014.
-
(2014)
Proceeding of the 41st Annual International Symposium on Computer Architecuture
-
-
Tanasic, I.1
Gelado, I.2
Cabezas, J.3
Ramirez, A.4
Navarro, N.5
Valero, M.6
-
37
-
-
80052350460
-
Gregex: GPU based high speed regular expression matching engine
-
IEEE
-
L. Wang, S. Chen, Y. Tang, and J. Su, "Gregex: Gpu based high speed regular expression matching engine," in Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS), 2011 Fifth International Conference on. IEEE, 2011, pp. 366-370.
-
(2011)
Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS), 2011 Fifth International Conference on
, pp. 366-370
-
-
Wang, L.1
Chen, S.2
Tang, Y.3
Su, J.4
-
38
-
-
84875184822
-
Kernel weaver: Automatically fusing database primitives for efficient GPU computation
-
H. Wu, G. Diamos, S. Cadambi, and S. Yalamanchili, "Kernel weaver: Automatically fusing database primitives for efficient GPU computation," in Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012.
-
(2012)
Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
-
-
Wu, H.1
Diamos, G.2
Cadambi, S.3
Yalamanchili, S.4
-
40
-
-
79953126288
-
On-the-fly elimination of dynamic irregularities for GPU computing
-
E. Z. Zhang, Y. Jiang, Z. Guo, K. Tian, and X. Shen, "On-the-fly elimination of dynamic irregularities for GPU computing," in ACM SIGARCH Computer Architecture News, vol. 39, no. 1, 2011, pp. 369-380.
-
(2011)
ACM SIGARCH Computer Architecture News
, vol.39
, Issue.1
, pp. 369-380
-
-
Zhang, E.Z.1
Jiang, Y.2
Guo, Z.3
Tian, K.4
Shen, X.5
|