-
1
-
-
84882309498
-
Long timestep molecular dynamics on the graphical processing unit
-
James C Sweet, Ronald J Nowling, Trevor Cickovski, Christopher R Sweet, Vijay S Pande, and Jesus A Izaguirre, "Long Timestep Molecular Dynamics on the Graphical Processing Unit," Journal of chemical theory and computation, pp. 9(8):3267-3281, 2013.
-
(2013)
Journal of Chemical Theory and Computation
, vol.9
, Issue.8
, pp. 3267-3281
-
-
Sweet, J.C.1
Nowling, R.J.2
Cickovski, T.3
Sweet, C.R.4
Pande, V.S.5
Izaguirre, J.A.6
-
2
-
-
78651415181
-
GPU-BLAST: Using graphics processors to accelerate protein sequence alignment
-
Panagiotis D Vouzis and Nikolaos V Sahinidis, "GPU-BLAST: Using graphics processors to accelerate protein sequence alignment," Bioinformatics, vol. 27, no. 2, pp. 182-188, 2011.
-
(2011)
Bioinformatics
, vol.27
, Issue.2
, pp. 182-188
-
-
Vouzis, P.D.1
Sahinidis, N.V.2
-
3
-
-
84901824166
-
-
Agent-Directed Simulation Symposium (ADS 2014), Simulation Series, 46 # 1, Curran Associates, Inc.
-
Klaus Kofler, Gregory Davis, and Sandra Gesing, "SAMPO: An Agent-based Mosquito Point Model in OpenCL," Agent-Directed Simulation Symposium (ADS 2014), Simulation Series Vol 46 #1, pp. 36-45, Curran Associates, Inc., ISBN 9781629939469, 2014.
-
(2014)
SAMPO: An Agent-based Mosquito Point Model in OpenCL
, pp. 36-45
-
-
Kofler, K.1
Davis, G.2
Gesing, S.3
-
4
-
-
84965007676
-
-
CUDA, (http://www.nvidia.com/object/cuda-home-new.html).
-
CUDA
-
-
-
5
-
-
84965015769
-
-
OpenCL, (https://www.khronos.org/opencl/).
-
OpenCL
-
-
-
6
-
-
84965041649
-
-
OpenACC, (http://www.openacc-standard.org/).
-
OpenACC
-
-
-
7
-
-
84944903287
-
The impact of Docker containers on the performance of genomic pipelines
-
Docker
-
Docker (https://www.docker.com/).Paolo Di Tommaso, Emilio Palumbo, Maria Chatzou, Pablo Prieto, Michael L Heuer, Cedric Notredame, "The impact of Docker containers on the performance of genomic pipelines," Peer J Prints, 2015.
-
(2015)
Peer J Prints
-
-
Di Tommaso, P.1
Palumbo, E.2
Chatzou, M.3
Prieto, P.4
Heuer, M.L.5
Notredame, C.6
-
8
-
-
84923686283
-
An updated performance comparison of virtual machines and linux containers
-
Wes Felter, Alexandre Ferreira, Ram Rajamony, and Juan Rubio. "An updated performance comparison of virtual machines and linux containers." technology 28 (2014): 32.
-
(2014)
Technology
, vol.28
, pp. 32
-
-
Felter, W.1
Ferreira, A.2
Rajamony, R.3
Rubio, J.4
-
10
-
-
84965039615
-
-
Growth Statistics (http://venturebeat.com/2015/04/14/dockerraises-95m-led-by-insight-venture-partners/)
-
Growth Statistics
-
-
-
11
-
-
84965039606
-
-
Eighth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2015), Prague, January
-
Li-Wen Chang, Abdul Dakkak, Christopher I Rodrigues, Wenmei Hwu, "Tangram: a High-level Language for Performance Portable Code Synthesis," Eighth Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2015), Prague, January 2015.
-
(2015)
Tangram: A High-level Language for Performance Portable Code Synthesis
-
-
Chang, L.1
Dakkak, A.2
Rodrigues, C.I.3
Hwu, W.4
-
12
-
-
84899692998
-
A large-scale cross-architecture evaluation of thread-coarsening
-
Networking, Storage and Analysis
-
Alberto Magni, Christophe Dubach, and Michael O'Boyle, "A large-scale cross-architecture evaluation of thread-coarsening," in Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2013.
-
(2013)
Proceedings of the International Conference on High Performance Computing
-
-
Magni, A.1
Dubach, C.2
O'Boyle, M.3
-
13
-
-
84888866287
-
Parboil: A revised benchmark suite for scientific and commercial throughput computing
-
John A Stratton, Christopher Rodrigues, I-Jui Sung, Nady Obeid, Li-Wen Chang, Nasser Anssari, Geng D Liu, and Wenmei W Hwu, "Parboil: A Revised Benchmark Suite for Scientific and Commercial Throughput Computing," Center for Reliable and High-Performance Computing, 2012.
-
(2012)
Center for Reliable and High-Performance Computing
-
-
Stratton, J.A.1
Rodrigues, C.2
Sung, I.3
Obeid, N.4
Chang, L.5
Anssari, N.6
Liu, G.D.7
Hwu, W.W.8
-
15
-
-
78149276036
-
Twin peaks: A software platform for heterogeneous computing on generalpurpose and graphics processors
-
Jayanth Gummaraju, Laurent Morichetti, Michael Houston, Ben Sander, Benedict R Gaster, and Bixia Zheng, "Twin peaks: a software platform for heterogeneous computing on generalpurpose and graphics processors," in Parallel architectures and compilation techniques, 2010, pp. 205-216.
-
(2010)
Parallel Architectures and Compilation Techniques
, pp. 205-216
-
-
Gummaraju, J.1
Morichetti, L.2
Houston, M.3
Sander, B.4
Gaster, B.R.5
Zheng, B.6
-
16
-
-
84962271091
-
Pocl: A performance-portable OpenCL implementation
-
Pekka Jääskeläinen, Carlos Sánchez de La Lama, Erik Schnetter, Kalle Raiskila, Jarmo Takala, and Heikki Berg, "pocl: A performance-portable OpenCL implementation," International Journal of Parallel Programming, pp. 1-34, 2014.
-
(2014)
International Journal of Parallel Programming
, pp. 1-34
-
-
Jääskeläinen, P.1
De La Lama, C.S.2
Schnetter, E.3
Raiskila, K.4
Takala, J.5
Berg, H.6
-
17
-
-
84859143447
-
Improving performance of OpenCL on CPUs
-
Ralf Karrenberg and Sebastian Hack, "Improving performance of OpenCL on CPUs," in Compiler Construction, 2012, pp. 1-20.
-
(2012)
Compiler Construction
, pp. 1-20
-
-
Karrenberg, R.1
Hack, S.2
-
18
-
-
84961314978
-
Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures
-
Hee-Seok Kim, Izzat El Hajj, John Stratton, Steven Lumetta, and Hwu Wen-Mei, "Locality-centric thread scheduling for bulk-synchronous programming models on CPU architectures," in Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2015, pp. 257-268.
-
(2015)
Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization
, pp. 257-268
-
-
Kim, H.1
El Hajj, I.2
Stratton, J.3
Lumetta, S.4
Wen-Mei, H.5
-
19
-
-
84864054886
-
SnuCL: An OpenCL framework for heterogeneous CPU/GPU clusters
-
Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, and Jaejin Lee, "SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters," Proceedings of the 26th ACM international conference on Supercomputing, pp. 341-352, 2012.
-
(2012)
Proceedings of the 26th ACM International Conference on Supercomputing
, pp. 341-352
-
-
Kim, J.1
Seo, S.2
Lee, J.3
Nah, J.4
Jo, G.5
Lee, J.6
-
20
-
-
58449109179
-
MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
-
John A Stratton, Sam S Stone, and Hwu W Wen-Mei, "MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs," Languages and Compilers for Parallel Computing, pp. 16-30, 2008.
-
(2008)
Languages and Compilers for Parallel Computing
, pp. 16-30
-
-
Stratton, J.A.1
Stone, S.S.2
Wen-Mei, H.W.3
-
21
-
-
84937693610
-
Porple: An extensible optimizer for portable data placement on GPU
-
Guoyang Chen, Bo Wu, Dong Li, and Xipeng Shen, "Porple: An extensible optimizer for portable data placement on GPU," in Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014, pp. 88-100.
-
(2014)
Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 88-100
-
-
Chen, G.1
Wu, B.2
Li, D.3
Shen, X.4
-
22
-
-
78649824847
-
Exploiting memory access patterns to improve memory performance in data-parallel architectures
-
Byunghyun Jang, Dana Schaa, Perhaad Mistry, and David Kaeli, "Exploiting memory access patterns to improve memory performance in data-parallel architectures," Parallel and Distributed Systems, IEEE Transactions on, vol. 22, pp. 105-118, 2011.
-
(2011)
Parallel and Distributed Systems IEEE Transactions on
, vol.22
, pp. 105-118
-
-
Jang, B.1
Schaa, D.2
Mistry, P.3
Kaeli, D.4
-
24
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
Clint R Whaley, Antoine Petitet, and Jack Dongarra, "Automated empirical optimizations of software and the ATLAS project," Parallel Computing, vol. 27, pp. 3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, pp. 3-35
-
-
Whaley, C.R.1
Petitet, A.2
Dongarra, J.3
-
25
-
-
1542396679
-
Spiral: A generator for platform-adapted libraries of signal processing alogorithms
-
Markus Püschel, José Moura, Bryan Singer, Jianxin Xiong, Jeremy Johnson, David Padua, Manuela Veloso, and Robert Johnson, "Spiral: A Generator for Platform-Adapted Libraries of Signal Processing Alogorithms," International Journal of High Performance Computing Applications, vol. 18, pp. 21-45, 2004.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, pp. 21-45
-
-
Püschel, M.1
Moura, J.2
Singer, B.3
Xiong, J.4
Johnson, J.5
Padua, D.6
Veloso, M.7
Johnson, R.8
-
26
-
-
84877042382
-
A scalable cross-platform infrastructure for application performance tuning using hardware counters
-
Shirley Browne, Jack Dongarra, Nathan Garner, Kevin London, and Philip Mucci, "A scalable cross-platform infrastructure for application performance tuning using hardware counters," in Supercomputing, ACM/IEEE 2000 Conference, 2000, pp. 42-42.
-
(2000)
Supercomputing, ACM/IEEE 2000 Conference
, pp. 42
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
London, K.4
Mucci, P.5
-
27
-
-
84939147992
-
A collection-oriented programming model for performance portability
-
Saurav Muralidharan, Michael Garland, Bryan Catanzaro, Albert Sidelnik, and Mary Hall, "A collection-oriented programming model for performance portability," in Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015, pp. 263-264.
-
(2015)
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 263-264
-
-
Muralidharan, S.1
Garland, M.2
Catanzaro, B.3
Sidelnik, A.4
Hall, M.5
-
28
-
-
84969824612
-
Thrust: A productivityoriented library for CUDA
-
December
-
Nathan Bell and Jared Hoberock, "Thrust: A productivityoriented library for CUDA," Astrophysics Source Code Library, vol. 1, December 2012.
-
(2012)
Astrophysics Source Code Library
, vol.1
-
-
Bell, N.1
Hoberock, J.2
-
29
-
-
70450227331
-
PetaBricks: A language and compiler for algorithmic choice
-
June
-
Jason Ansel, Cy Chan, Yee Lok Wong, Marek Olszewski, Qin Zhao, Alan Edelman, and Saman Amarasinghe, "PetaBricks: A Language and Compiler for Algorithmic Choice," SIGPLAN Not., vol. 44, no. 6, pp. 38-49, June 2009.
-
(2009)
Sigplan Not.
, vol.44
, Issue.6
, pp. 38-49
-
-
Ansel, J.1
Chan, C.2
Lok Wong, Y.3
Olszewski, M.4
Zhao, Q.5
Edelman, A.6
Amarasinghe, S.7
-
30
-
-
84905980170
-
Delite: A compiler architecture for performance-oriented embedded domain-specific languages
-
Arvind Sujeeth, Kevin Brown, Hyoukjoong Lee, Tiark Rompf, Hassan Chafi, Martin Odersky, and Kunle Olukotun, "Delite: A compiler architecture for performance-oriented embedded domain-specific languages," ACM Transactions on Embedded Computing Systems (TECS), vol. 13, pp. 134-134, 2014.
-
(2014)
ACM Transactions on Embedded Computing Systems (TECS)
, vol.13
, pp. 134
-
-
Sujeeth, A.1
Brown, K.2
Lee, H.3
Rompf, T.4
Chafi, H.5
Odersky, M.6
Olukotun, K.7
-
31
-
-
84988864524
-
An agent-based model of the population dynamics of Anopheles gambiae
-
SM Niaz Arifin, Ying Zhou, Gregory J. Davis, James E. Gentile, Gregory R. Madey, and Frank H. Collins. "An agent-based model of the population dynamics of Anopheles gambiae." Malaria journal 13, no. 1 (2014): 424.
-
(2014)
Malaria Journal
, vol.13
, Issue.1
, pp. 424
-
-
Niaz Arifin, S.M.1
Zhou, Y.2
Davis, G.J.3
Gentile, J.E.4
Madey, G.R.5
Collins, F.H.6
|