-
1
-
-
84884825354
-
-
[Online]. Available
-
NVIDIA, "GPU Accelerated Applications," 2012. [Online]. Available: http://www.nvidia.com/object/gpu-accelerated-applications.html
-
(2012)
GPU Accelerated Applications
-
-
-
2
-
-
80052312080
-
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community
-
J. S. Vetter, R. Glassbrook, J. Dongarra, K. Schwan, B. Loftis, S. McNally, J. Meredith, J. Rogers, P. Roth, K. Spafford, and S. Yalamanchili, "Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community," Computing in Science and Engineering, vol. 13, 2011.
-
(2011)
Computing in Science and Engineering
, vol.13
-
-
Vetter, J.S.1
Glassbrook, R.2
Dongarra, J.3
Schwan, K.4
Loftis, B.5
McNally, S.6
Meredith, J.7
Rogers, J.8
Roth, P.9
Spafford, K.10
Yalamanchili, S.11
-
3
-
-
77956939092
-
An integrative approach for in silico glioma research
-
L. A. D. Cooper, J. Kong, D. A. Gutman, F. Wang, S. R. Cholleti, T. C. Pan, P. M. Widener, A. Sharma, T. Mikkelsen, A. E. Flanders, D. L. Rubin, E. G. V. Meir, T. M. Kurc, C. S. Moreno, D. J. Brat, and J. H. Saltz, "An integrative approach for in silico glioma research," IEEE Trans Biomed Eng., vol. 57, no. 10, pp. 2617-2621, 2010.
-
(2010)
IEEE Trans Biomed Eng
, vol.57
, Issue.10
, pp. 2617-2621
-
-
Cooper, L.A.D.1
Kong, J.2
Gutman, D.A.3
Wang, F.4
Cholleti, S.R.5
Pan, T.C.6
Widener, P.M.7
Sharma, A.8
Mikkelsen, T.9
Flanders, A.E.10
Rubin, D.L.11
Meir, E.G.V.12
Kurc, T.M.13
Moreno, C.S.14
Brat, D.J.15
Saltz, J.H.16
-
5
-
-
84866880754
-
-
11 February [Online]. Available
-
NVIDIA, NVIDIA Performance Primitives(NPP), 11 February 2011. [Online]. Available: http://developer.nvidia.com/npp
-
(2011)
NVIDIA Performance Primitives(NPP)
-
-
-
6
-
-
79960214735
-
Advances on watershed processing on GPU architecture
-
Proceedings of the 10th International Conference on Mathematical Morphology, ser.
-
A. Körbes, G. B. Vitor, R. de Alencar Lotufo, and J. V. Ferreira, "Advances on watershed processing on GPU architecture," in Proceedings of the 10th International Conference on Mathematical Morphology, ser. ISMM'11, 2011.
-
(2011)
ISMM'11
-
-
Körbes, A.1
Vitor, G.B.2
De Alencar Lotufo, R.3
Ferreira, J.V.4
-
7
-
-
0027576716
-
Morphological grayscale reconstruction in image analysis: Applications and efficient algorithms
-
DOI 10.1109/83.217222
-
L. Vincent, "Morphological grayscale reconstruction in image analysis: Applications and efficient algorithms," IEEE Transactions on Image Processing, vol. 2, pp. 176-201, 1993. (Pubitemid 23692871)
-
(1993)
IEEE Transactions on Image Processing
, vol.2
, Issue.2
, pp. 176-201
-
-
Vincent, L.1
-
8
-
-
84884860785
-
A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs
-
Emory University, January
-
G. Teodoro, T. Pan, T. M. Kurc, L. Cooper, J. Kong, and J. H. Saltz, "A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs," Emory University, Center for Comprehensive Informatics Technical Report CCI-TR-2012-2, January 2012.
-
(2012)
Center for Comprehensive Informatics Technical Report CCI-TR-2012-2
-
-
Teodoro, G.1
Pan, T.2
Kurc, T.M.3
Cooper, L.4
Kong, J.5
Saltz, J.H.6
-
9
-
-
84884853346
-
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines
-
to appear
-
G. Teodoro, T. Pan, T. Kurc, J. Kong, L. Cooper, and J. Saltz, "Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines," Parallel Computing, 2013, to appear.
-
(2013)
Parallel Computing
-
-
Teodoro, G.1
Pan, T.2
Kurc, T.3
Kong, J.4
Cooper, L.5
Saltz, J.6
-
10
-
-
85127625501
-
Efficient Computation of Morphological Greyscale Reconstruction
-
MEMICS, ser. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany
-
P. Karas, "Efficient Computation of Morphological Greyscale Reconstruction," in MEMICS, ser. OASICS, vol. 16. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany, 2010.
-
(2010)
OASICS
, vol.16
-
-
Karas, P.1
-
12
-
-
84879777611
-
A Study on Connected Components Labeling algorithms using GPUs
-
V. M. A. Oliveira and R. de Alencar Lotufo, "A Study on Connected Components Labeling algorithms using GPUs," in SIBGRAPI, 2010.
-
(2010)
SIBGRAPI
-
-
Oliveira, V.M.A.1
De Alencar Lotufo, R.2
-
13
-
-
84866883109
-
Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems
-
G. Teodoro, T. M. Kurc, T. Pan, L. A. Cooper, J. Kong, P. Widener, and J. H. Saltz, "Accelerating Large Scale Image Analyses on Parallel, CPU-GPU Equipped Systems," in 26th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2012, pp. 1093-1104.
-
26th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2012
, pp. 1093-1104
-
-
Teodoro, G.1
Kurc, T.M.2
Pan, T.3
Cooper, L.A.4
Kong, J.5
Widener, P.6
Saltz, J.H.7
-
14
-
-
77957759721
-
Merge: A programming model for heterogeneous multi-core systems
-
DOI 10.1145/1346281.1346318, ASPLOS XIII - Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems
-
M. D. Linderman, J. D. Collins, H. Wang, and T. H. Meng, "Merge: a programming model for heterogeneous multi-core systems," SIGPLAN Not., vol. 43, no. 3, pp. 287-296, 2008. (Pubitemid 351585414)
-
(2008)
International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS
, pp. 287-296
-
-
Linderman, M.D.1
Collins, J.D.2
Wang, H.3
Meng, T.H.4
-
15
-
-
63549097654
-
Mars: A MapReduce Framework on Graphics Processors
-
B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang, "Mars: A MapReduce Framework on Graphics Processors," in Parallel Architectures and Compilation Techniques, 2008.
-
(2008)
Parallel Architectures and Compilation Techniques
-
-
He, B.1
Fang, W.2
Luo, Q.3
Govindaraju, N.K.4
Wang, T.5
-
17
-
-
79959904195
-
Automatic CPU-GPU communication management and optimization
-
Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, ser.
-
T. B. Jablin, P. Prabhu, J. A. Jablin, N. P. Johnson, S. R. Beard, and D. I. August, "Automatic CPU-GPU communication management and optimization," in Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, ser. PLDI '11, 2011, pp. 142-151.
-
(2011)
PLDI '11
, pp. 142-151
-
-
Jablin, T.B.1
Prabhu, P.2
Jablin, J.A.3
Johnson, N.P.4
Beard, S.R.5
August, D.I.6
-
18
-
-
70350641505
-
Starpu: A unified platform for task scheduling on heterogeneous multicore architectures
-
C. Augonnet, S. Thibault, R. Namyst, and P.-A. Wacrenier, "Starpu: A unified platform for task scheduling on heterogeneous multicore architectures," in Euro-Par '09: Proceedings of the 15th International Euro-Par Conference on Parallel Processing, 2009, pp. 863-874.
-
Euro-Par '09: Proceedings of the 15th International Euro-Par Conference on Parallel Processing, 2009
, pp. 863-874
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
Wacrenier, P.-A.4
-
19
-
-
57349153933
-
Harmony: An execution model and runtime for heterogeneous many core systems
-
Proceedings of the 17th international symposium on High performance distributed computing, ser. New York, NY, USA: ACM
-
G. F. Diamos and S. Yalamanchili, "Harmony: an execution model and runtime for heterogeneous many core systems," in Proceedings of the 17th international symposium on High performance distributed computing, ser. HPDC '08. New York, NY, USA: ACM, 2008, pp. 197-200.
-
(2008)
HPDC '08
, pp. 197-200
-
-
Diamos, G.F.1
Yalamanchili, S.2
-
20
-
-
72049125355
-
Coordinating the use of GPU and CPU for improving performance of compute intensive applications
-
G. Teodoro, R. Sachetto, O. Sertel, M. Gurcan, W. M. Jr., U. Catalyurek, and R. Ferreira, "Coordinating the use of GPU and CPU for improving performance of compute intensive applications," in IEEE Cluster, 2009, pp. 1-10.
-
(2009)
IEEE Cluster
, pp. 1-10
-
-
Teodoro, G.1
Sachetto, R.2
Sertel, O.3
Gurcan, M.4
M Jr., W.5
Catalyurek, U.6
Ferreira, R.7
-
21
-
-
70450029523
-
A framework for efficient and scalable execution of domain-specific templates on GPUs
-
N. Sundaram, A. Raghunathan, and S. T. Chakradhar, "A framework for efficient and scalable execution of domain-specific templates on GPUs," in IPDPS '09: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, 2009, pp. 1-12.
-
IPDPS '09: Proceedings of the 2009 IEEE International Symposium on Parallel and Distributed Processing, 2009
, pp. 1-12
-
-
Sundaram, N.1
Raghunathan, A.2
Chakradhar, S.T.3
-
22
-
-
78650028532
-
Run-time optimizations for replicated dataflows on heterogeneous environments
-
G. Teodoro, T. D. R. Hartley, U. Catalyurek, and R. Ferreira, "Run-time optimizations for replicated dataflows on heterogeneous environments," in Proc. of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC), 2010, pp. 13-24.
-
Proc. of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC), 2010
, pp. 13-24
-
-
Teodoro, G.1
Hartley, T.D.R.2
Catalyurek, U.3
Ferreira, R.4
-
23
-
-
80955167923
-
Performance Portability of a GPU Enabled Factorization with the DAGuE Framework
-
G. Bosilca, A. Bouteiller, T. Herault, P. Lemarinier, N. Saengpatsa, S. Tomov, and J. Dongarra, "Performance Portability of a GPU Enabled Factorization with the DAGuE Framework," in 2011 IEEE International Conference on Cluster Computing (CLUSTER), sept. 2011, pp. 395-402.
-
2011 IEEE International Conference on Cluster Computing (CLUSTER), Sept. 2011
, pp. 395-402
-
-
Bosilca, G.1
Bouteiller, A.2
Herault, T.3
Lemarinier, P.4
Saengpatsa, N.5
Tomov, S.6
Dongarra, J.7
-
24
-
-
77954709868
-
Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations
-
ACM
-
V. Ravi, W. Ma, D. Chiu, and G. Agrawal, "Compiler and runtime support for enabling generalized reduction computations on heterogeneous parallel configurations," in Proceedings of the 24th ACM International Conference on Supercomputing. ACM, 2010, p. 137146.
-
(2010)
Proceedings of the 24th ACM International Conference on Supercomputing
, pp. 137146
-
-
Ravi, V.1
Ma, W.2
Chiu, D.3
Agrawal, G.4
-
25
-
-
79952786877
-
Automatic dataflow application tuning for heterogeneous systems
-
IEEE
-
T. D. R. Hartley, E. Saule, and Ü. V. Çatalyürek, "Automatic dataflow application tuning for heterogeneous systems," in HiPC. IEEE, 2010, pp. 1-10.
-
(2010)
HiPC
, pp. 1-10
-
-
Hartley, T.D.R.1
Saule, E.2
Çatalyürek, Ü.V.3
-
26
-
-
84858060473
-
Porting irregular reductions on heterogeneous CPU-GPU configurations
-
X. Huo, V. Ravi, and G. Agrawal, "Porting irregular reductions on heterogeneous CPU-GPU configurations," in 18th International Conference on High Performance Computing (HiPC), dec. 2011, pp. 1-10.
-
18th International Conference on High Performance Computing (HiPC), Dec. 2011
, pp. 1-10
-
-
Huo, X.1
Ravi, V.2
Agrawal, G.3
-
27
-
-
67650081010
-
OpenMP to GPGPU: A compiler framework for automatic translation and optimization
-
S. Lee, S.-J. Min, and R. Eigenmann, "OpenMP to GPGPU: a compiler framework for automatic translation and optimization," in PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, 2009, pp. 101-110.
-
PPoPP '09: Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
, pp. 101-110
-
-
Lee, S.1
Min, S.-J.2
Eigenmann, R.3
-
28
-
-
82655162782
-
Ptask: Operating system abstractions to manage gpus as compute devices
-
Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, ser. New York, NY, USA: ACM
-
C. J. Rossbach, J. Currey, M. Silberstein, B. Ray, and E. Witchel, "Ptask: operating system abstractions to manage gpus as compute devices," in Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, ser. SOSP '11. New York, NY, USA: ACM, 2011, pp. 233-248.
-
(2011)
SOSP '11
, pp. 233-248
-
-
Rossbach, C.J.1
Currey, J.2
Silberstein, M.3
Ray, B.4
Witchel, E.5
-
29
-
-
84866856745
-
Productive Programming of GPU Clusters with OmpSs
-
J. Bueno, J. Planas, A. Duran, R. Badia, X. Martorell, E. Ayguade, and J. Labarta, "Productive Programming of GPU Clusters with OmpSs," in 2012 IEEE 26th International Parallel Distributed Processing Symposium (IPDPS), may 2012, pp. 557-568.
-
2012 IEEE 26th International Parallel Distributed Processing Symposium (IPDPS), May 2012
, pp. 557-568
-
-
Bueno, J.1
Planas, J.2
Duran, A.3
Badia, R.4
Martorell, X.5
Ayguade, E.6
Labarta, J.7
-
30
-
-
84861790553
-
Optimizing dataflow applications on heterogeneous environments
-
G. Teodoro, T. Hartley, U. Catalyurek, and R. Ferreira, "Optimizing dataflow applications on heterogeneous environments," Cluster Computing, vol. 15, pp. 125-144, 2012.
-
(2012)
Cluster Computing
, vol.15
, pp. 125-144
-
-
Teodoro, G.1
Hartley, T.2
Catalyurek, U.3
Ferreira, R.4
-
31
-
-
84867630228
-
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators
-
The 19th European MPI Users' Group Meeting (EuroMPI 2012), ser. S. B. Jesper Larsson Träff and J. Dongarra, Eds., Vienna, Autriche: Springer
-
C. Augonnet, O. Aumage, N. Furmento, R. Namyst, and S. Thibault, "StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators," in The 19th European MPI Users' Group Meeting (EuroMPI 2012), ser. LNCS, S. B. Jesper Larsson Träff and J. Dongarra, Eds., vol. 7490. Vienna, Autriche: Springer, 2012.
-
(2012)
LNCS
, vol.7490
-
-
Augonnet, C.1
Aumage, O.2
Furmento, N.3
Namyst, R.4
Thibault, S.5
-
32
-
-
0032680295
-
Cluster I/O with River: Making the Fast Case Common
-
R. H. Arpaci-Dusseau, E. Anderson, N. Treuhaft, D. E. Culler, J. M. Hellerstein, D. A. Patterson, and K. Yelick, "Cluster I/O with River: Making the Fast Case Common," in IOPADS '99: Input/Output for Parallel and Distributed Systems, Atlanta, GA, May 1999.
-
IOPADS '99: Input/Output for Parallel and Distributed Systems, Atlanta, GA, May 1999
-
-
Arpaci-Dusseau, R.H.1
Anderson, E.2
Treuhaft, N.3
Culler, D.E.4
Hellerstein, J.M.5
Patterson, D.A.6
Yelick, K.7
-
33
-
-
0038633579
-
Dynamic Querying of Streaming Data with the dQUOB System
-
B. Plale and K. Schwan, "Dynamic Querying of Streaming Data with the dQUOB System," IEEE Trans. Parallel Distrib. Syst., vol. 14, no. 4, pp. 422-432, 2003.
-
(2003)
IEEE Trans. Parallel Distrib. Syst.
, vol.14
, Issue.4
, pp. 422-432
-
-
Plale, B.1
Schwan, K.2
-
34
-
-
70449633075
-
An integrated framework for performance-based optimization of scientific workflows
-
V. S. Kumar, P. Sadayappan, G. Mehta, K. Vahi, E. Deelman, V. Ratnakar, J. Kim, Y. Gil, M. W. Hall, T. M. Kurc, and J. H. Saltz, "An integrated framework for performance-based optimization of scientific workflows," in HPDC, 2009, pp. 177-186.
-
(2009)
HPDC
, pp. 177-186
-
-
Kumar, V.S.1
Sadayappan, P.2
Mehta, G.3
Vahi, K.4
Deelman, E.5
Ratnakar, V.6
Kim, J.7
Gil, Y.8
Hall, M.W.9
Kurc, T.M.10
Saltz, J.H.11
-
35
-
-
34548312273
-
An Efficient and Reliable Scientific Workflow System
-
vol. 0
-
T. Tavares, G. Teodoro, T. Kurc, R. Ferreira, D. Guedes, W. J. Meira, U. Catalyurek, S. Hastings, S. Oster, S. Langella, and J. Saltz, "An Efficient and Reliable Scientific Workflow System," IEEE International Symposium on Cluster Computing and the Grid, vol. 0, pp. 445-452, 2007.
-
(2007)
IEEE International Symposium on Cluster Computing and the Grid
, pp. 445-452
-
-
Tavares, T.1
Teodoro, G.2
Kurc, T.3
Ferreira, R.4
Guedes, D.5
Meira, W.J.6
Catalyurek, U.7
Hastings, S.8
Oster, S.9
Langella, S.10
Saltz, J.11
-
36
-
-
42149182947
-
A run-time system for efficient execution of scientific workflows on distributed environments
-
G. Teodoro, T. Tavares, R. Ferreira, T. Kurc, J. Meira, Wagner, D. Guedes, T. Pan, and J. Saltz, "A run-time system for efficient execution of scientific workflows on distributed environments," International Journal of Parallel Programming, vol. 36, pp. 250-266, 2008.
-
(2008)
International Journal of Parallel Programming
, vol.36
, pp. 250-266
-
-
Teodoro, G.1
Tavares, T.2
Ferreira, R.3
Kurc, T.4
Meira, J.5
Wagner6
Guedes, D.7
Pan, T.8
Saltz, J.9
-
37
-
-
78649984950
-
Dataspaces: An interaction and coordination framework for coupled simulation workflows
-
C. Docan, M. Parashar, and S. Klasky, "Dataspaces: an interaction and coordination framework for coupled simulation workflows," in HPDC, 2010, pp. 25-36.
-
(2010)
HPDC
, pp. 25-36
-
-
Docan, C.1
Parashar, M.2
Klasky, S.3
-
38
-
-
77955093126
-
Datastager: Scalable data staging services for petascale applications
-
H. Abbasi, M. Wolf, G. Eisenhauer, S. Klasky, K. Schwan, and F. Zheng, "Datastager: scalable data staging services for petascale applications," Cluster Computing, vol. 13, no. 3, pp. 277-290, 2010.
-
(2010)
Cluster Computing
, vol.13
, Issue.3
, pp. 277-290
-
-
Abbasi, H.1
Wolf, M.2
Eisenhauer, G.3
Klasky, S.4
Schwan, K.5
Zheng, F.6
-
39
-
-
57349114683
-
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)
-
J. F. Lofstead, S. Klasky, K. Schwan, N. Podhorszki, and C. Jin, "Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)," in CLADE, 2008, pp. 15-24.
-
(2008)
CLADE
, pp. 15-24
-
-
Lofstead, J.F.1
Klasky, S.2
Schwan, K.3
Podhorszki, N.4
Jin, C.5
|