-
1
-
-
64549111741
-
Self- configuring applications for heterogeneous systems: Program composition and optimization using cognitive techniques
-
M.W. Hall, Y. Gil, and R.F. Lucas, "Self- Configuring Applications for Heterogeneous Systems: Program Composition and Optimization Using Cognitive Techniques," Proc. IEEE, vol. 96, no. 5, 2008, pp. 849-862.
-
(2008)
Proc. IEEE
, vol.96
, Issue.5
, pp. 849-862
-
-
Hall, M.W.1
Gil, Y.2
Lucas, R.F.3
-
2
-
-
80054854472
-
Explicit platform descriptions for heterogeneous many-core architectures
-
IEEE Press
-
M. Sandrieser, S. Benkner, and S. Pllana, "Explicit Platform Descriptions for Heterogeneous Many-Core Architectures," Proc. 16th Int'l Workshop High-Level Parallel Programming Models and Supportive Environments, Int'l Parallel and Distributed Processing Symp., IEEE Press, 2011, p. 42.
-
(2011)
Proc. 16th Int'l Workshop High-Level Parallel Programming Models and Supportive Environments, Int'l Parallel and Distributed Processing Symp.
, pp. 42
-
-
Sandrieser, M.1
Benkner, S.2
Pllana, S.3
-
3
-
-
84864712980
-
A framework for performance-aware composition ofexplicitly parallel components
-
IOS Press
-
C.W. Kessler and W. Löwe, "A Framework for Performance-Aware Composition ofExplicitly Parallel Components," Parallel Computing: Architectures, Algorithms, and Applications, IOS Press, 2007, pp. 227-234.
-
(2007)
Parallel Computing: Architectures, Algorithms, and Applications
, pp. 227-234
-
-
Kessler, C.W.1
Löwe, W.2
-
4
-
-
79959546345
-
Auto-tuning skePU: A multi-backend skeleton programming framework for multi- GPU systems
-
ACM Press
-
U. Dastgeer, J. Enmyren, and C. Kessler, "Auto-Tuning SkePU: A Multi-backend Skeleton Programming Framework for Multi- GPU Systems," Proc. 4th Int'l Workshop Multicore Software Eng., ACM Press, 2011, pp. 25-32.
-
(2011)
Proc. 4th Int'l Workshop Multicore Software Eng.
, pp. 25-32
-
-
Dastgeer, U.1
Enmyren, J.2
Kessler, C.3
-
5
-
-
78651103346
-
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
-
C. Augonnet et al., "StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures," Concurrency and Computation: Practice and Experience, vol. 23, no. 2, 2011, pp. 187-198.
-
(2011)
Concurrency and Computation: Practice and Experience
, vol.23
, Issue.2
, pp. 187-198
-
-
Augonnet, C.1
-
6
-
-
0036504666
-
Performance-effective and low-complexity task scheduling for heterogeneous computing
-
DOI 10.1109/71.993206
-
H. Topcuoglu, S. Hariri, and M.-Y. Wu, "Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing," IEEE Trans. Parallel and Distributed Systems, vol. 13, no. 3, 2002, pp. 260-274. (Pubitemid 34448780)
-
(2002)
IEEE Transactions on Parallel and Distributed Systems
, vol.13
, Issue.3
, pp. 260-274
-
-
Topcuoglu, H.1
Hariri, S.2
Wu, M.-Y.3
-
8
-
-
77953973785
-
GPU sample sort
-
IEEE CS Press, doi:10.1109/IPDPS. 2010.5470444
-
N. Leischner, V. Osipov, and P. Sanders, "GPU Sample Sort," Proc. 24th IEEE Int'l Parallel and Distributed Processing Symp., IEEE CS Press, 2010, doi:10.1109/IPDPS. 2010.5470444.
-
(2010)
Proc. 24th IEEE Int'l Parallel and Distributed Processing Symp.
-
-
Leischner, N.1
Osipov, V.2
Sanders, P.3
-
9
-
-
38049143862
-
MCSTL: The multi-core standard template library
-
Springer
-
J. Singler, P. Sanders, and F. Putze, "MCSTL: The Multi-core Standard Template Library," Proc. 13th Int'l Euro-Par Conf. Parallel Processing, LNCS 4641, Springer, 2007, pp. 682-694.
-
(2007)
Proc. 13th Int'l Euro-Par Conf. Parallel Processing, LNCS 4641
, pp. 682-694
-
-
Singler, J.1
Sanders, P.2
Putze, F.3
-
10
-
-
78650913187
-
Cache-aware lock-free queues for multiple producers/consumers and weak memory consistency
-
Springer
-
A. Gidenstam, H. Sundell, and P. Tsigas, "Cache-Aware Lock-Free Queues for Multiple Producers/Consumers and Weak Memory Consistency," Proc. 14th Int'l Conf. Principles of Distributed Systems, LNCS 6490, Springer, 2010, pp. 302-317.
-
(2010)
Proc. 14th Int'l Conf. Principles of Distributed Systems, LNCS 6490
, pp. 302-317
-
-
Gidenstam, A.1
Sundell, H.2
Tsigas, P.3
-
12
-
-
72749088260
-
NBFEB: A universal scalable easy-to-use synchronization primitive for many-core architectures
-
Springer
-
P.H. Ha, P. Tsigas, and O.J. Anshus, "NBFEB: A Universal Scalable Easy-to-Use Synchronization Primitive for Many-Core Architectures," Proc. 13th Int'l Conf. Principles of Distributed Systems, LNCS 5923, Springer, 2009, pp. 189-203.
-
(2009)
Proc. 13th Int'L Conf. Principles of Distributed Systems, LNCS 5923
, pp. 189-203
-
-
Ha, P.H.1
Tsigas, P.2
Anshus, O.J.3
-
13
-
-
80053251324
-
QR factorization on a multicore node enhanced with multiple GPU accelerators
-
IEEE Press
-
E. Agullo et al., "QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators," Proc. 25th IEEE Int'l Parallel and Distributed Processing Symp., IEEE Press, 2011, p. 32.
-
(2011)
Proc. 25th IEEE Int'l Parallel and Distributed Processing Symp.
, pp. 32
-
-
Agullo, E.1
-
14
-
-
58149269099
-
A class of parallel tiled linear algebra algorithms for multicore architectures
-
A. Buttari et al., "A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures," Parallel Computing, vol. 35, no. 1, 2009, pp. 38-53.
-
(2009)
Parallel Computing
, vol.35
, Issue.1
, pp. 38-53
-
-
Buttari, A.1
|