-
1
-
-
85013050645
-
Automatic calibration of performance models on heterogeneous multicore architectures
-
HPPC, Delft, The Netherlands, August
-
Cédric Augonnet, Samuel Thibault, and Raymond Namyst. Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures. In Proceedings of the Euro- Par Workshops, HPPC, Delft, The Netherlands, August 2009.
-
(2009)
Proceedings of the Euro-Par Workshops
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
-
2
-
-
79951756959
-
StarPU: A runtime system for scheduling tasks over accelerator-based multicore machines
-
March
-
Cédric Augonnet, Samuel Thibault, and Raymond Namyst. StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines. Technical Report 7240, INRIA, March 2010.
-
(2010)
Technical Report 7240, INRIA
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
-
3
-
-
84857478099
-
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures
-
Accepted for publication
-
Cédric Augonnet, Samuel Thibault, Raymond Namyst, and Pierre-André Wacrenier. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation: Practice and Experience, Euro- Par 2009 best papers issue, 2010. Accepted for publication.
-
(2010)
Concurrency and Computation: Practice and Experience, Euro- Par 2009 Best Papers Issue
-
-
Augonnet, C.1
Thibault, S.2
Namyst, R.3
Wacrenier, P.-A.4
-
4
-
-
70350635626
-
An extension of the StarSs programming model for platforms with multiple GPUs
-
Eduard Ayguadé, Rosa M. Badia, Francisco D. Igual, Jesús Labarta, Rafael Mayo, and Enrique S. Quintana-Ortí. An Extension of the StarSs Programming Model for Platforms with Multiple GPUs. In Euro-Par, pages 851-862, 2009.
-
(2009)
Euro-Par
, pp. 851-862
-
-
Ayguadé, E.1
Badia, R.M.2
Igual, F.D.3
Labarta, J.4
Mayo, R.5
Quintana-Ortí, E.S.6
-
6
-
-
34548207355
-
Sequoia: Programming the memory hierarchy
-
Kayvon Fatahalian, Timothy J. Knight, Mike Houston, Mattan Erez, Daniel Reiter Horn, Larkhoon Leem, Ji Young Park, Manman Ren, Alex Aiken, William J. Dally, and Pat Hanrahan. Sequoia: Programming the memory hierarchy. In Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 2006.
-
(2006)
Proceedings of the 2006 ACM/IEEE Conference on Supercomputing
-
-
Fatahalian, K.1
Knight, T.J.2
Houston, M.3
Erez, M.4
Horn, D.R.5
Leem, L.6
Park, J.Y.7
Ren, M.8
Aiken, A.9
Dally, W.J.10
Hanrahan, P.11
-
7
-
-
77952251540
-
An asymmetric distributed shared memory model for heterogeneous parallel systems
-
Pittsburgh, PA, USA, March
-
Isaac Gelado, Javier Cabezas, John E. Stone, Sanjay Patel, Nacho Navarro, and Wen-mei W. Hwu. An Asymmetric Distributed Shared Memory Model for Heterogeneous Parallel Systems. In ASPLOS'10, Pittsburgh, PA, USA, March 2010.
-
(2010)
ASPLOS'10
-
-
Gelado, I.1
Cabezas, J.2
Stone, J.E.3
Patel, S.4
Navarro, N.5
Hwu, W.-M.W.6
-
8
-
-
67651156160
-
Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures
-
Luigi Genovese, Matthieu Ospici, Thierry Deutsch, Jean-Francois Méhaut, Alexey Neelov, and Stefan Goedecker. Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures. J Chem Phys, 131(3):034103, 2009.
-
(2009)
J Chem Phys
, vol.131
, Issue.3
, pp. 034103
-
-
Genovese, L.1
Ospici, M.2
Deutsch, T.3
Méhaut, J.-F.4
Neelov, A.5
Goedecker, S.6
-
9
-
-
72049106942
-
GPU clusters for high-performance computing
-
Volodymyr V. Kindratenko, Jeremy Enos, Guochun Shi, Michael T. Showerman, Galen W. Arnold, John E. Stone, James C. Phillips, and Wen mei W. Hwu. GPU clusters for high-performance computing. In CLUSTER, pages 1-8, 2009.
-
(2009)
CLUSTER
, pp. 1-8
-
-
Kindratenko, V.V.1
Enos, J.2
Shi, G.3
Showerman, M.T.4
Arnold, G.W.5
Stone, J.E.6
Phillips, J.C.7
Hwu, W.M.W.8
-
10
-
-
72049099859
-
Message passing for GPGPU clusters: CudaMPI
-
Orion Sky Lawlor. Message Passing for GPGPU Clusters: cudaMPI. In IEEE Cluster PPAC Workshop, 2009.
-
(2009)
IEEE Cluster PPAC Workshop
-
-
Lawlor, O.S.1
-
11
-
-
63549088652
-
COMIC: A coherent shared memory interface for Cell BE
-
New York, NY, USA, ACM
-
Jaejin Lee, Sangmin Seo, Chihun Kim, Junghyun Kim, Posung Chun, Zehra Sura, Jungwon Kim, and SangYong Han. COMIC: a coherent shared memory interface for Cell BE. In PACT'08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pages 303-314, New York, NY, USA, 2008. ACM.
-
(2008)
PACT'08: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 303-314
-
-
Lee, J.1
Seo, S.2
Kim, C.3
Kim, J.4
Chun, P.5
Sura, Z.6
Kim, J.7
Han, S.8
-
12
-
-
77954042160
-
Scalable high performant cholesky factorization for multicore with GPU accelerators
-
November
-
Hatem Ltaief, Stanimire Tomov, Rajib Nath, Peng Du, and Jack Dongarra. Scalable High Performant Cholesky Factorization for Multicore with GPU Accelerators. Technical Report 223, LAPACK Working Note, November 2009.
-
(2009)
Technical Report 223, LAPACK Working Note
-
-
Ltaief, H.1
Tomov, S.2
Nath, R.3
Du, P.4
Dongarra, J.5
-
13
-
-
77954725202
-
Overlapping communication and computation by using a hybrid MPI/SMPSs approach
-
New York, NY, USA, ACM
-
Vladimir Marjanović, Jesús Labarta, Eduard Ayguadé, and Mateo Valero. Overlapping communication and computation by using a hybrid MPI/SMPSs approach. In ICS'10: Proceedings of the 24th ACM International Conference on Supercomputing, pages 5-16, New York, NY, USA, 2010. ACM.
-
(2010)
ICS'10: Proceedings of the 24th ACM International Conference on Supercomputing
, pp. 5-16
-
-
Marjanović, V.1
Labarta, J.2
Ayguadé, E.3
Valero, M.4
-
14
-
-
33646596525
-
MPI Microtask for programming the cell broadband engine processor
-
M. Ohara, H. Inoue, Y. Sohda, H. Komatsu, and T. Nakatani. MPI Microtask for programming the Cell Broadband Engine Processor. IBM Syst. J., 45(1), 2006.
-
(2006)
IBM Syst. J.
, vol.45
, Issue.1
-
-
Ohara, M.1
Inoue, H.2
Sohda, Y.3
Komatsu, H.4
Nakatani, T.5
-
15
-
-
0036504666
-
Performance-effective and low-complexity task scheduling for heterogeneous computing
-
DOI 10.1109/71.993206
-
H. Topcuoglu, S. Hariri, and Min-You Wu. Performanceeffective and low-complexity task scheduling for heterogeneous computing. Parallel and Distributed Systems, IEEE Transactions on, 13(3):260-274, Mar 2002. (Pubitemid 34448780)
-
(2002)
IEEE Transactions on Parallel and Distributed Systems
, vol.13
, Issue.3
, pp. 260-274
-
-
Topcuoglu, H.1
Hariri, S.2
Wu, M.-Y.3
|