-
2
-
-
0025467711
-
A bridging model for parallel computation
-
Valiant LG,. A bridging model for parallel computation. Communications of the ACM 1990; 33 (8): 103-111.
-
(1990)
Communications of the ACM
, vol.33
, Issue.8
, pp. 103-111
-
-
Valiant, L.G.1
-
3
-
-
0032297719
-
BSPlib: The BSP programming library
-
PII S0167819198000933
-
Hill JMD, McColl B, et al,. BSPlib: the BSP programming library. Parallel Computing 1998; 24 (14): 1947-1980. (Pubitemid 128424040)
-
(1998)
Parallel Computing
, vol.24
, Issue.14
, pp. 1947-1980
-
-
Hill, J.M.D.1
McColl, B.2
Stefanescu, D.C.3
Goudreau, M.W.4
Lang, K.5
Rao, S.B.6
Suel, T.7
Tsantilas, T.8
Bisseling, R.H.9
-
4
-
-
0037303080
-
The Paderborn University BSP (PUB) library
-
DOI: 10.1016/S0167-8191(02)00218-1
-
Bonorden O, Juurlink B, et al,. The Paderborn University BSP (PUB) library. Parallel Computing 2003; 29 (2): 187-207. DOI: 10.1016/S0167-8191(02) 00218-1.
-
(2003)
Parallel Computing
, vol.29
, Issue.2
, pp. 187-207
-
-
Bonorden, O.1
Juurlink, B.2
-
6
-
-
0002806690
-
OpenMP: An industry standard API for shared-memory programming
-
Dagum L, Menon R,. OpenMP: an industry standard API for shared-memory programming. Computational Science and Engineering 1998; 5 (1): 46-55.
-
(1998)
Computational Science and Engineering
, vol.5
, Issue.1
, pp. 46-55
-
-
Dagum, L.1
Menon, R.2
-
7
-
-
54949115201
-
-
Scientific and Engineering Computation Series, The MIT Press: Cambridge, MA
-
Chapman B, Jost G, et al,. Using OpenMP: Portable Shared Memory Parallel Programming, Scientific and Engineering Computation Series, The MIT Press: Cambridge, MA, 2007.
-
(2007)
Using OpenMP: Portable Shared Memory Parallel Programming
-
-
Chapman, B.1
Jost, G.2
-
10
-
-
14744296887
-
CGMGRAPH/CGMLIB: Implementing and testing CGM graph algorithms on PC clusters and shared memory machines
-
DOI 10.1177/1094342005051196
-
Chan A, Dehne F,. CGMgraph/CGMlib: implementing and testing CGM graph algorithms on PC clusters and shared memory machines. International Journal of High Performance Computing Applications 2005; 19: 81-97. (Pubitemid 40329108)
-
(2005)
International Journal of High Performance Computing Applications
, vol.19
, Issue.1
, pp. 81-97
-
-
Chan, A.1
Dehne, F.2
Taylor, R.3
-
11
-
-
0032155556
-
Titanium: A high-performance Java dialect
-
Yelick K, Semenzato L, et al,. Titanium: a high-performance Java dialect. Concurrency: Practice and Experience 1998; 10 (11-13): 825-836. DOI: 10.1002/(SICI)1096-9128(199809/11)10:11/13h825::AID-CPE383i3.0.CO;2-H. (Pubitemid 128445433)
-
(1998)
Concurrency Practice and Experience
, vol.10
, Issue.11-13
, pp. 825-836
-
-
Yelick, K.1
Semenzato, L.2
Pike, G.3
Miyamoto, C.4
Liblit, B.5
Krishnamurthy, A.6
Hilfinger, P.7
Graham, S.8
Gay, D.9
Colella, P.10
Aiken, A.11
-
12
-
-
34548717526
-
Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java
-
IEEE Press: Long Beach, CA, USA
-
Kaminsky A,. Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java. In International Parallel and Distributed Processing Symposium, IEEE Press: Long Beach, CA, USA, 2007; 1-8.
-
(2007)
International Parallel and Distributed Processing Symposium
, pp. 1-8
-
-
Kaminsky, A.1
-
13
-
-
0347528600
-
High-level parallel software development with Python and BSP
-
Hinsen K,. High-level parallel software development with Python and BSP. Parallel Processing Letters 2003; 13 (3): 473-484.
-
(2003)
Parallel Processing Letters
, vol.13
, Issue.3
, pp. 473-484
-
-
Hinsen, K.1
-
14
-
-
0346098076
-
The bulk-synchronous parallel random access machine
-
PII S0304397597001977
-
Tiskin A,. The bulk-synchronous parallel random access machine. Theoretical Computer Science 1998; 196 (1-2): 109-130. DOI: 10.1016/S0304- 3975(97)00197-7. (Pubitemid 128458405)
-
(1998)
Theoretical Computer Science
, vol.196
, Issue.1-2
, pp. 109-130
-
-
Tiskin, A.1
-
15
-
-
49249137934
-
BSGP: Bulk-synchronous GPU programming
-
August
-
Hou Q, Zhou K, et al,. BSGP: bulk-synchronous GPU programming. ACM Transactions on Graphics August 2008; 27 (3): 19.1-19.12.
-
(2008)
ACM Transactions on Graphics
, vol.27
, Issue.3
, pp. 191-1912
-
-
Hou, Q.1
Zhou, K.2
-
16
-
-
17444414573
-
A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
-
DOI 10.1137/S0036144502409019
-
Vastenhouw B, Bisseling RH,. A two-dimensional data distribution method for parallel sparse matrix-vector multiplication. SIAM Review 2005; 47 (1): 67-95. (Pubitemid 40535972)
-
(2005)
SIAM Review
, vol.47
, Issue.1
, pp. 67-95
-
-
Vastenhouw, B.1
Bisseling, R.H.2
-
17
-
-
33847119013
-
Parallel hypergraph partitioning for scientific computing
-
IEEE Press: Long Beach, CA, USA
-
Devine KD, Boman EG, et al,. Parallel hypergraph partitioning for scientific computing. In International Parallel and Distributed Processing Symposium, IEEE Press: Long Beach, CA, USA, 2006; 102.
-
(2006)
International Parallel and Distributed Processing Symposium
, pp. 102
-
-
Devine, K.D.1
Boman, E.G.2
-
18
-
-
77954707501
-
Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
-
Yzelman AN, Bisseling RH,. Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods. SIAM Journal on Scientific Computing 2009; 31 (4): 3128-3154.
-
(2009)
SIAM Journal on Scientific Computing
, vol.31
, Issue.4
, pp. 3128-3154
-
-
Yzelman, A.N.1
Bisseling, R.H.2
-
19
-
-
0031269220
-
Improving the memory-system performance of sparse-matrix vector multiplication
-
Toledo S,. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 1997; 41 (6): 711-725. (Pubitemid 127557044)
-
(1997)
IBM Journal of Research and Development
, vol.41
, Issue.6
, pp. 711-725
-
-
Toledo, S.1
-
20
-
-
60949098907
-
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
-
DOI: 10.1016/j.parco.2008.12.006
-
Williams S, Oliker L, et al,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194. DOI: 10.1016/j.parco.2008.12.006.
-
(2009)
Parallel Computing
, vol.35
, Issue.3
, pp. 178-194
-
-
Williams, S.1
Oliker, L.2
-
22
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
IEEE Press: Los Alamitos, CA
-
Frigo M, Johnson SG,. FFTW: An adaptive software architecture for the FFT. In Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 3, IEEE Press: Los Alamitos, CA, 1998; 1381-1384.
-
(1998)
Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing
, vol.3
, pp. 1381-1384
-
-
Frigo, M.1
Johnson, S.G.2
-
23
-
-
0004236492
-
-
3rd ed., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD
-
Golub GH, Van Loan CF,. Matrix Computations, 3rded., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD, 1996.
-
(1996)
Matrix Computations
-
-
Golub, G.H.1
Van Loan, C.F.2
-
25
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
DOI 10.1016/S0167-8191(00)00087-9
-
Whaley RC, Petitet A, et al,. Automated empirical optimizations of software and the ATLAS project. Parallel Computing 2001; 27 (1-2): 3-35. (Pubitemid 32264775)
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Clint Whaley, R.1
Petitet, A.2
Dongarra, J.J.3
-
26
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
DOI 10.1088/1742-6596/16/1/071
-
Vuduc R, Demmel JW, et al,. OSKI: a library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series 2005; 16: 521-530. (Pubitemid 41259393)
-
(2005)
Journal of Physics: Conference Series
, vol.16
, Issue.1
, pp. 521-530
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
27
-
-
84930675361
-
A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
-
Springer: Berlin, in press
-
Yzelman AN, Bisseling RH,. A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve. In Progress in Industrial Mathematics at ECMI 2010, Springer: Berlin, 2011. in press.
-
(2011)
Progress in Industrial Mathematics at ECMI 2010
-
-
Yzelman, A.N.1
Bisseling, R.H.2
-
28
-
-
70449629588
-
Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
-
ACM: New York, NY, USA
-
Buluç A, Fineman JT, et al,. Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks. In SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures, ACM: New York, NY, USA, 2009; 233-244.
-
(2009)
SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures
, pp. 233-244
-
-
Buluç, A.1
Fineman, J.T.2
-
29
-
-
79551511651
-
Utilizing recursive storage in sparse matrix-vector multiplication - Preliminary considerations
-
Philip T. (ed.). ISCA: Hawaii, USA
-
Martone M, Filippone S, et al,. Utilizing recursive storage in sparse matrix-vector multiplication-preliminary considerations. In Proceedings of the ISCA 25th International Conference on Computers and Their Applications (CATA), Philip T, (ed.). ISCA: Hawaii, USA, 2010; 300-305.
-
(2010)
Proceedings of the ISCA 25th International Conference on Computers and Their Applications (CATA)
, pp. 300-305
-
-
Martone, M.1
Filippone, S.2
|