SCOPUS 정보 검색 플랫폼

Concurrency and Computation: Practice and Experience

Volumn 24, Issue 5, 2012, Pages 533-553

An object-oriented bulk synchronous parallel library for multicore programming

Author keywords

BSP; bulk synchronous parallel; dense LU decomposition; fast Fourier transform; multicore; parallel computing; shared memory; sparse matrix vector multiplication

Indexed keywords

FAST FOURIER TRANSFORMS; MATRIX ALGEBRA; MEMORY ARCHITECTURE; MULTICORE PROGRAMMING; PARALLEL PROCESSING SYSTEMS; SOFTWARE ARCHITECTURE;

BULK SYNCHRONOUS PARALLEL; LU DECOMPOSITION; MULTI CORE; SHARED MEMORY; SPARSE MATRIX-VECTOR MULTIPLICATION;

OBJECT ORIENTED PROGRAMMING;

EID: 84858077252 PISSN: 15320626 EISSN: 15320634 Source Type: Journal
DOI: 10.1002/cpe.1843 Document Type: Conference Paper

Times cited : (18)

References (29)

1
- 57849138004
- A bridging model for multi-core computing
- Springer: Berlin
- Valiant LG,. A bridging model for multi-core computing. In Algorithms-ESA 2008, Lecture Notes in Computer Science, Vol. 5193. Springer: Berlin, 2008; 13-28.
- (2008) Algorithms - ESA 2008, Lecture Notes in Computer Science , vol.5193 , pp. 13-28
- Valiant, L.G.¹

2
- 0025467711
- A bridging model for parallel computation
- Valiant LG,. A bridging model for parallel computation. Communications of the ACM 1990; 33 (8): 103-111.
- (1990) Communications of the ACM , vol.33 , Issue.8 , pp. 103-111
- Valiant, L.G.¹

3
- 0032297719
- BSPlib: The BSP programming library
- PII S0167819198000933
- Hill JMD, McColl B, et al,. BSPlib: the BSP programming library. Parallel Computing 1998; 24 (14): 1947-1980. (Pubitemid 128424040)
- (1998) Parallel Computing , vol.24 , Issue.14 , pp. 1947-1980
- Hill, J.M.D.¹ McColl, B.² Stefanescu, D.C.³ Goudreau, M.W.⁴ Lang, K.⁵ Rao, S.B.⁶ Suel, T.⁷ Tsantilas, T.⁸ Bisseling, R.H.⁹

4
- 0037303080
- The Paderborn University BSP (PUB) library
- DOI: 10.1016/S0167-8191(02)00218-1
- Bonorden O, Juurlink B, et al,. The Paderborn University BSP (PUB) library. Parallel Computing 2003; 29 (2): 187-207. DOI: 10.1016/S0167-8191(02) 00218-1.
- (2003) Parallel Computing , vol.29 , Issue.2 , pp. 187-207
- Bonorden, O.¹ Juurlink, B.²

5
- 4744357005
- Oxford University Press: Oxford, UK
- Bisseling RH,. Parallel Scientific Computation: A Structured Approach using BSP and MPI. Oxford University Press: Oxford, UK, 2004.
- (2004) Parallel Scientific Computation: A Structured Approach Using BSP and MPI
- Bisseling, R.H.¹

6
- 0002806690
- OpenMP: An industry standard API for shared-memory programming
- Dagum L, Menon R,. OpenMP: an industry standard API for shared-memory programming. Computational Science and Engineering 1998; 5 (1): 46-55.
- (1998) Computational Science and Engineering , vol.5 , Issue.1 , pp. 46-55
- Dagum, L.¹ Menon, R.²

7
- 54949115201
- Scientific and Engineering Computation Series, The MIT Press: Cambridge, MA
- Chapman B, Jost G, et al,. Using OpenMP: Portable Shared Memory Parallel Programming, Scientific and Engineering Computation Series, The MIT Press: Cambridge, MA, 2007.
- (2007) Using OpenMP: Portable Shared Memory Parallel Programming
- Chapman, B.¹ Jost, G.²

8
- 84858075823
- An object-oriented programming model for BSP computations
- Lecomber D,. An object-oriented programming model for BSP computations. Proceedings of the PPECC Workshop on Parallel and Distributed Computing, 1994.
- (1994) Proceedings of the PPECC Workshop on Parallel and Distributed Computing
- Lecomber, D.¹

9
- 0030496241
- Scalable parallel computational geometry for coarse grained multicomputers
- Dehne F, Fabri A, et al,. Scalable parallel computational geometry for coarse grained multicomputers. International Journal on Computational Geometry and Applications 1996; 6 (3): 379-400.
- (1996) International Journal on Computational Geometry and Applications , vol.6 , Issue.3 , pp. 379-400
- Dehne, F.¹ Fabri, A.²

10
- 14744296887
- CGMGRAPH/CGMLIB: Implementing and testing CGM graph algorithms on PC clusters and shared memory machines
- DOI 10.1177/1094342005051196
- Chan A, Dehne F,. CGMgraph/CGMlib: implementing and testing CGM graph algorithms on PC clusters and shared memory machines. International Journal of High Performance Computing Applications 2005; 19: 81-97. (Pubitemid 40329108)
- (2005) International Journal of High Performance Computing Applications , vol.19 , Issue.1 , pp. 81-97
- Chan, A.¹ Dehne, F.² Taylor, R.³

11
- 0032155556
- Titanium: A high-performance Java dialect
- Yelick K, Semenzato L, et al,. Titanium: a high-performance Java dialect. Concurrency: Practice and Experience 1998; 10 (11-13): 825-836. DOI: 10.1002/(SICI)1096-9128(199809/11)10:11/13h825::AID-CPE383i3.0.CO;2-H. (Pubitemid 128445433)
- (1998) Concurrency Practice and Experience , vol.10 , Issue.11-13 , pp. 825-836
- Yelick, K.¹ Semenzato, L.² Pike, G.³ Miyamoto, C.⁴ Liblit, B.⁵ Krishnamurthy, A.⁶ Hilfinger, P.⁷ Graham, S.⁸ Gay, D.⁹ Colella, P.¹⁰ Aiken, A.¹¹

12
- 34548717526
- Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java
- IEEE Press: Long Beach, CA, USA
- Kaminsky A,. Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java. In International Parallel and Distributed Processing Symposium, IEEE Press: Long Beach, CA, USA, 2007; 1-8.
- (2007) International Parallel and Distributed Processing Symposium , pp. 1-8
- Kaminsky, A.¹

13
- 0347528600
- High-level parallel software development with Python and BSP
- Hinsen K,. High-level parallel software development with Python and BSP. Parallel Processing Letters 2003; 13 (3): 473-484.
- (2003) Parallel Processing Letters , vol.13 , Issue.3 , pp. 473-484
- Hinsen, K.¹

14
- 0346098076
- The bulk-synchronous parallel random access machine
- PII S0304397597001977
- Tiskin A,. The bulk-synchronous parallel random access machine. Theoretical Computer Science 1998; 196 (1-2): 109-130. DOI: 10.1016/S0304- 3975(97)00197-7. (Pubitemid 128458405)
- (1998) Theoretical Computer Science , vol.196 , Issue.1-2 , pp. 109-130
- Tiskin, A.¹

15
- 49249137934
- BSGP: Bulk-synchronous GPU programming
- August
- Hou Q, Zhou K, et al,. BSGP: bulk-synchronous GPU programming. ACM Transactions on Graphics August 2008; 27 (3): 19.1-19.12.
- (2008) ACM Transactions on Graphics , vol.27 , Issue.3 , pp. 191-1912
- Hou, Q.¹ Zhou, K.²

16
- 17444414573
- A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
- DOI 10.1137/S0036144502409019
- Vastenhouw B, Bisseling RH,. A two-dimensional data distribution method for parallel sparse matrix-vector multiplication. SIAM Review 2005; 47 (1): 67-95. (Pubitemid 40535972)
- (2005) SIAM Review , vol.47 , Issue.1 , pp. 67-95
- Vastenhouw, B.¹ Bisseling, R.H.²

17
- 33847119013
- Parallel hypergraph partitioning for scientific computing
- IEEE Press: Long Beach, CA, USA
- Devine KD, Boman EG, et al,. Parallel hypergraph partitioning for scientific computing. In International Parallel and Distributed Processing Symposium, IEEE Press: Long Beach, CA, USA, 2006; 102.
- (2006) International Parallel and Distributed Processing Symposium , pp. 102
- Devine, K.D.¹ Boman, E.G.²

18
- 77954707501
- Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
- Yzelman AN, Bisseling RH,. Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods. SIAM Journal on Scientific Computing 2009; 31 (4): 3128-3154.
- (2009) SIAM Journal on Scientific Computing , vol.31 , Issue.4 , pp. 3128-3154
- Yzelman, A.N.¹ Bisseling, R.H.²

19
- 0031269220
- Improving the memory-system performance of sparse-matrix vector multiplication
- Toledo S,. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 1997; 41 (6): 711-725. (Pubitemid 127557044)
- (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
- Toledo, S.¹

20
- 60949098907
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms
- DOI: 10.1016/j.parco.2008.12.006
- Williams S, Oliker L, et al,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194. DOI: 10.1016/j.parco.2008.12.006.
- (2009) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
- Williams, S.¹ Oliker, L.²

21
- 84858070367
- Preprint
- Yzelman AN, Bisseling RH,. Two-dimensional cache-oblivious sparse matrix-vector multiplication, 2011. Preprint.
- (2011) Two-dimensional Cache-oblivious Sparse Matrix-vector Multiplication
- Yzelman, A.N.¹ Bisseling, R.H.²

22
- 0031636309
- FFTW: An adaptive software architecture for the FFT
- IEEE Press: Los Alamitos, CA
- Frigo M, Johnson SG,. FFTW: An adaptive software architecture for the FFT. In Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 3, IEEE Press: Los Alamitos, CA, 1998; 1381-1384.
- (1998) Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.3 , pp. 1381-1384
- Frigo, M.¹ Johnson, S.G.²

23
- 0004236492
- 3rd ed., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD
- Golub GH, Van Loan CF,. Matrix Computations, 3rded., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD, 1996.
- (1996) Matrix Computations
- Golub, G.H.¹ Van Loan, C.F.²

24
- 84857494597
- [February 2011]
- Goto K, Milfeld K,. GotoBLAS2., [February 2011].
- GotoBLAS2
- Goto, K.¹ Milfeld, K.²

25
- 0343462141
- Automated empirical optimizations of software and the ATLAS project
- DOI 10.1016/S0167-8191(00)00087-9
- Whaley RC, Petitet A, et al,. Automated empirical optimizations of software and the ATLAS project. Parallel Computing 2001; 27 (1-2): 3-35. (Pubitemid 32264775)
- (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
- Clint Whaley, R.¹ Petitet, A.² Dongarra, J.J.³

26
- 24344485098
- OSKI: A library of automatically tuned sparse matrix kernels
- DOI 10.1088/1742-6596/16/1/071
- Vuduc R, Demmel JW, et al,. OSKI: a library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series 2005; 16: 521-530. (Pubitemid 41259393)
- (2005) Journal of Physics: Conference Series , vol.16 , Issue.1 , pp. 521-530
- Vuduc, R.¹ Demmel, J.W.² Yelick, K.A.³

27
- 84930675361
- A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
- Springer: Berlin, in press
- Yzelman AN, Bisseling RH,. A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve. In Progress in Industrial Mathematics at ECMI 2010, Springer: Berlin, 2011. in press.
- (2011) Progress in Industrial Mathematics at ECMI 2010
- Yzelman, A.N.¹ Bisseling, R.H.²

28
- 70449629588
- Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks
- ACM: New York, NY, USA
- Buluç A, Fineman JT, et al,. Parallel sparse matrix-vector and matrix-transpose-vector multiplication using compressed sparse blocks. In SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures, ACM: New York, NY, USA, 2009; 233-244.
- (2009) SPAA '09: Proceedings of the Twenty-first Annual Symposium on Parallelism in Algorithms and Architectures , pp. 233-244
- Buluç, A.¹ Fineman, J.T.²

29
- 79551511651
- Utilizing recursive storage in sparse matrix-vector multiplication - Preliminary considerations
- Philip T. (ed.). ISCA: Hawaii, USA
- Martone M, Filippone S, et al,. Utilizing recursive storage in sparse matrix-vector multiplication-preliminary considerations. In Proceedings of the ISCA 25th International Conference on Computers and Their Applications (CATA), Philip T, (ed.). ISCA: Hawaii, USA, 2010; 300-305.
- (2010) Proceedings of the ISCA 25th International Conference on Computers and Their Applications (CATA) , pp. 300-305
- Martone, M.¹ Filippone, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.