메뉴 건너뛰기




Volumn 24, Issue 5, 2012, Pages 533-553

An object-oriented bulk synchronous parallel library for multicore programming

Author keywords

BSP; bulk synchronous parallel; dense LU decomposition; fast Fourier transform; multicore; parallel computing; shared memory; sparse matrix vector multiplication

Indexed keywords

FAST FOURIER TRANSFORMS; MATRIX ALGEBRA; MEMORY ARCHITECTURE; MULTICORE PROGRAMMING; PARALLEL PROCESSING SYSTEMS; SOFTWARE ARCHITECTURE;

EID: 84858077252     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1843     Document Type: Conference Paper
Times cited : (18)

References (29)
  • 2
    • 0025467711 scopus 로고
    • A bridging model for parallel computation
    • Valiant LG,. A bridging model for parallel computation. Communications of the ACM 1990; 33 (8): 103-111.
    • (1990) Communications of the ACM , vol.33 , Issue.8 , pp. 103-111
    • Valiant, L.G.1
  • 4
    • 0037303080 scopus 로고    scopus 로고
    • The Paderborn University BSP (PUB) library
    • DOI: 10.1016/S0167-8191(02)00218-1
    • Bonorden O, Juurlink B, et al,. The Paderborn University BSP (PUB) library. Parallel Computing 2003; 29 (2): 187-207. DOI: 10.1016/S0167-8191(02) 00218-1.
    • (2003) Parallel Computing , vol.29 , Issue.2 , pp. 187-207
    • Bonorden, O.1    Juurlink, B.2
  • 6
    • 0002806690 scopus 로고    scopus 로고
    • OpenMP: An industry standard API for shared-memory programming
    • Dagum L, Menon R,. OpenMP: an industry standard API for shared-memory programming. Computational Science and Engineering 1998; 5 (1): 46-55.
    • (1998) Computational Science and Engineering , vol.5 , Issue.1 , pp. 46-55
    • Dagum, L.1    Menon, R.2
  • 10
    • 14744296887 scopus 로고    scopus 로고
    • CGMGRAPH/CGMLIB: Implementing and testing CGM graph algorithms on PC clusters and shared memory machines
    • DOI 10.1177/1094342005051196
    • Chan A, Dehne F,. CGMgraph/CGMlib: implementing and testing CGM graph algorithms on PC clusters and shared memory machines. International Journal of High Performance Computing Applications 2005; 19: 81-97. (Pubitemid 40329108)
    • (2005) International Journal of High Performance Computing Applications , vol.19 , Issue.1 , pp. 81-97
    • Chan, A.1    Dehne, F.2    Taylor, R.3
  • 12
    • 34548717526 scopus 로고    scopus 로고
    • Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java
    • IEEE Press: Long Beach, CA, USA
    • Kaminsky A,. Parallel Java: A unified API for shared memory and cluster parallel programming in 100% Java. In International Parallel and Distributed Processing Symposium, IEEE Press: Long Beach, CA, USA, 2007; 1-8.
    • (2007) International Parallel and Distributed Processing Symposium , pp. 1-8
    • Kaminsky, A.1
  • 13
    • 0347528600 scopus 로고    scopus 로고
    • High-level parallel software development with Python and BSP
    • Hinsen K,. High-level parallel software development with Python and BSP. Parallel Processing Letters 2003; 13 (3): 473-484.
    • (2003) Parallel Processing Letters , vol.13 , Issue.3 , pp. 473-484
    • Hinsen, K.1
  • 14
    • 0346098076 scopus 로고    scopus 로고
    • The bulk-synchronous parallel random access machine
    • PII S0304397597001977
    • Tiskin A,. The bulk-synchronous parallel random access machine. Theoretical Computer Science 1998; 196 (1-2): 109-130. DOI: 10.1016/S0304- 3975(97)00197-7. (Pubitemid 128458405)
    • (1998) Theoretical Computer Science , vol.196 , Issue.1-2 , pp. 109-130
    • Tiskin, A.1
  • 15
    • 49249137934 scopus 로고    scopus 로고
    • BSGP: Bulk-synchronous GPU programming
    • August
    • Hou Q, Zhou K, et al,. BSGP: bulk-synchronous GPU programming. ACM Transactions on Graphics August 2008; 27 (3): 19.1-19.12.
    • (2008) ACM Transactions on Graphics , vol.27 , Issue.3 , pp. 191-1912
    • Hou, Q.1    Zhou, K.2
  • 16
    • 17444414573 scopus 로고    scopus 로고
    • A two-dimensional data distribution method for parallel sparse matrix-vector multiplication
    • DOI 10.1137/S0036144502409019
    • Vastenhouw B, Bisseling RH,. A two-dimensional data distribution method for parallel sparse matrix-vector multiplication. SIAM Review 2005; 47 (1): 67-95. (Pubitemid 40535972)
    • (2005) SIAM Review , vol.47 , Issue.1 , pp. 67-95
    • Vastenhouw, B.1    Bisseling, R.H.2
  • 18
    • 77954707501 scopus 로고    scopus 로고
    • Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods
    • Yzelman AN, Bisseling RH,. Cache-oblivious sparse matrix-vector multiplication by using sparse matrix partitioning methods. SIAM Journal on Scientific Computing 2009; 31 (4): 3128-3154.
    • (2009) SIAM Journal on Scientific Computing , vol.31 , Issue.4 , pp. 3128-3154
    • Yzelman, A.N.1    Bisseling, R.H.2
  • 19
    • 0031269220 scopus 로고    scopus 로고
    • Improving the memory-system performance of sparse-matrix vector multiplication
    • Toledo S,. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 1997; 41 (6): 711-725. (Pubitemid 127557044)
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
    • Toledo, S.1
  • 20
    • 60949098907 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication on emerging multicore platforms
    • DOI: 10.1016/j.parco.2008.12.006
    • Williams S, Oliker L, et al,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194. DOI: 10.1016/j.parco.2008.12.006.
    • (2009) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
    • Williams, S.1    Oliker, L.2
  • 23
    • 0004236492 scopus 로고    scopus 로고
    • 3rd ed., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD
    • Golub GH, Van Loan CF,. Matrix Computations, 3rded., Johns Hopkins Studies in the Mathematical Sciences, The Johns Hopkins University Press: Baltimore, MD, 1996.
    • (1996) Matrix Computations
    • Golub, G.H.1    Van Loan, C.F.2
  • 25
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • DOI 10.1016/S0167-8191(00)00087-9
    • Whaley RC, Petitet A, et al,. Automated empirical optimizations of software and the ATLAS project. Parallel Computing 2001; 27 (1-2): 3-35. (Pubitemid 32264775)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Clint Whaley, R.1    Petitet, A.2    Dongarra, J.J.3
  • 26
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • DOI 10.1088/1742-6596/16/1/071
    • Vuduc R, Demmel JW, et al,. OSKI: a library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series 2005; 16: 521-530. (Pubitemid 41259393)
    • (2005) Journal of Physics: Conference Series , vol.16 , Issue.1 , pp. 521-530
    • Vuduc, R.1    Demmel, J.W.2    Yelick, K.A.3
  • 27
    • 84930675361 scopus 로고    scopus 로고
    • A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve
    • Springer: Berlin, in press
    • Yzelman AN, Bisseling RH,. A cache-oblivious sparse matrix-vector multiplication scheme based on the Hilbert curve. In Progress in Industrial Mathematics at ECMI 2010, Springer: Berlin, 2011. in press.
    • (2011) Progress in Industrial Mathematics at ECMI 2010
    • Yzelman, A.N.1    Bisseling, R.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.