메뉴 건너뛰기




Volumn 38, Issue 3, 2012, Pages

Parallel and cache-efficient in-place matrix storage format conversion

Author keywords

Blocked matrix data layout; In place matrix transposition; Parallel and cache efficient algorithms

Indexed keywords

CACHE-EFFICIENT; DATA LAYOUTS; MATRIX TRANSPOSITION; NUMERICAL LIBRARY; PARALLEL ENVIRONMENT; STORAGE FORMATS;

EID: 84862107202     PISSN: 00983500     EISSN: 15577295     Source Type: Journal    
DOI: 10.1145/2168773.2168775     Document Type: Article
Times cited : (40)

References (30)
  • 1
    • 0016567155 scopus 로고
    • A computer algorithm for transposing nonsquare matrices
    • ALLTOP, W. O. 1975. A computer algorithm for transposing nonsquare matrices. IEEE Trans. Comput. 24, 10, 1038-1040.
    • (1975) IEEE Trans. Comput. , vol.24 , Issue.10 , pp. 1038-1040
    • Alltop, W.O.1
  • 2
    • 33746256776 scopus 로고    scopus 로고
    • Cache oblivious matrix multiplication using an element ordering based on a Peano curve
    • DOI 10.1016/j.laa.2006.03.018, PII S0024379506001595
    • BADER, M. AND ZENGER, C. 2006. Cache oblivious matrix multiplication using an element ordering based on a Peano curve. Linear Algebra Its Appl. 417, 2-3, 301-313. (Pubitemid 44093596)
    • (2006) Linear Algebra and Its Applications , vol.417 , Issue.2-3 , pp. 301-313
    • Bader, M.1    Zenger, C.2
  • 4
    • 34247127338 scopus 로고
    • A method for transposing a matrix
    • BERMAN, M. F. 1958. A method for transposing a matrix. J. A C M 5, 4, 383-384.
    • (1958) J. A C M , vol.5 , Issue.4 , pp. 383-384
    • Berman, M.F.1
  • 5
    • 38049061832 scopus 로고
    • Algorithm 302: Transpose vector stored array
    • BOOTHROYD, J. 1967. Algorithm 302: Transpose vector stored array. Comm. ACM 10, 5, 292-293.
    • (1967) Comm. ACM , vol.10 , Issue.5 , pp. 292-293
    • Boothroyd, J.1
  • 6
    • 84976753838 scopus 로고
    • Algorithm 380: In-situ transposition of a rectangular matrix
    • BREBNER, M. A. AND LAFLIN, S. 1970. Algorithm 380: In-situ transposition of a rectangular matrix. Comm. ACM 13, 5, 324-326.
    • (1970) Comm. ACM , vol.13 , Issue.5 , pp. 324-326
    • Brebner, M.A.1    Laflin, S.2
  • 7
    • 84976823916 scopus 로고
    • Algorithm 467: Matrix transposition in place
    • BRENNER, N. 1973. Algorithm 467: Matrix transposition in place. Comm. ACM 16, 11, 692-694.
    • (1973) Comm. ACM , vol.16 , Issue.11 , pp. 692-694
    • Brenner, N.1
  • 8
    • 0006359598 scopus 로고
    • Algorithm 513: Analysis of in-situ transposition
    • CATE, E. G. AND TWIGG, D. W. 1977. Algorithm 513: Analysis of in-situ transposition. ACM Trans. Math. Softw. 3, 1, 104-110.
    • (1977) ACM Trans. Math. Softw. , vol.3 , Issue.1 , pp. 104-110
    • Cate, E.G.1    Twigg, D.W.2
  • 10
    • 60649117581 scopus 로고    scopus 로고
    • QR factorization for the CELL Broadband Engine
    • DONGARRA, J. AND KURZAK, J. 2009. QR factorization for the CELL Broadband Engine. Sci. Progr. 17, 1-2, 31-42.
    • (2009) Sci. Progr. , vol.17 , Issue.1-2 , pp. 31-42
    • Dongarra, J.1    Kurzak, J.2
  • 11
    • 0029492798 scopus 로고
    • Transposing a matrix on a vector computer
    • DOW, M. 1995. Transposing a matrix on a vector computer. Parall. Comput. 21, 12, 1997-2005.
    • (1995) Parall. Comput. , vol.21 , Issue.12 , pp. 1997-2005
    • Dow, M.1
  • 12
    • 1842832833 scopus 로고    scopus 로고
    • Recursive blocked algorithms and hybrid data structures for dense matrix library software
    • ELMROTH, E., GUSTAVSON, F., JONSSON, I., AND KÅGSTRÖ M, B. 2004. Recursive blocked algorithms and hybrid data structures for dense matrix library software. SIAM Rev. 46, 1, 3-45.
    • (2004) SIAM Rev. , vol.46 , Issue.1 , pp. 3-45
    • Elmroth, E.1    Gustavson, F.2    Jonsson, I.3    Kågströ, M.B.4
  • 14
    • 0016939423 scopus 로고
    • Array permutation by index-digit permutation
    • FRASER, D. 1976. Array permutation by index-digit permutation. J. A C M 2 3, 298-309.
    • (1976) J. A C M , vol.2 , Issue.3 , pp. 298-309
    • Fraser, D.1
  • 15
    • 84972622535 scopus 로고
    • The impact of hierarchical memory systems on linear algebra algorithm design
    • GALLIVAN, K., JALBY, W., MEIER, U., AND SAMEH, A. 1988. The impact of hierarchical memory systems on linear algebra algorithm design. Int. J. Supercomput. Appl. 2, 1, 12-48.
    • (1988) Int. J. Supercomput. Appl. , vol.2 , Issue.1 , pp. 12-48
    • Gallivan, K.1    Jalby, W.2    Meier, U.3    Sameh, A.4
  • 19
    • 65849486487 scopus 로고    scopus 로고
    • Distributed SBP Cholesky factorization algorithms with near-optimal scheduling
    • GUSTAVSON, F., KARLSSON, L., AND KÅGSTRÖ M, B. 2009. Distributed SBP Cholesky factorization algorithms with near-optimal scheduling. ACM Trans. Math. Softw. 36, 2, 11:1-11:25.
    • (2009) ACM Trans. Math. Softw. , vol.36 , Issue.2 , pp. 111-1125
    • Gustavson, F.1    Karlsson, L.2    Kågströ, M.B.3
  • 21
    • 0042235298 scopus 로고    scopus 로고
    • Tiling, block data layout, and memory hierarchy performance
    • HONG, B., PARK, N., AND PRASANNA, V. K. 2003. Tiling, block data layout, and memory hierarchy performance. IEEE Trans. Parall. Distrib. Syst. 14, 7, 640-654.
    • (2003) IEEE Trans. Parall. Distrib. Syst. , vol.14 , Issue.7 , pp. 640-654
    • Hong, B.1    Park, N.2    Prasanna, V.K.3
  • 26
    • 0001941270 scopus 로고
    • Mathematical analysis of algorithms
    • North-Holland
    • KNUTH, D. E. 1971. Mathematical analysis of algorithms. In Proceedings of IFIP Congress. North-Holland, 19-27.
    • (1971) Proceedings of IFIP Congress , pp. 19-27
    • Knuth, D.E.1
  • 28
  • 29
    • 84968507674 scopus 로고
    • A problem in Abelian groups, with application to the transposition of a matrix on an electronic computer
    • PALL, G. AND SEIDEN, E. 1960. A problem in Abelian groups, with application to the transposition of a matrix on an electronic computer. Math. Comput. 14, 70, 189-192.
    • (1960) Math. Comput. , vol.14 , Issue.70 , pp. 189-192
    • Pall, G.1    Seiden, E.2
  • 30
    • 38049073987 scopus 로고
    • Transposing matrices in a digital computer
    • WINDLEY, P. F. 1959. Transposing matrices in a digital computer. Comput. J. 2, 1, 47-48.
    • (1959) Comput. J. , vol.2 , Issue.1 , pp. 47-48
    • Windley, P.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.