SCOPUS 정보 검색 플랫폼

Volumn 2006, Issue , 2006, Pages

The general matrix multiply-add operation on 2D torus

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; OPTIMIZATION; PARALLEL PROCESSING SYSTEMS; PROBLEM SOLVING; RESOURCE ALLOCATION; SCHEDULING;

CANNON'S ALGORITHM; DATA ALLOCATIONS; DATA ROLLING; TOROIDAL ARRAY PROCESSOR;

COMPUTATIONAL METHODS;

EID: 33847114052 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IPDPS.2006.1639613 Document Type: Conference Paper

Times cited : (10)

References (20)

1
- 0028545949
- A high performance matrix multiplication algorithm on a distributed-memory parallel computer, using overlapped communication
- R. Agarwal, F. Gustavson, and M. Zubair. A high performance matrix multiplication algorithm on a distributed-memory parallel computer, using overlapped communication. IBMJ. of Res. and Develop., 38(6):673-681, 1994.
- (1994) IBMJ. of Res. and Develop , vol.38 , Issue.6 , pp. 673-681
- Agarwal, R.¹ Gustavson, F.² Zubair, M.³

2
- 0003712293
- PhD thesis, Montana State University
- L. Cannon. A Cellular Computer to Implement the Kalman Filter Algorithm. PhD thesis, Montana State University, 1969.
- (1969) A Cellular Computer to Implement the Kalman Filter Algorithm
- Cannon, L.¹

4
- 4043097206
- Elsevier
- W. Dally and B.Towles. Principles and Practices of Interconnection Networkks. Elsevier, 2004.
- (2004) Principles and Practices of Interconnection Networkks
- Dally, W.¹ Towles, B.²

6
- 0025402476
- A set of level 3 basic linear algebra subprograms
- J. J. Dongarra, J. D. Croz, I. Duff, and S. Hammarling. A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Software, 16:1-17, 1990.
- (1990) ACM Trans. Math. Software , vol.16 , pp. 1-17
- Dongarra, J.J.¹ Croz, J.D.² Duff, I.³ Hammarling, S.⁴

7
- 0023983122
- An extended set of FORTRAN basic linear algebra subprograms
- J. J. Dongarra, J. D. Croz, S. Hammarling, and R. J. Hanson. An extended set of FORTRAN basic linear algebra subprograms. ACM Trans. Math. Software, 14:1-17, 1988.
- (1988) ACM Trans. Math. Software , vol.14 , pp. 1-17
- Dongarra, J.J.¹ Croz, J.D.² Hammarling, S.³ Hanson, R.J.⁴

8
- 0023288009
- Matrix algorithms on a hypercube I: Matrix multiplication
- G. Fox, S. Otto, and A. Hey. Matrix algorithms on a hypercube I: Matrix multiplication. Parallel Computing, 4:17-31, 1987.
- (1987) Parallel Computing , vol.4 , pp. 17-31
- Fox, G.¹ Otto, S.² Hey, A.³

9
- 0004236492
- John Hopkins, Baltimore, Maryland
- G. H. Golub and C. F. V. Loan. Matrix Computations. John Hopkins, Baltimore, Maryland, 1989.
- (1989) Matrix Computations
- Golub, G.H.¹ Loan, C.F.V.²

12
- 0032155271
- GEMM-based level 3 BLAS: High-performance model implementations and performance evaluation benchmark
- B. Kågström, P. Ling, and C. V. Loan. GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark. ACM Trans. Math. Software, 24(3):268-302, 1998.
- (1998) ACM Trans. Math. Software , vol.24 , Issue.3 , pp. 268-302
- Kågström, B.¹ Ling, P.² Loan, C.V.³

13
- 0003859414
- Prentice Hall
- S. Kung. VLSI Array Processors. Prentice Hall, 1988.
- (1988) VLSI Array Processors
- Kung, S.¹

15
- 0018515759
- Basic linear algebra subprograms for FORTRAN usage
- C. L. Lawson, R. J. Hanson, R. J. Kincaid, and F. T. Krogh. Basic linear algebra subprograms for FORTRAN usage. ACM Trans. Math. Software, 5:308-323, 1979.
- (1979) ACM Trans. Math. Software , vol.5 , pp. 308-323
- Lawson, C.L.¹ Hanson, R.J.² Kincaid, R.J.³ Krogh, F.T.⁴

16
- 0038553717
- Modular mappings and data distribution independent computations
- H. J. Lee and J. A. Fortes. Modular mappings and data distribution independent computations. Parallel Processing Letters, 7(2): 169-180, 1997.
- (1997) Parallel Processing Letters , vol.7 , Issue.2 , pp. 169-180
- Lee, H.J.¹ Fortes, J.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.