SCOPUS 정보 검색 플랫폼

IEEE Transactions on Parallel and Distributed Systems

Volumn 10, Issue 12, 1999, Pages 1201-1216

Algorithmic redistribution methods for block-cyclic decompositions

(2) Petitet, Antoine P b Dongarra, Jack J a,b

a IEEE (United States)

b University of Tennessee (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMIC REDISTRIBUTION METHODS; BLOCK-CYCLIC DECOMPOSITION;

ALGORITHMS; COMPUTER ARCHITECTURE; COMPUTER SYSTEMS PROGRAMMING; MATHEMATICAL OPERATORS; MATRIX ALGEBRA; RESPONSE TIME (COMPUTER SYSTEMS);

DISTRIBUTED COMPUTER SYSTEMS;

EID: 0033309435 PISSN: 10459219 EISSN: None Source Type: Journal
DOI: 10.1109/71.819944 Document Type: Article

Times cited : (36)

References (51)

1
- 12444260176
- Technical Report CSD-TR-91-007, Purdue Univ., West Lafayette, Ind.
- M. Aboelaze, N. Chrisochoides, and E. Houstis, "The Parallelization of Level 2 and 3 BLAS Operations on Distributed-Memory Machines," Technical Report CSD-TR-91-007, Purdue Univ., West Lafayette, Ind., 1991.
- (1991) The Parallelization of Level 2 and 3 BLAS Operations on Distributed-Memory Machines
- Aboelaze, M.¹ Chrisochoides, N.² Houstis, E.³

2
- 0028545949
- A High Performance Matrix Multiplication Algorithm on a Distributed-Memory Parallel Computer, Using Overlapped Communication
- R. Agarwal, F. Gustavson, and M. Zubair, "A High Performance Matrix Multiplication Algorithm on a Distributed-Memory Parallel Computer, Using Overlapped Communication," IBM J. Research and Development, vol. 38, no. 6, pp.673-681, 1994.
- (1994) IBM J. Research and Development , vol.38 , Issue.6 , pp. 673-681
- Agarwal, R.¹ Gustavson, F.² Zubair, M.³

3
- 0029218542
- SP2 System Architecture
- T. Agerwala, J. Martin, J. Mirza, D. Sadler, D. Dias, and M. Snir, "SP2 System Architecture," IBM Systems J., vol. 34, no. 2, pp. 153-184, 1995.
- (1995) IBM Systems J. , vol.34 , Issue.2 , pp. 153-184
- Agerwala, T.¹ Martin, J.² Mirza, J.³ Sadler, D.⁴ Dias, D.⁵ Snir, M.⁶

4
- 0003873564
- Technical Report A-278-CRI, CRI-Ecole des Mines, Fontainebleau, France
- C. Ancourt, F. Coelho, F. Irigoin, R. Keryell, "A linear Algebra Framework for Static HPF Code Distribution," Technical Report A-278-CRI, CRI-Ecole des Mines, Fontainebleau, France, 1995. (Available at http://www.cri.ensmp.fr.)
- (1995) A Linear Algebra Framework for Static HPF Code Distribution
- Ancourt, C.¹ Coelho, F.² Irigoin, F.³ Keryell, R.⁴

5
- 0003706460
- Philadelphia, Penn.: SIAM
- E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, S. Ostrouchov, and D. Sorensen, LAPACK Users' Guide. Philadelphia, Penn.: SIAM, 1995.
- (1995) LAPACK Users' Guide
- Anderson, E.¹ Bai, Z.² Bischof, C.³ Demmel, J.⁴ Dongarra, J.⁵ Du Croz, J.⁶ Greenbaum, A.⁷ Hammarling, S.⁸ McKenney, A.⁹ Ostrouchov, S.¹⁰ Sorensen, D.¹¹

6
- 0039408378
- Technical Report ECA-TR-147, Boeing Computer Services, Seattle, Wash.
- C. Ashcraft, "The Distributed Solution of Linear Systems Using the Torus-Wrap Data Mapping," Technical Report ECA-TR-147, Boeing Computer Services, Seattle, Wash., 1990.
- (1990) The Distributed Solution of Linear Systems Using the Torus-Wrap Data Mapping
- Ashcraft, C.¹

7
- 4243278459
- master's thesis, Mississippi State Univ.
- P. Bangalore, "The Data-Distribution-Independent Approach to Scalable Parallel Libraries," master's thesis, Mississippi State Univ., 1995.
- (1995) The Data-Distribution-Independent Approach to Scalable Parallel Libraries
- Bangalore, P.¹

8
- 24344465959
- Technical Report UT CS-96-326, LAPACK Working Note 111, Univ. Tennessee
- J. Bilmes, K. Asanovic, J. Demmel, D. Lam, and C. Chin, "Optimizing Matrix Multiply using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology," Technical Report UT CS-96-326, LAPACK Working Note 111, Univ. Tennessee, 1996.
- (1996) Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology
- Bilmes, J.¹ Asanovic, K.² Demmel, J.³ Lam, D.⁴ Chin, C.⁵

9
- 84943678690
- Parallel LU Decomposition on a Transputer Network
- G. van Zee and J. van der Vorst, eds.
- R. Bisseling and J. van der Vorst, "Parallel LU Decomposition on a Transputer Network," Lecture Notes in Computer Sciences, G. van Zee and J. van der Vorst, eds., vol. 384, pp. 61-77, 1989.
- (1989) Lecture Notes in Computer Sciences , vol.384 , pp. 61-77
- Bisseling, R.¹ Van Der Vorst, J.²

10
- 0041187787
- Parallel Triangular System Solving on a Mesh Network of Transputers
- R. Bisseling and J. van der Vorst, "Parallel Triangular System Solving on a Mesh Network of Transputers," SIAM J. Scientific and Statistical Computing, vol. 12, pp. 787-799, 1991.
- (1991) SIAM J. Scientific and Statistical Computing , vol.12 , pp. 787-799
- Bisseling, R.¹ Van Der Vorst, J.²

11
- 0003615167
- Philadelphia, Penn.: SIAM
- L. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R.C. Whaley, ScaLAPACK Users' Guide. Philadelphia, Penn.: SIAM, 1997.
- (1997) ScaLAPACK Users' Guide
- Blackford, L.¹ Choi, J.² Cleary, A.³ D'Azevedo, E.⁴ Demmel, J.⁵ Dhillon, I.⁶ Dongarra, J.⁷ Hammarling, S.⁸ Henry, G.⁹ Petitet, A.¹⁰ Stanley, K.¹¹ Walker, D.¹² Whaley, R.C.¹³

12
- 0027558054
- Implementation of BLAS Level 3 and LINPACK Benchmark on the AP1000
- R. Brent and P. Strazdins, "Implementation of BLAS Level 3 and LINPACK Benchmark on the AP1000," Fujitsu Scientific and Technical J., vol. 5, no. 1, pp. 61-70, 1993.
- (1993) Fujitsu Scientific and Technical J. , vol.5 , Issue.1 , pp. 61-70
- Brent, R.¹ Strazdins, P.²

13
- 0002742410
- Generating Local Adresses and Communication Sets for Data Parallel Programs
- S. Chatterjee, J. Gilbert, F. Long, R. Schreiber, and S. Tseng, "Generating Local Adresses and Communication Sets for Data Parallel Programs," J. Parallel and Distributed Computing, vol. 26, pp. 72-84, 1995.
- (1995) J. Parallel and Distributed Computing , vol.26 , pp. 72-84
- Chatterjee, S.¹ Gilbert, J.² Long, F.³ Schreiber, R.⁴ Tseng, S.⁵

14
- 33749935843
- Technical Report UT CS-97-369, LAPACK Working Note 129, Univ. Tennessee
- J. Choi, "A New Parallel Matrix Multiplication Algorithm on Distributed-Memory Concurrent Computers," Technical Report UT CS-97-369, LAPACK Working Note 129, Univ. Tennessee, 1997.
- (1997) A New Parallel Matrix Multiplication Algorithm on Distributed-Memory Concurrent Computers
- Choi, J.¹

15
- 0028530654
- PUMMA: Parallel Universal Matrix Multiplication Algorithms on Distributed-Memory Concurrent Computers
- J. Choi, J. Dongarra, and D. Walker, "PUMMA: Parallel Universal Matrix Multiplication Algorithms on Distributed-Memory Concurrent Computers," Concurrency: Practice and Experience, vol. 6, no. 7, pp. 543-570, 1994.
- (1994) Concurrency: Practice and Experience , vol.6 , Issue.7 , pp. 543-570
- Choi, J.¹ Dongarra, J.² Walker, D.³

16
- 0030241311
- PB-BLAS: A Set of Parallel Block Basic Linear Algebra Subroutines
- J. Choi, J. Dongarra, and D. Walker, "PB-BLAS: A Set of Parallel Block Basic Linear Algebra Subroutines" Concurrency: Practice and Experience, vol. 8, no. 7, pp. 517-535, 1996.
- (1996) Concurrency: Practice and Experience , vol.8 , Issue.7 , pp. 517-535
- Choi, J.¹ Dongarra, J.² Walker, D.³

17
- 0031221523
- Parallel Implementation of BLAS: General Techniques for Level 3 BLAS
- A. Chtchelkanova, J. Gunnels, G. Morrow, J. Overfelt, and R. van de Geijn, "Parallel Implementation of BLAS: General Techniques for Level 3 BLAS," Concurrency: Practice and Experience, vol. 9, no. 9, pp. 837-857, 1997.
- (1997) Concurrency: Practice and Experience , vol.9 , Issue.9 , pp. 837-857
- Chtchelkanova, A.¹ Gunnels, J.² Morrow, G.³ Overfelt, J.⁴ Van De Geijn, R.⁵

18
- 0006488807
- QR Factorization of a Dense Matrix on a Hypercube Multiprocessor
- E. Chu and A. George, "QR Factorization of a Dense Matrix on a Hypercube Multiprocessor," SIAM J. Scientific and Statistical Computing, vol. 11, pp. 990-1,028, 1990.
- (1990) SIAM J. Scientific and Statistical Computing , vol.11
- Chu, E.¹ George, A.²

19
- 0028443077
- A Parallel Block Implementation of Level 3 BLAS for MIMD Vector Processors
- M. Day de, I. Duff, and A. Petitet, "A Parallel Block Implementation of Level 3 BLAS for MIMD Vector Processors," ACM Trans. Mathematical Software, vol. 20, no. 2, pp. 178-193, 1994.
- (1994) ACM Trans. Mathematical Software , vol.20 , Issue.2 , pp. 178-193
- Dayde, M.¹ Duff, I.² Petitet, A.³

20
- 0032002536
- Scheduling Block-Cyclic Array Redistribution
- F. Desprez, J. Dongarra, and A. Petitet, C. Randriamaro, Y. Robert, "Scheduling Block-Cyclic Array Redistribution," IEEE Trans. Parallel and Distributed Systems, vol. 9, no. 2, pp. 192-205 1998.
- (1998) IEEE Trans. Parallel and Distributed Systems , vol.9 , Issue.2 , pp. 192-205
- Desprez, F.¹ Dongarra, J.² Petitet, A.³ Randriamaro, C.⁴ Robert, Y.⁵

21
- 0000778168
- Scalability Issues in the Design of a Library for Dense Linear Algebra
- J. Dongarra, R. van de Geijn, and D. Walker, "Scalability Issues in the Design of a Library for Dense Linear Algebra," J. Parallel and Distributed Computing, vol. 22, no. 3, pp. 523-537, 1994.
- (1994) J. Parallel and Distributed Computing , vol.22 , Issue.3 , pp. 523-537
- Dongarra, J.¹ Van De Geijn, R.² Walker, D.³

22
- 0029324485
- Software Libraries for Linear Algebra Computations on High Performance Computers
- J. Dongarra and D. Walker, "Software Libraries for Linear Algebra Computations on High Performance Computers," SIAM Review, vol. 37, no. 2, pp. 151-180, 1995.
- (1995) SIAM Review , vol.37 , Issue.2 , pp. 151-180
- Dongarra, J.¹ Walker, D.²

23
- 0012493293
- Technical Report UT CS-95-281, LAPACK Working Note 94, Univ. Tennessee
- J. Dongarra and R.C. Whaley, "A User's Guide to the BLACS v1.0," Technical Report UT CS-95-281, LAPACK Working Note 94, Univ. Tennessee, 1995. (http://www.netlib.org/blacs/)
- (1995) A User's Guide to the BLACS V1.0
- Dongarra, J.¹ Whaley, R.C.²

24
- 0003506603
- Englewood Cliffs, N.J.: Prentice Hall
- G. Fox, M. Johnson, G. Lyzenga, S. Otto, J. Salmon, and D. Walker, Solving Problems on Concurrent Processors. Englewood Cliffs, N.J.: Prentice Hall, 1988.
- (1988) Solving Problems on Concurrent Processors
- Fox, G.¹ Johnson, M.² Lyzenga, G.³ Otto, S.⁴ Salmon, J.⁵ Walker, D.⁶

25
- 0023288009
- Matrix Algorithms on a Hypercube I: Matrix Multiplication
- G. Fox, S. Otto, and A. Hey, "Matrix Algorithms on a Hypercube I: Matrix Multiplication," Parallel Computing, vol. 3, pp. 17-31, 1987.
- (1987) Parallel Computing , vol.3 , pp. 17-31
- Fox, G.¹ Otto, S.² Hey, A.³

26
- 0039821547
- LU Factorization Algorithms on Distributed-Memory Multiprocessor Architectures
- G. Geist and C. Romine, "LU Factorization Algorithms on Distributed-Memory Multiprocessor Architectures," SIAM J. Scientific and Statistical Computing, vol. 9, pp. 639-649, 1988.
- (1988) SIAM J. Scientific and Statistical Computing , vol.9 , pp. 639-649
- Geist, G.¹ Romine, C.²

27
- 0001615713
- Parallel Solution Triangular Systems on Distributed-Memory Multiprocessors
- M. Heath and C. Romine, "Parallel Solution Triangular Systems on Distributed-Memory Multiprocessors," SIAM J. Scientific and Statistical Computing, vol. 9, pp. 558-588, 1988.
- (1988) SIAM J. Scientific and Statistical Computing , vol.9 , pp. 558-588
- Heath, M.¹ Romine, C.²

28
- 33749521746
- personal communication
- B. Hendrickson, E. Jessup, and C. Smith, "A Parallel Eigensolver for Dense Symmetric Matrices," personal communication, 1996.
- (1996) A Parallel Eigensolver for Dense Symmetric Matrices
- Hendrickson, B.¹ Jessup, E.² Smith, C.³

29
- 0000667923
- The Torus-Wrap Mapping for Dense Matrix Calculations on Massively Parallel Computers
- Sept.
- B. Hendrickson and D. Womble, "The Torus-Wrap Mapping for Dense Matrix Calculations on Massively Parallel Computers," J. Scientific and Statistical Computing, vol. 15, no. 5, pp. 1,201-1,226, Sept. 1994.
- (1994) J. Scientific and Statistical Computing , vol.15 , Issue.5
- Hendrickson, B.¹ Womble, D.²

30
- 0040770650
- Technical Report UT CS-94-244, LAPACK Working Note 79, Univ. Tennessee
- G. Henry and R. van de Geijn, "Parallelizing the QR Algorithm for the Unsymmetric Algebraic Eigenvalue Problem: Myths and Reality," Technical Report UT CS-94-244, LAPACK Working Note 79, Univ. Tennessee, 1994.
- (1994) Parallelizing the QR Algorithm for the Unsymmetric Algebraic Eigenvalue Problem: Myths and Reality
- Henry, G.¹ Van De Geijn, R.²

31
- 0028529387
- Matrix Multiplication on the Intel Touchstone DELTA
- S. Huss-Lederman, E. Jacobson, A. Tsao, and G. Zhang, "Matrix Multiplication on the Intel Touchstone DELTA," Concurrency: Practice and Experience, vol. 6, no. 7, pp. 571-594, 1994.
- (1994) Concurrency: Practice and Experience , vol.6 , Issue.7 , pp. 571-594
- Huss-Lederman, S.¹ Jacobson, E.² Tsao, A.³ Zhang, G.⁴

32
- 0040831411
- Technical Report UMINF 95-18, Dept. Computing Science, Umeå Univ.
- B. Kågström, P. Ling, and C. van Loan, "GEMM-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark," Technical Report UMINF 95-18, Dept. Computing Science, Umeå Univ., 1995.
- (1995) GEMM-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark
- Kågström, B.¹ Ling, P.² Van Loan, C.³

33
- 0029484078
- Processor Mapping Techniques towards Efficient Data Redistribution
- E. Kalns and L. Ni, "Processor Mapping Techniques towards Efficient Data Redistribution," IEEE Trans. Parallel and Distributed Systems, vol. 12, no. 6, pp. 1,234-1,247, 1995.
- (1995) IEEE Trans. Parallel and Distributed Systems , vol.12 , Issue.6
- Kalns, E.¹ Ni, L.²

34
- 0029192689
- A Linear-Time Algorithm for Computing the Memory Access Sequence in Data Parallel Programs
- K. Kennedy, N. Nedeljković, and A. Sethi, "A Linear-Time Algorithm for Computing the Memory Access Sequence in Data Parallel Programs," Proc. Fifth ACM SIGPLAN, Symp. Principles and Practice of Parallel Programming, 1995.
- (1995) Proc. Fifth ACM SIGPLAN, Symp. Principles and Practice of Parallel Programming
- Kennedy, K.¹ Nedeljković, N.² Sethi, A.³

35
- 0003487717
- Cambridge, Mass.: MIT Press
- C. Koebel, D. Loveman, R. Schreiber, G. Steele, and M. Zosel, The High Performance Fortran Handbook. Cambridge, Mass.: MIT Press, 1994.
- (1994) The High Performance Fortran Handbook
- Koebel, C.¹ Loveman, D.² Schreiber, R.³ Steele, G.⁴ Zosel, M.⁵

36
- 0003901150
- Redwood City, Calif.: Benjamin/Cummings Publishing Company, Inc.
- V. Kumar, A. Grama, A. Gupta, and G. Karypis, Introduction to Parallel Computing. Redwood City, Calif.: Benjamin/Cummings Publishing Company, Inc., 1994.
- (1994) Introduction to Parallel Computing
- Kumar, V.¹ Grama, A.² Gupta, A.³ Karypis, G.⁴

37
- 0013317481
- A New Method for Solving Triangular Systems on Distributed-Memory Message-Passing Multiprocessor
- G. Li and T. Coleman, "A New Method for Solving Triangular Systems on Distributed-Memory Message-Passing Multiprocessor," SIAM J. Scientific and Statistical Computing, vol. 10, no. 2, pp. 382-396, 1989.
- (1989) SIAM J. Scientific and Statistical Computing , vol.10 , Issue.2 , pp. 382-396
- Li, G.¹ Coleman, T.²

38
- 0000828297
- Block-Cyclic Dense Linear Algebra
- W. Lichtenstein and S.L. Johnsson, "Block-Cyclic Dense Linear Algebra," SIAM J. Scientific and Statistical Computing, vol. 14, no. 6, pp. 1,259-1,288 1993.
- (1993) SIAM J. Scientific and Statistical Computing , vol.14 , Issue.6
- Lichtenstein, W.¹ Johnsson, S.L.²

39
- 33749927010
- Technical Report CENG 97-10, Dept. Electrical Engineering-Systems, Univ. Southern California, Los Angeles, Calif.
- Y. Lim, P. Bhat, and V. Prasanna, "Efficient Algorithms for Block-Cyclic Redistribution of Arrays," Technical Report CENG 97-10, Dept. Electrical Engineering-Systems, Univ. Southern California, Los Angeles, Calif., 1997.
- (1997) Efficient Algorithms for Block-Cyclic Redistribution of Arrays
- Lim, Y.¹ Bhat, P.² Prasanna, V.³

40
- 0028464291
- Multiplication of Matrices of Arbitrary Shapes on a Data Parallel Computer
- K. Mathur, S.L. Johnsson, "Multiplication of Matrices of Arbitrary Shapes on a Data Parallel Computer," Parallel Computing, vol. 20, pp. 919-951, 1994.
- (1994) Parallel Computing , vol.20 , pp. 919-951
- Mathur, K.¹ Johnsson, S.L.²

41
- 0242505768
- doctoral thesis, Univ. Tennessee, Knoxville
- A. Petitet, Algorithmic Redistribution Methods for Block Cyclic Decompositions, doctoral thesis, Univ. Tennessee, Knoxville, 1996.
- (1996) Algorithmic Redistribution Methods for Block Cyclic Decompositions
- Petitet, A.¹

42
- 0004395831
- Fast Runtime Block Cyclic Data Redistribution on Multiprocessors
- L. Prylli and B. Tourancheau, "Fast Runtime Block Cyclic Data Redistribution on Multiprocessors," J. Parallel and Distributed Computing, vol. 45, 1997.
- J. Parallel and Distributed Computing , vol.45 , pp. 1997
- Prylli, L.¹ Tourancheau, B.²

43
- 0029231855
- Matrix Factorization using Distributed Panels on the Fujitsu AP1000
- P. Strazdins, "Matrix Factorization using Distributed Panels on the Fujitsu AP1000," Proc. IEEE First Int'l Conf. Algorithms and Architectures for Parallel Processing (ICA3PP-95), 1995.
- (1995) Proc. IEEE First Int'l Conf. Algorithms and Architectures for Parallel Processing (ICA3PP-95)
- Strazdins, P.¹

44
- 33749948602
- A High Performance Version of Parallel LAPACK: Preliminary Report
- Fujitsu Parallel Computing Center
- P. Strazdins and H. Koesmarno, "A High Performance Version of Parallel LAPACK: Preliminary Report," Proc. Sixth Parallel Computing Workshop, Fujitsu Parallel Computing Center, 1996.
- (1996) Proc. Sixth Parallel Computing Workshop
- Strazdins, P.¹ Koesmarno, H.²

45
- 0029218595
- The SP2 High-Performance Switch
- C. Stunkel, D. Shea, B. Abali, M. Atkins, C. Bender, D. Grice, P. Hochshild, D. Joseph, B. Nathanson, R. Swetz, R. Stucke, M. Tsao, and P. Varker, "The SP2 High-Performance Switch," IBM Systems J., vol. 34, no. 2, pp. 185-204, 1995.
- (1995) IBM Systems J. , vol.34 , Issue.2 , pp. 185-204
- Stunkel, C.¹ Shea, D.² Abali, B.³ Atkins, M.⁴ Bender, C.⁵ Grice, D.⁶ Hochshild, P.⁷ Joseph, D.⁸ Nathanson, B.⁹ Swetz, R.¹⁰ Stucke, R.¹¹ Tsao, M.¹² Varker, P.¹³

46
- 0000606960
- Fast Address Sequence Generation for Data Parallel Programs Using Integer Lattices
- P. Sadayappan et al., eds., Springer-Verlag
- A. Thirumalai and J. Ramanujam, "Fast Address Sequence Generation for Data Parallel Programs Using Integer Lattices," Languages and Compilers for Parallel Computing: Lecture Notes in Computer Science. P. Sadayappan et al., eds., Springer-Verlag, 1996.
- (1996) Languages and Compilers for Parallel Computing: Lecture Notes in Computer Science
- Thirumalai, A.¹ Ramanujam, J.²

47
- 0031123769
- SUMMA: Scalable Universal Matrix Multiplication Algorithm
- R. van de Geijn and J. Watts, "SUMMA: Scalable Universal Matrix Multiplication Algorithm," Concurrency: Practice and Experience, vol. 9, no. 4, pp. 255-274, 1997.
- (1997) Concurrency: Practice and Experience , vol.9 , Issue.4 , pp. 255-274
- Van De Geijn, R.¹ Watts, J.²

48
- 84990712105
- Experiments with Multicomputer LU Decomposition
- E. van de Velde, "Experiments with Multicomputer LU Decomposition," Concurrency: Practice and Experience, vol. 2, pp. 1-26, 1990.
- (1990) Concurrency: Practice and Experience , vol.2 , pp. 1-26
- Van De Velde, E.¹

49
- 0030282238
- Redistribution of Block-Cyclic Data Distributions Using MPI
- D. Walker and S. Otto, "Redistribution of Block-Cyclic Data Distributions Using MPI," Concurrency: Practice and Experience, vol. 8, no. 9, pp. 707-728, 1996.
- (1996) Concurrency: Practice and Experience , vol.8 , Issue.9 , pp. 707-728
- Walker, D.¹ Otto, S.²

50
- 0010224751
- Runtime Performance of Parallel Array Assignment: An Empirical Study
- L. Wang, J. Stichnoth, S. Chatterjee, "Runtime Performance of Parallel Array Assignment: An Empirical Study," Proc. Supercomputing, 1996. (http://www.supercomp.org/sc96/proceedings/).
- (1996) Proc. Supercomputing
- Wang, L.¹ Stichnoth, J.² Chatterjee, S.³

51
- 0003418094
- Technical Report UT CS-97-366, LAPACK Working Note 131, Univ. Tennessee
- R. Whaley and J. Dongarra, "Automatically Tuned Linear Algebra Software," Technical Report UT CS-97-366, LAPACK Working Note 131, Univ. Tennessee, 1997.
- (1997) Automatically Tuned Linear Algebra Software
- Whaley, R.¹ Dongarra, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.