SCOPUS 정보 검색 플랫폼

Journal of Parallel and Distributed Computing

Volumn 47, Issue 2, 1997, Pages 168-184

Models and scheduling algorithms for mixed data and task parallel programs

(3) Chakrabarti, Soumen a Demmel, James a,b Yelick, Katherine a

a UNIVERSITY OF CALIFORNIA (United States)

b UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 0031574566 PISSN: 07437315 EISSN: None Source Type: Journal
DOI: 10.1006/jpdc.1997.1413 Document Type: Article

Times cited : (23)

References (45)

1
- 0029199162
- Empirical evaluation of the CRAY-T3D: A compiler perspective
- ACM SIGARCH
- R. Arpaci, D. Culler, A. Krishnamurthy, S. Steinberg, K. Yelick, 1995, Empirical evaluation of the CRAY-T3D: A compiler perspective, International Symposium on Computer Architecture, ACM SIGARCH.
- (1995) International Symposium on Computer Architecture
- Arpaci, R.¹ Culler, D.² Krishnamurthy, A.³ Steinberg, S.⁴ Yelick, K.⁵

2
- 0010921883
- Programming abstractions for dynamically partitioning and coordinating localized scientific calculations running on multiprocessors
- Baden S. B. Programming abstractions for dynamically partitioning and coordinating localized scientific calculations running on multiprocessors. SIAM J. Sci. Statist. Comput. 12:1991;145-157.
- (1991) SIAM J. Sci. Statist. Comput. , vol.12 , pp. 145-157
- Baden, S.B.¹

3
- 0001175581
- Design of a parallel nonsymmetric eigenroutine toolbox
- SIAM
- Z. Bai, J. Demmel, 1993, Design of a parallel nonsymmetric eigenroutine toolbox, Part I, Proceedings of the Sixth SIAM Conference on Parallel Processing for Scientific Computing, SIAM.
- (1993) Proceedings of the Sixth SIAM Conference on Parallel Processing for Scientific Computing , Issue.PART I
- Bai, Z.¹ Demmel, J.²

4
- 0028583189
- Parallel performance of a symmetric eigensolver based on the invariant subspace decomposition approach
- May Knoxville, TN, 39, IEEE Press, New York
- C. Bischof, S. Huss-Lederman, X. Sun, A. Tsao, T. Turnbull, May 1994, Parallel performance of a symmetric eigensolver based on the invariant subspace decomposition approach, Scalable High Performance Computing Conference, Knoxville, TN, 32, 39, IEEE Press, New York.
- (1994) Scalable High Performance Computing Conference , pp. 32
- Bischof, C.¹ Huss-Lederman, S.² Sun, X.³ Tsao, A.⁴ Turnbull, T.⁵

5
- 0003229750
- Scalapack: A portable linear algebra library for distributed memory computers - design issues and performance
- L. S. Blackford, J. Choi, A. Cleary, J. Demmel, I. Dhillon, J. J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. W. Walker, R. C. Whaley, Scalapack: A portable linear algebra library for distributed memory computers - Design issues and performance, Proceedings of Supercomputing '96.
- Proceedings of Supercomputing '96
- Blackford, L.S.¹ Choi, J.² Cleary, A.³ Demmel, J.⁴ Dhillon, I.⁵ Dongarra, J.J.⁶ Hammarling, S.⁷ Henry, G.⁸ Petitet, A.⁹ Stanley, K.¹⁰ Walker, D.W.¹¹ Whaley, R.C.¹²

6
- 43949161602
- Implementation of a portable nested data-parallel language
- Blelloch G. E., Chatterjee S., Hardwick J. C., Sipelstein J., Zagha M. Implementation of a portable nested data-parallel language. J. Parallel Distrib. Comput. 21:1994;4-14.
- (1994) J. Parallel Distrib. Comput. , vol.21 , pp. 4-14
- Blelloch, G.E.¹ Chatterjee, S.² Hardwick, J.C.³ Sipelstein, J.⁴ Zagha, M.⁵

7
- 0002634823
- Scheduling multithreaded computations by work stealing
- New York: IEEE Press
- Blumwofe R., Leiserson C. Scheduling multithreaded computations by work stealing. Foundations of Computer Science (FOCS), Santa Fe, NM. November 1994;IEEE Press, New York.
- (1994) Foundations of Computer Science (FOCS), Santa Fe, NM
- Blumwofe, R.¹ Leiserson, C.²

8
- 0030398638
- Resource scheduling for parallel database and scientific applications
- June Italy, Assoc. Comput. Mach. New York
- S. Chakrabarti, S. Muthukrishnan, June 1996, Resource scheduling for parallel database and scientific applications, Symposium on Parallel Algorithms and Architectures (SPAA), Italy, Assoc. Comput. Mach. New York.
- (1996) Symposium on Parallel Algorithms and Architectures (SPAA)
- Chakrabarti, S.¹ Muthukrishnan, S.²

9
- 0029183528
- Scheduling problems in parallel query optimization
- C. Chekuri, W. Hasan, R. Motwani, 1995, Scheduling problems in parallel query optimization, ACM Symposium on Principles of Database Systems.
- (1995) ACM Symposium on Principles of Database Systems
- Chekuri, C.¹ Hasan, W.² Motwani, R.³

10
- 0003978709
- A proposal for a set of parallel basic linear algebra subprograms
- CS-95-292:May
- Choi J., Dongarra J., Ostrouchov S., Petitet A., Walker D., Whaley R. C. A proposal for a set of parallel basic linear algebra subprograms. Computer Science Dept. Technical Report. CS-95-292:May 1995.
- (1995) Computer Science Dept. Technical Report
- Choi, J.¹ Dongarra, J.² Ostrouchov, S.³ Petitet, A.⁴ Walker, D.⁵ Whaley, R.C.⁶

11
- 0000659575
- A divide and conquer method for the symmetric tridiagonal eigenproblem
- Cuppen J. A divide and conquer method for the symmetric tridiagonal eigenproblem. Numer. Math. 36:1981;177-195.
- (1981) Numer. Math. , vol.36 , pp. 177-195
- Cuppen, J.¹

12
- 0002922503
- The performance of finding eigenvalues and eigenvectors of dense symmetric matrices on distributed memory computers
- SIAM, Philadelphia
- J. Demmel, K. Stanley, 1994, The performance of finding eigenvalues and eigenvectors of dense symmetric matrices on distributed memory computers, Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, SIAM, Philadelphia.
- (1994) Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing
- Demmel, J.¹ Stanley, K.²

13
- 0010866205
- Performance complexity ofLU
- Desprez F., Tourancheau B., Dongarra J. J. Performance complexity ofLU. Technical report. [LAPACK Working Note 67]:Feb. 1994.
- (1994) Technical Report , vol.67
- Desprez, F.¹ Tourancheau, B.² Dongarra, J.J.³

14
- 0003517895
- A proposal for a user-level message passing interface in a distributed memory environment
- ORNL/TM-12231:February
- Dongarra J., Hempel R., Hay A., Walker D. A proposal for a user-level message passing interface in a distributed memory environment. Technical Report. ORNL/TM-12231:February 1993.
- (1993) Technical Report
- Dongarra, J.¹ Hempel, R.² Hay, A.³ Walker, D.⁴

15
- 0026991394
- A look at scalable dense linear algebra libraries
- April IEEE Comput. Soc. Los Alamitos, CA
- J. Dongarra, R. van de Geijn, D. Walker, April 1992, A look at scalable dense linear algebra libraries, Scalable High-Performance Computing Conference, IEEE Comput. Soc. Los Alamitos, CA.
- (1992) Scalable High-Performance Computing Conference
- Dongarra, J.¹ Van De Geijn, R.² Walker, D.³

16
- 0010921884
- Fortran 90
- Fortran 90.

17
- 0003487728
- High performance Fortran language specification, version 1.0
- CRPC-TR92225:May
- Forum H. P. F. High performance Fortran language specification, version 1.0. Technical Report. CRPC-TR92225:May 1993.
- (1993) Technical Report
- Forum, H.P.F.¹

18
- 0028599384
- A compilation system that integrates high performance Fortran and Fortran M
- 300, IEEE, New York
- I. Foster, M. Xu, B. Avalani, A. Chowdhary, 1994, A compilation system that integrates high performance Fortran and Fortran M, Scalable High Performance Computing Conference, 293, 300, IEEE, New York.
- (1994) Scalable High Performance Computing Conference , pp. 293
- Foster, I.¹ Xu, M.² Avalani, B.³ Chowdhary, A.⁴

19
- 0003287437
- Bounds for multiprocessor scheduling with resource constraints
- Garey M. R., Graham R. L. Bounds for multiprocessor scheduling with resource constraints. SIAM J. Comput. 4:1975;187-200.
- (1975) SIAM J. Comput. , vol.4 , pp. 187-200
- Garey, M.R.¹ Graham, R.L.²

20
- 0003645035
- Englewood Cliffs: Prentice-Hall
- George A., Liu J. Computer Solution of Large Sparse Positive Definite Systems. 1981;Prentice-Hall, Englewood Cliffs.
- (1981) Computer Solution of Large Sparse Positive Definite Systems
- George, A.¹ Liu, J.²

21
- 0027606922
- On the granularity and clustering of directed acyclic task graphs
- Gerasoulis A., Yang T. On the granularity and clustering of directed acyclic task graphs. IEEE Trans. Parallel Distrib. Syst. 4:1993;686-701.
- (1993) IEEE Trans. Parallel Distrib. Syst. , vol.4 , pp. 686-701
- Gerasoulis, A.¹ Yang, T.²

22
- 0010865720
- The analysis of a nested dissection algorithm
- Gilbert J., Tarjan R. The analysis of a nested dissection algorithm. Numer. Math. 50:1987;377-404.
- (1987) Numer. Math. , vol.50 , pp. 377-404
- Gilbert, J.¹ Tarjan, R.²

23
- 0014477093
- Bounds on multiprocessor timing anomalies
- Graham R. L. Bounds on multiprocessor timing anomalies. SIAM J. Appl. Math. 17:1969;416-429.
- (1969) SIAM J. Appl. Math. , vol.17 , pp. 416-429
- Graham, R.L.¹

24
- 0003487728
- High performance Fortran language specification version 1.0
- Draft, Jan.
- High Performance Fortran Forum, High performance Fortran language specification version 1.0, Draft, Jan. 1993.
- (1993) High Performance Fortran Forum

25
- 84976813879
- Compiling Fortran D for MIMD distributed-memory machines
- Hiranandani S., Kennedy K., Tseng C. Compiling Fortran D for MIMD distributed-memory machines. Comm. Assoc. Comput. Mach. 35:1992;66-80.
- (1992) Comm. Assoc. Comput. Mach. , vol.35 , pp. 66-80
- Hiranandani, S.¹ Kennedy, K.² Tseng, C.³

26
- 0030699816
- High performance Fortran for highly unstructured problems
- Hu Y. C., Johnson S. L., Teng S.-H. High performance Fortran for highly unstructured problems. Principles and Practice of Parallel Programming (PPoPP). 1997.
- (1997) Principles and Practice of Parallel Programming (PPoPP)
- Hu, Y.C.¹ Johnson, S.L.² Teng, S.-H.³

27
- 0023328834
- Communication efficient basic linear algebra computations on hypercube architectures
- Johnsson S. L. Communication efficient basic linear algebra computations on hypercube architectures. J. Parallel Distrib. Comput. 4:1987.
- (1987) J. Parallel Distrib. Comput. , vol.4
- Johnsson, S.L.¹

28
- 0010807779
- On the concurrency of C++
- May 219, Ontario, Canada
- X. Li, H. Huang, May 1993, On the concurrency of C++, Proceedings ICCI'93. Fifth International Conference on Computing and Information, 215, 219, Ontario, Canada.
- (1993) Proceedings ICCI'93. Fifth International Conference on Computing and Information , pp. 215
- Li, X.¹ Huang, H.²

29
- 0026840122
- The multifrontal method for sparse matrix solution: Theory and practice
- Liu J. W. H. The multifrontal method for sparse matrix solution: Theory and practice. SIAM Rev. 34:1992;82-109.
- (1992) SIAM Rev. , vol.34 , pp. 82-109
- Liu, J.W.H.¹

30
- 0028195126
- Scheduling malleable and nonmalleable parallel tasks
- 176, ACM-SIAM, New York
- W. Ludwig, P. Tiwari, 1994, Scheduling malleable and nonmalleable parallel tasks, Symposium on Discrete Algorithms (SODA), 167, 176, ACM-SIAM, New York.
- (1994) Symposium on Discrete Algorithms (SODA) , pp. 167
- Ludwig, W.¹ Tiwari, P.²

31
- 0011612024
- Implementing an efficient portable global memory layer on distributed memory multiprocessors
- UCB/CSD-94-810:May
- Luna S. Implementing an efficient portable global memory layer on distributed memory multiprocessors. Technical Report. UCB/CSD-94-810:May 1994.
- (1994) Technical Report
- Luna, S.¹

32
- 0027868988
- Parallelization and distribution of a coupled atmosphere-ocean general circulation model
- Mechoso C. R., Ma C.-C., Farrara J., Spahr J. A., Moore R. W. Parallelization and distribution of a coupled atmosphere-ocean general circulation model. Monthly Weather Rev. 121:1993;2062-2076.
- (1993) Monthly Weather Rev. , vol.121 , pp. 2062-2076
- Mechoso, C.R.¹ Ma, C.-C.² Farrara, J.³ Spahr, J.A.⁴ Moore, R.W.⁵

33
- 0025418536
- Towards an architecture-independent analysis of parallel algorithms
- Papadimitriou C. H., Yannakakis M. Towards an architecture-independent analysis of parallel algorithms. SIAM J. Comput. 19:1990;322-328.
- (1990) SIAM J. Comput. , vol.19 , pp. 322-328
- Papadimitriou, C.H.¹ Yannakakis, M.²

34
- 84904357426
- A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors
- IEEE, New York
- S. Ramaswamy, S. Sapatnekar, P. Banerjee, 1994, A convex programming approach for exploiting data and functional parallelism on distributed memory multiprocessors, International Conference on Parallel Processing (ICPP), IEEE, New York.
- (1994) International Conference on Parallel Processing (ICPP)
- Ramaswamy, S.¹ Sapatnekar, S.² Banerjee, P.³

35
- 0004116414
- New York: McGraw-Hill
- Rudin W. Real and Complex Analysis. 1974;McGraw-Hill, New York.
- (1974) Real and Complex Analysis
- Rudin, W.¹

36
- 0003764585
- University of California
- J. Rutter, 1994, A serial implementation of Cuppen's divide and conquer algorithm for the symmetric eigenvalue problem, University of California.
- (1994) A Serial Implementation of Cuppen's Divide and Conquer Algorithm for the Symmetric Eigenvalue Problem
- Rutter, J.¹

37
- 0026213832
- Automatic partitioning of a program dependence graph into parallel tasks
- Sarkar V. Automatic partitioning of a program dependence graph into parallel tasks. IBM J. Res. Devel. 35:1991.
- (1991) IBM J. Res. Devel. , vol.35
- Sarkar, V.¹

38
- 0010921885
- Modeling the performance of linear systems solvers on distributed memory multiprocessors
- Stanley K., Demmel J. Modeling the performance of linear systems solvers on distributed memory multiprocessors. Technical report. 1994.
- (1994) Technical Report
- Stanley, K.¹ Demmel, J.²

39
- 0027845715
- Exploiting task and data parallelism on a multicomputer
- New York: ACM-SIGPLAN. p. 13-22
- Subhlok J., Stichnoth J., O'Hallaron D., Gross T. Exploiting task and data parallelism on a multicomputer. Principles and Practice of Parallel Programming (PPoPP), San Diego. May 1993;ACM-SIGPLAN, New York. p. 13-22.
- (1993) Principles and Practice of Parallel Programming (PPoPP), San Diego
- Subhlok, J.¹ Stichnoth, J.² O'Hallaron, D.³ Gross, T.⁴

40
- 0029181476
- Optimal mapping of sequences of data parallel tasks
- p. 134-143
- Subhlok J., Vondran G. Optimal mapping of sequences of data parallel tasks. Principles and Practice of Parallel Programming (PPoPP). 1995;. p. 134-143.
- (1995) Principles and Practice of Parallel Programming (PPoPP)
- Subhlok, J.¹ Vondran, G.²

41
- 0030655338
- A new model for integrated nested task and data parallel programming
- Subhlok J., Yang B. A new model for integrated nested task and data parallel programming. Principles and Practice of Parallel Programming (PPoPP). 1997.
- (1997) Principles and Practice of Parallel Programming (PPoPP)
- Subhlok, J.¹ Yang, B.²

42
- 0010931024
- Runtime array redistribution in HPF programs
- Thakur R., Choudhary A., Fox G. Runtime array redistribution in HPF programs. Technical Report. SCCS-601:1994.
- (1994) Technical Report , vol.601
- Thakur, R.¹ Choudhary, A.² Fox, G.³

43
- 0027796598
- Parallel timing simulation on a distributed memory multiprocessor
- November Santa Clara, CA
- C.-P. Wen, K. Yelick, November 1993, Parallel timing simulation on a distributed memory multiprocessor, International Conference on CAD, Santa Clara, CA.
- (1993) International Conference on CAD
- Wen, C.-P.¹ Yelick, K.²

44
- 0003687069
- Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures
- Whaley R. C. Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures. Technical report. [LAPACK Working Note 73]:June 1994.
- (1994) Technical Report , vol.73
- Whaley, R.C.¹

45
- 0010921886
- Basic linear algebra communication subroutines: Analysis and implementation across multiple parallel architectures
- Whaley R. C. Basic linear algebra communication subroutines: Analysis and implementation across multiple parallel architectures. Technical report. [LAPACK Working Note 73]:June 1994.
- (1994) Technical Report , vol.73
- Whaley, R.C.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.