메뉴 건너뛰기




Volumn , Issue , 2007, Pages 15-23

KAAPI: A thread scheduling runtime system for data flow computations on cluster of multi-processors

Author keywords

Cluster; Data flow; Multi core; Multi processor; Work stealing

Indexed keywords

ALGEBRA; CLUSTER ANALYSIS; COMPUTATION THEORY; MULTIPROCESSING SYSTEMS; PROBLEM SOLVING;

EID: 36849003422     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1278177.1278182     Document Type: Article
Times cited : (124)

References (39)
  • 4
    • 0003966887 scopus 로고
    • NESL: A Nested Data-Parallel Language
    • Technical Report CMU-CS-93-129, April
    • G.E. Blelloch. NESL: A Nested Data-Parallel Language. Technical Report CMU-CS-93-129, April 1993.
    • (1993)
    • Blelloch, G.E.1
  • 8
    • 0038036149 scopus 로고    scopus 로고
    • Space-efficient scheduling of multithreaded computations
    • R.D. Blumofe and C.E. Leiserson. Space-efficient scheduling of multithreaded computations. SIAM Journal on Computing, 1(27):202-229, 1997.
    • (1997) SIAM Journal on Computing , vol.1 , Issue.27 , pp. 202-229
    • Blumofe, R.D.1    Leiserson, C.E.2
  • 12
    • 0034421561 scopus 로고    scopus 로고
    • Efficient scheduling of strict multithreaded computations
    • P. Fatourou and P.G. Spirakis. Efficient scheduling of strict multithreaded computations. Theory of Computing Systems, 33(3):173-232, 2000.
    • (2000) Theory of Computing Systems , vol.33 , Issue.3 , pp. 173-232
    • Fatourou, P.1    Spirakis, P.G.2
  • 14
    • 0031622953 scopus 로고    scopus 로고
    • The implementation of the cilk-5 multithreaded language
    • M. Frigo, C.E. Leiserson, and K.H. Randall. The implementation of the cilk-5 multithreaded language. In Sigplan'98, pages 212-223, 1998.
    • (1998) Sigplan'98 , pp. 212-223
    • Frigo, M.1    Leiserson, C.E.2    Randall, K.H.3
  • 15
    • 84908538816 scopus 로고    scopus 로고
    • F. Galilée, J.-L. Roch, G. Cavalheiro, and M. Doreille. Athapascan-1: On-line building data flow graph in a parallel language. In IEEE, editor, Pact'98, pages 88-95, Paris, France, October 1998.
    • F. Galilée, J.-L. Roch, G. Cavalheiro, and M. Doreille. Athapascan-1: On-line building data flow graph in a parallel language. In IEEE, editor, Pact'98, pages 88-95, Paris, France, October 1998.
  • 16
    • 3042751916 scopus 로고    scopus 로고
    • Athapascan: Api for asynchronous parallel programming
    • Technical Report RR-0276, APACHE, INRIA Rhône-Alpes, February 2003
    • T. Gautier, R. Revire, and Roch. Athapascan: Api for asynchronous parallel programming. Technical Report RR-0276, APACHE, INRIA Rhône-Alpes, February 2003.
    • Gautier, T.1    Revire, R.2    Roch3
  • 18
    • 74049155634 scopus 로고    scopus 로고
    • Grid5000. http://www.grid5000.org.
    • Grid5000
  • 19
    • 0029774839 scopus 로고    scopus 로고
    • L. J. Hendren, G. R. Gao, X. Tang, Y Zhu, X. Xue, H. Cai, and P. Ouellet. Compiling c for the earth multithreaded architecture. In IEEE, editor, Pact'96, pages 12-23, Boston, USA, 1996.
    • L. J. Hendren, G. R. Gao, X. Tang, Y Zhu, X. Xue, H. Cai, and P. Ouellet. Compiling c for the earth multithreaded architecture. In IEEE, editor, Pact'96, pages 12-23, Boston, USA, 1996.
  • 20
    • 36849018472 scopus 로고    scopus 로고
    • High Performance Fortran Forum. High Performance Fortran language specification, version 1.0. Technical Report CRPC-TR.92225, Houston, Tex., 1993.
    • High Performance Fortran Forum. High Performance Fortran language specification, version 1.0. Technical Report CRPC-TR.92225, Houston, Tex., 1993.
  • 21
    • 0024647955 scopus 로고
    • Scheduling precedence graphs in systems with interprocessor communication times
    • J.-J. Hwang, Y.-C. Chow, F. D. Anger, and C.-Y. Lee. Scheduling precedence graphs in systems with interprocessor communication times. SIAM J. Comput., 18(2):244-257, 1989.
    • (1989) SIAM J. Comput , vol.18 , Issue.2 , pp. 244-257
    • Hwang, J.-J.1    Chow, Y.-C.2    Anger, F.D.3    Lee, C.-Y.4
  • 22
    • 27144432456 scopus 로고    scopus 로고
    • A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing
    • Lisboa, Portugal, August
    • S. Jafar, T. Gautier, A. Krings, and J-L. Roch. A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing. In Proceedings of (LNCS) EuroPar'05, Lisboa, Portugal, August 2005.
    • (2005) Proceedings of (LNCS) EuroPar'05
    • Jafar, S.1    Gautier, T.2    Krings, A.3    Roch, J.-L.4
  • 25
    • 36849058134 scopus 로고    scopus 로고
    • Kaapi. http://kaapi.gforge.inria.fr.
    • Kaapi
  • 27
    • 36849021558 scopus 로고    scopus 로고
    • A. Mainwaring and D. Culler. Active message applications programming interface and communication subsystem organization. Technical Report CSD-96-918.
    • A. Mainwaring and D. Culler. Active message applications programming interface and communication subsystem organization. Technical Report CSD-96-918.
  • 29
    • 36849000808 scopus 로고    scopus 로고
    • Scheduling threads for low space requirement and good locality. Number TR CMU-CS-99-121
    • may, Extended version of the paper published in Spaa'99
    • G.J. Narlikar. Scheduling threads for low space requirement and good locality. Number TR CMU-CS-99-121, may 1999. Extended version of the paper published in Spaa'99.
    • (1999)
    • Narlikar, G.J.1
  • 30
    • 36849073681 scopus 로고
    • Information Technology - Portable Operating Systems Interface (POSIX) - Part: System Application Program Interface (API) - Amendment 2: Threads Extension [C Language]
    • Institute of Electrical and Inc. Electronic Engineers, IEEE, New York, NY
    • Institute of Electrical and Inc. Electronic Engineers. Information Technology - Portable Operating Systems Interface (POSIX) - Part: System Application Program Interface (API) - Amendment 2: Threads Extension [C Language]. IEEE Standard 1003.1c-1995, IEEE, New York, NY, 1995.
    • (1995) IEEE Standard 1003.1c-1995
  • 31
    • 0040769741 scopus 로고    scopus 로고
    • Experimental analysis of the dual recursive bipartitioning algorithm for static mapping
    • Technical Report 1038-96
    • F. Pellegrini and J. Roman. Experimental analysis of the dual recursive bipartitioning algorithm for static mapping. Technical Report 1038-96, 1996.
    • (1996)
    • Pellegrini, F.1    Roman, J.2
  • 33
    • 35248844154 scopus 로고    scopus 로고
    • Efficient and easy parallel implementation of large numerical simulation
    • Springer, editor, Venice, Italy
    • R. Revire, F. Zara, and T. Gautier. Efficient and easy parallel implementation of large numerical simulation. In Springer, editor, Proceedings of ParSim03 of EuroPVM/MPI03, pages 663-666, Venice, Italy, 2003.
    • (2003) Proceedings of ParSim03 of EuroPVM/MPI03 , pp. 663-666
    • Revire, R.1    Zara, F.2    Gautier, T.3
  • 35
    • 36849008151 scopus 로고    scopus 로고
    • Takaken. http://www.ic-net.or.Jp/home/ta.kaken/e/queen.
    • Takaken
  • 36
    • 36849010181 scopus 로고    scopus 로고
    • TakTuk. http://taktuk.gforge.inria.fr.
    • TakTuk
  • 38
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • January
    • R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. Journal of Physics Conference Series, 16:521-530, January 2005.
    • (2005) Journal of Physics Conference Series , vol.16 , pp. 521-530
    • Vuduc, R.1    Demmel, J.W.2    Yelick, K.A.3
  • 39
    • 0028498486 scopus 로고
    • DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors
    • T. Yang and A. Gerasoulis. DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors. IEEE Trans. Parallel Distrib. Syst., 5(9):951-967, 1994.
    • (1994) IEEE Trans. Parallel Distrib. Syst , vol.5 , Issue.9 , pp. 951-967
    • Yang, T.1    Gerasoulis, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.