메뉴 건너뛰기




Volumn , Issue , 2008, Pages 123-132

SuperMatrix: A multithreaded runtime scheduling system for algorithms-by-blocks

Author keywords

Algorithms by blocks; Dependency analysis; Dynamic scheduling; Out of order execution

Indexed keywords

ALGORITHMS-BY-BLOCKS; CC-NUMA; DEPENDENCY ANALYSIS; DYNAMIC SCHEDULING; EXCELLENT PERFORMANCE; HIGH LEVEL OF ABSTRACTION; LIBRARY DEVELOPERS; MATRIX OPERATIONS; MICRO ARCHITECTURES; MULTICORE ARCHITECTURES; MULTITHREADED; OUT-OF-ORDER EXECUTION; PARALLELIZATIONS; RUN-TIME SCHEDULING; RUNTIME SYSTEMS; SCHEDULING OPERATIONS; SEPARATION OF CONCERNS; SUPERSCALAR;

EID: 67650056933     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (70)

References (35)
  • 1
    • 0037834788 scopus 로고    scopus 로고
    • OpenMP issues arising in the development of parallel BLAS and LAPACK libraries
    • C. Addison, Y. Ren, and M. van Waveren. OpenMP issues arising in the development of parallel BLAS and LAPACK libraries. Scientific Programming, 11(2), 2003.
    • (2003) Scientific Programming , vol.11 , Issue.2
    • Addison, C.1    Ren, Y.2    Van Waveren, M.3
  • 8
    • 17644370328 scopus 로고    scopus 로고
    • Representing linear algebra algorithms in code: The FLAME application programming interfaces
    • March
    • Paolo Bientinesi, Enrique S. Quintana-Orti, and Robert A. van de Geijn. Representing linear algebra algorithms in code: The FLAME application programming interfaces. ACM Transaction? on Mathematical Software, 31(1):27-59, March 2005.
    • (2005) ACM Transaction? on Mathematical Software , vol.31 , Issue.1 , pp. 27-59
    • Bientinesi, P.1    Quintana-Orti, E.S.2    Van De Geijn, R.A.3
  • 16
    • 1842832833 scopus 로고    scopus 로고
    • Recursive blocked algorithms and hybrid data structures for dense matrix library software
    • Erik Elmroth, Fred Gustavson, Isak Jonsson, and Bo Kagstrom. Recursive blocked algorithms and hybrid data structures for dense matrix library software. SIAMReview, 46(1):3-45, 2004.
    • (2004) SIAMReview , vol.46 , Issue.1 , pp. 3-45
    • Elmroth, E.1    Gustavson, F.2    Jonsson, I.3    Kagstrom, B.4
  • 17
    • 79959386800 scopus 로고    scopus 로고
    • Kazushige Goto. http ://www.tace.utexas.edu/resources/software.
  • 21
    • 35248867212 scopus 로고
    • BLAS based on block data structures
    • Cornell University, February
    • Greg Henry. BLAS based on block data structures. Theory Center Technical Report CTC92TR89, Cornell University, February 1992.
    • (1992) Theory Center Technical Report CTC92TR89
    • Henry, G.1
  • 24
    • 50249166476 scopus 로고    scopus 로고
    • Solving systems of linear equations on the Cell processor using Cholesky factorization
    • Innovative Computing Laboratory, University of Tennesse, April
    • Jakub Kurzak, Alfredo Buttari, and Jack Dongarra. Solving systems of linear equations on the Cell processor using Cholesky factorization. Technical Report UT-CS-07-596, Innovative Computing Laboratory, University of Tennesse, April 2007.
    • (2007) Technical Report UT-CS-07-596
    • Kurzak, J.1    Buttari, A.2    Dongarra, J.3
  • 25
    • 35248868578 scopus 로고    scopus 로고
    • Implementing linear algebra routines on multi-core processors with pipelining and a look ahead
    • University of Tennessee, September
    • Jakub Kurzak and Jack Dongarra. Implementing linear algebra routines on multi-core processors with pipelining and a look ahead. LAPACK Working Note 178 Technical Report UT-CS-06-581, University of Tennessee, September 2006.
    • (2006) LAPACK Working Note 178 Technical Report UT-CS-06-581
    • Kurzak, J.1    Dongarra, J.2
  • 26
    • 0012525494 scopus 로고    scopus 로고
    • Programming parallel applications in CiIk
    • Charles Leiserson and Aske Plaat. Programming parallel applications in CiIk. SINEWS: SIAM News, 31, 1998.
    • (1998) SINEWS: SIAM News , vol.31
    • Leiserson, C.1    Plaat, A.2
  • 27
    • 47349106165 scopus 로고    scopus 로고
    • An API for manipulating matrices stored by blocks
    • Department of Computer Sciences, The University of Texas at Austin, May
    • Tze Meng Low and Robert van de Geijn. An API for manipulating matrices stored by blocks. FLAME Working Note #12 TR-2004-15, Department of Computer Sciences, The University of Texas at Austin, May 2004.
    • (2004) FLAME Working Note #12 TR-2004-15
    • Low, T.M.1    Van De Geijn, R.2
  • 31
    • 0035003299 scopus 로고    scopus 로고
    • A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization
    • June
    • Peter Strazdins. A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization. International Journal of Parallel and Distributed Systems and Networks, 4(1):26-35, June 2001.
    • (2001) International Journal of Parallel and Distributed Systems and Networks , vol.4 , Issue.1 , pp. 26-35
    • Strazdins, P.1
  • 32
    • 0003081830 scopus 로고
    • An efficient algorithm for exploiting multiple arithmetic units
    • R. Tomasulo. An efficient algorithm for exploiting multiple arithmetic units. IBM Journal of Research and Development, 11(1), 1967.
    • (1967) IBM Journal of Research and Development , vol.11 , Issue.1
    • Tomasulo, R.1
  • 33
    • 0037173976 scopus 로고    scopus 로고
    • A framework for highperformance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels
    • Vinod Valsalam and Anthony Skjellum. A framework for highperformance matrix multiplication based on hierarchical abstractions, algorithms and optimized low-level kernels. Concurrency and Computation: Practice and Experience, 14(10):805-840, 2002.
    • (2002) Concurrency and Computation: Practice and Experience , vol.14 , Issue.10 , pp. 805-840
    • Valsalam, V.1    Skjellum, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.