메뉴 건너뛰기




Volumn , Issue , 2007, Pages

Multi-threading and one-sided communication in parallel LU factorization

Author keywords

Dense linear algebra; Latency tolerance; Multithreading

Indexed keywords

ALGEBRA; COMPUTERS; FACTORIZATION; HIGH PERFORMANCE LIQUID CHROMATOGRAPHY;

EID: 56749169455     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1362622.1362664     Document Type: Conference Paper
Times cited : (31)

References (34)
  • 5
    • 0038036149 scopus 로고    scopus 로고
    • Space-Efficient Scheduling of Multithreaded Computations
    • Blumofe R. and Leiserson C. Space-Efficient Scheduling of Multithreaded Computations. SIAM J. on Computing, 27, 1 (1998), 202-229.
    • (1998) SIAM J. on Computing , vol.27 , Issue.1 , pp. 202-229
    • Blumofe, R.1    Leiserson, C.2
  • 6
    • 56749116502 scopus 로고    scopus 로고
    • Bonachea D. GASNet Specification, v1. 1. U.C. Berkeley Technical Report CSD-02-1207, 2001.
    • Bonachea D. GASNet Specification, v1. 1. U.C. Berkeley Technical Report CSD-02-1207, 2001.
  • 7
    • 56749089227 scopus 로고    scopus 로고
    • Proposal for Extending the UPC Memory Copy Library Functions and Supporting Extensions to GASNet, vl.O
    • LBNL-54983, 2004
    • Bonachea D. Proposal for Extending the UPC Memory Copy Library Functions and Supporting Extensions to GASNet, vl.O. Lawrence Berkeley National Laboratory Technical Report LBNL-54983, 2004.
    • Lawrence Berkeley National Laboratory Technical Report
    • Bonachea, D.1
  • 12
    • 0030244536 scopus 로고    scopus 로고
    • Choi J., Dongarra J., Ostrouchov S., Petitet A., Walker D., and Whaley, R.C. The Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines. Scientific Programming, 5, (1996), 173-184.
    • Choi J., Dongarra J., Ostrouchov S., Petitet A., Walker D., and Whaley, R.C. The Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines. Scientific Programming, 5, (1996), 173-184.
  • 13
    • 56749144542 scopus 로고    scopus 로고
    • Ebcioglu K., Saraswat V., and Sarkar, V. X10: an Experimental Language for High Productivity Programming of Scalable Systems. In Proceedings of the P-PHEC 2005 Workshop, held in conjunction with HPCA 2005, 2005.
    • Ebcioglu K., Saraswat V., and Sarkar, V. X10: an Experimental Language for High Productivity Programming of Scalable Systems. In Proceedings of the P-PHEC 2005 Workshop, held in conjunction with HPCA 2005, 2005.
  • 17
    • 1542392269 scopus 로고    scopus 로고
    • On reducing TLB misses in matrix multiplication
    • Technical Report TR-2002-55, The University of Texas at Austin, Department of Computer Sciences, Also published as FLAME Working Note #9
    • Goto K. and van de Geijn R. On reducing TLB misses in matrix multiplication. Technical Report TR-2002-55, The University of Texas at Austin, Department of Computer Sciences, 2002. Also published as FLAME Working Note #9.
    • (2002)
    • Goto, K.1    van de Geijn, R.2
  • 18
  • 19
    • 0031273280 scopus 로고    scopus 로고
    • Recursion Leads to Automatic Variable Blocking for Dense Linear-Algebra Algorithms
    • Gustavson F. 1997. Recursion Leads to Automatic Variable Blocking for Dense Linear-Algebra Algorithms. IBM Journal of Research and Development, 41, 6 (1997), 737-755.
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 737-755
    • Gustavson, F.1
  • 21
    • 84976817516 scopus 로고
    • CHARM++ : A Portable Concurrent Object Oriented System Based On C++
    • Kale L. V. and Krishnan S. CHARM++ : A Portable Concurrent Object Oriented System Based On C++, ACM Sigplan Notes, 28, 10 (1993), 91-108.
    • (1993) ACM Sigplan Notes , vol.28 , Issue.10 , pp. 91-108
    • Kale, L.V.1    Krishnan, S.2
  • 24
    • 56749138808 scopus 로고    scopus 로고
    • Li X. and Demmel J. SuperLU-DIST: A Scalable Distributed-Memory Sparse Direct Solver for Unsymmetric Linear Systems. ACM TOMS, 31, 3 (2003), 110-140.
    • Li X. and Demmel J. SuperLU-DIST: A Scalable Distributed-Memory Sparse Direct Solver for Unsymmetric Linear Systems. ACM TOMS, 31, 3 (2003), 110-140.
  • 26
    • 56749112867 scopus 로고    scopus 로고
    • Luszczek P., Dongarra J., Koester D., Rabenseifner R., Lucas B., Kepner J., McCalpin J., Bailey D., and Takahashi D. Introduction to the HPC Challenge Benchmark Suite. SC2005 (submitted), Seattle, WA, 2005.
    • Luszczek P., Dongarra J., Koester D., Rabenseifner R., Lucas B., Kepner J., McCalpin J., Bailey D., and Takahashi D. Introduction to the HPC Challenge Benchmark Suite. SC2005 (submitted), Seattle, WA, 2005.
  • 27
    • 0031599142 scopus 로고    scopus 로고
    • Mersenne Twister: A 623-dimensionally equidistributed uniform pseudorandom number generator
    • Matsumoto M. and Nishimura T. Mersenne Twister: A 623-dimensionally equidistributed uniform pseudorandom number generator. ACM Transactions on Modeling and Computer Simulation, 8, 1 (1998), 3-30.
    • (1998) ACM Transactions on Modeling and Computer Simulation , vol.8 , Issue.1 , pp. 3-30
    • Matsumoto, M.1    Nishimura, T.2
  • 32
    • 56749134764 scopus 로고    scopus 로고
    • Snir M., Otto S., Huss-Lederman S., Walker D., and Dongarra J. MPI: The Complete Reference - 2nd Edition: 1. The MIT Press. ISBN 0-262-57123-4, 1998.
    • Snir M., Otto S., Huss-Lederman S., Walker D., and Dongarra J. MPI: The Complete Reference - 2nd Edition: Volume 1. The MIT Press. ISBN 0-262-57123-4, 1998.
  • 34
    • 34447571243 scopus 로고    scopus 로고
    • UPC Consortium, Available at:, 2005
    • UPC Consortium. UPC Language Specifications, v1.2. Available at: http://upc.lbl.gov/docs/user/upc-spec-1.2.pdf, 2005.
    • UPC Language Specifications, v1.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.