메뉴 건너뛰기




Volumn , Issue , 2007, Pages 116-125

Representation-transparent matrix algorithms with scalable performance

Author keywords

Dilated integers; Doppled integers; Matrix template library; Morton order

Indexed keywords

COMPUTER SIMULATION; ITERATIVE METHODS; LINEAR ALGEBRA; NATURAL SCIENCES COMPUTING; OBJECT ORIENTED PROGRAMMING;

EID: 34548020763     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1274971.1274989     Document Type: Conference Paper
Times cited : (35)

References (25)
  • 1
    • 33748317896 scopus 로고    scopus 로고
    • Fast additions on masked integers
    • May
    • M. D. Adams and D. S. Wise. Fast additions on masked integers. SIGPLAN Not., 41(5):39-45, May 2006. http://doi.acm.org/10.1145/1149982.1149987
    • (2006) SIGPLAN Not , vol.41 , Issue.5 , pp. 39-45
    • Adams, M.D.1    Wise, D.S.2
  • 2
    • 34547509407 scopus 로고    scopus 로고
    • Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms. In AiSPC
    • ACM Press, New York, Oct
    • M. D. Adams and D. S. Wise. Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms. In AiSPC '06: Proc. 2006 Wkshp. Memory System Performance and Correctness, pages 41-50. ACM Press, New York, Oct. 2006. http://doi.acm.org/10.1146/1178597.1178604
    • (2006) 06: Proc. 2006 Wkshp. Memory System Performance and Correctness , pp. 41-50
    • Adams, M.D.1    Wise, D.S.2
  • 3
    • 34548028138 scopus 로고    scopus 로고
    • Advanced Micro Devices, Inc, Sunnyvale, CA. AMD Core Math Library ACML
    • Advanced Micro Devices, Inc., Sunnyvale, CA. AMD Core Math Library (ACML), 2006. http://developer.and.com/acml.jsp
    • (2006)
  • 4
    • 34548048927 scopus 로고    scopus 로고
    • Families of algorithms related to the inversion of a symmetric positive definite matrix
    • Accepted upon revisions
    • P. Bientinesi, B. Gunter, and R. A. van de Geijn. Families of algorithms related to the inversion of a symmetric positive definite matrix. ACM Trans. Math. Softw., 2007. Accepted upon revisions. http://www.cs.utexas.edu/ueers/ pauldj/pubs/TOMS.SPD.pdf
    • (2007) ACM Trans. Math. Softw
    • Bientinesi, P.1    Gunter, B.2    van de Geijn, R.A.3
  • 6
    • 0036870763 scopus 로고    scopus 로고
    • Recursive array layouts and fast parallel matrix multiplication
    • Nov
    • S. Chatterjee, A. R. Lebeck, P. K. Patnala, and M. Thottenthodi. Recursive array layouts and fast parallel matrix multiplication. IEEE Trans. Parallel Distrib. Syst., 13(11):1105-1123, Nov. 2002. http://dx.doi.org/10. 1109/TPDS.2002.1058095
    • (2002) IEEE Trans. Parallel Distrib. Syst , vol.13 , Issue.11 , pp. 1105-1123
    • Chatterjee, S.1    Lebeck, A.R.2    Patnala, P.K.3    Thottenthodi, M.4
  • 11
    • 49149109685 scopus 로고    scopus 로고
    • Anatomy of high-performance matrix multiplication
    • Technical report, Univ. of Texas, Austin. Submittted for publication. Visited Sept
    • K. Goto and R. A. van de Geijn. Anatomy of high-performance matrix multiplication. Technical report, Univ. of Texas, Austin. Submittted for publication. Visited Sept. 2006. http://www.cs.utexas.edu/ueers/ilame/pubs/GOTO. TOMS.pdf
    • (2006)
    • Goto, K.1    van de Geijn, R.A.2
  • 12
    • 63449107836 scopus 로고    scopus 로고
    • Fundamental algebraic concepts in concept-enabled G++
    • Technical Report 638, Indiana University, Oct
    • P. Gottschling. Fundamental algebraic concepts in concept-enabled G++. Technical Report 638, Indiana University, Oct. 2006. http://www.cs.indiana.edu/ cgi-bin/techreports/TRNNN.cgi?trnum-TR638
    • (2006)
    • Gottschling, P.1
  • 13
    • 34248336283 scopus 로고    scopus 로고
    • Analyzing block locality in Morton-order and Morton-hybrid matrices
    • P. Foglia, C. A. Prete, S. B. Bartolini, and R. Giorgi, editors, ACM Press, New York, Sept
    • K. P. Lorton and D. S. Wise. Analyzing block locality in Morton-order and Morton-hybrid matrices. In P. Foglia, C. A. Prete, S. B. Bartolini, and R. Giorgi, editors, Proc. 7th MEDEA Wkshp. MEmory performance: DEaling with Applications, systems and architecture, pages 5-12. ACM Press, New York, Sept. 2006. http://doi.acm.org/10.1146/1166133.1166134
    • (2006) Proc. 7th MEDEA Wkshp. MEmory performance: DEaling with Applications, systems and architecture , pp. 5-12
    • Lorton, K.P.1    Wise, D.S.2
  • 15
    • 0003460690 scopus 로고
    • A computer oriented geodetic data base and a new technique in file sequencing
    • Technical report, IBM Ltd, Ottawa, Ontario, Mar
    • G. M. Morton. A computer oriented geodetic data base and a new technique in file sequencing. Technical report, IBM Ltd., Ottawa, Ontario, Mar. 1966.
    • (1966)
    • Morton, G.M.1
  • 17
    • 0042235298 scopus 로고    scopus 로고
    • Tiling, block data layout, and memory hierarchy performance
    • July
    • N. Park, B. Hong, and V. K. Prasanna. Tiling, block data layout, and memory hierarchy performance. IEEE Trans. Parallel Distrib. Syst., 14(7):640-654, July 2003. http://dx.doi.org/10.1109/TPDS.2003.1214317
    • (2003) IEEE Trans. Parallel Distrib. Syst , vol.14 , Issue.7 , pp. 640-654
    • Park, N.1    Hong, B.2    Prasanna, V.K.3
  • 19
    • 0000058088 scopus 로고
    • Finding neighbors of equal size in linear quadtrees and octrees in constant time
    • May
    • G. Schrack. Finding neighbors of equal size in linear quadtrees and octrees in constant time. CVGIP: Image Underst, 55(3):221-230, May 1992.
    • (1992) CVGIP: Image Underst , vol.55 , Issue.3 , pp. 221-230
    • Schrack, G.1
  • 20
    • 84947577532 scopus 로고    scopus 로고
    • The matrix template library: A generic programming approach to high performance numerical linear algebra
    • D. Caromel, R. R. Oldehoeft, and M. Tholburn, editors, Computing in Object-Oriented Parallel Environments, of, Springer, Berlin
    • J. G. Siek and A. Lumsdaine. The matrix template library: A generic programming approach to high performance numerical linear algebra. In D. Caromel, R. R. Oldehoeft, and M. Tholburn, editors, Computing in Object-Oriented Parallel Environments, volume 1505 of Lecture Notes in Comput. Sci., pages 59-70. Springer, Berlin, 1998. http://springerink.metapress. com/link.asp?id-95b3nt4qngm2kj8d
    • (1998) Lecture Notes in Comput. Sci , vol.1505 , pp. 59-70
    • Siek, J.G.1    Lumsdaine, A.2
  • 21
    • 0017095793 scopus 로고
    • Untersuchungen des Zeitgewinns durch neue Algorithmen zur Matrix-Multiplication
    • J. Spieß. Untersuchungen des Zeitgewinns durch neue Algorithmen zur Matrix-Multiplication. Computing, 17:23-36, 1976.
    • (1976) Computing , vol.17 , pp. 23-36
    • Spieß, J.1
  • 22
    • 31744441193 scopus 로고
    • The Standard Template Library -how do you build an algorithm that is both generic and efficient?
    • Oct
    • A. Stepanov. The Standard Template Library -how do you build an algorithm that is both generic and efficient? Byte Magazine, 20(10), Oct. 1995.
    • (1995) Byte Magazine , vol.20 , Issue.10
    • Stepanov, A.1
  • 23
    • 33748473004 scopus 로고    scopus 로고
    • Is Morton layout competitive for large two-dimensional arrays, yet?
    • Sept
    • J. Thiyagalingam, O. Beckmann, and P. H. J. Kelly. Is Morton layout competitive for large two-dimensional arrays, yet? Concur. Comput Prac. Exper., 18(11):1509-1539, Sept. 2006. http://dx.doi.org/10.1002/cpe.1018
    • (2006) Concur. Comput Prac. Exper , vol.18 , Issue.11 , pp. 1509-1539
    • Thiyagalingam, J.1    Beckmann, O.2    Kelly, P.H.J.3
  • 24
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • Jan
    • R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, Jan. 2001. http://dx.doi.org/10.1016/80167-8191(00)00087-9
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.