-
1
-
-
33748317896
-
Fast additions on masked integers
-
May
-
M. D. Adams and D. S. Wise. Fast additions on masked integers. SIGPLAN Not., 41(5):39-45, May 2006. http://doi.acm.org/10.1145/1149982.1149987
-
(2006)
SIGPLAN Not
, vol.41
, Issue.5
, pp. 39-45
-
-
Adams, M.D.1
Wise, D.S.2
-
2
-
-
34547509407
-
Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms. In AiSPC
-
ACM Press, New York, Oct
-
M. D. Adams and D. S. Wise. Seven at one stroke: Results from a cache-oblivious paradigm for scalable matrix algorithms. In AiSPC '06: Proc. 2006 Wkshp. Memory System Performance and Correctness, pages 41-50. ACM Press, New York, Oct. 2006. http://doi.acm.org/10.1146/1178597.1178604
-
(2006)
06: Proc. 2006 Wkshp. Memory System Performance and Correctness
, pp. 41-50
-
-
Adams, M.D.1
Wise, D.S.2
-
3
-
-
34548028138
-
-
Advanced Micro Devices, Inc, Sunnyvale, CA. AMD Core Math Library ACML
-
Advanced Micro Devices, Inc., Sunnyvale, CA. AMD Core Math Library (ACML), 2006. http://developer.and.com/acml.jsp
-
(2006)
-
-
-
4
-
-
34548048927
-
Families of algorithms related to the inversion of a symmetric positive definite matrix
-
Accepted upon revisions
-
P. Bientinesi, B. Gunter, and R. A. van de Geijn. Families of algorithms related to the inversion of a symmetric positive definite matrix. ACM Trans. Math. Softw., 2007. Accepted upon revisions. http://www.cs.utexas.edu/ueers/ pauldj/pubs/TOMS.SPD.pdf
-
(2007)
ACM Trans. Math. Softw
-
-
Bientinesi, P.1
Gunter, B.2
van de Geijn, R.A.3
-
5
-
-
0032659795
-
Recursive array layouts and fast parallel matrix multiplication
-
ACM Press, New York, June
-
S. Chatterjee, A. R. Lebeck, P. K. Patnala, and M. Thottenthodi. Recursive array layouts and fast parallel matrix multiplication. In Proc. 11th ACM Symp. Parallel Algorithms and Architectures, pages 222-231. ACM Press, New York, June 1999. http://doi.acm.org/10.1145/305619.306845
-
(1999)
Proc. 11th ACM Symp. Parallel Algorithms and Architectures
, pp. 222-231
-
-
Chatterjee, S.1
Lebeck, A.R.2
Patnala, P.K.3
Thottenthodi, M.4
-
6
-
-
0036870763
-
Recursive array layouts and fast parallel matrix multiplication
-
Nov
-
S. Chatterjee, A. R. Lebeck, P. K. Patnala, and M. Thottenthodi. Recursive array layouts and fast parallel matrix multiplication. IEEE Trans. Parallel Distrib. Syst., 13(11):1105-1123, Nov. 2002. http://dx.doi.org/10. 1109/TPDS.2002.1058095
-
(2002)
IEEE Trans. Parallel Distrib. Syst
, vol.13
, Issue.11
, pp. 1105-1123
-
-
Chatterjee, S.1
Lebeck, A.R.2
Patnala, P.K.3
Thottenthodi, M.4
-
8
-
-
0003555195
-
-
Society for Industrial and Applied Mathematics, Philadelphia
-
J. J. Dongarra, C. B. Moler, J. R. Bunch, and G. W. Stewart. LINPACK Users' Guide. Society for Industrial and Applied Mathematics, Philadelphia, 1979.
-
(1979)
LINPACK Users' Guide
-
-
Dongarra, J.J.1
Moler, C.B.2
Bunch, J.R.3
Stewart, G.W.4
-
9
-
-
0030688479
-
Auto-blocking matrix multiplication, or tracking BLAS3 performance from source code
-
July
-
J. D. Frens and D. S. Wise. Auto-blocking matrix multiplication, or tracking BLAS3 performance from source code. Proc. 6th ACM SIGPLAN Symp. on Principles and Practice of Parallel Program., SIGPLAN Not., 32(7):206-216, July 1997. http://doi.acm.org/10.1146/263764.263789
-
(1997)
Proc. 6th ACM SIGPLAN Symp. on Principles and Practice of Parallel Program., SIGPLAN Not
, vol.32
, Issue.7
, pp. 206-216
-
-
Frens, J.D.1
Wise, D.S.2
-
10
-
-
0033350255
-
Cache-oblivious algorithms
-
IEEE Computer Soc. Press, Washington, DC, Oct
-
M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran. Cache-oblivious algorithms. In Proc. 40th Ann. Symp. Foundations of Computer Science, pages 285-298. IEEE Computer Soc. Press, Washington, DC, Oct. 1999. http://dx.doi.org/10.1109/8FFCS.1999.814600
-
(1999)
Proc. 40th Ann. Symp. Foundations of Computer Science
, pp. 285-298
-
-
Frigo, M.1
Leiserson, C.E.2
Prokop, H.3
Ramachandran, S.4
-
11
-
-
49149109685
-
Anatomy of high-performance matrix multiplication
-
Technical report, Univ. of Texas, Austin. Submittted for publication. Visited Sept
-
K. Goto and R. A. van de Geijn. Anatomy of high-performance matrix multiplication. Technical report, Univ. of Texas, Austin. Submittted for publication. Visited Sept. 2006. http://www.cs.utexas.edu/ueers/ilame/pubs/GOTO. TOMS.pdf
-
(2006)
-
-
Goto, K.1
van de Geijn, R.A.2
-
12
-
-
63449107836
-
Fundamental algebraic concepts in concept-enabled G++
-
Technical Report 638, Indiana University, Oct
-
P. Gottschling. Fundamental algebraic concepts in concept-enabled G++. Technical Report 638, Indiana University, Oct. 2006. http://www.cs.indiana.edu/ cgi-bin/techreports/TRNNN.cgi?trnum-TR638
-
(2006)
-
-
Gottschling, P.1
-
13
-
-
34248336283
-
Analyzing block locality in Morton-order and Morton-hybrid matrices
-
P. Foglia, C. A. Prete, S. B. Bartolini, and R. Giorgi, editors, ACM Press, New York, Sept
-
K. P. Lorton and D. S. Wise. Analyzing block locality in Morton-order and Morton-hybrid matrices. In P. Foglia, C. A. Prete, S. B. Bartolini, and R. Giorgi, editors, Proc. 7th MEDEA Wkshp. MEmory performance: DEaling with Applications, systems and architecture, pages 5-12. ACM Press, New York, Sept. 2006. http://doi.acm.org/10.1146/1166133.1166134
-
(2006)
Proc. 7th MEDEA Wkshp. MEmory performance: DEaling with Applications, systems and architecture
, pp. 5-12
-
-
Lorton, K.P.1
Wise, D.S.2
-
15
-
-
0003460690
-
A computer oriented geodetic data base and a new technique in file sequencing
-
Technical report, IBM Ltd, Ottawa, Ontario, Mar
-
G. M. Morton. A computer oriented geodetic data base and a new technique in file sequencing. Technical report, IBM Ltd., Ottawa, Ontario, Mar. 1966.
-
(1966)
-
-
Morton, G.M.1
-
17
-
-
0042235298
-
Tiling, block data layout, and memory hierarchy performance
-
July
-
N. Park, B. Hong, and V. K. Prasanna. Tiling, block data layout, and memory hierarchy performance. IEEE Trans. Parallel Distrib. Syst., 14(7):640-654, July 2003. http://dx.doi.org/10.1109/TPDS.2003.1214317
-
(2003)
IEEE Trans. Parallel Distrib. Syst
, vol.14
, Issue.7
, pp. 640-654
-
-
Park, N.1
Hong, B.2
Prasanna, V.K.3
-
19
-
-
0000058088
-
Finding neighbors of equal size in linear quadtrees and octrees in constant time
-
May
-
G. Schrack. Finding neighbors of equal size in linear quadtrees and octrees in constant time. CVGIP: Image Underst, 55(3):221-230, May 1992.
-
(1992)
CVGIP: Image Underst
, vol.55
, Issue.3
, pp. 221-230
-
-
Schrack, G.1
-
20
-
-
84947577532
-
The matrix template library: A generic programming approach to high performance numerical linear algebra
-
D. Caromel, R. R. Oldehoeft, and M. Tholburn, editors, Computing in Object-Oriented Parallel Environments, of, Springer, Berlin
-
J. G. Siek and A. Lumsdaine. The matrix template library: A generic programming approach to high performance numerical linear algebra. In D. Caromel, R. R. Oldehoeft, and M. Tholburn, editors, Computing in Object-Oriented Parallel Environments, volume 1505 of Lecture Notes in Comput. Sci., pages 59-70. Springer, Berlin, 1998. http://springerink.metapress. com/link.asp?id-95b3nt4qngm2kj8d
-
(1998)
Lecture Notes in Comput. Sci
, vol.1505
, pp. 59-70
-
-
Siek, J.G.1
Lumsdaine, A.2
-
21
-
-
0017095793
-
Untersuchungen des Zeitgewinns durch neue Algorithmen zur Matrix-Multiplication
-
J. Spieß. Untersuchungen des Zeitgewinns durch neue Algorithmen zur Matrix-Multiplication. Computing, 17:23-36, 1976.
-
(1976)
Computing
, vol.17
, pp. 23-36
-
-
Spieß, J.1
-
22
-
-
31744441193
-
The Standard Template Library -how do you build an algorithm that is both generic and efficient?
-
Oct
-
A. Stepanov. The Standard Template Library -how do you build an algorithm that is both generic and efficient? Byte Magazine, 20(10), Oct. 1995.
-
(1995)
Byte Magazine
, vol.20
, Issue.10
-
-
Stepanov, A.1
-
23
-
-
33748473004
-
Is Morton layout competitive for large two-dimensional arrays, yet?
-
Sept
-
J. Thiyagalingam, O. Beckmann, and P. H. J. Kelly. Is Morton layout competitive for large two-dimensional arrays, yet? Concur. Comput Prac. Exper., 18(11):1509-1539, Sept. 2006. http://dx.doi.org/10.1002/cpe.1018
-
(2006)
Concur. Comput Prac. Exper
, vol.18
, Issue.11
, pp. 1509-1539
-
-
Thiyagalingam, J.1
Beckmann, O.2
Kelly, P.H.J.3
-
24
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Jan
-
R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, Jan. 2001. http://dx.doi.org/10.1016/80167-8191(00)00087-9
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
25
-
-
0034819362
-
Language support for Morton-order matrices
-
July
-
D. S. Wise, J. D. Frens, Y. Gu, and G. A. Alexander. Language support for Morton-order matrices. Proc. 8th ACM SIGPLAN Symp. on Principles and Practice of Parallel Program., SIGPLAN Not., 36(7):24-33, July 2001. http://doi.acm.org/10.1146/879639.379669
-
(2001)
Proc. 8th ACM SIGPLAN Symp. on Principles and Practice of Parallel Program., SIGPLAN Not
, vol.36
, Issue.7
, pp. 24-33
-
-
Wise, D.S.1
Frens, J.D.2
Gu, Y.3
Alexander, G.A.4
|