-
1
-
-
84973744454
-
Large dense numerical linear algebra in 1993: The parallel computing influence
-
A. Edelman, "Large Dense Numerical Linear Algebra in 1993: The Parallel Computing Influence," Int'l J. Supercomputer Applications, vol. 7, pp. 113-128, 1993.
-
(1993)
Int'l J. Supercomputer Applications
, vol.7
, pp. 113-128
-
-
Edelman, A.1
-
2
-
-
0029324485
-
Software libraries for linear algebra computations on high performance computers
-
J. J. Dongarra and D. W. Walker, "Software Libraries for Linear Algebra Computations on High Performance Computers," SIAM Rev., vol. 37, pp. 151-180, 1995.
-
(1995)
SIAM Rev.
, vol.37
, pp. 151-180
-
-
Dongarra, J.J.1
Walker, D.W.2
-
3
-
-
0000667923
-
The torus-wrap mapping for dense matrix calculations on massively parallel computers
-
B. A. Hendrickson and D. E. Womble, "The Torus-Wrap Mapping for Dense Matrix Calculations on Massively Parallel Computers," SIAM J. Scientific Computing, vol. 15, no. 5, pp. 1201-1226, 1994.
-
(1994)
SIAM J. Scientific Computing
, vol.15
, Issue.5
, pp. 1201-1226
-
-
Hendrickson, B.A.1
Womble, D.E.2
-
4
-
-
0025448609
-
Origin and development of the method of moments for field computation
-
DOI 10.1109/74.80522
-
R. Harrington, "Origin and Development of the Method of Moments for Field Computation," IEEE Antennas and Propagation Magazine, vol. 32, no. 3, pp. 31-35, June 1990. (Pubitemid 20725243)
-
(1990)
IEEE Antennas and Propagation Magazine
, vol.32
, Issue.3
, pp. 31-35
-
-
Harrington Roger1
-
5
-
-
0000589993
-
Panel methods in computational fluid dynamics
-
Jan.
-
J. L. Hess, "Panel Methods in Computational Fluid Dynamics," Ann. Rev. of Fluid Mechanics, vol. 22, pp. 225-274, Jan. 1990.
-
(1990)
Ann. Rev. of Fluid Mechanics
, vol.22
, pp. 225-274
-
-
Hess, J.L.1
-
7
-
-
84985321100
-
Stability of block LU factorization
-
J. W. Demmel, N. J. Higham, and R. S. Schreiber, "Stability of Block LU Factorization," Numerical Linear Algebra with Applications, vol. 2, no. 2, pp. 173-190, 1995.
-
(1995)
Numerical Linear Algebra with Applications
, vol.2
, Issue.2
, pp. 173-190
-
-
Demmel, J.W.1
Higham, N.J.2
Schreiber, R.S.3
-
8
-
-
0026913668
-
Stability of block algorithms with fast level-3 BLAS
-
Sept.
-
J. W. Demmel and N. J. Higham, "Stability of Block Algorithms with Fast Level-3 BLAS," ACM Trans. Math. Software, vol. 18, no. 3, pp. 274-291, Sept. 1992.
-
(1992)
ACM Trans. Math. Software
, vol.18
, Issue.3
, pp. 274-291
-
-
Demmel, J.W.1
Higham, N.J.2
-
12
-
-
0039821550
-
On the parallelization of blocked LU factorization algorithms on distributed memory architectures
-
G. von Laszewski, M. Parashar, A. G. Mohamed, and G. C. Fox, "On the Parallelization of Blocked LU Factorization Algorithms on Distributed Memory Architectures," Supercomputing'92: Proc. ACM/IEEE Conf. Supercomputing, pp. 170-179, 1992.
-
(1992)
Supercomputing'92: Proc. ACM/IEEE Conf. Supercomputing
, pp. 170-179
-
-
Von Laszewski, G.1
Parashar, M.2
Mohamed, A.G.3
Fox, G.C.4
-
13
-
-
45449117672
-
Implementation and optimization of dense LU decomposition on the stream processor
-
R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., Springer
-
Y. Zhang, T. Tang, G. Li, and X. Yang, "Implementation and Optimization of Dense LU Decomposition on the Stream Processor," Parallel Processing and Applied Mathematics, R. Wyrzykowski, J. Dongarra, K. Karczewski, and J. Wasniewski, eds., pp. 78-88, Springer, 2008.
-
(2008)
Parallel Processing and Applied Mathematics
, pp. 78-88
-
-
Zhang, Y.1
Tang, T.2
Li, G.3
Yang, X.4
-
14
-
-
82555164731
-
Multi-FPGA based high performance LU decomposition
-
Sept.
-
A. Sudarsanam, S. Young, A. Dasu, and T. Hauser, "Multi-FPGA Based High Performance LU Decomposition," Proc. 10th High Performance Embedded Computing (HPEC) Workshop, Sept. 2006.
-
(2006)
Proc. 10th High Performance Embedded Computing (HPEC) Workshop
-
-
Sudarsanam, A.1
Young, S.2
Dasu, A.3
Hauser, T.4
-
17
-
-
12444323064
-
A high-performance and energy-efficient architecture for floating-point based LU decomposition on FPGAs
-
Apr.
-
G. Govindu, S. Choi, V. Prasanna, V. Daga, S. Gangadharpalli, and V. Sridhar, "A High-Performance and Energy-Efficient Architecture for Floating-Point Based LU Decomposition on FPGAs," Proc. 18th Int'l Parallel and Distributed Processing Symp., p. 149, Apr. 2004.
-
(2004)
Proc. 18th Int'l Parallel and Distributed Processing Symp.
, pp. 149
-
-
Govindu, G.1
Choi, S.2
Prasanna, V.3
Daga, V.4
Gangadharpalli, S.5
Sridhar, V.6
-
18
-
-
47049109081
-
High-performance designs for linear algebra operations on reconfigurable hardware
-
Aug.
-
L. Zhuo and V. K. Prasanna, "High-Performance Designs for Linear Algebra Operations on Reconfigurable Hardware," IEEE Trans. Computers, vol. 57, no. 8, pp. 1057-1071, Aug. 2008.
-
(2008)
IEEE Trans. Computers
, vol.57
, Issue.8
, pp. 1057-1071
-
-
Zhuo, L.1
Prasanna, V.K.2
-
19
-
-
63049121558
-
Portable and scalable FPGA-based acceleration of a direct linear system solver
-
Dec.
-
W. Zhang, V. Betz, and J. Rose, "Portable and Scalable FPGA-Based Acceleration of a Direct Linear System Solver," Proc. Int'l Conf. Field-Programmable Technology (FPT'08), pp. 17-24, Dec. 2008.
-
(2008)
Proc. Int'l Conf. Field-Programmable Technology (FPT'08)
, pp. 17-24
-
-
Zhang, W.1
Betz, V.2
Rose, J.3
-
20
-
-
82555171856
-
-
"SRC Supercomputers," http://www.srccomp.com/, 2008.
-
(2008)
SRC Supercomputers
-
-
-
23
-
-
33845468997
-
LU-GPU: Efficient algorithms for solving dense linear systems on graphics hardware
-
Nov.
-
N. Galoppo, N. Govindaraju, M. Henson, and D. Manocha, "LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware," Proc. ACM/IEEE Conf. Supercomputing (SC), p. 3, Nov. 2005.
-
(2005)
Proc. ACM/IEEE Conf. Supercomputing (SC)
, pp. 3
-
-
Galoppo, N.1
Govindaraju, N.2
Henson, M.3
Manocha, D.4
-
25
-
-
33646731004
-
Performance study of LU decomposition on the programmable GPU
-
F. Ino, M. Matsui, K. Goda, and K. Hagihara, "Performance Study of LU Decomposition on the Programmable GPU," Proc. Int'l Conf. High Performance Computing (HiPC), vol. 3769, pp. 83-94, 2005.
-
(2005)
Proc. Int'l Conf. High Performance Computing (HiPC)
, vol.3769
, pp. 83-94
-
-
Ino, F.1
Matsui, M.2
Goda, K.3
Hagihara, K.4
-
26
-
-
77954080759
-
Dense linear algebra solvers for multicore with GPU accelerators
-
Jan.
-
S. Tomov, R. Nath, H. Ltaief, and J. Dongarra, "Dense Linear Algebra Solvers for Multicore with GPU Accelerators," Proc. Int'l Workshop High-Level Parallel Programming Models and Supportive Environments (HIPS'10), Jan. 2010.
-
(2010)
Proc. Int'l Workshop High-Level Parallel Programming Models and Supportive Environments (HIPS'10)
-
-
Tomov, S.1
Nath, R.2
Ltaief, H.3
Dongarra, J.4
-
29
-
-
82555163061
-
QDR II SRAM interface for virtex-5 devices
-
Oct.
-
L. Gopalakrishnan, "QDR II SRAM Interface for Virtex-5 Devices," Xilinx Application Note (XAPP853), http://www.xilinx.com/support/ documentation/application-notes/xapp853.pdf, Oct. 2008.
-
(2008)
Xilinx Application Note (XAPP853)
-
-
Gopalakrishnan, L.1
-
30
-
-
57049186554
-
High-performance mixed-precision linear solver for FPGAs
-
Dec.
-
J. Sun, G. Peterson, and O. Storaasli, "High-Performance Mixed-Precision Linear Solver for FPGAs," IEEE Trans. Computers, vol. 57, no. 12, pp. 1614-1623, Dec. 2008.
-
(2008)
IEEE Trans. Computers
, vol.57
, Issue.12
, pp. 1614-1623
-
-
Sun, J.1
Peterson, G.2
Storaasli, O.3
-
31
-
-
82555169975
-
-
"AMD Core Math Library (ACML)," http://developer.amd.com/cpu/ Libraries/acml/Pages/default.aspx, 2011.
-
(2011)
AMD Core Math Library (ACML)
-
-
-
33
-
-
77953997924
-
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects
-
E. Agullo, J. Demmel, J. Dongarra, B. Hadri, J. Kurzak, J. Langou, H. Ltaief, P. Luszczek, and S. Tomov, "Numerical Linear Algebra on Emerging Architectures: The PLASMA and MAGMA Projects," J. Physics: Conference Series, vol. 180, 2009.
-
(2009)
J. Physics: Conference Series
, vol.180
-
-
Agullo, E.1
Demmel, J.2
Dongarra, J.3
Hadri, B.4
Kurzak, J.5
Langou, J.6
Ltaief, H.7
Luszczek, P.8
Tomov, S.9
|