-
1
-
-
77956607557
-
A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance
-
J. Sloan, D. Kesler, R. Kumar, and A. Rahimi, "A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance," in 40th IEEE/IFIP International Conference on Dependable Systems and Networks, 2010, July 2010.
-
40th IEEE/IFIP International Conference on Dependable Systems and Networks, 2010, July 2010
-
-
Sloan, J.1
Kesler, D.2
Kumar, R.3
Rahimi, A.4
-
2
-
-
1842582489
-
Making typical silicon matter with razor
-
T. Austin, D. Blaauw, T. Mudge, and K. Flautner, "Making typical silicon matter with razor," Computer, vol. 37, pp. 57-65, 2004.
-
(2004)
Computer
, vol.37
, pp. 57-65
-
-
Austin, T.1
Blaauw, D.2
Mudge, T.3
Flautner, K.4
-
4
-
-
0000135303
-
Methods of conjugate gradients for solving linear systems
-
M. Hestenes and E. Stiefel, "Methods of conjugate gradients for solving linear systems," J. Research Natl Bureau of Standards, vol. 49, no. 6, 1952.
-
(1952)
J. Research Natl Bureau of Standards
, vol.49
, Issue.6
-
-
Hestenes, M.1
Stiefel, E.2
-
5
-
-
62949205696
-
Fpga based high performance double-precision matrix multiplication
-
Washington, DC, USA: IEEE Computer Society
-
V. B. Y. Kumar, S. Joshi, S. B. Patkar, and H. Narayanan, "Fpga based high performance double-precision matrix multiplication," in VLSID '09: Proceedings of the 2009 22nd International Conference on VLSI Design. Washington, DC, USA: IEEE Computer Society, 2009, pp. 341-346.
-
(2009)
VLSID '09: Proceedings of the 2009 22nd International Conference on VLSI Design
, pp. 341-346
-
-
Kumar, V.B.Y.1
Joshi, S.2
Patkar, S.B.3
Narayanan, H.4
-
6
-
-
84947242005
-
A hierarchical sparse matrix storage format for vector processors
-
P. Stathis, S. Vassiliadis, and S. Cotofana, "A hierarchical sparse matrix storage format for vector processors," in Parallel and Distributed Processing Symposium, 2003. Proceedings. International, 22-26 2003, p. 8 pp.
-
Parallel and Distributed Processing Symposium, 2003. Proceedings. International, 22-26 2003
, pp. 8
-
-
Stathis, P.1
Vassiliadis, S.2
Cotofana, S.3
-
7
-
-
0003473816
-
-
Philadelphia, PA: SIAM
-
R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. M. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. V. D. Vorst, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. Philadelphia, PA: SIAM, 1994.
-
(1994)
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.M.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
Vorst, H.V.D.10
-
8
-
-
47349126591
-
Sparse matrix-vector multiplication design on fpgas
-
J. Sun, G. Peterson, and O. Storaasli, "Sparse matrix-vector multiplication design on fpgas," in Field-Programmable Custom Computing Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium on, 23- 25 2007, pp. 349 -352.
-
Field-Programmable Custom Computing Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium On, 23- 25 2007
, pp. 349-352
-
-
Sun, J.1
Peterson, G.2
Storaasli, O.3
-
9
-
-
34147157830
-
Sparse matrix computations on reconfigurable hardware
-
G. R. Morris and V. K. Prasanna, "Sparse matrix computations on reconfigurable hardware," Computer, vol. 40, no. 3, pp. 58-64, 2007.
-
(2007)
Computer
, vol.40
, Issue.3
, pp. 58-64
-
-
Morris, G.R.1
Prasanna, V.K.2
-
11
-
-
70350368872
-
Efficient sparse matrix-vector multiplication on CUDA
-
NVIDIA Corporation, Dec.
-
N. Bell and M. Garland, "Efficient sparse matrix-vector multiplication on CUDA," NVIDIA Corporation, NVIDIA Technical Report NVR-2008-004, Dec. 2008.
-
(2008)
NVIDIA Technical Report NVR-2008-004
-
-
Bell, N.1
Garland, M.2
-
13
-
-
34047144377
-
Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems
-
L. Zhuo and V. K. Prasanna, "Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems," IEEE Trans. Parallel Distrib. Syst., vol. 18, no. 4, pp. 433-448, 2007.
-
(2007)
IEEE Trans. Parallel Distrib. Syst.
, vol.18
, Issue.4
, pp. 433-448
-
-
Zhuo, L.1
Prasanna, V.K.2
-
14
-
-
20344376214
-
64-bit floating-point fpga matrix multiplication
-
New York, NY, USA: ACM
-
Y. Dou, S. Vassiliadis, G. K. Kuzmanov, and G. N. Gaydadjiev, "64-bit floating-point fpga matrix multiplication," in FPGA '05: Proceedings of the 2005 ACM/SIGDA 13th international symposium on Fieldprogrammable gate arrays. New York, NY, USA: ACM, 2005, pp. 86-95.
-
(2005)
FPGA '05: Proceedings of the 2005 ACM/SIGDA 13th International Symposium on Fieldprogrammable Gate Arrays
, pp. 86-95
-
-
Dou, Y.1
Vassiliadis, S.2
Kuzmanov, G.K.3
Gaydadjiev, G.N.4
-
15
-
-
84859456270
-
A high performance fpga-based accelerator for blas library implementation
-
S. Rousseaux, D. Hubaux, P. Guisset, and J.-D. Legat, "A high performance fpga-based accelerator for blas library implementation," in RSSI'07: Proceedings of the Third Annual Reconfigurable Systems Summer Institute, July 2007.
-
RSSI'07: Proceedings of the Third Annual Reconfigurable Systems Summer Institute, July 2007
-
-
Rousseaux, S.1
Hubaux, D.2
Guisset, P.3
Legat, J.-D.4
-
16
-
-
34548826218
-
Hardware acceleration of matrix multiplication on a xilinx fpga
-
Washington, DC, USA: IEEE Computer Society
-
N. Dave, K. Fleming, M. King, M. Pellauer, and M. Vijayaraghavan, "Hardware acceleration of matrix multiplication on a xilinx fpga," in MEMOCODE '07: Proceedings of the 5th IEEE/ACM International Conference on Formal Methods and Models for Codesign. Washington, DC, USA: IEEE Computer Society, 2007, pp. 97-100.
-
(2007)
MEMOCODE '07: Proceedings of the 5th IEEE/ACM International Conference on Formal Methods and Models for Codesign
, pp. 97-100
-
-
Dave, N.1
Fleming, K.2
King, M.3
Pellauer, M.4
Vijayaraghavan, M.5
-
17
-
-
79961190886
-
Hardware realization of matrix multiplication using field programmable gate array
-
August
-
S. M. Qasim, S. A. Abbasi, and B. A. Almashary, "Hardware realization of matrix multiplication using field programmable gate array," in MASAUM Journal of Computing, vol. 1, August 2009, pp. 21-25.
-
(2009)
MASAUM Journal of Computing
, vol.1
, pp. 21-25
-
-
Qasim, S.M.1
Abbasi, S.A.2
Almashary, B.A.3
-
18
-
-
79961188036
-
Floating point matrix multiplication on a reconfigurable computing system
-
Springer Berlin Heidelberg
-
C. Sajish, Y. Abhyankar, S. Ghotgalkar, and K. Venkates, "Floating point matrix multiplication on a reconfigurable computing system," in Proceedings of the International Conference on High Performance Computing and Applications. Springer Berlin Heidelberg, 2005, pp. 113-122.
-
(2005)
Proceedings of the International Conference on High Performance Computing and Applications
, pp. 113-122
-
-
Sajish, C.1
Abhyankar, Y.2
Ghotgalkar, S.3
Venkates, K.4
-
19
-
-
47049109081
-
High-performance designs for linear algebra operations on reconfigurable hardware
-
L. Zhuo and V. K. Prasanna, "High-performance designs for linear algebra operations on reconfigurable hardware," IEEE Trans. Comput., vol. 57, no. 8, pp. 1057-1071, 2008.
-
(2008)
IEEE Trans. Comput.
, vol.57
, Issue.8
, pp. 1057-1071
-
-
Zhuo, L.1
Prasanna, V.K.2
-
20
-
-
50949166640
-
Evaluation and tuning of the level 3 cublas for graphics processors
-
S. Barrachina, M. Castillo, F. Igual, R. Mayo, and E. Quintana-Orti, "Evaluation and tuning of the level 3 cublas for graphics processors," in Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, april 2008, pp. 1 -8.
-
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium On, April 2008
, pp. 1-8
-
-
Barrachina, S.1
Castillo, M.2
Igual, F.3
Mayo, R.4
Quintana-Orti, E.5
|