SCOPUS 정보 검색 플랫폼

Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011

Volumn , Issue , 2011, Pages 94-101

A hardware acceleration technique for gradient descent and conjugate gradient

(3) Kesler, David a Deka, Biplab a Kumar, Rakesh a

a University of Illinois at Urbana Champaign (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AS GRAPH; COMPUTATION TIME; CONJUGATE GRADIENT; CONJUGATE GRADIENT ALGORITHMS; GRADIENT DESCENT; HARDWARE ACCELERATION; HARDWARE ACCELERATORS; LEAST SQUARE; LINEAR ALGEBRA OPERATIONS; NON-ITERATIVE; NUMERICAL OPTIMIZATIONS; PERFORMANCE LOSS; PROCESSOR POWER; ROBUSTIFICATION; SOFTWARE SUPPORT; SPARSE MATRICES;

ALGORITHMS; LINEAR ALGEBRA; PATTERN MATCHING;

CONJUGATE GRADIENT METHOD;

EID: 79961187689 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SASP.2011.5941086 Document Type: Conference Paper

Times cited : (6)

References (20)

1
- 77956607557
- A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance
- J. Sloan, D. Kesler, R. Kumar, and A. Rahimi, "A numerical optimization-based methodology for application robustification: Transforming applications for error tolerance," in 40th IEEE/IFIP International Conference on Dependable Systems and Networks, 2010, July 2010.
- 40th IEEE/IFIP International Conference on Dependable Systems and Networks, 2010, July 2010
- Sloan, J.¹ Kesler, D.² Kumar, R.³ Rahimi, A.⁴

2
- 1842582489
- Making typical silicon matter with razor
- T. Austin, D. Blaauw, T. Mudge, and K. Flautner, "Making typical silicon matter with razor," Computer, vol. 37, pp. 57-65, 2004.
- (2004) Computer , vol.37 , pp. 57-65
- Austin, T.¹ Blaauw, D.² Mudge, T.³ Flautner, K.⁴

3
- 32844466040
- Springer Publishing
- J. A. Snyman, Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms. Springer Publishing, 2005.
- (2005) Practical Mathematical Optimization: An Introduction to Basic Optimization Theory and Classical and New Gradient-Based Algorithms
- Snyman, J.A.¹

4
- 0000135303
- Methods of conjugate gradients for solving linear systems
- M. Hestenes and E. Stiefel, "Methods of conjugate gradients for solving linear systems," J. Research Natl Bureau of Standards, vol. 49, no. 6, 1952.
- (1952) J. Research Natl Bureau of Standards , vol.49 , Issue.6
- Hestenes, M.¹ Stiefel, E.²

5
- 62949205696
- Fpga based high performance double-precision matrix multiplication
- Washington, DC, USA: IEEE Computer Society
- V. B. Y. Kumar, S. Joshi, S. B. Patkar, and H. Narayanan, "Fpga based high performance double-precision matrix multiplication," in VLSID '09: Proceedings of the 2009 22nd International Conference on VLSI Design. Washington, DC, USA: IEEE Computer Society, 2009, pp. 341-346.
- (2009) VLSID '09: Proceedings of the 2009 22nd International Conference on VLSI Design , pp. 341-346
- Kumar, V.B.Y.¹ Joshi, S.² Patkar, S.B.³ Narayanan, H.⁴

6
- 84947242005
- A hierarchical sparse matrix storage format for vector processors
- P. Stathis, S. Vassiliadis, and S. Cotofana, "A hierarchical sparse matrix storage format for vector processors," in Parallel and Distributed Processing Symposium, 2003. Proceedings. International, 22-26 2003, p. 8 pp.
- Parallel and Distributed Processing Symposium, 2003. Proceedings. International, 22-26 2003 , pp. 8
- Stathis, P.¹ Vassiliadis, S.² Cotofana, S.³

7
- 0003473816
- Philadelphia, PA: SIAM
- R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. M. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. V. D. Vorst, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods. Philadelphia, PA: SIAM, 1994.
- (1994) Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
- Barrett, R.¹ Berry, M.² Chan, T.F.³ Demmel, J.⁴ Donato, J.M.⁵ Dongarra, J.⁶ Eijkhout, V.⁷ Pozo, R.⁸ Romine, C.⁹ Vorst, H.V.D.¹⁰

8
- 47349126591
- Sparse matrix-vector multiplication design on fpgas
- J. Sun, G. Peterson, and O. Storaasli, "Sparse matrix-vector multiplication design on fpgas," in Field-Programmable Custom Computing Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium on, 23- 25 2007, pp. 349 -352.
- Field-Programmable Custom Computing Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium On, 23- 25 2007 , pp. 349-352
- Sun, J.¹ Peterson, G.² Storaasli, O.³

9
- 34147157830
- Sparse matrix computations on reconfigurable hardware
- G. R. Morris and V. K. Prasanna, "Sparse matrix computations on reconfigurable hardware," Computer, vol. 40, no. 3, pp. 58-64, 2007.
- (2007) Computer , vol.40 , Issue.3 , pp. 58-64
- Morris, G.R.¹ Prasanna, V.K.²

10
- 20244390636
- Floating-point sparse matrix-vector multiply for fpgas
- M. deLorimier and A. DeHon, "Floating-point sparse matrix-vector multiply for fpgas," in Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays, 2005.
- Proceedings of the 2005 ACM/SIGDA 13th International Symposium on Field-programmable Gate Arrays, 2005
- DeLorimier, M.¹ DeHon, A.²

11
- 70350368872
- Efficient sparse matrix-vector multiplication on CUDA
- NVIDIA Corporation, Dec.
- N. Bell and M. Garland, "Efficient sparse matrix-vector multiplication on CUDA," NVIDIA Corporation, NVIDIA Technical Report NVR-2008-004, Dec. 2008.
- (2008) NVIDIA Technical Report NVR-2008-004
- Bell, N.¹ Garland, M.²

12
- 74049163483
- Optimizing sparse matrix-vector multiplication on gpus
- IBM, Apr.
- M. M. Baskaran and R. Bordawekar, "Optimizing sparse matrix-vector multiplication on gpus," IBM, IBM Research Report RC24704, Apr. 2009.
- (2009) IBM Research Report RC24704
- Baskaran, M.M.¹ Bordawekar, R.²

13
- 34047144377
- Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems
- L. Zhuo and V. K. Prasanna, "Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems," IEEE Trans. Parallel Distrib. Syst., vol. 18, no. 4, pp. 433-448, 2007.
- (2007) IEEE Trans. Parallel Distrib. Syst. , vol.18 , Issue.4 , pp. 433-448
- Zhuo, L.¹ Prasanna, V.K.²

14
- 20344376214
- 64-bit floating-point fpga matrix multiplication
- New York, NY, USA: ACM
- Y. Dou, S. Vassiliadis, G. K. Kuzmanov, and G. N. Gaydadjiev, "64-bit floating-point fpga matrix multiplication," in FPGA '05: Proceedings of the 2005 ACM/SIGDA 13th international symposium on Fieldprogrammable gate arrays. New York, NY, USA: ACM, 2005, pp. 86-95.
- (2005) FPGA '05: Proceedings of the 2005 ACM/SIGDA 13th International Symposium on Fieldprogrammable Gate Arrays , pp. 86-95
- Dou, Y.¹ Vassiliadis, S.² Kuzmanov, G.K.³ Gaydadjiev, G.N.⁴

15
- 84859456270
- A high performance fpga-based accelerator for blas library implementation
- S. Rousseaux, D. Hubaux, P. Guisset, and J.-D. Legat, "A high performance fpga-based accelerator for blas library implementation," in RSSI'07: Proceedings of the Third Annual Reconfigurable Systems Summer Institute, July 2007.
- RSSI'07: Proceedings of the Third Annual Reconfigurable Systems Summer Institute, July 2007
- Rousseaux, S.¹ Hubaux, D.² Guisset, P.³ Legat, J.-D.⁴

16
- 34548826218
- Hardware acceleration of matrix multiplication on a xilinx fpga
- Washington, DC, USA: IEEE Computer Society
- N. Dave, K. Fleming, M. King, M. Pellauer, and M. Vijayaraghavan, "Hardware acceleration of matrix multiplication on a xilinx fpga," in MEMOCODE '07: Proceedings of the 5th IEEE/ACM International Conference on Formal Methods and Models for Codesign. Washington, DC, USA: IEEE Computer Society, 2007, pp. 97-100.
- (2007) MEMOCODE '07: Proceedings of the 5th IEEE/ACM International Conference on Formal Methods and Models for Codesign , pp. 97-100
- Dave, N.¹ Fleming, K.² King, M.³ Pellauer, M.⁴ Vijayaraghavan, M.⁵

17
- 79961190886
- Hardware realization of matrix multiplication using field programmable gate array
- August
- S. M. Qasim, S. A. Abbasi, and B. A. Almashary, "Hardware realization of matrix multiplication using field programmable gate array," in MASAUM Journal of Computing, vol. 1, August 2009, pp. 21-25.
- (2009) MASAUM Journal of Computing , vol.1 , pp. 21-25
- Qasim, S.M.¹ Abbasi, S.A.² Almashary, B.A.³

18
- 79961188036
- Floating point matrix multiplication on a reconfigurable computing system
- Springer Berlin Heidelberg
- C. Sajish, Y. Abhyankar, S. Ghotgalkar, and K. Venkates, "Floating point matrix multiplication on a reconfigurable computing system," in Proceedings of the International Conference on High Performance Computing and Applications. Springer Berlin Heidelberg, 2005, pp. 113-122.
- (2005) Proceedings of the International Conference on High Performance Computing and Applications , pp. 113-122
- Sajish, C.¹ Abhyankar, Y.² Ghotgalkar, S.³ Venkates, K.⁴

19
- 47049109081
- High-performance designs for linear algebra operations on reconfigurable hardware
- L. Zhuo and V. K. Prasanna, "High-performance designs for linear algebra operations on reconfigurable hardware," IEEE Trans. Comput., vol. 57, no. 8, pp. 1057-1071, 2008.
- (2008) IEEE Trans. Comput. , vol.57 , Issue.8 , pp. 1057-1071
- Zhuo, L.¹ Prasanna, V.K.²

20
- 50949166640
- Evaluation and tuning of the level 3 cublas for graphics processors
- S. Barrachina, M. Castillo, F. Igual, R. Mayo, and E. Quintana-Orti, "Evaluation and tuning of the level 3 cublas for graphics processors," in Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, april 2008, pp. 1 -8.
- Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium On, April 2008 , pp. 1-8
- Barrachina, S.¹ Castillo, M.² Igual, F.³ Mayo, R.⁴ Quintana-Orti, E.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.