메뉴 건너뛰기




Volumn , Issue , 2012, Pages 149-152

A linear algebra core design for efficient level-3 BLAS

Author keywords

BLAS; Co design; Linear Algebra; Low Power

Indexed keywords

ALGORITHM/ARCHITECTURE CO-DESIGN; BASIC LINEAR ALGEBRA SUBPROGRAMS; BLAS; CMOS TECHNOLOGY; CO-DESIGNS; COMPUTATIONAL KERNELS; CORE DESIGN; DOUBLE PRECISION; LEVEL-3 BLAS; LOW POWER; POWER EFFICIENCY; PROCESSING ELEMENTS;

EID: 84870726245     PISSN: 10636862     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASAP.2012.18     Document Type: Conference Paper
Times cited : (10)

References (12)
  • 1
    • 77954995378 scopus 로고    scopus 로고
    • Understanding sources of inefficiency in general-purpose chips
    • R. Hameed et al., "Understanding sources of inefficiency in general-purpose chips," ISCA '10, 2010.
    • (2010) ISCA '10
    • Hameed, R.1
  • 2
    • 80055100054 scopus 로고    scopus 로고
    • A high-performance, low-power linear algebra core
    • A. Pedram et al., "A high-performance, low-power linear algebra core," ASAP '11, pp. 35-41, 2011.
    • (2011) ASAP '11 , pp. 35-41
    • Pedram, A.1
  • 3
    • 48849089104 scopus 로고    scopus 로고
    • High-performance implementation of the level-3 BLAS
    • K. Goto et al, "High-performance implementation of the level-3 BLAS," ACM Trans. Math. Softw., vol. 35, no. 1, pp. 1-14, 2008.
    • (2008) ACM Trans. Math. Softw. , vol.35 , Issue.1 , pp. 1-14
    • Goto, K.1
  • 4
    • 70350771131 scopus 로고    scopus 로고
    • Benchmarking GPUs to tune dense linear algebra
    • V. Volkov and J. Demmel, "Benchmarking GPUs to tune dense linear algebra," SC 2008, 2008.
    • (2008) SC 2008
    • Volkov, V.1    Demmel, J.2
  • 5
    • 0027693790 scopus 로고
    • General-purpose systolic arrays
    • K. Johnson et al., "General-purpose systolic arrays," Computer, vol. 26, no. 11, pp. 20-31, 1993.
    • (1993) Computer , vol.26 , Issue.11 , pp. 20-31
    • Johnson, K.1
  • 6
    • 70450237431 scopus 로고    scopus 로고
    • Rigel: An architecture and scalable programming interface for a 1000-core accelerator
    • J. Kelm et al., "Rigel: an architecture and scalable programming interface for a 1000-core accelerator," ISCA '09, 2009.
    • (2009) ISCA '09
    • Kelm, J.1
  • 7
    • 85008053864 scopus 로고    scopus 로고
    • An 80-tile sub-100-w teraflops processor in 65-nm cmos
    • S. Vangal et al., "An 80-tile sub-100-w teraflops processor in 65-nm cmos," IEEE J. of Solid-State Circuits, vol. 43, no. 1, 2008.
    • (2008) IEEE J. of Solid-State Circuits , vol.43 , Issue.1
    • Vangal, S.1
  • 8
    • 47049109081 scopus 로고    scopus 로고
    • High-performance designs for linear algebra operations on reconfigurable hardware
    • L. Zhuo and V. Prasanna, "High-performance designs for linear algebra operations on reconfigurable hardware," IEEE Trans. on Computers, vol. 57, no. 8, 2008.
    • (2008) IEEE Trans. on Computers , vol.57 , Issue.8
    • Zhuo, L.1    Prasanna, V.2
  • 9
    • 77951476028 scopus 로고    scopus 로고
    • High-performance floating-point implementation using FPGAs
    • M. Parker, "High-performance floating-point implementation using FPGAs," in MILCOM, 2009.
    • (2009) MILCOM
    • Parker, M.1
  • 10
    • 33750919950 scopus 로고    scopus 로고
    • Energy- and time-efficient matrix multiplication on FPGAs
    • J.-W. Jang et al., "Energy- and time-efficient matrix multiplication on FPGAs," IEEE Trans on VLSI Systems,, vol. 13, no. 11, 2005.
    • (2005) IEEE Trans on VLSI Systems , vol.13 , Issue.11
    • Jang, J.-W.1
  • 11
    • 80055092218 scopus 로고    scopus 로고
    • Energy-efficient floating-point arithmetic for software-defined radio architectures
    • S. Gilani et al., "Energy-efficient floating-point arithmetic for software-defined radio architectures," in ASAP2011.
    • ASAP2011
    • Gilani, S.1
  • 12
    • 41349122721 scopus 로고    scopus 로고
    • Architecting efficient interconnects for large caches with cacti 6.0
    • January
    • N. Muralimanohar et al., "Architecting efficient interconnects for large caches with cacti 6.0," IEEE Micro, vol. 28, pp. 69-79, January 2008.
    • (2008) IEEE Micro , vol.28 , pp. 69-79
    • Muralimanohar, N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.