메뉴 건너뛰기




Volumn 61, Issue 5, 2012, Pages 745-751

Low-cost binary128 floating-point FMA unit design with SIMD support

Author keywords

binary128; computer arithmetic; Floating point; fused multiply add; implementation; SIMD

Indexed keywords

BINARY128; COMPUTER ARITHMETIC; FLOATING POINTS; FUSED MULTIPLY-ADD; IMPLEMENTATION; SIMD;

EID: 84859719343     PISSN: 00189340     EISSN: None     Source Type: Journal    
DOI: 10.1109/TC.2011.77     Document Type: Article
Times cited : (30)

References (26)
  • 1
    • 0025211732 scopus 로고
    • Design of the IBM RISC System/6000 floating-point execution unit
    • R.K. Montoye, E. Hokenek, and S.L. Runyon, "Design of the IBM RISC System/6000 Floating-Point Execution Unit," IBM J. Research & Development, vol. 34, pp. 59-70, 1990 (Pubitemid 20686677)
    • (1990) IBM Journal of Research and Development , vol.34 , Issue.1 , pp. 59-70
    • Montoye, R.K.1    Hokenek, E.2    Runyon, S.L.3
  • 2
    • 0034224812 scopus 로고    scopus 로고
    • Implementing streaming simd extensions on the pentium iii processor
    • July/Aug
    • S.K. Raman, V. Pentkovski, and J. Keshava, "Implementing Streaming SIMD Extensions on the Pentium III Processor," IEEE Micro, vol. 20, no. 4, pp. 47-57, July/Aug. 2000
    • (2000) IEEE Micro , vol.20 , Issue.4 , pp. 47-57
    • Raman, S.K.1    Pentkovski, V.2    Keshava, J.3
  • 3
    • 0037957323 scopus 로고    scopus 로고
    • The amd opteron processor for multiprocessor servers
    • Mar./Apr
    • C. Keltcher, K. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro, vol. 23, no. 2, pp. 66-76, Mar./Apr. 2003
    • (2003) IEEE Micro , vol.23 , Issue.2 , pp. 66-76
    • Keltcher, C.1    McGrath, K.2    Ahmed, A.3    Conway, P.4
  • 4
    • 21044454029 scopus 로고    scopus 로고
    • Design and exploitation of a high-performance simd floating-point unit for blue gene/l
    • S. Chatterjee and L.R. Bachega, "Design and Exploitation of a High-Performance SIMD Floating-Point Unit for Blue Gene/L," IBM J. Research and Development, vol. 49, pp. 377-392, 2005
    • (2005) IBM J. Research and Development , vol.49 , pp. 377-392
    • Chatterjee, S.1    Bachega, L.R.2
  • 8
    • 3242732342 scopus 로고    scopus 로고
    • Floating-point multiply-add-fused with reduced latency
    • Aug
    • T. Lang and J.D. Bruguera, "Floating-Point Multiply-Add-Fused with Reduced Latency," IEEE Trans. Computers, vol. 53, no. 8, pp. 988-1003, Aug. 2004
    • (2004) IEEE Trans. Computers , vol.53 , Issue.8 , pp. 988-1003
    • Lang, T.1    Bruguera, J.D.2
  • 12
    • 27944446098 scopus 로고    scopus 로고
    • The vector floating-point unit in a synergistic processor element of a cell processor
    • S.M. Mueller et al., "The Vector Floating-Point Unit in a Synergistic Processor Element of a CELL Processor," Proc. IEEE 17th Symp. Computer Arithmetic (ARITH ), 2005
    • (2005) Proc. IEEE 17th Symp. Computer Arithmetic (ARITH )
    • Mueller, S.M.1
  • 16
    • 38049053683 scopus 로고    scopus 로고
    • A new architecture for multiple-precision floating-point multiply-add fused unit design
    • June
    • L. Huang, L. Shen, K. Dai, and Z.Wang, "A New Architecture for Multiple-Precision Floating-Point Multiply-Add Fused Unit Design," Proc. IEEE 18th Symp. Computer Arithmetic, pp. 69-76, June 2007
    • (2007) Proc. IEEE 18th Symp. Computer Arithmetic , pp. 69-76
    • Huang, L.1    Shen, L.2    Dai, K.3    Wang, Z.4
  • 17
    • 77949346568 scopus 로고    scopus 로고
    • A comparative study of subword parallel adders for multimedia applications
    • S. Ma, L. Huang, M. Lai, and Z. Wang, "A Comparative Study of Subword Parallel Adders for Multimedia Applications," The Eight Int'l Conf. ASIC, 2009
    • (2009) The Eight Int'l Conf. ASIC
    • Ma, S.1    Huang, L.2    Lai, M.3    Wang, Z.4
  • 20
    • 75449106575 scopus 로고    scopus 로고
    • Low-power multiple-precision iterative floating-point multiplier with simd support
    • Feb
    • D. Tan, C.E. Lemonds, and M.J. Schulte, "Low-Power Multiple-Precision Iterative Floating-Point Multiplier with SIMD Support," IEEE Trans. Computers, vol. 58, no. 2, pp. 175-187, Feb. 2009
    • (2009) IEEE Trans. Computers , vol.58 , Issue.2 , pp. 175-187
    • Tan, D.1    Lemonds, C.E.2    Schulte, M.J.3
  • 21
    • 38049038473 scopus 로고    scopus 로고
    • Multi-functional floating-point maf designs with dot product support
    • Jan
    • M. Gok and M.M. Ozbilen, "Multi-Functional Floating-Point MAF Designs with Dot Product Support," Microelectronics J., vol. 39, pp. 30-43, Jan. 2008
    • (2008) Microelectronics J , vol.39 , pp. 30-43
    • Gok, M.1    Ozbilen, M.M.2
  • 22
    • 0001083804 scopus 로고
    • A reduced-area scheme for carry-select adders
    • Oct
    • A. Tyagi, "A Reduced-Area Scheme for Carry-Select Adders," IEEE Trans. Computers, vol. 42, no. 10, pp. 1163-1170, Oct. 1993
    • (1993) IEEE Trans. Computers , vol.42 , Issue.10 , pp. 1163-1170
    • Tyagi, A.1
  • 23
    • 0033204413 scopus 로고    scopus 로고
    • Leading-one prediction with concurrent position correction
    • Oct
    • J. Bruguera and T. Lang, "Leading-One Prediction with Concurrent Position Correction," IEEE Trans. Computers, vol. 48, no. 10, pp. 298-305, Oct. 1999
    • (1999) IEEE Trans. Computers , vol.48 , Issue.10 , pp. 298-305
    • Bruguera, J.1    Lang, T.2
  • 24
    • 84969498435 scopus 로고    scopus 로고
    • Architectural design of a fast floating-point multiplication-add fused unit using signed-digit addition
    • L. Chen and J. Cheng, "Architectural Design of a Fast Floating-Point Multiplication-Add Fused Unit Using Signed-Digit Addition," Proc. Euromicro Symp. Digital Systems Design, p. 346, 2001
    • (2001) Proc. Euromicro Symp. Digital Systems Design , pp. 346
    • Chen, L.1    Cheng, J.2
  • 26
    • 0000044838 scopus 로고    scopus 로고
    • Comparison of single- and dual-pass multiply-add fused floating-point units
    • R.M. Jessani and M. Putrino, "Comparison of Single-and Dual-Pass Multiply-Add Fused Floating-Point Units," IEEE Trans. Computers, vol. 47, no. 9, pp. 927-937, Sept. 1998. (Pubitemid 128737483)
    • (1998) IEEE Transactions on Computers , vol.47 , Issue.9 , pp. 927-937
    • Jessani, R.M.1    Putrino, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.