SCOPUS 정보 검색 플랫폼

IEEE Transactions on Computers

Volumn 61, Issue 5, 2012, Pages 745-751

Low-cost binary128 floating-point FMA unit design with SIMD support

(5) Huang, Libo a Ma, Sheng a Shen, Li a Wang, Zhiying a Xiao, Nong a

a NATIONAL UNIVERSITY OF DEFENSE TECHNOLOGY (China)

Author keywords

binary128; computer arithmetic; Floating point; fused multiply add; implementation; SIMD

Indexed keywords

BINARY128; COMPUTER ARITHMETIC; FLOATING POINTS; FUSED MULTIPLY-ADD; IMPLEMENTATION; SIMD;

COMPUTER ARCHITECTURE; COMPUTER HARDWARE; DIGITAL ARITHMETIC; HARDWARE;

DESIGN;

EID: 84859719343 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/TC.2011.77 Document Type: Article

Times cited : (30)

References (26)

1
- 0025211732
- Design of the IBM RISC System/6000 floating-point execution unit
- R.K. Montoye, E. Hokenek, and S.L. Runyon, "Design of the IBM RISC System/6000 Floating-Point Execution Unit," IBM J. Research & Development, vol. 34, pp. 59-70, 1990 (Pubitemid 20686677)
- (1990) IBM Journal of Research and Development , vol.34 , Issue.1 , pp. 59-70
- Montoye, R.K.¹ Hokenek, E.² Runyon, S.L.³

2
- 0034224812
- Implementing streaming simd extensions on the pentium iii processor
- July/Aug
- S.K. Raman, V. Pentkovski, and J. Keshava, "Implementing Streaming SIMD Extensions on the Pentium III Processor," IEEE Micro, vol. 20, no. 4, pp. 47-57, July/Aug. 2000
- (2000) IEEE Micro , vol.20 , Issue.4 , pp. 47-57
- Raman, S.K.¹ Pentkovski, V.² Keshava, J.³

3
- 0037957323
- The amd opteron processor for multiprocessor servers
- Mar./Apr
- C. Keltcher, K. McGrath, A. Ahmed, and P. Conway, "The AMD Opteron Processor for Multiprocessor Servers," IEEE Micro, vol. 23, no. 2, pp. 66-76, Mar./Apr. 2003
- (2003) IEEE Micro , vol.23 , Issue.2 , pp. 66-76
- Keltcher, C.¹ McGrath, K.² Ahmed, A.³ Conway, P.⁴

4
- 21044454029
- Design and exploitation of a high-performance simd floating-point unit for blue gene/l
- S. Chatterjee and L.R. Bachega, "Design and Exploitation of a High-Performance SIMD Floating-Point Unit for Blue Gene/L," IBM J. Research and Development, vol. 49, pp. 377-392, 2005
- (2005) IBM J. Research and Development , vol.49 , pp. 377-392
- Chatterjee, S.¹ Bachega, L.R.²

5
- 33644962118
- High precision numerical accuracy in physics research
- F. Dinechin and G. Villard, "High Precision Numerical Accuracy in Physics Research," Nuclear Instruments and Methods in Physics Research, vol. 559, pp. 207-210, 2006
- (2006) Nuclear Instruments and Methods in Physics Research , vol.559 , pp. 207-210
- Dinechin, F.¹ Villard, G.²

6
- 84944324519
- A quadruple precision and dual double precision floating-point multiplier
- A. Akkas and M. Schulte, "A Quadruple Precision and Dual Double Precision Floating-Point Multiplier," Proc. Euromicro Symp. Digital System Design (DSD '03), pp. 76-81, 2003
- (2003) Proc. Euromicro Symp. Digital System Design (DSD '03) , pp. 76-81
- Akkas, A.¹ Schulte, M.²

7
- 85096105190
- ANSI/IEEE Standard 754-2008
- IEEE Standard for Floating-Point Arithmetic, ANSI/IEEE Standard 754-2008, 2008
- (2008) IEEE Standard for Floating-Point Arithmetic

8
- 3242732342
- Floating-point multiply-add-fused with reduced latency
- Aug
- T. Lang and J.D. Bruguera, "Floating-Point Multiply-Add-Fused with Reduced Latency," IEEE Trans. Computers, vol. 53, no. 8, pp. 988-1003, Aug. 2004
- (2004) IEEE Trans. Computers , vol.53 , Issue.8 , pp. 988-1003
- Lang, T.¹ Bruguera, J.D.²

9
- 0034874223
- Leading zero anticipation and detection - A comparison of methods
- M.S. Schmookler and K.J. Nowka, "Leading Zero Anticipation and Detection a Comparison of Methods," Proc. IEEE 15th Symp. Computer Arithmetic (ARITH), pp. 7-12, June 2001 (Pubitemid 32797841)
- (2001) Proceedings - Symposium on Computer Arithmetic , pp. 7-12
- Schmookler, M.S.¹ Nowka, K.J.²

10
- 0031372083
- A dual mode
- G. Even, S. Mueller, and P.-M. Seidel, "A Dual Mode IEEE Multiplier," Proc. IEEE Second Int'l Conf. Innovative Systems in Silicon, pp. 282-289, 1997
- (1997) IEEE Multiplier Proc. IEEE Second Int'l Conf. Innovative Systems in Silicon , pp. 282-289
- Even, G.¹ Mueller, S.² Seidel, P.-M.³

11
- 0033348139
- Enhanced floating point coprocessor for embedded signal processing and graphics applications
- C. Hinds, "An Enhanced Floating Point Coprocessor for Embedded Signal Processing and Graphics Applications," Proc. 33rd Asilomar Conf. Signals, Systems, and Computers, pp. 147-151, 1999 (Pubitemid 30591954)
- (1999) Conference Record of the Asilomar Conference on Signals, Systems and Computers , vol.1 , pp. 147-151
- Hinds Chris, N.¹

12
- 27944446098
- The vector floating-point unit in a synergistic processor element of a cell processor
- S.M. Mueller et al., "The Vector Floating-Point Unit in a Synergistic Processor Element of a CELL Processor," Proc. IEEE 17th Symp. Computer Arithmetic (ARITH ), 2005
- (2005) Proc. IEEE 17th Symp. Computer Arithmetic (ARITH )
- Mueller, S.M.¹

13
- 0033348139
- Enhanced floating point coprocessor for embedded signal processing and graphics applications
- C. Hinds, "An Enhanced Floating Point Coprocessor for Embedded Signal Processing and Graphics Applications," Proc. 33rd Asilomar Conf. Signals, Systems, and Computers, pp. 147-151, 1999 (Pubitemid 30591954)
- (1999) Conference Record of the Asilomar Conference on Signals, Systems and Computers , vol.1 , pp. 147-151
- Hinds Chris, N.¹

14
- 33646000524
- Technical Report 225, Universitat Wurzburg
- R. Kolla et al., "The IAX Architecture: Interval Arithmetic Extension," Technical Report 225, Universitat Wurzburg, 1999
- (1999) The IAX Architecture: Interval Arithmetic Extension
- Kolla, R.¹

15
- 0031342304
- Multimedia extensions for general-purpose processors
- R. Lee, "Multimedia Extensions for General-Purpose Processors," Proc. IEEE Workshop Signal Processing Systems, pp. 9-23, 1997
- (1997) Proc. IEEE Workshop Signal Processing Systems , pp. 9-23
- Lee, R.¹

16
- 38049053683
- A new architecture for multiple-precision floating-point multiply-add fused unit design
- June
- L. Huang, L. Shen, K. Dai, and Z.Wang, "A New Architecture for Multiple-Precision Floating-Point Multiply-Add Fused Unit Design," Proc. IEEE 18th Symp. Computer Arithmetic, pp. 69-76, June 2007
- (2007) Proc. IEEE 18th Symp. Computer Arithmetic , pp. 69-76
- Huang, L.¹ Shen, L.² Dai, K.³ Wang, Z.⁴

17
- 77949346568
- A comparative study of subword parallel adders for multimedia applications
- S. Ma, L. Huang, M. Lai, and Z. Wang, "A Comparative Study of Subword Parallel Adders for Multimedia Applications," The Eight Int'l Conf. ASIC, 2009
- (2009) The Eight Int'l Conf. ASIC
- Ma, S.¹ Huang, L.² Lai, M.³ Wang, Z.⁴

18
- 0042134563
- Multiple-precision fixed-point vector multiply-accumulator using shared segmentation
- D. Tan, A. Danysh, and M. Liebelt, "Multiple-Precision Fixed-Point Vector Multiply-Accumulator Using Shared Segmentation," Proc. 16th IEEE Symp. Computer Arithmetic (ARITH-16), pp. 12-19, 2003
- (2003) Proc. 16th IEEE Symp. Computer Arithmetic (ARITH-16) , pp. 12-19
- Tan, D.¹ Danysh, A.² Liebelt, M.³

19
- 4143102743
- Multiplier architectures for media processing
- S. Krithivasan and M.J. Schulte, "Multiplier Architectures for Media Processing," Proc. 37th Asilomar Conf. Signals, Systems, and Computers, pp. 2193-2197, 2003
- (2003) Proc. 37th Asilomar Conf. Signals, Systems, and Computers , pp. 2193-2197
- Krithivasan, S.¹ Schulte, M.J.²

20
- 75449106575
- Low-power multiple-precision iterative floating-point multiplier with simd support
- Feb
- D. Tan, C.E. Lemonds, and M.J. Schulte, "Low-Power Multiple-Precision Iterative Floating-Point Multiplier with SIMD Support," IEEE Trans. Computers, vol. 58, no. 2, pp. 175-187, Feb. 2009
- (2009) IEEE Trans. Computers , vol.58 , Issue.2 , pp. 175-187
- Tan, D.¹ Lemonds, C.E.² Schulte, M.J.³

21
- 38049038473
- Multi-functional floating-point maf designs with dot product support
- Jan
- M. Gok and M.M. Ozbilen, "Multi-Functional Floating-Point MAF Designs with Dot Product Support," Microelectronics J., vol. 39, pp. 30-43, Jan. 2008
- (2008) Microelectronics J , vol.39 , pp. 30-43
- Gok, M.¹ Ozbilen, M.M.²

22
- 0001083804
- A reduced-area scheme for carry-select adders
- Oct
- A. Tyagi, "A Reduced-Area Scheme for Carry-Select Adders," IEEE Trans. Computers, vol. 42, no. 10, pp. 1163-1170, Oct. 1993
- (1993) IEEE Trans. Computers , vol.42 , Issue.10 , pp. 1163-1170
- Tyagi, A.¹

23
- 0033204413
- Leading-one prediction with concurrent position correction
- Oct
- J. Bruguera and T. Lang, "Leading-One Prediction with Concurrent Position Correction," IEEE Trans. Computers, vol. 48, no. 10, pp. 298-305, Oct. 1999
- (1999) IEEE Trans. Computers , vol.48 , Issue.10 , pp. 298-305
- Bruguera, J.¹ Lang, T.²

24
- 84969498435
- Architectural design of a fast floating-point multiplication-add fused unit using signed-digit addition
- L. Chen and J. Cheng, "Architectural Design of a Fast Floating-Point Multiplication-Add Fused Unit Using Signed-Digit Addition," Proc. Euromicro Symp. Digital Systems Design, p. 346, 2001
- (2001) Proc. Euromicro Symp. Digital Systems Design , pp. 346
- Chen, L.¹ Cheng, J.²

25
- 50249180329
- Floating-point fused multiply-add architectures
- E. Quinnell, E. Swartzlander, and C. Lemonds, "Floating-Point Fused Multiply-Add Architectures," Proc. 41st Asilomar Conf. Signals, Systems, and Computers, (ACSSC '07), pp. 331-337, 2007
- (2007) Proc. 41st Asilomar Conf. Signals, Systems, and Computers, (ACSSC '07) , pp. 331-337
- Quinnell, E.¹ Swartzlander, E.² Lemonds, C.³

26
- 0000044838
- Comparison of single- and dual-pass multiply-add fused floating-point units
- R.M. Jessani and M. Putrino, "Comparison of Single-and Dual-Pass Multiply-Add Fused Floating-Point Units," IEEE Trans. Computers, vol. 47, no. 9, pp. 927-937, Sept. 1998. (Pubitemid 128737483)
- (1998) IEEE Transactions on Computers , vol.47 , Issue.9 , pp. 927-937
- Jessani, R.M.¹ Putrino, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.