SCOPUS 정보 검색 플랫폼

ACM Transactions on Reconfigurable Technology and Systems

Volumn 6, Issue 1, 2013, Pages

Self-alignment schemes for the implementation of addition-related floating-point operators

(2) Ould Bachir, Tarek a David, Jean Pierre a

a ÉCOLE POLYTECHNIQUE DE MONTRÉAL (Canada)

Author keywords

Accumulator; Floating point; FPGA; Redundant arithmetic; Self alignment technique; Summation

Indexed keywords

ACCUMULATOR; FLOATING-POINT; REDUNDANT ARITHMETIC; SELF-ALIGNMENT TECHNIQUES; SUMMATION;

ALIGNMENT; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); HARDWARE; SEMICONDUCTOR DEVICE MANUFACTURE;

DIGITAL ARITHMETIC;

EID: 84877898997 PISSN: 19367406 EISSN: 19367414 Source Type: Journal
DOI: 10.1145/2457443.2457444 Document Type: Article

Times cited : (10)

References (41)

1
- 84937078021
- Signed-digit number representations for fast parallel arithmetic
- Avizienis, A. 1961. Signed-digit number representations for fast parallel arithmetic. IEEE Trans. Electron. Comput. 10, 3, 389-400.
- (1961) IEEE Trans. Electron. Comput. , vol.10 , Issue.3 , pp. 389-400
- Avizienis, A.¹

2
- 0029227047
- A new VLSI vector arithmetic coprocessor for the PC
- IEEE
- Baumhof, C. 1995. A new VLSI vector arithmetic coprocessor for the PC. In Proceedings of the 12th Symposium on Computer Arithmetic. IEEE, 210-215.
- (1995) Proceedings of the 12th Symposium on Computer Arithmetic , pp. 210-215
- Baumhof, C.¹

3
- 57049102841
- Automatic generation of modular multipliers for FPGA applications
- Beuchat, J.-L. and Muller, J.-M. 2008. Automatic generation of modular multipliers for FPGA applications. IEEE Trans. Comput. 57, 12, 1600-1613.
- (2008) IEEE Trans. Comput. , vol.57 , Issue.12 , pp. 1600-1613
- Beuchat, J.-L.¹ Muller, J.-M.²

4
- 77951701991
- A hardware accelerator for the fast retrieval of DIALIGN biological sequence alignments in linear space
- Boukerche, A., Correa, J. M., Melo, A., and Jacobi, R. P. 2010. A hardware accelerator for the fast retrieval of DIALIGN biological sequence alignments in linear space. IEEE Trans. Comput. 59, 6, 808-821.
- (2010) IEEE Trans. Comput. , vol.59 , Issue.6 , pp. 808-821
- Boukerche, A.¹ Correa, J.M.² Melo, A.³ Jacobi, R.P.⁴

5
- 77956007831
- Molecular dynamics simulations on high-performance reconfigurable computing systems
- Article 23
- Chiu, M. and Herbordt, M. C. 2010. Molecular dynamics simulations on high-performance reconfigurable computing systems. ACM Trans. Reconfig. Technol. Syst. 3, 4, Article 23.
- (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.4
- Chiu, M.¹ Herbordt, M.C.²

6
- 80055034384
- Designing custom arithmetic data paths with FloPoCo
- de Dinechin, F. and Pasca, B. 2011. Designing custom arithmetic data paths with FloPoCo. IEEE Des. Test Comput. 28, 4, 18-27.
- (2011) IEEE Des. Test Comput. , vol.28 , Issue.4 , pp. 18-27
- De Dinechin, F.¹ Pasca, B.²

7
- 79551558541
- Pipelined FPGA adders
- IEEE
- de Dinechin, F., Nguyen, H. D., and Pasca, B. 2010. Pipelined FPGA adders. In Proceedings of the International Conference on Field Programmable Logic and Applications (FPL'10). IEEE, 422-427.
- (2010) Proceedings of the International Conference on Field Programmable Logic and Applications (FPL'10) , pp. 422-427
- De Dinechin, F.¹ Nguyen, H.D.² Pasca, B.³

8
- 78650423490
- An FPGA-specific approach to floating-point accumulation and sum-of-products
- IEEE
- de Dinechin, F., Pasca, B., Creţ, O., and Tudoran, R. 2008. An FPGA-specific approach to floating-point accumulation and sum-of-products. In Proceedings of the International Conference on ICECE Technology. IEEE, 33-40.
- (2008) Proceedings of the International Conference on ICECE Technology , pp. 33-40
- De Dinechin, F.¹ Pasca, B.² Creţ, O.³ Tudoran, R.⁴

9
- 1842462616
- Morgan Kaufmann, San Francisco, CA
- Ercegovac, M. and Lan, T. 2004. Digital Arithmetic. Morgan Kaufmann, San Francisco, CA.
- (2004) Digital Arithmetic
- Ercegovac, M.¹ Lan, T.²

10
- 0043136386
- The case for a redundant format in floating point arithmetic
- IEEE
- Fahmy, H. A. H. and Flynn, M. J. 2003. The case for a redundant format in floating point arithmetic. In Proceedings of the IEEE Symposium on Computer Arithmetic. IEEE, 95-102.
- (2003) Proceedings of the IEEE Symposium on Computer Arithmetic , pp. 95-102
- Fahmy, H.A.H.¹ Flynn, M.J.²

11
- 0035573133
- Improving the effectiveness of floating point arithmetic
- IEEE
- Fahmy, H. A. H., Liddicoat, A. A., and Flynn, M. J. 2001. Improving the effectiveness of floating point arithmetic. In Conference Record of the 35th Asilomar Conference on Signals, Systems and Computers. Vol. 1, IEEE, 875-879.
- (2001) Conference Record of the 35th Asilomar Conference on Signals, Systems and Computers , vol.1 , pp. 875-879
- Fahmy, H.A.H.¹ Liddicoat, A.A.² Flynn, M.J.³

12
- 0003439428
- Ph.D. dissertation. Stanford University
- Farmwald, P. M. 1981. On the design of high performance digital arithmetic units. Ph.D. dissertation. Stanford University.
- (1981) On the Design of High Performance Digital Arithmetic Units
- Farmwald, P.M.¹

13
- 84866919831
- Fast, efficient floating-point adders and multipliers for FPGAs
- Article 11
- Hemmert, K. S. and Underwood, K. D. 2010. Fast, efficient floating-point adders and multipliers for FPGAs. ACM Trans. Reconfig. Technol. Syst. 3, 3, Article 11.
- (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.3
- Hemmert, K.S.¹ Underwood, K.D.²

14
- 0000155361
- The accuracy of floating point summation
- Higham, N. J. 1993. The accuracy of floating point summation. SIAM J. Sci. Comput. 14, 783-799.
- (1993) SIAM J. Sci. Comput. , vol.14 , pp. 783-799
- Higham, N.J.¹

15
- 70350777563
- IEEE
- IEEE. 2008. IEEE-754, Standard for floating-point arithmetic.
- (2008) IEEE-754, Standard for Floating-point Arithmetic

16
- 77950318688
- Redundant-digit floating-point addition scheme based on a stored rounding value
- Jaberipur, G., Parhami, B., and Gorgin, S. 2010. Redundant-digit floating-point addition scheme based on a stored rounding value. IEEE Trans. Comput. 59, 5, 694-706.
- (2010) IEEE Trans. Comput. , vol.59 , Issue.5 , pp. 694-706
- Jaberipur, G.¹ Parhami, B.² Gorgin, S.³

17
- 82555168407
- FPGA-based high-performance and scalable block LU decomposition architecture
- Jaiswal, M. K. and Chandrachoodan, N. 2012. FPGA-based high-performance and scalable block LU decomposition architecture. IEEE Trans. Comput. 61, 1, 60-72.
- (2012) IEEE Trans. Comput. , vol.61 , Issue.1 , pp. 60-72
- Jaiswal, M.K.¹ Chandrachoodan, N.²

18
- 0002165479
- A survey of error analysis
- Kahan, W. 1971. A survey of error analysis. In Proceedings of the IFIP Congress. 1214-1239.
- (1971) Proceedings of the IFIP Congress , pp. 1214-1239
- Kahan, W.¹

19
- 36049032847
- Optimistic parallelization of floating-point accumulation
- IEEE
- Kapre, N. and DeHon, A. 2007. Optimistic parallelization of floating-point accumulation. In Proceedings of the IEEE Symposium on Computer Arithmetic. IEEE, 205-216.
- (2007) Proceedings of the IEEE Symposium on Computer Arithmetic , pp. 205-216
- Kapre, N.¹ DeHon, A.²

20
- 67649887173
- A floating-point unit for 4D vector inner product with reduced latency
- Kim, D. and Kim. L.-S. 2009. A floating-point unit for 4D vector inner product with reduced latency. IEEE Trans. Comput. 58, 7, 890-901.
- (2009) IEEE Trans. Comput. , vol.58 , Issue.7 , pp. 890-901
- Kim, D.¹ Kim, L.-S.²

21
- 0013015990
- A. K. Peters, Ltd., Natick, MA
- Koren, I. 2001. Computer Arithmetic Algorithms. A. K. Peters, Ltd., Natick, MA.
- (2001) Computer Arithmetic Algorithms
- Koren, I.¹

22
- 4444325161
- Springer
- Kulisch, U. 2002. Advanced Arithmetic for Digital Computer: Design of Arithmetic Units. Springer.
- (2002) Advanced Arithmetic for Digital Computer: Design of Arithmetic Units
- Kulisch, U.¹

23
- 74349114314
- FPGA floating point datapath compiler
- Langhammer, M. and VanCourt, T. 2009. FPGA floating point datapath compiler. In Proceedings of the IEEE Symposium on Field Programmable Custom Computing Machines. 259-262.
- (2009) Proceedings of the IEEE Symposium on Field Programmable Custom Computing Machines. , pp. 259-262
- Langhammer, M.¹ VanCourt, T.²

24
- 0033733825
- Accelerating pipelined integer and floating-point accumulations in configurable hardware with delayed addition techniques
- Luo, Z. and Martonosi, M. 2000. Accelerating pipelined integer and floating-point accumulations in configurable hardware with delayed addition techniques. IEEE Trans. Comput. 49, 3, 208-218.
- (2000) IEEE Trans. Comput. , vol.49 , Issue.3 , pp. 208-218
- Luo, Z.¹ Martonosi, M.²

25
- 33846985251
- An FPGA-based floating-point jacobi iterative solver
- IEEE
- Morris, G. R. and Prasanna, V. K. 2005. An FPGA-based floating-point Jacobi iterative solver. In Proceedings of the 8th International Symposium on Parallel Architectures, Algorithms and Networks. IEEE.
- (2005) Proceedings of the 8th International Symposium on Parallel Architectures, Algorithms and Networks
- Morris, G.R.¹ Prasanna, V.K.²

26
- 33749583441
- On the definition of ulp(x)
- Muller, J.-M. 2005. On the definition of ulp(x). Tech. rep. RR-5504. INRIA.
- (2005) Tech. Rep. RR-5504. INRIA
- Muller, J.-M.¹

27
- 77955911632
- Birkhauser
- Muller, J.-M., Brisebarre, N., de Dinechin, F., Jeannerod, C.-P., Lefèvre, V., Melquiond, G., Revol, N., Stehlé, D., and Torres, S. 2010. Handbook of Floating-Point Arithmetic. Birkhauser.
- (2010) Handbook of Floating-Point Arithmetic
- Muller, J.-M.¹ Brisebarre, N.² De Dinechin, F.³ Jeannerod, C.-P.⁴ Lefèvre, V.⁵ Melquiond, G.⁶ Revol, N.⁷ Stehlé, D.⁸ Torres, S.⁹

28
- 84867539182
- Accelerating machine-learning algorithms on FPGAs using pattern-based decomposition
- Nagarajan, K., Holland, B., George, A., Slatton, A., and Lam, H. 2009. Accelerating machine-learning algorithms on FPGAs using pattern-based decomposition. J. Signal Process. Syst. 60, 1, 1-21.
- (2009) J. Signal Process. Syst. , vol.60 , Issue.1 , pp. 1-21
- Nagarajan, K.¹ Holland, B.² George, A.³ Slatton, A.⁴ Lam, H.⁵

29
- 77954309015
- Performing floating-point accumulation on a modern fpga in single and double precision
- IEEE
- Ould-Bachir, T. and David, J.-P. 2010. Performing floating-point accumulation on a modern fpga in single and double precision. In Proceedings of the International Symposium on Field-Programmable Custom Computing Machines (FCCM'10). IEEE, 105-108.
- (2010) Proceedings of the International Symposium on Field-Programmable Custom Computing Machines (FCCM'10) , pp. 105-108
- Ould-Bachir, T.¹ David, J.-P.²

30
- 0025210204
- Generalized signed-digit number systems: A unifying framework for redundant number representations
- Parhami, B. 1990. Generalized signed-digit number systems: a unifying framework for redundant number representations. IEEE Trans. Comput. 39, 1, 89-98.
- (1990) IEEE Trans. Comput. , vol.39 , Issue.1 , pp. 89-98
- Parhami, B.¹

31
- 0002342184
- INTLAB: Interval Laboratory
- Tibor Csendes Ed., Kluwer Academic Publishers, Dordrecht
- Rump, S. M. 1999. INTLAB: INTerval LABoratory. In Developments in Reliable Computing, Tibor Csendes Ed., Kluwer Academic Publishers, Dordrecht, 77-104.
- (1999) Developments in Reliable Computing , pp. 77-104
- Rump, S.M.¹

32
- 77956237657
- Ultimately fast accurate summation
- Rump, S. M. 2009. Ultimately fast accurate summation. SIAM J. Sci. Comput. 31, 5, 3466-3502.
- (2009) SIAM J. Sci. Comput. , vol.31 , Issue.5 , pp. 3466-3502
- Rump, S.M.¹

33
- 55049129860
- Accurate floating-point summation Part I: Faithful rounding
- Rump, S, M., Ogita, T., and Oishi, S. 2008. Accurate floating-point summation Part I: Faithful rounding. SIAM J. Sci. Comput. 31, 1, 189-224.
- (2008) SIAM J. Sci. Comput. , vol.31 , Issue.1 , pp. 189-224
- Rump, S.M.¹ Ogita, T.² Oishi, S.³

34
- 84855422103
- FFT implementation with fused floating-point operations
- Swartzlander, E., Jr. and Saleh, H. 2012. FFT implementation with fused floating-point operations. IEEE Trans. Comput. 61, 2, 284-288.
- (2012) IEEE Trans. Comput. , vol.61 , Issue.2 , pp. 284-288
- Swartzlander Jr., E.¹ Saleh, H.²

35
- 84855352110
- Accelerating matrix operations with improved deeply pipelined vector reduction
- Tai, Y.-G., Lo, C.-T. D., and Psarris, K. 2012. Accelerating matrix operations with improved deeply pipelined vector reduction. IEEE Trans. Parallel Distrib. Syst. 23, 2, 202-210.
- (2012) IEEE Trans. Parallel Distrib. Syst. , vol.23 , Issue.2 , pp. 202-210
- Tai, Y.-G.¹ Lo, C.-T.D.² Psarris, K.³

36
- 70350787064
- Multi-operand floating-point addition
- IEEE
- Tenca, A. F. 2009. Multi-operand floating-point addition. In Proceedings of the IEEE Symposium on Computer Arithmetic. IEEE, 161-168.
- (2009) Proceedings of the IEEE Symposium on Computer Arithmetic , pp. 161-168
- Tenca, A.F.¹

37
- 33749527748
- A 6.2-GFLOPS floating-point multiply-accumulator with conditional Normalization
- Vangal, S. R., Hoskote, Y. V., Borkar, N. Y., and Alvandpour, A. 2006. A 6.2-GFLOPS floating-point multiply-accumulator with conditional normalization. IEEE J. Solid-State Circuits 41, 10, 2314-2323.
- (2006) IEEE J. Solid-State Circuits , vol.41 , Issue.10 , pp. 2314-2323
- Vangal, S.R.¹ Hoskote, Y.V.² Borkar, N.Y.³ Alvandpour, A.⁴

38
- 79951755290
- Synthesis of floating-point addition clusters on FPGAs using carry-save arithmetic
- IEEE
- Verma, A., Verma, A. K., Parandeh-Afshar, H., Brisk, P., and Ienne, P. 2010. Synthesis of floating-point addition clusters on FPGAs using carry-save arithmetic. In Proceedings of the International Conference on Field Programmable Logic and Applications. IEEE, 19-24.
- (2010) Proceedings of the International Conference on Field Programmable Logic and Applications , pp. 19-24
- Verma, A.¹ Verma, A.K.² Parandeh-Afshar, H.³ Brisk, P.⁴ Ienne, P.⁵

39
- 84856968087
- VFloat: A variable precision fixed- and floating-point library for reconfigurable hardware
- Article 16
- Wang, X. and Leeser, M. 2010. VFloat: A variable precision fixed- and floating-point library for reconfigurable hardware. ACM Trans. Reconfig. Technol. Syst. 3, 3, Article 16.
- (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.3
- Wang, X.¹ Leeser, M.²

40
- 84862668124
- Characterization of fixed and reconfigurable multi-core devices for application acceleration
- Article 19
- Williams, J., Massie, C., George, A. D., Richardson, J., Gosrani, K., and Lam, H. 2010. Characterization of fixed and reconfigurable multi-core devices for application acceleration. ACM Trans. Reconfig. Technol. Syst. 3, 4, Article 19.
- (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.4
- Williams, J.¹ Massie, C.² George, A.D.³ Richardson, J.⁴ Gosrani, K.⁵ Lam, H.⁶

41
- 47049109081
- High-performance designs for linear algebra operations on reconfigurable hardware
- Zhuo, L. and Prasanna, V. K. 2008. High-performance designs for linear algebra operations on reconfigurable hardware. IEEE Trans. Comput. 57, 8, 1057-1071.
- (2008) IEEE Trans. Comput. , vol.57 , Issue.8 , pp. 1057-1071
- Zhuo, L.¹ Prasanna, V.K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.