메뉴 건너뛰기




Volumn 6, Issue 1, 2013, Pages

Self-alignment schemes for the implementation of addition-related floating-point operators

Author keywords

Accumulator; Floating point; FPGA; Redundant arithmetic; Self alignment technique; Summation

Indexed keywords

ACCUMULATOR; FLOATING-POINT; REDUNDANT ARITHMETIC; SELF-ALIGNMENT TECHNIQUES; SUMMATION;

EID: 84877898997     PISSN: 19367406     EISSN: 19367414     Source Type: Journal    
DOI: 10.1145/2457443.2457444     Document Type: Article
Times cited : (10)

References (41)
  • 1
    • 84937078021 scopus 로고
    • Signed-digit number representations for fast parallel arithmetic
    • Avizienis, A. 1961. Signed-digit number representations for fast parallel arithmetic. IEEE Trans. Electron. Comput. 10, 3, 389-400.
    • (1961) IEEE Trans. Electron. Comput. , vol.10 , Issue.3 , pp. 389-400
    • Avizienis, A.1
  • 3
    • 57049102841 scopus 로고    scopus 로고
    • Automatic generation of modular multipliers for FPGA applications
    • Beuchat, J.-L. and Muller, J.-M. 2008. Automatic generation of modular multipliers for FPGA applications. IEEE Trans. Comput. 57, 12, 1600-1613.
    • (2008) IEEE Trans. Comput. , vol.57 , Issue.12 , pp. 1600-1613
    • Beuchat, J.-L.1    Muller, J.-M.2
  • 4
    • 77951701991 scopus 로고    scopus 로고
    • A hardware accelerator for the fast retrieval of DIALIGN biological sequence alignments in linear space
    • Boukerche, A., Correa, J. M., Melo, A., and Jacobi, R. P. 2010. A hardware accelerator for the fast retrieval of DIALIGN biological sequence alignments in linear space. IEEE Trans. Comput. 59, 6, 808-821.
    • (2010) IEEE Trans. Comput. , vol.59 , Issue.6 , pp. 808-821
    • Boukerche, A.1    Correa, J.M.2    Melo, A.3    Jacobi, R.P.4
  • 5
    • 77956007831 scopus 로고    scopus 로고
    • Molecular dynamics simulations on high-performance reconfigurable computing systems
    • Article 23
    • Chiu, M. and Herbordt, M. C. 2010. Molecular dynamics simulations on high-performance reconfigurable computing systems. ACM Trans. Reconfig. Technol. Syst. 3, 4, Article 23.
    • (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.4
    • Chiu, M.1    Herbordt, M.C.2
  • 6
    • 80055034384 scopus 로고    scopus 로고
    • Designing custom arithmetic data paths with FloPoCo
    • de Dinechin, F. and Pasca, B. 2011. Designing custom arithmetic data paths with FloPoCo. IEEE Des. Test Comput. 28, 4, 18-27.
    • (2011) IEEE Des. Test Comput. , vol.28 , Issue.4 , pp. 18-27
    • De Dinechin, F.1    Pasca, B.2
  • 13
    • 84866919831 scopus 로고    scopus 로고
    • Fast, efficient floating-point adders and multipliers for FPGAs
    • Article 11
    • Hemmert, K. S. and Underwood, K. D. 2010. Fast, efficient floating-point adders and multipliers for FPGAs. ACM Trans. Reconfig. Technol. Syst. 3, 3, Article 11.
    • (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.3
    • Hemmert, K.S.1    Underwood, K.D.2
  • 14
    • 0000155361 scopus 로고
    • The accuracy of floating point summation
    • Higham, N. J. 1993. The accuracy of floating point summation. SIAM J. Sci. Comput. 14, 783-799.
    • (1993) SIAM J. Sci. Comput. , vol.14 , pp. 783-799
    • Higham, N.J.1
  • 16
    • 77950318688 scopus 로고    scopus 로고
    • Redundant-digit floating-point addition scheme based on a stored rounding value
    • Jaberipur, G., Parhami, B., and Gorgin, S. 2010. Redundant-digit floating-point addition scheme based on a stored rounding value. IEEE Trans. Comput. 59, 5, 694-706.
    • (2010) IEEE Trans. Comput. , vol.59 , Issue.5 , pp. 694-706
    • Jaberipur, G.1    Parhami, B.2    Gorgin, S.3
  • 17
    • 82555168407 scopus 로고    scopus 로고
    • FPGA-based high-performance and scalable block LU decomposition architecture
    • Jaiswal, M. K. and Chandrachoodan, N. 2012. FPGA-based high-performance and scalable block LU decomposition architecture. IEEE Trans. Comput. 61, 1, 60-72.
    • (2012) IEEE Trans. Comput. , vol.61 , Issue.1 , pp. 60-72
    • Jaiswal, M.K.1    Chandrachoodan, N.2
  • 20
    • 67649887173 scopus 로고    scopus 로고
    • A floating-point unit for 4D vector inner product with reduced latency
    • Kim, D. and Kim. L.-S. 2009. A floating-point unit for 4D vector inner product with reduced latency. IEEE Trans. Comput. 58, 7, 890-901.
    • (2009) IEEE Trans. Comput. , vol.58 , Issue.7 , pp. 890-901
    • Kim, D.1    Kim, L.-S.2
  • 24
    • 0033733825 scopus 로고    scopus 로고
    • Accelerating pipelined integer and floating-point accumulations in configurable hardware with delayed addition techniques
    • Luo, Z. and Martonosi, M. 2000. Accelerating pipelined integer and floating-point accumulations in configurable hardware with delayed addition techniques. IEEE Trans. Comput. 49, 3, 208-218.
    • (2000) IEEE Trans. Comput. , vol.49 , Issue.3 , pp. 208-218
    • Luo, Z.1    Martonosi, M.2
  • 28
    • 84867539182 scopus 로고    scopus 로고
    • Accelerating machine-learning algorithms on FPGAs using pattern-based decomposition
    • Nagarajan, K., Holland, B., George, A., Slatton, A., and Lam, H. 2009. Accelerating machine-learning algorithms on FPGAs using pattern-based decomposition. J. Signal Process. Syst. 60, 1, 1-21.
    • (2009) J. Signal Process. Syst. , vol.60 , Issue.1 , pp. 1-21
    • Nagarajan, K.1    Holland, B.2    George, A.3    Slatton, A.4    Lam, H.5
  • 30
    • 0025210204 scopus 로고
    • Generalized signed-digit number systems: A unifying framework for redundant number representations
    • Parhami, B. 1990. Generalized signed-digit number systems: a unifying framework for redundant number representations. IEEE Trans. Comput. 39, 1, 89-98.
    • (1990) IEEE Trans. Comput. , vol.39 , Issue.1 , pp. 89-98
    • Parhami, B.1
  • 31
    • 0002342184 scopus 로고    scopus 로고
    • INTLAB: Interval Laboratory
    • Tibor Csendes Ed., Kluwer Academic Publishers, Dordrecht
    • Rump, S. M. 1999. INTLAB: INTerval LABoratory. In Developments in Reliable Computing, Tibor Csendes Ed., Kluwer Academic Publishers, Dordrecht, 77-104.
    • (1999) Developments in Reliable Computing , pp. 77-104
    • Rump, S.M.1
  • 32
    • 77956237657 scopus 로고    scopus 로고
    • Ultimately fast accurate summation
    • Rump, S. M. 2009. Ultimately fast accurate summation. SIAM J. Sci. Comput. 31, 5, 3466-3502.
    • (2009) SIAM J. Sci. Comput. , vol.31 , Issue.5 , pp. 3466-3502
    • Rump, S.M.1
  • 33
    • 55049129860 scopus 로고    scopus 로고
    • Accurate floating-point summation Part I: Faithful rounding
    • Rump, S, M., Ogita, T., and Oishi, S. 2008. Accurate floating-point summation Part I: Faithful rounding. SIAM J. Sci. Comput. 31, 1, 189-224.
    • (2008) SIAM J. Sci. Comput. , vol.31 , Issue.1 , pp. 189-224
    • Rump, S.M.1    Ogita, T.2    Oishi, S.3
  • 34
    • 84855422103 scopus 로고    scopus 로고
    • FFT implementation with fused floating-point operations
    • Swartzlander, E., Jr. and Saleh, H. 2012. FFT implementation with fused floating-point operations. IEEE Trans. Comput. 61, 2, 284-288.
    • (2012) IEEE Trans. Comput. , vol.61 , Issue.2 , pp. 284-288
    • Swartzlander Jr., E.1    Saleh, H.2
  • 35
    • 84855352110 scopus 로고    scopus 로고
    • Accelerating matrix operations with improved deeply pipelined vector reduction
    • Tai, Y.-G., Lo, C.-T. D., and Psarris, K. 2012. Accelerating matrix operations with improved deeply pipelined vector reduction. IEEE Trans. Parallel Distrib. Syst. 23, 2, 202-210.
    • (2012) IEEE Trans. Parallel Distrib. Syst. , vol.23 , Issue.2 , pp. 202-210
    • Tai, Y.-G.1    Lo, C.-T.D.2    Psarris, K.3
  • 37
    • 33749527748 scopus 로고    scopus 로고
    • A 6.2-GFLOPS floating-point multiply-accumulator with conditional Normalization
    • Vangal, S. R., Hoskote, Y. V., Borkar, N. Y., and Alvandpour, A. 2006. A 6.2-GFLOPS floating-point multiply-accumulator with conditional normalization. IEEE J. Solid-State Circuits 41, 10, 2314-2323.
    • (2006) IEEE J. Solid-State Circuits , vol.41 , Issue.10 , pp. 2314-2323
    • Vangal, S.R.1    Hoskote, Y.V.2    Borkar, N.Y.3    Alvandpour, A.4
  • 39
    • 84856968087 scopus 로고    scopus 로고
    • VFloat: A variable precision fixed- and floating-point library for reconfigurable hardware
    • Article 16
    • Wang, X. and Leeser, M. 2010. VFloat: A variable precision fixed- and floating-point library for reconfigurable hardware. ACM Trans. Reconfig. Technol. Syst. 3, 3, Article 16.
    • (2010) ACM Trans. Reconfig. Technol. Syst. , vol.3 , Issue.3
    • Wang, X.1    Leeser, M.2
  • 41
    • 47049109081 scopus 로고    scopus 로고
    • High-performance designs for linear algebra operations on reconfigurable hardware
    • Zhuo, L. and Prasanna, V. K. 2008. High-performance designs for linear algebra operations on reconfigurable hardware. IEEE Trans. Comput. 57, 8, 1057-1071.
    • (2008) IEEE Trans. Comput. , vol.57 , Issue.8 , pp. 1057-1071
    • Zhuo, L.1    Prasanna, V.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.