메뉴 건너뛰기




Volumn 18, Issue 4, 2010, Pages 578-590

Improving FPGA performance for carry-save arithmetic

Author keywords

Carry save arithmetic; Field programmable gate array (FPGA); Generalized parallel counter (GPC)

Indexed keywords

AREA UTILIZATION; CARRY-SAVE; DATA-FLOW TRANSFORMATIONS; DSP APPLICATION; ENERGY CONSUMPTION; HARDWARE COMPILERS; MULTIINPUT; MULTIPLY ACCUMULATIONS; PARALLEL COUNTERS; PROGRAMMABLE COUNTER;

EID: 77950296034     PISSN: 10638210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TVLSI.2009.2014380     Document Type: Article
Times cited : (28)

References (45)
  • 1
    • 77950298526 scopus 로고
    • Asuggestionforafastmultiplier
    • Dec.
    • C.S. Wallace, "Asuggestionforafastmultiplier," IEEETrans. Electron. Comput., vol. EC-13, no.6, p.754, Dec. 1964.
    • (1964) IEEETrans. Electron. Comput. , vol.EC-13 , Issue.6 , pp. 754
    • Wallace, C.S.1
  • 2
    • 0001342967 scopus 로고
    • Some schemes for parallel multipliers
    • Mar.
    • L. Dadda, "Some schemes for parallel multipliers," Alta Freq., vol.34, pp. 349-356, Mar. 1965.
    • (1965) Alta Freq. , vol.34 , pp. 349-356
    • Dadda, L.1
  • 3
    • 52649128342 scopus 로고    scopus 로고
    • Data-flow transformations to maximize the use of carry-save representation in arithmetic circuits
    • Oct.
    • A. K. Verma, P. Brisk, and P. Ienne, "Data-flow transformations to maximize the use of carry-save representation in arithmetic circuits," IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol.27, no.10, pp. 1761-1774, Oct. 2008.
    • (2008) IEEE Trans. Comput.-Aided Design Integr. Circuits Syst. , vol.27 , Issue.10 , pp. 1761-1774
    • Verma, A.K.1    Brisk, P.2    Ienne, P.3
  • 4
    • 49749090125 scopus 로고    scopus 로고
    • FPGA implementation of high speed FIR filters using add and shift method
    • San Jose, CA, Oct.
    • S. Mirzaei, A. Hosangadi, and R. Kastner, "FPGA implementation of high speed FIR filters using add and shift method," in Proc. Int. Conf. Comput. Des., San Jose, CA, Oct. 2006, pp. 308-313.
    • (2006) Proc. Int. Conf. Comput. Des. , pp. 308-313
    • Mirzaei, S.1    Hosangadi, A.2    Kastner, R.3
  • 7
    • 33749527748 scopus 로고    scopus 로고
    • A 6.2-Gflops floating-point multiply-accumulator with conditional normalization
    • Oct.
    • S. R. Vangal, Y. V. Hoskote, N. Y. Borkar, and A. Alvandpour, "A 6.2-Gflops floating-point multiply-accumulator with conditional normalization," IEEE J. Solid-State Circuits, vol.41, no.10, pp. 2314-2323, Oct. 2006.
    • (2006) IEEE J. Solid-State Circuits , vol.41 , Issue.10 , pp. 2314-2323
    • Vangal, S.R.1    Hoskote, Y.V.2    Borkar, N.Y.3    Alvandpour, A.4
  • 10
    • 49749108872 scopus 로고    scopus 로고
    • Improving synthesis of compressor trees on FPGAs via integer linear programming
    • Munich, Germany, Mar.
    • H. Parandeh-Afshar, P. Brisk, and P. Ienne, "Improving synthesis of compressor trees on FPGAs via integer linear programming," in Proc. Int. Conf. Des. Autom. Test Eur., Munich, Germany, Mar. 2008, pp. 1256-1261.
    • (2008) Proc. Int. Conf. Des. Autom. Test Eur. , pp. 1256-1261
    • Parandeh-Afshar, H.1    Brisk, P.2    Ienne, P.3
  • 12
  • 13
    • 0017542921 scopus 로고
    • A compact high-speed parallel multiplication scheme
    • Oct.
    • W. J. Stenzel, W. J. Kubitz, and G. H. Garcia, "A compact high-speed parallel multiplication scheme," IEEE Trans. Comput., vol.C-26, no.10, pp. 948-957, Oct. 1977.
    • (1977) IEEE Trans. Comput. , vol.C-26 , Issue.10 , pp. 948-957
    • Stenzel, W.J.1    Kubitz, W.J.2    Garcia, G.H.3
  • 14
    • 77950298633 scopus 로고
    • Synthesis of generalized parallel counters
    • Sep.
    • S. Dormido and M. A. Canto, "Synthesis of generalized parallel counters," IEEE Trans. Comput., vol.C-30, no.9, pp. 699-703, Sep. 1981.
    • (1981) IEEE Trans. Comput. , vol.C-30 , Issue.9 , pp. 699-703
    • Dormido, S.1    Canto, M.A.2
  • 15
    • 0020165895 scopus 로고
    • An upper bound for the synthesis of generalized parallel counters
    • Aug.
    • S. Dormido and M. A. Canto, "An upper bound for the synthesis of generalized parallel counters," IEEE Trans. Comput., vol.C-31, no.8, pp. 802-805, Aug. 1982.
    • (1982) IEEE Trans. Comput. , vol.C-31 , Issue.8 , pp. 802-805
    • Dormido, S.1    Canto, M.A.2
  • 16
    • 77950300624 scopus 로고    scopus 로고
    • Altera Corporation, San Jose, CA, Feb. [Online]. Available
    • "Stratix III Device Handbook, Vol. 1 and 2" Altera Corporation, San Jose, CA, Feb. 2009. [Online]. Available: http://www.altera.com/
    • (2009) Stratix III Device Handbook , vol.1-2
  • 17
    • 43249104067 scopus 로고    scopus 로고
    • Xilinx Corporation, San Jose, CA [Online]. Available
    • "Virtex-5 User Guide" Xilinx Corporation, San Jose, CA, 2007. [Online]. Available: http://www.xilinx.com/
    • (2007) Virtex-5 User Guide
  • 18
    • 77950298967 scopus 로고    scopus 로고
    • Xilinx Corporation, San Jose, CA, Jan. [Online]. Available
    • "Virtex-5 FPGA Xtreme DSP Design Considerations" Xilinx Corporation, San Jose, CA, Jan. 2009. [Online]. Available: http://www.xilinx. com/
    • (2009) Virtex-5 FPGA Xtreme DSP Design Considerations
  • 20
    • 50149120746 scopus 로고    scopus 로고
    • Architecting hard crossbars on FPGAs and increasing their area-efficiency with shadow clusters
    • Kitakyushu, Japan, Dec.
    • P. Jamieson and J. Rose, "Architecting hard crossbars on FPGAs and increasing their area-efficiency with shadow clusters," in Proc. IEEE Int. Conf. Field Programmable Technol., Kitakyushu, Japan, Dec. 2007, pp. 57-64.
    • (2007) Proc. IEEE Int. Conf. Field Programmable Technol. , pp. 57-64
    • Jamieson, P.1    Rose, J.2
  • 24
    • 0029701117 scopus 로고    scopus 로고
    • DP-FPGA: An FPGA architecture optimized for datapaths
    • D. Cherepacha and D. Lewis, "DP-FPGA: An FPGA architecture optimized for datapaths," VLSI Des., vol.4, no.4, pp. 329-343, 1996.
    • (1996) VLSI Des. , vol.4 , Issue.4 , pp. 329-343
    • Cherepacha, D.1    Lewis, D.2
  • 28
    • 46249097191 scopus 로고    scopus 로고
    • Multi-bit carry chains for high-performance reconfigurable fabrics
    • Madrid, Spain, Aug.
    • M. T. Frederick and A. K. Somani, "Multi-bit carry chains for high-performance reconfigurable fabrics," in Proc. Int. Conf. Field Programmable Logic Appl., Madrid, Spain, Aug. 2006, pp. 1-6.
    • (2006) Proc. Int. Conf. Field Programmable Logic Appl. , pp. 1-6
    • Frederick, M.T.1    Somani, A.K.2
  • 30
    • 0034449694 scopus 로고    scopus 로고
    • Configurable arithmetic arrays with data-driven control
    • Pacific Grove, CA, Oct./Nov.
    • B. Parhami, "Configurable arithmetic arrays with data-driven control," in Proc. Asilomar Conf. Signals, Syst., Comput., Pacific Grove, CA, Oct./Nov. 2000, pp. 89-93.
    • (2000) Proc. Asilomar Conf. Signals, Syst., Comput. , pp. 89-93
    • Parhami, B.1
  • 31
    • 44149086928 scopus 로고    scopus 로고
    • A network of time-division multiplexed wiring for FPGAs
    • Apr. Newcastle University, U.K.
    • R. Francis, S. Moore, and R. Mullins, "A network of time-division multiplexed wiring for FPGAs," in Proc. 2nd IEEE Symp. Networks-on-Chip, Apr. 2008, pp. 35-44, Newcastle University, U.K..
    • (2008) Proc. 2nd IEEE Symp. Networks-on-Chip , pp. 35-44
    • Francis, R.1    Moore, S.2    Mullins, R.3
  • 32
    • 84956852816 scopus 로고    scopus 로고
    • A novel field programmable gate array architecture for high speed arithmetic processing
    • Tallinn, Estonia, Aug./ Sep.
    • N. L. Miller and S. F. Quigley, "A novel field programmable gate array architecture for high speed arithmetic processing," in Proc. 8th Int. Workshop Field-Programmable Logic Appl., Tallinn, Estonia, Aug./ Sep. 1998, pp. 386-390.
    • (1998) Proc. 8th Int. Workshop Field-Programmable Logic Appl. , pp. 386-390
    • Miller, N.L.1    Quigley, S.F.2
  • 34
    • 0023799446 scopus 로고
    • The reconfigurable arithmetic processor
    • Honolulu, HI, May/Jun.
    • S. Fiske and W. J. Dally, "The reconfigurable arithmetic processor," in Proc. 15th Int. Symp. Comput. Archit., Honolulu, HI, May/Jun. 1988, pp. 30-36.
    • (1988) Proc. 15th Int. Symp. Comput. Archit. , pp. 30-36
    • Fiske, S.1    Dally, W.J.2
  • 35
    • 36849022584 scopus 로고    scopus 로고
    • A 5-GHz mesh interconnect for a teraflops processor
    • Sep./Oct.
    • Y. Hoskote, S. Vangal, A. Singh, N. Borkar, and S. Borkar, "A 5-GHz mesh interconnect for a teraflops processor," IEEE Micro, vol.27, no.5, pp. 51-61, Sep./Oct. 2007.
    • (2007) IEEE Micro , vol.27 , Issue.5 , pp. 51-61
    • Hoskote, Y.1    Vangal, S.2    Singh, A.3    Borkar, N.4    Borkar, S.5
  • 36
    • 30544444390 scopus 로고    scopus 로고
    • Technology mapping and architecture evaluation for k/m-macrocell-based FPGAs
    • Jan.
    • J. Cong and H. Huang, "Technology mapping and architecture evaluation for k/m-macrocell-based FPGAs," ACM Trans. Des. Autom. Electron. Syst., vol.10, no.1, pp. 3-23, Jan. 2005.
    • (2005) ACM Trans. Des. Autom. Electron. Syst. , vol.10 , Issue.1 , pp. 3-23
    • Cong, J.1    Huang, H.2
  • 37
    • 50249128247 scopus 로고    scopus 로고
    • Design, synthesis and evaluation of heterogeneous FPGA with mixed LUTs and macro-gates
    • San Jose, CA, Nov.
    • Y. Hu, S. Das, S. Trimberger, and L. He, "Design, synthesis and evaluation of heterogeneous FPGA with mixed LUTs and macro-gates," in Proc. Int. Conf. Comput.-Aided Des., San Jose, CA, Nov. 2007, pp. 188-193.
    • (2007) Proc. Int. Conf. Comput.-Aided Des. , pp. 188-193
    • Hu, Y.1    Das, S.2    Trimberger, S.3    He, L.4
  • 38
    • 84957870821 scopus 로고    scopus 로고
    • VPR:A newpacking, placement, and routing tool for FPGA research
    • London, U.K., Sep.
    • V. Betz and J. Rose, "VPR:A newpacking, placement, and routing tool for FPGA research," in Proc. 7th Int. Workshop Field-Programmable Logic Appl., London, U.K., Sep. 1997, pp. 213-222.
    • (1997) Proc. 7th Int. Workshop Field-Programmable Logic Appl. , pp. 213-222
    • Betz, V.1    Rose, J.2
  • 40
    • 30544455212 scopus 로고    scopus 로고
    • A detailed power model for field-programmable gate arrays
    • Apr.
    • K. K. W. Poon, S. J. E.Wilton, and A.Yan, "A detailed power model for field-programmable gate arrays," ACM Trans. Des. Autom. Electron. Syst., vol.10, no.2, pp. 279-302, Apr. 2005.
    • (2005) ACM Trans. Des. Autom. Electron. Syst. , vol.10 , Issue.2 , pp. 279-302
    • Poon, K.K.W.1    Wilton, S.J.E.2    Yan, A.3
  • 41
    • 43749091973 scopus 로고    scopus 로고
    • Activity-based power estimation and characterization of DSP and multiplier blocks in FPGAs
    • Bangkok, Thailand, Dec.
    • N. C. K. Choy and S. J. E. Wilton, "Activity-based power estimation and characterization of DSP and multiplier blocks in FPGAs," in Proc. IEEE Int. Conf. Field Programmable Technol., Bangkok, Thailand, Dec. 2006, pp. 253-256.
    • (2006) Proc. IEEE Int. Conf. Field Programmable Technol. , pp. 253-256
    • Choy, N.C.K.1    Wilton, S.J.E.2
  • 43
    • 0028711580 scopus 로고
    • A survey of power estimation techniques in VLSI circuits
    • Dec.
    • F. N. Najm, "A survey of power estimation techniques in VLSI circuits," IEEE Trans. Very Large Scale Integr.(VLSI) Syst., vol.2, no.4, pp. 446-455, Dec. 1994.
    • (1994) IEEE Trans. Very Large Scale Integr.(VLSI) Syst. , vol.2 , Issue.4 , pp. 446-455
    • Najm, F.N.1
  • 44
    • 0031339427 scopus 로고    scopus 로고
    • MediaBench: A tool for evaluating and synthesizing multimedia and communications systems
    • Research Triangle Park, NC, Dec.
    • C. Lee, M. Potkonjak, and W. H. Mangione-Smith, "MediaBench: A tool for evaluating and synthesizing multimedia and communications systems," in Proc. 30th Int. Symp. Microarchitecture, Research Triangle Park, NC, Dec. 1997, pp. 330-335.
    • (1997) Proc. 30th Int. Symp. Microarchitecture , pp. 330-335
    • Lee, C.1    Potkonjak, M.2    Mangione-Smith, W.H.3
  • 45


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.