메뉴 건너뛰기




Volumn 153, Issue 4, 2006, Pages 249-260

Exploiting mixed-mode parallelism for matrix operations on the HERA architecture through reconfiguration

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; COMPUTER ARCHITECTURE; DATA STORAGE EQUIPMENT; FIELD PROGRAMMABLE GATE ARRAYS; LOGIC DESIGN; SCHEDULING;

EID: 33745683356     PISSN: 13502387     EISSN: None     Source Type: Journal    
DOI: 10.1049/ip-cdt:20045136     Document Type: Article
Times cited : (15)

References (43)
  • 1
    • 2442598143 scopus 로고    scopus 로고
    • FPGAs vs. CPUs: Trends in peak floating-point performance
    • Monterey, CA, February
    • Underwood, K.: ' FPGAs vs. CPUs: trends in peak floating-point performance ', 12th ACM/SIGDA Int. Symp. on Field Programmable Gate Arrays, Monterey, CA, February 2004, p. 171-180
    • (2004) ACM/SIGDA Int. Symp. on Field Programmable Gate Arrays , pp. 171-180
    • Underwood, K.1
  • 2
    • 12444290912 scopus 로고    scopus 로고
    • Scalable and modular algorithms for floating-point matrix multiplication on FPGAs
    • April
    • Zhuo, L., and Prasanna, V.K.: ' Scalable and modular algorithms for floating-point matrix multiplication on FPGAs ', 18th Int. Parallel and Distributed Processing Symp., April 2004, p. 92-101
    • (2004) Int. Parallel and Distributed Processing Symp. , pp. 92-101
    • Zhuo, L.1    Prasanna, V.K.2
  • 5
    • 0012561327 scopus 로고    scopus 로고
    • Coming challenges in microarchitecture and architecture
    • Ronen, R., Mendelson, A., Lai, K., Lu, S.-L., Pollack, F., and Shen, J.: ' Coming challenges in microarchitecture and architecture ', Proc. IEEE, 2001, 89, (3), p. 325-340
    • (2001) Proc. IEEE , vol.89 , Issue.3 , pp. 325-340
    • Ronen, R.1    Mendelson, A.2    Lai, K.3    Lu, S.-L.4    Pollack, F.5    Shen, J.6
  • 8
    • 0000227930 scopus 로고    scopus 로고
    • Reconfigurable computing: A survey of systems and software
    • Compton, K., and Hauck, S.: ' Reconfigurable computing: a survey of systems and software ', ACM Comput. Surv., 2002, 34, (2), p. 171-210
    • (2002) ACM Comput. Surv. , vol.34 , Issue.2 , pp. 171-210
    • Compton, K.1    Hauck, S.2
  • 9
    • 33745715748 scopus 로고    scopus 로고
    • Synthesizable reconfigurable array targeting distributed arithmetic for system-on-chip applications
    • Khawam, S., Arslan, T., and Westall, F.: ' Synthesizable reconfigurable array targeting distributed arithmetic for system-on-chip applications ', 12th Reconfigurable Architectures Workshop, 2004
    • (2004) Reconfigurable Architectures Workshop
    • Khawam, S.1    Arslan, T.2    Westall, F.3
  • 10
    • 3543135955 scopus 로고    scopus 로고
    • Evolutionary algorithm for the promotion of evolvable hardware
    • Tyrrell, A.M., Krohling, R.A., and Zhou, Y.: ' Evolutionary algorithm for the promotion of evolvable hardware ', IEE Proc., Comput. Digit. Tech., 2004, 151, (4), p. 267-275
    • (2004) IEE Proc., Comput. Digit. Tech. , vol.151 , Issue.4 , pp. 267-275
    • Tyrrell, A.M.1    Krohling, R.A.2    Zhou, Y.3
  • 13
    • 0040674218 scopus 로고    scopus 로고
    • Mixed-mode system heterogeneous computing
    • Eshaghian, M.M., Heterogeneous computing, Artech House, Norwood, MA
    • Siegel, H.J., Maheswaran, M., Watson, D.W., Antonio, J.K., and Atallah, M.J.: ' Mixed-mode system heterogeneous computing ', Eshaghian, M.M., Heterogeneous computing, (Artech House, Norwood, MA 1996), p. 19-65
    • (1996) Heterogeneous Computing , pp. 19-65
    • Siegel, H.J.1    Maheswaran, M.2    Watson, D.W.3    Antonio, J.K.4    Atallah, M.J.5
  • 16
    • 1842533207 scopus 로고    scopus 로고
    • Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines
    • Wang, X., and Ziavras, S.G.: ' Parallel LU factorization of sparse matrices on FPGA-based configurable computing engines ', Concurrency Comput. Pract. Exp., 2004, 16, (4), p. 319-343
    • (2004) Concurrency Comput. Pract. Exp. , vol.16 , Issue.4 , pp. 319-343
    • Wang, X.1    Ziavras, S.G.2
  • 18
    • 11844292749 scopus 로고    scopus 로고
    • HERA: A reconfigurable and mixed-mode parallel computing engine on platform FPGAs
    • Boston, Massachusetts, November
    • Wang, X., and Ziavras, S.G.: ' HERA: a reconfigurable and mixed-mode parallel computing engine on platform FPGAs ', 16th Int. Conf. Parallel and Distributed Computing and Systems, Boston, Massachusetts, November 9-11 2004, p. 374-379
    • (2004) Int. Conf. Parallel and Distributed Computing and Systems , pp. 374-379
    • Wang, X.1    Ziavras, S.G.2
  • 19
    • 0035341885 scopus 로고    scopus 로고
    • Reconfigurable computing and digital signal processing: A survey
    • Tessier, R., and Burleson, W.: ' Reconfigurable computing and digital signal processing: a survey ', J. VLSI Signal Process., 2001, 28, (1-2), p. 7-27
    • (2001) J. VLSI Signal Process. , vol.28 , Issue.1-2 , pp. 7-27
    • Tessier, R.1    Burleson, W.2
  • 21
  • 25
    • 0034187952 scopus 로고    scopus 로고
    • MorphoSys: An integrated reconfigurable system for data-parallel and computation-Intensive Applications
    • Singh, H., Lee, M.-H., Lu, G., Kurdahi, F.J., Bagherzadeh, N., and Filho, E.M.C.: ' MorphoSys: an integrated reconfigurable system for data-parallel and computation-Intensive Applications ', IEEE Trans. Comput., 2000, 49, (5), p. 465-481
    • (2000) IEEE Trans. Comput. , vol.49 , Issue.5 , pp. 465-481
    • Singh, H.1    Lee, M.-H.2    Lu, G.3    Kurdahi, F.J.4    Bagherzadeh, N.5    Filho, E.M.C.6
  • 27
    • 0030394522 scopus 로고    scopus 로고
    • MATRIX: A reconfigurable computing architecture with configurable instruction distribution and deployable resources
    • Mirsky, E., and DeHon, A.: ' MATRIX: a reconfigurable computing architecture with configurable instruction distribution and deployable resources ', 1996 IEEE Symp. FPGAs for Custom Computing Machines, 1996, p. 157-166
    • (1996) 1996 IEEE Symp. FPGAs for Custom Computing Machines , pp. 157-166
    • Mirsky, E.1    Dehon, A.2
  • 30
    • 85086484200 scopus 로고    scopus 로고
    • A multiprocessor-on-a-programmable-chip reconfigurable system for matrix operations with power-grid case studies
    • Special Issue on Parallel and Distrib. Sci. Eng. Comput., in Press
    • Wang, X., and Ziavras, S.G.: ' A multiprocessor-on-a-programmable-chip reconfigurable system for matrix operations with power-grid case studies ', Int. J. Comput. Sci. Eng., Special Issue on Parallel and Distrib. Sci. Eng. Comput., in press
    • Int. J. Comput. Sci. Eng.
    • Wang, X.1    Ziavras, S.G.2
  • 31
    • 33745687464 scopus 로고    scopus 로고
    • Annapolis Micro Systems, Inc., Available at http://www.annapmicro.com/
  • 32
    • 33745687465 scopus 로고    scopus 로고
    • Codito Technologies Pvt. Ltd. Available at: http://www.codito.com/prodtech_framework.html
  • 33
    • 33745715747 scopus 로고    scopus 로고
    • OpenMP. Available at: http://www.openmp.org
  • 34
    • 33947401902 scopus 로고    scopus 로고
    • Coprocessor design to support MPI primitives in configurable multiprocessors
    • in Press
    • Ziavras, S.G., Gerbessiotis, A., and Bafna, R.: ' Coprocessor design to support MPI primitives in configurable multiprocessors ', Integr. VLSI J., in press
    • Integr. VLSI J.
    • Ziavras, S.G.1    Gerbessiotis, A.2    Bafna, R.3
  • 35
    • 33745687448 scopus 로고    scopus 로고
    • A framework for dynamic resource management and scheduling on reconfigurable mixed-mode multiprocessors
    • Singapore, December
    • Wang, X., and Ziavras, S.G.: ' A framework for dynamic resource management and scheduling on reconfigurable mixed-mode multiprocessors ', IEEE Int. Conf. on Field-Programmable Technology, Singapore, December 2005, p. 51-58
    • (2005) IEEE Int. Conf. on Field-Programmable Technology , pp. 51-58
    • Wang, X.1    Ziavras, S.G.2
  • 36
    • 33745701672 scopus 로고    scopus 로고
    • Tensilica. Available at: http://tensilica.com
  • 37
    • 0036709503 scopus 로고    scopus 로고
    • Reconfigurable instruction set processors from a hardware/software perspective
    • Barat, F., Rudy, L., and Geert, D.: ' Reconfigurable instruction set processors from a hardware/software perspective ', IEEE Trans. Softw. Eng., 2002, 28, (9), p. 847-862
    • (2002) IEEE Trans. Softw. Eng. , vol.28 , Issue.9 , pp. 847-862
    • Barat, F.1    Rudy, L.2    Geert, D.3
  • 38
    • 33745701666 scopus 로고    scopus 로고
    • Automatic performance tuning of linear algebra kernels
    • http://bebop.cs.berkeley.edu/pubs/SciDAC_250102.pdf
    • Demmel, J., and Yelick, K.: ' Automatic performance tuning of linear algebra kernels ', TOPS-SciDAC (http://www.tops-scidac.org), 2002 January), Available at: http://bebop.cs.berkeley.edu/pubs/SciDAC_250102.pdf
    • (2002) TOPS-SciDAC
    • Demmel, J.1    Yelick, K.2
  • 39
    • 33745701670 scopus 로고    scopus 로고
    • Intel Math Kernel Library (MKL) 8.0. Available at: http://www.intel.com/cd/software/products/asmo-na/eng/perflib/mkl/219823.htm
  • 40
    • 84858903302 scopus 로고    scopus 로고
    • Accelerating blocked matrix-matrix multiplication using a software-managed memory hierarchy with DMA
    • Wunderlich, R., Püschel, M., and Hoe, J.: ' Accelerating blocked matrix-matrix multiplication using a software-managed memory hierarchy with DMA ', High Performance Embedded Computing Workshop, 2005, MIT
    • (2005) High Performance Embedded Computing Workshop
    • Wunderlich, R.1    Püschel, M.2    Hoe, J.3
  • 41
    • 0017636064 scopus 로고
    • An efficient heuristic cluster algorithm for tearing large-scale networks
    • Sangiovanni-Vincentelli, A., Chen, L.K., and Chua, L.O.: ' An efficient heuristic cluster algorithm for tearing large-scale networks ', IEEE Trans. Circuits Syst., 1977, 24, (12), p. 709-717
    • (1977) IEEE Trans. Circuits Syst. , vol.24 , Issue.12 , pp. 709-717
    • Sangiovanni-Vincentelli, A.1    Chen, L.K.2    Chua, L.O.3
  • 42
    • 33745687456 scopus 로고    scopus 로고
    • Matrix Market, Available at: http://math.nist.gov/MatrixMarket/
  • 43
    • 70349183865 scopus 로고    scopus 로고
    • TMS320C6711/11B/11C/11D
    • TMS320C6711/11B/11C/11D Floating-Point Digital Signal Processors. Available at: http://focus.ti.com/docs/prod/folders/print/tms320c6711.html
    • Floating-Point Digital Signal Processors


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.