-
1
-
-
47049106841
-
-
Xilinx Incorporated
-
Xilinx Incorporated, http://www.xilinx.com, 2008.
-
(2008)
-
-
-
5
-
-
2442575888
-
A Quantitative Analysis of the Speedup Factors of FPGAs over Processors
-
Feb
-
Z. Guo, W. Najjar, F. Vahid, and K. Vissers, "A Quantitative Analysis of the Speedup Factors of FPGAs over Processors," Proc. 12th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays, pp. 162-170, Feb. 2004.
-
(2004)
Proc. 12th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays
, pp. 162-170
-
-
Guo, Z.1
Najjar, W.2
Vahid, F.3
Vissers, K.4
-
6
-
-
47049092496
-
Reconfigurable Computing with Multiscale Data Fusion for Remote Sensing
-
Feb
-
V. Aggarwal, A. George, and K. Slatton, "Reconfigurable Computing with Multiscale Data Fusion for Remote Sensing," Proc. 14th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays, p. 235, Feb. 2006.
-
(2006)
Proc. 14th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays
, pp. 235
-
-
Aggarwal, V.1
George, A.2
Slatton, K.3
-
10
-
-
47049130960
-
-
Cray Inc
-
Cray Inc., http://www.cray.com/, 2008.
-
(2008)
-
-
-
11
-
-
47049098886
-
-
SRC Computers, Inc
-
SRC Computers, Inc., http://www.srccomp.com/, 2008.
-
(2008)
-
-
-
12
-
-
47049109366
-
-
Silicon Graphics, Inc
-
Silicon Graphics, Inc., http://www.sgi.com/, 2008.
-
(2008)
-
-
-
13
-
-
24944539760
-
High-Performance Algorithm Engineering for Parallel Computation
-
D. Bader, B. Moret, and P. Sanders, "High-Performance Algorithm Engineering for Parallel Computation," Lecture Notes in Computer Science, vol. 2547, pp. 1-23, 2002.
-
(2002)
Lecture Notes in Computer Science
, vol.2547
, pp. 1-23
-
-
Bader, D.1
Moret, B.2
Sanders, P.3
-
14
-
-
0018515759
-
Basic Linear Algebra Subprograms for FORTRAN Usage
-
C. Lawson, R. Hanson, D. Kincaid, and F. Krogh, "Basic Linear Algebra Subprograms for FORTRAN Usage," ACM Trans. Math. Software, vol. 5, no. 3, pp. 308-323, 1979.
-
(1979)
ACM Trans. Math. Software
, vol.5
, Issue.3
, pp. 308-323
-
-
Lawson, C.1
Hanson, R.2
Kincaid, D.3
Krogh, F.4
-
17
-
-
0003473816
-
-
second ed. SIAM
-
R. Barrett, M. Berry, T.F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H.V. der Vorst, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, second ed. SIAM, 1994.
-
(1994)
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
der Vorst, H.V.10
-
20
-
-
0343462141
-
-
R.C. Whaley, A. Petitet, and J.J. Dongarra, Automated Empirical Optimization of Software and the ATLAS Project, Parallel Computing 27, nos. 1-2, pp. 3-35, also available as Univ. of Tennessee LAPACK Working Note #147, UT-CS-00-448, 2000 (www.netlib.org/lapack/lawns/lawn147.ps), 2001.
-
R.C. Whaley, A. Petitet, and J.J. Dongarra, "Automated Empirical Optimization of Software and the ATLAS Project," Parallel Computing vol. 27, nos. 1-2, pp. 3-35, also available as Univ. of Tennessee LAPACK Working Note #147, UT-CS-00-448, 2000 (www.netlib.org/lapack/lawns/lawn147.ps), 2001.
-
-
-
-
21
-
-
0003706460
-
-
Aug. 1999
-
E. Anderson, Z. Bai, C. Bischof, S. Blackford, J. Demmel, J. Dongarra, J.D. Croz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen, "LAPACK User's Guide Third Edition," http://www.netlib.org/lapack/lug/lapack_lug.html, Aug. 1999.
-
LAPACK User's Guide Third Edition
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Blackford, S.4
Demmel, J.5
Dongarra, J.6
Croz, J.D.7
Greenbaum, A.8
Hammarling, S.9
McKenney, A.10
Sorensen, D.11
-
22
-
-
0003615167
-
-
SIAM
-
L.S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R.C. Whaley, ScaLAPACK Users' Guide, SIAM, 1997.
-
(1997)
ScaLAPACK Users' Guide
-
-
Blackford, L.S.1
Choi, J.2
Cleary, A.3
D'Azevedo, E.4
Demmel, J.5
Dhillon, I.6
Dongarra, J.7
Hammarling, S.8
Henry, G.9
Petitet, A.10
Stanley, K.11
Walker, D.12
Whaley, R.C.13
-
23
-
-
0031221523
-
Parallel Implementation of BLAS: General Techniques for Level 3 BLAS
-
A. Chtchelkanova, J. Gunnels, G. Morrow, J. Overfelt, and R. van de Geijn, "Parallel Implementation of BLAS: General Techniques for Level 3 BLAS," Concurrency: Practice and Experience, vol. 9, no. 9, pp. 837-857, 1997.
-
(1997)
Concurrency: Practice and Experience
, vol.9
, Issue.9
, pp. 837-857
-
-
Chtchelkanova, A.1
Gunnels, J.2
Morrow, G.3
Overfelt, J.4
van de Geijn, R.5
-
24
-
-
0000227930
-
Reconfigurable Computing: A Survey of Systems and Software
-
June
-
K. Compton and S. Hauck, "Reconfigurable Computing: A Survey of Systems and Software," ACM Computing Surveys, vol. 34, no. 2, pp. 171-210, June 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.2
, pp. 171-210
-
-
Compton, K.1
Hauck, S.2
-
27
-
-
34147177681
-
Using FPGA Devices to Accelerate Biomolecular Simulations
-
Mar
-
S. Alam, P. Agarwal, M. Smith, J. Vetter, and D. Caliga, "Using FPGA Devices to Accelerate Biomolecular Simulations," Computer, vol. 40, no. 3, pp. 66-73, Mar. 2007.
-
(2007)
Computer
, vol.40
, Issue.3
, pp. 66-73
-
-
Alam, S.1
Agarwal, P.2
Smith, M.3
Vetter, J.4
Caliga, D.5
-
28
-
-
0033488513
-
Optimizing FPGA-Based Vector Product Designs
-
Apr
-
D. Benyamin, W. Luk, and J. Villasenor, "Optimizing FPGA-Based Vector Product Designs," Proc. Seventh Ann. IEEE Symp. Field-Programmable Custom Computing Machines, pp. 188-197, Apr. 1999.
-
(1999)
Proc. Seventh Ann. IEEE Symp. Field-Programmable Custom Computing Machines
, pp. 188-197
-
-
Benyamin, D.1
Luk, W.2
Villasenor, J.3
-
31
-
-
20344376214
-
64-Bit Floating-Point FPGA Matrix Multiplication
-
Feb
-
Y. Dou, S. Vassiliadis, G. Kuzmanov, and G. Gaydadjiev, "64-Bit Floating-Point FPGA Matrix Multiplication," Proc. 13th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays, Feb. 2005.
-
(2005)
Proc. 13th ACM/SIGDA Int'l Symp. Field Programmable Gate Arrays
-
-
Dou, Y.1
Vassiliadis, S.2
Kuzmanov, G.3
Gaydadjiev, G.4
-
35
-
-
34147110975
-
Sparse Matrix-Vector Multiplication Kernel on a Reconfigurable Computer
-
Sept
-
S. Akella, M. Smith, R. Mills, S. Alam, R. Barrett, and J. Vetter, "Sparse Matrix-Vector Multiplication Kernel on a Reconfigurable Computer," Proc. Workshop High Performance Embedded Computing, Sept. 2005.
-
(2005)
Proc. Workshop High Performance Embedded Computing
-
-
Akella, S.1
Smith, M.2
Mills, R.3
Alam, S.4
Barrett, R.5
Vetter, J.6
-
36
-
-
12444323064
-
A High-Performance and Energy-Efficient Architecture for Floating-Point Based LU Decomposition on FPGAs
-
June
-
G. Govindu, S. Choi, V.K. Prasanna, V. Daga, S. Gangadharpalli, and V. Sridhar, "A High-Performance and Energy-Efficient Architecture for Floating-Point Based LU Decomposition on FPGAs," Proc. Int'l Conf. Eng. Reconfigurable Systems and Algorithms, June 2004.
-
(2004)
Proc. Int'l Conf. Eng. Reconfigurable Systems and Algorithms
-
-
Govindu, G.1
Choi, S.2
Prasanna, V.K.3
Daga, V.4
Gangadharpalli, S.5
Sridhar, V.6
-
37
-
-
47049096571
-
-
V,. Daga, G. Govindu, S. Gangadharpalli, V. Sridhar, and V.K. Prasanna, Efficient Floating-Point Based Block LU Decomposition on FPGAs, Proc. Int'l Conf. Eng. Reconfigurable Systems and Algorithms, June 2004.
-
V,. Daga, G. Govindu, S. Gangadharpalli, V. Sridhar, and V.K. Prasanna, "Efficient Floating-Point Based Block LU Decomposition on FPGAs," Proc. Int'l Conf. Eng. Reconfigurable Systems and Algorithms, June 2004.
-
-
-
-
41
-
-
34047144377
-
Scalable and Modular Algorithms for Floating-Point Matrix Multiplication on Reconfigurable Computing Systems
-
Apr
-
L. Zhuo and V. Prasanna, "Scalable and Modular Algorithms for Floating-Point Matrix Multiplication on Reconfigurable Computing Systems," IEEE Trans. Parallel and Distributed Systems, vol. 18, no. 4, pp. 433-448, Apr. 2007.
-
(2007)
IEEE Trans. Parallel and Distributed Systems
, vol.18
, Issue.4
, pp. 433-448
-
-
Zhuo, L.1
Prasanna, V.2
-
42
-
-
0004116989
-
-
second ed. The MIT Press
-
T.H. Cormen, C.E. Leiserson, R.L. Rivest, and C. Stein, Introduction to Algorithms, second ed. The MIT Press, 2001.
-
(2001)
Introduction to Algorithms
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
Stein, C.4
-
43
-
-
85064764845
-
Out of Core, Out of Mind: Practical Parallel I/O
-
D. Womble, D. Greenberg, R. Riesen, and S. Wheat, "Out of Core, Out of Mind: Practical Parallel I/O," Proc. Scalable Parallel Libraries Conf., pp. 10-16, citeseer.ist.psu.edu/womble93out.html, 1993.
-
(1993)
Proc. Scalable Parallel Libraries Conf
, pp. 10-16
-
-
Womble, D.1
Greenberg, D.2
Riesen, R.3
Wheat, S.4
-
44
-
-
47049101775
-
-
Mentor Graphics Corp
-
Mentor Graphics Corp., http://www.mentor.com/, 2008.
-
(2008)
-
-
-
45
-
-
47049114782
-
-
AMD Core Math Library, http://developer.amd.com/acml.aspx, 2008.
-
(2008)
AMD Core Math Library
-
-
|