-
1
-
-
84867544211
-
-
ALTERA CORPORATION. 2008. http://www.altera.com/literature/.
-
(2008)
-
-
-
2
-
-
2442575887
-
An FPGA implementation of the two-dimensional finite-difference time-domain (FDTD) algorithm
-
CHEN, W., KOSMAS, P., LEESER, M., AND RAPPAPORT, C. 2004. An FPGA implementation of the two-dimensional finite-difference time-domain (FDTD) algorithm. In Proceedings of the ACM/SIGDA 12th International Symposium on Field Programmable Gate Arrays (FPGA'04). 213-222.
-
(2004)
Proceedings of the ACM/SIGDA 12th International Symposium on Field Programmable Gate Arrays (FPGA'04)
, pp. 213-222
-
-
Chen, W.1
Kosmas, P.2
Leeser, M.3
Rappaport, C.4
-
4
-
-
0000227930
-
Reconfigurable computing: A survey of systems and software
-
COMPTON, K. AND HAUCK, S. 2002. Reconfigurable computing: A survey of systems and software. ACM Comput. Surv. 34, 2, 171-210.
-
(2002)
ACM Comput. Surv.
, vol.34
, Issue.2
, pp. 171-210
-
-
Compton, K.1
Hauck, S.2
-
6
-
-
20344376214
-
64-bit floatingpoint FPGA matrix multiplication
-
DOU, Y., VASSILIADIS, S., KUZMANOV, G. K., AND GAYDADJIEV, G. N. 2005. 64-bit floatingpoint FPGA matrix multiplication. In Proceedings of the International Symposium on Field- Programmable Gate Arrays. 86-95.
-
(2005)
Proceedings of the International Symposium on Field- Programmable Gate Arrays
, pp. 86-95
-
-
Dou, Y.1
Vassiliadis, S.2
Kuzmanov, G.K.3
Gaydadjiev, G.N.4
-
7
-
-
17844401837
-
FPGAbased acceleration of the 3d finite-difference time-domain method
-
DURBANO, J. P., ORTIZ, F. E., HUMPHREY, J. R., CURT, P. F., AND PRATHER, D. W. 2004. FPGAbased acceleration of the 3d finite-difference time-domain method. In Proceedings of the 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines. 156-163.
-
(2004)
Proceedings of the 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
, pp. 156-163
-
-
Durbano, J.P.1
Ortiz, F.E.2
Humphrey, J.R.3
Curt, P.F.4
Prather, D.W.5
-
8
-
-
0032632125
-
Computational ram: Implementing processors in memory
-
ELLIOTT, D. G., STUMM, M., SNELGROVE, W., COJOCARU, C., AND MCKENZIE, R. 1999. Computational ram: Implementing processors in memory. Des. Test Comput. 16, 1, 32-41.
-
(1999)
Des. Test Comput.
, vol.16
, Issue.1
, pp. 32-41
-
-
Elliott, D.G.1
Stumm, M.2
Snelgrove, W.3
Cojocaru, C.4
McKenzie, R.5
-
10
-
-
33746313271
-
Power and energy profiling of scieitific applications on distributed systems
-
IEEE Computer Society Press, Los Alamitos, CA
-
FENG, X., GE, R., AND CAMERON, K. W. 2005. Power and energy profiling of scieitific applications on distributed systems. In Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium. IEEE Computer Society Press, Los Alamitos, CA.
-
(2005)
Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium
-
-
Feng, X.1
Ge, R.2
Cameron, K.W.3
-
14
-
-
18644369929
-
Accelerating seismic migration using FPGA-based coprocessor platform
-
IEEE Computer Society Press, Los Alamitos, CA
-
HE, C., LU, M., AND SUN, C. 2004. Accelerating seismic migration using FPGA-based coprocessor platform. In Proceedings of the 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines. IEEE Computer Society Press, Los Alamitos, CA. 207-216.
-
(2004)
Proceedings of the 12th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
, pp. 207-216
-
-
He, C.1
Lu, M.2
Sun, C.3
-
15
-
-
33746130849
-
Time domain numerical simulation for transient waves on reconfigurable coprocessor platform
-
IEEE Computer Society Press, Los Alamitos, CA
-
HE, C., ZHAO, W., AND LU, M. 2005. Time domain numerical simulation for transient waves on reconfigurable coprocessor platform. In Proceedings of the 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines. IEEE Computer Society Press, Los Alamitos, CA. 127-136.
-
(2005)
Proceedings of the 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
, pp. 127-136
-
-
He, C.1
Zhao, W.2
Lu, M.3
-
17
-
-
84976696809
-
Pacs: A parallel microprocessor array for scientific calculations
-
HOSHINO, T., KAWAI, T., SHIRAKAWA, T., HIGASHINO, J., YAMAOKA, A., ITO, H., SATO, T., AND SAWADA, K. 1983. Pacs: A parallel microprocessor array for scientific calculations. ACM Trans. Comput. Syst. 1, 3, 195-221.
-
(1983)
ACM Trans. Comput. Syst.
, vol.1
, Issue.3
, pp. 195-221
-
-
Hoshino, T.1
Kawai, T.2
Shirakawa, T.3
Higashino, J.4
Yamaoka, A.5
Ito, H.6
Sato, T.7
Sawada, K.8
-
18
-
-
0027693790
-
General-purpose systolic arrays
-
JOHNSON, K. T., HURSON, A., AND SHIRAZI, B. 1993. General-purpose systolic arrays. Computer 26, 11, 20-31.
-
(1993)
Computer
, vol.26
, Issue.11
, pp. 20-31
-
-
Johnson, K.T.1
Hurson, A.2
Shirazi, B.3
-
20
-
-
46549092989
-
Application of a fractional-step method to incompressible navierstokes
-
KIM, J. AND MOIN, P. 1985. Application of a fractional-step method to incompressible navierstokes. J. Comput. Physics 59, 308-323.
-
(1985)
J. Comput. Physics
, vol.59
, pp. 308-323
-
-
Kim, J.1
Moin, P.2
-
21
-
-
0019923189
-
Why systolic architecture?
-
KUNG, H. T. 1982. Why systolic architecture? Computer 15, 1, 37-46.
-
(1982)
Computer
, vol.15
, Issue.1
, pp. 37-46
-
-
Kung, H.T.1
-
22
-
-
63049107218
-
Exploiting memory hierarchy for a computational fluid dynamics accelerator on FPGAs
-
MORISHITA, H., OSANA, Y., FUJITA, N., AND AMANO, H. 2008. Exploiting memory hierarchy for a computational fluid dynamics accelerator on FPGAs. In Proceedings of the International Conference on Field-Programmable Technology (FPT'08). 193-200.
-
(2008)
Proceedings of the International Conference on Field-Programmable Technology (FPT'08)
, pp. 193-200
-
-
Morishita, H.1
Osana, Y.2
Fujita, N.3
Amano, H.4
-
25
-
-
34547427510
-
A scalable FPGA-based multiprocessor
-
PATEL, A., MADILL, C. A., SALDANA, M., COMIS, C., POMES, R., AND CHOW, P. 2006. A scalable FPGA-based multiprocessor. In Proceedings of the 14th Annual IEEE Symposium on Field- Programmable Custom Computing Machines. 111-120.
-
(2006)
Proceedings of the 14th Annual IEEE Symposium on Field- Programmable Custom Computing Machines
, pp. 111-120
-
-
Patel, A.1
Madill, C.A.2
Saldana, M.3
Comis, C.4
Pomes, R.5
Chow, P.6
-
26
-
-
0031096193
-
A case for intelligent ram: IRAM
-
PATTERSON, D., ANDERSON, T., CARDWELL, N., FROMM, R., KEETON, K., KOZYRAKIS, C., THOMAS, R., AND YELICK, K. 1997a. A case for intelligent ram: IRAM. IEEE Micro 17, 2, 34-44.
-
(1997)
IEEE Micro
, vol.17
, Issue.2
, pp. 34-44
-
-
Patterson, D.1
Anderson, T.2
Cardwell, N.3
Fromm, R.4
Keeton, K.5
Kozyrakis, C.6
Thomas, R.7
Yelick, K.8
-
27
-
-
0031383426
-
Intelligent ram (IRAM): The industrial setting, applications, and architectures
-
PATTERSON, D., ASANOVIC, K., BROWN, A., FROMM, R., GOLBUS, J., GRIBSTAD, B., KEETON, K., KOZYRAKIS, C., MARTIN, D., PERISSAKIS, S., THOMAS, R., TREUHAFT, N., AND YELICK, K. 1997b. Intelligent ram (IRAM): The industrial setting, applications, and architectures. In Proceedings of the International Conference on Computer Design. 2-9.
-
(1997)
Proceedings of the International Conference on Computer Design
, pp. 2-9
-
-
Patterson, D.1
Asanovic, K.2
Brown, A.3
Fromm, R.4
Golbus, J.5
Gribstad, B.6
Keeton, K.7
Kozyrakis, C.8
Martin, D.9
Perissakis, S.10
Thomas, R.11
Treuhaft, N.12
Yelick, K.13
-
28
-
-
3042542332
-
A systolic memory architecture for fast codebook design based on MMPDCL algorithm
-
SANO, K., TAKAGI, C., EGAWA, R., SUZUKI, K., AND NAKAMURA, T. 2004. A systolic memory architecture for fast codebook design based on MMPDCL algorithm. In Proceedings of the International Conference on Information Technology (ITCC'04). 572-578.
-
(2004)
Proceedings of the International Conference on Information Technology (ITCC'04)
, pp. 572-578
-
-
Sano, K.1
Takagi, C.2
Egawa, R.3
Suzuki, K.4
Nakamura, T.5
-
33
-
-
50149121349
-
FPGA-based streaming computation for lattice Boltzmann method
-
SANO, K., PELL, O., LUK, W., AND YAMAMOTO, S. 2007b. FPGA-based streaming computation for lattice Boltzmann method. In Proceedings of the International Conference on Field- Programmable Technology (FPT'07). 233-236.
-
(2007)
Proceedings of the International Conference on Field- Programmable Technology (FPT'07)
, pp. 233-236
-
-
Sano, K.1
Pell, O.2
Luk, W.3
Yamamoto, S.4
-
34
-
-
62949112216
-
Scalable FPGA-array for highperformance and power-efficient computation based on difference schemes
-
DOI: 10.1109/HPRCTA.2008.4745679
-
SANO, K., LUZHOU, W., HATSUDA, Y., AND YAMAMOTO, S. 2008a. Scalable FPGA-array for highperformance and power-efficient computation based on difference schemes. In Proceedings of the International Workshop on High-Performance Reconfigurable Computing Technology and Applications (HPRCTA'08). DOI: 10.1109/HPRCTA.2008.4745679.
-
(2008)
Proceedings of the International Workshop on High-Performance Reconfigurable Computing Technology and Applications (HPRCTA'08)
-
-
Sano, K.1
Luzhou, W.2
Hatsuda, Y.3
Yamamoto, S.4
-
35
-
-
63049136296
-
Evaluating power and energy consumption of FPGA-based custom computing machines for scientific floating-point computation
-
SANO, K., NISHIKAWA, T., AOKI, T., AND YAMAMOTO, S. 2008b. Evaluating power and energy consumption of FPGA-based custom computing machines for scientific floating-point computation. In Proceedings of the International Conference on Field-Programmable Technology (FPT'08). 301-304.
-
(2008)
Proceedings of the International Conference on Field-Programmable Technology (FPT'08)
, pp. 301-304
-
-
Sano, K.1
Nishikawa, T.2
Aoki, T.3
Yamamoto, S.4
-
37
-
-
33846789210
-
In A hardware/software approach to molecular dynamics on reconfigurable computers
-
SCROFANO, R.,GOKHALE, M. B., TROUW, F., AND PRASANNA, V. K. 2006. In A hardware/software approach to molecular dynamics on reconfigurable computers. In Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines. 23-34.
-
(2006)
Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
, pp. 23-34
-
-
Scrofano, R.1
Gokhale, M.B.2
Trouw, F.3
Prasanna, V.K.4
-
38
-
-
44049100960
-
Accelerating molecular dynamics simulations with reconfigurable computers
-
SCROFANO, R., GOKHALE, M. B., TROUW, F., AND PRASANNA, V. K. 2008. Accelerating molecular dynamics simulations with reconfigurable computers. IEEE Trans. Paral. Distrib. Syst. 19, 6, 764-778.
-
(2008)
IEEE Trans. Paral. Distrib. Syst.
, vol.19
, Issue.6
, pp. 764-778
-
-
Scrofano, R.1
Gokhale, M.B.2
Trouw, F.3
Prasanna, V.K.4
-
40
-
-
23944454327
-
Towards an rcc-based accelerator for computational fluid dynamics applications
-
SMITH, W. D. AND SCHNORE, A. R. 2003. Towards an rcc-based accelerator for computational fluid dynamics applications. J. Supercomput. 30, 3, 239-261.
-
(2003)
J. Supercomput.
, vol.30
, Issue.3
, pp. 239-261
-
-
Smith, W.D.1
Schnore, A.R.2
-
41
-
-
62949245114
-
Evaluating FPGAs for floating point performance
-
DOI: 10.1109/HPRCTA.2008.4745680
-
STRENSKI, D., SIMKINS, J., WALKE, R., AND WITTIG, R. 2008. Evaluating FPGAs for floating point performance. In Proceedings of the International Workshop on High-Performance Reconfigurable Computing Technology and Applications (HPRCTA'08). DOI: 10.1109/HPRCTA.2008.4745680.
-
(2008)
Proceedings of the International Workshop on High-Performance Reconfigurable Computing Technology and Applications (HPRCTA'08)
-
-
Strenski, D.1
Simkins, J.2
Walke, R.3
Wittig, R.4
-
42
-
-
0001875899
-
The accuracy of the fractional step method
-
STRIKWERDA, J. C. AND LEE, Y. S. 1999. The accuracy of the fractional step method. SIAM J. Numer. Anal. 37, 1, 37-47.
-
(1999)
SIAM J. Numer. Anal.
, vol.37
, Issue.1
, pp. 37-47
-
-
Strikwerda, J.C.1
Lee, Y.S.2
-
43
-
-
0003502626
-
-
Aretch House, Inc., Norwood, MA
-
TAFLOVE, A. AND HAGNESS, S. C. 1996. Computational Electrodynamics - The Finite Difference Time-Domain Method. Aretch House, Inc., Norwood, MA.
-
(1996)
Computational Electrodynamics - The Finite Difference Time-Domain Method
-
-
Taflove, A.1
Hagness, S.C.2
-
46
-
-
0030104367
-
Programmable active memories: Reconfigurable systems come of age
-
VUILLEMIN, J. E., BERTIN, P., RONCIN, D., SHAND, M., TOUATI, H. H., AND BOUCARD, P. 1996. Programmable active memories: reconfigurable systems come of age. IEEE Trans. VLSI Syst. 4, 1, 56-69.
-
(1996)
IEEE Trans. VLSI Syst.
, vol.4
, Issue.1
, pp. 56-69
-
-
Vuillemin, J.E.1
Bertin, P.2
Roncin, D.3
Shand, M.4
Touati, H.H.5
Boucard, P.6
-
47
-
-
0034505886
-
20-GFLOPS QR processor on a Xilinx virtex-e FPGA
-
WALKE, R. L., SMITH, R. W. M., AND LIGHTBODY, G. 2000. 20-GFLOPS QR processor on a Xilinx virtex-e FPGA. In Proceedings of SPIE: Advanced Signal Processing Algorithms, Architectures and Implementations X. Vol. 4116, 300-310.
-
(2000)
Proceedings of SPIE: Advanced Signal Processing Algorithms, Architectures and Implementations X
, vol.4116
, pp. 300-310
-
-
Walke, R.L.1
Smith, R.W.M.2
Lightbody, G.3
-
48
-
-
65949107549
-
Roofline: An insightful visual performance model for multicore architectures
-
WILLIAMS, S., WATERMAN, A., AND PATTERSON, D. 2009. Roofline: an insightful visual performance model for multicore architectures. Comm. ACM 52, 4, 65-76.
-
(2009)
Comm. ACM
, vol.52
, Issue.4
, pp. 65-76
-
-
Williams, S.1
Waterman, A.2
Patterson, D.3
-
50
-
-
84894021661
-
Numerical solution of inital boundary value problems involving Maxwell's equations in isotropic media
-
YEE, K. S. 1966. Numerical solution of inital boundary value problems involving Maxwell's equations in isotropic media. IEEE Trans. Antennas Prop. 14, 302-307.
-
(1966)
IEEE Trans. Antennas Prop.
, vol.14
, pp. 302-307
-
-
Yee, K.S.1
-
51
-
-
34648814129
-
High-performance reduction circuits using deeply pipelined operators on FPGAs
-
ZHUO, L.,MORRIS, G. R., AND PRASANNA, V. K. 2007. High-performance reduction circuits using deeply pipelined operators on FPGAs. IEEE Trans. Paral. Distrib. Syst. 18, 10, 1377-1392.
-
(2007)
IEEE Trans. Paral. Distrib. Syst.
, vol.18
, Issue.10
, pp. 1377-1392
-
-
Zhuo, L.1
Morris, G.R.2
Prasanna, V.K.3
-
53
-
-
34047144377
-
Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems
-
ZHUO, L. AND PRASANNA, V. K. 2007. Scalable and modular algorithms for floating-point matrix multiplication on reconfigurable computing systems. IEEE Trans. Paral. Distrib. Syst. 18, 4, 433-448.
-
(2007)
IEEE Trans. Paral. Distrib. Syst.
, vol.18
, Issue.4
, pp. 433-448
-
-
Zhuo, L.1
Prasanna, V.K.2
|