-
1
-
-
74349084542
-
A massively parallel FPGA-based coprocessor for support vector machines
-
FCCM'09 IEEE
-
Cadambi, S., Durdanovic, I., Jakkula, V., Sankaradass, M., Cosatto, E., Chakradhar, S., Graf, H.P. A massively parallel fpga-based coprocessor for support vector machines. In 17th IEEE Symposium on Field Programmable Custom Computing Machines, 2009. FCCM'09 (2009) IEEE, 115-122.
-
(2009)
17th IEEE Symposium on Field Programmable Custom Computing Machines, 2009
, pp. 115-122
-
-
Cadambi, S.1
Durdanovic, I.2
Jakkula, V.3
Sankaradass, M.4
Cosatto, E.5
Chakradhar, S.6
Graf, H.P.7
-
2
-
-
77955007393
-
A dynamically configurable coprocessor for convolutional neural networks
-
Saint Malo, France, June ACM
-
Chakradhar, S., Sankaradas, M., Jakkula, V., Cadambi, S. A dynamically configurable coprocessor for convolutional neural networks. In International Symposium on Computer Architecture (Saint Malo, France, June 2010). ACM 38(3): 247-257.
-
(2010)
International Symposium on Computer Architecture
, vol.38
, Issue.3
, pp. 247-257
-
-
Chakradhar, S.1
Sankaradas, M.2
Jakkula, V.3
Cadambi, S.4
-
4
-
-
84873463816
-
BenchNN: On the broad potential application scope of hardware neural network accelerators
-
Chen, T., Chen, Y., Duranton, M., Guo, Q., Hashmi, A., Lipasti, M., Nere, A., Qiu, S., Sebag, M., Temam, O. BenchNN: On the broad potential application scope of hardware neural network accelerators. In International Symposium on Workload Characterization, 2012.
-
(2012)
International Symposium on Workload Characterization
-
-
Chen, T.1
Chen, Y.2
Duranton, M.3
Guo, Q.4
Hashmi, A.5
Lipasti, M.6
Nere, A.7
Qiu, S.8
Sebag, M.9
Temam, O.10
-
5
-
-
84897780584
-
Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning
-
March ACM
-
Chen, T., Du, Z., Sun, N., Wang, J., Wu, C., Chen, Y., Temam, O. Diannao: A small-footprint high-throughput accelerator for ubiquitous machine-learning. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), (March 2014). ACM 49(4): 269-284.
-
(2014)
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
, vol.49
, Issue.4
, pp. 269-284
-
-
Chen, T.1
Du, Z.2
Sun, N.3
Wang, J.4
Wu, C.5
Chen, Y.6
Temam, O.7
-
6
-
-
84937706638
-
Dadiannao: A machine-learning supercomputer
-
December IEEE Computer Society
-
Chen, Y., Luo, T., Liu, S., Zhang, S., He, L., Wang, J., Li, L., Chen, T., Xu, Z., Sun, N., Temam, O. Dadiannao: A machine-learning supercomputer. In ACM/IEEE International Symposium on Microarchitecture (MICRO) (December 2014). IEEE Computer Society, 609-622.
-
(2014)
ACM/IEEE International Symposium on Microarchitecture (MICRO)
, pp. 609-622
-
-
Chen, Y.1
Luo, T.2
Liu, S.3
Zhang, S.4
He, L.5
Wang, J.6
Li, L.7
Chen, T.8
Xu, Z.9
Sun, N.10
Temam, O.11
-
7
-
-
84897484337
-
Deep learning with cots HPC systems
-
Coates, A., Huval, B., Wang, T., Wu, D.J., Ng, A.Y. Deep learning with cots HPC systems. In International Conference on Machine Learning, 2013: 1337-1345.
-
(2013)
International Conference on Machine Learning
, pp. 1337-1345
-
-
Coates, A.1
Huval, B.2
Wang, T.3
Wu, D.J.4
Ng, A.Y.5
-
8
-
-
85198028989
-
ImageNet: A large-scale hierarchical image database
-
IEEE
-
Deng, J. Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L. ImageNet: A large-scale hierarchical image database. In Conference on Computer Vision and Pattern Recognition (CVPR) (2009). IEEE, 248-255.
-
(2009)
Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 248-255
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
9
-
-
84959912559
-
Shidiannao: Shifting vision processing closer to the sensor
-
ACM
-
Du, Z., Fasthuber, R., Chen, T., Ienne, P., Li, L., Luo, T., Feng, X., Chen, Y., Temam, O. Shidiannao: Shifting vision processing closer to the sensor. In Proceedings of the 42nd ACM/IEEE International Symposium on Computer Architecture (ISCA'15) (2015). ACM, 92-104.
-
(2015)
Proceedings of the 42nd ACM/IEEE International Symposium on Computer Architecture (ISCA'15)
, pp. 92-104
-
-
Du, Z.1
Fasthuber, R.2
Chen, T.3
Ienne, P.4
Li, L.5
Luo, T.6
Feng, X.7
Chen, Y.8
Temam, O.9
-
10
-
-
80052528714
-
Dark silicon and the end of multicore scaling
-
June IEEE
-
Esmaeilzadeh, H., Blem, E., Amant, R.S., Sankaralingam, K., Burger, D. Dark silicon and the end of multicore scaling. In Proceedings of the 38th International Symposium on Computer Architecture (ISCA) (June 2011). IEEE, 365-376.
-
(2011)
Proceedings of the 38th International Symposium on Computer Architecture (ISCA)
, pp. 365-376
-
-
Esmaeilzadeh, H.1
Blem, E.2
Amant, R.S.3
Sankaralingam, K.4
Burger, D.5
-
11
-
-
84876591853
-
Neural acceleration for general-purpose approximate programs
-
Dec IEEE Computer Society
-
Esmaeilzadeh, H., Sampson, A., Ceze, L., Burger, D. Neural acceleration for general-purpose approximate programs. In Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture (Dec 2012). IEEE Computer Society, 449-460.
-
(2012)
Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
, pp. 449-460
-
-
Esmaeilzadeh, H.1
Sampson, A.2
Ceze, L.3
Burger, D.4
-
12
-
-
80054919955
-
NeuFlow: A runtime reconfigurable dataflow processor for vision
-
June IEEE
-
Farabet, C., Martini, B., Corda, B., Akselrod, P., Culurciello, E., LeCun, Y. NeuFlow: A runtime reconfigurable dataflow processor for vision. In CVPR Workshop (June 2011). IEEE, 109-116.
-
(2011)
CVPR Workshop
, pp. 109-116
-
-
Farabet, C.1
Martini, B.2
Corda, B.3
Akselrod, P.4
Culurciello, E.5
LeCun, Y.6
-
13
-
-
80054919955
-
Neuflow: A runtime reconfigurable dataflow processor for vision
-
IEEE
-
Farabet, C., Martini, B., Corda, B., Akselrod, P., Culurciello, E., LeCun, Y. Neuflow: A runtime reconfigurable dataflow processor for vision. In 2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2011). IEEE, 109-116.
-
(2011)
2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
, pp. 109-116
-
-
Farabet, C.1
Martini, B.2
Corda, B.3
Akselrod, P.4
Culurciello, E.5
LeCun, Y.6
-
14
-
-
50149091681
-
Hyperspectral images clustering on reconfigurable hardware using the k-means algorithm
-
SBCCI 2003 IEEE
-
Frery, A., de Araujo, C., Alice, H., Cerqueira, J., Loureiro, J.A., de Lima, M.E., Oliveira, M., Horta, M., et al. Hyperspectral images clustering on reconfigurable hardware using the k-means algorithm. In Proceedings of the 16th Symposium on Integrated Circuits and Systems Design, 2003. SBCCI 2003 (2003). IEEE, 99-104.
-
(2003)
Proceedings of the 16th Symposium on Integrated Circuits and Systems Design, 2003
, pp. 99-104
-
-
Frery, A.1
De Araujo, C.2
Alice, H.3
Cerqueira, J.4
Loureiro, J.A.5
De Lima, M.E.6
Oliveira, M.7
Horta, M.8
-
15
-
-
77954995378
-
Understanding sources of inefficiency in general-purpose chips
-
New York, New York, USA ACM
-
Hameed, R., Qadeer, W., Wachs, M., Azizi, O., Solomatnikov, A., Lee, B.C., Richardson, S., Kozyrakis, C., Horowitz, M. Understanding sources of inefficiency in general-purpose chips. In International Symposium on Computer Architecture (New York, New York, USA, 2010). ACM, 38(3): 37-47.
-
(2010)
International Symposium on Computer Architecture
, vol.38
, Issue.3
, pp. 37-47
-
-
Hameed, R.1
Qadeer, W.2
Wachs, M.3
Azizi, O.4
Solomatnikov, A.5
Lee, B.C.6
Richardson, S.7
Kozyrakis, C.8
Horowitz, M.9
-
17
-
-
80052112395
-
FPGA implementation of k-means algorithm for bioinformatics application: An accelerated approach to clustering microarray data
-
IEEE
-
Hussain, H.M., Benkrid, K., Seker, H., Erdogan, A.T. Fpga implementation of k-means algorithm for bioinformatics application: An accelerated approach to clustering microarray data. In 2011 NASA/ESA Conference on Adaptive Hardware and Systems (AHS) (2011). IEEE, 248-255.
-
(2011)
2011 NASA/ESA Conference on Adaptive Hardware and Systems (AHS)
, pp. 248-255
-
-
Hussain, H.M.1
Benkrid, K.2
Seker, H.3
Erdogan, A.T.4
-
18
-
-
84862328133
-
Life after dennard and how I learned to love the picojoule (keynote)
-
Keynote presentation, Sao Paolo, Dec.
-
Keckler, S. Life after Dennard and how I learned to love the Picojoule (keynote). In International Symposium on Microarchitecture, Keynote presentation, Sao Paolo, Dec. 2011.
-
(2011)
International Symposium on Microarchitecture
-
-
Keckler, S.1
-
19
-
-
73249114232
-
GOPS 496 mW real-time multi-object recognition processor with bio-inspired neural perception engine
-
Jan.
-
Kim, J.Y., Kim, M., Lee, S., Oh, J., Kim, K., Yoo, H.-J.A. GOPS 496 mW real-time multi-object recognition processor with bio-inspired neural perception engine. IEEE Journal of Solid-State Circuits 45, 1 (Jan. 2010), 32-45.
-
(2010)
IEEE Journal of Solid-State Circuits
, vol.45
, Issue.1
, pp. 32-45
-
-
Kim, J.Y.1
Kim, M.2
Lee, S.3
Oh, J.4
Kim, K.5
Yoo, H.-J.A.6
-
20
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
Krizhevsky, A., Sutskever, I., Hinton, G. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (2012), 1-9.
-
(2012)
Advances in Neural Information Processing Systems
, pp. 1-9
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.3
-
22
-
-
33750683304
-
Towards hardware acceleration of neuroevolution for multimedia processing applications on mobile devices
-
Springer, Berlin Heidelberg
-
Larkin, D., Kinane, A., O'Connor, N.E. Towards hardware acceleration of neuroevolution for multimedia processing applications on mobile devices. In Neural Information Processing (2006). Springer, Berlin Heidelberg, 1178-1188.
-
(2006)
Neural Information Processing
, pp. 1178-1188
-
-
Larkin, D.1
Kinane, A.2
O'Connor, N.E.3
-
24
-
-
84867135575
-
Building high-level features using large scale unsupervised learning
-
June
-
Le, Q.V., Ranzato, M.A., Monga, R., Devin, M., Chen, K., Corrado, G.S., Dean, J., Ng, A.Y. Building high-level features using large scale unsupervised learning. In International Conference on Machine Learning, June 2012.
-
(2012)
International Conference on Machine Learning
-
-
Le, Q.V.1
Ranzato, M.A.2
Monga, R.3
Devin, M.4
Chen, K.5
Corrado, G.S.6
Dean, J.7
Ng, A.Y.8
-
25
-
-
84930630277
-
Deep learning
-
LeCun, Y., Bengio, Y., Hintion, G. Deep learning. Nature 521, 7553 (2015), 436-444.
-
(2015)
Nature
, vol.521
, Issue.7553
, pp. 436-444
-
-
LeCun, Y.1
Bengio, Y.2
Hintion, G.3
-
26
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86 11 (1998), 2278-2324.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
27
-
-
76749146060
-
McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
-
New York, NY, USA ACM
-
Li, S., Ahn, J.H., Strong, R.D., Brockman, J.B., Tullsen, D.M., Jouppi, N.P. McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures. In Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 42 (New York, NY, USA, 2009). ACM, 469-480.
-
(2009)
Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO
, vol.42
, pp. 469-480
-
-
Li, S.1
Ahn, J.H.2
Strong, R.D.3
Brockman, J.B.4
Tullsen, D.M.5
Jouppi, N.P.6
-
28
-
-
84939194962
-
Pudiannao: A polyvalent machine learning accelerator
-
ACM
-
Liu, D., Chen, T., Liu, S., Zhou, J., Zhou, S., Teman, O., Feng, X., Zhou, X., Chen, Y. Pudiannao: A polyvalent machine learning accelerator. In International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) (2015). ACM, 369-381.
-
(2015)
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)
, pp. 369-381
-
-
Liu, D.1
Chen, T.2
Liu, S.3
Zhou, J.4
Zhou, S.5
Teman, O.6
Feng, X.7
Zhou, X.8
Chen, Y.9
-
29
-
-
84863551827
-
Accelerating neuromorphic vision algorithms for recognition
-
ACM
-
Maashri, A.A., Debole, M., Cotter, M., Chandramoorthy, N., Xiao, Y., Narayanan, V., Chakrabarti, C. Accelerating neuromorphic vision algorithms for recognition. In Proceedings of the 49th Annual Design Automation Conference (2012). ACM, 579-584.
-
(2012)
Proceedings of the 49th Annual Design Automation Conference
, pp. 579-584
-
-
Maashri, A.A.1
Debole, M.2
Cotter, M.3
Chandramoorthy, N.4
Xiao, Y.5
Narayanan, V.6
Chakrabarti, C.7
-
30
-
-
84866606976
-
A 0.41 μa standby leakage 32 kb embedded SRAM with low-voltage resume-standby utilizing all digital current comparator in 28 nm hkmg CMOS
-
Maeda, N., Komatsu, S., Morimoto, M., Shimazaki, Y. A 0.41 μa standby leakage 32 kb embedded SRAM with low-voltage resume-standby utilizing all digital current comparator in 28 nm hkmg CMOS. In International Symposium on VLSI Circuits (VLSIC), 2012.
-
(2012)
International Symposium on VLSI Circuits (VLSIC)
-
-
Maeda, N.1
Komatsu, S.2
Morimoto, M.3
Shimazaki, Y.4
-
31
-
-
84859452113
-
A massively parallel, energy efficient programmable accelerator for learning and classification
-
Majumdar, A., Cadambi, S., Becchi, M., Chakradhar, S.T., Graf, H.P. A massively parallel, energy efficient programmable accelerator for learning and classification. ACM Trans. Arch. Code Optim. (TACO) 9, 1 (2012), 6.
-
(2012)
ACM Trans. Arch. Code Optim. (TACO)
, vol.9
, Issue.1
, pp. 6
-
-
Majumdar, A.1
Cadambi, S.2
Becchi, M.3
Chakradhar, S.T.4
Graf, H.P.5
-
32
-
-
79953123438
-
An energy-efficient heterogeneous system for embedded learning and classification
-
Majumdar, A., Cadambi, S., Chakradhar, S.T. An energy-efficient heterogeneous system for embedded learning and classification. Embedded Systems Letters 3, 1 (2011), 42-45.
-
(2011)
Embedded Systems Letters
, vol.3
, Issue.1
, pp. 42-45
-
-
Majumdar, A.1
Cadambi, S.2
Chakradhar, S.T.3
-
34
-
-
34047242620
-
Real-time k-means clustering for color images on reconfigurable hardware
-
Aug IEEE
-
Maruyama, T. Real-time k-means clustering for color images on reconfigurable hardware. In 18th International Conference on Pattern Recognition (ICPR) (Aug 2006). IEEE, Volume 2, 816-819.
-
(2006)
18th International Conference on Pattern Recognition (ICPR)
, vol.2
, pp. 816-819
-
-
Maruyama, T.1
-
37
-
-
84881162326
-
Convolution engine: Balancing efficiency & flexibility in specialized computing
-
ACM
-
Qadeer, W., Hameed, R., Shacham, O., Venkatesan, P., Kozyrakis, C., Horowitz, M.A. Convolution engine: Balancing efficiency & flexibility in specialized computing. In International Symposium on Computer Architecture, 2013). ACM, 41(3), 24-35.
-
International Symposium on Computer Architecture, 2013)
, vol.41
, Issue.3
, pp. 24-35
-
-
Qadeer, W.1
Hameed, R.2
Shacham, O.3
Venkatesan, P.4
Kozyrakis, C.5
Horowitz, M.A.6
-
38
-
-
84874575248
-
Convolutional neural networks applied to house numbers digit classification
-
Sermanet, P., Chintala, S., LeCun, Y. Convolutional neural networks applied to house numbers digit classification. In Pattern Recognition (ICPR), 2012.
-
(2012)
Pattern Recognition (ICPR)
-
-
Sermanet, P.1
Chintala, S.2
LeCun, Y.3
-
40
-
-
84885624310
-
Parallel architectures for the KNN classifier-design of soft IP cores and FPGA implementations
-
Stamoulias, I., Manolakos, E.S. Parallel architectures for the KNN classifier-design of soft IP cores and FPGA implementations. ACM Transactions on Embedded Computing Systems (TECS) 13, 2 (2013), 22.
-
(2013)
ACM Transactions on Embedded Computing Systems (TECS)
, vol.13
, Issue.2
, pp. 22
-
-
Stamoulias, I.1
Manolakos, E.S.2
-
41
-
-
84944392428
-
-
Dec IEEE Computer Society
-
Swanson, S., Michelson, K., Schwerin, A., Oskin, M. Wavescalar. In ACM/IEEE International Symposium on Microarchitecture (MICRO) (Dec 2003). IEEE Computer Society, 291.
-
(2003)
ACM/IEEE International Symposium on Microarchitecture (MICRO)
, pp. 291
-
-
Swanson, S.1
Michelson, K.2
Schwerin, A.3
Wavescalar, O.M.4
-
43
-
-
84864858301
-
A defect-tolerant accelerator for emerging highperformance applications
-
Sep Portland, Oregon
-
Temam, O. A defect-tolerant accelerator for emerging highperformance applications. In International Symposium on Computer Architecture (Sep 2012). Portland, Oregon, 40(3), 356-367.
-
(2012)
International Symposium on Computer Architecture
, vol.40
, Issue.3
, pp. 356-367
-
-
Temam, O.1
-
45
-
-
77952388227
-
Scaling deep trench based EDRAM on SOI to 32 nm and beyond
-
IEEE
-
Wang, G., Anand, D., Butt, N., Cestero, A., Chudzik, M., Ervin, J., Fang, S., Freeman, G., Ho, H., Khan, B., Kim, B., Kong, W., Krishnan, R., Krishnan, S., Kwon, O., Liu, J., McStay, K., Nelson, E., Nummy, K., Parries, P., Sim, J., Takalkar, R., Tessier, A., Todi, R., Malik, R., Stiffler, S., Iyer, S. Scaling deep trench based EDRAM on SOI to 32 nm and beyond. In IEEE International Electron Devices Meeting (IEDM) (2009). IEEE, 1-4.
-
(2009)
IEEE International Electron Devices Meeting (IEDM)
, pp. 1-4
-
-
Wang, G.1
Anand, D.2
Butt, N.3
Cestero, A.4
Chudzik, M.5
Ervin, J.6
Fang, S.7
Freeman, G.8
Ho, H.9
Khan, B.10
Kim, B.11
Kong, W.12
Krishnan, R.13
Krishnan, S.14
Kwon, O.15
Liu, J.16
McStay, K.17
Nelson, E.18
Nummy, K.19
Parries, P.20
Sim, J.21
Takalkar, R.22
Tessier, A.23
Todi, R.24
Malik, R.25
Stiffler, S.26
Iyer, S.27
more..
-
46
-
-
0000459353
-
The lack of a priori distinctions between learning algorithms
-
Wolpert, D.H. The lack of a priori distinctions between learning algorithms. Neural Comput. 8, 7 (1996), 1341-1390.
-
(1996)
Neural Comput.
, vol.8
, Issue.7
, pp. 1341-1390
-
-
Wolpert, D.H.1
-
47
-
-
38049032490
-
FPGA implementation of KNN classifier based on wavelet transform and partial distance search
-
June Springer Berlin Heidelberg
-
Yeh, Y.-J., Li, H.-Y., Hwang, W.-J., Fang, C.-Y. Fpga implementation of KNN classifier based on wavelet transform and partial distance search. In Image Analysis (June 2007). Springer Berlin Heidelberg, 512-521.
-
(2007)
Image Analysis
, pp. 512-521
-
-
Yeh, Y.-J.1
Li, H.-Y.2
Hwang, W.-J.3
Fang, C.-Y.4
|