SCOPUS 정보 검색 플랫폼

Proceedings - International Symposium on High-Performance Computer Architecture

Volumn 2016-April, Issue , 2016, Pages 14-26

TABLA: A unified template-based framework for accelerating statistical machine learning

(7) Mahajan, Divya a Park, Jongse a Amaro, Emmanuel a Sharma, Hardik a Yazdanbakhsh, Amir a Kim, Joon Kyung a Esmaeilzadeh, Hadi a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACCELERATION; ALGORITHMS; ARM PROCESSORS; ARTIFICIAL INTELLIGENCE; COMPUTER ARCHITECTURE; COMPUTER HARDWARE; COMPUTER PROGRAMMING LANGUAGES; FIELD PROGRAMMABLE GATE ARRAYS (FPGA); GENERAL PURPOSE COMPUTERS; HARDWARE; HIGH LEVEL LANGUAGES; INTEGRATED CIRCUIT DESIGN; LEARNING SYSTEMS; LOGIC SYNTHESIS; OPTIMIZATION; PROGRAM PROCESSORS; RECONFIGURABLE HARDWARE; SUPERCOMPUTERS;

GENERAL PURPOSE PROCESSORS; HIGH-LEVEL ABSTRACTION; OBJECTIVE FUNCTIONS; OPTIMIZATION PROBLEMS; PERFORMANCE BENEFITS; STATISTICAL MACHINE LEARNING; STOCHASTIC GRADIENT DESCENT; STOCHASTIC OPTIMIZATION PROBLEMS;

LEARNING ALGORITHMS;

EID: 84965008656 PISSN: 15300897 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/HPCA.2016.7446050 Document Type: Conference Paper

Times cited : (163)

References (58)

1
- 79961040286
- Toward dark silicon in servers
- July-Aug.
- N. Hardavellas, M. Ferdman, B. Falsafi, and A. Ailamaki. Toward dark silicon in servers. IEEE Micro, 31(4):6-15, July-Aug. 2011.
- (2011) IEEE Micro , vol.31 , Issue.4 , pp. 6-15
- Hardavellas, N.¹ Ferdman, M.² Falsafi, B.³ Ailamaki, A.⁴

2
- 80052528714
- Dark silicon and the end of multicore scaling
- Hadi Esmaeilzadeh, Emily Blem, Renee St. Amant, Karthikeyan Sankaralingam, and Doug Burger. Dark silicon and the end of multicore scaling. In ISCA, 2011.
- (2011) ISCA
- Esmaeilzadeh, H.¹ Blem, E.² St Amant, R.³ Sankaralingam, K.⁴ Burger, D.⁵

3
- 77952256041
- Conservation cores: Reducing the energy of mature computations
- Ganesh Venkatesh, Jack Sampson, Nathan Goulding, Saturnino Garcia, Vladyslav Bryksin, Jose Lugo-Martinez, Steven Swanson, and Michael Bedford Taylor. Conservation cores: Reducing the energy of mature computations. In ASPLOS, 2010.
- (2010) ASPLOS
- Venkatesh, G.¹ Sampson, J.² Goulding, N.³ Garcia, S.⁴ Bryksin, V.⁵ Lugo-Martinez, J.⁶ Swanson, S.⁷ Taylor, M.B.⁸

4
- 0016116644
- Design of ion-implanted mosfet's with very small physical dimensions
- October
- R. H. Dennard, F. H. Gaensslen, V. L. Rideout, E. Bassous, and A. R. LeBlanc. Design of ion-implanted mosfet's with very small physical dimensions. IEEE Journal of Solid-State Circuits, 9, October 1974.
- (1974) IEEE Journal of Solid-State Circuits , vol.9
- Dennard, R.H.¹ Gaensslen, F.H.² Rideout, V.L.³ Bassous, E.⁴ LeBlanc, A.R.⁵

5
- 84860270793
- CPU DB: Recording microprocessor history
- April
- Andrew Danowitz, Kyle Kelley, James Mao, John P. Stevenson, and Mark Horowitz. CPU DB: Recording microprocessor history. ACM Queue, 10(4):10:10-10:27, April 2012.
- (2012) ACM Queue , vol.10 , Issue.4 , pp. 1010-1027
- Danowitz, A.¹ Kelley, K.² Mao, J.³ Stevenson, J.P.⁴ Horowitz, M.⁵

6
- 83755217707
- John Gantz and David Reinsel. Extracting value from chaos.
- Extracting Value from Chaos
- Gantz, J.¹ Reinsel, D.²

7
- 84905454486
- A reconfigurable fabric for accelerating large-scale datacenter services
- June
- Andrew Putnam, Adrian Caulfield, Eric Chung, Derek Chiou, Kypros Constantinides, John Demme, Hadi Esmaeilzadeh, Jeremy Fowers, Gopi Prashanth, Jan Gray, Michael Haselman, Scott Hauck, Stephen Heil, Amir Hormati, Joo-Young Kim, Sitaram Lanka, James R. Larus, Eric Peterson, Aaron Smith, Jason Thong, Phillip Yi Xiao, and Doug Burger. A reconfigurable fabric for accelerating large-scale datacenter services. In ISCA, June 2014.
- (2014) ISCA
- Putnam, A.¹ Caulfield, A.² Chung, E.³ Chiou, D.⁴ Constantinides, K.⁵ Demme, J.⁶ Esmaeilzadeh, H.⁷ Fowers, J.⁸ Prashanth, G.⁹ Gray, J.¹⁰ Haselman, M.¹¹ Hauck, S.¹² Heil, S.¹³ Hormati, A.¹⁴ Kim, J.-Y.¹⁵ Lanka, S.¹⁶ Larus, J.R.¹⁷ Peterson, E.¹⁸ Smith, A.¹⁹ Thong, J.²⁰ Xiao, P.Y.²¹ Burger, D.²² more..

8
- 79955890625
- Dynamically specialized datapaths for energy efficient computing
- Venkatraman Govindaraju, Chen-Han Ho, and Karthikeyan Sankaralingam. Dynamically specialized datapaths for energy efficient computing. In HPCA, 2011.
- (2011) HPCA
- Govindaraju, V.¹ Ho, C.-H.² Sankaralingam, K.³

9
- 84858776502
- QsCores: Trading dark silicon for scalable energy efficiency with quasi-specific cores
- Ganesh Venkatesh, John Sampson, Nathan Goulding, Sravanthi Kota Venkata, Steven Swanson, and Michael Taylor. QsCores: Trading dark silicon for scalable energy efficiency with quasi-specific cores. In MICRO, 2011.
- (2011) MICRO
- Venkatesh, G.¹ Sampson, J.² Goulding, N.³ Venkata, S.K.⁴ Swanson, S.⁵ Taylor, M.⁶

10
- 84863374615
- Bundled execution of recurring traces for energy-efficient general purpose processing
- Shantanu Gupta, Shuguang Feng, Amin Ansari, Scott Mahlke, and David August. Bundled execution of recurring traces for energy-efficient general purpose processing. In MICRO, 2011.
- (2011) MICRO
- Gupta, S.¹ Feng, S.² Ansari, A.³ Mahlke, S.⁴ August, D.⁵

11
- 84939202658
- Sirius: An open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers
- Johann Hauswald, Michael A. Laurenzano, Yunqi Zhang, Cheng Li, Austin Rovinski, Arjun Khurana, Ron Dreslinski, Trevor Mudge, Vinicius Petrucci, Lingjia Tang, and Jason Mars. Sirius: An open end-to-end voice and vision personal assistant and its implications for future warehouse scale computers. In ASPLOS, 2015.
- (2015) ASPLOS
- Hauswald, J.¹ Laurenzano, M.A.² Zhang, Y.³ Li, C.⁴ Rovinski, A.⁵ Khurana, A.⁶ Dreslinski, R.⁷ Mudge, T.⁸ Petrucci, V.⁹ Tang, L.¹⁰ Mars, J.¹¹

12
- 84856089356
- Technical Report MSR-TR-2008-130, Microsoft Research September
- Scott Sirowy and Alessandro Forin. Where's the beef why FPGAs are so fast. Technical Report MSR-TR-2008-130, Microsoft Research, September 2008.
- (2008) Where's the Beef Why FPGAs Are so Fast
- Sirowy, S.¹ Forin, A.²

13
- 84906684989
- Xilinx
- Xilinx. Zynq-7000 all programmable soc, 2014.
- (2014) Zynq-7000 All Programmable Soc

14
- 84930507208
- Intel Corporation
- Intel Corporation. Disrupting the data center to create the digital services economy.
- Disrupting the Data Center to Create the Digital Services Economy

15
- 0004055894
- Cambridge university press
- Stephen Boyd and Lieven Vandenberghe. Convex optimization. Cambridge university press, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

16
- 84862644049
- Towards a unified architecture for in-RDBMS analytics
- Xixuan Feng, Arun Kumar, Benjamin Recht, and Christopher Ré. Towards a unified architecture for in-RDBMS analytics. In Proceedings of the International Conference on Management of Data, SIGMOD '12, 2012.
- (2012) Proceedings of the International Conference on Management of Data, SIGMOD '12
- Feng, X.¹ Kumar, A.² Recht, B.³ Ré, C.⁴

17
- 0003558120
- Kluwer Academic Publishers
- David C Ku and Giovanni De Micheli. High level synthesis of ASICs under timing and synchronization constraints. Kluwer Academic Publishers, 1992.
- (1992) High Level Synthesis of ASICs under Timing and Synchronization Constraints
- Ku, D.C.¹ Micheli, G.D.²

18
- 84913555165
- arXiv preprint arXiv: 1408.5093
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093, 2014.
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

19
- 84886567160
- M. Lichman. UCI machine learning repository, 2013.
- (2013) UCI Machine Learning Repository
- Lichman, M.¹

20
- 82555195625
- Second workshop on information heterogeneity and fusion in recommender systems (HetRec)
- Iván Cantador, Peter Brusilovsky, and Tsvi Kuflik. Second workshop on information heterogeneity and fusion in recommender systems (HetRec). In Proceedings of the ACM conference on Recommender systems, RecSys 2011, 2011.
- (2011) Proceedings of the ACM Conference on Recommender Systems, RecSys 2011
- Cantador, I.¹ Brusilovsky, P.² Kuflik, T.³

21
- 84879746225
- Grouplens
- Grouplens. Movielens dataset.
- Movielens Dataset

22
- 0010615068
- Neurocomputing using the MasPar MP-1
- K. W. Przytula and V. K. Prasnna, editors chapter 2 Prentice-Hall
- Kamil A. Grajski. Neurocomputing, using the MasPar MP-1. In K. W. Przytula and V. K. Prasnna, editors, Parallel Digital Implementations of Neural Networks, chapter 2, pages 51-76. Prentice-Hall, 1993.
- (1993) Parallel Digital Implementations of Neural Networks , pp. 51-76
- Kamil, A.¹ Grajski²

23
- 84876591853
- Neural acceleration for general-purpose approximate programs
- Hadi Esmaeilzadeh, Adrian Sampson, Luis Ceze, and Doug Burger. Neural acceleration for general-purpose approximate programs. In MICRO, 2012.
- (2012) MICRO
- Esmaeilzadeh, H.¹ Sampson, A.² Ceze, L.³ Burger, D.⁴

24
- 0032203257
- Gradient-based learning applied to document recognition
- Yann Lecun, LÃl'on Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. In Proceedings of the IEEE, pages 2278-2324, 1998.
- (1998) Proceedings of the IEEE , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

25
- 84965003162
- Nvidia. Jetson
- Nvidia. Jetson. http://www.nvidia.com/object/jetson-tk1-embedded-dev-kit.html, 2015.
- (2015)

26
- 50949133669
- Liblinear: A library for large linear classification
- June
- Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. Liblinear: A library for large linear classification. J. Mach. Learn. Res., 9:1871-1874, June 2008.
- (2008) J. Mach. Learn. Res. , vol.9 , pp. 1871-1874
- Fan, R.-E.¹ Chang, K.-W.² Hsieh, C.-J.³ Wang, X.-R.⁴ Lin, C.-J.⁵

27
- 84876211743
- MLPACK: A scalable C++ machine learning library
- Ryan R. Curtin, James R. Cline, Neil P. Slagle, William B. March, P. Ram, Nishant A. Mehta, and Alexander G. Gray. MLPACK: A scalable C++ machine learning library. Journal of Machine Learning Research, 14:801-805, 2013.
- (2013) Journal of Machine Learning Research , vol.14 , pp. 801-805
- Curtin, R.R.¹ Cline, J.R.² Slagle, N.P.³ March, W.B.⁴ Ram, P.⁵ Mehta, N.A.⁶ Gray, A.G.⁷

28
- 84874092321
- Model-driven level 3 BLAS performance optimization on loongson 3A processor
- Zhang Xianyi, Wang Qian, and Zhang Yunquan. Model-driven level 3 BLAS performance optimization on loongson 3A processor. In ICPADS, 2012.
- (2012) ICPADS
- Xianyi, Z.¹ Qian, W.² Yunquan, Z.³

29
- 84863614151
- Factorization machines with libFM
- May
- Steffen Rendle. Factorization machines with libFM. ACM Trans. Intell. Syst. Technol., 3(3):57:1-57:22, May 2012.
- (2012) ACM Trans. Intell. Syst. Technol. , vol.3 , Issue.3 , pp. 571-5722
- Rendle, S.¹

30
- 79955702502
- Libsvm: A library for support vector machines
- May
- Chih-Chung Chang and Chih-Jen Lin. Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol., 2(3):27:1-27:27, May 2011.
- (2011) ACM Trans. Intell. Syst. Technol. , vol.2 , Issue.3 , pp. 271-2727
- Chang, C.-C.¹ Lin, C.-J.²

31
- 33745858913
- Technical report Department of Computer Science University of Copenhagen (DIKU)
- S. Nissen. Implementation of a fast artificial neural network library (FANN). Technical report, Department of Computer Science University of Copenhagen (DIKU), 2003. http://fann.sf.net.
- (2003) Implementation of A Fast Artificial Neural Network Library (FANN)
- Nissen, S.¹

32
- 84939194962
- PuDianNao: A polyvalent machine learning accelerator
- Daofu Liu, Tianshi Chen, Shaoli Liu, Jinhong Zhou, Shengyuan Zhou, Olivier Teman, Xiaobing Feng, Xuehai Zhou, and Yunji Chen. PuDianNao: A polyvalent machine learning accelerator. In ASPLOS, 2015.
- (2015) ASPLOS
- Liu, D.¹ Chen, T.² Liu, S.³ Zhou, J.⁴ Zhou, S.⁵ Teman, O.⁶ Feng, X.⁷ Zhou, X.⁸ Chen, Y.⁹

33
- 84899626479
- GPU acceleration for support vector machines
- Andreas Athanasopoulos, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. GPU acceleration for support vector machines. In 12th International Workshop on Image Analysis for Multimedia Interactive Services, 2011.
- (2011) 12th International Workshop on Image Analysis for Multimedia Interactive Services
- Athanasopoulos, A.¹ Dimou, A.² Mezaris, V.³ Kompatsiaris, I.⁴

34
- 84944081816
- CoRR
- Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, and Evan Shelhamer. cudnn: Efficient primitives for deep learning. CoRR, 2014.
- (2014) Cudnn: Efficient Primitives for Deep Learning
- Chetlur, S.¹ Woolley, C.² Vandermersch, P.³ Cohen, J.⁴ Tran, J.⁵ Catanzaro, B.⁶ Shelhamer, E.⁷

35
- 84885624310
- Parallel architectures for the knn classifier-design of soft IP cores and FPGA implementations
- September
- Ioannis Stamoulias and Elias S. Manolakos. Parallel architectures for the knn classifier-design of soft IP cores and FPGA implementations. ACM Trans. Embed. Comput. Syst., 13(2):22:1-22:21, September 2013.
- (2013) ACM Trans. Embed. Comput. Syst. , vol.13 , Issue.2 , pp. 221-2221
- Stamoulias, I.¹ Manolakos, E.S.²

36
- 77955985658
- IP-cores design for the knn classifier
- May
- E.S. Manolakos and I. Stamoulias. IP-cores design for the knn classifier. In ISCAS, May 2010.
- (2010) ISCAS
- Manolakos, E.S.¹ Stamoulias, I.²

37
- 80052112395
- AHS, June
- H.M. Hussain, K. Benkrid, H. Seker, and A.T. Erdogan. FPGA implementation of K-means algorithm for bioinformatics application: An accelerated approach to clustering microarray data. In AHS, June 2011.
- (2011) FPGA Implementation of K-means Algorithm for Bioinformatics Application: An Accelerated Approach to Clustering Microarray Data
- Hussain, H.M.¹ Benkrid, K.² Seker, H.³ Erdogan, A.T.⁴

38
- 34047242620
- Real-time K-Means clustering for color images on reconfigurable hardware
- Tsutomu Maruyama. Real-time K-Means clustering for color images on reconfigurable hardware. In ICPR, pages 816-819, 2006.
- (2006) ICPR , pp. 816-819
- Maruyama, T.¹

39
- 50149091681
- Hyperspectral images clustering on reconfigurable hardware using the k-means algorithm
- Sept
- A.Gda.S. Filho, A.C. Frery, C.C. de Araujo, H. Alice, J. Cerqueira, J.A. Loureiro, M.E. de Lima, Mdas.G.S. Oliveira, and M.M. Horta. Hyperspectral images clustering on reconfigurable hardware using the k-means algorithm. In SBCCI, Sept 2003.
- (2003) SBCCI
- Gda, A.¹ Filho, S.² Frery, A.C.³ De Araujo, C.C.⁴ Alice, H.⁵ Cerqueira, J.⁶ Loureiro, J.A.⁷ De Lima, M.E.⁸ Oliveira, M.G.S.⁹ Horta, M.M.¹⁰

40
- 77954269943
- A heterogeneous FPGA architecture for support vector machine training
- May
- M. Papadonikolakis and C. Bouganis. A heterogeneous FPGA architecture for support vector machine training. In FCCM, May 2010.
- (2010) FCCM
- Papadonikolakis, M.¹ Bouganis, C.²

41
- 74349084542
- A massively parallel fpga-based coprocessor for support vector machines
- April
- S. Cadambi, I. Durdanovic, V. Jakkula, M. Sankaradass, E. Cosatto, S. Chakradhar, and H.P. Graf. A massively parallel fpga-based coprocessor for support vector machines. In FCCM, April 2009.
- (2009) FCCM
- Cadambi, S.¹ Durdanovic, I.² Jakkula, V.³ Sankaradass, M.⁴ Cosatto, E.⁵ Chakradhar, S.⁶ Graf, H.P.⁷

42
- 79953123438
- An energy-efficient heterogeneous system for embedded learning and classification
- March
- A. Majumdar, S. Cadambi, and S.T. Chakradhar. An energy-efficient heterogeneous system for embedded learning and classification. Embedded Systems Letters, IEEE, 3(1):42-45, March 2011.
- (2011) Embedded Systems Letters IEEE , vol.3 , Issue.1 , pp. 42-45
- Majumdar, A.¹ Cadambi, S.² Chakradhar, S.T.³

43
- 84859452113
- A massively parallel, energy efficient programmable accelerator for learning and classification
- March
- Abhinandan Majumdar, Srihari Cadambi, Michela Becchi, Srimat T. Chakradhar, and Hans Peter Graf. A massively parallel, energy efficient programmable accelerator for learning and classification. ACM Trans. Archit. Code Optim., 9(1):6:1-6:30, March 2012.
- (2012) ACM Trans. Archit. Code Optim. , vol.9 , Issue.1 , pp. 61-630
- Majumdar, A.¹ Cadambi, S.² Becchi, M.³ Chakradhar, S.T.⁴ Graf, H.P.⁵

44
- 80054919955
- NeuFlow: A runtime reconfigurable dataflow processor for vision
- June
- C. Farabet, B. Martini, B. Corda, P. Akselrod, E. Culurciello, and Y. LeCun. NeuFlow: A runtime reconfigurable dataflow processor for vision. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference on, pages 109-116, June 2011.
- (2011) Computer Vision and Pattern Recognition Workshops (CVPRW) 2011 IEEE Computer Society Conference on , pp. 109-116
- Farabet, C.¹ Martini, B.² Corda, B.³ Akselrod, P.⁴ Culurciello, E.⁵ LeCun, Y.⁶

45
- 84897780584
- DianNao: A small-footprint high-throughput accelerator for ubiquitous machine-learning
- Tianshi Chen, Zidong Du, Ninghui Sun, JiaWang, ChengyongWu, Yunji Chen, and Olivier Temam. DianNao: a small-footprint high-throughput accelerator for ubiquitous machine-learning. In ASPLOS, 2014.
- (2014) ASPLOS
- Chen, T.¹ Du, Z.² Sun, N.³ Wang, J.⁴ Wu, C.⁵ Chen, Y.⁶ Temam, O.⁷

46
- 84863551827
- Accelerating neuromorphic vision algorithms for recognition
- June
- A.A. Maashri, M. DeBole, M. Cotter, N. Chandramoorthy, Yang Xiao, V. Narayanan, and C. Chakrabarti. Accelerating neuromorphic vision algorithms for recognition. In DAC, June 2012.
- (2012) DAC
- Maashri, A.A.¹ DeBole, M.² Cotter, M.³ Chandramoorthy, N.⁴ Xiao, Y.⁵ Narayanan, V.⁶ Chakrabarti, C.⁷

47
- 84934280945
- SNNAP: Approximate computing on programmable socs via neural acceleration
- Thierry Moreau, Mark Wyse, Jacob Nelson, Adrian Sampson, Hadi Esmaeilzadeh, Luis Ceze, and Mark Oskin. SNNAP: Approximate computing on programmable socs via neural acceleration. In HPCA, 2015.
- (2015) HPCA
- Moreau, T.¹ Wyse, M.² Nelson, J.³ Sampson, A.⁴ Esmaeilzadeh, H.⁵ Ceze, L.⁶ Oskin, M.⁷

48
- 79961187689
- A hardware acceleration technique for gradient descent and conjugate gradient
- June
- D. Kesler, B. Deka, and R. Kumar. A hardware acceleration technique for gradient descent and conjugate gradient. In SASP, June 2011.
- (2011) SASP
- Kesler, D.¹ Deka, B.² Kumar, R.³

49
- 79953230605
- Constantinides. A high throughput FPGAbased floating point conjugate gradient implementation for dense matrices
- January
- Antonio Roldao and George A. Constantinides. A high throughput FPGAbased floating point conjugate gradient implementation for dense matrices. ACM Trans. Reconfigurable Technol. Syst., 3(1):1:1-1:19, January 2010.
- (2010) ACM Trans. Reconfigurable Technol. Syst. , vol.3 , Issue.1 , pp. 11-119
- Roldao, A.¹ George, A.²

50
- 34147131364
- A hybrid approach for mapping conjugate gradient onto an fpga-augmented reconfigurable supercomputer
- April
- G.R. Morris, V.K. Prasanna, and R.D.,erson. A hybrid approach for mapping conjugate gradient onto an fpga-augmented reconfigurable supercomputer. In FCCM, April 2006.
- (2006) FCCM
- Morris, G.R.¹ Prasanna, V.K.² Erson, R.D.³

51
- 60349119698
- An implementation of the conjugate gradient algorithm on fpgas
- April
- D. DuBois, A. DuBois, T. Boorman, C. Connor, and S. Poole. An implementation of the conjugate gradient algorithm on fpgas. In FCCM, April 2008.
- (2008) FCCM
- DuBois, D.¹ DuBois, A.² Boorman, T.³ Connor, C.⁴ Poole, S.⁵

52
- 77954069604
- FPGA implementation of kNN classifier based on wavelet transform and partial distance search
- Yao-Jung Yeh, Hui-Ya Li,Wen-Jyi Hwang, and Chiung-Yao Fang. FPGA implementation of kNN classifier based on wavelet transform and partial distance search. In SCIA, 2007.
- (2007) SCIA
- Yeh, Y.-J.¹ Li, H.-Y.² Hwang, W.-J.³ Fang, C.-Y.⁴

53
- 54949115901
- CHiMPS: A high-level compilation flow for hybrid CPU-FPGA architectures
- Andrew R. Putnam, Dave Bennett, Eric Dellinger, Jeff Mason, and Prasanna Sundararajan. CHiMPS: A high-level compilation flow for hybrid CPU-FPGA architectures. In FPGA, 2008.
- (2008) FPGA
- Putnam, A.R.¹ Bennett, D.² Dellinger, E.³ Mason, J.⁴ Sundararajan, P.⁵

54
- 77955733335
- Fpmr: Mapreduce framework on fpga
- Yi Shan, BoWang, Jing Yan, YuWang, Ningyi Xu, and Huazhong Yang. Fpmr: Mapreduce framework on fpga. In FPGA, 2010.
- (2010) FPGA
- Shan, Y.¹ Wang, B.² Yan, J.³ Wang, Y.⁴ Xu, N.⁵ Yang, H.⁶

55
- 84912524416
- A high memory bandwidth fpga accelerator for sparse matrix-vector multiplication
- IEEE, May
- Jeremy Fowers, Kalin Ovtcharov, Karin Strauss, Eric Chung, and Greg Stitt. A high memory bandwidth fpga accelerator for sparse matrix-vector multiplication. In FCCM. IEEE, May 2014.
- (2014) FCCM
- Fowers, J.¹ Ovtcharov, K.² Strauss, K.³ Chung, E.⁴ Stitt, G.⁵

56
- 79952918458
- CoRAM: An in-fabric memory architecture for fpga-based computing
- Eric S. Chung, James C. Hoe, and Ken Mai. CoRAM: An in-fabric memory architecture for fpga-based computing. In FPGA, 2011.
- (2011) FPGA
- Chung, E.S.¹ Hoe, J.C.² Mai, K.³

57
- 84881142714
- LINQits: Big data on little clients
- Eric S. Chung, John D. Davis, and Jaewon Lee. LINQits: Big data on little clients. In ISCA, 2013.
- (2013) ISCA
- Chung, E.S.¹ Davis, J.D.² Lee, J.³

58
- 84898616656
- FPL, Sept
- M. King, A. Khan, A. Agarwal, O. Arcas, and Arvind. Generating infrastructure for FPGA-accelerated applications. In FPL, Sept 2013.
- (2013) Generating Infrastructure for FPGA-accelerated Applications
- King, M.¹ Khan, A.² Agarwal, A.³ Arcas, O.⁴ Arvind⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.