SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Loss-aware binarization of deep networks

(3) Hou, Lu a Yao, Quanming a Kwok, James T a

a HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY (Hong Kong)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ALGORITHMS; RECURRENT NEURAL NETWORKS;

BINARIZATION ALGORITHM; CLOSED FORM SOLUTIONS; FEEDFORWARD AND RECURRENT NETWORKS; MATRIX APPROXIMATION; NETWORK WEIGHTS; NEURAL NETWORK MODEL; NEWTON ALGORITHM; SECOND MOMENTS;

DEEP NEURAL NETWORKS;

EID: 85050926955 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (121)

References (28)

1
- 85009395458
- BinaryConnect: Training deep neural networks with binary weights during propagations
- M. Courbariaux, Y. Bengio, and J.P. David. BinaryConnect: Training deep neural networks with binary weights during propagations. In NIPS, pp. 3105-3113, 2015.
- (2015) NIPS , pp. 3105-3113
- Courbariaux, M.¹ Bengio, Y.² David, J.P.³

2
- 84965117097
- Equilibrated adaptive learning rates for non-convex optimization
- Y. Dauphin, H. de Vries, and Y. Bengio. Equilibrated adaptive learning rates for non-convex optimization. In NIPS, pp. 1504-1512, 2015a.
- (2015) NIPS , pp. 1504-1512
- Dauphin, Y.¹ De Vries, H.² Bengio, Y.³

3
- 84945969537
- Technical Report
- Y. Dauphin, H. de Vries, J. Chung, and Y. Bengio. RMSprop and equilibrated adaptive learning rates for non-convex optimization. Technical Report arXiv:1502.04390, 2015b.
- (2015) RMSprop and Equilibrated Adaptive Learning Rates for Non-Convex Optimization
- Dauphin, Y.¹ De Vries, H.² Chung, J.³ Bengio, Y.⁴

4
- 80052250414
- Adaptive subgradient methods for online learning and stochastic optimization
- J. Duchi, E. Hazan, and Y. Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 12:2121-2159, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 2121-2159
- Duchi, J.¹ Hazan, E.² Singer, Y.³

5
- 84862277874
- Understanding the difficulty of training deep feedforward neural networks
- X. Glorot and Y. Bengio. Understanding the difficulty of training deep feedforward neural networks. In AISTAT, pp. 249-256, 2010.
- (2010) AISTAT , pp. 249-256
- Glorot, X.¹ Bengio, Y.²

6
- 84940682866
- Technical Report
- Y. Gong, L. Liu, M. Yang, and L. Bourdev. Compressing deep convolutional networks using vector quantization. Technical Report arXiv:1412.6115, 2014.
- (2014) Compressing Deep Convolutional Networks Using Vector Quantization
- Gong, Y.¹ Liu, L.² Yang, M.³ Bourdev, L.⁴

7
- 84893401626
- arXiv preprint
- I.J. Goodfellow, D. Warde-Farley, P. Lamblin, V. Dumoulin, M. Mirza, R. Pascanu, J. Bergstra, F. Bastien, and Y. Bengio. Pylearn2: a machine learning research library. arXiv preprint arXiv:1308.4214, 2013.
- (2013) Pylearn2: A Machine Learning Research Library
- Goodfellow, I.J.¹ Warde-Farley, D.² Lamblin, P.³ Dumoulin, V.⁴ Mirza, M.⁵ Pascanu, R.⁶ Bergstra, J.⁷ Bastien, F.⁸ Bengio, Y.⁹

8
- 85083950579
- Deep compression: Compressing deep neural network with pruning, trained quantization and Huffman coding
- S. Han, H. Mao, and W.J. Dally. Deep compression: Compressing deep neural network with pruning, trained quantization and Huffman coding. In ICLR, 2016.
- (2016) ICLR
- Han, S.¹ Mao, H.² Dally, W.J.³

9
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Computation, pp. 1735-1780, 1997.
- (1997) Neural Computation , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

10
- 85013626529
- Binarized neural networks
- I. Hubara, M. Courbariaux, D. Soudry, R. El-Yaniv, and Y. Bengio. Binarized neural networks. In NIPS, pp. 4107-4115, 2016.
- (2016) NIPS , pp. 4107-4115
- Hubara, I.¹ Courbariaux, M.² Soudry, D.³ El-Yaniv, R.⁴ Bengio, Y.⁵

11
- 84959876313
- Visualizing and understanding recurrent networks
- A. Karpathy, J. Johnson, and F.-F. Li. Visualizing and understanding recurrent networks. In ICLR, 2016.
- (2016) ICLR
- Karpathy, A.¹ Johnson, J.² Li, F.-F.³

12
- 85083951289
- Compression of deep convolutional neural networks for fast and low power mobile applications
- Y.-D. Kim, E. Park, S. Yoo, T. Choi, L. Yang, and D. Shin. Compression of deep convolutional neural networks for fast and low power mobile applications. In ICLR, 2016.
- (2016) ICLR
- Kim, Y.-D.¹ Park, E.² Yoo, S.³ Choi, T.⁴ Yang, L.⁵ Shin, D.⁶

13
- 85083951076
- A method for stochastic optimization
- D. Kingma and J. Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
- (2015) ICLR
- Kingma, D.¹ Adam, J.Ba.²

14
- 84930630277
- Deep learning
- Y. LeCun, Y. Bengio, and G. Hinton. Deep learning. Nature, 521(7553):436-444, 2015.
- (2015) Nature , vol.521 , Issue.7553 , pp. 436-444
- LeCun, Y.¹ Bengio, Y.² Hinton, G.³

15
- 84910594199
- Proximal Newton-type methods for minimizing composite functions
- J.D. Lee, Y. Sun, and M.A. Saunders. Proximal Newton-type methods for minimizing composite functions. SIAM Journal on Optimization, 24(3):1420-1443, 2014.
- (2014) SIAM Journal on Optimization , vol.24 , Issue.3 , pp. 1420-1443
- Lee, J.D.¹ Sun, Y.² Saunders, M.A.³

16
- 85016060111
- Technical Report
- F. Li and B. Liu. Ternary weight networks. Technical Report arXiv:1605.04711, 2016.
- (2016) Ternary Weight Networks
- Li, F.¹ Liu, B.²

17
- 85083953250
- Neural networks with few multiplications
- Z. Lin, M. Courbariaux, R. Memisevic, and Y. Bengio. Neural networks with few multiplications. In ICLR, 2016.
- (2016) ICLR
- Lin, Z.¹ Courbariaux, M.² Memisevic, R.³ Bengio, Y.⁴

18
- 84872565347
- Training deep and recurrent networks with Hessian-free optimization
- Springer
- J. Martens and I. Sutskever. Training deep and recurrent networks with Hessian-free optimization. In Neural Networks: Tricks of the trade, pp. 479-535. Springer, 2012.
- (2012) Neural Networks: Tricks of the Trade , pp. 479-535
- Martens, J.¹ Sutskever, I.²

19
- 84965128773
- Tensorizing neural networks
- A. Novikov, D. Podoprikhin, A. Osokin, and D.P. Vetrov. Tensorizing neural networks. In NIPS, pp. 442-450, 2015.
- (2015) NIPS , pp. 442-450
- Novikov, A.¹ Podoprikhin, D.² Osokin, A.³ Vetrov, D.P.⁴

20
- 85083950291
- Revisiting natural gradient for deep networks
- R. Pascanu and Y. Bengio. Revisiting natural gradient for deep networks. In ICLR, 2014.
- (2014) ICLR
- Pascanu, R.¹ Bengio, Y.²

21
- 84892982833
- On the difficulty of training recurrent neural networks
- R. Pascanu, T. Mikolov, and Y. Bengio. On the difficulty of training recurrent neural networks. In ICLR, pp. 1310-1318, 2013.
- (2013) ICLR , pp. 1310-1318
- Pascanu, R.¹ Mikolov, T.² Bengio, Y.³

22
- 84928251916
- DC proximal Newton for nonconvex optimization problems
- A. Rakotomamonjy, R. Flamary, and G. Gasso. DC proximal Newton for nonconvex optimization problems. IEEE Transactions on Neural Networks and Learning Systems, 27(3):636-647, 2016.
- (2016) IEEE Transactions on Neural Networks and Learning Systems , vol.27 , Issue.3 , pp. 636-647
- Rakotomamonjy, A.¹ Flamary, R.² Gasso, G.³

23
- 84990055874
- XNOR-Net: ImageNet classification using binary convolutional neural networks
- M. Rastegari, V. Ordonez, J. Redmon, and A. Farhadi. XNOR-Net: ImageNet classification using binary convolutional neural networks. In ECCV, 2016.
- (2016) ECCV
- Rastegari, M.¹ Ordonez, V.² Redmon, J.³ Farhadi, A.⁴

24
- 84979557463
- arXiv e-prints, abs/1605.02688, May
- Theano Development Team. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints, abs/1605.02688, May 2016. URL http://arxiv.org/abs/1605.02688.
- (2016) Theano: A Python Framework for Fast Computation of Mathematical Expressions

25
- 84943546021
- T. Tieleman and G. Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude, 2012.
- (2012) Lecture 6.5-Rmsprop: Divide the Gradient by a Running Average of Its Recent Magnitude
- Tieleman, T.¹ Hinton, G.²

26
- 77951160349
- The concave-convex procedure (CCCP)
- A.L. Yuille and A. Rangarajan. The concave-convex procedure (CCCP). NIPS, 2:1033-1040, 2002.
- (2002) NIPS , vol.2 , pp. 1033-1040
- Yuille, A.L.¹ Rangarajan, A.²

27
- 84969736572
- Technical Report
- M.D. Zeiler. ADADELTA: An adaptive learning rate method. Technical Report arXiv:1212.5701, 2012.
- (2012) ADADELTA: An Adaptive Learning Rate Method
- Zeiler, M.D.¹

28
- 85023600253
- Technical Report
- S. Zhou, Z. Ni, X. Zhou, H. Wen, Y. Wu, and Y. Zou. DoReFa-Net: Training low bitwidth convolutional neural networks with low bitwidth gradients. Technical Report arXiv:1606.06160, 2016.
- (2016) DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
- Zhou, S.¹ Ni, Z.² Zhou, X.³ Wen, H.⁴ Wu, Y.⁵ Zou, Y.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.