SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4984-4988

Small-footprint high-performance deep neural network-based speech recognition using split-VQ

(3) Wang, Yongqiang a Li, Jinyu a Gong, Yifan a

a MICROSOFT (United States)

Author keywords

DNN; model compression; on device speech recognition; split VQ

Indexed keywords

AUDIO SIGNAL PROCESSING; BACKPROPAGATION; DEEP NEURAL NETWORKS; MATRIX ALGEBRA; SPEECH; SPEECH COMMUNICATION; VECTOR QUANTIZATION;

COMMON PRACTICES; MODEL COMPRESSION; PERFORMANCE DEGRADATION; SCALAR QUANTIZATION; SMALL FOOTPRINTS; SPEECH RECOGNITION SYSTEMS; SPLIT VECTOR QUANTIZATION; SPLIT VQ;

SPEECH RECOGNITION;

EID: 84946014836 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178919 Document Type: Conference Paper

Times cited : (51)

References (23)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, N. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups," IEEE Signal Processing Magazine, 2012.
- (2012) IEEE Signal Processing Magazine
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, N.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

2
- 84878539964
- Application of pretrained deep neural networks to large vocabulary speech recognition
- N. Jaitly, P. Nguyen, A. Senior, and V. Vanhoucke, "Application of pretrained deep neural networks to large vocabulary speech recognition," in Proceedings of Interspeech, 2012.
- (2012) Proceedings of Interspeech
- Jaitly, N.¹ Nguyen, P.² Senior, A.³ Vanhoucke, V.⁴

3
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proceedings of Interspeech, 2011, pp. 437-440.
- (2011) Proceedings of Interspeech , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

4
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

5
- 84890491198
- Recent advances in deep learning for speech research at Microsoft
- L. Deng, J. Li, J.-T. Huang, K. Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams, et al., "Recent advances in deep learning for speech research at Microsoft," in Proceedings of ICASSP, 2013, pp. 8604-8608.
- (2013) Proceedings of ICASSP , pp. 8604-8608
- Deng, L.¹ Li, J.² Huang, J.-T.³ Yao, K.⁴ Yu, D.⁵ Seide, F.⁶ Seltzer, M.⁷ Zweig, G.⁸ He, X.⁹ Williams, J.¹⁰

6
- 84906251664
- Accurate and compact large vocabulary speech recognition on mobile devices
- X. Lei, A. Senior, A. Gruenstein, and J. Sorensen, "Accurate and compact large vocabulary speech recognition on mobile devices," in Proceedings of Interspeech, 2013, pp. 662-665.
- (2013) Proceedings of Interspeech , pp. 662-665
- Lei, X.¹ Senior, A.² Gruenstein, A.³ Sorensen, J.⁴

7
- 84905252895
- Small-footprint keyword spotting using deep neural networks
- G. Chen, C. Parada, and G. Heiglod, "Small-footprint keyword spotting using deep neural networks," in Proceedings of ICASSP, 2014.
- (2014) Proceedings of ICASSP
- Chen, G.¹ Parada, C.² Heiglod, G.³

8
- 84910047185
- Boundary contraction training for acoustic models based on discrete deep neural networks
- R. Takeda, N. Kanda, and N. Nukaga, "Boundary contraction training for acoustic models based on discrete deep neural networks," in Proceedings of Interspeech, 2014.
- (2014) Proceedings of Interspeech
- Takeda, R.¹ Kanda, N.² Nukaga, N.³

9
- 84905252894
- Deep neural networks for small footprint textdependent speaker verification
- E. Variani, X. Lei, E. McDermott, I. Moreno, and J. Gonzalez-Dominguez, "Deep neural networks for small footprint textdependent speaker verification," in Proceedings of ICASSP, 2014, pp. 4052-4056.
- (2014) Proceedings of ICASSP , pp. 4052-4056
- Variani, E.¹ Lei, X.² McDermott, E.³ Moreno, I.⁴ Gonzalez-Dominguez, J.⁵

10
- 84910035297
- Learning small-size DNN with output-distribution-based criteria
- J. Li, R. Zhao, J.-T. Huang, and Y. Gong, "Learning small-size DNN with output-distribution-based criteria," in Proceedings of Interspeech, 2014.
- (2014) Proceedings of Interspeech
- Li, J.¹ Zhao, R.² Huang, J.-T.³ Gong, Y.⁴

11
- 0027662338
- Pruning algorithms: A survey
- R. Reed, "Pruning algorithms: a survey," IEEE Transactions on Neural Networks, vol. 4, no. 5, pp. 740-747, 1993.
- (1993) IEEE Transactions on Neural Networks , vol.4 , Issue.5 , pp. 740-747
- Reed, R.¹

12
- 84905224450
- Reshaping deep neural network for fast decoding by node-pruning
- T. He, Y. Fan, Y. Qian, T. Tan, and K. Yu, "Reshaping deep neural network for fast decoding by node-pruning," in Proceedings of ICASSP, 2014, pp. 245-249.
- (2014) Proceedings of ICASSP , pp. 245-249
- He, T.¹ Fan, Y.² Qian, Y.³ Tan, T.⁴ Yu, K.⁵

13
- 84867606668
- Exploiting sparseness in deep neural networks for large vocabulary speech recognition
- D. Yu, F. Seide, G. Li, and L. Deng, "Exploiting sparseness in deep neural networks for large vocabulary speech recognition," in Proceedings of ICASSP, 2012, pp. 4409-4412.
- (2012) Proceedings of ICASSP , pp. 4409-4412
- Yu, D.¹ Seide, F.² Li, G.³ Deng, L.⁴

14
- 84910028276
- Pruning deep neural networks by optimal brain damage
- C. Liu, Z. Zhang, and D. Wang, "Pruning deep neural networks by optimal brain damage," in Proceedings of Interspeech, 2014.
- (2014) Proceedings of Interspeech
- Liu, C.¹ Zhang, Z.² Wang, D.³

15
- 84890454527
- Low-rank matrix factorization for deep neural network training with high-dimensional output targets
- T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets," in Proceedings of ICASSP, 2013, pp. 6655-6659.
- (2013) Proceedings of ICASSP , pp. 6655-6659
- Sainath, T.N.¹ Kingsbury, B.² Sindhwani, V.³ Arisoy, E.⁴ Ramabhadran, B.⁵

16
- 84906227589
- Restructuring of deep neural network acoustic models with singular value decomposition
- J. Xue, J. Li, and Y. Gong, "Restructuring of deep neural network acoustic models with singular value decomposition," in Proceedings of Interspeech, 2013, pp. 2365-2369.
- (2013) Proceedings of Interspeech , pp. 2365-2369
- Xue, J.¹ Li, J.² Gong, Y.³

17
- 84867754966
- Improving the speed of neural networks on CPUs
- V. Vanhoucke, A. Senior, and M. Mao, "Improving the speed of neural networks on CPUs," in Proc. Deep Learning and Unsupervised Feature Learning, NIPS Workshop, 2011.
- (2011) Proc. Deep Learning and Unsupervised Feature Learning, NIPS Workshop
- Vanhoucke, V.¹ Senior, A.² Mao, M.³

18
- 0003959189
- Springer
- A. Gersho and R. M. Gray, Vector quantization and signal compression, Springer, 1992.
- (1992) Vector Quantization and Signal Compression
- Gersho, A.¹ Gray, R.M.²

19
- 12744264186
- A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters
- Y. Ge and Q. Huo, "A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters," in Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2002, pp. 334-338.
- (2002) Proceedings of International Workshop on Frontiers in Handwriting Recognition , pp. 334-338
- Ge, Y.¹ Huo, Q.²

20
- 0035280044
- Subspace distribution clustering hidden Markov model
- E. Bocchieri and B.-W. Mak, "Subspace distribution clustering hidden Markov model," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 3, pp. 264-275, 2001.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 264-275
- Bocchieri, E.¹ Mak, B.-W.²

21
- 0035339805
- Direct training of subspace distribution clustering hidden Markov model
- B.-W. Mak and E. Bocchieri, "Direct training of subspace distribution clustering hidden Markov model," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 4, pp. 378-387, 2001.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.4 , pp. 378-387
- Mak, B.-W.¹ Bocchieri, E.²

22
- 44449172959
- Building compact MQDF classifier for large character set recognition by subspace distribution sharing
- T. Long and L. Jin, "Building compact MQDF classifier for large character set recognition by subspace distribution sharing," Pattern Recognition, vol. 41, no. 9, pp. 2916-2925, 2008.
- (2008) Pattern Recognition , vol.41 , Issue.9 , pp. 2916-2925
- Long, T.¹ Jin, L.²

23
- 0018918171
- An algorithm for vector quantizer design
- Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantizer design," IEEE Transactions on Communication, vol. 28, no. 1, pp. 84-95, 1980.
- (1980) IEEE Transactions on Communication , vol.28 , Issue.1 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.