메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4984-4988

Small-footprint high-performance deep neural network-based speech recognition using split-VQ

Author keywords

DNN; model compression; on device speech recognition; split VQ

Indexed keywords

AUDIO SIGNAL PROCESSING; BACKPROPAGATION; DEEP NEURAL NETWORKS; MATRIX ALGEBRA; SPEECH; SPEECH COMMUNICATION; VECTOR QUANTIZATION;

EID: 84946014836     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178919     Document Type: Conference Paper
Times cited : (51)

References (23)
  • 2
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • N. Jaitly, P. Nguyen, A. Senior, and V. Vanhoucke, "Application of pretrained deep neural networks to large vocabulary speech recognition," in Proceedings of Interspeech, 2012.
    • (2012) Proceedings of Interspeech
    • Jaitly, N.1    Nguyen, P.2    Senior, A.3    Vanhoucke, V.4
  • 3
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proceedings of Interspeech, 2011, pp. 437-440.
    • (2011) Proceedings of Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 4
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 6
    • 84906251664 scopus 로고    scopus 로고
    • Accurate and compact large vocabulary speech recognition on mobile devices
    • X. Lei, A. Senior, A. Gruenstein, and J. Sorensen, "Accurate and compact large vocabulary speech recognition on mobile devices," in Proceedings of Interspeech, 2013, pp. 662-665.
    • (2013) Proceedings of Interspeech , pp. 662-665
    • Lei, X.1    Senior, A.2    Gruenstein, A.3    Sorensen, J.4
  • 7
    • 84905252895 scopus 로고    scopus 로고
    • Small-footprint keyword spotting using deep neural networks
    • G. Chen, C. Parada, and G. Heiglod, "Small-footprint keyword spotting using deep neural networks," in Proceedings of ICASSP, 2014.
    • (2014) Proceedings of ICASSP
    • Chen, G.1    Parada, C.2    Heiglod, G.3
  • 8
    • 84910047185 scopus 로고    scopus 로고
    • Boundary contraction training for acoustic models based on discrete deep neural networks
    • R. Takeda, N. Kanda, and N. Nukaga, "Boundary contraction training for acoustic models based on discrete deep neural networks," in Proceedings of Interspeech, 2014.
    • (2014) Proceedings of Interspeech
    • Takeda, R.1    Kanda, N.2    Nukaga, N.3
  • 11
    • 0027662338 scopus 로고
    • Pruning algorithms: A survey
    • R. Reed, "Pruning algorithms: a survey," IEEE Transactions on Neural Networks, vol. 4, no. 5, pp. 740-747, 1993.
    • (1993) IEEE Transactions on Neural Networks , vol.4 , Issue.5 , pp. 740-747
    • Reed, R.1
  • 12
    • 84905224450 scopus 로고    scopus 로고
    • Reshaping deep neural network for fast decoding by node-pruning
    • T. He, Y. Fan, Y. Qian, T. Tan, and K. Yu, "Reshaping deep neural network for fast decoding by node-pruning," in Proceedings of ICASSP, 2014, pp. 245-249.
    • (2014) Proceedings of ICASSP , pp. 245-249
    • He, T.1    Fan, Y.2    Qian, Y.3    Tan, T.4    Yu, K.5
  • 13
    • 84867606668 scopus 로고    scopus 로고
    • Exploiting sparseness in deep neural networks for large vocabulary speech recognition
    • D. Yu, F. Seide, G. Li, and L. Deng, "Exploiting sparseness in deep neural networks for large vocabulary speech recognition," in Proceedings of ICASSP, 2012, pp. 4409-4412.
    • (2012) Proceedings of ICASSP , pp. 4409-4412
    • Yu, D.1    Seide, F.2    Li, G.3    Deng, L.4
  • 15
    • 84890454527 scopus 로고    scopus 로고
    • Low-rank matrix factorization for deep neural network training with high-dimensional output targets
    • T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets," in Proceedings of ICASSP, 2013, pp. 6655-6659.
    • (2013) Proceedings of ICASSP , pp. 6655-6659
    • Sainath, T.N.1    Kingsbury, B.2    Sindhwani, V.3    Arisoy, E.4    Ramabhadran, B.5
  • 16
    • 84906227589 scopus 로고    scopus 로고
    • Restructuring of deep neural network acoustic models with singular value decomposition
    • J. Xue, J. Li, and Y. Gong, "Restructuring of deep neural network acoustic models with singular value decomposition," in Proceedings of Interspeech, 2013, pp. 2365-2369.
    • (2013) Proceedings of Interspeech , pp. 2365-2369
    • Xue, J.1    Li, J.2    Gong, Y.3
  • 19
    • 12744264186 scopus 로고    scopus 로고
    • A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters
    • Y. Ge and Q. Huo, "A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters," in Proceedings of International Workshop on Frontiers in Handwriting Recognition, 2002, pp. 334-338.
    • (2002) Proceedings of International Workshop on Frontiers in Handwriting Recognition , pp. 334-338
    • Ge, Y.1    Huo, Q.2
  • 21
    • 0035339805 scopus 로고    scopus 로고
    • Direct training of subspace distribution clustering hidden Markov model
    • B.-W. Mak and E. Bocchieri, "Direct training of subspace distribution clustering hidden Markov model," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 4, pp. 378-387, 2001.
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.4 , pp. 378-387
    • Mak, B.-W.1    Bocchieri, E.2
  • 22
    • 44449172959 scopus 로고    scopus 로고
    • Building compact MQDF classifier for large character set recognition by subspace distribution sharing
    • T. Long and L. Jin, "Building compact MQDF classifier for large character set recognition by subspace distribution sharing," Pattern Recognition, vol. 41, no. 9, pp. 2916-2925, 2008.
    • (2008) Pattern Recognition , vol.41 , Issue.9 , pp. 2916-2925
    • Long, T.1    Jin, L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.