메뉴 건너뛰기




Volumn 2016-May, Issue , 2016, Pages 5955-5959

Personalized speech recognition on mobile devices

Author keywords

CTC; embedded speech recognition; LSTM; model compression; quantization

Indexed keywords


EID: 84973402464     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2016.7472820     Document Type: Conference Paper
Times cited : (182)

References (25)
  • 1
    • 84906251664 scopus 로고    scopus 로고
    • Accurate and compact large vocabulary speech recognition on mobile devices
    • ISCA
    • Xin Lei, Andrew Senior, Alexander Gruenstein, and Jeffrey Sorensen, "Accurate and compact large vocabulary speech recognition on mobile devices., " in INTERSPEECH. 2013, pp. 662-665, ISCA
    • (2013) INTERSPEECH. , pp. 662-665
    • Lei, X.1    Senior, A.2    Gruenstein, A.3    Sorensen, J.4
  • 2
    • 34250704813 scopus 로고    scopus 로고
    • Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks
    • Alex Graves, Santiago Fernández, Faustino Gomez, and Jürgen Schmidhuber, "Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks, " in ICML, 2006, pp. 369-376
    • (2006) ICML , pp. 369-376
    • Graves, A.1    Fernández, S.2    Gomez, F.3    Schmidhuber, J.4
  • 3
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • IEEE
    • Brian Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in ICASSP. 2009, pp. 3761-3764, IEEE
    • (2009) ICASSP , pp. 3761-3764
    • Kingsbury, B.1
  • 4
    • 84906227589 scopus 로고    scopus 로고
    • Restructuring of deep neural network acoustic models with singular value decomposition
    • Jian Xue, Jinyu Li, and Yifan Gong, "Restructuring of deep neural network acoustic models with singular value decomposition, " in INTERSPEECH, 2013, pp. 2365-2369
    • (2013) INTERSPEECH , pp. 2365-2369
    • Xue, J.1    Li, J.2    Gong, Y.3
  • 5
    • 84973402069 scopus 로고    scopus 로고
    • On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition
    • IEEE
    • Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, and Ian McGraw, "On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition, " in ICASSP. 2016, IEEE
    • (2016) ICASSP.
    • Prabhavalkar, R.1    Alsharif, O.2    Bruguier, A.3    McGraw, I.4
  • 6
    • 84959152523 scopus 로고    scopus 로고
    • Locallyconnected and convolutional neural networks for small footprint speaker recognition
    • ISCA
    • Yu-hsin Chen, Ignacio Lopez-Moreno, Tara N. Sainath, Mirkó Visontai, Raziel Alvarez, and Carolina Parada, "Locallyconnected and convolutional neural networks for small footprint speaker recognition, " in INTERSPEECH. 2015, pp. 1136-1140, ISCA
    • (2015) INTERSPEECH. , pp. 1136-1140
    • Chen, Y.1    Lopez-Moreno, I.2    Sainath, T.N.3    Visontai, M.4    Alvarez, R.5    Parada, C.6
  • 7
    • 84959104369 scopus 로고    scopus 로고
    • Compressing deep neural networks using a rank-constrained topology
    • ISCA
    • Preetum Nakkiran, Raziel Alvarez, Rohit Prabhavalkar, and Carolina Parada, "Compressing deep neural networks using a rank-constrained topology, " in INTERSPEECH. 2015, pp. 1473-1477, ISCA
    • (2015) INTERSPEECH. , pp. 1473-1477
    • Nakkiran, P.1    Alvarez, R.2    Prabhavalkar, R.3    Parada, C.4
  • 8
    • 84965177696 scopus 로고    scopus 로고
    • Structured transforms for small-footprint deep learning
    • Vikas Sindhwani, Tara N. Sainath, and Sanjiv Kumar, "Structured transforms for small-footprint deep learning, " in NIPS (to appear), 2015
    • (2015) NIPS (To Appear)
    • Sindhwani, V.1    Sainath, T.N.2    Kumar, S.3
  • 9
    • 84890454527 scopus 로고    scopus 로고
    • Low-rank matrix factorization for deep neural network training with high-dimensional output targets
    • IEEE
    • Tara N. Sainath, Brian Kingsbury, Vikas Sindhwani, Ebru Arisoy, and Bhuvana Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets, " in ICASSP. 2013, pp. 6655-6659, IEEE
    • (2013) ICASSP. , pp. 6655-6659
    • Sainath, T.N.1    Kingsbury, B.2    Sindhwani, V.3    Arisoy, E.4    Ramabhadran, B.5
  • 10
    • 84946014836 scopus 로고    scopus 로고
    • Small-footprint high-performance deep neural network-based speech recognition using split-VQ
    • IEEE
    • Yongqiang Wang, Jinyu Li, and Yifan Gong, "Small-footprint high-performance deep neural network-based speech recognition using split-VQ, " in ICASSP. 2015, pp. 4984-4988, IEEE
    • (2015) ICASSP. , pp. 4984-4988
    • Wang, Y.1    Li, J.2    Gong, Y.3
  • 11
    • 84959121080 scopus 로고    scopus 로고
    • Transferring knowledge from a RNN to a DNN
    • ISCA
    • William Chan, Nan Rosemary Ke, and Ian Lane, "Transferring knowledge from a RNN to a DNN, " in INTERSPEECH. 2015, ISCA
    • (2015) INTERSPEECH.
    • Chan, W.1    Rosemary Ke, N.2    Lane, I.3
  • 14
    • 84910046405 scopus 로고    scopus 로고
    • Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • ISCA
    • Haşim Sak, Andrew Senior, and Françoise Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling, " in INTERSPEECH. 2014, pp. 338-342, ISCA
    • (2014) INTERSPEECH. , pp. 338-342
    • Sak, H.1    Senior, A.2    Beaufays, F.3
  • 15
    • 84946084790 scopus 로고    scopus 로고
    • Learning acoustic frame labeling for speech recognition with recurrent neural networks
    • Haşim Sak, Andrew Senior, Kanishka Rao, Ozan Irsoy, Alex Graves, Françoise Beaufays, and Johan Schalkwyk, "Learning acoustic frame labeling for speech recognition with recurrent neural networks, " in ICASSP, 2015, pp. 4280-4284
    • (2015) ICASSP , pp. 4280-4284
    • Sak, H.1    Senior, A.2    Rao, K.3    Irsoy, O.4    Graves, A.5    Beaufays, F.6    Schalkwyk, J.7
  • 16
    • 84959112739 scopus 로고    scopus 로고
    • Fast and accurate recurrent neural network acoustic models for speech recognition
    • ISCA
    • Haşim Sak, Andrew Senior, Kanishka Rao, and Françoise Beaufays, "Fast and accurate recurrent neural network acoustic models for speech recognition, " in INTERSPEECH. 2015, pp. 1468-1472, ISCA
    • (2015) INTERSPEECH. , pp. 1468-1472
    • Sak, H.1    Senior, A.2    Rao, K.3    Beaufays, F.4
  • 17
    • 84865720088 scopus 로고    scopus 로고
    • Unary data structures for language models
    • ISCA
    • Jeffrey Sorensen and Cyril Allauzen, "Unary data structures for language models, " in INTERSPEECH. 2011, ISCA
    • (2011) INTERSPEECH.
    • Sorensen, J.1    Allauzen, C.2
  • 19
    • 84910072094 scopus 로고    scopus 로고
    • Sequence discriminative distributed training of long short-term memory recurrent neural networks
    • Haşim Sak, Oriol Vinyals, Georg Heigold, Andrew Senior, Erik McDermott, Rajat Monga, and Mark Mao, "Sequence discriminative distributed training of long short-term memory recurrent neural networks, " in INTERSPEECH, 2014, pp. 1209-1213
    • (2014) INTERSPEECH , pp. 1209-1213
    • Sak, H.1    Vinyals, O.2    Heigold, G.3    Senior, A.4    McDermott, E.5    Monga, R.6    Mao, M.7
  • 23
    • 84865772214 scopus 로고    scopus 로고
    • Bayesian language model interpolation for mobile speech input
    • Cyril Allauzen and Michael Riley, "Bayesian language model interpolation for mobile speech input, " in INTERSPEECH, 2011, pp. 1429-1432
    • (2011) INTERSPEECH , pp. 1429-1432
    • Allauzen, C.1    Riley, M.2
  • 24
    • 85075929453 scopus 로고    scopus 로고
    • Speech recognition with weighted finite-state transducers
    • Jacob Benesty, M. Sondhi, and Yiteng Huang, Eds., chapter 28 Springer
    • Mehryar Mohri, Fernando Pereira, and Michael Riley, "Speech recognition with weighted finite-state transducers, " in Handbook of Speech Processing, Jacob Benesty, M. Sondhi, and Yiteng Huang, Eds., chapter 28, pp. 559-582. Springer, 2008
    • (2008) Handbook of Speech Processing , pp. 559-582
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 25
    • 84946032010 scopus 로고    scopus 로고
    • Grapheme-to-phoneme conversion using long shortterm memory recurrent neural networks
    • Kanishka Rao, Fuchun Peng, Haşim Sak, and Françoise Beaufays, "Grapheme-to-phoneme conversion using long shortterm memory recurrent neural networks, " in ICASSP, 2015.
    • (2015) ICASSP
    • Rao, K.1    Peng, F.2    Sak, H.3    Beaufays, F.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.