-
1
-
-
0024610919
-
A tutorial on hidden Markov models and selectedapplications in speech recognition
-
L. Rabiner, "A tutorial on hidden Markov models and selectedapplications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.1
-
2
-
-
85032751458
-
Deepneural networks for acoustic modeling in speech recognition: Theshared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deepneural networks for acoustic modeling in speech recognition: Theshared views of four research groups, " IEEE Signal ProcessingMagazine, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal ProcessingMagazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
3
-
-
33947635130
-
Regularized adaptation of discriminativeclassifiers
-
X. Li and J. Bilmes, "Regularized adaptation of discriminativeclassifiers, " in Proc. ICASSP, vol. 1, 2006, pp. I-I.
-
(2006)
Proc. ICASSP
, vol.1
, pp. I-I
-
-
Li, X.1
Bilmes, J.2
-
4
-
-
84890542079
-
KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition
-
D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "KL-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition, " in Proc. ICASSP, 2013, pp. 7893-7897.
-
(2013)
Proc. ICASSP
, pp. 7893-7897
-
-
Yu, D.1
Yao, K.2
Su, H.3
Li, G.4
Seide, F.5
-
6
-
-
84871387302
-
The deep tensor neural networkwith applications to large vocabulary speech recognition
-
D. Yu, L. Deng, and S. Seide, "The deep tensor neural networkwith applications to large vocabulary speech recognition, " IEEETrans. Audio, Speech, and Language Processing, vol. 21, no. 2, pp. 388-396, 2013.
-
(2013)
IEEETrans. Audio, Speech, and Language Processing
, vol.21
, Issue.2
, pp. 388-396
-
-
Yu, D.1
Deng, L.2
Seide, S.3
-
7
-
-
84937854847
-
Speaker-adaptation for hybridHMM-ANN continuous speech recognition system
-
J. Neto, L. Almeida, M. Hochberg, C. Martins, L. Nunes, S. Renals, and T. Robinson, "Speaker-adaptation for hybridHMM-ANN continuous speech recognition system, " in Proc. Eurospeech, 1995.
-
(1995)
Proc. Eurospeech
-
-
Neto, J.1
Almeida, L.2
Hochberg, M.3
Martins, C.4
Nunes, L.5
Renals, S.6
Robinson, T.7
-
8
-
-
34548012893
-
Linearhidden transformations for adaptation of hybrid ANN/HMMmodels
-
R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. D. Mori, "Linearhidden transformations for adaptation of hybrid ANN/HMMmodels, " Speech Communication, vol. 49, no. 10, pp. 827-835, 2007.
-
(2007)
Speech Communication
, vol.49
, Issue.10
, pp. 827-835
-
-
Gemello, R.1
Mana, F.2
Scanzio, S.3
Laface, P.4
Mori, R.D.5
-
9
-
-
84858976070
-
Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription, " in Proc. ASRU, 2011, pp. 24-29.
-
(2011)
Proc. ASRU
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
10
-
-
84874226579
-
Adaptationof context-dependent deep neural networks for automatic speechrecognition
-
K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptationof context-dependent deep neural networks for automatic speechrecognition, " in Proc. Spoken Language Technology Workshop, 2012, pp. 366-369.
-
(2012)
Proc. Spoken Language Technology Workshop
, pp. 366-369
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
11
-
-
84893691530
-
Speaker adaptationof neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptationof neural network acoustic models using i-vectors, " in Proc. ASRU, 2013, pp. 55-59.
-
(2013)
Proc. ASRU
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
12
-
-
84881054791
-
Hermitian polynomial forspeaker adaptation of connectionist speech recognition systems
-
S. M. Siniscalchi, J. Li, and C.-H. Lee, "Hermitian polynomial forspeaker adaptation of connectionist speech recognition systems, "IEEE Trans. Audio, Speech, and Language Processing, vol. 21, no. 10, pp. 2152-2161, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Processing
, vol.21
, Issue.10
, pp. 2152-2161
-
-
Siniscalchi, S.M.1
Li, J.2
Lee, C.-H.3
-
13
-
-
84906225505
-
Rapid and effective speaker adaptationof convolutional neural network based models for speechrecognition
-
O. Abdel-Hamid and H. Jiang, "Rapid and effective speaker adaptationof convolutional neural network based models for speechrecognition, " in Proc. INTERSPEECH, 2013, pp. 1248-1252.
-
(2013)
Proc. INTERSPEECH
, pp. 1248-1252
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
14
-
-
84983119674
-
Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models
-
P. Swietojanski and S. Renals, "Learning hidden unit contributionsfor unsupervised speaker adaptation of neural networkacoustic models, " in Proc. IEEE STL, 2014.
-
(2014)
Proc. IEEE STL
-
-
Swietojanski, P.1
Renals, S.2
-
15
-
-
84905262902
-
Factorized adaptation for deepneural network
-
J. Li, J.-T. Huang, and Y. Gong, "Factorized adaptation for deepneural network, " in Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Li, J.1
Huang, J.-T.2
Gong, Y.3
-
16
-
-
84890452886
-
Fast speaker adaptation of hybridNN/HMM model for speech recognition based on discriminativelearning of speaker code
-
O. Abdel-Hamid and H. Jiang, "Fast speaker adaptation of hybridNN/HMM model for speech recognition based on discriminativelearning of speaker code, " in Proc. ICASSP, 2013, pp. 7942-7946.
-
(2013)
Proc. ICASSP
, pp. 7942-7946
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
17
-
-
84905284226
-
Direct adaptationof hybrid DNN/HMM model for fast speaker adaptationin LVCSR based on speaker code
-
S. Xue, O. Abdel-Hamid, H. Jiang, and L. Dai, "Direct adaptationof hybrid DNN/HMM model for fast speaker adaptationin LVCSR based on speaker code, " in Proc. ICASSP, 2014, pp. 6339-6343.
-
(2014)
Proc. ICASSP
, pp. 6339-6343
-
-
Xue, S.1
Abdel-Hamid, O.2
Jiang, H.3
Dai, L.4
-
18
-
-
84921731072
-
Fastadaptation of deep neural network based on discriminant codesfor speech recognition
-
S. Xue, O. Abdel-Hamid, H. Jiang, L. Dai, and Q. Liu, "Fastadaptation of deep neural network based on discriminant codesfor speech recognition, " IEEE/ACM Trans. on Audio, Speech and Lang. Proc., vol. 22, no. 12, pp. 1713-1725, 2014.
-
(2014)
IEEE/ACM Trans. on Audio, Speech and Lang. Proc
, vol.22
, Issue.12
, pp. 1713-1725
-
-
Xue, S.1
Abdel-Hamid, O.2
Jiang, H.3
Dai, L.4
Liu, Q.5
-
19
-
-
0027683813
-
Shared-distribution hiddenmarkov models for speech recognition
-
M.-Y. M.-Y. Hwang and X. Huang, "Shared-distribution hiddenmarkov models for speech recognition, " IEEE Trans. Speech and Audio Processing, vol. 1, no. 4, pp. 414-420, 1993.
-
(1993)
IEEE Trans. Speech and Audio Processing
, vol.1
, Issue.4
, pp. 414-420
-
-
Hwang, M.-Y.M.-Y.1
Huang, X.2
-
20
-
-
84938690750
-
Speaker adaptation of deepneural networks using a hierarchy of output layers
-
R. Price, I. Kenichi, and K. Shinoda, "Speaker adaptation of deepneural networks using a hierarchy of output layers, " in Proc. SLT, 2014.
-
(2014)
Proc. SLT
-
-
Price, R.1
Kenichi, I.2
Shinoda, K.3
-
21
-
-
84959161626
-
Maximum a posteriori adaptation of network parameters indeep models
-
Z. Huang, S. M. Siniscalchi, I.-F. Chen, J. Li, J. Wu, and C.-H. Lee, "Maximum a posteriori adaptation of network parameters indeep models, " 2015, submitted to INTERSPEECH.
-
(2015)
INTERSPEECH
-
-
Huang, Z.1
Siniscalchi, S.M.2
Chen, I.-F.3
Li, J.4
Wu, J.5
Lee, C.-H.6
-
22
-
-
85121045899
-
Multitask learning: A knowledge-based source of inductivebias
-
R. Caruna, "Multitask learning: A knowledge-based source of inductivebias, " in Proc. ICML, 1993, pp. 41-48.
-
(1993)
Proc. ICML
, pp. 41-48
-
-
Caruna, R.1
-
23
-
-
85009167968
-
Multitask learning in connectionistrobust asr using recurrent neural networks
-
S. Parveen and P. Green, "Multitask learning in connectionistrobust asr using recurrent neural networks. " in Proc. INTERSPEECH, 2003.
-
(2003)
Proc. INTERSPEECH
-
-
Parveen, S.1
Green, P.2
-
24
-
-
84890458846
-
Multitask learning in connectionist speech recognition
-
Y. Lu, F. Lu, S. Sehgal, S. Gupta, J. Du, C. Tham, P. Green, and V. Wan, "Multitask learning in connectionist speech recognition, "in Proc. Australian International Conference on Speech Scienceand Technology, 2004.
-
(2004)
Proc. Australian International Conference on Speech Scienceand Technology
-
-
Lu, Y.1
Lu, F.2
Sehgal, S.3
Gupta, S.4
Du, J.5
Tham, C.6
Green, P.7
Wan, V.8
-
25
-
-
84890545600
-
Multi-task learning in deep neuralnetworks for improved phoneme recognition
-
M. Seltzer and J. Droppo, "Multi-task learning in deep neuralnetworks for improved phoneme recognition, " in Proc. ICASSP, 2013, pp. 6965-6969.
-
(2013)
Proc. ICASSP
, pp. 6965-6969
-
-
Seltzer, M.1
Droppo, J.2
-
26
-
-
84976230656
-
Learning auxiliarycategorization for neural network based speech synthesis
-
Z. Wen, K. Li, Z. Huang, J. Tao, and C.-H. Lee, "Learning auxiliarycategorization for neural network based speech synthesis, "2015, submitted to INTERSPEECH.
-
(2015)
INTERSPEECH
-
-
Wen, Z.1
Li, K.2
Huang, Z.3
Tao, J.4
Lee, C.-H.5
-
27
-
-
84959100788
-
Multiobjectivelearning and mask-based post-processing for deep neuralnetwork based speech enhancement
-
Y. Xu, J. Du, Z. Huang, L.-R. Dai, and C.-H. Lee, "Multiobjectivelearning and mask-based post-processing for deep neuralnetwork based speech enhancement, " 2015, submitted to INTERSPEECH.
-
(2015)
INTERSPEECH
-
-
Xu, Y.1
Du, J.2
Huang, Z.3
Dai, L.-R.4
Lee, C.-H.5
-
28
-
-
84866054643
-
-
MIT Press, Cambridge, MA, USA
-
D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representationsby back-propagating errors. MIT Press, Cambridge, MA, USA, 1988.
-
(1988)
Learning Representationsby Back-propagating Errors
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
29
-
-
80053446822
-
Optimaldistributed online prediction
-
O. Dekel, R. Gilad-Bachrach, O. Shamir, and L. Xiao, "Optimaldistributed online prediction, " in Proc. ICML, 2011, pp. 713-720.
-
(2011)
Proc. ICML
, pp. 713-720
-
-
Dekel, O.1
Gilad-Bachrach, R.2
Shamir, O.3
Xiao, L.4
-
30
-
-
33746600649
-
Reducing the dimensionalityof data with neural networks
-
G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionalityof data with neural networks, " Science, vol. 313, no. 5786, pp. 504-507, 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
31
-
-
85008035419
-
Equivalenceof generative and log-linear models
-
G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schluter, "Equivalenceof generative and log-linear models, " IEEE Trans. Audio, Speech & Language Processing, vol. 19, no. 5, pp. 1138-1148, 2011.
-
(2011)
IEEE Trans. Audio, Speech & Language Processing
, vol.19
, Issue.5
, pp. 1138-1148
-
-
Heigold, G.1
Ney, H.2
Lehnen, P.3
Gass, T.4
Schluter, R.5
-
32
-
-
84910035297
-
Learning small-sizednn with output-distribution-based criteria
-
J. Li, R. Zhao, J.-T. Huang, and Y. Gong, "Learning small-sizednn with output-distribution-based criteria, " in Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Li, J.1
Zhao, R.2
Huang, J.-T.3
Gong, Y.4
-
33
-
-
0029288633
-
Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linearregression for speaker adaptation of continuous density hiddenMarkov models, " Computer Speech & Language, vol. 9, no. 2, pp. 171-185, 1995.
-
(1995)
Computer Speech & Language
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
35
-
-
84858953642
-
The Kaldi speech recognitiontoolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi speech recognitiontoolkit, " in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
Silovsky, J.11
Stemmer, G.12
Vesely, K.13
-
36
-
-
84890454527
-
Low-rank matrix factorization for deep neural networktraining with high-dimensional output targets
-
T. N. Sainath, B. Kingsbury, V. Sindhwani, E. Arisoy, and B. Ramabhadran, "Low-rank matrix factorization for deep neural networktraining with high-dimensional output targets, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE InternationalConference on. IEEE, 2013, pp. 6655-6659.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE InternationalConference On. IEEE
, pp. 6655-6659
-
-
Sainath, T.N.1
Kingsbury, B.2
Sindhwani, V.3
Arisoy, E.4
Ramabhadran, B.5
-
37
-
-
84906227589
-
Restructuring of deep neural networkacoustic models with singular value decomposition
-
J. Xue, J. Li, and Y. Gong, "Restructuring of deep neural networkacoustic models with singular value decomposition. " in INTERSPEECH, 2013, pp. 2365-2369.
-
(2013)
INTERSPEECH
, pp. 2365-2369
-
-
Xue, J.1
Li, J.2
Gong, Y.3
-
38
-
-
84905229915
-
Singular value decompositionbased low-footprint speaker adaptation and personalizationfor deep neural network
-
J. Xue, J. Li, D. Yu, M. Seltzer, and Y. Gong, "Singular value decompositionbased low-footprint speaker adaptation and personalizationfor deep neural network, " in Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Xue, J.1
Li, J.2
Yu, D.3
Seltzer, M.4
Gong, Y.5
-
39
-
-
84912109599
-
Speaker adaptation of hybridNN/HMM model for speech recognition based on singular valuedecomposition
-
S. Xue, H. Jiang, and L. Dai, "Speaker adaptation of hybridNN/HMM model for speech recognition based on singular valuedecomposition, " in Proc. ISCSLP, 2014.
-
(2014)
Proc. ISCSLP
-
-
Xue, S.1
Jiang, H.2
Dai, L.3
|