-
1
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
J. L. Gauvain and Chin-Hui Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Transactions on Speech and audio processing, vol. 2, no. 2, pp. 291-298, 1994.
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.2
-
2
-
-
0031177213
-
Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
-
S. M. Ahadi and P. C. Woodland, "Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models," Computer speech & language, vol. 11, no. 3, pp. 187-206, 1997.
-
(1997)
Computer Speech & Language
, vol.11
, Issue.3
, pp. 187-206
-
-
Ahadi, S.M.1
Woodland, P.C.2
-
3
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
Christopher Leggetter and P. C.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech & Language, vol. 9, no. 2, pp. 171-185, 1995.
-
(1995)
Computer Speech & Language
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.C.1
Woodland, P.2
-
4
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
Mark J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech & Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
5
-
-
0029375590
-
Speaker adaptation using constrained estimation of Gaussian mixtures
-
Vassilios V Digalakis, Dimitry Rtischev, and Leonardo G Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Transactions on Speech and Audio Processing, vol. 3, no. 5, pp. 357-366, 1995.
-
(1995)
IEEE Transactions on Speech and Audio Processing
, vol.3
, Issue.5
, pp. 357-366
-
-
Digalakis, V.V.1
Rtischev, D.2
Neumeyer, L.G.3
-
6
-
-
0029747183
-
Speaker normalization using efficient frequency warping procedures
-
IEEE
-
Li Lee and Richard C Rose, "Speaker normalization using efficient frequency warping procedures," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 1996, vol. 1, pp. 353-356.
-
(1996)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, vol.1
, pp. 353-356
-
-
Lee, L.1
Rose, R.C.2
-
7
-
-
0034848766
-
Hierarchical stochastic feature matching for robust speech recognition
-
IEEE
-
Hui Jiang, Frank Soong, and Chin-Hui Lee, "Hierarchical stochastic feature matching for robust speech recognition," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2001, vol. 1, pp. 217-220.
-
(2001)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, vol.1
, pp. 217-220
-
-
Jiang, H.1
Soong, F.2
Lee, C.3
-
8
-
-
84937854847
-
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
-
Joao Neto, Lus Almeida, Mike Hochberg, Ciro Martins, Lus Nunes, Steve Renals, and Tony Robinson, "Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system," in EUROSPEECH, 1995.
-
(1995)
EUROSPEECH
-
-
Neto, J.1
Almeida, L.2
Hochberg, M.3
Martins, C.4
Nunes, L.5
Renals, S.6
Robinson, T.7
-
9
-
-
34548012893
-
Linear hidden transformations for adaptation of hybrid ANN/HMM models
-
Roberto Gemello, Franco Mana, Stefano Scanzio, Pietro Laface, and Renato De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Communication, vol. 49, no. 10, pp. 827-835, 2007.
-
(2007)
Speech Communication
, vol.49
, Issue.10
, pp. 827-835
-
-
Gemello, R.1
Mana, F.2
Scanzio, S.3
Laface, P.4
De Mori, R.5
-
11
-
-
84881054791
-
Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
-
Sabato Marco Siniscalchi, Jinyu Li, and C-H Lee, "Hermitian polynomial for speaker adaptation of connectionist speech recognition systems," IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 10, pp. 2152-2161, 2013.
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.21
, Issue.10
, pp. 2152-2161
-
-
Marco Siniscalchi, S.1
Li, J.2
Lee, C.-H.3
-
12
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
Frank Seide, Gang Li, Xie Chen, and Dong Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in 2011 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011.
-
(2011)
2011 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
13
-
-
84874226579
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
IEEE
-
Kaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, and Yifan Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition," in Spoken Language Technology Workshop (SLT). IEEE, 2012, pp. 366-369.
-
(2012)
Spoken Language Technology Workshop (SLT)
, pp. 366-369
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
14
-
-
84890542079
-
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
-
IEEE
-
Dong Yu, Kaisheng Yao, Hang Su, Gang Li, and Frank Seide, "KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2013, pp. 7893-7897.
-
(2013)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, pp. 7893-7897
-
-
Yu, D.1
Yao, K.2
Su, H.3
Li, G.4
Seide, F.5
-
16
-
-
84890543571
-
Deep hierarchical bottleneck mrasta features for LVCSR
-
IEEE
-
Zoltan Tuske, Ralf Schluter, and Hermann Ney, "Deep hierarchical bottleneck mrasta features for LVCSR," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2013, pp. 6970-6974.
-
(2013)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, pp. 6970-6974
-
-
Tuske, Z.1
Schluter, R.2
Ney, H.3
-
17
-
-
84890452886
-
Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
-
IEEE
-
Ossama Abdel-Hamid and Hui Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2013, pp. 7942-7946.
-
(2013)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, pp. 7942-7946
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
18
-
-
84906225505
-
Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition
-
Ossama Abdel-Hamid and Hui Jiang, "Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition," in INTERSPEECH, 2013.
-
(2013)
INTERSPEECH
-
-
Abdel-Hamid, O.1
Jiang, H.2
-
19
-
-
84905284226
-
Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code
-
Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, and Lirong Dai, "Direct adaptation of hybrid DNN/HMM model for fast speaker adaptation in LVCSR based on speaker code," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP), 2014.
-
(2014)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
-
-
Xue, S.1
Abdel-Hamid, O.2
Jiang, H.3
Dai, L.4
-
20
-
-
84905268324
-
Speaker adaptation of deep neural network based on discriminant codes
-
submitted to, Feb
-
Shaofei Xue, Ossama Abdel-Hamid, Hui Jiang, and Lirong Dai, "Speaker adaptation of deep neural network based on discriminant codes," submitted to IEEE Transactions on Acoustics, Speech and Signal Processing, Feb 2014.
-
(2014)
IEEE Transactions on Acoustics, Speech and Signal Processing
-
-
Xue, S.1
Abdel-Hamid, O.2
Jiang, H.3
Dai, L.4
-
21
-
-
84905229915
-
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
-
Jian Xue, Jinyu Li, Dong Yu, Mike Seltzer, and Yifan Gong, "Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP), 2014.
-
(2014)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
-
-
Xue, J.1
Li, J.2
Yu, D.3
Seltzer, M.4
Gong, Y.5
-
22
-
-
84898971588
-
Predicting parameters in deep learning
-
Misha Denil, Babak Shakibi, Laurent Dinh, Nando de Freitas, et al., "Predicting parameters in deep learning," in Advances in Neural Information Processing Systems, 2013, pp. 2148-2156.
-
(2013)
Advances in Neural Information Processing Systems
, pp. 2148-2156
-
-
Denil, M.1
Shakibi, B.2
Dinh, L.3
De Freitas, N.4
-
23
-
-
84890454527
-
Low-rank matrix factorization for deep neural network training with high-dimensional output targets
-
IEEE
-
Tara N Sainath, Brian Kingsbury, Vikas Sindhwani, Ebru Arisoy, and Bhuvana Ramabhadran, "Low-rank matrix factorization for deep neural network training with high-dimensional output targets," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP). IEEE, 2013, pp. 6655-6659.
-
(2013)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
, pp. 6655-6659
-
-
Sainath, T.N.1
Kingsbury, B.2
Sindhwani, V.3
Arisoy, E.4
Ramabhadran, B.5
-
24
-
-
84906227589
-
Restructuring of deep neural network acoustic models with singular value decomposition
-
Jian Xue, Jinyu Li, and Yifan Gong, "Restructuring of deep neural network acoustic models with singular value decomposition," in INTERSPEECH, 2013.
-
(2013)
INTERSPEECH
-
-
Xue, J.1
Li, J.2
Gong, Y.3
-
25
-
-
0024768209
-
Speaker-independent phone recognition using hidden markov models
-
K-F Lee and H-W Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 37, no. 11, pp. 1641-1648, 1989.
-
(1989)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.-F.1
Hon, H.-W.2
-
26
-
-
84874485803
-
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling
-
Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, and Hui Jiang, "Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMs in acoustic modeling," in 8th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2012, pp. 301-305.
-
(2012)
8th International Symposium on Chinese Spoken Language Processing (ISCSLP)
, pp. 301-305
-
-
Pan, J.1
Liu, C.2
Wang, Z.3
Hu, Y.4
Jiang, H.5
-
27
-
-
84876477729
-
Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems
-
Yebo Bao, Hui Jiang, Cong Liu, Yu Hu, and Lirong Dai, "Investigation on dimensionality reduction of concatenated features with deep neural network for LVCSR systems," in IEEE 11th International Conference on Signal Processing (ICSP), 2012, vol. 1, pp. 562-566.
-
(2012)
IEEE 11th International Conference on Signal Processing (ICSP)
, vol.1
, pp. 562-566
-
-
Bao, Y.1
Jiang, H.2
Liu, C.3
Hu, Y.4
Dai, L.5
-
28
-
-
84890445451
-
Incoherent training of deep neural networks to de-correlate bottleneck features for speech recognition
-
Yebo Bao, Hui Jiang, Lirong Dai, and Cong Liu, "Incoherent training of deep neural networks to de-correlate bottleneck features for speech recognition," in IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP), 2013.
-
(2013)
IEEE International Conference of Acoustics,Speech and Signal Processing (ICASSP)
-
-
Bao, Y.1
Jiang, H.2
Dai, L.3
Liu, C.4
-
29
-
-
84905252086
-
Improving deep neural networks for LVCSR using dropout and shrinking structure
-
Shiliang Zhang, Yebo Bao, Pan Zhou, Hui Jiang, and Lirong Dai, "Improving deep neural networks for LVCSR using dropout and shrinking structure," in IEEE International Conference of Acoustics, Speech and Signal Processing (ICASSP), 2014.
-
(2014)
IEEE International Conference of Acoustics, Speech and Signal Processing (ICASSP)
-
-
Zhang, S.1
Bao, Y.2
Zhou, P.3
Jiang, H.4
Dai, L.5
|