-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " IEEE Signal Processing Magazine, vol. 29, pp. 82-97, 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
3
-
-
0033709098
-
Tandem connectionist feature extraction for conventionalHMMsystems
-
H. Hermansky, D. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventionalHMMsystems, " in In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000.
-
(2000)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing
-
-
Hermansky, H.1
Ellis, D.2
Sharma, S.3
-
4
-
-
84858976070
-
Feature engineering in context dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context dependent deep neural networks for conversational speech transcription, " in In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2011.
-
(2011)
Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
5
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2013.
-
(2013)
Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
6
-
-
84886829539
-
Optimization techniques to improve training speed of deep neural networks for large speech tasks
-
T. Sainath, B. Kingsbury, H. Soltau, and B. Ramabhadran, "Optimization techniques to improve training speed of deep neural networks for large speech tasks, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, pp. 2267-2276, 2013.
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.21
, pp. 2267-2276
-
-
Sainath, T.1
Kingsbury, B.2
Soltau, H.3
Ramabhadran, B.4
-
8
-
-
84921731072
-
Fast adaptation of deep neural network based on discriminant codes for speech recognition
-
S. Xue, O. Abdel-Hamid, H. Jiang, L. Dai, and Q. Liu, "Fast adaptation of deep neural network based on discriminant codes for speech recognition, " ACM/IEEE Transactions on Audio, Speech, and Language Processing, vol. 22, pp. 1713-1725, 2014.
-
(2014)
ACM/IEEE Transactions on Audio, Speech, and Language Processing
, vol.22
, pp. 1713-1725
-
-
Xue, S.1
Abdel-Hamid, O.2
Jiang, H.3
Dai, L.4
Liu, Q.5
-
9
-
-
84874226579
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition, " in In Proceedings of IEEE Spoken Language Technology (SLT) Workshop, 2012.
-
(2012)
Proceedings of IEEE Spoken Language Technology (SLT) Workshop
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
12
-
-
84959142471
-
Robust i-vector based adaptation of DNN acoustic model for speech recognition
-
S. Garimella, A. Mandal, N. Strom, B. Hoffmeister, S. Matsoukas, and S. H. K. Parthasarathi, "Robust i-vector based adaptation of DNN acoustic model for speech recognition, " in In Proceedings of Interspeech, 2015.
-
(2015)
Proceedings of Interspeech
-
-
Garimella, S.1
Mandal, A.2
Strom, N.3
Hoffmeister, B.4
Matsoukas, S.5
Parthasarathi, S.H.K.6
-
15
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models, " Computer Speech and Language, no. 9, 1995.
-
(1995)
Computer Speech and Language
, Issue.9
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
16
-
-
0030263447
-
Mean and variance adaptation within the MLLR framework
-
M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework, " Computer Speech and Language, vol. 10, pp. 249-264, 1996.
-
(1996)
Computer Speech and Language
, vol.10
, pp. 249-264
-
-
Gales, M.J.F.1
Woodland, P.C.2
-
17
-
-
0031647824
-
A frequency warping approach to speaker normalization
-
L. Lee and R. Rose, "A frequency warping approach to speaker normalization, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 6, pp. 49-60, 1998.
-
(1998)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.6
, pp. 49-60
-
-
Lee, L.1
Rose, R.2
-
18
-
-
0029764708
-
Speaker normalization on conversational telephone speech
-
S. Wegmann, D. McAllaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech, " in In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 1996.
-
(1996)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing
-
-
Wegmann, S.1
McAllaster, D.2
Orloff, J.3
Peskin, B.4
-
19
-
-
84890465724
-
The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions
-
S. H. K. Parthasarathi, S. Y. Chang, J. Cohen, N. Morgan, and S. Wegmann, "The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions, " in In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2013.
-
(2013)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing
-
-
Parthasarathi, S.H.K.1
Chang, S.Y.2
Cohen, J.3
Morgan, N.4
Wegmann, S.5
-
20
-
-
33646759965
-
Adaptive training using simple target models
-
G. Stemmer, F. Brugnara, and D. Giuliani, "Adaptive training using simple target models, " in In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
-
(2005)
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing
-
-
Stemmer, G.1
Brugnara, F.2
Giuliani, D.3
-
21
-
-
0032021555
-
On combining classifiers
-
J. Kittler, M. Hatef, R. P. W. Duin, and J. Matas, "On combining classifiers. " IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, pp. 226-239, 1998.
-
(1998)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.20
, pp. 226-239
-
-
Kittler, J.1
Hatef, M.2
Duin, R.P.W.3
Matas, J.4
|