-
1
-
-
0028419019
-
Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
-
J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains, " Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 2, pp. 291-298, 1994.
-
(1994)
Speech and Audio Processing, IEEE Transactions on
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
2
-
-
0030263447
-
Mean and variance adaptation within the mllr framework
-
M. J. Gales and P. Woodland, "Mean and variance adaptation within the mllr framework, " Computer Speech & Language, vol. 10, no. 4, pp. 249-264, 1996.
-
(1996)
Computer Speech & Language
, vol.10
, Issue.4
, pp. 249-264
-
-
Gales, M.J.1
Woodland, P.2
-
3
-
-
80051654263
-
Deep belief networks using discriminative features for phone recognition
-
IEEE
-
A.-R. Mohamed, T. N. Sainath, G. Dahl, B. Ramabhadran, G. E. Hinton, and M. A. Picheny, "Deep belief networks using discriminative features for phone recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. IEEE, 2011, pp. 5060-5063.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 5060-5063
-
-
Mohamed, A.-R.1
Sainath, T.N.2
Dahl, G.3
Ramabhadran, B.4
Hinton, G.E.5
Picheny, M.A.6
-
4
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
5
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks." in Interspeech, 2011, pp. 437-440.
-
(2011)
Interspeech
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
6
-
-
84874226579
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
IEEE
-
K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition, " in Spoken Language Technology Workshop (SLT), 2012 IEEE. IEEE, 2012, pp. 366-369.
-
(2012)
Spoken Language Technology Workshop (SLT), 2012 IEEE
, pp. 366-369
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
7
-
-
84890542079
-
Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
-
IEEE
-
D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7893-7897.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
, pp. 7893-7897
-
-
Yu, D.1
Yao, K.2
Su, H.3
Li, G.4
Seide, F.5
-
8
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
IEEE
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 55-59.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
9
-
-
50249170027
-
Joint factor analysis versus eigenchannels in speaker recognition
-
P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 4, pp. 1435-1447, 2007.
-
(2007)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.15
, Issue.4
, pp. 1435-1447
-
-
Kenny, P.1
Boulianne, G.2
Ouellet, P.3
Dumouchel, P.4
-
10
-
-
70450180849
-
Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification
-
N. Dehak, R. Dehak, P. Kenny, N. Brümmer, P. Ouellet, and P. Dumouchel, "Support vector machines versus fast scoring in the lowdimensional total variability space for speaker verification." in INTERSPEECH, vol. 9, 2009, pp. 1559-1562.
-
(2009)
INTERSPEECH
, vol.9
, pp. 1559-1562
-
-
Dehak, N.1
Dehak, R.2
Kenny, P.3
Brümmer, N.4
Ouellet, P.5
Dumouchel, P.6
-
11
-
-
84865753339
-
Intersession compensation and scoring methods in the i-vectors space for speaker recognition
-
P.-M. Bousquet, D. Matrouf, and J.-F. Bonastre, "Intersession compensation and scoring methods in the i-vectors space for speaker recognition." in InterSpeech, 2011, pp. 485-488.
-
(2011)
InterSpeech
, pp. 485-488
-
-
Bousquet, P.-M.1
Matrouf, D.2
Bonastre, J.-F.3
-
12
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems." in Interspeech, 2011, pp. 249-252.
-
(2011)
Interspeech
, pp. 249-252
-
-
Garcia-Romero, D.1
Espy-Wilson, C.Y.2
-
14
-
-
84906274473
-
An open-source state-of-the-art toolbox for broadcast news diarization
-
M. Rouvier, G. Dupuy, P. Gay, E. Khoury, T. Merlin, and S. Meignier, "An open-source state-of-the-art toolbox for broadcast news diarization." in InterSpeech, 2013.
-
(2013)
InterSpeech
-
-
Rouvier, M.1
Dupuy, G.2
Gay, P.3
Khoury, E.4
Merlin, T.5
Meignier, S.6
-
15
-
-
84858953642
-
The kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz et al., "The kaldi speech recognition toolkit, " in Proc. ASRU, 2011, pp. 1-4.
-
(2011)
Proc. ASRU
, pp. 1-4
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
-
16
-
-
70450180496
-
The ester 2 evaluation campaign for the rich transcription of french radio broadcasts
-
S. Galliano, G. Gravier, and L. Chaubard, "The ester 2 evaluation campaign for the rich transcription of french radio broadcasts." in Interspeech, vol. 9, 2009, pp. 2583-2586.
-
(2009)
Interspeech
, vol.9
, pp. 2583-2586
-
-
Galliano, S.1
Gravier, G.2
Chaubard, L.3
-
17
-
-
85016241152
-
The epac corpus: Manual and automatic annotations of conversational speech in french broadcast news
-
Y. Esteve, T. Bazillon, J.-Y. Antoine, F. Béchet, and J. Farinas, "The epac corpus: Manual and automatic annotations of conversational speech in french broadcast news." in LREC, 2010.
-
(2010)
LREC
-
-
Esteve, Y.1
Bazillon, T.2
Antoine, J.-Y.3
Béchet, F.4
Farinas, J.5
-
19
-
-
84907937611
-
Srilm-an extensible language modeling toolkit
-
A. Stolcke et al., "Srilm-an extensible language modeling toolkit." in InterSpeech, 2002.
-
(2002)
InterSpeech
-
-
Stolcke, A.1
-
20
-
-
85073229756
-
Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis
-
P.-M. Bousquet, A. Larcher, D. Matrouf, J.-F. Bonastre, and O. Plchot, "Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis, " in Speaker and Language Recognition Workshop (IEEE Odyssey), 2012.
-
(2012)
Speaker and Language Recognition Workshop (IEEE Odyssey)
-
-
Bousquet, P.-M.1
Larcher, A.2
Matrouf, D.3
Bonastre, J.-F.4
Plchot, O.5
-
22
-
-
84865783736
-
Mixture of plda models in i-vector space for genderindependent speaker recognition
-
M. Senoussaoui, P. Kenny, N. Brümmer, E. De Villiers, and P. Dumouchel, "Mixture of plda models in i-vector space for genderindependent speaker recognition." in InterSpeech, 2011, pp. 25- 28.
-
(2011)
InterSpeech
, pp. 25-28
-
-
Senoussaoui, M.1
Kenny, P.2
Brümmer, N.3
De Villiers, E.4
Dumouchel, P.5
|