-
1
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in Proc. ASRU, 2013, pp. 55-59.
-
(2013)
Proc. ASRU
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
2
-
-
84905259138
-
Improving DNN speaker independence with i-vector inputs
-
A. Senior and I. Lopez-Moreno, "Improving DNN speaker independence with i-vector inputs, " in Proc. ICASSP, 2014, pp. 225-229.
-
(2014)
Proc. ICASSP
, pp. 225-229
-
-
Senior, A.1
Lopez-Moreno, I.2
-
3
-
-
84910068089
-
Adaptation of deep neural network acoustic models using factorised i-vectors
-
P. Karanasou, Y. Wang, M. J. F. Gales, and P. C. Woodland, "Adaptation of deep neural network acoustic models using factorised i-vectors, " in Proc. Interspeech, 2014, pp. 2180-2184.
-
(2014)
Proc. Interspeech
, pp. 2180-2184
-
-
Karanasou, P.1
Wang, Y.2
Gales, M.J.F.3
Woodland, P.C.4
-
4
-
-
84905225575
-
IVector-based acoustic data selection
-
O. Siohan and M. Bacchiani, "iVector-based acoustic data selection, " in Proc. Interspeech, 2013, pp. 657-661.
-
(2013)
Proc. Interspeech
, pp. 657-661
-
-
Siohan, O.1
Bacchiani, M.2
-
5
-
-
84865718184
-
I-Vector based speaker recognition on short utterances
-
A. Kanagasundaram, R. Vogt, D. Dean, S. Sridharan, and M. Mason, "i-Vector based speaker recognition on short utterances, " in Proc. Interspeech, 2011, pp. 2341-2344.
-
(2011)
Proc. Interspeech
, pp. 2341-2344
-
-
Kanagasundaram, A.1
Vogt, R.2
Dean, D.3
Sridharan, S.4
Mason, M.5
-
7
-
-
80051652767
-
Bayesian speaker verification with heavy-tailed priors
-
-, "Bayesian speaker verification with heavy-tailed priors, " in Proc. Odyssey-10, 2010.
-
(2010)
Proc. Odyssey-10
-
-
Kenny, P.1
-
8
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems, " in Proc. Interspeech, 2011, pp. 249-252.
-
(2011)
Proc. Interspeech
, pp. 249-252
-
-
Garcia-Romero, D.1
Espy-Wilson, C.Y.2
-
9
-
-
84910028543
-
Modified-prior i-vector estimation for language identification of short duration utterances
-
R. Travadi, M. V. Segbroeck, and S. Narayanan, "Modified-prior i-vector estimation for language identification of short duration utterances, " in Proc. Interspeech, 2014, pp. 3037-3041.
-
(2014)
Proc. Interspeech
, pp. 3037-3041
-
-
Travadi, R.1
Segbroeck, M.V.2
Narayanan, S.3
-
10
-
-
85135187845
-
Transformation smoothing for speaker and environmental adaptation
-
M. J. F. Gales, "Transformation smoothing for speaker and environmental adaptation, " in Proc. Eurospeech, 1997.
-
(1997)
Proc. Eurospeech
-
-
Gales, M.J.F.1
-
11
-
-
79959841091
-
Prior information for rapid speaker adaptation
-
C. Breslin, K. Chin, M. Gales, K. Knill, and H. Xu, "Prior information for rapid speaker adaptation, " in Proc. Interspeech, 2010, pp. 1644-1647.
-
(2010)
Proc. Interspeech
, pp. 1644-1647
-
-
Breslin, C.1
Chin, K.2
Gales, M.3
Knill, K.4
Xu, H.5
-
12
-
-
84878397811
-
Exploring rich expressive information from audiobook data using cluster adaptive training
-
L. Chen, M. J. F. Gales, V. Wan, J. Latorre, and M. Akamine, "Exploring rich expressive information from audiobook data using cluster adaptive training, " in Proc. Interspeech, 2012, pp. 959-962.
-
(2012)
Proc. Interspeech
, pp. 959-962
-
-
Chen, L.1
Gales, M.J.F.2
Wan, V.3
Latorre, J.4
Akamine, M.5
-
14
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Processing, vol. 10, no. 1-3, pp. 19-41, 2000.
-
(2000)
Digital Signal Processing
, vol.10
, Issue.1-3
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
15
-
-
0002992867
-
The 1996 broadcast news speech and language-model corpus
-
D. Graff, "The 1996 broadcast news speech and language-model corpus, " in Proc. 1997 DARPA Speech Recognition Workshop, 1997, pp. 11-14.
-
(1997)
Proc. 1997 DARPA Speech Recognition Workshop
, pp. 11-14
-
-
Graff, D.1
-
16
-
-
0003064806
-
1997 broadcast news benchmark test results: English and non-english
-
D. S. Pallett, J. G. Fiscus, A. Martin, and M. A. Przybocki, "1997 broadcast news benchmark test results: English and non-english, " in Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop, 1998, pp. 5-11.
-
(1998)
Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop
, pp. 5-11
-
-
Pallett, D.S.1
Fiscus, J.G.2
Martin, A.3
Przybocki, M.A.4
-
17
-
-
33745219648
-
The development of the Cambridge University RT-04 diarisation system
-
S. E. Tranter, M. J. F. Gales, R. Sinha, S. Umesh, and P. C. Woodland, "The development of the Cambridge University RT-04 diarisation system, " in Proc. Fall 2004 Rich Transcription Workshop (RT-04), 2004.
-
(2004)
Proc. Fall 2004 Rich Transcription Workshop (RT-04)
-
-
Tranter, S.E.1
Gales, M.J.F.2
Sinha, R.3
Umesh, S.4
Woodland, P.C.5
-
18
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
19
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, no. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, Issue.75-98
-
-
Gales, M.1
-
20
-
-
84055222005
-
Context-dependent pretrained deep neural networks for large-vocabulary speech recognition
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
21
-
-
84893712779
-
-
D. Johnson, "Quicknet, " www1. icsi. berkeley. edu/Speech/qn. html.
-
Quicknet
-
-
Johnson, D.1
-
22
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, 2011, pp. 24-29.
-
(2011)
Proc. ASRU
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
|