SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2015-January, Issue , 2015, Pages 2872-2876

I-vector estimation using informative priors for adaptation of deep neural networks

(3) Karanasou, Penny a Gales, Mark a Woodland, Philip a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Deep neural networks; I vectors; Prior information; Speaker adaptation

Indexed keywords

SPEECH COMMUNICATION; SPEECH RECOGNITION; VECTORS;

DEEP NEURAL NETWORKS; I VECTORS; INFORMATIVE PRIORS; LOW-DIMENSIONAL REPRESENTATION; PRIOR INFORMATION; RELATIVE REDUCTION; SPEAKER ADAPTATION; STATE OF THE ART;

VECTOR SPACES;

EID: 84959162419 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (22)

1
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors, " in Proc. ASRU, 2013, pp. 55-59.
- (2013) Proc. ASRU , pp. 55-59
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

2
- 84905259138
- Improving DNN speaker independence with i-vector inputs
- A. Senior and I. Lopez-Moreno, "Improving DNN speaker independence with i-vector inputs, " in Proc. ICASSP, 2014, pp. 225-229.
- (2014) Proc. ICASSP , pp. 225-229
- Senior, A.¹ Lopez-Moreno, I.²

3
- 84910068089
- Adaptation of deep neural network acoustic models using factorised i-vectors
- P. Karanasou, Y. Wang, M. J. F. Gales, and P. C. Woodland, "Adaptation of deep neural network acoustic models using factorised i-vectors, " in Proc. Interspeech, 2014, pp. 2180-2184.
- (2014) Proc. Interspeech , pp. 2180-2184
- Karanasou, P.¹ Wang, Y.² Gales, M.J.F.³ Woodland, P.C.⁴

4
- 84905225575
- IVector-based acoustic data selection
- O. Siohan and M. Bacchiani, "iVector-based acoustic data selection, " in Proc. Interspeech, 2013, pp. 657-661.
- (2013) Proc. Interspeech , pp. 657-661
- Siohan, O.¹ Bacchiani, M.²

5
- 84865718184
- I-Vector based speaker recognition on short utterances
- A. Kanagasundaram, R. Vogt, D. Dean, S. Sridharan, and M. Mason, "i-Vector based speaker recognition on short utterances, " in Proc. Interspeech, 2011, pp. 2341-2344.
- (2011) Proc. Interspeech , pp. 2341-2344
- Kanagasundaram, A.¹ Vogt, R.² Dean, D.³ Sridharan, S.⁴ Mason, M.⁵

6
- 33947637189
- Technical Report CRIM-06/08-14
- P. Kenny, "Joint factor analysis of speaker and session variability: Theory and algorithms, " in Technical Report CRIM-06/08-14, 2006.
- (2006) Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms
- Kenny, P.¹

7
- 80051652767
- Bayesian speaker verification with heavy-tailed priors
- -, "Bayesian speaker verification with heavy-tailed priors, " in Proc. Odyssey-10, 2010.
- (2010) Proc. Odyssey-10
- Kenny, P.¹

8
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems, " in Proc. Interspeech, 2011, pp. 249-252.
- (2011) Proc. Interspeech , pp. 249-252
- Garcia-Romero, D.¹ Espy-Wilson, C.Y.²

9
- 84910028543
- Modified-prior i-vector estimation for language identification of short duration utterances
- R. Travadi, M. V. Segbroeck, and S. Narayanan, "Modified-prior i-vector estimation for language identification of short duration utterances, " in Proc. Interspeech, 2014, pp. 3037-3041.
- (2014) Proc. Interspeech , pp. 3037-3041
- Travadi, R.¹ Segbroeck, M.V.² Narayanan, S.³

10
- 85135187845
- Transformation smoothing for speaker and environmental adaptation
- M. J. F. Gales, "Transformation smoothing for speaker and environmental adaptation, " in Proc. Eurospeech, 1997.
- (1997) Proc. Eurospeech
- Gales, M.J.F.¹

11
- 79959841091
- Prior information for rapid speaker adaptation
- C. Breslin, K. Chin, M. Gales, K. Knill, and H. Xu, "Prior information for rapid speaker adaptation, " in Proc. Interspeech, 2010, pp. 1644-1647.
- (2010) Proc. Interspeech , pp. 1644-1647
- Breslin, C.¹ Chin, K.² Gales, M.³ Knill, K.⁴ Xu, H.⁵

12
- 84878397811
- Exploring rich expressive information from audiobook data using cluster adaptive training
- L. Chen, M. J. F. Gales, V. Wan, J. Latorre, and M. Akamine, "Exploring rich expressive information from audiobook data using cluster adaptive training, " in Proc. Interspeech, 2012, pp. 959-962.
- (2012) Proc. Interspeech , pp. 959-962
- Chen, L.¹ Gales, M.J.F.² Wan, V.³ Latorre, J.⁴ Akamine, M.⁵

13
- 0034227757
- Cluster adaptive training of hidden Markov models
- M. J. F. Gales, "Cluster adaptive training of hidden Markov models, " IEEE Transactions on Speech and Audio Processing, vol. 8, pp. 417-428, 1999.
- (1999) IEEE Transactions on Speech and Audio Processing , vol.8 , pp. 417-428
- Gales, M.J.F.¹

14
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Processing, vol. 10, no. 1-3, pp. 19-41, 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

15
- 0002992867
- The 1996 broadcast news speech and language-model corpus
- D. Graff, "The 1996 broadcast news speech and language-model corpus, " in Proc. 1997 DARPA Speech Recognition Workshop, 1997, pp. 11-14.
- (1997) Proc. 1997 DARPA Speech Recognition Workshop , pp. 11-14
- Graff, D.¹

16
- 0003064806
- 1997 broadcast news benchmark test results: English and non-english
- D. S. Pallett, J. G. Fiscus, A. Martin, and M. A. Przybocki, "1997 broadcast news benchmark test results: English and non-english, " in Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop, 1998, pp. 5-11.
- (1998) Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop , pp. 5-11
- Pallett, D.S.¹ Fiscus, J.G.² Martin, A.³ Przybocki, M.A.⁴

17
- 33745219648
- The development of the Cambridge University RT-04 diarisation system
- S. E. Tranter, M. J. F. Gales, R. Sinha, S. Umesh, and P. C. Woodland, "The development of the Cambridge University RT-04 diarisation system, " in Proc. Fall 2004 Rich Transcription Workshop (RT-04), 2004.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Tranter, S.E.¹ Gales, M.J.F.² Sinha, R.³ Umesh, S.⁴ Woodland, P.C.⁵

18
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

19
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition, " Computer Speech and Language, vol. 12, no. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.75-98
- Gales, M.¹

20
- 84055222005
- Context-dependent pretrained deep neural networks for large-vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

21
- 84893712779
- D. Johnson, "Quicknet, " www1. icsi. berkeley. edu/Speech/qn. html.
- Quicknet
- Johnson, D.¹

22
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, 2011, pp. 24-29.
- (2011) Proc. ASRU , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.