SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4610-4613

An investigation of augmenting speaker representations to improve speaker normalisation for DNN-based speech recognition

(2) Huang, Hengguan a Sim, Khe Chai a

a NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

Author keywords

augmented speaker representation; deep neural network; speaker normalisation; speech recognition

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; LOUDSPEAKERS; SPEECH; SPEECH COMMUNICATION; VECTOR SPACES;

AUGMENTED SPEAKER REPRESENTATION; AUTOMATIC SPEECH RECOGNITION; BOTTLENECK FEATURES; FINE GRAINS; PERFORMANCE GAIN; POSTERIOR PROBABILITY; SPEAKER INDEPENDENTS; SPEAKER NORMALISATION;

SPEECH RECOGNITION;

EID: 84946035423 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178844 Document Type: Conference Paper

Times cited : (53)

References (12)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, Deep neural networks for acoustic modeling in speech recognition, IEEE Signal Processing Magazine, vol. 29, pp. 82-97, 2012
- (2012) IEEE Signal Processing Magazine , vol.29 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

2
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, Speaker adaptation of neural network acoustic models using i-vectors, in ASRU, 2013
- (2013) ASRU
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

3
- 84910031119
- Towards speaker adaptive training of deep neural network acoustic models
- Y. Miao, H. Zhang, and F. Metze, Towards speaker adaptive training of deep neural network acoustic models, in Proc. Interspeech, 2014
- (2014) Proc. Interspeech
- Miao, Y.¹ Zhang, H.² Metze, F.³

4
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, Speaker and session variability in GMM-based speaker verification, IEEE Transactions on Audio, Speech &Language Processing, vol. 15, no. 4, pp. 14481460, 2007
- (2007) IEEE Transactions on Audio, Speech &Language Processing , vol.15 , Issue.4 , pp. 14481460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

5
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, Computer Speech &Language, vol. 9, no. 2, 1995
- (1995) Computer Speech &Language , vol.9 , Issue.2
- Leggetter, C.J.¹ Woodland, P.C.²

6
- 0032050110
- Maximum likelihood linear transformations for hmm-based speech recognition
- M.J.F. Gales, Maximum likelihood linear transformations for hmm-based speech recognition, Computer Speech &Language, vol. 12, 1998
- (1998) Computer Speech &Language , vol.12
- Gales, M.J.F.¹

7
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Frank Seide, Gang Li, Xie Chen, and Dong Yu, Feature engineering in context-dependent deep neural networks for conversational speech transcription, in ASRU, 2011, pp. 24-29
- (2011) ASRU , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

8
- 84921731072
- Speaker and session variability in GMMbased speaker verification
- Shaofei Xue, O. Abdel-Hamid, Hui Jiang, Lirong Dai, and Qingfeng Liu, Speaker and session variability in GMMbased speaker verification, IEEE/ACM Transactions on Audio, Speech &Language Processing, vol. 22, no. 12, pp. 1713-1725, 2014
- (2014) IEEE/ACM Transactions on Audio, Speech &Language Processing , vol.22 , Issue.12 , pp. 1713-1725
- Xue, S.¹ Abdel-Hamid, O.² Jiang, H.³ Dai, L.⁴ Liu, Q.⁵

9
- 84890509526
- MLP-based factor analysis for tandem speech recognition
- IEEE
- M. Ferras and H. Bourlard, MLP-based factor analysis for tandem speech recognition, in IEEE International Conference on Acoustics, Speech and Signal Processing. 2013, IEEE
- (2013) IEEE International Conference on Acoustics, Speech and Signal Processing
- Ferras, M.¹ Bourlard, H.²

10
- 33745220290
- Modeling intra-speaker variability for speaker recognition
- Hagai Aronowitz, Dror Irony, and David Burstein, Modeling intra-speaker variability for speaker recognition, in Proc. Interspeech, 2005
- (2005) Proc. Interspeech
- Aronowitz, H.¹ Irony, D.² Burstein, D.³

11
- 84901456587
- Bottleneck features for speaker recognition
- Sibel Yaman, Jason Pelecanos, and Ruhi Sarikaya, Bottleneck features for speaker recognition, in Proc. Odyssey, 2012, vol. 12
- (2012) Proc. Odyssey , vol.12
- Yaman, S.¹ Pelecanos, J.² Sarikaya, R.³

12
- 85079095310
- The design for the Wall Street Journal-based CSR corpus
- D. B. Paul and J. M. Baker, The design for the Wall Street Journal-based CSR corpus, in Proc. ICSLP, 1992
- (1992) Proc. ICSLP
- Paul, D.B.¹ Baker, J.M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.