SCOPUS 정보 검색 플랫폼

IEEE/ACM Transactions on Audio Speech and Language Processing

Volumn 25, Issue 1, 2017, Pages 60-71

Bayesian unsupervised batch and online speaker adaptation of activation function parameters in deep models for automatic speech recognition

(3) Huang, Zhen a Siniscalchi, Sabato Marco b,c Lee, Chin Hui a

a GEORGIA INSTITUTE OF TECHNOLOGY (United States)

b KORE UNIVERSITY OF ENNA (Italy)

c GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Automatic speech recognition; Bayesian learning; deep neural networks; online adaptation; prior evolution; transfer learning; unsupervised speaker adaptation

Indexed keywords

BAYESIAN NETWORKS; BENCHMARKING; CHEMICAL ACTIVATION; COMPUTATIONAL EFFICIENCY; DIGITAL STORAGE; HIDDEN MARKOV MODELS; LEARNING ALGORITHMS; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MAXIMUM LIKELIHOOD ESTIMATION;

AUTOMATIC SPEECH RECOGNITION; BAYESIAN LEARNING; DEEP NEURAL NETWORKS; ON-LINE ADAPTATION; PRIOR EVOLUTION; TRANSFER LEARNING; UNSUPERVISED SPEAKER ADAPTATION;

SPEECH RECOGNITION;

EID: 85002900398 PISSN: 23299290 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2016.2621669 Document Type: Article

Times cited : (16)

References (60)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Nov
- G. Hinton et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹

2
- 0003573244
- Norwell, MA, USA: Kluwer
- H. Bourlard and N. Morgan, Connectionist Speech Recognition: A Hybrid Approach. Norwell, MA, USA: Kluwer, 1994.
- (1994) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

3
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb
- L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.¹

4
- 0032140546
- On stochastic feature and model compensation approaches to robust speech recognition
- C.-H. Lee, "On stochastic feature and model compensation approaches to robust speech recognition," Speech Commun., vol. 25, no. 1-3, pp. 29-47, 1998.
- (1998) Speech Commun. , vol.25 , Issue.1-3 , pp. 29-47
- Lee, C.-H.¹

5
- 84862931515
- Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
- Mar
- S. M. Siniscalchi, D.-C. Lyu, T. Svendsen, and C.-H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 875-887, Mar. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.3 , pp. 875-887
- Siniscalchi, S.M.¹ Lyu, D.-C.² Svendsen, T.³ Lee, C.-H.⁴

6
- 84890542079
- KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
- D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 7893-7897.
- (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 7893-7897
- Yu, D.¹ Yao, K.² Su, H.³ Li, G.⁴ Seide, F.⁵

7
- 84890452886
- Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
- O. Abdel-Hamid and H. Jiang, "Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 7942-7946.
- (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 7942-7946
- Abdel-Hamid, O.¹ Jiang, H.²

8
- 84938690750
- Speaker adaptation of deep neural networks using a hierarchy of output layers
- R. Price, K.-I. Iso, and K. Shinoda, "Speaker adaptation of deep neural networks using a hierarchy of output layers," in Proc. Spoken Lang. Technol. Workshop, 2014, pp. 153-158.
- (2014) Proc. Spoken Lang. Technol. Workshop , pp. 153-158
- Price, R.¹ Iso, K.-I.² Shinoda, K.³

9
- 84959169347
- Rapid adaptation for deep neural networks through multi-task learning
- Z. Huang, J. Li, S.M. Siniscalchi, I.-F. Chen, J. Wu, and C.-H. Lee, "Rapid adaptation for deep neural networks through multi-task learning," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2015, pp. 3625-3629.
- (2015) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 3625-3629
- Huang, Z.¹ Li, J.² Siniscalchi, S.M.³ Chen, I.-F.⁴ Wu, J.⁵ Lee, C.-H.⁶

10
- 84976435936
- Learning hidden unit contributions for unsupervised acoustic model adaptation
- Aug
- P. Swietojanski, J. Li, and S. Renals, "Learning hidden unit contributions for unsupervised acoustic model adaptation," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 24, no. 8, pp. 1450-1463, Aug. 2016.
- (2016) IEEE/ACM Trans. Audio, Speech, Lang. Process. , vol.24 , Issue.8 , pp. 1450-1463
- Swietojanski, P.¹ Li, J.² Renals, S.³

11
- 84858976070
- Feature engineering in contextdependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in contextdependent deep neural networks for conversational speech transcription," in Proc. IEEE Workshop Automat. Speech Recogn. Understanding, 2011, pp. 24-29.
- (2011) Proc. IEEE Workshop Automat. Speech Recogn. Understanding , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

12
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors," in Proc. IEEE Workshop Autom. Speech Recogn. Understanding, 2013, pp. 55-59.
- (2013) Proc. IEEE Workshop Autom. Speech Recogn. Understanding , pp. 55-59
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

13
- 84881054791
- Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
- Oct
- S. M. Siniscalchi, J. Li, and C.-H. Lee, "Hermitian polynomial for speaker adaptation of connectionist speech recognition systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 10, pp. 2152-2161, Oct. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.10 , pp. 2152-2161
- Siniscalchi, S.M.¹ Li, J.² Lee, C.-H.³

14
- 84905262902
- Factorized adaptation for deep neural network
- J. Li, J.-T. Huang, and Y. Gong, "Factorized adaptation for deep neural network," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2014, pp. 5537-5541.
- (2014) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 5537-5541
- Li, J.¹ Huang, J.-T.² Gong, Y.³

15
- 84946054484
- Multi-basis adaptive neural network for rapid adaptation in speech recognition
- C. Wu and M. Gales, "Multi-basis adaptive neural network for rapid adaptation in speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2015, pp. 4315-4319.
- (2015) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 4315-4319
- Wu, C.¹ Gales, M.²

16
- 84929376602
- Bounded conditional mean imputation with observation uncertainties and acoustic model adaptation
- U. Remes, A. R. López, and D. Palomäki, "Bounded conditional mean imputation with observation uncertainties and acoustic model adaptation," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 23, no. 7, pp. 1198-1208, 2015.
- (2015) IEEE/ACM Trans. Audio, Speech, Lang. Process. , vol.23 , Issue.7 , pp. 1198-1208
- Remes, U.¹ López, A.R.² Palomäki, D.³

17
- 84991404490
- Factorized hidden layer adaptation for deep neural network based acoustic modeling
- Dec
- L. Samarakoon and K. C. Sim, "Factorized hidden layer adaptation for deep neural network based acoustic modeling," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 24, no. 12, pp. 2241-2250, Dec. 2016.
- (2016) IEEE/ACM Trans. Audio, Speech, Lang. Process. , vol.24 , Issue.12 , pp. 2241-2250
- Samarakoon, L.¹ Sim, K.C.²

18
- 84986193646
- Differentiable pooling for unsupervised acoustic model adaptation
- Oct
- P. Swietojanski and S. Renals, "Differentiable pooling for unsupervised acoustic model adaptation," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 24, no. 10, pp. 1773-1784, Oct. 2016.
- (2016) IEEE/ACM Trans. Audio, Speech, Lang. Process. , vol.24 , Issue.10 , pp. 1773-1784
- Swietojanski, P.¹ Renals, S.²

19
- 0027683813
- Shared-distribution hidden Markov models for speech recognition
- Oct
- M.-Y. Hwang and X. Huang, "Shared-distribution hidden Markov models for speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 4, pp. 414-420, Oct. 1993.
- (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.4 , pp. 414-420
- Hwang, M.-Y.¹ Huang, X.²

20
- 0032923221
- Catastrophic forgetting in connectionist networks: Causes, consequences and solutions
- R. M. French, "Catastrophic forgetting in connectionist networks: Causes, consequences and solutions," Trends Cogn. Sci., vol. 3, 1994, pp. 128-135.
- (1994) Trends Cogn. Sci. , vol.3 , pp. 128-135
- French, R.M.¹

21
- 34548012893
- Linear hidden transformations for adaptation of hybrid ANN/HMM models
- R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. D. Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models," Speech Commun., vol. 49, no. 10, pp. 827-835, 2007.
- (2007) Speech Commun. , vol.49 , Issue.10 , pp. 827-835
- Gemello, R.¹ Mana, F.² Scanzio, S.³ Laface, P.⁴ Mori, R.D.⁵

22
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

23
- 0035279111
- A structural Bayes approach to speaker adaptation
- Mar
- K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 276-287
- Shinoda, K.¹ Lee, C.-H.²

24
- 84959161626
- Maximum a posteriori adaptation of network parameters in deep models
- Z. Huang, S. M. Siniscalchi, I.-F. Chen, J. Li, J. Wu, and C.-H. Lee, "Maximum a posteriori adaptation of network parameters in deep models," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2015, pp. 1076-1080.
- (2015) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 1076-1080
- Huang, Z.¹ Siniscalchi, S.M.² Chen, I.-F.³ Li, J.⁴ Wu, J.⁵ Lee, C.-H.⁶

25
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Aug
- C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
- Lee, C.-H.¹ Huo, Q.²

26
- 84876672166
- Machine learning paradigms for speech recognition: An overview
- May
- L. Deng and X. Li, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 5, pp. 1060-1089, May 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.5 , pp. 1060-1089
- Deng, L.¹ Li, X.²

27
- 85141919230
- Unsupervised word sense disambiguation rivaling supervised methods
- 198-196
- D. Yarowsky, "Unsupervised word sense disambiguation rivaling supervised methods," in Proc. 33rd Annu. Meeting Assoc. Comput. Linguistics, 1995, pp. 198-196.
- (1995) Proc. 33rd Annu. Meeting Assoc. Comput. Linguistics
- Yarowsky, D.¹

28
- 33749625988
- Philadelphia, PA, USA: Linguistic Data Consortium
- J. J. Godfrey and E. Holliman, "Switchboard-1 release 2," Philadelphia, PA, USA: Linguistic Data Consortium, 1997.
- (1997) Switchboard-1 Release 2
- Godfrey, J.J.¹ Holliman, E.²

29
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2009, pp. 3761-3764.
- (2009) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 3761-3764
- Kingsbury, B.¹

30
- 84906274730
- Sequence-discriminative training of deep neural networks
- K. Veseý, A. Ghoshal, L. Burget, and D. Povey, "Sequence-discriminative training of deep neural networks," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2013, pp. 2345-2349.
- (2013) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 2345-2349
- Veseý, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

31
- 84994372786
- Compact feedforward sequential memory networks for large vocabulary continuous speech recognition
- Sep
- S. Zhang, H. Jiang, S. Xiong, S. Wei and L. Dai, "Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition," in Proc. Interspeech, Sep. 2016, pp. 3389-3393.
- (2016) Proc. Interspeech , pp. 3389-3393
- Zhang, S.¹ Jiang, H.² Xiong, S.³ Wei, S.⁴ Dai, L.⁵

32
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Comput., vol. 9, pp. 1735-1780, 1997.
- (1997) Neural Comput. , vol.9 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

33
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A. Mohamed, and G. E. Hinton, "Speech recognition with deep recurrent neural networks," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 6645-6649.
- (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 6645-6649
- Graves, A.¹ Mohamed, A.² Hinton, G.E.³

34
- 84910046405
- Long short-term memory recurrent neural network architectures for large scale acoustic modeling
- H. Sak, A. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2014, pp. 338-342.
- (2014) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 338-342
- Sak, H.¹ Senior, A.² Beaufays, F.³

35
- 84890525984
- Deep convolutional neural networks for LVCSR
- T. N. Sainath, A. M. B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 8614-8618.
- (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 8614-8618
- Sainath, T.N.¹ Kingsbury, A.M.B.² Ramabhadran, B.³

36
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- Sep.
- V. V. Digalakis, D. Rtischev, and L. G. Neumeye, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no 4, 357-366, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.4 , pp. 357-366
- Digalakis, V.V.¹ Rtischev, D.² Neumeye, L.G.³

37
- 0032050110
- Maximum likelihood linear transformations for HMMbased speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput., Speech, Lang., vol. 12, pp. 75-98, 1998.
- (1998) Comput., Speech, Lang. , vol.12 , pp. 75-98
- Gales, M.J.F.¹

38
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

39
- 84938688160
- Speaker adaptive training of deep neural network acoustic models using i-vectors
- Nov
- Y. Miao, H. Zhang, and F. Metze, "Speaker adaptive training of deep neural network acoustic models using i-vectors," IEEE/ACMTrans. Audio, Speech, Lang. Process., vol. 23, no. 11, pp. 1938-1949, Nov. 2015.
- (2015) IEEE/ACMTrans. Audio, Speech, Lang. Process. , vol.23 , Issue.11 , pp. 1938-1949
- Miao, Y.¹ Zhang, H.² Metze, F.³

40
- 84921731072
- Fast adaptation of deep neural network based on discriminant codes for speech recognition
- Dec
- S. Xue, O. Abdel-Hamid, H. Jiang, L. Dai, and Q. Liu, "Fast adaptation of deep neural network based on discriminant codes for speech recognition," IEEE/ACM Trans. Audio, Speech, Lang. Process., vol. 22, no. 12, pp. 1713-1725, Dec. 2014.
- (2014) IEEE/ACM Trans. Audio, Speech, Lang. Process. , vol.22 , Issue.12 , pp. 1713-1725
- Xue, S.¹ Abdel-Hamid, O.² Jiang, H.³ Dai, L.⁴ Liu, Q.⁵

41
- 80051654263
- Deep belief networks using discriminative features for phone recognition
- A. Mohamed, T. Sainath, G. D. B. Ramabhadran, G. E. Hinton, and M. Picheny, "Deep belief networks using discriminative features for phone recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2011, pp. 5060-5063.
- (2011) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 5060-5063
- Mohamed, A.¹ Sainath, T.² Ramabhadran, G.D.B.³ Hinton, G.E.⁴ Picheny, M.⁵

42
- 84937854847
- Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
- J. Neto et al., "Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system," in Proc. Eur. Conf. SpeechCommun. Technol., 1995, pp. 2171-2174.
- (1995) Proc. Eur. Conf. SpeechCommun. Technol , pp. 2171-2174
- Neto, J.¹

43
- 84874226579
- Adaptation of context-dependent deep neural networks for automatic speech recognition
- K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition," in Proc. Spoken Lang. Technol. Workshop, 2012, pp. 366-369.
- (2012) Proc. Spoken Lang. Technol. Workshop , pp. 366-369
- Yao, K.¹ Yu, D.² Seide, F.³ Su, H.⁴ Deng, L.⁵ Gong, Y.⁶

44
- 79959849500
- Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
- B. Li and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2010, pp. 526-529.
- (2010) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 526-529
- Li, B.¹ Sim, K.C.²

45
- 84912109599
- Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition
- S. Xue, H. Jiang, and L. Dai, "Speaker adaptation of hybrid NN/HMM model for speech recognition based on singular value decomposition," in Proc. 9th Int. Symp. Chin. Spoken Lang. Process., 2014, pp. 1-5.
- (2014) Proc. 9th Int. Symp. Chin. Spoken Lang. Process , pp. 1-5
- Xue, S.¹ Jiang, H.² Dai, L.³

46
- 84905229915
- Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network
- J. Xue, J. Li, D. Yu, M. Seltzer, and Y. Gong, "Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2014, pp. 6359-6363.
- (2014) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 6359-6363
- Xue, J.¹ Li, J.² Yu, D.³ Seltzer, M.⁴ Gong, Y.⁵

47
- 84906227589
- Restructuring of deep neural network acoustic models with singular value decomposition
- J. Xue, J. Li, and Y. Gong, "Restructuring of deep neural network acoustic models with singular value decomposition," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2013, pp. 2365-2369.
- (2013) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 2365-2369
- Xue, J.¹ Li, J.² Gong, Y.³

48
- 84946061232
- Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data
- Y. Zhao, J. Li, J. Xue, and Y. Gong, "Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2015, pp. 4310-4314.
- (2015) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 4310-4314
- Zhao, Y.¹ Li, J.² Xue, J.³ Gong, Y.⁴

49
- 84973321190
- DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions
- C. Zhang and P. C. Woodland, "DNN speaker adaptation using parameterised sigmoid and ReLU hidden activation functions," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2016, pp. 5300-5304.
- (2016) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 5300-5304
- Zhang, C.¹ Woodland, P.C.²

50
- 84858984756
- IVectorbased discriminative adaptation for automatic speech recognition
- M. Karafiát, L. Burget, P. Matejka, O. Glembek, and J. Cernocky, "iVectorbased discriminative adaptation for automatic speech recognition," in Proc. IEEE Workshop Automat. Speech Recogn. Understanding, 2011, pp. 152-157.
- (2011) Proc. IEEE Workshop Automat. Speech Recogn. Understanding , pp. 152-157
- Karafiát, M.¹ Burget, L.² Matejka, P.³ Glembek, O.⁴ Cernocky, J.⁵

51
- 84910068089
- Adaptation of deep neural network acoustic models using factorised i-vectors
- P. Karanasou, Y. Wang, M. Gales, and P. Woodland, "Adaptation of deep neural network acoustic models using factorised i-vectors," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2014, pp. 2180-2184.
- (2014) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 2180-2184
- Karanasou, P.¹ Wang, Y.² Gales, M.³ Woodland, P.⁴

52
- 0003413187
- New York, NY, USA: Macmillan
- S. Haykin, Neural Networks: A Comprehensive Foundation. New York, NY, USA: Macmillan, 1994.
- (1994) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

53
- 84871614543
- A novel loss function for the overall risk criterion based discriminative training of HMMmodels
- J. Kaiser, B. Horvat, and Z. Kacic, "A novel loss function for the overall risk criterion based discriminative training of HMMmodels," in Proc. 6th Int. Conf. Spoken Lang. Process., 2000, pp. 887-890.
- (2000) Proc. 6th Int. Conf. Spoken Lang. Process , pp. 887-890
- Kaiser, J.¹ Horvat, B.² Kacic, Z.³

54
- 44949182698
- Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition
- M. Gibson and T. Hain, "Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2006, pp. 2406-2409.
- (2006) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 2406-2409
- Gibson, M.¹ Hain, T.²

55
- 33746600649
- Reducing the dimensionality of data with neural networks
- G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, pp. 504-507, 2006.
- (2006) Science , vol.313 , pp. 504-507
- Hinton, G.E.¹ Salakhutdinov, R.R.²

56
- 0002533801
- The empirical Bayes approach to statistical decision problems
- H. Robbins, "The empirical Bayes approach to statistical decision problems," Ann. Math. Statist., vol. 35, no. 1, 1964.
- (1964) Ann. Math. Statist. , vol.35 , Issue.1
- Robbins, H.¹

57
- 0003759417
- New York, NY, USA: McGraw-Hill
- M. H. DeGroot, Optimal Statistical Decisions. New York, NY, USA: McGraw-Hill, 1970.
- (1970) Optimal Statistical Decisions
- DeGroot, M.H.¹

58
- 84858953642
- The Kaldi speech recognition toolkit
- D. Povey et al., "The Kaldi speech recognition toolkit," in Proc. IEEE Workshop Automat. Speech Recogn. Understanding, 2011. [Online]. Availabel: http://kaldi-asr.org/doc/about.html
- (2011) Proc. IEEE Workshop Automat. Speech Recogn. Understanding
- Povey, D.¹

59
- 84910084579
- 2000 NIST evaluation of conversational speech recognition over the telephone: English and Mandarin performance results
- J. Fiscus, W. M. Fisher, A. F. Martin, M. A. Przybocki, and D. S. Pallett, "2000 NIST evaluation of conversational speech recognition over the telephone: English and Mandarin performance results," in Proc. Speech Transcription Workshop, 2000, pp. 1-5.
- (2000) Proc. Speech Transcription Workshop , pp. 1-5
- Fiscus, J.¹ Fisher, W.M.² Martin, A.F.³ Przybocki, M.A.⁴ Pallett, D.S.⁵

60
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Annu. Conf. Int. Speech Commun. Assoc., 2011, pp. 437-440.
- (2011) Proc. Annu. Conf. Int. Speech Commun. Assoc , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.