SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 639-646

Cambridge university transcription systems for the multi-genre broadcast challenge

(8) Woodland, P C a Liu, X a Qian, Y a Zhang, C a Gales, M J F a Karanasou, P a Lanchantin, P a Wang, L a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

broadcast transcription; deep neural networks; HTK; Kaldi; Speech recognition

Indexed keywords

COMPUTATIONAL LINGUISTICS; RECURRENT NEURAL NETWORKS; TRANSCRIPTION;

ACTIVATION FUNCTIONS; CAMBRIDGE UNIVERSITY; COMBINED SYSTEM; DEEP NEURAL NETWORKS; JOINT DECODING; KALDI; LANGUAGE MODEL; SEGMENTATION SYSTEM;

SPEECH RECOGNITION;

EID: 84964475976 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404856 Document Type: Conference Paper

Times cited : (31)

References (55)

1
- 84964518874
- http://htk.eng.cam.ac.uk

2
- 84867605836
- Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
- Kyoto
- O. Abdel-Hamid, A. Mohamed, H. Jiang, &G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition", Proc. ICASSP, Kyoto, 2012
- (2012) Proc. ICASSP
- Abdel-Hamid, O.¹ Mohamed, A.² Jiang, H.³ Penn, G.⁴

3
- 0030362995
- A compact model for speaker adaptive training
- Philadelphia
- T. Anastasakos, J. McDonough, R. Schwartz, &J. Makhoul, "A compact model for speaker adaptive training", Proc. ICSLP, Philadelphia, 1996
- (1996) Proc. ICSLP
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

4
- 85010742974
- The MGB challenge: Evaluating multi-genre broadcast media transcription
- Scottsdale
- P. Bell, M.J.F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M.Wester &P.C.Woodland. "The MGB challenge: Evaluating multi-genre broadcast media transcription", Proc. ASRU Workshop, Scottsdale, 2015
- (2015) Proc. ASRU Workshop
- Bell, P.¹ Gales, M.J.F.² Hain, T.³ Kilgour, J.⁴ Lanchantin, P.⁵ Liu, X.⁶ McParland, A.⁷ Renals, S.⁸ Saz, O.⁹ Wester, M.¹⁰ Woodland, P.C.¹¹

5
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Y. Bengio, P. Simard, &P. Frasconi, "Learning long-term dependencies with gradient descent is difficult", IEEE Transactions on Neural Networks, vol. 5, pp. 157-166, 1994
- (1994) IEEE Transactions on Neural Networks , vol.5 , pp. 157-166
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

6
- 41049105254
- Joint-sequence models for graphemeto-phoneme conversion
- M. Bisani &H. Ney, "Joint-sequence models for graphemeto-phoneme conversion, Speech Communication, vol. 50, no. 5, 2008
- (2008) Speech Communication , vol.50 , Issue.5
- Bisani, M.¹ Ney, H.²

7
- 0141607824
- Latent dirichlet allocation
- D.M. Blei, A. Ng, &M.I. Jordan, "Latent Dirichlet allocation", Journal of Machine Learning Research, vol. 3, pp. 99-1022, 2003
- (2003) Journal of Machine Learning Research , vol.3 , pp. 99-1022
- Blei, D.M.¹ Ng, A.² Jordan, M.I.³

8
- 4544253838
- Improving broadcast news transcription by lightly supervised discriminative training
- Montreal
- H.Y. Chan &P.C.Woodland, "Improving broadcast news transcription by lightly supervised discriminative training", Proc. ICASSP, Montreal, 2004
- (2004) Proc. ICASSP
- Chan, H.Y.¹ Woodland, P.C.²

9
- 84910067710
- Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch
- Singapore
- X. Chen, Y. Wang, X. Liu, M.J.F. Gales, &P.C. Woodland, "Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch", Proc. Interspeech, Singapore, 2014
- (2014) Proc. Interspeech
- Chen, X.¹ Wang, Y.² Liu, X.³ Gales, M.J.F.⁴ Woodland, P.C.⁵

10
- 84959155988
- Recurrent neural network language model adaptation for multi-genre broadcast speech recognition
- Dresden
- X. Chen, T. Tan, X. Liu, P. Lanchantin, M.J.F. Gales &P.C. Woodland, "Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition", Proc. Interspeech, Dresden, 2015
- (2015) Proc. Interspeech
- Chen, X.¹ Tan, T.² Liu, X.³ Lanchantin, P.⁴ Gales, M.J.F.⁵ Woodland, P.C.⁶

11
- 4544253834
- Posterior probability decoding, confidence estimation and system combination
- College Park, MD
- G. Evermann &P.C. Woodland, "Posterior probability decoding, confidence estimation and system combination", Proc. Speech Transcription Workshop, College Park, MD, 2000
- (2000) Proc. Speech Transcription Workshop
- Evermann, G.¹ Woodland, P.C.²

12
- 0030638031
- A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER)
- Santa Barbara
- J. Fiscus, "A post-processing system to yield reduced word error rates: recogniser output voting error reduction (ROVER), iProc. ASRU Workshop, Santa Barbara, 1997
- (1997) IProc. ASRU Workshop
- Fiscus, J.¹

13
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition", Computer Speech and Langauge, vol. 12, pp. 75-98, 1997
- (1997) Computer Speech and Langauge , vol.12 , pp. 75-98
- Gales, M.J.F.¹

14
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- M.J.F. Gales, "Semi-tied covariance matrices for hidden Markov models", IEEE Transactions on Speech and Audio Processing, vol. 7, no. 3, pp. 272-281, 1999
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
- Gales, M.J.F.¹

15
- 34047266379
- Progress in the CU-HTK broadcast news transcription system
- M.J.F. Gales, D.Y. Kim, P.C. Woodland, H.Y. Chan, D. Mrva, R. Sinha, &S.E. Tranter, "Progress in the CU-HTK broadcast news transcription system", IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 5, pp. 1513-1525, 2006
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.5 , pp. 1513-1525
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Chan, H.Y.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

16
- 0036567851
- The LIMSI broadcast news transcription system
- J.L. Gauvain, L. Lamel, &G. Adda "The LIMSI broadcast news transcription system" Speech communication, vol. 37, no. 1, pp. 89-108, 2002
- (2002) Speech Communication , vol.37 , Issue.1 , pp. 89-108
- Gauvain, J.L.¹ Lamel, L.² Adda, G.³

17
- 84905252790
- A pitch extraction algorithm tuned for automatic speech recognition
- Florence
- P. Ghahremani, B. BabaAli, D. Povey, K. Riedhammer, J. Trmal, &S. Khudanpur, "A pitch extraction algorithm tuned for automatic speech recognition", Proc. ICASSP, Florence, 2014
- (2014) Proc. ICASSP
- Ghahremani, P.¹ Babaali, B.² Povey, D.³ Riedhammer, K.⁴ Trmal, J.⁵ Khudanpur, S.⁶

18
- 51449103447
- Optimizing bottle-neck features for LVCSR
- Las Vegas
- F. Grezl &P. Fousek, "Optimizing bottle-neck features for LVCSR", Proc. ICASSP, Las Vegas, 2008
- (2008) Proc. ICASSP
- Grezl, F.¹ Fousek, P.²

19
- 78650474133
- Technical Report, UTML TR 2010-003, Department of Computer Science, University of Toronto
- G.E. Hinton, "A Practical Guide to Training Restricted Boltzmann Machines", Technical Report, UTML TR 2010-003, Department of Computer Science, University of Toronto, 2010
- (2010) A Practical Guide to Training Restricted Boltzmann Machines
- Hinton, G.E.¹

20
- 84959162419
- I-vector estimation using informative priors for adaptation of deep neural networks
- Dresden
- P. Karanasou, M.J.F. Gales &P.C. Woodland, "I-vector estimation using informative priors for adaptation of deep neural networks", Proc. Interspeech, Dresden, 2015
- (2015) Proc. Interspeech
- Karanasou, P.¹ Gales, M.J.F.² Woodland, P.C.³

21
- 84964556678
- Speaker diarisation and longitudinal linking in multi-genre broadcast data
- P. Karanasou, M.J.F. Gales, P. Lanchantin, X. Liu, Y. Qian, L. Wang, P.C. Woodland &C. Zhang, "Speaker diarisation and longitudinal linking in multi-genre broadcast data", Proc. ASRU Workshop, Scottsdale, 2015
- (2015) Proc. ASRU Workshop, Scottsdale
- Karanasou, P.¹ Gales, M.J.F.² Lanchantin, P.³ Liu, X.⁴ Qian, Y.⁵ Wang, L.⁶ Woodland, P.C.⁷ Zhang, C.⁸

22
- 70349213445
- Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
- Taipei
- B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, Proc. ICASSP, Taipei, 2009
- (2009) Proc. ICASSP
- Kingsbury, B.¹

23
- 84893668957
- Investigation of multilingual deep neural networks for spoken term detection
- Olomouc
- K.M. Knill, M.J.F. Gales, S.P. Rath, P.C. Woodland, C. Zhang, &S.-X. Zhang, "Investigation of multilingual deep neural networks for spoken term detection", Proc. ASRU Workshop, Olomouc, 2013
- (2013) Proc. ASRU Workshop
- Knill, K.M.¹ Gales, M.J.F.² Rath, S.P.³ Woodland, P.C.⁴ Zhang, C.⁵ Zhang, S.-X.⁶

24
- 0036460908
- Lightly supervised and unsupervised acoustic model training
- L. Lamel, J.L. Gauvain, &G. Adda, "Lightly supervised and unsupervised acoustic model training", Computer Speech &Language, vol. 16, no. 1, pp. 115-129, 2002
- (2002) Computer Speech &Language , vol.16 , Issue.1 , pp. 115-129
- Lamel, L.¹ Gauvain, J.L.² Adda, G.³

25
- 84964513580
- The development of the Cambridge university alignment systems for the multi-genre broadcast challenge
- Scottsdale
- P. Lanchantin, M.J.F. Gales, P. Karanasou, X. Liu, Y. Qian, L. Wang, P.C. Woodland &C. Zhang, "The development of the Cambridge University alignment systems for the Multi-Genre Broadcast challenge", Proc. ASRU Workshop, Scottsdale, 2015
- (2015) Proc. ASRU Workshop
- Lanchantin, P.¹ Gales, M.J.F.² Karanasou, P.³ Liu, X.⁴ Qian, Y.⁵ Wang, L.⁶ Woodland, P.C.⁷ Zhang, C.⁸

26
- 0141703325
- Automatic complexity control for HLDA systems
- Hong Kong
- X. Liu, M.J.F. Gales, &P.C. Woodland, "Automatic complexity control for HLDA systems", Proc. ICASSP, Hong Kong, 2003
- (2003) Proc. ICASSP
- Liu, X.¹ Gales, M.J.F.² Woodland, P.C.³

27
- 84905240726
- Efficient lattice rescoring using recurrent neural network language models
- Florence
- X. Liu, Y. Wang, X. Chen, M.J.F. Gales, &P.C. Woodland, "Efficient lattice rescoring using recurrent neural network language models", Proc. ICASSP, Florence, 2014
- (2014) Proc. ICASSP
- Liu, X.¹ Wang, Y.² Chen, X.³ Gales, M.J.F.⁴ Woodland, P.C.⁵

28
- 84959109976
- The Cambridge university 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation
- Dresden
- X. Liu, F. Flego, L. Wang, C. Zhang, M.J.F. Gales, &P.C. Woodland, "The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation", Proc. Interspeech, Dresden, 2015
- (2015) Proc. Interspeech
- Liu, X.¹ Flego, F.² Wang, L.³ Zhang, C.⁴ Gales, M.J.F.⁵ Woodland, P.C.⁶

29
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brill, A. Stolcke, "Finding consensus in speech recognition: word error minimization and other applications of confusion networks", Computer Speech and Language, Vol. 14, No. 4, pp. 373-400, 2000
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

30
- 79959829092
- Recurrent neural network based language model
- Makuhari, Japan
- T. Mikolov, M. Karafiat, L. Burget, J. Cernocky, &S. Khudanpur, "Recurrent neural network based language model", Proc. Interspeech, Makuhari, Japan, 2010
- (2010) Proc. Interspeech
- Mikolov, T.¹ Karafiat, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

31
- 80051643236
- Extensions of recurrent neural network language model
- Prague
- T. Mikolov, S. Kombrink, L. Burget, J. H. Cernocky, &S. Khudanpur, "Extensions of recurrent neural network language model", Proc. ICASSP, Prague, 2011
- (2011) Proc. ICASSP
- Mikolov, T.¹ Kombrink, S.² Burget, L.³ Cernocky, J.H.⁴ Khudanpur, S.⁵

32
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- Orlando
- D. Povey &P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training", Proc. ICASSP, Orlando, 2002
- (2002) Proc. ICASSP
- Povey, D.¹ Woodland, P.C.²

33
- 84858953642
- The Kaldi speech recognition toolkit
- Hawaii
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M.Hannemann, P. Motlíček, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, &K. Vesely, "The Kaldi speech recognition toolkit", Proc. ASRU Workshop, Hawaii, 2011
- (2011) Proc. ASRU Workshop
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlíček, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stemmer, G.¹² Vesely, K.¹³

34
- 70450180978
- Robust LTS rules with the Combilex speech technology lexicon
- Brighton
- K. Richmond, R. Clark &S. Fitt, "Robust LTS rules with the Combilex speech technology lexicon", Proc. Interspeech, Brighton, 2009
- (2009) Proc. Interspeech
- Richmond, K.¹ Clark, R.² Fitt, S.³

35
- 79959836077
- On generating Combilex pronunciations via morphological analysis
- Makuhari, Japan
- K. Richmond, R. Clark &S. Fitt, "On generating Combilex pronunciations via morphological analysis", Proc. Interspeech, Makuhari, Japan, 2010
- (2010) Proc. Interspeech
- Richmond, K.¹ Clark, R.² Fitt, S.³

36
- 84910046405
- Long short-term memory recurrent neural network architectures for large scale acoustic modeling
- Singapore
- H. Sak, A. Senior, &F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling", Proc. Interspeech, Singapore, 2014
- (2014) Proc. Interspeech
- Sak, H.¹ Senior, A.² Beaufays, F.³

37
- 84893688455
- Learning filter banks within a deep neural network framework
- Olomouc
- T. N. Sainath, B. Kingsbury, A. Mohamed, &B. Ramabhadran, "Learning filter banks within a deep neural network framework", Proc. ASRU Workshop, Olomouc, 2013
- (2013) Proc. ASRU Workshop
- Sainath, T.N.¹ Kingsbury, B.² Mohamed, A.³ Ramabhadran, B.⁴

38
- 84946037134
- Convolutional, long short-term memory, fully connected deep neural networks
- Brisbane
- T.N. Sainath, O. Vinyals, A. Senior, &Hasim Sak, "Convolutional, long short-term memory, fully connected deep neural networks", Proc. ICASSP, Brisbane, 2015
- (2015) Proc. ICASSP
- Sainath, T.N.¹ Vinyals, O.² Senior, A.³ Sak, H.⁴

39
- 84890446559
- Feature engineering in context-dependent deep neural networks
- Hawaii
- F. Seide, G. Li, X. Chen, &D. Yu, "Feature engineering in context-dependent deep neural networks", Proc. ASRU Workshop, Hawaii, 2011
- (2011) Proc. ASRU Workshop
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

40
- 84906240855
- Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system
- Lyon
- Y. Si, Q. Zhang, T. Li, J. Pan, &Y. Yan, "Prefix tree based n-best list re-scoring for recurrent neural network language model used in speech recognition system", Proc. Interspeech, Lyon, 2013
- (2013) Proc. Interspeech
- Si, Y.¹ Zhang, Q.² Li, T.³ Pan, J.⁴ Yan, Y.⁵

41
- 33646357306
- The Cambridge University March 2005 speaker diarisation system
- R. Sinha, S.E. Tranter, M.J.F. Gales, &P.C. Woodland, "The Cambridge University March 2005 speaker diarisation system", Proc. Interspeech, 2005
- (2005) Proc. Interspeech
- Sinha, R.¹ Tranter, S.E.² Gales, M.J.F.³ Woodland, P.C.⁴

42
- 84881054791
- Hermitian polynomial for speaker adaptation of connectionist speech recognition systems
- S.M. Siniscalchi, J.-Y. Li, &C.-H. Lee, "Hermitian polynomial for speaker adaptation of connectionist speech recognition systems", IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, pp. 2152-2161, 2013
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , pp. 2152-2161
- Siniscalchi, S.M.¹ Li, J.-Y.² Lee, C.-H.³

43
- 84891308106
- SRILM: An extensible language modeling toolkit
- Denver
- A. Stolcke, "SRILM an extensible language modeling toolkit", Proc. ICSLP, Denver, 2002
- (2002) Proc. ICSLP
- Stolcke, A.¹

44
- 84890492591
- Revisiting hybrid and GMM-HMM system combination techniques
- Vancouver
- P. Swietojanski, A. Ghoshal, &S. Renals, "Revisiting hybrid and GMM-HMM system combination techniques", Proc. ICASSP, Vancouver, 2013
- (2013) Proc. ICASSP
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

45
- 84983119674
- Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models
- Lake Tahoe
- P. Swietojanski &S. Renals, "Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models", Proc. IWSLT, Lake Tahoe, 2014
- (2014) Proc. IWSLT
- Swietojanski, P.¹ Renals, S.²

46
- 84946032695
- Differentiable pooling for unsupervised speaker adaptation
- Brisbane
- P. Swietojanski &S. Renals, "Differentiable pooling for unsupervised speaker adaptation", Proc. ICASSP, Brisbane, 2015
- (2015) Proc. ICASSP
- Swietojanski, P.¹ Renals, S.²

47
- 84858971297
- Convolutive bottleneck network features for LVCSR
- Hawaii
- K. Veseĺy, M. Karafiát, &F. Grézl, "Convolutive Bottleneck Network Features for LVCSR", Proc. ASRU Workshop, Hawaii, 2011
- (2011) Proc. ASRU Workshop
- Veseĺy, K.¹ Karafiát, M.² Grézl, F.³

48
- 84906274730
- Sequencediscriminative training of deep neural networks
- Lyon
- K. Vesely, A. Ghoshal, L. Burget, &D. Povey, "Sequencediscriminative training of deep neural networks", Proc. Interspeech, Lyon, 2013
- (2013) Proc. Interspeech
- Vesely, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

49
- 0036567794
- The development of the HTK broadcast news transcription system: An overview
- P.C.Woodland, "The development of the HTK broadcast news transcription system: An overview", Speech Communication, vol. 37, no. 1, pp. 47-67, 2002
- (2002) Speech Communication , vol.37 , Issue.1 , pp. 47-67
- Woodland, P.C.¹

50
- 79953250475
- Minimum Bayes risk decoding and system combination based on a recursion for edit distance
- H. Xu, D. Povey, L. Mangu, &J. Zhu, "Minimum Bayes risk decoding and system combination based on a recursion for edit distance", Computer Speech &Language, vol. 25, no. 4, pp. 802-828, 2011
- (2011) Computer Speech &Language , vol.25 , Issue.4 , pp. 802-828
- Xu, H.¹ Povey, D.² Mangu, L.³ Zhu, J.⁴

51
- 0003571976
- Cambridge University Engineering Department
- S.J. Young, G. Evermann, M.J.F. Gales, T. Hain., D. Kershaw, X. Liu, G. Moore, J.J. Odell, D. Ollason, D. Povey, V. Valtchev, and P.C. Woodland, The HTK book (for HTK version 3.4). Cambridge University Engineering Department, 2006
- (2006) The HTK Book (For HTK Version 3.4)
- Young, S.J.¹ Evermann, G.² Gales, M.J.F.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.C.¹²

52
- 84923929378
- Fuse deep neural network and Gaussian mixture model systems
- Springer, London
- D. Yu &L. Deng, "Fuse deep neural network and Gaussian mixture model systems", Automatic Speech Recognition: A Deep Learning Approach, pp. 177-191. Springer, London, 2015
- (2015) Automatic Speech Recognition: A Deep Learning Approach , pp. 177-191
- Yu, D.¹ Deng, L.²

53
- 84959142742
- A general artificial neural network extension for HTK
- Dresden
- C. Zhang &P.C. Woodland, "A general artificial neural network extension for HTK", Proc. Interspeech, Dresden, 2015
- (2015) Proc. Interspeech
- Zhang, C.¹ Woodland, P.C.²

54
- 84959174678
- Parameterised sigmoid and ReLU hidden activation functions for DNN acoustic modelling
- Dresden
- C. Zhang &P.C. Woodland, "Parameterised sigmoid and ReLU hidden activation functions for DNN acoustic modelling", Proc. Interspeech, Dresden, 2015
- (2015) Proc. Interspeech
- Zhang, C.¹ Woodland, P.C.²

55
- 84946061232
- Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data
- Brisbane
- Y. Zhao, J.-Y. Li, J. Xue, &Y.-F. Gong, "Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data", Proc. ICASSP, Brisbane, 2015
- (2015) Proc. ICASSP
- Zhao, Y.¹ Li, J.-Y.² Xue, J.³ Gong, Y.-F.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.