SCOPUS 정보 검색 플랫폼

2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings

Volumn , Issue , 2012, Pages 246-251

Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR

(3) Swietojanski, Pawel a Ghoshal, Arnab a Renals, Steve a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Cross lingual ASR; Deep Neural Networks; GlobalPhone; RBM pretraining

Indexed keywords

ACOUSTIC DATA; ACOUSTIC MODEL; CROSS-LINGUAL; DEEP NEURAL NETWORKS; EMISSION DENSITY; GAUSSIAN MIXTURE MODEL (GMMS); GLOBALPHONE; HIDDEN MARKOV MODEL(HMM); HYBRID CONFIGURATIONS; KNOWLEDGE TRANSFER; PRE-TRAINING; RESTRICTED BOLTZMANN MACHINE; SWEDISHS; TANDEM CONFIGURATION; TRAINING DATA;

HIDDEN MARKOV MODELS; KNOWLEDGE MANAGEMENT;

NEURAL NETWORKS;

EID: 84874278045 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2012.6424230 Document Type: Conference Paper

Times cited : (125)

References (28)

1
- 0035426931
- Language independent and language adaptive acoustic modeling for speech recognition
- T Schultz and AWaibel, "Language independent and language adaptive acoustic modeling for speech recognition," Speech Communication, vol. 35, pp. 31-51, 2001.
- (2001) Speech Communication , vol.35 , pp. 31-51
- Schultz, T.¹ Waibel, A.²

2
- 85009101138
- Experiments on cross-language acoustic modeling
- T Schultz and A Waibel, "Experiments on cross-language acoustic modeling," in Proc. Eurospeech, 2001.
- (2001) Proc. Eurospeech
- Schultz, T.¹ Waibel, A.²

3
- 84858955616
- Study of probabilistic and bottle-neck features in multilingual environment
- F Gŕezl, M Karafíat, and M Janda, "Study of probabilistic and bottle-neck features in multilingual environment," in Proc. IEEE ASRU, 2011.
- (2011) Proc IEEE ASRU
- Gŕezl, F.¹ Karafíat, M.² Janda, M.³

4
- 84867606552
- Multilingual MLP features for low-resource LVCSR systems
- S Thomas, S Ganapathy, and H Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in Proc. IEEE ICASSP, 2012.
- (2012) Proc IEEE ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

5
- 84867616349
- Using KL-divergence and multilingual information to improve ASR for underresourced languages
- D Imseng, H Bourlard, and PN Garner, "Using KL-divergence and multilingual information to improve ASR for underresourced languages," in Proc. IEEE ICASSP, 2012.
- (2012) Proc IEEE ICASSP
- Imseng, D.¹ Bourlard, H.² Garner, P.N.³

6
- 78049394188
- Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
- L Burget, P Schwarz, M Agarwal, P Akyaz, K Feng, A Ghoshal, O Glembek, N Goel, M Karafíat, D Povey, A Rastrow, RC Rose, and S Thomas, "Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models," in Proc. IEEE ICASSP, 2010.
- (2010) Proc IEEE ICASSP
- Burget, L.¹ Schwarz, P.² Agarwal, M.³ Akyaz, P.⁴ Feng, K.⁵ Ghoshal, A.⁶ Glembek, O.⁷ Goel, N.⁸ Karafíat, M.⁹ Povey, D.¹⁰ Rastrow, A.¹¹ Rose, R.C.¹² Thomas, S.¹³

7
- 84858952433
- Regularized subspace Gaussian mixture models for cross-lingual speech recognition
- L Lu, A Ghoshal, and S Renals, "Regularized subspace Gaussian mixture models for cross-lingual speech recognition," in Proc. IEEE ASRU, 2011.
- (2011) Proc IEEE ASRU
- Lu, L.¹ Ghoshal, A.² Renals, S.³

8
- 80051617867
- Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil
- NT Vu, F Kraus, and T Schultz, "Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil," in Proc. IEEE ICASSP, 2011.
- (2011) Proc IEEE ICASSP
- Vu, N.T.¹ Kraus, F.² Schultz, T.³

9
- 33745805403
- A fast learning algorithm for deep belief nets
- G Hinton, S Osindero, and Y Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, pp. 1527-1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.¹ Osindero, S.² Teh, Y.³

10
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- H Hermansky, DPW Ellis, and S Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP, 2000.
- (2000) Proc IEEE ICASSP
- Hermansky, H.¹ Ellis, D.² Sharma, S.³

11
- 34547548235
- Probabilistic and bottle-neck features for LVCSR of meetings
- F Grézl, M Karafiát, S Kontár, and J C? ernocký, "Probabilistic and bottle-neck features for LVCSR of meetings," in Proc. IEEE ICASSP, 2007.
- (2007) Proc IEEE ICASSP
- Grézl, F.¹ Karafiát, M.² Kontár, S.³ Cernocký, J.⁴

12
- 0003573244
- Kluwer Academic
- H Bourlard and N Morgan, Connectionist Speech Recognition-A Hybrid Approach, Kluwer Academic, 1994.
- (1994) Connectionist Speech Recognition-A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

13
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- S Renals, N Morgan, H Bourlard, M Cohen, and H Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.1 , pp. 161-174
- Renals, S.¹ Morgan, N.² Bourlard, H.³ Cohen, M.⁴ Franco, H.⁵

14
- 0028392167
- An application of recurrent nets to phone probability estimation
- AJ Robinson, "An application of recurrent nets to phone probability estimation," IEEE Transactions on Neural Networks, vol. 5, no. 2, pp. 298-305, 1994.
- (1994) IEEE Transactions on Neural Networks , vol.5 , Issue.2 , pp. 298-305
- Robinson, A.J.¹

15
- 84055211743
- Acoustic modeling using deep belief networks
- A Mohamed, GE Dahl, and G Hinton, "Acoustic modeling using deep belief networks," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

16
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- GE Dahl, D Yu, L Deng, and A Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech & Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech & Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

17
- 33947619591
- Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
- A Stolcke, F Gŕezl, M-Y Hwang, X Lei, N Morgan, and D Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. IEEE ICASSP, 2006.
- (2006) Proc IEEE ICASSP
- Stolcke, A.¹ Gŕezl, F.² Hwang, M.-Y.³ Lei, X.⁴ Morgan, N.⁵ Vergyri, D.⁶

18
- 79959819891
- Cross-lingual and multistream posterior features for low resource LVCSR systems
- S Thomas and H Hermansky, "Cross-lingual and multistream posterior features for low resource LVCSR systems," in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Thomas, S.¹ Hermansky, H.²

19
- 44849132075
- Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs
- O Ç etin, M Magimai-Doss, K Livescu, A Kantor, S King, C Bartels, and J Frankel, "Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs," in Proc IEEE ASRU, 2007.
- (2007) Proc IEEE ASRU
- Çetin, O.¹ Magimai-Doss, M.² Livescu, K.³ Kantor, A.⁴ King, S.⁵ Bartels, C.⁶ Frankel, J.⁷

20
- 84864073449
- Greedy layer-wise training of deep networks
- MIT Press
- Y Bengio, P Lamblin, D Popovici, and H Larochelle, "Greedy layer-wise training of deep networks," in Advances in Neural Information Processing Systems 19 (NIPS'06), pp. 153-160. MIT Press, 2007.
- (2007) Advances in Neural Information Processing Systems 19 (NIPS'06) , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴

21
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F Seide, G Li, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE ASRU, 2011.
- (2011) Proc IEEE ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

22
- 84905257065
- GlobalPhone: A multilingual speech and text database developed at Karlsruhe University
- T Schultz, "GlobalPhone: a multilingual speech and text database developed at Karlsruhe University," in Proc. ICLSP, 2002.
- (2002) Proc. ICLSP
- Schultz, T.¹

23
- 84858965424
- Ph.D. thesis The University of Edinburgh
- P Lal, Cross-Lingual Automatic Speech Recognition using Tandem Features, Ph.D. thesis, The University of Edinburgh, 2011.
- (2011) Cross-Lingual Automatic Speech Recognition Using Tandem Features
- Lal, P.¹

24
- 84874276847
- The Kaldi speech recognition toolkit
- December
- D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, M Hannemann, P Motĺcek, Y Qian, P Schwarz, J Silovsḱy, G Stemmer, and K Veseĺy, "The Kaldi speech recognition toolkit," in Proc. IEEE ASRU, December 2011.
- (2011) Proc. IEEE ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motĺcek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsḱy, J.¹¹ Stemmer, G.¹² Veseĺy, K.¹³

25
- 84892187452
- Maximum likelihood modeling with Gaussian distributions for classification
- May
- R Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. IEEE ICASSP, May 1998, vol. 2, pp. 661-664.
- (1998) Proc IEEE ICASSP , vol.2 , pp. 661-664
- Gopinath, R.¹

26
- 51449120120
- Boosted MMI for model and featurespace discriminative training
- D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, and K Visweswariah, "Boosted MMI for model and featurespace discriminative training," in Proc. IEEE ICASSP, 2008, pp. 4057-4060.
- (2008) Proc IEEE ICASSP , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

27
- 84873443879
- Theano: A CPU and GPU math expression compiler
- J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, J Turian, D Warde-Farley, and Y Bengio, "Theano: a CPU and GPU math expression compiler," in Proc. SciPy, 2010.
- (2010) Proc. SciPy
- Bergstra, J.¹ Breuleux, O.² Bastien, F.³ Lamblin, P.⁴ Pascanu, R.⁵ Desjardins, G.⁶ Turian, J.⁷ Warde-Farley, D.⁸ Bengio, Y.⁹

28
- 77949522811
- Why does unsupervised pre-training help deep learning?
- February
- D Erhan, Y Bengio, A Courville, P-A Manzagol, P Vincent, and S Bengio, "Why does unsupervised pre-training help deep learning?," Journal of Machine Learning Research, vol. 11, pp. 625-660, February 2010.
- (2010) Journal of Machine Learning Research , vol.11 , pp. 625-660
- Erhan, D.¹ Bengio, Y.² Courville, A.³ Manzagol, P.-A.⁴ Vincent, P.⁵ Bengio, S.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.