SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 6975-6979

Multi-level adaptive networks in tandem and hybrid ASR systems

(3) Bell, Peter a Swietojanski, Pawel a Renals, Steve a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

BBC; deep neural networks; hybrid; MLAN; tandem; TED

Indexed keywords

BBC; DEEP NEURAL NETWORKS; HYBRID; MLAN; TANDEM; TED;

SIGNAL PROCESSING; SPEECH RECOGNITION;

EXPERIMENTS;

EID: 84890537527 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639014 Document Type: Conference Paper

Times cited : (38)

References (34)

1
- 0003573244
- Kluwer Academic Publishers
- H. Bourlard and N. Morgan, Connectionist Speech Recognition: A Hybrid Approach, Kluwer Academic Publishers, 1994
- (1994) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

2
- 0028194709
- Connectionist probability estimators in HMM speech recognition
- S. Renals, N. Morgan, H. Bourlard, M. Cohen, and H. Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.1 , pp. 161-174
- Renals, S.¹ Morgan, N.² Bourlard, H.³ Cohen, M.⁴ Franco, H.⁵

3
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- H. Hermanksy, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. ICASSP, 2000, pp. 1635-1630
- (2000) Proc. ICASSP , pp. 1635-1630
- Hermanksy, H.¹ Ellis, D.P.W.² Sharma, S.³

4
- 33745528628
- Using MLP features in SRIs conversational speech recognition system
- Q. Zhu, A. Stolcke, B.Y. Chen, and N. Morgan, "Using MLP features in SRIs conversational speech recognition system," in Proc. Interspeech, 2005
- (2005) Proc. Interspeech
- Zhu, Q.¹ Stolcke, A.² Chen, B.Y.³ Morgan, N.⁴

5
- 34547548235
- Probabilistic and bottleneck features for LVCSR of meetings
- F. Grezl, M Karafiat, S. Kontar, and J. Cernokcy, "Probabilistic and bottleneck features for LVCSR of meetings," in Proc. ICASSP, 2007
- (2007) Proc. ICASSP
- Grezl, F.¹ Karafiat, M.² Kontar, S.³ Cernokcy, J.⁴

6
- 0028996853
- Recent improvements to the ABBOT large vocabulary CSR system
- M. M. Hochberg, S. J. Renals, A. J. Robinson, and G. D. Cook, "Recent improvements to the ABBOT large vocabulary CSR system," in Proc. IEEE ICASSP, 1995, pp. 69-72
- (1995) Proc. IEEE ICASSP , pp. 69-72
- Hochberg, M.M.¹ Renals, S.J.² Robinson, A.J.³ Cook, G.D.⁴

7
- 0036567797
- Connectionist speech recognition of broadcast news
- A. J. Robinson, G. D. Cook, D. P. W. Ellis, E. Fosler-Lussier, S. J. Renals, and D. A. G. Williams, "Connectionist speech recognition of broadcast news," Speech Communication, vol. 37, no. 1-2, pp. 27-45, 2002
- (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 27-45
- Robinson, A.J.¹ Cook, G.D.² Ellis, D.P.W.³ Fosler-Lussier, E.⁴ Renals, S.J.⁵ Williams, D.A.G.⁶

8
- 0028530231
- State clustering in hidden Markov model-based continuous speech recognition
- S. J. Young and P. C. Woodland, "State clustering in hidden Markov model-based continuous speech recognition," Computer Speech &Language, vol. 8, no. 4, pp. 369-383, 1994
- (1994) Computer Speech &Language , vol.8 , Issue.4 , pp. 369-383
- Young, S.J.¹ Woodland, P.C.²

9
- 0032050110
- Maximum likelihood linear transforms for HMM-based speech recognition
- "Maximum likelihood linear transforms for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 75-98, 1998
- (1998) Computer Speech and Language , vol.12 , Issue.75-98

10
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- D. Povey and P.C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP. IEEE, 2002, vol. I, pp. 105-108
- (2002) Proc. ICASSP. IEEE , vol.1 , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

11
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig, "FMPE: Discriminatively trained features for speech recognition," in Proc ICASSP, 2005
- (2005) Proc ICASSP
- Povey, D.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Soltau, H.⁵ Zweig, G.⁶

12
- 85008520364
- Transcribing meetings with the AMIDA systems
- T. Hain, L. Burget, J. Dines, P.N. Garner, F. Grezl, A.E. Hannani, M. Huijbregts, M. Karafiat, M. Lincoln, and V. Wan, "Transcribing meetings with the AMIDA systems," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 2, pp. 486-498, 2012
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.2 , pp. 486-498
- Hain, T.¹ Burget, L.² Dines, J.³ Garner, P.N.⁴ Grezl, F.⁵ Hannani, A.E.⁶ Huijbregts, M.⁷ Karafiat, M.⁸ Lincoln, M.⁹ Wan, V.¹⁰

13
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G.E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012
- (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

14
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011
- (2011) Proc. ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

15
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohammed, G.E. Dahl, and Hinton G., "Acoustic modeling using deep belief networks," IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 14-22, 2012
- (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohammed, A.¹ Dahl, G.E.² Hinton, G.³

16
- 85009078709
- CDNN: A context dependent neural network for continuous speech recognition
- H. Bourlard, N. Morgan, C.Wooters, and S. Renals, "CDNN: A context dependent neural network for continuous speech recognition," in Proc. ICASSP, 1992, vol. 2, pp. 349-352
- (1992) Proc. ICASSP , vol.2 , pp. 349-352
- Bourlard, H.¹ Morgan, N.² Wooters, C.³ Renals, S.⁴

17
- 0030371791
- The 1995 ABBOT LVCSR system for multiple unknown microphones
- D. Kershaw, T. Robinson, and S. Renals, "The 1995 ABBOT LVCSR system for multiple unknown microphones," in Proc. ICSLP, 1996, pp. 1325-1328
- (1996) Proc. ICSLP , pp. 1325-1328
- Kershaw, D.¹ Robinson, T.² Renals, S.³

18
- 33745805403
- A fast learning algorithm for deep belief nets
- G. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, pp. 1527-1554, 2006
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.¹ Osindero, S.² Teh, Y.³

19
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech, 2011
- (2011) Proc. Interspeech
- Seide, F.¹ Li, G.² Yu, D.³

20
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohammed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in Proc. ASRU, 2011
- (2011) Proc. ASRU
- Sainath, T.¹ Kingsbury, B.² Ramabhadran, B.³ Fousek, P.⁴ Novak, P.⁵ Mohammed, A.⁶

21
- 84867593213
- Auto-encoder bottleneck features using deep belief networks
- T. Sainath, B. Kingsbury, and B. Ramabhadran, "Auto-encoder bottleneck features using deep belief networks," in Proc ICASSP, 2012
- (2012) Proc ICASSP
- Sainath, T.¹ Kingsbury, B.² Ramabhadran, B.³

22
- 84878392008
- Data-driven posterior features for low resource speech recognition applications
- S. Thomas, S. Ganapathy, A. Jansen, and H. Hermansky, "Data-driven posterior features for low resource speech recognition applications," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Thomas, S.¹ Ganapathy, S.² Jansen, A.³ Hermansky, H.⁴

23
- 84858955616
- Study of probabilistic and bottle-neck features in multilingual environment
- F. Grezl, M. Karafiat, and M. Janda, "Study of probabilistic and bottle-neck features in multilingual environment," in Proc. ASRU, 2011
- (2011) Proc. ASRU
- Grezl, F.¹ Karafiat, M.² Janda, M.³

24
- 84867606552
- Multilingual MLP features for low-resource LVCSR systems
- S. Thomas, S. Ganapathy, and H. Hermansky, "Multilingual MLP features for low-resource LVCSR systems," in Proc. ICASSP, 2012
- (2012) Proc. ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

25
- 4544236237
- On use of task independent training data in tandem feature extraction
- S. Sivadas and H. Hermansky, "On use of task independent training data in tandem feature extraction," in Proc. ICASSP, 2004
- (2004) Proc. ICASSP
- Sivadas, S.¹ Hermansky, H.²

26
- 33947619591
- Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons
- A. Stolcke, F. Grezl, M.-Y. Hwang, X Lei, N. Morgan, and D. Vergyri, "Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons," in Proc. ICASSP, 2006
- (2006) Proc. ICASSP
- Stolcke, A.¹ Grezl, F.² Hwang, M.-Y.³ Lei, X.⁴ Morgan, N.⁵ Vergyri, D.⁶

27
- 78049384951
- Multi-style MLP features for BN transcription
- V.-B. Le, L. Lamel, and J.-L. Gauvain, "Multi-style MLP features for BN transcription," in Proc. ICASSP, 2010, pp. 4866-4869
- (2010) Proc. ICASSP , pp. 4866-4869
- Le, V.-B.¹ Lamel, L.² Gauvain, J.-L.³

28
- 77949374930
- MLP based hierachical system for task adaptation in ASR
- J. Pinto, M. Magimai-Doss, and H. Bourlard, "MLP based hierachical system for task adaptation in ASR," in Proc. ASRU, 2009
- (2009) Proc. ASRU
- Pinto, J.¹ Magimai-Doss, M.² Bourlard, H.³

29
- 84874245054
- Transcription of multi-genre media archives using out-of-domain data
- Dec
- P.J. Bell, M.J.F. Gales, P. Lanchantin, X. Liu, Y. Long, S. Renals, P. Swietojanski, and P.C. Woodland, "Transcription of multi-genre media archives using out-of-domain data," in Proc. IEEE Workshop on Spoken Language Technology, Dec. 2012
- (2012) Proc. IEEE Workshop on Spoken Language Technology
- Bell, P.J.¹ Gales, M.J.F.² Lanchantin, P.³ Liu, X.⁴ Long, Y.⁵ Renals, S.⁶ Swietojanski, P.⁷ Woodland, P.C.⁸

30
- 33947620115
- Hierarchical structures of neural networks for phoneme recognition
- P. Schwarz, Matejka P., and J. Cernokcy, "Hierarchical structures of neural networks for phoneme recognition," in Proc. ICASSP, 2006
- (2006) Proc. ICASSP
- Schwarz, P.¹ Matejka, P.² Cernokcy, J.³

31
- 84873443879
- Theano: A CPU and GPU math expression compiler
- J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, J Turian, D Warde-Farley, and Y Bengio, "Theano: A CPU and GPU math expression compiler," in Proc. SciPy, 2010
- (2010) Proc. SciPy
- Bergstra, J.¹ Breuleux, O.² Bastien, F.³ Lamblin, P.⁴ Pascanu, R.⁵ Desjardins, G.⁶ Turian, J.⁷ Warde-Farley, D.⁸ Bengio, Y.⁹

32
- 85045373614
- Overview of the IWSLT 2012 evaluation campaign
- HK, December
- M Federico, M. Cettolo, L. Bentivogli, M. Paul, and S. Stuker, "Overview of the IWSLT 2012 evaluation campaign," in Proc. of the International Workshop on Spoken Language Translation, Hong Kong, HK, December 2012
- (2012) Proc. of the International Workshop on Spoken Language Translation, Hong Kong
- Federico, M.¹ Cettolo, M.² Bentivogli, L.³ Paul, M.⁴ Stuker, S.⁵

33
- 85001124710
- Wit3: Web inventory of transcribed and translated talks
- Trento, Italy, May
- M. Cettolo, C. Girardi, and M Federico, "Wit3: Web inventory of transcribed and translated talks," in Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT), Trento, Italy, May 2012, pp. 261-268
- (2012) Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT) , pp. 261-268
- Cettolo, M.¹ Girardi, C.² Federico, M.³

34
- 84890543632
- The UEDIN systems for the IWSLT 2012 evaluation
- E. Hasler, P. Bell, A. Ghoshal, B. Haddow, P. Koehn, F. McInnes, S. Renals, and P. Swietojanski, "The UEDIN systems for the IWSLT 2012 evaluation," in Proc. IWSLT, 2012.
- (2012) Proc. IWSLT
- Hasler, E.¹ Bell, P.² Ghoshal, A.³ Haddow, B.⁴ Koehn, P.⁵ McInnes, F.⁶ Renals, S.⁷ Swietojanski, P.⁸

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.