SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 681-686

CRIM and LIUM approaches for multi-genre broadcast media transcription

(6) Gupta, Vishwa a Deleglise, Paul b Boulianne, Gilles a Esteve, Yannick b Meignier, Sylvain b Rousseau, Anthony b

a Cent de Recherche Informatique de Montreal (Canada)

b UNIVERSITÉ DU MAINE (France)

Author keywords

automatic transcription; change point detection; Deep Neural Networks; DNN; multi genre broadcast transcription

Indexed keywords

MERGING; TRANSCRIPTION;

AUTOMATIC TRANSCRIPTION; BASELINE SYSTEMS; BROADCAST MEDIA; CHANGE POINT DETECTION; DEEP NEURAL NETWORKS; SPEAKER DIARIZATION; TRAINING SCENARIO; WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 84964540334 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404862 Document Type: Conference Paper

Times cited : (5)

References (17)

1
- 85010742974
- The MGB challenge evaluating multigenre broadcast media transcription
- P. Bell et. al., "The MGB challenge: Evaluating multigenre broadcast media transcription", in Proc. ASRU 2015
- (2015) Proc. ASRU
- Bell, P.¹

2
- 84858953642
- The kaldi speech recognition toolkit
- D. Povey et. al., "The Kaldi Speech Recognition Toolkit", in Proc. ASRU 2011
- (2011) Proc. ASRU
- Povey, D.¹

3
- 84905223329
- Multilingual deep neural network based acoustic modeling for rapid language adaptation
- N. Vu, D. Imseng, D. Povey, P. Motlicek, T. Schultz, H. Bourlard, "Multilingual deep neural network based acoustic modeling for rapid language adaptation", in Proc. ICASSP 2014, pp. 7689-7693
- (2014) Proc. ICASSP , pp. 7689-7693
- Vu, N.¹ Imseng, D.² Povey, D.³ Motlicek, P.⁴ Schultz, T.⁵ Bourlard, H.⁶

4
- 78049260392
- Doctoral Thesis, dept. Computer Graphics &Multimedia, Brno Univ of Technology, Brno
- F. Grézl, "TRAP-based Probabilistic Features for Automatic Speech Recognition", Doctoral Thesis, dept. Computer Graphics &Multimedia, Brno Univ of Technology, Brno 2007
- (2007) TRAP-based Probabilistic Features for Automatic Speech Recognition
- Grézl, F.¹

5
- 33745219648
- The development of the Cambridge university rt-04 diarisation system
- S. Tranter, M. Gales, R. Sinha, S. Umesh, P. Woodland, "The development of the Cambridge University RT-04 diarisation system," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), 2004
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Tranter, S.¹ Gales, M.² Sinha, R.³ Umesh, S.⁴ Woodland, P.⁵

6
- 84905239342
- Improving deep neural network acoustic models using generalized maxout networks
- X. Zhang, J. Trmal, D. Povey, S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks", in Proc. ICASSP 2014, pp. 215-219
- (2014) Proc. ICASSP , pp. 215-219
- Zhang, X.¹ Trmal, J.² Povey, D.³ Khudanpur, S.⁴

7
- 84905259145
- I-vectorbased speaker adaptation of deep neural networks for French broadcast audio transcription
- Florence, Italy
- V. Gupta, P. Kenny, P. Ouellet, T. Stafylakis, "I-vectorbased speaker adaptation of deep neural networks for French broadcast audio transcription", in Proc. ICASSP 2014, Florence, Italy
- (2014) Proc. ICASSP
- Gupta, V.¹ Kenny, P.² Ouellet, P.³ Stafylakis, T.⁴

8
- 84893691530
- Speaker adaptation of neural network acoustic models using i-vectors
- G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors", in Proc. ASRU 2013, pp. 55-59
- (2013) Proc. ASRU , pp. 55-59
- Saon, G.¹ Soltau, H.² Nahamoo, D.³ Picheny, M.⁴

9
- 84905259138
- Improving DNN speaker independence with i-vector inputs
- A. Senior, I. Moreno, "Improving DNN speaker independence with i-vector inputs", in Proc. ICASSP 2014
- (2014) Proc. ICASSP
- Senior, A.¹ Moreno, I.²

10
- 78650898482
- LIUM SPKDIARIZATION: An open source toolkit for diarization
- Dallas, Tx
- S. Meignier and T. Merlin, "LIUM SPKDIARIZATION: An open source toolkit for diarization", in CMU SPUD workshop, Dallas, Tx, 2010
- (2010) CMU SPUD Workshop
- Meignier, S.¹ Merlin, T.²

11
- 84910063870
- The UEDIN english ASR system for the IWSLT 2013 evaluation
- P.J. Bell, F. McInnes, S. Gangireddy, M. Sinclair, A. Birch, and S. Renals, "The UEDIN english ASR system for the IWSLT 2013 evaluation", in Proc. International Workshop on Spoken Language Translation, 2013
- (2013) Proc. International Workshop on Spoken Language Translation
- Bell, P.J.¹ McInnes, F.² Gangireddy, S.³ Sinclair, M.⁴ Birch, A.⁵ Renals, S.⁶

12
- 84906261494
- CSLM-A modular Open-Source Continuous Space Language Modeling Toolkit
- Lyon, France
- H. Schwenk, "CSLM-A modular Open-Source Continuous Space Language Modeling Toolkit", in Proc. Interspeech 2013, Lyon, France
- (2013) Proc. Interspeech
- Schwenk, H.¹

13
- 51449094642
- Speaker diarization of French broadcast news
- V. Gupta, G. Boulianne, P. Kenny, P. Ouellet, and P. Dumouchel, "Speaker Diarization of French Broadcast News", in Proc. ICASSP 2008, pp. 4365-4368
- (2008) Proc. ICASSP , pp. 4365-4368
- Gupta, V.¹ Boulianne, G.² Kenny, P.³ Ouellet, P.⁴ Dumouchel, P.⁵

14
- 84906274730
- Sequencediscriminative training of deep neural networks
- Lyon, France
- K. Veseĺy, A. Ghoshal, L. Burget, D. Povey, "Sequencediscriminative Training of Deep Neural Networks", in Proc. Interspeech 2013, Lyon, France
- (2013) Proc. Interspeech
- Veseĺy, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

15
- 70450190028
- Improvements to the LIUM French ASR system based on CMU Sphinx: What helps to significantly reduce the word error rate?
- Brighton, UK
- P. Deléglise, Y. Esteve, S. Meignier, T. Merlin, "Improvements to the LIUM French ASR system based on CMU Sphinx: what helps to significantly reduce the word error rate?", in Proc. Interspeech 2009, Brighton, UK
- (2009) Proc. Interspeech
- Deléglise, P.¹ Esteve, Y.² Meignier, S.³ Merlin, T.⁴

16
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brilland, A. Stolcke, "Finding Consensus in Speech Recognition: Word Error Minimization and other Applications of Confusion Networks", in Computer Speech and Language, vol. 14, number 4, pp 373-400, 2000
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brilland, E.² Stolcke, A.³

17
- 84964532533
- LIUM and CRIM ASR system combination for the REPERE evaluation campaign
- Brno, Czech Republic
- A. Rousseau, G. Boulianne, P. Deléglise, Y. Esteve, V. Gupta, S. Meignier, "LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign", in Proc. Text, Speech and Dialogue, Brno, Czech Republic, 2014
- (2014) Proc. Text, Speech and Dialogue
- Rousseau, A.¹ Boulianne, G.² Deléglise, P.³ Esteve, Y.⁴ Gupta, V.⁵ Meignier, S.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.