메뉴 건너뛰기




Volumn , Issue , 2016, Pages 681-686

CRIM and LIUM approaches for multi-genre broadcast media transcription

Author keywords

automatic transcription; change point detection; Deep Neural Networks; DNN; multi genre broadcast transcription

Indexed keywords

MERGING; TRANSCRIPTION;

EID: 84964540334     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2015.7404862     Document Type: Conference Paper
Times cited : (5)

References (17)
  • 1
    • 85010742974 scopus 로고    scopus 로고
    • The MGB challenge evaluating multigenre broadcast media transcription
    • P. Bell et. al., "The MGB challenge: Evaluating multigenre broadcast media transcription", in Proc. ASRU 2015
    • (2015) Proc. ASRU
    • Bell, P.1
  • 2
    • 84858953642 scopus 로고    scopus 로고
    • The kaldi speech recognition toolkit
    • D. Povey et. al., "The Kaldi Speech Recognition Toolkit", in Proc. ASRU 2011
    • (2011) Proc. ASRU
    • Povey, D.1
  • 3
    • 84905223329 scopus 로고    scopus 로고
    • Multilingual deep neural network based acoustic modeling for rapid language adaptation
    • N. Vu, D. Imseng, D. Povey, P. Motlicek, T. Schultz, H. Bourlard, "Multilingual deep neural network based acoustic modeling for rapid language adaptation", in Proc. ICASSP 2014, pp. 7689-7693
    • (2014) Proc. ICASSP , pp. 7689-7693
    • Vu, N.1    Imseng, D.2    Povey, D.3    Motlicek, P.4    Schultz, T.5    Bourlard, H.6
  • 6
    • 84905239342 scopus 로고    scopus 로고
    • Improving deep neural network acoustic models using generalized maxout networks
    • X. Zhang, J. Trmal, D. Povey, S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks", in Proc. ICASSP 2014, pp. 215-219
    • (2014) Proc. ICASSP , pp. 215-219
    • Zhang, X.1    Trmal, J.2    Povey, D.3    Khudanpur, S.4
  • 7
    • 84905259145 scopus 로고    scopus 로고
    • I-vectorbased speaker adaptation of deep neural networks for French broadcast audio transcription
    • Florence, Italy
    • V. Gupta, P. Kenny, P. Ouellet, T. Stafylakis, "I-vectorbased speaker adaptation of deep neural networks for French broadcast audio transcription", in Proc. ICASSP 2014, Florence, Italy
    • (2014) Proc. ICASSP
    • Gupta, V.1    Kenny, P.2    Ouellet, P.3    Stafylakis, T.4
  • 8
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors", in Proc. ASRU 2013, pp. 55-59
    • (2013) Proc. ASRU , pp. 55-59
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 9
    • 84905259138 scopus 로고    scopus 로고
    • Improving DNN speaker independence with i-vector inputs
    • A. Senior, I. Moreno, "Improving DNN speaker independence with i-vector inputs", in Proc. ICASSP 2014
    • (2014) Proc. ICASSP
    • Senior, A.1    Moreno, I.2
  • 10
    • 78650898482 scopus 로고    scopus 로고
    • LIUM SPKDIARIZATION: An open source toolkit for diarization
    • Dallas, Tx
    • S. Meignier and T. Merlin, "LIUM SPKDIARIZATION: An open source toolkit for diarization", in CMU SPUD workshop, Dallas, Tx, 2010
    • (2010) CMU SPUD Workshop
    • Meignier, S.1    Merlin, T.2
  • 12
    • 84906261494 scopus 로고    scopus 로고
    • CSLM-A modular Open-Source Continuous Space Language Modeling Toolkit
    • Lyon, France
    • H. Schwenk, "CSLM-A modular Open-Source Continuous Space Language Modeling Toolkit", in Proc. Interspeech 2013, Lyon, France
    • (2013) Proc. Interspeech
    • Schwenk, H.1
  • 14
    • 84906274730 scopus 로고    scopus 로고
    • Sequencediscriminative training of deep neural networks
    • Lyon, France
    • K. Veseĺy, A. Ghoshal, L. Burget, D. Povey, "Sequencediscriminative Training of Deep Neural Networks", in Proc. Interspeech 2013, Lyon, France
    • (2013) Proc. Interspeech
    • Veseĺy, K.1    Ghoshal, A.2    Burget, L.3    Povey, D.4
  • 15
    • 70450190028 scopus 로고    scopus 로고
    • Improvements to the LIUM French ASR system based on CMU Sphinx: What helps to significantly reduce the word error rate?
    • Brighton, UK
    • P. Deléglise, Y. Esteve, S. Meignier, T. Merlin, "Improvements to the LIUM French ASR system based on CMU Sphinx: what helps to significantly reduce the word error rate?", in Proc. Interspeech 2009, Brighton, UK
    • (2009) Proc. Interspeech
    • Deléglise, P.1    Esteve, Y.2    Meignier, S.3    Merlin, T.4
  • 16
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brilland, A. Stolcke, "Finding Consensus in Speech Recognition: Word Error Minimization and other Applications of Confusion Networks", in Computer Speech and Language, vol. 14, number 4, pp 373-400, 2000
    • (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brilland, E.2    Stolcke, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.