메뉴 건너뛰기




Volumn , Issue , 2011, Pages 95-100

Leveraging large amounts of loosely transcribed corporate videos for acoustic model training

Author keywords

automatic speech recognition; lightly supervised acoustic model training; LVCSR

Indexed keywords

ACOUSTIC MODEL; ADDITIONAL COSTS; AUTOMATIC SPEECH RECOGNITION; COST SAVING; LIGHTLY SUPERVISED ACOUSTIC MODEL TRAINING; LVCSR; STATE OF THE ART; TRAINING PROCESS; WORD ERROR RATE;

EID: 84858951500     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2011.6163912     Document Type: Conference Paper
Times cited : (5)

References (22)
  • 1
    • 0032659923 scopus 로고    scopus 로고
    • Improving acoustic models with captioned multimedia speech
    • June
    • P. J. Jang and A. G. Hauptmann, "Improving Acoustic Models with Captioned Multimedia Speech." in Proc. of ICMCS, vol. 2, June 1999, pp. 767-771.
    • (1999) Proc. of ICMCS , vol.2 , pp. 767-771
    • Jang, P.J.1    Hauptmann, A.G.2
  • 3
    • 0034841730 scopus 로고    scopus 로고
    • Investigating lightly supervised acoustic model training
    • Salt Lake City, USA, May
    • L. Lamel, J. Gauvain, and G. Adda, "Investigating Lightly Supervised Acoustic Model Training," in Proc. of ICASSP, Salt Lake City, USA, May 2001.
    • (2001) Proc. of ICASSP
    • Lamel, L.1    Gauvain, J.2    Adda, G.3
  • 4
    • 4544315111 scopus 로고    scopus 로고
    • Lightly supervised acoustic model training using consensus networks
    • L. Chen, L. Lamel, and J.-L. Gauvain, "Lightly supervised acoustic model training using consensus networks," in Proc. ICASSP, 2004.
    • (2004) Proc. ICASSP
    • Chen, L.1    Lamel, L.2    Gauvain, J.-L.3
  • 5
    • 4544273245 scopus 로고    scopus 로고
    • Light supervision in acoustic model training
    • L. Nguyen and B. Xiang, "Light Supervision in Acoustic Model Training," in Proc. ICASSP, 2004.
    • (2004) Proc. ICASSP
    • Nguyen, L.1    Xiang, B.2
  • 6
    • 4544253838 scopus 로고    scopus 로고
    • Improving broadcast news transcription by lightly supervised discriminative training
    • Montreal, Canada, May
    • H. Chan and P. Woodland, "Improving Broadcast News Transcription by Lightly Supervised Discriminative Training," in Proc. ICASSP, Montreal, Canada, May 2004.
    • (2004) Proc. ICASSP
    • Chan, H.1    Woodland, P.2
  • 7
    • 84867216798 scopus 로고    scopus 로고
    • Lightly supervised acoustic model training on EPPS recordings
    • Brisbane, Australia, September
    • M. Paulik and A. Waibel, "Lightly Supervised Acoustic Model Training on EPPS Recordings," in Proc. Interspeech, Brisbane, Australia, September 2008.
    • (2008) Proc. Interspeech
    • Paulik, M.1    Waibel, A.2
  • 8
    • 79851498679 scopus 로고    scopus 로고
    • Automatic transcription of parliamentary meetings and classroom lectures - A sustainable approach and real system evaluations
    • Tainan, Taiwan, November
    • T. Kawahara, "Automatic transcription of parliamentary meetings and classroom lectures - A sustainable approach and real system evaluations," in Proc. Chinese Spoken Language Processing, Tainan, Taiwan, November 2010.
    • (2010) Proc. Chinese Spoken Language Processing
    • Kawahara, T.1
  • 9
    • 0003571407 scopus 로고    scopus 로고
    • University of Edinburgh, Scotland, Tech. Rep.
    • A. Black and P. Taylor, "The Festival Speech Synthesis System," University of Edinburgh, Scotland, Tech. Rep., 1997, http://www.cstr.ed.ac.uk/ projects/festival.html.
    • (1997) The Festival Speech Synthesis System
    • Black, A.1    Taylor, P.2
  • 11
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky, "Perceptual Linear Predictive (PLP) Analysis of Speech," The Journal of Acoustical Society of America, vol. 87(4), pp. 1738-1752, 1990. (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 12
    • 84891308106 scopus 로고    scopus 로고
    • SRILM - An extensible language modeling toolkit
    • Denver, USA, September
    • A. Stolcke, "SRILM - An extensible language modeling toolkit." in Proc. of ICSLP, Denver, USA, September 2002.
    • (2002) Proc. of ICSLP
    • Stolcke, A.1
  • 13
    • 0028996876 scopus 로고
    • Improved backing-off for n-gram language modeling
    • Detroit, USA, May
    • R. Kneser and H. Ney, "Improved backing-off for n-gram language modeling." in Proc. of ICASSP, Detroit, USA, May 1995.
    • (1995) Proc. of ICASSP
    • Kneser, R.1    Ney, H.2
  • 14
    • 0003396042 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Harvard University, Tech. Rep., 1998.
    • (1998) Harvard University, Tech. Rep.
    • Chen, S.1    Goodman, J.2
  • 18
    • 4544339437 scopus 로고    scopus 로고
    • A generalized construction of integrated speech recognition transducers
    • Montreal, Canada, May
    • C. Allauzen, M. Mohri, M. Riley, and B. Roar., "A Generalized Construction of Integrated Speech Recognition Transducers." in Proc. of ICASSP, Montreal, Canada, May 2004.
    • (2004) Proc. of ICASSP
    • Allauzen, C.1    Mohri, M.2    Riley, M.3    Roar, B.4
  • 19
    • 79959851726 scopus 로고    scopus 로고
    • An empirical comparison of the T3, juicer, HDecode and sphinx3 decoders
    • Makuhari, Japan, September
    • J. R. Novak, P. Dixon, and S. Furui, "An Empirical Comparison of the T3, Juicer, HDecode and Sphinx3 Decoders." in Proc. of Interspeech, Makuhari, Japan, September 2010.
    • (2010) Proc. of Interspeech
    • Novak, J.R.1    Dixon, P.2    Furui, S.3
  • 20
    • 70450180978 scopus 로고    scopus 로고
    • Robust LTS rules with the combilex speech technology lexicon
    • Brighton, UK, September
    • K. Richmond, R. A. J. Clark, and S. Fitt, "Robust LTS rules with the Combilex speech technology lexicon," in Proc. of Interspeech, Brighton, UK, September 2009.
    • (2009) Proc. of Interspeech
    • Richmond, K.1    Clark, R.A.J.2    Fitt, S.3
  • 21
    • 0042879653 scopus 로고    scopus 로고
    • A systematic comparison of various statistical alignment models
    • DOI 10.1162/089120103321337421
    • F. Och and H. Ney, "A Systematic Comparison of Various Statistical Alignment Models," Computational Linguistics, vol. 29(1), pp. 19-51, 2003. (Pubitemid 37049767)
    • (2003) Computational Linguistics , vol.29 , Issue.1 , pp. 19-51
    • Och, F.J.1    Ney, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.