SCOPUS 정보 검색 플랫폼

2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings

Volumn , Issue , 2011, Pages 95-100

Leveraging large amounts of loosely transcribed corporate videos for acoustic model training

(2) Paulik, Matthias a Panchapagesan, Panchi a

a CISCO SYSTEMS (United States)

Author keywords

automatic speech recognition; lightly supervised acoustic model training; LVCSR

Indexed keywords

ACOUSTIC MODEL; ADDITIONAL COSTS; AUTOMATIC SPEECH RECOGNITION; COST SAVING; LIGHTLY SUPERVISED ACOUSTIC MODEL TRAINING; LVCSR; STATE OF THE ART; TRAINING PROCESS; WORD ERROR RATE;

SPEECH RECOGNITION;

TRANSCRIPTION;

EID: 84858951500 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2011.6163912 Document Type: Conference Paper

Times cited : (5)

References (22)

1
- 0032659923
- Improving acoustic models with captioned multimedia speech
- June
- P. J. Jang and A. G. Hauptmann, "Improving Acoustic Models with Captioned Multimedia Speech." in Proc. of ICMCS, vol. 2, June 1999, pp. 767-771.
- (1999) Proc. of ICMCS , vol.2 , pp. 767-771
- Jang, P.J.¹ Hauptmann, A.G.²

2
- 0002450218
- Lightly supervised acoustic model training
- L. Lamel, J. luc Gauvain, and G. Adda, "Lightly supervised acoustic model training," in Proc. ISCA ITRW ASR2000, 2000, pp. 150-154.
- (2000) Proc. ISCA ITRW ASR2000 , pp. 150-154
- Lamel, L.¹ Luc Gauvain, J.² Adda, G.³

3
- 0034841730
- Investigating lightly supervised acoustic model training
- Salt Lake City, USA, May
- L. Lamel, J. Gauvain, and G. Adda, "Investigating Lightly Supervised Acoustic Model Training," in Proc. of ICASSP, Salt Lake City, USA, May 2001.
- (2001) Proc. of ICASSP
- Lamel, L.¹ Gauvain, J.² Adda, G.³

4
- 4544315111
- Lightly supervised acoustic model training using consensus networks
- L. Chen, L. Lamel, and J.-L. Gauvain, "Lightly supervised acoustic model training using consensus networks," in Proc. ICASSP, 2004.
- (2004) Proc. ICASSP
- Chen, L.¹ Lamel, L.² Gauvain, J.-L.³

5
- 4544273245
- Light supervision in acoustic model training
- L. Nguyen and B. Xiang, "Light Supervision in Acoustic Model Training," in Proc. ICASSP, 2004.
- (2004) Proc. ICASSP
- Nguyen, L.¹ Xiang, B.²

6
- 4544253838
- Improving broadcast news transcription by lightly supervised discriminative training
- Montreal, Canada, May
- H. Chan and P. Woodland, "Improving Broadcast News Transcription by Lightly Supervised Discriminative Training," in Proc. ICASSP, Montreal, Canada, May 2004.
- (2004) Proc. ICASSP
- Chan, H.¹ Woodland, P.²

7
- 84867216798
- Lightly supervised acoustic model training on EPPS recordings
- Brisbane, Australia, September
- M. Paulik and A. Waibel, "Lightly Supervised Acoustic Model Training on EPPS Recordings," in Proc. Interspeech, Brisbane, Australia, September 2008.
- (2008) Proc. Interspeech
- Paulik, M.¹ Waibel, A.²

8
- 79851498679
- Automatic transcription of parliamentary meetings and classroom lectures - A sustainable approach and real system evaluations
- Tainan, Taiwan, November
- T. Kawahara, "Automatic transcription of parliamentary meetings and classroom lectures - A sustainable approach and real system evaluations," in Proc. Chinese Spoken Language Processing, Tainan, Taiwan, November 2010.
- (2010) Proc. Chinese Spoken Language Processing
- Kawahara, T.¹

9
- 0003571407
- University of Edinburgh, Scotland, Tech. Rep.
- A. Black and P. Taylor, "The Festival Speech Synthesis System," University of Edinburgh, Scotland, Tech. Rep., 1997, http://www.cstr.ed.ac.uk/ projects/festival.html.
- (1997) The Festival Speech Synthesis System
- Black, A.¹ Taylor, P.²

10
- 0003822743
- The HTK book
- S. Young, G. Everman, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK Book," Cambridge University, Tech. Rep., 2006.
- (2006) Cambridge University, Tech. Rep.
- Young, S.¹ Everman, G.² Kershaw, D.³ Moore, G.⁴ Odell, J.⁵ Ollason, D.⁶ Valtchev, V.⁷ Woodland, P.⁸

11
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- DOI 10.1121/1.399423
- H. Hermansky, "Perceptual Linear Predictive (PLP) Analysis of Speech," The Journal of Acoustical Society of America, vol. 87(4), pp. 1738-1752, 1990. (Pubitemid 20256470)
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

12
- 84891308106
- SRILM - An extensible language modeling toolkit
- Denver, USA, September
- A. Stolcke, "SRILM - An extensible language modeling toolkit." in Proc. of ICSLP, Denver, USA, September 2002.
- (2002) Proc. of ICSLP
- Stolcke, A.¹

13
- 0028996876
- Improved backing-off for n-gram language modeling
- Detroit, USA, May
- R. Kneser and H. Ney, "Improved backing-off for n-gram language modeling." in Proc. of ICASSP, Detroit, USA, May 1995.
- (1995) Proc. of ICASSP
- Kneser, R.¹ Ney, H.²

14
- 0003396042
- An empirical study of smoothing techniques for language modeling
- S. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Harvard University, Tech. Rep., 1998.
- (1998) Harvard University, Tech. Rep.
- Chen, S.¹ Goodman, J.²

15
- 79959857712
- Juicer: A weighted finite state transducer speech decoder
- Lisbon, Portugal, April
- D. Moore, J. Dines, M. M. Doss, O. Vepa, O. Cheng, and T. Hain, "Juicer: A Weighted Finite State Transducer Speech Decoder." in Proc. of Interspeech, Lisbon, Portugal, April 2005.
- (2005) Proc. of Interspeech
- Moore, D.¹ Dines, J.² Doss, M.M.³ Vepa, O.⁴ Cheng, O.⁵ Hain, T.⁶

16
- 38149133882
- OpenFst: A general and efficient weighted finite-state transducer library
- Springer
- C. Allauzen, M. Riley, J. Schalkwyk, W. Skut, and M. Mohri, "OpenFst: A General and Efficient Weighted Finite-State Transducer Library," in CIAA 2007, ser. Lecture Notes in Computer Science, vol. 4783. Springer, 2007, pp. 11-23, http://www.openfst.org.
- (2007) CIAA 2007, Ser. Lecture Notes in Computer Science , vol.4783 , pp. 11-23
- Allauzen, C.¹ Riley, M.² Schalkwyk, J.³ Skut, W.⁴ Mohri, M.⁵

17
- 84858973164
- Transducersaurus - Tools for generating WFST-based ASR cascades. [Online]. Available: http://code.google.com/p/transducersaurus
- Transducersaurus - Tools for Generating WFST-based ASR Cascades

18
- 4544339437
- A generalized construction of integrated speech recognition transducers
- Montreal, Canada, May
- C. Allauzen, M. Mohri, M. Riley, and B. Roar., "A Generalized Construction of Integrated Speech Recognition Transducers." in Proc. of ICASSP, Montreal, Canada, May 2004.
- (2004) Proc. of ICASSP
- Allauzen, C.¹ Mohri, M.² Riley, M.³ Roar, B.⁴

19
- 79959851726
- An empirical comparison of the T3, juicer, HDecode and sphinx3 decoders
- Makuhari, Japan, September
- J. R. Novak, P. Dixon, and S. Furui, "An Empirical Comparison of the T3, Juicer, HDecode and Sphinx3 Decoders." in Proc. of Interspeech, Makuhari, Japan, September 2010.
- (2010) Proc. of Interspeech
- Novak, J.R.¹ Dixon, P.² Furui, S.³

20
- 70450180978
- Robust LTS rules with the combilex speech technology lexicon
- Brighton, UK, September
- K. Richmond, R. A. J. Clark, and S. Fitt, "Robust LTS rules with the Combilex speech technology lexicon," in Proc. of Interspeech, Brighton, UK, September 2009.
- (2009) Proc. of Interspeech
- Richmond, K.¹ Clark, R.A.J.² Fitt, S.³

21
- 0042879653
- A systematic comparison of various statistical alignment models
- DOI 10.1162/089120103321337421
- F. Och and H. Ney, "A Systematic Comparison of Various Statistical Alignment Models," Computational Linguistics, vol. 29(1), pp. 19-51, 2003. (Pubitemid 37049767)
- (2003) Computational Linguistics , vol.29 , Issue.1 , pp. 19-51
- Och, F.J.¹ Ney, H.²

22
- 85110867932
- Moses: Open source toolkit for statistical machine translation
- Prague, Czech Republic, June
- P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, and E. Herbst, "Moses: Open Source Toolkit for Statistical Machine Translation." in Proc. of ACL, Prague, Czech Republic, June 2007.
- (2007) Proc. of ACL
- Koehn, P.¹ Hoang, H.² Birch, A.³ Callison-Burch, C.⁴ Federico, M.⁵ Bertoldi, N.⁶ Cowan, B.⁷ Shen, W.⁸ Moran, C.⁹ Zens, R.¹⁰ Dyer, C.¹¹ Bojar, O.¹² Constantin, A.¹³ Herbst, E.¹⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.