메뉴 건너뛰기




Volumn 25, Issue 3, 2011, Pages 519-534

The efficient incorporation of MLP features into automatic speech recognition systems

Author keywords

Acoustic modelling; Automatic speech recognition; MLP feature; Speaker adaptation

Indexed keywords

ACOUSTIC ENVIRONMENT; ACOUSTIC FEATURES; ACOUSTIC MODEL; ACOUSTIC MODELLING; ADAPTATION SCHEME; AUTOMATIC SPEECH RECOGNITION; AUTOMATIC SPEECH RECOGNITION SYSTEM; BROADCAST CONVERSATION; BROADCAST NEWS; CONSISTENT PERFORMANCE; DESIGN DECISIONS; DISCRIMINATIVE TRAINING; LARGE AMOUNTS OF DATA; LARGE VOCABULARY SPEECH RECOGNITION; MLP FEATURE; MULTI LAYER PERCEPTRON; MULTI-PASS; NETWORK ADAPTATION; PERFORMANCE GAIN; SPEAKER ADAPTATION; SPEECH RECOGNITION SYSTEMS; SPEED-UPS; SUB-SYSTEMS; TEST DATA; TRAINING CORPUS;

EID: 79251574977     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2010.07.005     Document Type: Article
Times cited : (24)

References (43)
  • 4
    • 0030351194 scopus 로고    scopus 로고
    • Boosting the performance of connectionist large vocabulary speech recognition
    • G. Cook, and A. Robinson Boosting the performance of connectionist large vocabulary speech recognition Proc. ICSLP 1996
    • (1996) Proc. ICSLP
    • Cook, G.1    Robinson, A.2
  • 5
    • 79952617990 scopus 로고    scopus 로고
    • David, J.; 2004. ICSI QuickNet Software Package. http://www.icsi. berkeley.edu/Speech/qn.html.
    • (2004)
    • David, J.1
  • 8
    • 0033676943 scopus 로고    scopus 로고
    • Large vocabulary decoding and confidence estimation using word posterior probabilities
    • G. Evermann, and P.C. Woodland Large vocabulary decoding and confidence estimation using word posterior probabilities Proc. ICASSP 2000 2366 2369
    • (2000) Proc. ICASSP , pp. 2366-2369
    • Evermann, G.1    Woodland, P.C.2
  • 11
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • J.G. Fiscus A post-processing system to yield reduced word error rates: recognizer output voting error reduction (ROVER) Proc. IEEE Workshop: Automatic Speech Recognition and Understanding 1997 347 354
    • (1997) Proc. IEEE Workshop: Automatic Speech Recognition and Understanding , pp. 347-354
    • Fiscus, J.G.1
  • 12
    • 85065669950 scopus 로고    scopus 로고
    • Nonlinear discriminant analysis for improved speech recognition
    • V. Fontaine, C. Ris, and J.M. Boite Nonlinear discriminant analysis for improved speech recognition Proc. EUROSPEECH 1997
    • (1997) Proc. EUROSPEECH
    • Fontaine, V.1    Ris, C.2    Boite, J.M.3
  • 13
    • 53049104569 scopus 로고    scopus 로고
    • On the use of MLP features for broadcast news transcription
    • Springer Verlag
    • Fousek, P.; Lamel, L.; Gauvain, J.-L.; 2008a. On the use of MLP features for broadcast news transcription. In: Lecture Notes in Computer Science. Springer Verlag, pp. 303-310.
    • (2008) Lecture Notes in Computer Science , pp. 303-310
    • Fousek, P.1    Lamel, L.2    Gauvain, J.-L.3
  • 19
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • DOI 10.1006/csla.1996.0013
    • M.J.F. Gales, and P.C. Woodland Mean and variance adaptation within the MLLR framework Computer Speech & Language 10 1996 249 264 (Pubitemid 126374488)
    • (1996) Computer Speech and Language , vol.10 , Issue.4 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 21
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales Maximum likelihood linear transformations for HMM based speech recognition Computer Speech & Language 12 1998 75 98 (Pubitemid 128383747)
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 23
    • 0031640333 scopus 로고    scopus 로고
    • Linear Input Network based speaker adaptation in the dialogos system
    • R. Gemello, F. Mana, and D. Albesano Linear Input Network based speaker adaptation in the dialogos system Proc. of IJCNN 1998 2190 2195
    • (1998) Proc. of IJCNN , pp. 2190-2195
    • Gemello, R.1    Mana, F.2    Albesano, D.3
  • 26
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermansky, D.P.W. Ellis, and S. Sharma Tandem connectionist feature extraction for conventional HMM systems Proc. of ICASSP 2000
    • (2000) Proc. of ICASSP
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 27
    • 27144439262 scopus 로고    scopus 로고
    • Data-derived nonlinear mapping for feature extraction in HMM
    • H. Hermansky, S. Sharma, and P. Jain Data-derived nonlinear mapping for feature extraction in HMM Proc. ASRU 1999
    • (1999) Proc. ASRU
    • Hermansky, H.1    Sharma, S.2    Jain, P.3
  • 28
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky Perceptual linear prediction (PLP) analysis for speech The Journal of the Acoustical Society of America 87 April 1990 1738 1752 (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 29
    • 77249135915 scopus 로고    scopus 로고
    • Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition
    • Springer Verlag
    • Karafiát, M.; Grézl, F.; Schwarz, P.; Burget, L.; Černocký, J.; 2006. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. In: Lecture Notes in Computer Science. Springer Verlag, pp. 275-284.
    • (2006) Lecture Notes in Computer Science , pp. 275-284
    • Karafiát, M.1    Grézl, F.2    Schwarz, P.3    Burget, L.4    Černocký, J.5
  • 31
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech & Language 9 1995 171 185
    • (1995) Computer Speech & Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 33
    • 84923190448 scopus 로고    scopus 로고
    • Combination of acoustic models in continuous speech recognition hybrid systems
    • H. Meinedo, and J.P. Neto Combination of acoustic models in continuous speech recognition hybrid systems Proc. ICSLP 2000 931 934
    • (2000) Proc. ICSLP , pp. 931-934
    • Meinedo, H.1    Neto, J.P.2
  • 34
    • 0025680226 scopus 로고
    • Tools for the analysis of benchmark speech recognition tests
    • D.S. Pallet, W.M. Fisher, and J.G. Fiscus Tools for the analysis of benchmark speech recognition tests Proc. ICASSP 1990 97 100
    • (1990) Proc. ICASSP , pp. 97-100
    • Pallet, D.S.1    Fisher, W.M.2    Fiscus, J.G.3
  • 36
    • 0141480019 scopus 로고    scopus 로고
    • Discriminative MAP for acoustic model adaptation
    • D. Povey, P.C. Woodland, and M.J.F. Gales Discriminative MAP for acoustic model adaptation Proc. ICASSP 2003 312 315
    • (2003) Proc. ICASSP , pp. 312-315
    • Povey, D.1    Woodland, P.C.2    Gales, M.J.F.3
  • 37
    • 0036296863 scopus 로고    scopus 로고
    • Minimum Phone Error and I-smoothing for improved discriminative training
    • D. Povey, and P.C. Woodland Minimum Phone Error and I-smoothing for improved discriminative training Proc. ICASSP 2002
    • (2002) Proc. ICASSP
    • Povey, D.1    Woodland, P.C.2
  • 40
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models for speech recognition
    • P.C. Woodland, and D. Povey Large scale discriminative training of hidden Markov models for speech recognition Computer Speech & Language 16 2002 25 47
    • (2002) Computer Speech & Language , vol.16 , pp. 25-47
    • Woodland, P.C.1    Povey, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.