메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1257-1260

Front-end compensation methods for LVCSR under Lombard effect

Author keywords

Bottleneck features; Histogram equalization; Lombard effect; Quantile based cepstral distribution normalization; Speech recognition; UT Scope database

Indexed keywords

BACKGROUND NOISE; BACKGROUND VARIATION; BAND PASS FILTERING; BOTTLENECK FEATURES; CEPSTRAL; CEPSTRAL MEAN SUBTRACTION; COMPENSATION METHOD; CRITICAL BANDS; FEATURE DISTRIBUTION; HISTOGRAM EQUALIZATIONS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LOMBARD EFFECT; TRAINING FEATURES; TRANSIENT EFFECT;

EID: 84865772156     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (26)
  • 1
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • J.-C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," JASA, vol. 93, no. 1, pp. 510-524, 1993.
    • (1993) JASA , vol.93 , Issue.1 , pp. 510-524
    • Junqua, J.-C.1
  • 2
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • J. H. L. Hansen, "Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition," Speech Comm., vol. 20, no. 1-2, pp. 151-173, 1996.
    • (1996) Speech Comm. , vol.20 , Issue.1-2 , pp. 151-173
    • Hansen, J.H.L.1
  • 4
    • 56749169816 scopus 로고    scopus 로고
    • Speech production modifications produced by competing talkers, babble and stationary noise
    • Y. Lu and M. Cooke, "Speech production modifications produced by competing talkers, babble and stationary noise," JASA, vol. 124, no. 5, pp. 3261-3275, 2008.
    • (2008) JASA , vol.124 , Issue.5 , pp. 3261-3275
    • Lu, Y.1    Cooke, M.2
  • 6
    • 77955734646 scopus 로고    scopus 로고
    • Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
    • August
    • H. Bořil and J. H. L. Hansen, "Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments," IEEE Trans. on ASLP, vol. 18, no. 6, pp. 1379-1393, August 2010.
    • (2010) IEEE Trans. on ASLP , vol.18 , Issue.6 , pp. 1379-1393
    • Bořil, H.1    Hansen, J.H.L.2
  • 7
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • S. E. Bou-Ghazale and J. H. L. Hansen, "A comparative study of traditional and newly proposed features for recognition of speech under stress," IEEE Trans. on SAP, vol. 8, no. 4, pp. 429-442, 2000.
    • (2000) IEEE Trans. on SAP , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 8
    • 44949251671 scopus 로고    scopus 로고
    • Data-driven design of front-end filter bank for Lombard speech recognition
    • Pittsburgh, Pennsylvania
    • H. Bořil, P. Fousek, and P. Pollák, "Data-driven design of front-end filter bank for Lombard speech recognition," in Proc. ICSLP'06, Pittsburgh, Pennsylvania, 2006, pp. 381-384.
    • (2006) Proc. ICSLP'06 , pp. 381-384
    • Bořil, H.1    Fousek, P.2    Pollák, P.3
  • 9
    • 80051656187 scopus 로고    scopus 로고
    • UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background
    • Prague, Czech
    • H. Bořil and J. H. L. Hansen, "UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background," in Proc. IEEE ICASSP'11, Prague, Czech, 2011, pp. 4472-4475.
    • (2011) Proc. IEEE ICASSP'11 , pp. 4472-4475
    • Bořil, H.1    Hansen, J.H.L.2
  • 10
    • 0028517164 scopus 로고
    • RASTA processing of speech
    • Oct.
    • H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Transactions on SAP, vol. 2, no. 4, pp. 578 -589, Oct. 1994.
    • (1994) IEEE Transactions on SAP , vol.2 , Issue.4 , pp. 578-589
    • Hermansky, H.1    Morgan, N.2
  • 11
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. ASSP, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Trans. ASSP , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 12
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," JASA, vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) JASA , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 13
    • 37649022051 scopus 로고    scopus 로고
    • A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
    • U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition," Speech Commun., vol. 50, no. 2, pp. 142-152, 2008.
    • (2008) Speech Commun. , vol.50 , Issue.2 , pp. 142-152
    • Yapanel, U.H.1    Hansen, J.H.L.2
  • 14
    • 34547548235 scopus 로고    scopus 로고
    • Probabilistic and bottle-neck features for LVCSR of meetings
    • Apr
    • F. Grézl et al., "Probabilistic and bottle-neck features for LVCSR of meetings," in Proc. IEEE ICASSP'07, Apr 2007, pp. 757-760.
    • (2007) Proc. IEEE ICASSP'07 , pp. 757-760
    • Grézl, F.1
  • 15
    • 4544282392 scopus 로고    scopus 로고
    • Cepstral gain normalization for noise robust speech recognition
    • May
    • S. Yoshizawa, N. Hayasaka, N. Wada, and Y. Miyanaga, "Cepstral gain normalization for noise robust speech recognition," in Proc. IEEE ICASSP'04, vol. 1, May 2004, pp. 209-212.
    • (2004) Proc. IEEE ICASSP'04 , vol.1 , pp. 209-212
    • Yoshizawa, S.1    Hayasaka, N.2    Wada, N.3    Miyanaga, Y.4
  • 16
    • 85073258179 scopus 로고    scopus 로고
    • Feature warping for robust speaker verification
    • Crete, Greece
    • J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in In ODYSSEY-2001, Crete, Greece, 2001, pp. 213-218.
    • (2001) ODYSSEY-2001 , pp. 213-218
    • Pelecanos, J.1    Sridharan, S.2
  • 17
    • 85009074785 scopus 로고    scopus 로고
    • A nonlinear unsupervised adaptation technique for speech recognition
    • S. Dharanipragada and M. Padmanabha, "A nonlinear unsupervised adaptation technique for speech recognition," in ICSLP, 2000, pp. 556-559.
    • (2000) ICSLP , pp. 556-559
    • Dharanipragada, S.1    Padmanabha, M.2
  • 18
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • H. Hermansky et al., "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP'00, 2000.
    • (2000) Proc. IEEE ICASSP'00
    • Hermansky, H.1
  • 19
    • 85009110188 scopus 로고    scopus 로고
    • Learning long-term temporal features in LVCSR using neural networks
    • B. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks," in Proc. ICSLP'04, 2004.
    • (2004) Proc. ICSLP'04
    • Chen, B.1    Zhu, Q.2    Morgan, N.3
  • 20
    • 70249086510 scopus 로고    scopus 로고
    • Robust ASR front-end using spectral-based and discriminant features: Experiments on the AURORA task
    • Aalborg, Denmark, Sept.
    • C. Benitz et al., "Robust ASR front-end using spectral-based and discriminant features: experiments on the AURORA task," in Proc. Eurospeech' 01, Aalborg, Denmark, Sept. 2001.
    • (2001) Proc. Eurospeech' 01
    • Benitz, C.1
  • 21
    • 38049107590 scopus 로고    scopus 로고
    • Trap-based techniques for recognition of noisy speech
    • F. Grézl and J. Černocký, "Trap-based techniques for recognition of noisy speech," Lecture Notes in Comp. Science, vol. 2007, no. 9, pp. 270-277.
    • Lecture Notes in Comp. Science , vol.2007 , Issue.9 , pp. 270-277
    • Grézl, F.1    Černocký, J.2
  • 22
    • 51449103447 scopus 로고    scopus 로고
    • Optimizing bottle-neck features for LVCSR
    • Las Vegas, NV, April
    • F. Grézl and P. Fousek, "Optimizing bottle-neck features for LVCSR," in Proc. IEEE ICASSP'08, Las Vegas, NV, April 2008, pp. 4729-4732.
    • (2008) Proc. IEEE ICASSP'08 , pp. 4729-4732
    • Grézl, F.1    Fousek, P.2
  • 23
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with gaussian distributions for classification
    • R. A. Gopinath, "Maximum likelihood modeling with gaussian distributions for classification," in Proc. IEEE ICASSP'98, 1998, pp. 661-664.
    • (1998) Proc. IEEE ICASSP'98 , pp. 661-664
    • Gopinath, R.A.1
  • 24
    • 70350454918 scopus 로고    scopus 로고
    • Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition
    • Feb.
    • J. H. L. Hansen and V. Varadarajan, "Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition," IEEE Trans. ASLP, vol. 17, no. 2, pp. 366 -378, Feb. 2009.
    • (2009) IEEE Trans. ASLP , vol.17 , Issue.2 , pp. 366-378
    • Hansen, J.H.L.1    Varadarajan, V.2
  • 25
    • 0025477640 scopus 로고
    • Speech database development at MIT: TIMIT and beyond
    • V. Zue, S. Seneff, and J. Glass, "Speech database development at MIT: TIMIT and beyond," Speech Comm., vol. 9, no. 4, pp. 351 - 356, 1990.
    • (1990) Speech Comm. , vol.9 , Issue.4 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3
  • 26
    • 33745195684 scopus 로고    scopus 로고
    • Confronting HMM-based phone labelling with human evaluation of speech production
    • Lisbon, Portugal, Sept.
    • J. Volín, R. Skarnitzl, and P. Pollák, "Confronting HMM-based phone labelling with human evaluation of speech production," in Proc. of INTERSPEECH' 05, Lisbon, Portugal, Sept. 2005, pp. 1541-1544.
    • (2005) Proc. of INTERSPEECH' 05 , pp. 1541-1544
    • Volín, J.1    Skarnitzl, R.2    Pollák, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.