SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2011, Pages 1257-1260

Front-end compensation methods for LVCSR under Lombard effect

(3) Bořil, Hynek a Grézl, František b Hansen, John H L a

a The University of Texas at Dallas (United States)

b BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

Author keywords

Bottleneck features; Histogram equalization; Lombard effect; Quantile based cepstral distribution normalization; Speech recognition; UT Scope database

Indexed keywords

BACKGROUND NOISE; BACKGROUND VARIATION; BAND PASS FILTERING; BOTTLENECK FEATURES; CEPSTRAL; CEPSTRAL MEAN SUBTRACTION; COMPENSATION METHOD; CRITICAL BANDS; FEATURE DISTRIBUTION; HISTOGRAM EQUALIZATIONS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LOMBARD EFFECT; TRAINING FEATURES; TRANSIENT EFFECT;

CONTINUOUS SPEECH RECOGNITION; FEATURE EXTRACTION; GRAPHIC METHODS; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION;

NORMAL DISTRIBUTION;

EID: 84865772156 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (26)

1
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizers
- J.-C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," JASA, vol. 93, no. 1, pp. 510-524, 1993.
- (1993) JASA , vol.93 , Issue.1 , pp. 510-524
- Junqua, J.-C.¹

2
- 0030283741
- Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
- J. H. L. Hansen, "Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition," Speech Comm., vol. 20, no. 1-2, pp. 151-173, 1996.
- (1996) Speech Comm. , vol.20 , Issue.1-2 , pp. 151-173
- Hansen, J.H.L.¹

3
- 70349487580
- Ph.D. dissertation, CTU in Prague, Czech Rep.
- H. Bořil, "Robust speech recognition: Analysis and equalization of Lombard effect in Czech corpora," Ph.D. dissertation, CTU in Prague, Czech Rep., http://www.utdallas.edu/~hynek, 2008.
- (2008) Robust Speech Recognition: Analysis and Equalization of Lombard Effect in Czech Corpora
- Bořil, H.¹

4
- 56749169816
- Speech production modifications produced by competing talkers, babble and stationary noise
- Y. Lu and M. Cooke, "Speech production modifications produced by competing talkers, babble and stationary noise," JASA, vol. 124, no. 5, pp. 3261-3275, 2008.
- (2008) JASA , vol.124 , Issue.5 , pp. 3261-3275
- Lu, Y.¹ Cooke, M.²

5
- 80051706588
- Ph.D. dissertation, Univ. of Paris VI, France
- M. Garnier, "Communication in noisy environments: From adaptation to vocal straining," Ph.D. dissertation, Univ. of Paris VI, France, 2007.
- (2007) Communication in Noisy Environments: From Adaptation to Vocal Straining
- Garnier, M.¹

6
- 77955734646
- Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
- August
- H. Bořil and J. H. L. Hansen, "Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments," IEEE Trans. on ASLP, vol. 18, no. 6, pp. 1379-1393, August 2010.
- (2010) IEEE Trans. on ASLP , vol.18 , Issue.6 , pp. 1379-1393
- Bořil, H.¹ Hansen, J.H.L.²

7
- 0034229795
- A comparative study of traditional and newly proposed features for recognition of speech under stress
- S. E. Bou-Ghazale and J. H. L. Hansen, "A comparative study of traditional and newly proposed features for recognition of speech under stress," IEEE Trans. on SAP, vol. 8, no. 4, pp. 429-442, 2000.
- (2000) IEEE Trans. on SAP , vol.8 , Issue.4 , pp. 429-442
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

8
- 44949251671
- Data-driven design of front-end filter bank for Lombard speech recognition
- Pittsburgh, Pennsylvania
- H. Bořil, P. Fousek, and P. Pollák, "Data-driven design of front-end filter bank for Lombard speech recognition," in Proc. ICSLP'06, Pittsburgh, Pennsylvania, 2006, pp. 381-384.
- (2006) Proc. ICSLP'06 , pp. 381-384
- Bořil, H.¹ Fousek, P.² Pollák, P.³

9
- 80051656187
- UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background
- Prague, Czech
- H. Bořil and J. H. L. Hansen, "UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background," in Proc. IEEE ICASSP'11, Prague, Czech, 2011, pp. 4472-4475.
- (2011) Proc. IEEE ICASSP'11 , pp. 4472-4475
- Bořil, H.¹ Hansen, J.H.L.²

10
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Transactions on SAP, vol. 2, no. 4, pp. 578 -589, Oct. 1994.
- (1994) IEEE Transactions on SAP , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

11
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. ASSP, vol. 28, no. 4, pp. 357-366, 1980.
- (1980) IEEE Trans. ASSP , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

12
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," JASA, vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) JASA , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

13
- 37649022051
- A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
- U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition," Speech Commun., vol. 50, no. 2, pp. 142-152, 2008.
- (2008) Speech Commun. , vol.50 , Issue.2 , pp. 142-152
- Yapanel, U.H.¹ Hansen, J.H.L.²

14
- 34547548235
- Probabilistic and bottle-neck features for LVCSR of meetings
- Apr
- F. Grézl et al., "Probabilistic and bottle-neck features for LVCSR of meetings," in Proc. IEEE ICASSP'07, Apr 2007, pp. 757-760.
- (2007) Proc. IEEE ICASSP'07 , pp. 757-760
- Grézl, F.¹

15
- 4544282392
- Cepstral gain normalization for noise robust speech recognition
- May
- S. Yoshizawa, N. Hayasaka, N. Wada, and Y. Miyanaga, "Cepstral gain normalization for noise robust speech recognition," in Proc. IEEE ICASSP'04, vol. 1, May 2004, pp. 209-212.
- (2004) Proc. IEEE ICASSP'04 , vol.1 , pp. 209-212
- Yoshizawa, S.¹ Hayasaka, N.² Wada, N.³ Miyanaga, Y.⁴

16
- 85073258179
- Feature warping for robust speaker verification
- Crete, Greece
- J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in In ODYSSEY-2001, Crete, Greece, 2001, pp. 213-218.
- (2001) ODYSSEY-2001 , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

17
- 85009074785
- A nonlinear unsupervised adaptation technique for speech recognition
- S. Dharanipragada and M. Padmanabha, "A nonlinear unsupervised adaptation technique for speech recognition," in ICSLP, 2000, pp. 556-559.
- (2000) ICSLP , pp. 556-559
- Dharanipragada, S.¹ Padmanabha, M.²

18
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- H. Hermansky et al., "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP'00, 2000.
- (2000) Proc. IEEE ICASSP'00
- Hermansky, H.¹

19
- 85009110188
- Learning long-term temporal features in LVCSR using neural networks
- B. Chen, Q. Zhu, and N. Morgan, "Learning long-term temporal features in LVCSR using neural networks," in Proc. ICSLP'04, 2004.
- (2004) Proc. ICSLP'04
- Chen, B.¹ Zhu, Q.² Morgan, N.³

20
- 70249086510
- Robust ASR front-end using spectral-based and discriminant features: Experiments on the AURORA task
- Aalborg, Denmark, Sept.
- C. Benitz et al., "Robust ASR front-end using spectral-based and discriminant features: experiments on the AURORA task," in Proc. Eurospeech' 01, Aalborg, Denmark, Sept. 2001.
- (2001) Proc. Eurospeech' 01
- Benitz, C.¹

21
- 38049107590
- Trap-based techniques for recognition of noisy speech
- F. Grézl and J. Černocký, "Trap-based techniques for recognition of noisy speech," Lecture Notes in Comp. Science, vol. 2007, no. 9, pp. 270-277.
- Lecture Notes in Comp. Science , vol.2007 , Issue.9 , pp. 270-277
- Grézl, F.¹ Černocký, J.²

22
- 51449103447
- Optimizing bottle-neck features for LVCSR
- Las Vegas, NV, April
- F. Grézl and P. Fousek, "Optimizing bottle-neck features for LVCSR," in Proc. IEEE ICASSP'08, Las Vegas, NV, April 2008, pp. 4729-4732.
- (2008) Proc. IEEE ICASSP'08 , pp. 4729-4732
- Grézl, F.¹ Fousek, P.²

23
- 84892187452
- Maximum likelihood modeling with gaussian distributions for classification
- R. A. Gopinath, "Maximum likelihood modeling with gaussian distributions for classification," in Proc. IEEE ICASSP'98, 1998, pp. 661-664.
- (1998) Proc. IEEE ICASSP'98 , pp. 661-664
- Gopinath, R.A.¹

24
- 70350454918
- Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition
- Feb.
- J. H. L. Hansen and V. Varadarajan, "Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition," IEEE Trans. ASLP, vol. 17, no. 2, pp. 366 -378, Feb. 2009.
- (2009) IEEE Trans. ASLP , vol.17 , Issue.2 , pp. 366-378
- Hansen, J.H.L.¹ Varadarajan, V.²

25
- 0025477640
- Speech database development at MIT: TIMIT and beyond
- V. Zue, S. Seneff, and J. Glass, "Speech database development at MIT: TIMIT and beyond," Speech Comm., vol. 9, no. 4, pp. 351 - 356, 1990.
- (1990) Speech Comm. , vol.9 , Issue.4 , pp. 351-356
- Zue, V.¹ Seneff, S.² Glass, J.³

26
- 33745195684
- Confronting HMM-based phone labelling with human evaluation of speech production
- Lisbon, Portugal, Sept.
- J. Volín, R. Skarnitzl, and P. Pollák, "Confronting HMM-based phone labelling with human evaluation of speech production," in Proc. of INTERSPEECH' 05, Lisbon, Portugal, Sept. 2005, pp. 1541-1544.
- (2005) Proc. of INTERSPEECH' 05 , pp. 1541-1544
- Volín, J.¹ Skarnitzl, R.² Pollák, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.