SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 1749-1753

Medium-duration modulation cepstral feature for robust speech recognition

(4) Mitra, Vikramjit a Franco, Horacio a Graciarena, Martin a Vergyri, Dimitra a

a SRI INTERNATIONAL (United States)

Author keywords

large vocabulary continuous speech recognition; modulation features; noise robust speech recognition

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; MODULATION; SPEECH; SPEECH TRANSMISSION; VOCABULARY CONTROL;

AUTOMATIC SPEECH RECOGNITION SYSTEM; DEFENSE ADVANCE RESEARCH PROJECTS AGENCIES; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; MODULATION FEATURES; NOISE ROBUST SPEECH RECOGNITION; PERFORMANCE DEGRADATION; ROBUST SPEECH RECOGNITION; SPEECH RECOGNITION PERFORMANCE;

ACOUSTIC NOISE;

EID: 84905269267 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6853898 Document Type: Conference Paper

Times cited : (35)

References (24)

1
- 33745225159
- Auditory Teager energy cepstrum coefficients for robust speech recognition
- D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc. of Interspeech, pp. 3013-3016, 2005.
- (2005) Proc. of Interspeech , pp. 3013-3016
- Dimitriadis, D.¹ Maragos, P.² Potamianos, A.³

2
- 0033097443
- Single channel speech enhancement based on masking properties of the human auditory system
- N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system", IEEE Trans. Speech Audio Process., 7(2), pp. 126-137, 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
- Virag, N.¹

3
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition", IEEE Trans Audio, Speech, Lang. Process., 15(7), pp. 2130-2140, 2007.
- (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

4
- 84893214804
- Speech Processing, Transmission and Quality Aspects (STQ)
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Adv. Front-end Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 Ver. 1.1.5, 2007.
- (2007) Distributed Speech Recognition; Adv. Front-end Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 Ver. 1.1.5

5
- 78049398950
- Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
- C. Kim and R. M. Stern, "Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring", in Proc. ICASSP, pp. 4574-4577, 2010.
- (2010) Proc. ICASSP , pp. 4574-4577
- Kim, C.¹ Stern, R.M.²

6
- 84867613224
- Fepstrum features: Design and application to conversational speech recognition
- V. Tyagi, "Fepstrum features: Design and application to conversational speech recognition", IBM Research Report, 11009, 2011.
- (2011) IBM Research Report , pp. 11009
- Tyagi, V.¹

7
- 84867589420
- Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
- Japan
- V. Mitra, H. Franco, M. Graciarena and A. Mandal, "Normalized amplitude modulation features for large vocabulary noise-robust speech recognition", in Proc. of ICASSP, pp. 4117-4120, Japan, 2012.
- (2012) Proc. of ICASSP , pp. 4117-4120
- Mitra, V.¹ Franco, H.² Graciarena, M.³ Mandal, A.⁴

8
- 0028287770
- Effect of reducing slow temporal modulations on speech reception
- R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception", J. Acoust. Soc. of Am., 95(5), pp. 2670-2680, 1994.
- (1994) J. Acoust. Soc. of Am. , vol.95 , Issue.5 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

9
- 0034844903
- On the upper cutoff frequency of auditory criticalband envelope detectors in the context of speech perception
- O. Ghitza, "On the upper cutoff frequency of auditory criticalband envelope detectors in the context of speech perception", J. Acoust. Soc. of Am., 110(3), pp. 1628-1640, 2001.
- (2001) J. Acoust. Soc. of Am. , vol.110 , Issue.3 , pp. 1628-1640
- Ghitza, O.¹

10
- 33745225159
- Auditory Teager energy cepstrum coefficients for robust speech recognition
- D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc of Interspeech, pp. 3013-3016, 2005.
- (2005) Proc of Interspeech , pp. 3013-3016
- Dimitriadis, D.¹ Maragos, P.² Potamianos, A.³

11
- 0035278964
- Time-frequency distributions for automatic speech recognition
- A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition", IEEE Trans. Speech & Audio Proc., 9(3), pp. 196-200, 2001.
- (2001) IEEE Trans. Speech & Audio Proc. , vol.9 , Issue.3 , pp. 196-200
- Potamianos, A.¹ Maragos, P.²

12
- 0033328948
- Teager energy based feature parameters for speech recognition in car noise
- F. Jabloun, A. E. Cetin, and E. Erzin, "Teager energy based feature parameters for speech recognition in car noise", IEEE Sig. Proc. Letters, 6(10), pp. 259-261, 1999.
- (1999) IEEE Sig. Proc. Letters , vol.6 , Issue.10 , pp. 259-261
- Jabloun, F.¹ Cetin, A.E.² Erzin, E.³

13
- 0032030556
- A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
- J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, 45(3), pp. 300-313, 1998.
- (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
- Hansen, J.H.L.¹ Gavidia-Ceballos, L.² Kaiser, J.F.³

14
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, 47, pp.103-138, 1990.
- (1990) Hearing Research , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

15
- 0019075685
- Some observations on oral air flow during phonation
- H. Teager, "Some observations on oral air flow during phonation", IEEE Trans. ASSP, pp. 599-601, 1980.
- (1980) IEEE Trans. ASSP , pp. 599-601
- Teager, H.¹

16
- 0027210171
- Some useful properties of the Teager's energy operator
- J.F. Kaiser, "Some useful properties of the Teager's energy operator", in Proc of IEEE, Iss. III, pp. 149-152, 1993.
- (1993) Proc of IEEE, Iss. , vol.3 , pp. 149-152
- Kaiser, J.F.¹

17
- 0027676955
- Energy separation in signal modulations with application to speech analysis
- P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis", IEEE Trans. Signal Processing, 41, pp. 3024-3051, 1993.
- (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
- Maragos, P.¹ Kaiser, J.² Quatieri, T.³

18
- 0032030556
- A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
- J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, 45(3), pp. 300-313, 1998.
- (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
- Hansen, J.H.L.¹ Gavidia-Ceballos, L.² Kaiser, J.F.³

19
- 33646677283
- Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task
- June 4
- G. Hirsch, "Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task", ETSI STQ-Aurora DSR Working Group, June 4, 2001.
- (2001) ETSI STQ-Aurora DSR Working Group
- Hirsch, G.¹

20
- 84873310339
- The RATS radio traffic collection system
- Odyssey
- K.Walker and S. Strassel, "The RATS radio traffic collection system," in Proc. of ISCA, Odyssey, 2012.
- (2012) Proc. of ISCA
- Walker, K.¹ Strassel, S.²

21
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, 47, pp.103-138, 1990.
- (1990) Hearing Research , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

22
- 33947688089
- Development of a conversational telephone speech recognizer for Levantine Arabic
- D. Vergyri, K. Kirchhoff, R. Gadde, A. Stolcke, and J. Zheng, "Development of a conversational telephone speech recognizer for Levantine Arabic," in Proc. Interspeech, 2005.
- (2005) Proc. Interspeech
- Vergyri, D.¹ Kirchhoff, K.² Gadde, R.³ Stolcke, A.⁴ Zheng, J.⁵

23
- 84905282720
- Exploring Hilbert envelope based acoustic features in i-vector speaker verification using HT-PLDA
- Atlanta, GA, USA
- J.-W. Suh, S. O. Sadjadi, G. Liu, T. Hasan, K. W. Godin and J. H. L. Hansen, "Exploring Hilbert envelope based acoustic features in i-vector speaker verification using HT-PLDA", Proc. of NIST 2011 Speaker Recognition Evaluation Workshop, Atlanta, GA, USA, 2011.
- (2011) Proc. of NIST 2011 Speaker Recognition Evaluation Workshop
- Suh, J.-W.¹ Sadjadi, S.O.² Liu, G.³ Hasan, T.⁴ Godin, K.W.⁵ Hansen, J.H.L.⁶

24
- 84906246749
- Modulation features for noise robust speaker identification
- Lyon
- V. Mitra, M. McLaren, H. Franco, M. Graciarena, N. Scheffer, "Modulation Features for Noise Robust Speaker Identification," Proc. of Interspeech, pp. 3703-3707, Lyon, 2013.
- (2013) Proc. of Interspeech , pp. 3703-3707
- Mitra, V.¹ McLaren, M.² Franco, H.³ Graciarena, M.⁴ Scheffer, N.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.