SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4117-4120

Normalized amplitude modulation features for large vocabulary noise-robust speech recognition

(4) Mitra, Vikramjit a Franco, Horacio a Graciarena, Martin a Mandal, Arindam a

a SRI INTERNATIONAL (United States)

Author keywords

Large Vocabulary Speech Recognition; Modulation Features; Noise Robust Speech Recognition

Indexed keywords

AUTOMATED SYSTEMS; AUTOMATIC SPEECH RECOGNITION SYSTEM; BACKGROUND NOISE; CEPSTRAL COEFFICIENTS; CHANNEL DEGRADATIONS; DIGIT RECOGNITION; ENERGY OPERATORS; FEATURE SETS; HUMAN AUDITORY SYSTEM; HUMAN SPEECH; LARGE VOCABULARY; LARGE VOCABULARY SPEECH RECOGNITION; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; SPEECH RECOGNITION SYSTEMS; WALL STREET JOURNAL;

ACOUSTIC NOISE; AMPLITUDE MODULATION; AUDITION; AUTOMATION; COSINE TRANSFORMS; EXPERIMENTS; SIGNAL PROCESSING; VOCABULARY CONTROL;

SPEECH RECOGNITION;

EID: 84867589420 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6288824 Document Type: Conference Paper

Times cited : (92)

References (19)

1
- 0033097443
- Single channel speech enhancement based on masking properties of the human auditory system
- N. Virag, "Single channel speech enhancement based on masking properties of the human auditory system", IEEE Trans. Speech Audio Process. vol.7, no.2, pp. 126-137, 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.2 , pp. 126-137
- Virag, N.¹

2
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition", IEEE Trans Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, 2007.
- (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

3
- 0442317754
- ETSI ES 202 050 Ver. 1.1.5
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Adv. Front-end Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 Ver. 1.1.5, 2007.
- (2007) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Adv. Front-end Feature Extraction Algorithm; Compression Algorithms

4
- 78049398950
- Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
- C. Kim and R. M. Stern, "Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring", in Proc. ICASSP, pp. 4574-4577, 2010.
- (2010) Proc. ICASSP , pp. 4574-4577
- Kim, C.¹ Stern, R.M.²

5
- 84867613224
- Fepstrum features: Design and application to conversational speech recognition
- 11009
- V. Tyagi, Fepstrum features: Design and application to conversational speech recognition, IBM Research Report, 11009, 2011.
- (2011) IBM Research Report
- Tyagi, V.¹

6
- 37649022051
- A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
- U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition", Speech Comm., vol.50, iss. 2, pp. 142-152, 2008.
- (2008) Speech Comm. , vol.50 , Issue.2 , pp. 142-152
- Yapanel, U.H.¹ Hansen, J.H.L.²

7
- 33745225159
- Auditory Teager energy cepstrum coefficients for robust speech recognition
- D. Dimitriadis, P. Maragos, and A. Potamianos, "Auditory Teager energy cepstrum coefficients for robust speech recognition", in Proc of Interspeech, pp. 3013-3016, 2005.
- (2005) Proc of Interspeech , pp. 3013-3016
- Dimitriadis, D.¹ Maragos, P.² Potamianos, A.³

8
- 0028287770
- Effect of reducing slow temporal modulations on speech reception
- R. Drullman, J. M. Festen, and R. Plomp, "Effect of reducing slow temporal modulations on speech reception", J. Acoust. Soc. of Am., vol. 95, no. 5, pp. 2670-2680, 1994.
- (1994) J. Acoust. Soc. of Am. , vol.95 , Issue.5 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

9
- 0034844903
- On the upper cutoff frequency of auditory critical-band envelope detectors in the context of speech perception
- O. Ghitza, "On the upper cutoff frequency of auditory critical-band envelope detectors in the context of speech perception", J. Acoust. Soc. of America, vol. 110, no. 3, pp. 1628-1640, 2001.
- (2001) J. Acoust. Soc. of America , vol.110 , Issue.3 , pp. 1628-1640
- Ghitza, O.¹

10
- 0035278964
- Time-frequency distributions for automatic speech recognition
- A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition", IEEE Trans. Speech & Audio Proc., vol. 9, no. 3, pp. 196-200, 2001.
- (2001) IEEE Trans. Speech & Audio Proc. , vol.9 , Issue.3 , pp. 196-200
- Potamianos, A.¹ Maragos, P.²

11
- 0027676955
- Energy separation in signal modulations with application to speech analysis
- P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis", IEEE Trans. Signal Processing, vol.41, pp. 3024-3051, 1993.
- (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
- Maragos, P.¹ Kaiser, J.² Quatieri, T.³

12
- 0033328948
- Teager energy based feature parameters for speech recognition in car noise
- F. Jabloun, A. E. Cetin, and E. Erzin, "Teager energy based feature parameters for speech recognition in car noise", IEEE Sig. Proc. Letters, vol. 6, no. 10, pp. 259-261, 1999.
- (1999) IEEE Sig. Proc. Letters , vol.6 , Issue.10 , pp. 259-261
- Jabloun, F.¹ Cetin, A.E.² Erzin, E.³

13
- 0025110885
- Derivation of auditory filter shapes from notched-noise data
- B.R. Glasberg and B.C.J. Moore, "Derivation of auditory filter shapes from notched-noise data", Hearing Research, vol. 47, pp.103-138, 1990.
- (1990) Hearing Research , vol.47 , pp. 103-138
- Glasberg, B.R.¹ Moore, B.C.J.²

14
- 84867613230
- http://labrosa.ee.columbia.edu/projects/renoiser/create-wsj.html

15
- 0019075685
- Some observations on oral air flow during phonation
- H. Teager, "Some observations on oral air flow during phonation", IEEE Trans. ASSP, pp. 599-601, 1980.
- (1980) IEEE Trans. ASSP , pp. 599-601
- Teager, H.¹

16
- 0032030556
- A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
- J.H.L. Hansen, L. Gavidia-Ceballos, and J.F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment", IEEE Trans. Biomedical Engineering, vol. 45, no. 3, pp. 300-313, 1998.
- (1998) IEEE Trans. Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
- Hansen, J.H.L.¹ Gavidia-Ceballos, L.² Kaiser, J.F.³

17
- 84987702417
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- D. Pearce and H.G. Hirsch, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", in Proc. ICSLP, Beijing, China, 2000.
- Proc. ICSLP, Beijing, China, 2000
- Pearce, D.¹ Hirsch, H.G.²

18
- 70349217249
- Recent advances in SRI's IraqComm(tm) Iraqi Arabic-English speech-to-speech translation system
- April
- M. Akbacak, H. Franco, M. Frandsen, S. Hasan, H. Jameel, A. Kathol, S. Khadivi, X. Lei, A. Mandal, S. Mansour, K. Precoda, C. Richey, D. Vergyri, W. Wang, M. Yang, and J. Zheng, "Recent advances in SRI's IraqComm(tm) Iraqi Arabic-English speech-to-speech translation system", in Proc. IEEE ICASSP (Taipei), pp. 4809-4813, April 2009.
- (2009) Proc. IEEE ICASSP (Taipei) , pp. 4809-4813
- Akbacak, M.¹ Franco, H.² Frandsen, M.³ Hasan, S.⁴ Jameel, H.⁵ Kathol, A.⁶ Khadivi, S.⁷ Lei, X.⁸ Mandal, A.⁹ Mansour, S.¹⁰ Precoda, K.¹¹ Richey, C.¹² Vergyri, D.¹³ Wang, W.¹⁴ Yang, M.¹⁵ Zheng, J.¹⁶

19
- 85061661435
- Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing
- S. Ravindran, D. V. Anderson and M. Slaney, "Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing," in SAPA, Pittsburgh, PA, September 2006.
- SAPA, Pittsburgh, PA, September 2006
- Ravindran, S.¹ Anderson, D.V.² Slaney, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.