메뉴 건너뛰기




Volumn 25, Issue 3, 2011, Pages 571-584

Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments

Author keywords

Automatic speech recognition; Speech reverberation; Sub band temporal modulation envelope; Temporal modulation

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BAND PASS; CONSTANT BANDWIDTH; EXTRACTION METHOD; HILBERT TRANSFORM; INVERSE FOURIER TRANSFORMS; LOW-PASS FILTERING; MEAN AND VARIANCE NORMALIZATIONS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODULATION SPECTRUM; MODULATION TRANSFER; MODULATION TRANSFER FUNCTION; NORMALIZATION METHODS; PHASE INFORMATION; RECOGNITION PERFORMANCE; RECORDING ENVIRONMENT; REVERBERANT ENVIRONMENT; REVERBERANT ROOM; SUB-BAND TEMPORAL MODULATION ENVELOPE; SUB-BANDS; TEMPORAL FILTERING; TEMPORAL MODULATION; TEMPORAL MODULATIONS;

EID: 79952625580     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2010.10.002     Document Type: Article
Times cited : (10)

References (39)
  • 1
    • 0017659025 scopus 로고
    • Multimicrophone signal-processing technique to remove room reverberation from speech signals
    • DOI 10.1121/1.381621
    • J.B. Allen, D.A. Berkley, and J. Blauert Multi-microphone signal-processing technique to remove room reverberation from speech signals Journal of the Acoustical Society of America 62 4 1977 912 915 (Pubitemid 8199278)
    • (1977) Journal of the Acoustical Society of America , vol.62 , Issue.4 , pp. 912-915
    • Allen, J.B.1    Berkley, D.A.2    Blauert, J.3
  • 2
    • 79952626638 scopus 로고    scopus 로고
    • http://sp.shinshu-u.ac.jp/CENSREC/. AURORA-2J database.
  • 7
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • DOI 10.1121/1.409836
    • R. Drullman, J.M. Festen, and R. Plomp Effects of reducing slow temporal modulations on speech reception Journal of the Acoustical Society of America 95 5 1994 2670 2680 (Pubitemid 24152861)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 8
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean square error log-spectral amplitude estimator
    • Y. Ephraim, and D. Malah Speech enhancement using a minimum mean square error log-spectral amplitude estimator IEEE Transactions on Acoustics, Speech and Signal Processing 33 2 1985 443 445
    • (1985) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 11
    • 0027166410 scopus 로고
    • Recognition of speech in additive and convolutional noise based on RASTA spectral processing
    • H. Hermansky, N. Morgan, and H.G. Hirsch Recognition of speech in additive and convolutional noise based on RASTA spectral processing Proc. ICASSP'93 1993 83 86
    • (1993) Proc. ICASSP'93 , pp. 83-86
    • Hermansky, H.1    Morgan, N.2    Hirsch, H.G.3
  • 12
    • 0141587024 scopus 로고    scopus 로고
    • Speech waveform recovery from a reverberant speech signal using inverse filtering of the power envelope transfer function
    • S. Hirobayashi, H. Nomura, T. Koike, and M. Tohyama Speech waveform recovery from a reverberant speech signal using inverse filtering of the power envelope transfer function. IEICE Transactions A J81-A 10 1998 1323 1330
    • (1998) IEICE Transactions A , vol.81 , Issue.10 , pp. 1323-1330
    • Hirobayashi, S.1    Nomura, H.2    Koike, T.3    Tohyama, M.4
  • 13
    • 4344705227 scopus 로고    scopus 로고
    • Validation of blind dereverberation using power envelope inverse filtering and filter banks
    • S. Hirobayashi, and T. Yamabuchi Validation of blind dereverberation using power envelope inverse filtering and filter banks IEICE Transactions A J83-A 8 2000 1029 1033
    • (2000) IEICE Transactions A , vol.83 , Issue.8 , pp. 1029-1033
    • Hirobayashi, S.1    Yamabuchi, T.2
  • 14
    • 0015553712 scopus 로고
    • The modulation transfer function in room acoustics as a predictor of speech intelligibility
    • T. Houtgast, and H.J.M. Steeneken The modulation transfer function in room acoustics as a predictor of speech intelligibility Acustica 28 1973 66 73
    • (1973) Acustica , vol.28 , pp. 66-73
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 15
    • 84873312246 scopus 로고
    • A review of the MTF concept in room acoustics and its use for estimating speech intellgibility in auditoria
    • T. Houtgast, and H.J.M. Steeneken A review of the MTF concept in room acoustics and its use for estimating speech intellgibility in auditoria Journal of the Acoustical Society of America 77 3 1985 1069 1077
    • (1985) Journal of the Acoustical Society of America , vol.77 , Issue.3 , pp. 1069-1077
    • Houtgast, T.1    Steeneken, H.J.M.2
  • 16
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • N. Kanedera, T. Arai, H. Hermansky, and M. Pavel On the relative importance of various components of the modulation spectrum for automatic speech recognition Speech Communication 28 1 1999 43 55
    • (1999) Speech Communication , vol.28 , Issue.1 , pp. 43-55
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 17
    • 33947694356 scopus 로고    scopus 로고
    • Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation
    • K. Kinoshita, T. Nakatani, and M. Miyoshi Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation Proc. ICASSP'06, I 2006 817 820
    • (2006) Proc. ICASSP'06, i , pp. 817-820
    • Kinoshita, K.1    Nakatani, T.2    Miyoshi, M.3
  • 19
    • 44949119574 scopus 로고    scopus 로고
    • A robust feature extraction based on the MTF concept for speech recognition in reverberant environment
    • X. Lu, M. Unoki, and M. Akagi A robust feature extraction based on the MTF concept for speech recognition in reverberant environment Proc. ICSLP'06 2006 2546 2549
    • (2006) Proc. ICSLP'06 , pp. 2546-2549
    • Lu, X.1    Unoki, M.2    Akagi, M.3
  • 20
    • 56549098616 scopus 로고    scopus 로고
    • Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
    • X. Lu, M. Unoki, and M. Akagi Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems Acoustical Science and Technology 29 6 2008 351 361
    • (2008) Acoustical Science and Technology , vol.29 , Issue.6 , pp. 351-361
    • Lu, X.1    Unoki, M.2    Akagi, M.3
  • 21
    • 84867218794 scopus 로고    scopus 로고
    • Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement
    • J.G. Lyons, and K.K. Paliwal Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement Proc. INTERSPEECH'08 2008 387 390
    • (2008) Proc. INTERSPEECH'08 , pp. 387-390
    • Lyons, J.G.1    Paliwal, K.K.2
  • 23
    • 0038238630 scopus 로고    scopus 로고
    • A survey on automatic speech recognition
    • S. Nakagawa A survey on automatic speech recognition IEICE Transactions D-II J83-D-II 2 2000 433 457
    • (2000) IEICE Transactions D-II , vol.J83-D-II , Issue.2 , pp. 433-457
    • Nakagawa, S.1
  • 25
    • 0141830958 scopus 로고    scopus 로고
    • Blind dereverberation of single channel speech signal based on harmonic structure
    • T. Nakatani, and M. Miyoshi Blind dereverberation of single channel speech signal based on harmonic structure Proc. ICASSP'03, 1 2003 92 95
    • (2003) Proc. ICASSP'03, 1 , pp. 92-95
    • Nakatani, T.1    Miyoshi, M.2
  • 26
    • 70450139534 scopus 로고    scopus 로고
    • Blind dereverberation of monaural speech signals based on harmonic structure
    • Nakatani, T.; Miyoshi, M.; Kinoshita, K.; 2005. Blind dereverberation of monaural speech signals based on harmonic structure. IEICE D-II, J88-D-II (3), 509-520.
    • (2005) IEICE D-II, J88-D-II , Issue.3 , pp. 509-520
    • Nakatani, T.1    Miyoshi, M.2    Kinoshita, K.3
  • 28
    • 0019635319 scopus 로고
    • Modulation transfer function: definition and measurement
    • M.R. Schroeder Modulation transfer function: definition and measurement Acustica 49 1981 179 182 (Pubitemid 12508801)
    • (1981) Acustica , vol.49 , Issue.3 , pp. 179-182
    • Schroeder, M.R.1
  • 31
    • 0010516808 scopus 로고    scopus 로고
    • Hands-free speech recognition by HMM composition in noisy reverberant environments
    • T. Takiguchi, S. Nakamura, and K. Shikano Hands-free speech recognition by HMM composition in noisy reverberant environments IEICE Transactions D-II J79-D-II 12 1996 2047 2053
    • (1996) IEICE Transactions D-II , vol.J79-D-II , Issue.12 , pp. 2047-2053
    • Takiguchi, T.1    Nakamura, S.2    Shikano, K.3
  • 32
    • 0003822743 scopus 로고    scopus 로고
    • (version 3.2), 2002. Cambridge University Engineering Department
    • The HTK Book (version 3.2), 2002. Cambridge University Engineering Department.
    • The HTK Book
  • 33
    • 4344685385 scopus 로고    scopus 로고
    • An improved method based on the MTF concept for restoring the power envelope from a reverberant signal
    • M. Unoki, M. Furukawa, K. Sakata, and M. Akagi An improved method based on the MTF concept for restoring the power envelope from a reverberant signal Acoustical Science and Technology 25 4 2004 232 242
    • (2004) Acoustical Science and Technology , vol.25 , Issue.4 , pp. 232-242
    • Unoki, M.1    Furukawa, M.2    Sakata, K.3    Akagi, M.4
  • 34
    • 51449100217 scopus 로고    scopus 로고
    • Comparative evaluations of robust and accurate F0 estimates in reverberant environments
    • M. Unoki, T. Hosorogiya, and Y. Ishimoto Comparative evaluations of robust and accurate F0 estimates in reverberant environments Proc. ICASSP'08 2008 4569 4572
    • (2008) Proc. ICASSP'08 , pp. 4569-4572
    • Unoki, M.1    Hosorogiya, T.2    Ishimoto, Y.3
  • 35
    • 4344573437 scopus 로고    scopus 로고
    • A speech dereverberation method based on the MTF concept in power envelope restoration
    • M. Unoki, K. Sakata, M. Furukawa, and M. Akagi A speech dereverberation method based on the MTF concept in power envelope restoration Acoustical Science and Technology 25 4 2004 243 254
    • (2004) Acoustical Science and Technology , vol.25 , Issue.4 , pp. 243-254
    • Unoki, M.1    Sakata, K.2    Furukawa, M.3    Akagi, M.4
  • 37
    • 0141957802 scopus 로고    scopus 로고
    • Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement
    • P.J. Wolfe, and S.J. Godsill Efficient alternatives to the Ephraim and Malah suppression rule for audio signal enhancement EURASIP Journal on Applied Signal Processing 10 2003 1043 1051
    • (2003) EURASIP Journal on Applied Signal Processing , vol.10 , pp. 1043-1051
    • Wolfe, P.J.1    Godsill, S.J.2
  • 38
    • 34347376319 scopus 로고    scopus 로고
    • Temporal structure normalization of speech feature for robust speech recognition
    • DOI 10.1109/LSP.2006.891341
    • X. Xiao, E.S. Chng, and H. Li Temporal structure normalization of speech feature for robust speech recognition IEEE Signal Processing Letters 14 7 2007 500 503 (Pubitemid 47018924)
    • (2007) IEEE Signal Processing Letters , vol.14 , Issue.7 , pp. 500-503
    • Xiao, X.1    Chng, E.S.2    Li, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.