SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4029-4032

LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise

(5) Nakatani, Tomohiro a Yoshioka, Takuya a Araki, Shoko a Delcroix, Marc a Fujimoto, Masakiyo a

a NTT Corporation (Japan)

Author keywords

automatic speech recognition; mel frequency cepstral coefficients; model based approach; Speech enhancement

Indexed keywords

AMBIENT NOISE; AUTOMATIC SPEECH RECOGNITION; FUNDAMENTAL LIMITATIONS; GAUSSIAN MIXTURE MODEL; HIGH POTENTIAL; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODEL BASED APPROACH; NONSTATIONARY; NONSTATIONARY NOISE; OBSERVATION MODEL; SOURCE LOCATION; SPEECH QUALITY;

ACOUSTIC NOISE; HIDDEN MARKOV MODELS; SIGNAL PROCESSING; SPEECH ENHANCEMENT;

SPEECH RECOGNITION;

EID: 84867591985 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6288802 Document Type: Conference Paper

Times cited : (14)

References (8)

1
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," ICASSP-96, vol. II, pp. 733-736, 1996.
- (1996) ICASSP-96 , vol.2 , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

2
- 85032751986
- Single-channel multitalker speech recognition
- Nov.
- S.J. Rennie, J.R. Hershey, and P.A. Olsen, "Single-channel multitalker speech recognition," IEEE SP Magazine, pp. 66-80, Nov. 2010.
- (2010) IEEE SP Magazine , pp. 66-80
- Rennie, S.J.¹ Hershey, J.R.² Olsen, P.A.³

3
- 80051602796
- Joint unsupervized learning of hidden Markov source models and source location models for multichannel source separation
- T. Nakatani, S. Araki, T. Yoshioka, and M. Fujimoto, "Joint unsupervized learning of hidden Markov source models and source location models for multichannel source separation,"Proc. ICASSP-2011, 2011.
- Proc. ICASSP-2011, 2011
- Nakatani, T.¹ Araki, S.² Yoshioka, T.³ Fujimoto, M.⁴

4
- 84865754161
- Reduction of highly nonstationary ambient noise by integrating spectral and locational characteristics of speech and noise for robust ASR
- T. Nakatani, S. Araki, M. Delcroix, T. Yoshioka, and M. Fujimoto, "Reduction of highly nonstationary ambient noise by integrating spectral and locational characteristics of speech and noise for robust ASR," Proc. Interspeech-2011, 2011.
- Proc. Interspeech-2011, 2011
- Nakatani, T.¹ Araki, S.² Delcroix, M.³ Yoshioka, T.⁴ Fujimoto, M.⁵

5
- 84865745605
- J. Barker, H. Christensen, N. Ma, P. Green, and E. Vincent, the PASCAL CHiME speech separation challenge website. http://www.dcs.shef.ac.uk/spandh/ chime/challenge.html
- The PASCAL CHiME Speech Separation Challenge Website
- Barker, J.¹ Christensen, H.² Ma, N.³ Green, P.⁴ Vincent, E.⁵

6
- 50249118229
- A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures
- H. Sawada, S. Araki, and S. Makino, "A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures," Proc. WASPAA-2007, pp. 139-142, 2007.
- (2007) Proc. WASPAA-2007 , pp. 139-142
- Sawada, H.¹ Araki, S.² Makino, S.³

7
- 45849093239
- Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
- T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEE Trans. SAP, vol. 15, no. 4, pp. 1352-1365, 2007.
- (2007) IEEE Trans. SAP , vol.15 , Issue.4 , pp. 1352-1365
- Hori, T.¹ Hori, C.² Minami, Y.³ Nakamura, A.⁴

8
- 84873898784
- Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation
- M. Delcroix et al., "Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation," Proc. CHiME workshop, 2011.
- Proc. CHiME Workshop, 2011
- Delcroix, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.