메뉴 건너뛰기




Volumn , Issue , 2012, Pages 4029-4032

LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise

Author keywords

automatic speech recognition; mel frequency cepstral coefficients; model based approach; Speech enhancement

Indexed keywords

AMBIENT NOISE; AUTOMATIC SPEECH RECOGNITION; FUNDAMENTAL LIMITATIONS; GAUSSIAN MIXTURE MODEL; HIGH POTENTIAL; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MODEL BASED APPROACH; NONSTATIONARY; NONSTATIONARY NOISE; OBSERVATION MODEL; SOURCE LOCATION; SPEECH QUALITY;

EID: 84867591985     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6288802     Document Type: Conference Paper
Times cited : (14)

References (8)
  • 1
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," ICASSP-96, vol. II, pp. 733-736, 1996.
    • (1996) ICASSP-96 , vol.2 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 2
    • 85032751986 scopus 로고    scopus 로고
    • Single-channel multitalker speech recognition
    • Nov.
    • S.J. Rennie, J.R. Hershey, and P.A. Olsen, "Single-channel multitalker speech recognition," IEEE SP Magazine, pp. 66-80, Nov. 2010.
    • (2010) IEEE SP Magazine , pp. 66-80
    • Rennie, S.J.1    Hershey, J.R.2    Olsen, P.A.3
  • 3
    • 80051602796 scopus 로고    scopus 로고
    • Joint unsupervized learning of hidden Markov source models and source location models for multichannel source separation
    • T. Nakatani, S. Araki, T. Yoshioka, and M. Fujimoto, "Joint unsupervized learning of hidden Markov source models and source location models for multichannel source separation,"Proc. ICASSP-2011, 2011.
    • Proc. ICASSP-2011, 2011
    • Nakatani, T.1    Araki, S.2    Yoshioka, T.3    Fujimoto, M.4
  • 4
    • 84865754161 scopus 로고    scopus 로고
    • Reduction of highly nonstationary ambient noise by integrating spectral and locational characteristics of speech and noise for robust ASR
    • T. Nakatani, S. Araki, M. Delcroix, T. Yoshioka, and M. Fujimoto, "Reduction of highly nonstationary ambient noise by integrating spectral and locational characteristics of speech and noise for robust ASR," Proc. Interspeech-2011, 2011.
    • Proc. Interspeech-2011, 2011
    • Nakatani, T.1    Araki, S.2    Delcroix, M.3    Yoshioka, T.4    Fujimoto, M.5
  • 6
    • 50249118229 scopus 로고    scopus 로고
    • A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures
    • H. Sawada, S. Araki, and S. Makino, "A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures," Proc. WASPAA-2007, pp. 139-142, 2007.
    • (2007) Proc. WASPAA-2007 , pp. 139-142
    • Sawada, H.1    Araki, S.2    Makino, S.3
  • 7
    • 45849093239 scopus 로고    scopus 로고
    • Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition
    • T. Hori, C. Hori, Y. Minami, and A. Nakamura, "Efficient WFST-based one-pass decoding with on-the-fly hypothesis rescoring in extremely large vocabulary continuous speech recognition," IEEE Trans. SAP, vol. 15, no. 4, pp. 1352-1365, 2007.
    • (2007) IEEE Trans. SAP , vol.15 , Issue.4 , pp. 1352-1365
    • Hori, T.1    Hori, C.2    Minami, Y.3    Nakamura, A.4
  • 8
    • 84873898784 scopus 로고    scopus 로고
    • Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation
    • M. Delcroix et al., "Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation," Proc. CHiME workshop, 2011.
    • Proc. CHiME Workshop, 2011
    • Delcroix, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.