메뉴 건너뛰기




Volumn 20, Issue 9, 2012, Pages 2518-2527

A CASA-based system for long-term SNR estimation

Author keywords

broadband SNR; Computational auditory scene analysis (CASA); ideal binary mask (IBM); signal to noise ratio (SNR); subband SNR

Indexed keywords

BROADBAND SNR; COMPUTATIONAL AUDITORY SCENE ANALYSIS; IDEAL BINARY MASK; SIGNALTONOISE RATIO (SNR); SUBBANDS;

EID: 84865682906     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2205242     Document Type: Article
Times cited : (36)

References (32)
  • 1
    • 84865686073 scopus 로고
    • NIST Speech Quality Assurance (SPQA) Package V2.3 [Online]. Available:
    • NIST Speech Quality Assurance (SPQA) Package v2.3, 1994 [Online]. Available: http://www.itl.nist.gov/iad/mig/tools
    • (1994)
  • 3
    • 51449107956 scopus 로고    scopus 로고
    • A novel a priori snr estimation approach based on selective cepstro-temporal smoothing
    • C. Breithaupt, T. Gerkmann, and R. Martin, "A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing," in Proc. IEEE ICASSP, 2008, pp. 4897-4900.
    • (2008) Proc. IEEE ICASSP , pp. 4897-4900
    • Breithaupt, C.1    Gerkmann, T.2    Martin, R.3
  • 4
    • 32644447834 scopus 로고    scopus 로고
    • Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, "
    • I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, " Signal Process., vol. 86, no. 4, pp. 698-709, 2005.
    • (2005) Signal Process. , vol.86 , Issue.4 , pp. 698-709
    • Cohen, I.1
  • 5
    • 33750380834 scopus 로고    scopus 로고
    • On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
    • DOI 10.1016/j.specom.2006.06.009, PII S016763930600080X
    • T. H. Dat, K. Takeda, and F. Itakura, "On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement," Speech Commun., vol. 48, pp. 1515-1527, 2006. (Pubitemid 44634771)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1515-1527
    • Dat, T.H.1    Takeda, K.2    Itakura, F.3
  • 6
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 7
    • 51449104842 scopus 로고    scopus 로고
    • Minimum meansquare error estimation of discrete fourier coefficients with generalized gamma priors
    • Dec
    • J. Erkelens, R. Hendriks, R. Heusdens, and J. Jensen, "Minimum meansquare error estimation of discrete Fourier coefficients with generalized gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Dec. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
    • Erkelens, J.1    Hendriks, R.2    Heusdens, R.3    Jensen, J.4
  • 10
    • 78049364397 scopus 로고    scopus 로고
    • Mmse based noise psd tracking with low complexity
    • R. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD tracking with low complexity," in Proc. IEEE ICASSP, 2010, pp. 4266-4269.
    • (2010) Proc. IEEE ICASSP , pp. 4266-4269
    • Hendriks, R.1    Heusdens, R.2    Jensen, J.3
  • 12
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw. , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.L.2
  • 13
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • Nov
    • G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2067-2079
    • Hu, G.1    Wang, D.L.2
  • 14
    • 49249107353 scopus 로고    scopus 로고
    • Segregation of unvoiced speech from nonspeech interference
    • G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.124 , pp. 1306-1319
    • Hu, G.1    Wang, D.L.2
  • 15
    • 85008054377 scopus 로고    scopus 로고
    • Unvoiced speech segregation from nonspeech interference via casa and spectral subtraction
    • Aug
    • K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.6 , pp. 1600-1609
    • Hu, K.1    Wang, D.L.2
  • 16
    • 85008581724 scopus 로고    scopus 로고
    • Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions
    • Jan
    • J. Jensen and R. Hendriks, "Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 92-102, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 92-102
    • Jensen, J.1    Hendriks, R.2
  • 17
    • 84867201503 scopus 로고    scopus 로고
    • Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis
    • C. Kim and R. Stern, "Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis," in Proc. Interspeech, 2008, pp. 2598-2601.
    • (2008) Proc. Interspeech , pp. 2598-2601
    • Kim, C.1    Stern, R.2
  • 18
    • 0037211087 scopus 로고    scopus 로고
    • Sub-band snr estimation using auditory feature processing
    • M. Kleinschmidt and V. Hohmann, "Sub-band SNR estimation using auditory feature processing," Speech Commun., vol. 39, pp. 47-64, 2003.
    • (2003) Speech Commun. , vol.39 , pp. 47-64
    • Kleinschmidt, M.1    Hohmann, V.2
  • 19
    • 0343249636 scopus 로고    scopus 로고
    • Robust estimation of the snr of noisy speech signals for the quality evaluation of speech databases
    • A. Korthauer, "Robust estimation of the SNR of noisy speech signals for the quality evaluation of speech databases," in Proc. ROBUST'99 Workshop, 1999, pp. 123-126.
    • (1999) Proc. ROBUST'99 Workshop , pp. 123-126
    • Korthauer, A.1
  • 20
    • 58149196390 scopus 로고    scopus 로고
    • On the optimality of ideal binary time-frequency masks
    • Y. Li and D. L. Wang, "On the optimality of ideal binary time-frequency masks," Speech Commun., vol. 51, pp. 230-239, 2009.
    • (2009) Speech Commun. , vol.51 , pp. 230-239
    • Li, Y.1    Wang, D.L.2
  • 22
    • 85008013225 scopus 로고    scopus 로고
    • Estimators of the magnitude-squared spectrum and methods for incorporating snr uncertainty
    • Jul
    • Y. Lu and P. Loizou, "Estimators of the magnitude-squared spectrum and methods for incorporating SNR uncertainty," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 5, pp. 1123-1137, Jul. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.5 , pp. 1123-1137
    • Lu, Y.1    Loizou, P.2
  • 23
    • 85135379452 scopus 로고
    • An efficient algorithm to estimate the instantaneous snr of speech signals
    • R. Martin, "An efficient algorithm to estimate the instantaneous SNR of speech signals," in Proc. Eurospeech, 1993, pp. 1093-1096.
    • (1993) Proc. Eurospeech , pp. 1093-1096
    • Martin, R.1
  • 24
    • 84865687067 scopus 로고    scopus 로고
    • A casa based system for snr estimation
    • The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio-state.edu/pub/tech-report/2011
    • A. Narayanan and D. L. Wang, "A CASA based system for SNR estimation,' Dept. Comput. Sci. and Eng., The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio- state.edu/pub/tech-report/2011
    • Dept. Comput. Sci. and Eng
    • Narayanan, A.1    Wang, D.L.2
  • 25
    • 0032665180 scopus 로고    scopus 로고
    • Snr estimation of speech signals using subbands and fourth-order statistics
    • Jul
    • E. Nemer, R. Goubran, and S. Mahmoud, "SNR estimation of speech signals using subbands and fourth-order statistics," IEEE Signal Process. Lett., vol. 6, no. 7, pp. 504-512, Jul. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.7 , pp. 504-512
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 26
    • 0034832359 scopus 로고    scopus 로고
    • Assessing local noise level estimation methods: Application to noise robust ASR
    • DOI 10.1016/S0167-6393(00)00051-0
    • C. Ris and S. Dupont, "Assessing local noise level estimation methods: Application to noise robust ASR," Speech Commun., vol. 34, pp. 141-158, 2001. (Pubitemid 32874674)
    • (2001) Speech Communication , vol.34 , Issue.1-2 , pp. 141-158
    • Ris, C.1    Dupont, S.2
  • 27
    • 0038712550 scopus 로고    scopus 로고
    • Snr estimation based on amplitude modulation analysis with applications to noise suppression
    • May
    • J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Audio, Speech, Signal Process., vol. 11, no. 3, pp. 184-192, May 2003.
    • (2003) IEEE Trans. Audio, Speech, Signal Process. , vol.11 , Issue.3 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 28
    • 0006923547 scopus 로고
    • Noise adaptation in a hidden markov model speech recognition system
    • D. van Compernolle, "Noise adaptation in a hidden Markov model speech recognition system," Comput. Speech Lang., vol. 3, pp. 151-168, 1989.
    • (1989) Comput. Speech Lang. , vol.3 , pp. 151-168
    • Van Compernolle, D.1
  • 29
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: Ii. Noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, 1993.
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 30
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Boston, MA: Kluwer
    • D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 32
    • 80051602840 scopus 로고    scopus 로고
    • Robust speaker identification using a casa front-end
    • X. Zhao, Y. Shao, and D. L.Wang, "Robust speaker identification using a CASA front-end," in Proc. IEEE ICASSP, 2011, pp. 5468-5471.
    • (2011) Proc. IEEE ICASSP , pp. 5468-5471
    • Zhao, X.1    Shao, Y.2    Wang, D.L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.