SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 9, 2012, Pages 2518-2527

A CASA-based system for long-term SNR estimation

Author keywords

broadband SNR; Computational auditory scene analysis (CASA); ideal binary mask (IBM); signal to noise ratio (SNR); subband SNR

Indexed keywords

BROADBAND SNR; COMPUTATIONAL AUDITORY SCENE ANALYSIS; IDEAL BINARY MASK; SIGNALTONOISE RATIO (SNR); SUBBANDS;

ALGORITHMS; ESTIMATION; SPEECH PROCESSING;

SIGNAL TO NOISE RATIO;

EID: 84865682906 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2205242 Document Type: Article

Times cited : (36)

References (32)

1
- 84865686073
- NIST Speech Quality Assurance (SPQA) Package V2.3 [Online]. Available:
- NIST Speech Quality Assurance (SPQA) Package v2.3, 1994 [Online]. Available: http://www.itl.nist.gov/iad/mig/tools
- (1994)

2
- 0018320733
- M. Berouti, R. Schwartz, and R. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE ICASSP, 1979, pp. 208-211. (Pubitemid 9454996)
- (1979) Enhancement of speech corrupted by acoustic noise , pp. 208-211
- Berouti, M.¹ Schwartz, R.² Makhoul, J.³

3
- 51449107956
- A novel a priori snr estimation approach based on selective cepstro-temporal smoothing
- C. Breithaupt, T. Gerkmann, and R. Martin, "A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing," in Proc. IEEE ICASSP, 2008, pp. 4897-4900.
- (2008) Proc. IEEE ICASSP , pp. 4897-4900
- Breithaupt, C.¹ Gerkmann, T.² Martin, R.³

4
- 32644447834
- Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, "
- I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, " Signal Process., vol. 86, no. 4, pp. 698-709, 2005.
- (2005) Signal Process. , vol.86 , Issue.4 , pp. 698-709
- Cohen, I.¹

5
- 33750380834
- On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
- DOI 10.1016/j.specom.2006.06.009, PII S016763930600080X
- T. H. Dat, K. Takeda, and F. Itakura, "On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement," Speech Commun., vol. 48, pp. 1515-1527, 2006. (Pubitemid 44634771)
- (2006) Speech Communication , vol.48 , Issue.11 , pp. 1515-1527
- Dat, T.H.¹ Takeda, K.² Itakura, F.³

6
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process. , vol.32 , Issue.6 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

7
- 51449104842
- Minimum meansquare error estimation of discrete fourier coefficients with generalized gamma priors
- Dec
- J. Erkelens, R. Hendriks, R. Heusdens, and J. Jensen, "Minimum meansquare error estimation of discrete Fourier coefficients with generalized gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Dec. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1741-1752
- Erkelens, J.¹ Hendriks, R.² Heusdens, R.³ Jensen, J.⁴

8
- 0004072715
- 2nd ed. New York: Marcel Dekker
- S. Furui, Digital Speech Processing, Synthesis, and Recognition, 2nd ed. New York: Marcel Dekker, 2000.
- (2000) Digital Speech Processing Synthesis and Recognition
- Furui, S.¹

9
- 0003548585
- [Online]. Available:
- J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus, 1993, [Online]. Available: http://www.ldc.upenn.edu/Catalog/LDC93S1.html
- (1993) DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶

10
- 78049364397
- Mmse based noise psd tracking with low complexity
- R. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD tracking with low complexity," in Proc. IEEE ICASSP, 2010, pp. 4266-4269.
- (2010) Proc. IEEE ICASSP , pp. 4266-4269
- Hendriks, R.¹ Heusdens, R.² Jensen, J.³

11
- 0004055099
- Estimation of noise spectrum and its applications to snr-estimation and speech enhancement
- H. G. Hirsch, "Estimation of noise spectrum and its applications to SNR-estimation and speech enhancement," Int. Comput. Sci. Inst., Berkeley, CA, Tech. Rep. TR-93-012, 1993.
- (1993) Int. Comput. Sci. Inst., Berkeley, CA, Tech. Rep. TR-93-012
- Hirsch, H.G.¹

12
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Sep
- G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
- (2004) IEEE Trans. Neural Netw. , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.¹ Wang, D.L.²

13
- 77955695149
- A tandem algorithm for pitch estimation and voiced speech segregation
- Nov
- G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2067-2079
- Hu, G.¹ Wang, D.L.²

14
- 49249107353
- Segregation of unvoiced speech from nonspeech interference
- G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
- (2008) J. Acoust. Soc. Amer. , vol.124 , pp. 1306-1319
- Hu, G.¹ Wang, D.L.²

15
- 85008054377
- Unvoiced speech segregation from nonspeech interference via casa and spectral subtraction
- Aug
- K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.6 , pp. 1600-1609
- Hu, K.¹ Wang, D.L.²

16
- 85008581724
- Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions
- Jan
- J. Jensen and R. Hendriks, "Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 92-102, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 92-102
- Jensen, J.¹ Hendriks, R.²

17
- 84867201503
- Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis
- C. Kim and R. Stern, "Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis," in Proc. Interspeech, 2008, pp. 2598-2601.
- (2008) Proc. Interspeech , pp. 2598-2601
- Kim, C.¹ Stern, R.²

18
- 0037211087
- Sub-band snr estimation using auditory feature processing
- M. Kleinschmidt and V. Hohmann, "Sub-band SNR estimation using auditory feature processing," Speech Commun., vol. 39, pp. 47-64, 2003.
- (2003) Speech Commun. , vol.39 , pp. 47-64
- Kleinschmidt, M.¹ Hohmann, V.²

19
- 0343249636
- Robust estimation of the snr of noisy speech signals for the quality evaluation of speech databases
- A. Korthauer, "Robust estimation of the SNR of noisy speech signals for the quality evaluation of speech databases," in Proc. ROBUST'99 Workshop, 1999, pp. 123-126.
- (1999) Proc. ROBUST'99 Workshop , pp. 123-126
- Korthauer, A.¹

20
- 58149196390
- On the optimality of ideal binary time-frequency masks
- Y. Li and D. L. Wang, "On the optimality of ideal binary time-frequency masks," Speech Commun., vol. 51, pp. 230-239, 2009.
- (2009) Speech Commun. , vol.51 , pp. 230-239
- Li, Y.¹ Wang, D.L.²

21
- 34447100796
- Boca Raton FL: CRC
- P. C. Loizou, Speech Enhancement: Theory and Practice. Boca Raton, FL: CRC, 2007.
- (2007) Speech Enhancement: Theory And Practice
- Loizou, P.C.¹

22
- 85008013225
- Estimators of the magnitude-squared spectrum and methods for incorporating snr uncertainty
- Jul
- Y. Lu and P. Loizou, "Estimators of the magnitude-squared spectrum and methods for incorporating SNR uncertainty," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 5, pp. 1123-1137, Jul. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.5 , pp. 1123-1137
- Lu, Y.¹ Loizou, P.²

23
- 85135379452
- An efficient algorithm to estimate the instantaneous snr of speech signals
- R. Martin, "An efficient algorithm to estimate the instantaneous SNR of speech signals," in Proc. Eurospeech, 1993, pp. 1093-1096.
- (1993) Proc. Eurospeech , pp. 1093-1096
- Martin, R.¹

24
- 84865687067
- A casa based system for snr estimation
- The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio-state.edu/pub/tech-report/2011
- A. Narayanan and D. L. Wang, "A CASA based system for SNR estimation,' Dept. Comput. Sci. and Eng., The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio- state.edu/pub/tech-report/2011
- Dept. Comput. Sci. and Eng
- Narayanan, A.¹ Wang, D.L.²

25
- 0032665180
- Snr estimation of speech signals using subbands and fourth-order statistics
- Jul
- E. Nemer, R. Goubran, and S. Mahmoud, "SNR estimation of speech signals using subbands and fourth-order statistics," IEEE Signal Process. Lett., vol. 6, no. 7, pp. 504-512, Jul. 1999.
- (1999) IEEE Signal Process. Lett. , vol.6 , Issue.7 , pp. 504-512
- Nemer, E.¹ Goubran, R.² Mahmoud, S.³

26
- 0034832359
- Assessing local noise level estimation methods: Application to noise robust ASR
- DOI 10.1016/S0167-6393(00)00051-0
- C. Ris and S. Dupont, "Assessing local noise level estimation methods: Application to noise robust ASR," Speech Commun., vol. 34, pp. 141-158, 2001. (Pubitemid 32874674)
- (2001) Speech Communication , vol.34 , Issue.1-2 , pp. 141-158
- Ris, C.¹ Dupont, S.²

27
- 0038712550
- Snr estimation based on amplitude modulation analysis with applications to noise suppression
- May
- J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Audio, Speech, Signal Process., vol. 11, no. 3, pp. 184-192, May 2003.
- (2003) IEEE Trans. Audio, Speech, Signal Process. , vol.11 , Issue.3 , pp. 184-192
- Tchorz, J.¹ Kollmeier, B.²

28
- 0006923547
- Noise adaptation in a hidden markov model speech recognition system
- D. van Compernolle, "Noise adaptation in a hidden Markov model speech recognition system," Comput. Speech Lang., vol. 3, pp. 151-168, 1989.
- (1989) Comput. Speech Lang. , vol.3 , pp. 151-168
- Van Compernolle, D.¹

29
- 0027623210
- Assessment for automatic speech recognition: Ii. Noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, 1993.
- (1993) Speech Commun. , vol.12 , pp. 247-251
- Varga, A.¹ Steeneken, H.J.M.²

30
- 84892233308
- On ideal binary masks as the computational goal of auditory scene analysis
- P. Divenyi, Ed. Boston, MA: Kluwer
- D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

31
- 82255178542
- Hoboken, NJ: Wiley/IEEE Press
- ], D. L. Wang and G. J. Brown, Eds., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Hoboken, NJ: Wiley/IEEE Press, 2006.
- (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.J.²

32
- 80051602840
- Robust speaker identification using a casa front-end
- X. Zhao, Y. Shao, and D. L.Wang, "Robust speaker identification using a CASA front-end," in Proc. IEEE ICASSP, 2011, pp. 5468-5471.
- (2011) Proc. IEEE ICASSP , pp. 5468-5471
- Zhao, X.¹ Shao, Y.² Wang, D.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.