-
1
-
-
84865686073
-
-
NIST Speech Quality Assurance (SPQA) Package V2.3 [Online]. Available:
-
NIST Speech Quality Assurance (SPQA) Package v2.3, 1994 [Online]. Available: http://www.itl.nist.gov/iad/mig/tools
-
(1994)
-
-
-
2
-
-
0018320733
-
-
M. Berouti, R. Schwartz, and R. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE ICASSP, 1979, pp. 208-211. (Pubitemid 9454996)
-
(1979)
Enhancement of speech corrupted by acoustic noise
, pp. 208-211
-
-
Berouti, M.1
Schwartz, R.2
Makhoul, J.3
-
3
-
-
51449107956
-
A novel a priori snr estimation approach based on selective cepstro-temporal smoothing
-
C. Breithaupt, T. Gerkmann, and R. Martin, "A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing," in Proc. IEEE ICASSP, 2008, pp. 4897-4900.
-
(2008)
Proc. IEEE ICASSP
, pp. 4897-4900
-
-
Breithaupt, C.1
Gerkmann, T.2
Martin, R.3
-
4
-
-
32644447834
-
Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, "
-
I. Cohen, "Speech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models, " Signal Process., vol. 86, no. 4, pp. 698-709, 2005.
-
(2005)
Signal Process.
, vol.86
, Issue.4
, pp. 698-709
-
-
Cohen, I.1
-
5
-
-
33750380834
-
On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement
-
DOI 10.1016/j.specom.2006.06.009, PII S016763930600080X
-
T. H. Dat, K. Takeda, and F. Itakura, "On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement," Speech Commun., vol. 48, pp. 1515-1527, 2006. (Pubitemid 44634771)
-
(2006)
Speech Communication
, vol.48
, Issue.11
, pp. 1515-1527
-
-
Dat, T.H.1
Takeda, K.2
Itakura, F.3
-
6
-
-
0021645331
-
Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
-
Dec
-
Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, Dec. 1984.
-
(1984)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.32
, Issue.6
, pp. 1109-1121
-
-
Ephraim, Y.1
Malah, D.2
-
7
-
-
51449104842
-
Minimum meansquare error estimation of discrete fourier coefficients with generalized gamma priors
-
Dec
-
J. Erkelens, R. Hendriks, R. Heusdens, and J. Jensen, "Minimum meansquare error estimation of discrete Fourier coefficients with generalized gamma priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1741-1752, Dec. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.6
, pp. 1741-1752
-
-
Erkelens, J.1
Hendriks, R.2
Heusdens, R.3
Jensen, J.4
-
9
-
-
0003548585
-
-
[Online]. Available:
-
J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus, 1993, [Online]. Available: http://www.ldc.upenn.edu/Catalog/LDC93S1.html
-
(1993)
DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
-
10
-
-
78049364397
-
Mmse based noise psd tracking with low complexity
-
R. Hendriks, R. Heusdens, and J. Jensen, "MMSE based noise PSD tracking with low complexity," in Proc. IEEE ICASSP, 2010, pp. 4266-4269.
-
(2010)
Proc. IEEE ICASSP
, pp. 4266-4269
-
-
Hendriks, R.1
Heusdens, R.2
Jensen, J.3
-
11
-
-
0004055099
-
Estimation of noise spectrum and its applications to snr-estimation and speech enhancement
-
H. G. Hirsch, "Estimation of noise spectrum and its applications to SNR-estimation and speech enhancement," Int. Comput. Sci. Inst., Berkeley, CA, Tech. Rep. TR-93-012, 1993.
-
(1993)
Int. Comput. Sci. Inst., Berkeley, CA, Tech. Rep. TR-93-012
-
-
Hirsch, H.G.1
-
12
-
-
4644265990
-
Monaural speech segregation based on pitch tracking and amplitude modulation
-
Sep
-
G. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
-
(2004)
IEEE Trans. Neural Netw.
, vol.15
, Issue.5
, pp. 1135-1150
-
-
Hu, G.1
Wang, D.L.2
-
13
-
-
77955695149
-
A tandem algorithm for pitch estimation and voiced speech segregation
-
Nov
-
G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.8
, pp. 2067-2079
-
-
Hu, G.1
Wang, D.L.2
-
14
-
-
49249107353
-
Segregation of unvoiced speech from nonspeech interference
-
G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol. 124, pp. 1306-1319, 2008.
-
(2008)
J. Acoust. Soc. Amer.
, vol.124
, pp. 1306-1319
-
-
Hu, G.1
Wang, D.L.2
-
15
-
-
85008054377
-
Unvoiced speech segregation from nonspeech interference via casa and spectral subtraction
-
Aug
-
K. Hu and D. L.Wang, "Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 6, pp. 1600-1609, Aug. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.6
, pp. 1600-1609
-
-
Hu, K.1
Wang, D.L.2
-
16
-
-
85008581724
-
Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions
-
Jan
-
J. Jensen and R. Hendriks, "Spectral magnitude minimum mean-square error estimation using binary and continuous gain functions," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 92-102, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 92-102
-
-
Jensen, J.1
Hendriks, R.2
-
17
-
-
84867201503
-
Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis
-
C. Kim and R. Stern, "Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis," in Proc. Interspeech, 2008, pp. 2598-2601.
-
(2008)
Proc. Interspeech
, pp. 2598-2601
-
-
Kim, C.1
Stern, R.2
-
18
-
-
0037211087
-
Sub-band snr estimation using auditory feature processing
-
M. Kleinschmidt and V. Hohmann, "Sub-band SNR estimation using auditory feature processing," Speech Commun., vol. 39, pp. 47-64, 2003.
-
(2003)
Speech Commun.
, vol.39
, pp. 47-64
-
-
Kleinschmidt, M.1
Hohmann, V.2
-
19
-
-
0343249636
-
Robust estimation of the snr of noisy speech signals for the quality evaluation of speech databases
-
A. Korthauer, "Robust estimation of the SNR of noisy speech signals for the quality evaluation of speech databases," in Proc. ROBUST'99 Workshop, 1999, pp. 123-126.
-
(1999)
Proc. ROBUST'99 Workshop
, pp. 123-126
-
-
Korthauer, A.1
-
20
-
-
58149196390
-
On the optimality of ideal binary time-frequency masks
-
Y. Li and D. L. Wang, "On the optimality of ideal binary time-frequency masks," Speech Commun., vol. 51, pp. 230-239, 2009.
-
(2009)
Speech Commun.
, vol.51
, pp. 230-239
-
-
Li, Y.1
Wang, D.L.2
-
22
-
-
85008013225
-
Estimators of the magnitude-squared spectrum and methods for incorporating snr uncertainty
-
Jul
-
Y. Lu and P. Loizou, "Estimators of the magnitude-squared spectrum and methods for incorporating SNR uncertainty," IEEE Trans. Audio, Speech, Lang. Process, vol. 19, no. 5, pp. 1123-1137, Jul. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process
, vol.19
, Issue.5
, pp. 1123-1137
-
-
Lu, Y.1
Loizou, P.2
-
23
-
-
85135379452
-
An efficient algorithm to estimate the instantaneous snr of speech signals
-
R. Martin, "An efficient algorithm to estimate the instantaneous SNR of speech signals," in Proc. Eurospeech, 1993, pp. 1093-1096.
-
(1993)
Proc. Eurospeech
, pp. 1093-1096
-
-
Martin, R.1
-
24
-
-
84865687067
-
A casa based system for snr estimation
-
The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio-state.edu/pub/tech-report/2011
-
A. Narayanan and D. L. Wang, "A CASA based system for SNR estimation,' Dept. Comput. Sci. and Eng., The Ohio State Univ., Columbus, OH, Tech. Rep. OSU-CISRC-11/11-TR36, 2011 [Online]. Available: ftp://ftp.cse.ohio- state.edu/pub/tech-report/2011
-
Dept. Comput. Sci. and Eng
-
-
Narayanan, A.1
Wang, D.L.2
-
25
-
-
0032665180
-
Snr estimation of speech signals using subbands and fourth-order statistics
-
Jul
-
E. Nemer, R. Goubran, and S. Mahmoud, "SNR estimation of speech signals using subbands and fourth-order statistics," IEEE Signal Process. Lett., vol. 6, no. 7, pp. 504-512, Jul. 1999.
-
(1999)
IEEE Signal Process. Lett.
, vol.6
, Issue.7
, pp. 504-512
-
-
Nemer, E.1
Goubran, R.2
Mahmoud, S.3
-
26
-
-
0034832359
-
Assessing local noise level estimation methods: Application to noise robust ASR
-
DOI 10.1016/S0167-6393(00)00051-0
-
C. Ris and S. Dupont, "Assessing local noise level estimation methods: Application to noise robust ASR," Speech Commun., vol. 34, pp. 141-158, 2001. (Pubitemid 32874674)
-
(2001)
Speech Communication
, vol.34
, Issue.1-2
, pp. 141-158
-
-
Ris, C.1
Dupont, S.2
-
27
-
-
0038712550
-
Snr estimation based on amplitude modulation analysis with applications to noise suppression
-
May
-
J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Trans. Audio, Speech, Signal Process., vol. 11, no. 3, pp. 184-192, May 2003.
-
(2003)
IEEE Trans. Audio, Speech, Signal Process.
, vol.11
, Issue.3
, pp. 184-192
-
-
Tchorz, J.1
Kollmeier, B.2
-
28
-
-
0006923547
-
Noise adaptation in a hidden markov model speech recognition system
-
D. van Compernolle, "Noise adaptation in a hidden Markov model speech recognition system," Comput. Speech Lang., vol. 3, pp. 151-168, 1989.
-
(1989)
Comput. Speech Lang.
, vol.3
, pp. 151-168
-
-
Van Compernolle, D.1
-
29
-
-
0027623210
-
Assessment for automatic speech recognition: Ii. Noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
-
A. Varga and H. J. M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, pp. 247-251, 1993.
-
(1993)
Speech Commun.
, vol.12
, pp. 247-251
-
-
Varga, A.1
Steeneken, H.J.M.2
-
30
-
-
84892233308
-
On ideal binary masks as the computational goal of auditory scene analysis
-
P. Divenyi, Ed. Boston, MA: Kluwer
-
D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.L.1
-
31
-
-
82255178542
-
-
Hoboken, NJ: Wiley/IEEE Press
-
], D. L. Wang and G. J. Brown, Eds., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. Hoboken, NJ: Wiley/IEEE Press, 2006.
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
-
-
Wang, D.L.1
Brown, G.J.2
-
32
-
-
80051602840
-
Robust speaker identification using a casa front-end
-
X. Zhao, Y. Shao, and D. L.Wang, "Robust speaker identification using a CASA front-end," in Proc. IEEE ICASSP, 2011, pp. 5468-5471.
-
(2011)
Proc. IEEE ICASSP
, pp. 5468-5471
-
-
Zhao, X.1
Shao, Y.2
Wang, D.L.3
|