-
2
-
-
0036805081
-
Perceptual evaluation of speech quality (PESQ): The new ITU standard for end-to-end speech quality assessment. Part II - Psychoacoustic model
-
Beerends, J. G., Hekstra, A. P., Rix, A. W., and Hollier, M. P. (2002). Perceptual evaluation of speech quality (PESQ): The new ITU standard for end-to-end speech quality assessment part II-psychoacoustic model., J. Audio Eng. Soc. 50, 765-778. (Pubitemid 35296264)
-
(2002)
AES: Journal of the Audio Engineering Society
, vol.50
, Issue.10
, pp. 765-778
-
-
Beerends, J.G.1
Hekstra, A.P.2
Rix, A.W.3
Hollier, M.P.4
-
3
-
-
65549150744
-
Measurement of speech intelligibility based on the PESQ approach
-
in
-
Beerends, J. G., Larsen, E., Iyer, N., and van Vugt, J. M. (2004). Measurement of speech intelligibility based on the PESQ approach., in Proceedings of the Workshop Measurement of Speech and Audio Quality in Networks.
-
(2004)
Proceedings of the Workshop Measurement of Speech and Audio Quality in Networks
-
-
Beerends, J.G.1
Larsen, E.2
Iyer, N.3
Van Vugt, J.M.4
-
4
-
-
78649295196
-
Extension of ITU-T recommendation P. 862 PESQ towards measuring speech intelligibility with vocoders
-
Beerends, J. G., van Wijngaarden, S., and van Buuren, R. (2005). Extension of ITU-T recommendation P. 862 PESQ towards measuring speech intelligibility with vocoders., TNO Technical Report.
-
(2005)
TNO Technical Report
-
-
Beerends, J.G.1
Van Wijngaarden, S.2
Van Buuren, R.3
-
5
-
-
84863763285
-
A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation
-
in
-
Boldt, J. B., and Ellis, D. P. W. (2009). A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation., in Proceedings of EUSIPCO, pp. 1849-1853.
-
(2009)
Proceedings of EUSIPCO
, pp. 1849-1853
-
-
Boldt, J.B.1
Ellis, D.P.W.2
-
6
-
-
33845354768
-
Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
-
DOI 10.1121/1.2363929
-
Brungart, D. S., Chang, P. S., Simpson, B. D., and Wang, D. L. (2006). Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation., J. Acoust. Soc. Am. 120, 4007-4018. 10.1121/1.2363929 (Pubitemid 44888096)
-
(2006)
Journal of the Acoustical Society of America
, vol.120
, Issue.6
, pp. 4007-4018
-
-
Brungart, D.S.1
Chang, P.S.2
Simpson, B.D.3
Wang, D.4
-
7
-
-
79952871923
-
Prediction of speech intelligibility based on an auditory preprocessing model
-
10.1016/j.specom.2010.03.004
-
Christiansen, C., Pedersen, M. S., and Dau, T. (2010). Prediction of speech intelligibility based on an auditory preprocessing model., Speech Commun. 52, 678-692. 10.1016/j.specom.2010.03.004
-
(2010)
Speech Commun.
, vol.52
, pp. 678-692
-
-
Christiansen, C.1
Pedersen, M.S.2
Dau, T.3
-
8
-
-
0029952425
-
A quantitative model of the 'effective' signal processing in the auditory system. I. Model structure
-
DOI 10.1121/1.414959
-
Dau, T., Pschel, D., and Kohlrausch, A. (1996). A quantitative model of the effective signal processing in the auditory system. I. Model structure., J. Acoust. Soc. Am. 99, 3615-3622. 10.1121/1.414959 (Pubitemid 26190250)
-
(1996)
Journal of the Acoustical Society of America
, vol.99
, Issue.6
, pp. 3615-3622
-
-
Dau, T.1
Puschel, D.2
Kohlrausch, A.3
-
9
-
-
0003424145
-
-
(Prentice Hall PTR, Upper Saddle River, NJ)
-
Deller, Jr., J., Proakis, J., and Hansen, J. (1993). Discrete Time Processing Of Speech Signals (Prentice Hall PTR, Upper Saddle River, NJ), pp. 580-593.
-
(1993)
Discrete Time Processing of Speech Signals
, pp. 580-593
-
-
Deller Jr., J.1
Proakis, J.2
Hansen, J.3
-
10
-
-
60049084444
-
The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
-
10.1121/1.3001713
-
Dubbelboer, F., and Houtgast, T. (2008). The concept of signal-to-noise ratio in the modulation domain and speech intelligibility., J. Acoust. Soc. Am. 124, 3937-3946. 10.1121/1.3001713
-
(2008)
J. Acoust. Soc. Am.
, vol.124
, pp. 3937-3946
-
-
Dubbelboer, F.1
Houtgast, T.2
-
11
-
-
0038711696
-
A spectro-temporal modulation index (STMI) for assessment of speech intelligibility
-
10.1016/S0167-6393(02)00134-6
-
Elhilali, M., Chi, T., and Shamma, S. (2003). A spectro-temporal modulation index (STMI) for assessment of speech intelligibility., Speech Commun. 41, 331-348. 10.1016/S0167-6393(02)00134-6
-
(2003)
Speech Commun.
, vol.41
, pp. 331-348
-
-
Elhilali, M.1
Chi, T.2
Shamma, S.3
-
12
-
-
0021645331
-
Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
-
DOI 10.1109/TASSP.1984.1164453
-
Ephraim, Y., and Malah, D. (1984). Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator., IEEE Trans. Acoust. Speech Signal Process. 32, 1109-1121. 10.1109/TASSP.1984.1164453 (Pubitemid 15159457)
-
(1984)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.32
, Issue.6
, pp. 1109-1121
-
-
Ephraim, Y.1
Malah, D.2
-
13
-
-
51449104842
-
Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors
-
10.1109/TASL.2007.899233
-
Erkelens, J. S., Hendriks, R. C., Heusdens, R., and Jensen, J. (2007). Minimum mean-square error estimation of discrete fourier coefficients with generalized gamma priors., IEEE Trans. Audio Speech Lang. Process. 15, 1741-1752. 10.1109/TASL.2007.899233
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, pp. 1741-1752
-
-
Erkelens, J.S.1
Hendriks, R.C.2
Heusdens, R.3
Jensen, J.4
-
14
-
-
84953657538
-
Factors governing the intelligibility of speech sounds
-
10.1121/1.1916407
-
French, N. R., and Steinberg, J. C. (1947). Factors governing the intelligibility of speech sounds., J. Acoust. Soc. Am. 19, 90-119. 10.1121/1.1916407
-
(1947)
J. Acoust. Soc. Am.
, vol.19
, pp. 90-119
-
-
French, N.R.1
Steinberg, J.C.2
-
15
-
-
11144348189
-
Analysis of speech-based speech transmission index methods with implications for nonlinear operations
-
DOI 10.1121/1.1804628
-
Goldsworthy, R. L., and Greenberg, J. E. (2004). Analysis of speech-based speech transmission index methods with implications for nonlinear operations., J. Acoust. Soc. Am. 116, 3679-3689. 10.1121/1.1804628 (Pubitemid 40029948)
-
(2004)
Journal of the Acoustical Society of America
, vol.116
, Issue.6
, pp. 3679-3689
-
-
Goldsworthy, R.L.1
Greenberg, J.E.2
-
16
-
-
0017097474
-
Distance measures for speech processing
-
Gray, Jr., A. H., and Markel, J. D. (1976). Distance measures for speech processing., IEEE Trans. Acoust. Speech Signal Process. 24, 380-391. 10.1109/TASSP.1976.1162849 (Pubitemid 8091024)
-
(1976)
IEEE TRANS.ACOUST.SPEECH SIGN.PROC.
, vol.24
, Issue.5
, pp. 380-391
-
-
Gray Jr., A.H.1
Markel, J.D.2
-
18
-
-
78049364397
-
MMSE based noise PSD tracking with low complexity
-
in
-
Hendriks, R. C., Heusdens, R., and Jensen, J. (2010). MMSE based noise PSD tracking with low complexity., in IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4266-4269.
-
(2010)
IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 4266-4269
-
-
Hendriks, R.C.1
Heusdens, R.2
Jensen, J.3
-
19
-
-
35248891610
-
A comparative intelligibility study of single-microphone noise reduction algorithms
-
DOI 10.1121/1.2766778
-
Hu, Y., and Loizou, P. C. (2007a). A comparative intelligibility study of single-microphone noise reduction algorithms., J. Acoust. Soc. Am. 122, 1777-1786. 10.1121/1.2766778 (Pubitemid 47560539)
-
(2007)
Journal of the Acoustical Society of America
, vol.122
, Issue.3
, pp. 1777-1786
-
-
Hu, Y.1
Loizou, P.C.2
-
20
-
-
34447092407
-
Subjective comparison and evaluation of speech enhancement algorithms
-
DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
-
Hu, Y., and Loizou, P. C. (2007b). Subjective comparison and evaluation of speech enhancement algorithms., Speech Commun. 49, 588-601. 10.1016/j.specom.2006.12.006 (Pubitemid 47031352)
-
(2007)
Speech Communication
, vol.49
, Issue.7-8
, pp. 588-601
-
-
Hu, Y.1
Loizou, P.C.2
-
21
-
-
44149106061
-
Evaluation of objective quality measures for speech enhancement
-
10.1109/TASL.2007.911054
-
Hu, Y., and Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement., IEEE Trans. Audio Speech Lang. Process. 16, 229-238. 10.1109/TASL.2007.911054
-
(2008)
IEEE Trans. Audio Speech Lang. Process.
, vol.16
, pp. 229-238
-
-
Hu, Y.1
Loizou, P.C.2
-
22
-
-
0014704814
-
A statistical method for estimation of speech spectral density and formant frequencies
-
Itakura, F., and Saito, S. (1970). A statistical method for estimation of speech spectral density and formant frequencies., Electron. Commun. Jpn. 53, 36-43.
-
(1970)
Electron. Commun. Jpn.
, vol.53
, pp. 36-43
-
-
Itakura, F.1
Saito, S.2
-
23
-
-
17644399140
-
Coherence and the speech intelligibility index
-
DOI 10.1121/1.1862575
-
Kates, J. M., and Arehart, K. H. (2005). Coherence and the speech intelligibility index., J. Acoust. Soc. Am. 117, 2224-2237. 10.1121/1.1862575 (Pubitemid 40570480)
-
(2005)
Journal of the Acoustical Society of America
, vol.117
, Issue.4
, pp. 2224-2237
-
-
Kates, J.M.1
Arehart, K.H.2
-
25
-
-
70349161218
-
Role of mask pattern in intelligibility of ideal binary-masked noisy speech
-
10.1121/1.3179673
-
Kjems, U., Boldt, J. B., Pedersen, M. S., Lunner, T., and Wang, D. (2009). Role of mask pattern in intelligibility of ideal binary-masked noisy speech., J. Acoust. Soc. Am. 126, 1415-1426. 10.1121/1.3179673
-
(2009)
J. Acoust. Soc. Am.
, vol.126
, pp. 1415-1426
-
-
Kjems, U.1
Boldt, J.B.2
Pedersen, M.S.3
Lunner, T.4
Wang, D.5
-
28
-
-
84889381426
-
Methods for the calculation and use of the articulation index
-
10.1121/1.1909094
-
Kryter, K. D. (1962). Methods for the calculation and use of the articulation index., J. Acoust. Soc. Am. 34, 1689-1697. 10.1121/1.1909094
-
(1962)
J. Acoust. Soc. Am.
, vol.34
, pp. 1689-1697
-
-
Kryter, K.D.1
-
29
-
-
84867193517
-
Assessment of objective quality measures for speech intelligibility
-
in
-
Liu, W. M., Jellyman, K. A., Evans, N. W. D., and Mason, J. S. D. (2008). Assessment of objective quality measures for speech intelligibility., in Proceedings of Interspeech, pp. 699-702.
-
(2008)
Proceedings of Interspeech
, pp. 699-702
-
-
Liu, W.M.1
Jellyman, K.A.2
Evans, N.W.D.3
Mason, J.S.D.4
-
30
-
-
0032166975
-
Mimicking the human ear
-
Loizou, P. (1998). Mimicking the human ear., IEEE Sign. Process. Mag. 15, 101-130. 10.1109/79.708543 (Pubitemid 128634179)
-
(1998)
IEEE Signal Processing Magazine
, vol.15
, Issue.5
, pp. 101-130
-
-
Loizou, P.C.1
-
32
-
-
0027868016
-
Evaluation of a noise reduction method - Comparison between observed scores and scores predicted from STI
-
Ludvigsen, C., Elberling, C., and Keidser, G. (1993). Evaluation of a noise reduction method-Comparison between observed scores and scores predicted from STI., Scand. Audiol. Suppl. 38, 50-55. (Pubitemid 23362792)
-
(1993)
Scandinavian Audiology, Supplement
, vol.22
, Issue.38
, pp. 50-55
-
-
Ludvigsen, C.1
Elberling, C.2
Keidser, G.3
-
33
-
-
65549157071
-
Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions
-
10.1121/1.3097493
-
Ma, J., Hu, Y., and Loizou, P. (2009). Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions., J. Acoust. Soc. Am. 125, 3387-3405. 10.1121/1.3097493
-
(2009)
J. Acoust. Soc. Am.
, vol.125
, pp. 3387-3405
-
-
Ma, J.1
Hu, Y.2
Loizou, P.3
-
34
-
-
0035396555
-
Noise power spectral density estimation based on optimal smoothing and minimum statistics
-
DOI 10.1109/89.928915, PII S106366760104980X
-
Martin, R. (2001). Noise power spectral density estimation based on optimal smoothing and minimum statistics., IEEE Trans. Speech Audio Process. 9, 504-512. 10.1109/89.928915 (Pubitemid 32631178)
-
(2001)
IEEE Transactions on Speech and Audio Processing
, vol.9
, Issue.5
, pp. 504-512
-
-
Martin, R.1
-
35
-
-
85009100883
-
Usefulness of phase spectrum in human speech perception
-
in
-
Paliwal, K. K., and Alsteris, L. (2003). Usefulness of phase spectrum in human speech perception., in Proceedings of Interspeech, pp. 2117-2120.
-
(2003)
Proceedings of Interspeech
, pp. 2117-2120
-
-
Paliwal, K.K.1
Alsteris, L.2
-
36
-
-
0000460671
-
Complex sounds and auditory images
-
Vol
-
Patterson, R. D., Robinson, K., Holdsworth, J., McKeown, D., Zhang, C., and Allerhand, M. (1992). Complex sounds and auditory images., Auditory Physiology and Perception-Proceedings of the 9th International Symposium on Hearing, Vol. 83, pp. 429-446.
-
(1992)
Auditory Physiology and Perception-Proceedings of the 9th International Symposium on Hearing
, vol.83
, pp. 429-446
-
-
Patterson, R.D.1
Robinson, K.2
Holdsworth, J.3
McKeown, D.4
Zhang, C.5
Allerhand, M.6
-
37
-
-
0029007678
-
Quantifying the relation between speech quality and speech intelligibility
-
Preminger, J., and Tasell, D. (1995). Quantifying the relation between speech quality and speech intelligibility., J. Speech Lang. Hear. Res. 38, 714.
-
(1995)
J. Speech Lang. Hear. Res.
, vol.38
, pp. 714
-
-
Preminger, J.1
Tasell, D.2
-
38
-
-
0003560513
-
-
(Prentice-Hall, Englewood Cliffs, NJ)
-
Quackenbush, S. R., Barnwell, T. P., and Clements, M. A. (1988). Objective Measures of Speech Quality (Prentice-Hall, Englewood Cliffs, NJ), pp. 1-377.
-
(1988)
Objective Measures of Speech Quality
, pp. 1-377
-
-
Quackenbush, S.R.1
Barnwell, T.P.2
Clements, M.A.3
-
39
-
-
17644371385
-
A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners
-
DOI 10.1121/1.1861713
-
Rhebergen, K. S., and Versfeld, N. J. (2005). A speech intelligibility index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners., J. Acoust. Soc. Am. 117, 2181-2192. 10.1121/1.1861713 (Pubitemid 40570476)
-
(2005)
Journal of the Acoustical Society of America
, vol.117
, Issue.4
, pp. 2181-2192
-
-
Rhebergen, K.S.1
Versfeld, N.J.2
-
40
-
-
0028823541
-
Speech recognition with primarily temporal cues
-
10.1126/science.270.5234.303
-
Shannon, R., Zeng, F., Kamath, V., Wygonski, J., and Ekelid, M. (1995). Speech recognition with primarily temporal cues., Science 270, 303. 10.1126/science.270.5234.303
-
(1995)
Science
, vol.270
, pp. 303
-
-
Shannon, R.1
Zeng, F.2
Kamath, V.3
Wygonski, J.4
Ekelid, M.5
-
42
-
-
0018906941
-
A physical method for measuring speech-transmission quality
-
Steeneken, H. J. M., and Houtgast, T. (1980). A physical method for measuring speech-transmission quality., J. Acoust. Soc. Am. 67, 318-326. 10.1121/1.384464 (Pubitemid 10136600)
-
(1980)
Journal of the Acoustical Society of America
, vol.67
, Issue.1
, pp. 318-326
-
-
Steeneken, H.J.M.1
Houtgast, T.2
-
43
-
-
85008536355
-
On predicting the difference in intelligibility before and after single-channel noise reduction
-
(Tel Aviv, Israel)
-
Taal, C. H., Hendriks, R. C., Heusdens, R., and Jensen, J. (2010). On predicting the difference in intelligibility before and after single-channel noise reduction., in International Workshop on Acoustic Echo and Noise Control (Tel Aviv, Israel).
-
(2010)
International Workshop on Acoustic Echo and Noise Control
-
-
Taal, C.H.1
Hendriks, R.C.2
Heusdens, R.3
Jensen, J.4
-
44
-
-
70450161547
-
An evaluation of objective quality measures for speech intelligibility prediction
-
in
-
Taal, C. H., Hendriks, R. C., Heusdens, R., Jensen, J., and Kjems, U. (2009). An evaluation of objective quality measures for speech intelligibility prediction., in Proceedings of Interspeech, pp. 1947-1950.
-
(2009)
Proceedings of Interspeech
, pp. 1947-1950
-
-
Taal, C.H.1
Hendriks, R.C.2
Heusdens, R.3
Jensen, J.4
Kjems, U.5
-
45
-
-
0017787719
-
A study of complexity and quality of speech waveform coders
-
Vol
-
Tribolet, J. M., Noll, P., McDermott, B. J., and Crochiere, R. E. (1978). A study of complexity and quality of speech waveform coders., in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 3, pp. 586-590.
-
(1978)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, vol.3
, pp. 586-590
-
-
Tribolet, J.M.1
Noll, P.2
McDermott, B.J.3
Crochiere, R.E.4
-
46
-
-
27844508054
-
A perceptual model for sinusoidal audio coding based on spectral integration
-
DOI 10.1155/ASP.2005.1292
-
van de Par, S., Kohlrausch, A., Heusdens, R., Jensen, J., and Jensen, S. (2005). A perceptual model for sinusoidal audio coding based on spectral integration., EURASIP J. Appl. Sign. Process. 2005, 1292-1304. 10.1155/ASP.2005.1292 (Pubitemid 41639223)
-
(2005)
Eurasip Journal on Applied Signal Processing
, vol.2005
, Issue.9
, pp. 1292-1304
-
-
Van De Par, S.1
Kohlrausch, A.2
Heusdens, R.3
Jensen, J.4
Jensen, S.H.5
-
47
-
-
0037504237
-
Design, optimization and evaluation of a Danish sentence test in noise
-
Wagener, K., Josvassen, J. L., and Ardenkjaer, R. (2003). Design, optimization and evaluation of a Danish sentence test in noise., Int. J. Audiol. 42, 10-17. 10.3109/14992020309056080 (Pubitemid 37372682)
-
(2003)
International Journal of Audiology
, vol.42
, Issue.1
, pp. 10-17
-
-
Wagener, K.1
Josvassen, J.L.2
Ardenkjaer, R.3
-
48
-
-
84892233308
-
On ideal binary mask as the computational goal of auditory scene analysis
-
edited by P. Divenyi (Springer, New York)
-
Wang, D. (2005). On ideal binary mask as the computational goal of auditory scene analysis., in Speech Separation by Humans and Machines, edited by, P. Divenyi, (Springer, New York), pp. 181-197.
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.1
-
49
-
-
44949234533
-
Word intelligibility estimation of noise-reduced speech
-
in
-
Yamada, T., Kumakura, M., and Kitawaki, N. (2006). Word intelligibility estimation of noise-reduced speech., in Proceeding of Interspeech (ISCA), pp. 169-172.
-
(2006)
Proceeding of Interspeech (ISCA)
, pp. 169-172
-
-
Yamada, T.1
Kumakura, M.2
Kitawaki, N.3
|