-
2
-
-
0037381121
-
Segmental intelligibility of four currently used text-to-speech synthesis methods
-
H. Venkatagiri, "Segmental intelligibility of four currently used text-to-speech synthesis methods, " J. Acoust. Soc. Am., vol. 113, pp. 2095-2104, 2003.
-
(2003)
J. Acoust. Soc. Am.
, vol.113
, pp. 2095-2104
-
-
Venkatagiri, H.1
-
3
-
-
33646819304
-
Improving the understandability of speech synthesis by modeling speech in noise
-
B. Langner and A. W. Black, "Improving the understandability of speech synthesis by modeling speech in noise, " in Proc. ICASSP, vol. 1, 2005, pp. 265-268.
-
(2005)
Proc. ICASSP
, vol.1
, pp. 265-268
-
-
Langner, B.1
Black, A.W.2
-
4
-
-
84937998754
-
Audio dynamic range compression for minimum perceived distortion
-
B. A. Blesser, "Audio dynamic range compression for minimum perceived distortion, " IEEE Trans. on Audio and Electroacoustics, vol. 17, no. 1, 1969.
-
(1969)
IEEE Trans. on Audio and Electroacoustics
, vol.17
, Issue.1
-
-
Blesser, B.A.1
-
5
-
-
0016990909
-
The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression
-
R. J. Niederjohn and J. H. Grotelueschen, "The enhancement of speech intelligibility in high noise levels by high-pass filtering followed by rapid amplitude compression, " IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 24, no. 4, pp. 277-282, 1976.
-
(1976)
IEEE Trans. on Acoustics, Speech and Signal Processing
, vol.24
, Issue.4
, pp. 277-282
-
-
Niederjohn, R.J.1
Grotelueschen, J.H.2
-
6
-
-
0031356549
-
LSP-based speech modification for intelligibility enhancement
-
Santorini, Greece
-
I. V. McLoughlin and R. J. Chance, "LSP-based speech modification for intelligibility enhancement, " in Proc. Digital Signal Processing, vol. 2, Santorini, Greece, 1997, pp. 591-594.
-
(1997)
Proc. Digital Signal Processing
, vol.2
, pp. 591-594
-
-
McLoughlin, I.V.1
Chance, R.J.2
-
7
-
-
33947613677
-
Near end listening enhancement: Speech intelligibility improvement in noisy environments
-
Toulouse, France, May
-
B. Sauert and P. Vary, "Near end listening enhancement: Speech intelligibility improvement in noisy environments, " in Proc. ICASSP, Toulouse, France, May 2006, pp. 493-496.
-
(2006)
Proc. ICASSP
, pp. 493-496
-
-
Sauert, B.1
Vary, P.2
-
8
-
-
33645998440
-
Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments
-
M. D. Skowronski and J. G. Harris, "Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments, " Speech Communication, vol. 48, no. 5, pp. 549-558, 2006.
-
(2006)
Speech Communication
, vol.48
, Issue.5
, pp. 549-558
-
-
Skowronski, M.D.1
Harris, J.G.2
-
9
-
-
34547618996
-
Speech signal modification to increase intelligibility in noisy environments
-
Aug
-
S. D. Yoo, J. R. Boston, A. El-Jaroudi, C.-C. Li, J. D. Durrant, K. Kovacyk, and S. Shaiman, "Speech signal modification to increase intelligibility in noisy environments, " J. Acoust. Soc. Am., vol. 122, no. 2, pp. 1138-1149, Aug. 2007.
-
(2007)
J. Acoust. Soc. Am.
, vol.122
, Issue.2
, pp. 1138-1149
-
-
Yoo, S.D.1
Boston, J.R.2
El-Jaroudi, A.3
Li, C.-C.4
Durrant, J.D.5
Kovacyk, K.6
Shaiman, S.7
-
10
-
-
84867212676
-
Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments
-
H. Brouckxon, W. Verhelst, and B. D. Schuymer, "Time and frequency dependent amplification for speech intelligibility enhancement in noisy environments, " in Proc. Interspeech, 2008, pp. 557- 560.
-
(2008)
Proc. Interspeech
, pp. 557-560
-
-
Brouckxon, H.1
Verhelst, W.2
Schuymer, B.D.3
-
11
-
-
79959812739
-
Energy reallocation strategies for speech enhancement in known noise conditions
-
Y. Tang and M. Cooke, "Energy reallocation strategies for speech enhancement in known noise conditions, " in Proc. Interspeech, 2010, pp. 1636-1639.
-
(2010)
Proc. Interspeech
, pp. 1636-1639
-
-
Tang, Y.1
Cooke, M.2
-
12
-
-
84875226067
-
Reactive speech synthesis: Actively managing phonetic contrast along an H&H continuum
-
Hong Kong, China
-
R. K. Moore and M. Nicolao, "Reactive speech synthesis: Actively managing phonetic contrast along an H&H continuum, " in ICPhS 2011, Hong Kong, China, 2011, pp. 1422-1425.
-
(2011)
ICPhS 2011
, pp. 1422-1425
-
-
Moore, R.K.1
Nicolao, M.2
-
13
-
-
84875224029
-
Near end listening enhancement considering thermal limit of mobile phone loudspeakers
-
Aachen, Germany
-
B. Sauert and P. Vary, "Near end listening enhancement considering thermal limit of mobile phone loudspeakers, " in Proc. Conf. on Elektronische Sprachsignalverarbeitung (ESSV), vol. 61, Aachen, Germany, 2011, pp. 333-340.
-
(2011)
Proc. Conf. on Elektronische Sprachsignalverarbeitung (ESSV)
, vol.61
, pp. 333-340
-
-
Sauert, B.1
Vary, P.2
-
14
-
-
84867593799
-
A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure
-
C. H. Taal, R. C. Hendriks, and R. Heusdens, "A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure, " in Proc. ICASSP, 2012, pp. 4061- 4064.
-
(2012)
Proc. ICASSP
, pp. 4061-4064
-
-
Taal, C.H.1
Hendriks, R.C.2
Heusdens, R.3
-
15
-
-
84878419232
-
Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression
-
Portland, USA
-
T. C. Zorilǎ, V. Kandia, and Y. Stylianou, "Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression, " in Proc. Interspeech, Portland, USA, 2012.
-
(2012)
Proc. Interspeech
-
-
Zorila, T.C.1
Kandia, V.2
Stylianou, Y.3
-
16
-
-
84875231469
-
Evaluating the intelligibility benefit of speech modifications in known noise conditions
-
M. Cooke, C. Mayo, C. Valentini-Botinhao, Y. Stylianou, B. Sauert, and Y. Tang, "Evaluating the intelligibility benefit of speech modifications in known noise conditions, " Speech Communication, vol. 55, pp. 572-585, 2013.
-
(2013)
Speech Communication
, vol.55
, pp. 572-585
-
-
Cooke, M.1
Mayo, C.2
Valentini-Botinhao, C.3
Stylianou, Y.4
Sauert, B.5
Tang, Y.6
-
17
-
-
0014568991
-
Ieee recommended practice for speech quality measurements
-
E. H. Rothauser, W. D. Chapman, N. Guttman, H. R. Silbiger, M. H. L. Hecker, G. E. Urbanek, K. S. Nordby, and M.Weinstock, "IEEE Recommended practice for speech quality measurements, " IEEE Trans. on Audio and Electroacoustics, vol. 17, pp. 225-246, 1969.
-
(1969)
IEEE Trans. on Audio and Electroacoustics
, vol.17
, pp. 225-246
-
-
Rothauser, E.H.1
Chapman, W.D.2
Guttman, N.3
Silbiger, H.R.4
Hecker, M.H.L.5
Urbanek, G.E.6
Nordby, K.S.7
Weinstock, M.8
-
18
-
-
34547181924
-
Psychoacoustic speech tests: A modified rhyme test
-
A. S. House, C. Williams, M. H. L. Hecker, and K. D. Kryter, "Psychoacoustic speech tests: A modified rhyme test, " J. Acoust. Soc. Am., vol. 35, no. 11, pp. 1899-1899, 1963.
-
(1963)
J. Acoust. Soc. Am.
, vol.35
, Issue.11
, pp. 1899-1899
-
-
House, A.S.1
Williams, C.2
Hecker, M.H.L.3
Kryter, K.D.4
-
19
-
-
0034920512
-
ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing aid assessment
-
W. A. Dreschler, H. Verschuure, C. Ludvigsen, and S. Westermann, "ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing aid assessment, " Audiology, vol. 40, pp. 148-157, 2001.
-
(2001)
Audiology
, vol.40
, pp. 148-157
-
-
Dreschler, W.A.1
Verschuure, H.2
Ludvigsen, C.3
Westermann, S.4
-
20
-
-
33644661135
-
A glimpsing model of speech perception in noise
-
M. Cooke, "A glimpsing model of speech perception in noise, " J. Acoust. Soc. Am., vol. 119, no. 3, pp. 1562-1573, 2006.
-
(2006)
J. Acoust. Soc. Am.
, vol.119
, Issue.3
, pp. 1562-1573
-
-
Cooke, M.1
-
21
-
-
77955464549
-
Cochlea-scaled entropy, not consonants, vowels, or time, best predicts speech intelligibility
-
C. Stilp and K. Kluender, "Cochlea-scaled entropy, not consonants, vowels, or time, best predicts speech intelligibility, " Proceedings of the National Academy of Sciences, vol. 107, no. 27, pp. 12 387-12 392, 2010.
-
(2010)
Proceedings of the National Academy of Sciences
, vol.107
, Issue.27
, pp. 12387-12392
-
-
Stilp, C.1
Kluender, K.2
-
22
-
-
84906253654
-
Speech enhancement using emd-based adaptive soft-thresholding (EMDADT)
-
June
-
M. E. Hamid, S. Das, K. Hirose, and M. K. I. Molla, "Speech enhancement using EMD-based adaptive soft-thresholding (EMDADT), " International Journal of Signal Processing, Image Processing and Pattern Recognition, vol. 5, no. 2, June 2012.
-
(2012)
International Journal of Signal Processing, Image Processing and Pattern Recognition
, vol.5
, Issue.2
-
-
Hamid, M.E.1
Das, S.2
Hirose, K.3
Molla, M.K.I.4
-
23
-
-
84869752883
-
Detection of stop consonants in continuous noisy speech based on an extrapolation technique
-
R. Dokku and R. Martin, "Detection of stop consonants in continuous noisy speech based on an extrapolation technique, " in Proc. EUSIPCO, 2012, pp. 2338-2342.
-
(2012)
Proc. EUSIPCO
, pp. 2338-2342
-
-
Dokku, R.1
Martin, R.2
-
24
-
-
84873396291
-
On optimal linear filtering of speech for near-end listening enhancement
-
IEEE
-
C. Taal, J. Jensen, and A. Leijon, "On optimal linear filtering of speech for near-end listening enhancement, " Signal Processing Letters, IEEE, vol. 20, no. 3, pp. 225 -228, 2013.
-
(2013)
Signal Processing Letters
, vol.20
, Issue.3
, pp. 225-228
-
-
Taal, C.1
Jensen, J.2
Leijon, A.3
-
25
-
-
0012330750
-
The design for the wall street journal-based CSR corpus
-
D. B. Paul and J. M. Baker, "The design for the Wall Street Journal-based CSR corpus, " in Proc. Workshop Speech Natural Lang., 1992, pp. 357-362.
-
(1992)
Proc. Workshop Speech Natural Lang
, pp. 357-362
-
-
Paul, D.B.1
Baker, J.M.2
-
26
-
-
84873926312
-
Maximizing phoneme recognition accuracy for enhanced speech intelligibility in noise
-
P. N. Petkov, G. E. Henter, and W. B. Kleijn, "Maximizing phoneme recognition accuracy for enhanced speech intelligibility in noise, " IEEE Trans. Audio, Speech and Lang. Proc., vol. 21, no. 5, pp. 1035-1045, 2013.
-
(2013)
IEEE Trans. Audio, Speech and Lang. Proc.
, vol.21
, Issue.5
, pp. 1035-1045
-
-
Petkov, P.N.1
Henter, G.E.2
Kleijn, W.B.3
-
27
-
-
84906254679
-
Efficient non-uniform time-scaling of speech with WSOLA
-
Stellenbosch, South Africa
-
M. Demol, W. Verhelst, K. Struyve, and P. Verhoeve, "Efficient non-uniform time-scaling of speech with WSOLA, " in Proc. ISCA-ITRW Multiling 2006, Stellenbosch, South Africa, 2006.
-
(2006)
Proc. ISCA-ITRW Multiling 2006
-
-
Demol, M.1
Verhelst, W.2
Struyve, K.3
Verhoeve, P.4
-
28
-
-
33745080279
-
Improving syllable identification by a preprocessing method reducing overlap-masking in reverberant environments
-
N. Hodoshima, T. Arai, A. Kusumoto, and K. Kinoshita, "Improving syllable identification by a preprocessing method reducing overlap-masking in reverberant environments, " J. Acoust. Soc. Am., vol. 119, pp. 4055-4064, 2006.
-
(2006)
J. Acoust. Soc. Am.
, vol.119
, pp. 4055-4064
-
-
Hodoshima, N.1
Arai, T.2
Kusumoto, A.3
Kinoshita, K.4
-
29
-
-
84878385645
-
Mel cepstral coefficient modification based on the glimpse proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise
-
Portland, USA
-
C. Valentini-Botinhao, J. Yamagishi, and S. King, "Mel cepstral coefficient modification based on the Glimpse Proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise, " in Proc. Interspeech, Portland, USA, 2012.
-
(2012)
Proc. Interspeech
-
-
Valentini-Botinhao, C.1
Yamagishi, J.2
King, S.3
-
30
-
-
84878397412
-
C2H: A computational model of H&H-based phonetic contrast in synthetic speech
-
Portland, USA
-
M. Nicolao, J. Latorre, and R. K. Moore, "C2H: A computational model of H&H-based phonetic contrast in synthetic speech, " in Proc. Interspeech, Portland, USA, 2012.
-
(2012)
Proc. Interspeech
-
-
Nicolao, M.1
Latorre, J.2
Moore, R.K.3
-
31
-
-
77957744515
-
HMM-based speech synthesis utilizing glottal inverse filtering
-
T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku, "HMM-based speech synthesis utilizing glottal inverse filtering, " IEEE Trans. on Audio, Speech, and Lang. Proc., vol. 19, no. 1, pp. 153-165, 2011.
-
(2011)
IEEE Trans. on Audio, Speech, and Lang. Proc.
, vol.19
, Issue.1
, pp. 153-165
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
|