SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 4488-4492

An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement

(5) Tanaka, Kou a Toda, Tomoki a Neubig, Graham a Sakti, Sakriani a Nakamura, Satoshi a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

electrolaryngeal speech; hybrid approach; speaking aid; statistical excitation prediction; unvoiced voiced information

Indexed keywords

ACOUSTIC NOISE; NOISE ABATEMENT; SIGNAL FILTERING AND PREDICTION; SPEECH; SPEECH ENHANCEMENT;

ELECTROLARYNGEAL SPEECH; EXCITATION PARAMETERS; HYBRID APPROACH; MECHANICAL EXCITATIONS; NOISE REDUCTION METHODS; SPEAKING AID; SPECTRAL PARAMETERS; UNVOICED/VOICED INFORMATION;

FORECASTING;

EID: 84905252904 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854451 Document Type: Conference Paper

Times cited : (8)

References (19)

1
- 33646358742
- Enhancement of electrolarynx speech based on auditory masking
- May
- H. Liu, Q. Zhao, M.X. Wan, and S.P. Wang, "Enhancement of electrolarynx speech based on auditory masking, " IEEE Trans. Biomedical Engineering, vol. 53, no. 5, pp. 865-874, May 2006.
- (2006) IEEE Trans. Biomedical Engineering , vol.53 , Issue.5 , pp. 865-874
- Liu, H.¹ Zhao, Q.² Wan, M.X.³ Wang, S.P.⁴

2
- 84860822493
- Real-time enhancement of electrolaryngeal speech by spectral subtraction
- Feb
- S.K. Basha and P.C. Pandey, "Real-Time Enhancement of Electrolaryngeal Speech by Spectral Subtraction, " Proc. NCC, 1569507449, pp. 516-520, Feb, 2012.
- (2012) Proc. NCC, 1569507449 , pp. 516-520
- Basha, S.K.¹ Pandey, P.C.²

3
- 80052698826
- Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech
- Jan
- K. Nakamura, T. Toda, H. Saruwatari, K. Shikano, "Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech, " SPECOM, vol. 54, no. 1, pp. 134-146, Jan 2012.
- (2012) SPECOM , vol.54 , Issue.1 , pp. 134-146
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

4
- 80051642767
- An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques
- May
- H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques, " Proc. ICASSP, pp. 5136-5139, May 2011.
- (2011) Proc. ICASSP , pp. 5136-5139
- Doi, H.¹ Nakamura, K.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

5
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, Apr 1979.
- (1979) IEEE Trans. Acoustics, Speech and Signal Processing , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

6
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, Mar 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

7
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, and Language, Vol. 15, No. 8, pp. 2222-2235, Nov 2007.
- (2007) IEEE Trans. Audio, Speech, and Language , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

8
- 77956916135
- Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec
- Oct
- H.R. Sharifzadeh, I.V. McLoughlin, and F. Ahmadi, "Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec, " IEEE Trans. Biomedical Engineering, vol. 57, no. 10, pp. 2448-2458, Oct 2010.
- (2010) IEEE Trans. Biomedical Engineering , vol.57 , Issue.10 , pp. 2448-2458
- Sharifzadeh, H.R.¹ McLoughlin, I.V.² Ahmadi, F.³

9
- 84905244240
- A Hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion
- Aug
- K. Tanaka, T. Toda, G.Neubig, S. Sakti, and S. Nakamura, "A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Subtraction and Statistical Voice Conversion, " Proc. INTERSPEECH, pp.3067-3071, Aug. 2013.
- (2013) Proc. INTERSPEECH , pp. 3067-3071
- Tanaka, K.¹ Toda, T.² Neubig, G.³ Sakti, S.⁴ Nakamura, S.⁵

10
- 84865698185
- Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
- Nov
- T. Toda, M. Nakagiri, and K. Shikano, "Statistical voice conversion techniques for body-conducted unvoiced speech enhancement, " IEEE Trans. Audio, Speech, and Language, Vol. 20, No. 9, pp. 2505-2517, Nov 2012.
- (2012) IEEE Trans. Audio, Speech, and Language , vol.20 , Issue.9 , pp. 2505-2517
- Toda, T.¹ Nakagiri, M.² Shikano, K.³

11
- 85008023596
- Continuous F0 modelling for HMM based statistical parametric speech synthesis
- Jul
- K. Yu and S. Young, "Continuous F0 modelling for HMM based statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Language, Vol. 19, No. 5, pp. 1071-1079, Jul 2011.
- (2011) IEEE Trans. Audio, Speech, and Language , vol.19 , Issue.5 , pp. 1071-1079
- Yu, K.¹ Young, S.²

12
- 0032123832
- A parametric formulation of the generalized spectral subtraction method
- Jul
- B.L. Sim, Y.C. Tong, J.S. Chang, and C.T. Tan, "A parametric formulation of the generalized spectral subtraction method, " IEEE Trans. Speech and Audio Processing, Vol. 6, No. 4, pp. 328-337, Jul 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.4 , pp. 328-337
- Sim, B.L.¹ Tong, Y.C.² Chang, J.S.³ Tan, C.T.⁴

13
- 0031623661
- Spectral voice conversion for textto-speech synthesis
- May
- A. Kain and M.W. Macon, "Spectral voice conversion for textto-speech synthesis, " Proc. ICASSP, pp. 285-288, May 1998.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ MacOn, M.W.²

14
- 84874199000
- Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system STRAIGHT
- Sep
- H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system STRAIGHT, " Proc. 2nd MAVEBA, Sep 2001.
- (2001) Proc. 2nd MAVEBA
- Kawahara, H.¹ Estill, J.² Fujimura, O.³

15
- 0033708106
- Speech parameter generation algorithms for HMMbased speech synthesis
- June
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMMbased speech synthesis, " Proc. ICASSP, pp. 1315-1318, June 2000.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

16
- 44949143155
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
- Sep
- Y. Ohtani, T. Toda, H. Saruwatari, K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, " Proc. Interspeech, pp. 2266-2269, Sep 2006.
- (2006) Proc. Interspeech , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

17
- 0030359773
- Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours
- Oct
- A. Sakurai and K. Hirose, "Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours, " Proc. ICSLP, Vol. 2, pp. 817-820, Oct 1996.
- (1996) Proc. ICSLP , vol.2 , pp. 817-820
- Sakurai, A.¹ Hirose, K.²

18
- 0032673049
- Restructuring speech representations using a pitch-Adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- Apr
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-Adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " SPECOM, Vol. 27, No. 3-4, pp. 187-207, Apr 1999.
- (1999) SPECOM , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigne, A.³

19
- 84905280343
- Augmented speech production beyond physical constraints using statistical voice conversion-Alaryngeal speech enhancement and singing voice quality control
- NAIST-IS-DD1061014, Mar
- H. Doi., "Augmented speech production beyond physical constraints using statistical voice conversion-Alaryngeal speech enhancement and singing voice quality control-, " NAIST Doctoral Dissertation, NAIST-IS-DD1061014, Mar 2013
- (2013) NAIST Doctoral Dissertation
- Doi, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.