메뉴 건너뛰기




Volumn , Issue , 2014, Pages 4488-4492

An evaluation of excitation feature prediction in a hybrid approach to electrolaryngeal speech enhancement

Author keywords

electrolaryngeal speech; hybrid approach; speaking aid; statistical excitation prediction; unvoiced voiced information

Indexed keywords

ACOUSTIC NOISE; NOISE ABATEMENT; SIGNAL FILTERING AND PREDICTION; SPEECH; SPEECH ENHANCEMENT;

EID: 84905252904     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854451     Document Type: Conference Paper
Times cited : (8)

References (19)
  • 1
    • 33646358742 scopus 로고    scopus 로고
    • Enhancement of electrolarynx speech based on auditory masking
    • May
    • H. Liu, Q. Zhao, M.X. Wan, and S.P. Wang, "Enhancement of electrolarynx speech based on auditory masking, " IEEE Trans. Biomedical Engineering, vol. 53, no. 5, pp. 865-874, May 2006.
    • (2006) IEEE Trans. Biomedical Engineering , vol.53 , Issue.5 , pp. 865-874
    • Liu, H.1    Zhao, Q.2    Wan, M.X.3    Wang, S.P.4
  • 2
    • 84860822493 scopus 로고    scopus 로고
    • Real-time enhancement of electrolaryngeal speech by spectral subtraction
    • Feb
    • S.K. Basha and P.C. Pandey, "Real-Time Enhancement of Electrolaryngeal Speech by Spectral Subtraction, " Proc. NCC, 1569507449, pp. 516-520, Feb, 2012.
    • (2012) Proc. NCC, 1569507449 , pp. 516-520
    • Basha, S.K.1    Pandey, P.C.2
  • 3
    • 80052698826 scopus 로고    scopus 로고
    • Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech
    • Jan
    • K. Nakamura, T. Toda, H. Saruwatari, K. Shikano, "Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech, " SPECOM, vol. 54, no. 1, pp. 134-146, Jan 2012.
    • (2012) SPECOM , vol.54 , Issue.1 , pp. 134-146
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 4
    • 80051642767 scopus 로고    scopus 로고
    • An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques
    • May
    • H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques, " Proc. ICASSP, pp. 5136-5139, May 2011.
    • (2011) Proc. ICASSP , pp. 5136-5139
    • Doi, H.1    Nakamura, K.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 5
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, Apr 1979.
    • (1979) IEEE Trans. Acoustics, Speech and Signal Processing , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 7
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, and Language, Vol. 15, No. 8, pp. 2222-2235, Nov 2007.
    • (2007) IEEE Trans. Audio, Speech, and Language , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 8
    • 77956916135 scopus 로고    scopus 로고
    • Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec
    • Oct
    • H.R. Sharifzadeh, I.V. McLoughlin, and F. Ahmadi, "Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec, " IEEE Trans. Biomedical Engineering, vol. 57, no. 10, pp. 2448-2458, Oct 2010.
    • (2010) IEEE Trans. Biomedical Engineering , vol.57 , Issue.10 , pp. 2448-2458
    • Sharifzadeh, H.R.1    McLoughlin, I.V.2    Ahmadi, F.3
  • 9
    • 84905244240 scopus 로고    scopus 로고
    • A Hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion
    • Aug
    • K. Tanaka, T. Toda, G.Neubig, S. Sakti, and S. Nakamura, "A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Subtraction and Statistical Voice Conversion, " Proc. INTERSPEECH, pp.3067-3071, Aug. 2013.
    • (2013) Proc. INTERSPEECH , pp. 3067-3071
    • Tanaka, K.1    Toda, T.2    Neubig, G.3    Sakti, S.4    Nakamura, S.5
  • 10
    • 84865698185 scopus 로고    scopus 로고
    • Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
    • Nov
    • T. Toda, M. Nakagiri, and K. Shikano, "Statistical voice conversion techniques for body-conducted unvoiced speech enhancement, " IEEE Trans. Audio, Speech, and Language, Vol. 20, No. 9, pp. 2505-2517, Nov 2012.
    • (2012) IEEE Trans. Audio, Speech, and Language , vol.20 , Issue.9 , pp. 2505-2517
    • Toda, T.1    Nakagiri, M.2    Shikano, K.3
  • 11
    • 85008023596 scopus 로고    scopus 로고
    • Continuous F0 modelling for HMM based statistical parametric speech synthesis
    • Jul
    • K. Yu and S. Young, "Continuous F0 modelling for HMM based statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Language, Vol. 19, No. 5, pp. 1071-1079, Jul 2011.
    • (2011) IEEE Trans. Audio, Speech, and Language , vol.19 , Issue.5 , pp. 1071-1079
    • Yu, K.1    Young, S.2
  • 12
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • Jul
    • B.L. Sim, Y.C. Tong, J.S. Chang, and C.T. Tan, "A parametric formulation of the generalized spectral subtraction method, " IEEE Trans. Speech and Audio Processing, Vol. 6, No. 4, pp. 328-337, Jul 1998.
    • (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.4 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.S.3    Tan, C.T.4
  • 13
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for textto-speech synthesis
    • May
    • A. Kain and M.W. Macon, "Spectral voice conversion for textto-speech synthesis, " Proc. ICASSP, pp. 285-288, May 1998.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    MacOn, M.W.2
  • 14
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system STRAIGHT
    • Sep
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system STRAIGHT, " Proc. 2nd MAVEBA, Sep 2001.
    • (2001) Proc. 2nd MAVEBA
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 15
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMMbased speech synthesis
    • June
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMMbased speech synthesis, " Proc. ICASSP, pp. 1315-1318, June 2000.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 16
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Sep
    • Y. Ohtani, T. Toda, H. Saruwatari, K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, " Proc. Interspeech, pp. 2266-2269, Sep 2006.
    • (2006) Proc. Interspeech , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 17
    • 0030359773 scopus 로고    scopus 로고
    • Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours
    • Oct
    • A. Sakurai and K. Hirose, "Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours, " Proc. ICSLP, Vol. 2, pp. 817-820, Oct 1996.
    • (1996) Proc. ICSLP , vol.2 , pp. 817-820
    • Sakurai, A.1    Hirose, K.2
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-Adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • Apr
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-Adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " SPECOM, Vol. 27, No. 3-4, pp. 187-207, Apr 1999.
    • (1999) SPECOM , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 19
    • 84905280343 scopus 로고    scopus 로고
    • Augmented speech production beyond physical constraints using statistical voice conversion-Alaryngeal speech enhancement and singing voice quality control
    • NAIST-IS-DD1061014, Mar
    • H. Doi., "Augmented speech production beyond physical constraints using statistical voice conversion-Alaryngeal speech enhancement and singing voice quality control-, " NAIST Doctoral Dissertation, NAIST-IS-DD1061014, Mar 2013
    • (2013) NAIST Doctoral Dissertation
    • Doi, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.