메뉴 건너뛰기




Volumn 22, Issue 1, 2014, Pages 172-183

Alaryngeal speech enhancement based on one-to-many eigenvoice conversion

Author keywords

Alaryngeal speech; Eigenvoice conversion; Laryngectomees; Speech enhancement; Voice conversion

Indexed keywords

HANDICAPPED PERSONS; SPEECH ENHANCEMENT; SPEECH PRODUCTION AIDS; SPEECH PROCESSING;

EID: 84897939966     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2013.2286917     Document Type: Article
Times cited : (59)

References (29)
  • 1
    • 77956027630 scopus 로고    scopus 로고
    • Evaluation of extremely small sound source signals used in speaking-aid system with statistical voice conversion
    • Jul.
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Evaluation of extremely small sound source signals used in speaking-aid system with statistical voice conversion," IEICE Trans. Inf. Syst., vol. E93-D, no. 7, pp. 1909-1917, Jul. 2010.
    • (2010) IEICE Trans. Inf. Syst. , vol.E93-D , Issue.7 , pp. 1909-1917
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 3
    • 79959833617 scopus 로고    scopus 로고
    • The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion," in Proc. Interspeech, Sep. 2010, pp. 1628-1631.
    • Proc. Interspeech, Sep. 2010 , pp. 1628-1631
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 4
    • 33751234295 scopus 로고    scopus 로고
    • Real-time clarification of esophageal speech using a comb filter
    • A. Hisada and H. Sawada, "Real-time clarification of esophageal speech using a comb filter," in Proc. ICDVRAT, Sep. 2002, pp. 39-46.
    • Proc. ICDVRAT, Sep. 2002 , pp. 39-46
    • Hisada, A.1    Sawada, H.2
  • 6
    • 77956916135 scopus 로고    scopus 로고
    • Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec
    • Oct.
    • H. R. Sharifzadeh, I. V. McLoughlin, and F. Ahmadi, "Reconstruction of normal sounding speech for laryngectomy patients through a modified CELP codec," IEEE Trans. Biomed. Eng., vol. 57, no. 10, pp. 2448-2458, Oct. 2010.
    • (2010) IEEE Trans. Biomed. Eng. , vol.57 , Issue.10 , pp. 2448-2458
    • Sharifzadeh, H.R.1    McLoughlin, I.V.2    Ahmadi, F.3
  • 7
    • 33646358742 scopus 로고    scopus 로고
    • Enhancement of electrolarynx speech based on auditory masking
    • DOI 10.1109/TBME.2006.872821, 1621138
    • H. Liu, Q. Zhao, M. Wan, and S. Wang, "Enhancement of electrolarynx speech based on auditory masking," IEEE Trans. Biomed. Eng., vol. 53, no. 5, pp. 865-874, May 2006. (Pubitemid 43667746)
    • (2006) IEEE Transactions on Biomedical Engineering , vol.53 , Issue.5 , pp. 865-874
    • Liu, H.1    Zhao, Q.2    Wan, M.3    Wang, S.4
  • 9
    • 80051642767 scopus 로고    scopus 로고
    • An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques
    • H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques," in Proc. ICASSP, May 2011, pp. 5136-5139.
    • Proc. ICASSP, May 2011 , pp. 5136-5139
    • Doi, H.1    Nakamura, K.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 12
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and many-to-one voice conversion based on eigenvoices
    • T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices," in Proc. ICASSP, Apr. 2007, pp. 1249-1252.
    • Proc. ICASSP, Apr. 2007 , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 13
    • 77956795483 scopus 로고    scopus 로고
    • Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
    • Sep.
    • H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models," IEICE Trans. Inf. Syst., vol. E93-D, no. 9, pp. 2472-2482, Sep. 2010.
    • (2010) IEICE Trans. Inf. Syst. , vol.E93-D , Issue.9 , pp. 2472-2482
    • Doi, H.1    Nakamura, K.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 14
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of and periodicity
    • H. Kawahara, H. Katayose, A. Cheveigne, and R. D. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of and periodicity," in Proc. EUROSPEECH, Sep. 1999, pp. 2781-2784.
    • Proc. EUROSPEECH, Sep. 1999 , pp. 2781-2784
    • Kawahara, H.1    Katayose, H.2    Cheveigne, A.3    Patterson, R.D.4
  • 15
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Sep.
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation,"Interspeech '06-ICSLP, pp. 2266-2269, Sep. 2006.
    • (2006) Interspeech '06-ICSLP , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 16
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight," in Proc. MAVEBA, Sep. 2001.
    • Proc. MAVEBA, Sep. 2001
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 17
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based extraction: Possible role of a repetitive structure in sounds
    • Apr.
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Commun , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 18
    • 76849086888 scopus 로고    scopus 로고
    • Silent-speech enhancement using body-conducted vocal-tract resonance signals
    • Apr.
    • T. Hirahara, M. Otani, S. Shimizu, T. Toda, and K. Nakamura, "Silent-speech enhancement using body-conducted vocal-tract resonance signals,"Speech Commun., vol. 52, no. 4, pp. 301-313, Apr. 2010.
    • (2010) Speech Commun , vol.52 , Issue.4 , pp. 301-313
    • Hirahara, T.1    Otani, M.2    Shimizu, S.3    Toda, T.4    Nakamura, K.5
  • 19
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov.
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 20
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," in Proc. ICASSP, May 1998, pp. 285-288.
    • Proc. ICASSP, May 1998 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 23
    • 38049064114 scopus 로고    scopus 로고
    • Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction
    • Dec.
    • B. Milner and X. Shao, "Prediction of fundamental frequency and voicing from mel-frequency cepstral coefficients for unconstrained speech reconstruction," IEEE Trans. Speech Audio Process., vol. 15, no. 1, pp. 24-33, Dec. 2007.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.1 , pp. 24-33
    • Milner, B.1    Shao, X.2
  • 24
    • 60049098432 scopus 로고    scopus 로고
    • Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
    • Dec.
    • J. Darch, B. Milner, and S. Vaseghi, "Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures," J. Acoust. Soc. Amer., vol. 124, no. 6, pp. 3989-4000, Dec. 2008.
    • (2008) J. Acoust. Soc. Amer. , vol.124 , Issue.6 , pp. 3989-4000
    • Darch, J.1    Milner, B.2    Vaseghi, S.3
  • 25
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • DOI 10.1016/j.specom.2007.09.001, PII S0167639307001495
    • T. Toda, A. W. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum with a Gaussian mixture model," Speech Commun., vol. 50, no. 3, pp. 215-227, Mar. 2008. (Pubitemid 351172471)
    • (2008) Speech Communication , vol.50 , Issue.3 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 26
    • 84865698185 scopus 로고    scopus 로고
    • Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
    • Sep.
    • T. Toda, M. Nakagiri, and K. Shikano, "Statistical voice conversion techniques for body-conducted unvoiced speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 9, pp. 2505-2517, Sep. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.9 , pp. 2505-2517
    • Toda, T.1    Nakagiri, M.2    Shikano, K.3
  • 28
    • 77952978184 scopus 로고    scopus 로고
    • Adaptive training for voice conversion based on eigenvoices
    • Jun.
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Adaptive training for voice conversion based on eigenvoices," IEICE Trans. Inf. Syst., vol. E93-D, no. 6, pp. 1589-1598, Jun. 2010.
    • (2010) IEICE Trans. Inf. Syst. , vol.93 , Issue.6 , pp. 1589-1598
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 29
    • 85131821539 scopus 로고    scopus 로고
    • Mel-generalized cepstral analysis - A unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis - a unified approach to speech spectral estimation," in Proc. ICSLP, Sep. 1994, pp. 1043-1045.
    • Proc. ICSLP, Sep. 1994 , pp. 1043-1045
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.