메뉴 건너뛰기




Volumn , Issue , 2009, Pages 3601-3604

Voice conversion for various types of body transmitted speech

Author keywords

Body transmitted speech; Noise robust speech communication; Silent speech communication; Speaking aid; Voice conversion

Indexed keywords

BODY TRANSMITTED SPEECH; NOISE ROBUST SPEECH COMMUNICATION; SILENT SPEECH COMMUNICATION; SPEAKING AID; VOICE CONVERSION;

EID: 70349200844     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960405     Document Type: Conference Paper
Times cited : (33)

References (19)
  • 1
    • 38649090114 scopus 로고    scopus 로고
    • Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling
    • A. Subramanya, Z. Zhang, Z. Liu, A. Acero. Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling. Speech Communication, Vol. 50, No. 3, pp. 228-243, 2008.
    • (2008) Speech Communication , vol.50 , Issue.3 , pp. 228-243
    • Subramanya, A.1    Zhang, Z.2    Liu, Z.3    Acero, A.4
  • 2
    • 84890487256 scopus 로고    scopus 로고
    • Adaptation for soft whisper recognition using a throat microphone
    • Jeju Island, Korea
    • S-C. Jou, T. Schultz, and A.Waibel. Adaptation for soft whisper recognition using a throat microphone. Proc. INTERSPEECH, pp. 1493-1496, Jeju Island, Korea, 2004.
    • (2004) Proc. INTERSPEECH , pp. 1493-1496
    • Jou, S.-C.1    Schultz, T.2    Waibel, A.3
  • 3
    • 33846185567 scopus 로고    scopus 로고
    • Session independent non-audible speech recognition using surface electromyography
    • San Juan, Puerto Rico, Nov
    • L. Maier-Hein, F. Metze, T. Schultz, and A. Waibel. Session independent non-audible speech recognition using surface electromyography. Proc. ASRU, pp. 331-336, San Juan, Puerto Rico, Nov. 2005.
    • (2005) Proc. ASRU , pp. 331-336
    • Maier-Hein, L.1    Metze, F.2    Schultz, T.3    Waibel, A.4
  • 4
    • 67650558346 scopus 로고    scopus 로고
    • Continuousspeech phone recognition from ultrasound and optical images of the tongue and lips
    • Antwerp, Belgium, Aug
    • T. Hueber, G. Chollet, B. Denby, G. Dreyfus, M. Stone. Continuousspeech phone recognition from ultrasound and optical images of the tongue and lips. Proc. Interspeech, pp. 658-661, Antwerp, Belgium, Aug. 2007.
    • (2007) Proc. Interspeech , pp. 658-661
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 6
    • 0029256373 scopus 로고
    • Acoustic characteristics of speaker individuality: Control and conversion
    • H. Kuwabara and Y. Sagisaka. Acoustic characteristics of speaker individuality: control and conversion. Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
    • Kuwabara, H.1    Sagisaka, Y.2
  • 8
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda. Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. ASLP, Vol. 15, No. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 9
    • 33745214435 scopus 로고    scopus 로고
    • NAM-to-speech conversion with Gaussian mixture models
    • Lisbon, Portugal, Sep
    • T. Toda and K. Shikano. NAM-to-speech conversion with Gaussian mixture models. Proc. INTERSPEECH, pp. 1957-1960, Lisbon, Portugal, Sep. 2005.
    • (2005) Proc. INTERSPEECH , pp. 1957-1960
    • Toda, T.1    Shikano, K.2
  • 10
    • 44949187612 scopus 로고    scopus 로고
    • Improving body transmitted unvoiced speech with statistical voice conversion
    • Pittsburgh, USA, Sep
    • M. Nakagiri, T. Toda, H. Saruwatari, and K. Shikano. Improving body transmitted unvoiced speech with statistical voice conversion. Proc. INTERSPEECH, pp. 2270-2273, Pittsburgh, USA, Sep. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2270-2273
    • Nakagiri, M.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 11
    • 44949265538 scopus 로고    scopus 로고
    • Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech
    • Pittsburgh, USA, Sep
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano. Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. Proc. INTERSPEECH, pp. 1395-1398, Pittsburgh, USA, Sep. 2006.
    • (2006) Proc. INTERSPEECH , pp. 1395-1398
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 12
    • 33745217604 scopus 로고    scopus 로고
    • Remodeling of the sensor for non-audible murmur (NAM)
    • Lisbon, Portugal, Sep
    • Y. Nakajima, H. Kashioka, K. Shikano, and N. Campbell. Remodeling of the sensor for non-audible murmur (NAM). Proc. INTERSPEECH, pp. 389-392, Lisbon, Portugal, Sep. 2005.
    • (2005) Proc. INTERSPEECH , pp. 389-392
    • Nakajima, Y.1    Kashioka, H.2    Shikano, K.3    Campbell, N.4
  • 13
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Seattle, USA, May
    • A. Kain and M.W. Macon. Spectral voice conversion for text-to-speech synthesis. Proc. ICASSP, pp. 285-288, Seattle, USA, May 1998.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 14
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Pittsburgh, USA, Sep
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. Proc. INTERSPEECH, pp. 2266-2269, Pittsburgh, USA, Sep. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 15
    • 70349195617 scopus 로고    scopus 로고
    • Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees
    • Antwerp, Belgium, Aug
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano. Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. Proc. INTERSPEECH, pp. 2517-2520, Antwerp, Belgium, Aug. 2007.
    • (2007) Proc. INTERSPEECH , pp. 2517-2520
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 16
    • 84867208056 scopus 로고    scopus 로고
    • Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments
    • Brisbane, Australia, Sep
    • K. Nakamura, T. Toda, Y. Nakajima, H. Saruwatari, and K. Shikano. Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. Proc. INTERSPEECH, pp. 2209-2212, Brisbane, Australia, Sep. 2008.
    • (2008) Proc. INTERSPEECH , pp. 2209-2212
    • Nakamura, K.1    Toda, T.2    Nakajima, Y.3    Saruwatari, H.4    Shikano, K.5
  • 17
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Communication, Vol. 27, No. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigné, A.3
  • 18
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimat T. Todaion of F0 and periodicity
    • Budapest, Hungary, Sep
    • H. Kawahara, H. Katayose, A.de Cheveigné, and R.D. Patterson. Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimat T. Todaion of F0 and periodicity. Proc. EUROSPEECH, pp. 2781-2784, Budapest, Hungary, Sep. 1999.
    • (1999) Proc. EUROSPEECH , pp. 2781-2784
    • Kawahara, H.1    Katayose, H.2    de Cheveigné, A.3    Patterson, R.D.4
  • 19
    • 84867211725 scopus 로고    scopus 로고
    • Lowdelay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Brisbane, Australia, Sep
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Lowdelay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. Proc. INTERSPEECH, pp. 1076-1079, Brisbane, Australia, Sep. 2008.
    • (2008) Proc. INTERSPEECH , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.