메뉴 건너뛰기




Volumn 52, Issue 4, 2010, Pages 314-326

Improvement to a NAM-captured whisper-to-speech system

Author keywords

Audiovisual voice conversion; Non audible murmur; Silent speech interface; Whispered speech

Indexed keywords

COMPUTER-MEDIATED COMMUNICATION; DIMENSIONALITY REDUCTION; INPUT AND OUTPUTS; LINEAR DISCRIMINANT ANALYSIS; NON-AUDIBLE MURMUR; SPECTRAL ENVELOPES; SPEECH INTERFACE; SPEECH SYSTEMS; SUBJECTIVE EVALUATIONS; SUBJECTIVE TESTS; SYNTHESIZED SPEECH; TIME WINDOWS; UNVOICED SPEECH; VOICE CONVERSION; VOICED SEGMENT; VOICING DECISION; WHISPERED SPEECH;

EID: 76849105528     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2009.11.005     Document Type: Article
Times cited : (39)

References (34)
  • 1
    • 0036656541 scopus 로고    scopus 로고
    • Three-dimensional linear articulatory modeling of tongue, lips and face based on MRI and video images
    • Badin P., Bailly G., Revéret L., Baciu M., Segebarth C., and Savariaux C. Three-dimensional linear articulatory modeling of tongue, lips and face based on MRI and video images. J. Phonetics 30 3 (2002) 533-553
    • (2002) J. Phonetics , vol.30 , Issue.3 , pp. 533-553
    • Badin, P.1    Bailly, G.2    Revéret, L.3    Baciu, M.4    Segebarth, C.5    Savariaux, C.6
  • 3
    • 84925684753 scopus 로고    scopus 로고
    • Speaking with Smile or Disgust: Data and Models
    • Australia, pp
    • Bailly, G., Bégault, A., Elisei, F., Badin, P., 2008. Speaking with Smile or Disgust: Data and Models. AVSP, Tangalooma, Australia, pp. 111-116.
    • (2008) AVSP, Tangalooma , pp. 111-116
    • Bailly, G.1    Bégault, A.2    Elisei, F.3    Badin, P.4
  • 9
    • 0029972858 scopus 로고    scopus 로고
    • Perceived pitch of whispered vowels - relationship with formant frequencies: a preliminary study
    • Higashikawa M., Nakai K., Sakakura A., and Takahashi H. Perceived pitch of whispered vowels - relationship with formant frequencies: a preliminary study. J. Voice 10 2 (1996) 155-158
    • (1996) J. Voice , vol.10 , Issue.2 , pp. 155-158
    • Higashikawa, M.1    Nakai, K.2    Sakakura, A.3    Takahashi, H.4
  • 11
    • 67650558346 scopus 로고    scopus 로고
    • Continuous-speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips
    • Antwerp, Belgium, pp
    • Hueber, T., Chollet, G., Denby, B., Dreyfus, G., Stone, M., 2007b. Continuous-speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips. InterSpeech, Antwerp, Belgium, pp. 658-661.
    • (2007) InterSpeech , pp. 658-661
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 12
    • 76849099671 scopus 로고    scopus 로고
    • Hueber, T., Chollet, G., Denby, B., Stone, M., Zouari, L., 2007c. Ouisper: corpus-based synthesis driven by articulatory data. In: Internat. Cong. of Phonetic Sciences, Saarbrücken, Germany, pp. 2193-2196.
    • Hueber, T., Chollet, G., Denby, B., Stone, M., Zouari, L., 2007c. Ouisper: corpus-based synthesis driven by articulatory data. In: Internat. Cong. of Phonetic Sciences, Saarbrücken, Germany, pp. 2193-2196.
  • 13
    • 84867208175 scopus 로고    scopus 로고
    • Towards a Segmental Vocoder Driven by Ultrasound and Optical Images of the Tongue and Lips
    • Brisbane, Australia, pp
    • Hueber, T., Chollet, G., Denby, B., Dreyfus, G., Stone, M., 2008a. Towards a Segmental Vocoder Driven by Ultrasound and Optical Images of the Tongue and Lips. InterSpeech, Brisbane, Australia, pp. 2028-2031.
    • (2008) InterSpeech , pp. 2028-2031
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 14
    • 84867195703 scopus 로고    scopus 로고
    • Phone Recognition from Ultrasound and Optical Video Sequences for a Silent Speech Interface
    • Brisbane, Australia, pp
    • Hueber, T., Chollet, G., Denby, B., Dreyfus, G., Stone, M., 2008b. Phone Recognition from Ultrasound and Optical Video Sequences for a Silent Speech Interface. InterSpeech, Brisbane, Australia, pp. 2032-2035.
    • (2008) InterSpeech , pp. 2032-2035
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 15
    • 0014887733 scopus 로고
    • The electromyographic study of verbal hallucinations
    • Inouye T., and Shimizu A. The electromyographic study of verbal hallucinations. J. Nerv. Mental Dis. 151 (1970) 415-422
    • (1970) J. Nerv. Mental Dis. , vol.151 , pp. 415-422
    • Inouye, T.1    Shimizu, A.2
  • 19
    • 44949187612 scopus 로고    scopus 로고
    • Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion
    • Nakagiri, M., Toda, T., Kashioka, H., Shikano, K., 2006. Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion. InterSpeech, Pittsburgh, PE, pp. 2270-2273.
    • (2006) InterSpeech, Pittsburgh, PE , pp. 2270-2273
    • Nakagiri, M.1    Toda, T.2    Kashioka, H.3    Shikano, K.4
  • 22
    • 84870292720 scopus 로고    scopus 로고
    • MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animation
    • Beijing, China, pp
    • Revéret, L., Bailly, G., Badin, P., 2000. MOTHER: a new generation of talking heads providing a flexible articulatory control for video-realistic speech animation. In: Internat. Conf. on Speech and Language Processing, Beijing, China, pp. 755-758.
    • (2000) Internat. Conf. on Speech and Language Processing , pp. 755-758
    • Revéret, L.1    Bailly, G.2    Badin, P.3
  • 23
    • 10444247388 scopus 로고    scopus 로고
    • Developing an audio-visual speech source separation algorithm
    • Sodoyer D., Girin L., Jutten C., and Schwartz J.-L. Developing an audio-visual speech source separation algorithm. Speech Comm. 44 1-4 (2004) 113-125
    • (2004) Speech Comm. , vol.44 , Issue.1-4 , pp. 113-125
    • Sodoyer, D.1    Girin, L.2    Jutten, C.3    Schwartz, J.-L.4
  • 24
    • 0018701386 scopus 로고
    • Use of visual information for phonetic perception
    • Summerfield Q. Use of visual information for phonetic perception. Phonetica 36 (1979) 314-331
    • (1979) Phonetica , vol.36 , pp. 314-331
    • Summerfield, Q.1
  • 25
    • 0002955163 scopus 로고
    • Lips, teeth, and the benefits of lipreading
    • Young A.W., and Ellis H.D. (Eds), Elsevier Science Publishers, Amsterdam
    • Summerfield A., MacLeod A., McGrath M., and Brooke M. Lips, teeth, and the benefits of lipreading. In: Young A.W., and Ellis H.D. (Eds). Handbook of Research on Face Processing (1989), Elsevier Science Publishers, Amsterdam 223-233
    • (1989) Handbook of Research on Face Processing , pp. 223-233
    • Summerfield, A.1    MacLeod, A.2    McGrath, M.3    Brooke, M.4
  • 28
    • 70349200844 scopus 로고    scopus 로고
    • Voice conversion for various types of body transmitted speech
    • Taipei, Taiwan, pp
    • Toda, T., Nakamura, K., Sekimoto, H., Shikano, K., 2009. Voice conversion for various types of body transmitted speech. In: Proc. ICASSP. Taipei, Taiwan, pp. 3601-3604.
    • (2009) Proc. ICASSP , pp. 3601-3604
    • Toda, T.1    Nakamura, K.2    Sekimoto, H.3    Shikano, K.4
  • 33
    • 76849102588 scopus 로고    scopus 로고
    • Zen, H, Nose, T, Yamagishi, J, Sako, S, Masuko, T, Black, A, Tokuda, K, 2007. The HMM-based Speech Synthesis System Version 2.0. Speech Synthesis Workshop, Bonn, Germany, pp. 294-299
    • Zen, H., Nose, T., Yamagishi, J., Sako, S., Masuko, T., Black, A., Tokuda, K., 2007. The HMM-based Speech Synthesis System Version 2.0. Speech Synthesis Workshop, Bonn, Germany, pp. 294-299.
  • 34
    • 33745222547 scopus 로고    scopus 로고
    • Physiological Study of Whispered Speech in Moroccan Arabic
    • Lisbon, pp
    • Zeroual, C., Esling, J., Crevier-Buchman, L., 2005. Physiological Study of Whispered Speech in Moroccan Arabic. In: Proc. InterSpeech. Lisbon, pp. 1069-1072.
    • (2005) Proc. InterSpeech , pp. 1069-1072
    • Zeroual, C.1    Esling, J.2    Crevier-Buchman, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.