메뉴 건너뛰기




Volumn , Issue , 2008, Pages 107-110

Predicting F0 and voicing from NAM-captured whispered speech

Author keywords

Neural network; Non audible murmur; Voice conversion; Whispered speech

Indexed keywords

ESTIMATION; NEURAL NETWORKS; SPEECH PROCESSING;

EID: 77949907441     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (14)
  • 1
    • 33745214435 scopus 로고    scopus 로고
    • NAM-to-Speech Conversion with Gaussian Mixture Models
    • Lisboa
    • Toda, T.; Shikano, K., 2005. NAM-to-Speech Conversion with Gaussian Mixture Models. In Proc. Interspeech. Lisboa, 1957-1960.
    • (2005) In Proc. Interspeech , pp. 1957-1960
    • Toda, T.1    Shikano, K.2
  • 2
    • 57749193836 scopus 로고    scopus 로고
    • Voice Conversion Based on Maximum Likelihood Estimation of Spectral Parameter Trajectory
    • Toda, T.; Black, A.W.; Tokuda, K., 2007. Voice Conversion Based on Maximum Likelihood Estimation of Spectral Parameter Trajectory. In IEEE Transactions on Audio, Speech and Language Processing. Vol. 15, No. 8, 2222-2235.
    • (2007) In IEEE Transactions On Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 3
    • 44949143155 scopus 로고    scopus 로고
    • Maximum Likelihood Voice Conversion Based on GMM with STRAIGHT Mixed Excitation
    • Pittsburgh, USA
    • Ohtani, Y.; Toda, T.; Sarawatari, H.; Shikano, K., 2006. Maximum Likelihood Voice Conversion Based on GMM with STRAIGHT Mixed Excitation. In Proc. Interspeech - ICSLP. Pittsburgh, USA. 2266-2269.
    • (2006) In Proc. Interspeech - ICSLP , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Sarawatari, H.3    Shikano, K.4
  • 4
    • 44949187612 scopus 로고    scopus 로고
    • Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion
    • Pittsburgh, USA
    • Nakagiri, M.; Toda, T.; Kashioka, H.; Shikano, K., 2006. Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion. In Proc. Interspeech- ICSLP. Pittsburgh, USA. 2270-2273.
    • (2006) In Proc. Interspeech- ICSLP , pp. 2270-2273
    • Nakagiri, M.1    Toda, T.2    Kashioka, H.3    Shikano, K.4
  • 6
    • 33745215083 scopus 로고    scopus 로고
    • Audible (normal) speech and inaudible murmur recognition using NAM microphone
    • Vienna, Austria
    • Heracleous, P.; Nakajima, Y., 2004. Audible (normal) speech and inaudible murmur recognition using NAM microphone. In EUSIPCO, Vienna, Austria.
    • (2004) In EUSIPCO
    • Heracleous, P.1    Nakajima, Y.2
  • 7
    • 13544263200 scopus 로고    scopus 로고
    • Analysis and recognition of whispered speech
    • Lisboa
    • Ito, T.; Takeda, K.; Itakura, F., 2005. Analysis and recognition of whispered speech. In Speech Communication. Lisboa. Vol. 45, Issue 2, 139-152.
    • (2005) In Speech Communication , vol.45 , Issue.2 , pp. 139-152
    • Ito, T.1    Takeda, K.2    Itakura, F.3
  • 8
    • 0029972858 scopus 로고    scopus 로고
    • Perceived Pitch of Whispered Vowels - Relationship with formant frequencies: A preliminary study
    • Higashikawa, M.; Nakai, K.; Sakakura, A; Takahashi, H., 1996. Perceived Pitch of Whispered Vowels - Relationship with formant frequencies: A preliminary study. Journal of Voice, 155-158.
    • (1996) Journal of Voice , pp. 155-158
    • Higashikawa, M.1    Nakai, K.2    Sakakura, A.3    Takahashi, H.4
  • 11
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Seattle, U.S.A
    • Kain, A.; Macon M. W., Spectral voice conversion for text-to-speech synthesis. In Proc. ICASSP. Seattle, U.S.A. Vol 1, 285-288.
    • In Proc. ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 12
    • 56149100546 scopus 로고    scopus 로고
    • Continuous-Speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips
    • Antwerp, Belgium
    • Hueber, T.; Chollet, G.; Denby, B.; Dreyfus G.; Stone M, 2007. Continuous-Speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips. In Proc. Interspeech. Antwerp, Belgium,
    • (2007) In Proc. Interspeech
    • Hueber, T.1    Chollet, G.2    Denby, B.3    Dreyfus, G.4    Stone, M.5
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitchadaptive time frequency smoothing and instantaneousfrequency- based F0 extraction: Possible role of a repetitive structure in sounds
    • Kawahara, H.; Masuda-Katsuse, I.; Cheveigné, A., 1999. Restructuring speech representations using a pitchadaptive time frequency smoothing and instantaneousfrequency- based F0 extraction: Possible role of a repetitive structure in sounds. In Speech Communication. Vol. 27, No. 3-4, 187-207.
    • (1999) In Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 14
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
    • Firentze, Italy
    • Kawahara, H.; Estill, J.; Fujimura, O., 2001. Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT. MAVEBA, Firentze, Italy.
    • (2001) MAVEBA
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.