SCOPUS 정보 검색 플랫폼

Proceedings of the 4th International Conference on Speech Prosody, SP 2008

Volumn , Issue , 2008, Pages 107-110

Predicting F0 and voicing from NAM-captured whispered speech

(4) Tran, Viet Anh a Bailly, Gérard a Loevenbruck, Hélène a Toda, Tomoki b

a GIPSA LAB (France)

b NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

Neural network; Non audible murmur; Voice conversion; Whispered speech

Indexed keywords

ESTIMATION; NEURAL NETWORKS; SPEECH PROCESSING;

CONVERSION SYSTEMS; GAUSSIAN MIXTURE MODEL; NON-AUDIBLE MURMUR; ORIGINAL SYSTEMS; UNVOICED SPEECH; VOICE CONVERSION; VOICING DECISION; WHISPERED SPEECH;

SPEECH RECOGNITION;

EID: 77949907441 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (14)

1
- 33745214435
- NAM-to-Speech Conversion with Gaussian Mixture Models
- Lisboa
- Toda, T.; Shikano, K., 2005. NAM-to-Speech Conversion with Gaussian Mixture Models. In Proc. Interspeech. Lisboa, 1957-1960.
- (2005) In Proc. Interspeech , pp. 1957-1960
- Toda, T.¹ Shikano, K.²

2
- 57749193836
- Voice Conversion Based on Maximum Likelihood Estimation of Spectral Parameter Trajectory
- Toda, T.; Black, A.W.; Tokuda, K., 2007. Voice Conversion Based on Maximum Likelihood Estimation of Spectral Parameter Trajectory. In IEEE Transactions on Audio, Speech and Language Processing. Vol. 15, No. 8, 2222-2235.
- (2007) In IEEE Transactions On Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

3
- 44949143155
- Maximum Likelihood Voice Conversion Based on GMM with STRAIGHT Mixed Excitation
- Pittsburgh, USA
- Ohtani, Y.; Toda, T.; Sarawatari, H.; Shikano, K., 2006. Maximum Likelihood Voice Conversion Based on GMM with STRAIGHT Mixed Excitation. In Proc. Interspeech - ICSLP. Pittsburgh, USA. 2266-2269.
- (2006) In Proc. Interspeech - ICSLP , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Sarawatari, H.³ Shikano, K.⁴

4
- 44949187612
- Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion
- Pittsburgh, USA
- Nakagiri, M.; Toda, T.; Kashioka, H.; Shikano, K., 2006. Improving Body Transmitted Unvoiced Speech with Statistical Voice Conversion. In Proc. Interspeech- ICSLP. Pittsburgh, USA. 2270-2273.
- (2006) In Proc. Interspeech- ICSLP , pp. 2270-2273
- Nakagiri, M.¹ Toda, T.² Kashioka, H.³ Shikano, K.⁴

5
- 85009201649
- Non-audible murmur recognition
- Geneva, Switzeland
- Nakajima, Y.; Kashioka, H.; Shikano, K.; Campbell N., 2003. Non-audible murmur recognition. In Proc. Interspeech(Eurospeech). Geneva, Switzeland, 2601-2604.
- (2003) In Proc. Interspeech(Eurospeech) , pp. 2601-2604
- Nakajima, Y.¹ Kashioka, H.² Shikano, K.³ Campbell, N.⁴

6
- 33745215083
- Audible (normal) speech and inaudible murmur recognition using NAM microphone
- Vienna, Austria
- Heracleous, P.; Nakajima, Y., 2004. Audible (normal) speech and inaudible murmur recognition using NAM microphone. In EUSIPCO, Vienna, Austria.
- (2004) In EUSIPCO
- Heracleous, P.¹ Nakajima, Y.²

7
- 13544263200
- Analysis and recognition of whispered speech
- Lisboa
- Ito, T.; Takeda, K.; Itakura, F., 2005. Analysis and recognition of whispered speech. In Speech Communication. Lisboa. Vol. 45, Issue 2, 139-152.
- (2005) In Speech Communication , vol.45 , Issue.2 , pp. 139-152
- Ito, T.¹ Takeda, K.² Itakura, F.³

8
- 0029972858
- Perceived Pitch of Whispered Vowels - Relationship with formant frequencies: A preliminary study
- Higashikawa, M.; Nakai, K.; Sakakura, A; Takahashi, H., 1996. Perceived Pitch of Whispered Vowels - Relationship with formant frequencies: A preliminary study. Journal of Voice, 155-158.
- (1996) Journal of Voice , pp. 155-158
- Higashikawa, M.¹ Nakai, K.² Sakakura, A.³ Takahashi, H.⁴

9
- 0033065909
- Acousticalperceptual correlates of whispered pitch in synthetically generated vowels
- Higashikawa, M; Minifie, F.D., 1999. Acousticalperceptual correlates of whispered pitch in synthetically generated vowels. In Journal of Speech, Language, and Hearing Research. Vol 42, 583-591.
- (1999) In Journal of Speech, Language, and Hearing Research , vol.42 , pp. 583-591
- Higashikawa, M.¹ Minifie, F.D.²

10
- 0032026483
- Continuous probabilistic transform for voice conversion
- Stylianou, Y.; Cappé O.; Moulines E, 1998. Continuous probabilistic transform for voice conversion. In IEEE Trans. Speech and Audio Processing, Vol. 6, No.2, 131-142.
- (1998) In IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

11
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Seattle, U.S.A
- Kain, A.; Macon M. W., Spectral voice conversion for text-to-speech synthesis. In Proc. ICASSP. Seattle, U.S.A. Vol 1, 285-288.
- In Proc. ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

12
- 56149100546
- Continuous-Speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips
- Antwerp, Belgium
- Hueber, T.; Chollet, G.; Denby, B.; Dreyfus G.; Stone M, 2007. Continuous-Speech Phone Recognition from Ultrasound and Optical Images of the Tongue and Lips. In Proc. Interspeech. Antwerp, Belgium,
- (2007) In Proc. Interspeech
- Hueber, T.¹ Chollet, G.² Denby, B.³ Dreyfus, G.⁴ Stone, M.⁵

13
- 0032673049
- Restructuring speech representations using a pitchadaptive time frequency smoothing and instantaneousfrequency- based F0 extraction: Possible role of a repetitive structure in sounds
- Kawahara, H.; Masuda-Katsuse, I.; Cheveigné, A., 1999. Restructuring speech representations using a pitchadaptive time frequency smoothing and instantaneousfrequency- based F0 extraction: Possible role of a repetitive structure in sounds. In Speech Communication. Vol. 27, No. 3-4, 187-207.
- (1999) In Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigné, A.³

14
- 84874199000
- Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
- Firentze, Italy
- Kawahara, H.; Estill, J.; Fujimura, O., 2001. Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT. MAVEBA, Firentze, Italy.
- (2001) MAVEBA
- Kawahara, H.¹ Estill, J.² Fujimura, O.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.