SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2009, Pages 3601-3604

Voice conversion for various types of body transmitted speech

(4) Toda, Tomoki a Nakamura, Keigo a Sekimoto, Hidehiko a,b Shikano, Kiyohiro a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b OMRON CORPORATION (Japan)

Author keywords

Body transmitted speech; Noise robust speech communication; Silent speech communication; Speaking aid; Voice conversion

Indexed keywords

BODY TRANSMITTED SPEECH; NOISE ROBUST SPEECH COMMUNICATION; SILENT SPEECH COMMUNICATION; SPEAKING AID; VOICE CONVERSION;

ACOUSTICS; SIGNAL PROCESSING; SPEECH ANALYSIS; SPEECH PROCESSING;

SPEECH COMMUNICATION;

EID: 70349200844 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2009.4960405 Document Type: Conference Paper

Times cited : (33)

References (19)

1
- 38649090114
- Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling
- A. Subramanya, Z. Zhang, Z. Liu, A. Acero. Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling. Speech Communication, Vol. 50, No. 3, pp. 228-243, 2008.
- (2008) Speech Communication , vol.50 , Issue.3 , pp. 228-243
- Subramanya, A.¹ Zhang, Z.² Liu, Z.³ Acero, A.⁴

2
- 84890487256
- Adaptation for soft whisper recognition using a throat microphone
- Jeju Island, Korea
- S-C. Jou, T. Schultz, and A.Waibel. Adaptation for soft whisper recognition using a throat microphone. Proc. INTERSPEECH, pp. 1493-1496, Jeju Island, Korea, 2004.
- (2004) Proc. INTERSPEECH , pp. 1493-1496
- Jou, S.-C.¹ Schultz, T.² Waibel, A.³

3
- 33846185567
- Session independent non-audible speech recognition using surface electromyography
- San Juan, Puerto Rico, Nov
- L. Maier-Hein, F. Metze, T. Schultz, and A. Waibel. Session independent non-audible speech recognition using surface electromyography. Proc. ASRU, pp. 331-336, San Juan, Puerto Rico, Nov. 2005.
- (2005) Proc. ASRU , pp. 331-336
- Maier-Hein, L.¹ Metze, F.² Schultz, T.³ Waibel, A.⁴

4
- 67650558346
- Continuousspeech phone recognition from ultrasound and optical images of the tongue and lips
- Antwerp, Belgium, Aug
- T. Hueber, G. Chollet, B. Denby, G. Dreyfus, M. Stone. Continuousspeech phone recognition from ultrasound and optical images of the tongue and lips. Proc. Interspeech, pp. 658-661, Antwerp, Belgium, Aug. 2007.
- (2007) Proc. Interspeech , pp. 658-661
- Hueber, T.¹ Chollet, G.² Denby, B.³ Dreyfus, G.⁴ Stone, M.⁵

5
- 32244438249
- Non-Audible Murmur (NAM) Recognition
- Y. Nakajima, H. Kashioka, N. Cambell, and K. Shikano. Non-Audible Murmur (NAM) Recognition. IEICE Trans. Information and Systems, Vol. E89-D, No. 1, pp. 1-8, 2006.
- (2006) IEICE Trans. Information and Systems , vol.E89-D , Issue.1 , pp. 1-8
- Nakajima, Y.¹ Kashioka, H.² Cambell, N.³ Shikano, K.⁴

6
- 0029256373
- Acoustic characteristics of speaker individuality: Control and conversion
- H. Kuwabara and Y. Sagisaka. Acoustic characteristics of speaker individuality: control and conversion. Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
- (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
- Kuwabara, H.¹ Sagisaka, Y.²

7
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines. Continuous probabilistic transform for voice conversion. IEEE Trans. Speech and Audio Processing, Vol. 6, No. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

8
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A.W. Black, and K. Tokuda. Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. ASLP, Vol. 15, No. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

9
- 33745214435
- NAM-to-speech conversion with Gaussian mixture models
- Lisbon, Portugal, Sep
- T. Toda and K. Shikano. NAM-to-speech conversion with Gaussian mixture models. Proc. INTERSPEECH, pp. 1957-1960, Lisbon, Portugal, Sep. 2005.
- (2005) Proc. INTERSPEECH , pp. 1957-1960
- Toda, T.¹ Shikano, K.²

10
- 44949187612
- Improving body transmitted unvoiced speech with statistical voice conversion
- Pittsburgh, USA, Sep
- M. Nakagiri, T. Toda, H. Saruwatari, and K. Shikano. Improving body transmitted unvoiced speech with statistical voice conversion. Proc. INTERSPEECH, pp. 2270-2273, Pittsburgh, USA, Sep. 2006.
- (2006) Proc. INTERSPEECH , pp. 2270-2273
- Nakagiri, M.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

11
- 44949265538
- Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech
- Pittsburgh, USA, Sep
- K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano. Speaking aid system for total laryngectomees using voice conversion of body transmitted artificial speech. Proc. INTERSPEECH, pp. 1395-1398, Pittsburgh, USA, Sep. 2006.
- (2006) Proc. INTERSPEECH , pp. 1395-1398
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

12
- 33745217604
- Remodeling of the sensor for non-audible murmur (NAM)
- Lisbon, Portugal, Sep
- Y. Nakajima, H. Kashioka, K. Shikano, and N. Campbell. Remodeling of the sensor for non-audible murmur (NAM). Proc. INTERSPEECH, pp. 389-392, Lisbon, Portugal, Sep. 2005.
- (2005) Proc. INTERSPEECH , pp. 389-392
- Nakajima, Y.¹ Kashioka, H.² Shikano, K.³ Campbell, N.⁴

13
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Seattle, USA, May
- A. Kain and M.W. Macon. Spectral voice conversion for text-to-speech synthesis. Proc. ICASSP, pp. 285-288, Seattle, USA, May 1998.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ Macon, M.W.²

14
- 44949143155
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
- Pittsburgh, USA, Sep
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. Proc. INTERSPEECH, pp. 2266-2269, Pittsburgh, USA, Sep. 2006.
- (2006) Proc. INTERSPEECH , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

15
- 70349195617
- Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees
- Antwerp, Belgium, Aug
- K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano. Impact of various small sound source signals on voice conversion accuracy in speech communication aid for laryngectomees. Proc. INTERSPEECH, pp. 2517-2520, Antwerp, Belgium, Aug. 2007.
- (2007) Proc. INTERSPEECH , pp. 2517-2520
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

16
- 84867208056
- Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments
- Brisbane, Australia, Sep
- K. Nakamura, T. Toda, Y. Nakajima, H. Saruwatari, and K. Shikano. Evaluation of speaking-aid system with voice conversion for laryngectomees toward its use in practical environments. Proc. INTERSPEECH, pp. 2209-2212, Brisbane, Australia, Sep. 2008.
- (2008) Proc. INTERSPEECH , pp. 2209-2212
- Nakamura, K.¹ Toda, T.² Nakajima, Y.³ Saruwatari, H.⁴ Shikano, K.⁵

17
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Communication, Vol. 27, No. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigné, A.³

18
- 84928118106
- Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimat T. Todaion of F0 and periodicity
- Budapest, Hungary, Sep
- H. Kawahara, H. Katayose, A.de Cheveigné, and R.D. Patterson. Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimat T. Todaion of F0 and periodicity. Proc. EUROSPEECH, pp. 2781-2784, Budapest, Hungary, Sep. 1999.
- (1999) Proc. EUROSPEECH , pp. 2781-2784
- Kawahara, H.¹ Katayose, H.² de Cheveigné, A.³ Patterson, R.D.⁴

19
- 84867211725
- Lowdelay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Brisbane, Australia, Sep
- T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Lowdelay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. Proc. INTERSPEECH, pp. 1076-1079, Brisbane, Australia, Sep. 2008.
- (2008) Proc. INTERSPEECH , pp. 1076-1079
- Muramatsu, T.¹ Ohtani, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.