SCOPUS 정보 검색 플랫폼

6th ISCA Workshop on Speech Synthesis, SSW 2007

Volumn , Issue , 2007, Pages 101-106

Regression Approaches to Voice Quality Control Based on One-to-Many Eigenvoice Conversion

(5) Ohta, Kumi a Ohtani, Yamato a Toda, Tomoki a Saruwatari, Hiroshi a Shikano, Kiyohiro a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

GAUSSIAN DISTRIBUTION; QUALITY ASSURANCE; QUALITY CONTROL;

DATA SET; EIGENVOICES; GAUSSIAN MIXTURE MODEL; LOW DIMENSIONAL; PARALLEL DATA; PHYSICAL MEANINGS; SINGLE SOURCE; TARGET SPEAKER; VOICE QUALITY; VOICE QUALITY CONTROLS;

EIGENVALUES AND EIGENFUNCTIONS;

EID: 78049390735 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (19)

1
- 0029256373
- Acoustic characteristics of speaker individuality: control and conversion
- H. Kuwabara and Y. Sagisaka, “Acoustic characteristics of speaker individuality: control and conversion,” Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
- (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
- Kuwabara, H.¹ Sagisaka, Y.²

2
- 0030374940
- Speech morphing by gradually changing spectrum parameter and fundamental frequency
- Oct
- M. Abe, “Speech morphing by gradually changing spectrum parameter and fundamental frequency,” in Proc. ICSLP 96, Oct. 1996, Vol. 4, pp. 2235-2238.
- (1996) Proc. ICSLP 96 , vol.4 , pp. 2235-2238
- Abe, M.¹

3
- 34247565129
- Implementation of realtime STRAIGHT speech manipulation system: Report on its implementation
- H. Banno, H. Hata, M. Morise, T. Takahashi, T. Irino, and H. Kawahara, “Implementation of realtime STRAIGHT speech manipulation system: Report on its implementation,” Acoust. Sci. & Tech., Vol. 28, No. 3, pp. 140-146, 2007.
- (2007) Acoust. Sci. & Tech , vol.28 , Issue.3 , pp. 140-146
- Banno, H.¹ Hata, H.² Morise, M.³ Takahashi, T.⁴ Irino, T.⁵ Kawahara, H.⁶

4
- 85004448479
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, “Voice conversion through vector quantization,” J. Acoust. Soc. Jpn. (E), Vol. 11, No. 2, pp. 71-76, 1990.
- (1990) J. Acoust. Soc. Jpn. (E) , vol.11 , Issue.2 , pp. 71-76
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

5
- 0032026483
- Continuous probabilistictransformfor voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, “Continuous probabilistictransformfor voice conversion,” IEEETrans. Speech and Audio Processing, Vol. 6, No. 2, pp. 131-142, 1998.
- (1998) IEEETrans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

6
- 33646779506
- Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter
- Mar
- T. Toda, A.W. Black, and K. Tokuda, “Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter,” in Proc. ICASSP 2005, Mar. 2005, Vol. 1, pp. 9-12.
- (2005) Proc. ICASSP 2005 , vol.1 , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

7
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- Sep
- T. Toda, Y. Ohtani, and K. Shikano, “Eigenvoice conversion based on Gaussian mixture model,” in Proc. INTERSPEECH2006-ICSLP, Sep. 2006, pp. 2446-2449.
- (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

8
- 34547496175
- One-to-many and many-to-one voice conversion based on eigenvoices
- Apr
- T. Toda, Y. Ohtani, and K. Shikano, “One-to-many and many-to-one voice conversion based on eigenvoices,” in Proc. ICASSP 2007, Apr. 2007, pp. 1249-1252.
- (2007) Proc. ICASSP 2007 , pp. 1249-1252
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

9
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- R. Kuhn, J. Junqua, P. Nguyen, and N. Niedzielski,“Rapid speaker adaptation in eigenvoice space,” IEEE Trans. Speech and Audio Processing, Vol. 8, No. 6, pp. 695-707, 2000.
- (2000) IEEE Trans. Speech and Audio Processing , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.² Nguyen, P.³ Niedzielski, N.⁴

10
- 85009250381
- Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks
- Sep
- P. Kenny, G. Boulianne, and P. Dumouchel, “Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks,” in Proc. ICSLP 2002, Sep. 2002, pp. 57-60.
- (2002) Proc. ICSLP 2002 , pp. 57-60
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

11
- 85009257840
- Eigenvoices for HMM-based speech synthesis
- Sep
- K. Shichiri, A. Sawabe, T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Eigenvoices for HMM-based speech synthesis,” in Proc. ICSLP2002, Sep. 2002, pp. 1269-1272.
- (2002) Proc. ICSLP2002 , pp. 1269-1272
- Shichiri, K.¹ Sawabe, A.² Yoshimura, T.³ Tokuda, K.⁴ Masuko, T.⁵ Kobayashi, T.⁶ Kitamura, T.⁷

12
- 85009069226
- A style control technique for HMM-based speech synthesis
- Oct
- K. Miyanaga, T. Masuko, and T. Kobayashi, “A style control technique for HMM-based speech synthesis,” in Proc. INTERSPEECH2004-ICSLP, Oct. 2004, pp. 1437-1440.
- (2004) Proc. INTERSPEECH2004-ICSLP , pp. 1437-1440
- Miyanaga, K.¹ Masuko, T.² Kobayashi, T.³

13
- 44949155552
- A technique for controlling voice quality of synthetic speech using multiple regression HSMM
- Sep
- M. Tachibana, T. Nose, J. Yamagishi, and T. Kobayashi, “A technique for controlling voice quality of synthetic speech using multiple regression HSMM,” in Proc. INTERSPEECH2006-ICSLP, Sep. 2006, pp. 2438-2441.
- (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2438-2441
- Tachibana, M.¹ Nose, T.² Yamagishi, J.³ Kobayashi, T.⁴

14
- 0030362995
- A compact model for speaker-adaptive training
- Oct
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, “A compact model for speaker-adaptive training,” in Proc. ICSLP 96, Oct. 1996, Vol. 2, pp. 1137-1140.
- (1996) Proc. ICSLP 96 , vol.2 , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

15
- 85133679946
- Speaker adaptive training for voice conversion based on eigenvoice
- SP2006-40, [in Japanese]
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, “Speaker adaptive training for voice conversion based on eigenvoice,” IEICE Tech. Rep., SP2006-40, pp. 31-36, 2006 [in Japanese].
- (2006) IEICE Tech. Rep , pp. 31-36
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

16
- 84876483227
- JNAS: Japanese Newspaper Article Sentences. http://www.mibel.cs.tsukuba.ac.jp/jnas/instruct.html
- JNAS: Japanese Newspaper Article Sentences

17
- 85009291324
- Extraction of everyday expression associated with voice quality of normal utterance
- [in Japanese]
- H. Kido, and H. Kasuya, “Extraction of everyday expression associated with voice quality of normal utterance,” J. Acoust. Soc. Jpn., Vol. 55, No. 6, pp. 405-411, 1999 [in Japanese].
- (1999) J. Acoust. Soc. Jpn , vol.55 , Issue.6 , pp. 405-411
- Kido, H.¹ Kasuya, H.²

18
- 60649119506
- Everyday expressions associated with voice quality of normal utterance —Extraction by perceptual evaluation
- [in Japanese]
- H. Kido, and H. Kasuya, “Everyday expressions associated with voice quality of normal utterance —Extraction by perceptual evaluation—,” J. Acoust. Soc. Jpn., Vol. 57, No. 5, pp. 337-344, 2001 [in Japanese].
- (2001) J. Acoust. Soc. Jpn , vol.57 , Issue.5 , pp. 337-344
- Kido, H.¹ Kasuya, H.²

19
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, Vol. 27, No. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigné, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.