SCOPUS 정보 검색 플랫폼

6th ISCA Workshop on Speech Synthesis, SSW 2007

Volumn , Issue , 2007, Pages 107-112

An Evaluation of Many-to-One Voice Conversion Algorithms with Pre-Stored Speaker Data Sets

(5) Tani, Daisuke a Ohtani, Yamato a Toda, Tomoki a Saruwatariand, Hiroshi a Shikano, Kiyohiro a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

EVC; many to one VC; SAT; speaker selection; voice conversion

Indexed keywords

DATA SET; EIGENVOICE CONVERSION; EIGENVOICES; MANY-TO-ONE; MANY-TO-ONE VOICE CONVERSION; SPEAKER ADAPTIVE TRAININGS; SPEAKER SELECTION; VOICE CONVERSION; VOICE CONVERSION ALGORITHM;

EID: 78649286010 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1)

References (18)

1
- 0029256373
- Acoustic characteristics of speaker individuality control and conversion
- H. Kuwabara, and Y. Sagisaka. Acoustic characteristics of speaker individuality control and conversion. Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
- (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
- Kuwabara, H.¹ Sagisaka, Y.²

2
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Seattle, USA, May
- A. Kain and M.W. Macon. Spectral voice conversion for text-to-speech synthesis. Proc. ICASSP, pp. 285-288, Seattle, USA, May 1998.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 0033692729
- Narrowband to wideband conversion of speech using GMM based transformation
- K-Y Park and H. S. Kim. Narrowband to wideband conversion of speech using GMM based transformation.Proc. ICASSP, pp. 1843-1846, 2000.
- (2000) Proc. ICASSP , pp. 1843-1846
- Park, K-Y¹ Kim, H. S.²

4
- 44949187612
- Improving body Transmitted Unvoiced Speech with Statistical Voice Conversion
- Pittsburgh, USA, Sep
- M. Nakagiri, T. Toda, H. Kashioka and K. Shikano. Improving body Transmitted Unvoiced Speech with Statistical Voice Conversion. Proc. INTERSPEECH2006-ICSLP, pp. 2270-2273, Pittsburgh, USA, Sep. 2006.
- (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2270-2273
- Nakagiri, M.¹ Toda, T.² Kashioka, H.³ Shikano, K.⁴

5
- 85004448479
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara. Voice conversion through vector quantization. J. Acoust. Soc. Jpn. (E), Vol. 11, No. 2, pp. 71-76, 1990.
- (1990) J. Acoust. Soc. Jpn. (E) , vol.11 , Issue.2 , pp. 71-76
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

6
- 4544306344
- Cross-language voice conversion using bilingual database
- July
- M. Mashimo, T. Toda, H. Kawanami, K. Shikano and N Campbell. Cross-language voice conversion using bilingual database. IPSJ Journal, Vol.43, No.7, pp.2177-2185, July 2002.
- (2002) IPSJ Journal , vol.43 , Issue.7 , pp. 2177-2185
- Mashimo, M.¹ Toda, T.² Kawanami, H.³ Shikano, K.⁴ Campbell, N⁵

7
- 0032026483
- Continuousprobabilistic transform for voice conversion
- Mar
- Y. Stylianou,O. Cappe and E.Moulines.Continuousprobabilistic transform for voice conversion. IEEE Trans. on Speech and Audio Processing, Vol. 6, no. 2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

8
- 33646779506
- Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter
- T. Toda, A.W. Black and K. Tokuda. Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter. Proc. ICASSP, pp. 9-12, 2005.
- (2005) Proc. ICASSP , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

9
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- H. Ye and S. Young. Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Trans. ASLP, Vol. 14, No. 4, pp. 1301-1312, 2006.
- (2006) IEEE Trans. ASLP , vol.14 , Issue.4 , pp. 1301-1312
- Ye, H.¹ Young, S.²

10
- 33947623206
- Text-independent voice conversion based on unit selection
- D. Sundermann, H. Hoge, A. Bonafonte, H. Ney, A. Black and S. Narayanan. Text-independent voice conversion based on unit selection. Proc. ICASSP, pp. 81-84, 2006.
- (2006) Proc. ICASSP , pp. 81-84
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Black, A.⁵ Narayanan, S.⁶

11
- 4544297119
- Non-parallel training for voice conversion by maximum likelihood constrained adaptation
- May
- A. Mouchtaris, J. Spiegel, and P. Mueller. Non-parallel training for voice conversion by maximum likelihood constrained adaptation. Proc. ICASSP, pp. 1-4, May. 2004
- (2004) Proc. ICASSP , pp. 1-4
- Mouchtaris, A.¹ Spiegel, J.² Mueller, P.³

12
- 85084062692
- Map-based adaptation for speech conversion using adaptation data selection and nonparallel training
- Sept
- C.H. Lee and C.H. Wu. Map-based adaptation for speech conversion using adaptation data selection and nonparallel training. Proc. ICSLP, pp. 1164-1167, Sept. 2006.
- (2006) Proc. ICSLP , pp. 1164-1167
- Lee, C.H.¹ Wu, C.H.²

13
- 34547512822
- Eigenvoice Conversion Based on Gaussian Mixture Model
- Sept
- T. Toda, Y. Ohtani and K. Shikano. Eigenvoice Conversion Based on Gaussian Mixture Model. Proc. ICSLP, pp. 2446-2449, Sept. 2006.
- (2006) Proc. ICSLP , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

14
- 34547496175
- One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices
- Apr
- T. Toda, Y. Ohtani and K. Shikano. One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices. Proc. ICASSP, Vol. 4, pp. 1249-1252, Apr. 2007.
- (2007) Proc. ICASSP , vol.4 , pp. 1249-1252
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

15
- 0034848875
- Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers
- May
- S. Yoshizawa, A. Baba, K. Matsunami, Y. Mera, M. Yamada and K. Shikano. Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers. Proc.ICASSP2001, pp. 341-344, May 2001.
- (2001) Proc.ICASSP2001 , pp. 341-344
- Yoshizawa, S.¹ Baba, A.² Matsunami, K.³ Mera, Y.⁴ Yamada, M.⁵ Shikano, K.⁶

16
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz and J. Makhoul. A compact model for speaker-adaptive training. Proc. ICSLP, Vol. 2, 1996.
- (1996) Proc. ICSLP , vol.2
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

17
- 84876483227
- JNAS: Japanese Newspaper Article Sentences http://www.milab.is.tsukuba.ac.jp/jnas/instruct.html
- JNAS: Japanese Newspaper Article Sentences

18
- 0032673049
- Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigue. Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Communivation, Vol. 27, No. 3-4, pp. 187-207, 1999.
- (1999) Speech Communivation , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigue, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.