SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2012, Pages 98-101

Effects of speaker adaptive training on tensor-based arbitrary speaker conversion

Author keywords

Eigenvoice; Gaussian mixture model; Speaker adaptive training; Tucker decomposition; Voice conversion

Indexed keywords

EIGENVOICES; GAUSSIAN MIXTURE MODEL; SPEAKER ADAPTIVE TRAININGS; TUCKER DECOMPOSITIONS; VOICE CONVERSION;

SPEECH PROCESSING; TENSORS;

SPEECH RECOGNITION;

EID: 84878378722 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (16)

1
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," Proc. ICASSP, pp. 655-658, 1988.
- (1988) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, vol. 1, pp. 285-288, 1998.
- (1998) Proc. ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 0034855352
- High-performance robust speech recognition using stereo training data
- L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "High-performance robust speech recognition using stereo training data," Proc. ICASSP, pp. 301-304, 2001.
- (2001) Proc. ICASSP , pp. 301-304
- Deng, L.¹ Acero, A.² Jiang, L.³ Droppo, J.⁴ Huang, X.⁵

4
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

5
- 44949210554
- Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
- C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," Proc. INTERSPEECH, pp. 2254-2257, 2006.
- (2006) Proc. INTERSPEECH , pp. 2254-2257
- Lee, C.H.¹ Wu, C.H.²

6
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model," Proc. INTERSPEECH, pp. 2446-2449, 2006.
- (2006) Proc. INTERSPEECH , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

7
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- R. Kuhn, J-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in Eigenvoice space," IEEE Trans. on Speech and Audio Processing, vol. 8, no. 6, pp. 695-707, 2000.
- (2000) IEEE Trans. on Speech and Audio Processing , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

8
- 58349106697
- A study of interspeaker variability in speaker verification
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of interspeaker variability in speaker verification," IEEE Trans. on Audio, Speech, and Language Processing, vol. 16, no. 5, pp. 980-988, 2008.
- (2008) IEEE Trans. on Audio, Speech, and Language Processing , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

9
- 84865798483
- One-tomany voice conversion based on tensor representation of speaker space
- D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space," Proc. INTERSPEECH, pp. 653-656, 2011.
- (2011) Proc. INTERSPEECH , pp. 653-656
- Saito, D.¹ Yamamoto, K.² Minematsu, N.³ Hirose, K.⁴

10
- 0030362995
- A compact model for speaker adaptive training
- T. Anastasakos, J. McDonough, R. Schwarts and J. Makhoul, "A compact model for speaker adaptive training," Proc. ICSLP, vol. 2, pp. 1137-1140, 1996.
- (1996) Proc. ICSLP , vol.2 , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwarts, R.³ Makhoul, J.⁴

11
- 70450182468
- Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model," Proc. INTERSPEECH, pp. 1981-1984, 2007.
- (2007) Proc. INTERSPEECH , pp. 1981-1984
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

12
- 0013953617
- Some mathematical notes on three-mode factor analysis
- L. R. Tucker, "Some mathematical notes on three-mode factor analysis," Psychometrika, vol. 31, no. 3, pp. 279-311, 1966.
- (1966) Psychometrika , vol.31 , Issue.3 , pp. 279-311
- Tucker, L.R.¹

13
- 78049396810
- Speaker adaptation based on the multilinear decomposition of training speaker models
- Y. Jeong, "Speaker adaptation based on the multilinear decomposition of training speaker models," Proc. ICASSP, pp. 4870-4873, 2010.
- (2010) Proc. ICASSP , pp. 4870-4873
- Jeong, Y.¹

14
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis," Speech Communication, vol.9, pp.357-363, 1990.
- (1990) Speech Communication , vol.9 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

15
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol.27, pp.187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

16
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
- (2007) IEEE Trans. on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.