SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 7909-7913

Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion

(3) Benisty, H a Malah, D a Crammer, K a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

Gaussian Mixture Model (GMM); INCA; Non Parallel Voice Conversion; Spectral Distance

Indexed keywords

CODES (SYMBOLS); ITERATIVE METHODS; QUALITY CONTROL; SIGNAL PROCESSING;

GAUSSIAN MIXTURE MODEL; INCA; ITERATIVE ESTIMATION; MINIMIZATION PROBLEMS; NEAREST NEIGHBOR SEARCH; SPECTRAL DISTANCES; SUBJECTIVE EVALUATIONS; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84905234183 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6855140 Document Type: Conference Paper

Times cited : (21)

References (19)

1
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

2
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis, " in Proc. ICASSP, IEEE, 1998, vol. 1, pp. 285-288.
- (1998) Proc. ICASSP, IEEE , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 0034841948
- Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
- A. Kain and M. W. Macon, "Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction, " in Proc. ICASSP, IEEE, 2001, vol. 2, pp. 813-816.
- (2001) Proc. ICASSP, IEEE , vol.2 , pp. 813-816
- Kain, A.¹ Macon, M.W.²

4
- 0034842552
- Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
- T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum, " in Proc. ICASSP, IEEE, 2001, vol. 2, pp. 841-844.
- (2001) Proc. ICASSP, IEEE , vol.2 , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

5
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

6
- 77953727123
- Voice conversion based on weighted frequency warping
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping, " IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 922-931, 2010.
- (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

7
- 77953712499
- Voice conversion using partial least squares regression
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression, " IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
- (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

8
- 51449121435
- Textindependent voice conversion based on state mapped codebook
- M. Zhang, J. Tao, J. Tian, and X. Wang, "Textindependent voice conversion based on state mapped codebook, " in Proc. ICASSP, IEEE, 2008, pp. 4605-4608.
- (2008) Proc. ICASSP, IEEE , pp. 4605-4608
- Zhang, M.¹ Tao, J.² Tian, J.³ Wang, X.⁴

9
- 84890484652
- Non-parallel training for voice conversion based on adaptation method
- P. Song, W. Zheng, and L. Zhao, "Non-parallel training for voice conversion based on adaptation method, " in Proc. ICASSP, IEEE, 2013.
- (2013) Proc. ICASSP, IEEE
- Song, P.¹ Zheng, W.² Zhao, L.³

10
- 4544297119
- Nonparallel training for voice conversion by maximum likelihood constrained adaptation
- A. Mouchtaris, J. Van der Spiegel, and P. Mueller, "Nonparallel training for voice conversion by maximum likelihood constrained adaptation, " in Proc. ICASSP, IEEE, 2004, vol. 1, pp. I-1.
- (2004) Proc. ICASSP, IEEE , vol.1
- Mouchtaris, A.¹ Spiegel Der J.Van² Mueller, P.³

11
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model, " in Proc. ICSLP, pp. 2446-2449.
- Proc. ICSLP , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

12
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- D. Erro, A. Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora, " IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 944-953, 2010.
- (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.5 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

13
- 84867198185
- On the impact of alignment on voice conversion performance
- E. Helander, J. Schwarz, J. Nurminen, H. Silen, and M. Gabbouj, "On the impact of alignment on voice conversion performance., " in Proc. INTERSPEECH, 2008, pp. 1453-1456.
- (2008) Proc. INTERSPEECH , pp. 1453-1456
- Helander, E.¹ Schwarz, J.² Nurminen, J.³ Silen, H.⁴ Gabbouj, M.⁵

14
- 0001560954
- Information geometry and alternatingminimization procedures
- I. Csiszar and G. Tusnady, "Information geometry and alternatingminimization procedures, " Statistics and Decisions, vol. 1, pp. 205-237, 1984.
- (1984) Statistics and Decisions , vol.1 , pp. 205-237
- Csiszar, I.¹ Tusnady, G.²

15
- 33646773080
- J. Kominek and A. W. Black, "CMU ARCTIC databases for speech synthesis, " 2003.
- (2003) CMU ARCTIC Databases for Speech Synthesis
- Kominek, J.¹ Black, A.W.²

16
- 84905247158
- http://aholab. ehu. es/users/derro/software. html.

17
- 85135177301
- Highquality speech modifcation based on a harmonic + noise model
- Y. Stylianou, J. Laroche, and E. Moulines, "Highquality speech modifcation based on a harmonic + noise model, " in Proc. EUROSPEECH, 1995.
- (1995) Proc. EUROSPEECH
- Stylianou, Y.¹ Laroche, J.² Moulines, E.³

18
- 0035127703
- Applying the harmonic plus noise model in concatenative speech synthesis
- Y. Stylianou, "Applying the harmonic plus noise model in concatenative speech synthesis, " IEEE Trans. Speech and Audio Processing, vol. 9, no. 1, pp. 21-29, 2001.
- (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.1 , pp. 21-29
- Stylianou, Y.¹

19
- 51449107658
- LSF mapping for voice conversion with very small training sets
- E. Helander, J. Nurminen, and M. Gabbouj, "LSF mapping for voice conversion with very small training sets, " in Proc. ICASSP, IEEE, 2008, pp. 4669-4672.
- (2008) Proc. ICASSP, IEEE , pp. 4669-4672
- Helander, E.¹ Nurminen, J.² Gabbouj, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.