SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2016-May, Issue , 2016, Pages 5645-5649

The matching-minimization algorithm, the INCA algorithm and a mathematical framework for voice conversion with unaligned corpora

(1) Agiomyrgiannakis, Yannis a

a GOOGLE INC (United States)

Author keywords

INCA; matching minimization; nearest neighbour; voice conversion; voice transformation

Indexed keywords

EID: 84973373608 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2016.7472758 Document Type: Conference Paper

Times cited : (11)

References (25)

1
- 84905234183
- Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion
- Hadas Benisty, David Malah, and Koby Crammer, "Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion., " in ICASSP, 2014
- (2014) ICASSP
- Benisty, H.¹ Malah, D.² Crammer, K.³

2
- 84930664922
- Vocaine the vocoder and applications in speech synthesis
- Yannis Agiomyrgiannakis, "Vocaine the vocoder and applications in speech synthesis, " in ICASSP, 2015
- (2015) ICASSP
- Agiomyrgiannakis, Y.¹

3
- 0026880275
- Voice transformation using PSOLA technique
- H Valbret, Eric Moulines, and Jean-Pierre Tubach, "Voice transformation using PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992
- (1992) Speech Communication , vol.11 , Issue.2 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.³

4
- 0032026483
- Continuous probabilistic transform for voice conversion
- Yannis Stylianou and Eric Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Transactions on Speech and Audio Processing, vol. 6, pp. 131-142, 1998
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , pp. 131-142
- Stylianou, Y.¹ Moulines, E.²

5
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Alexander Kain and MichaelWMacon, "Spectral voice conversion for text-to-speech synthesis, " in ICASSP. IEEE, 1998, vol. 1, pp. 285-288
- (1998) ICASSP. IEEE , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

6
- 85009224898
- Perceptually weighted linear transformations for voice conversion
- Hui Ye and Steve Young, "Perceptually weighted linear transformations for voice conversion, " in Proc. of the Eurospeech'03, 2003
- (2003) Proc. of the Eurospeech'03
- Ye, H.¹ Young, S.²

7
- 85009084358
- A first step towards text-independent voice conversion
- David Sündermann, A. Bonafonte, Hermann Ney, and Harald Höge, "A first step towards text-independent voice conversion, " in Proc. of the ICSLP'04, 2004
- (2004) Proc. of the ICSLP'04
- Sündermann, D.¹ Bonafonte, A.² Ney, H.³ Höge, H.⁴

8
- 84908466787
- Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts
- Arun Kumar and Ashish Verma, "Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts, " in Multimedia and Expo, 2003. ICME'03. Proceedings. 2003 International Conference on. IEEE, 2003, vol. 1, pp. I-393
- (2003) Multimedia and Expo, 2003. ICME'03. Proceedings. 2003 International Conference On. IEEE , vol.1 , pp. I-393
- Kumar, A.¹ Verma, A.²

9
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- Vassilios V Digalakis, Dimitry Rtischev, and Leonardo G Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures, " Speech and Audio Processing, IEEE Transactions on, vol. 3, no. 5, pp. 357-366, 1995
- (1995) Speech and Audio Processing, IEEE Transactions on , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.V.¹ Rtischev, D.² Neumeyer, L.G.³

10
- 0033100038
- Maximumlikelihood stochastic-transformation adaptation of hidden markov models
- Vassilis D Diakoloukas and Vassilios V Digalakis, "Maximumlikelihood stochastic-transformation adaptation of hidden markov models, " Speech and Audio Processing, IEEE Transactions on, vol. 7, no. 2, pp. 177-187, 1999
- (1999) Speech and Audio Processing, IEEE Transactions on , vol.7 , Issue.2 , pp. 177-187
- Diakoloukas, V.D.¹ Digalakis, V.V.²

11
- 4544297119
- Nonparallel training for voice conversion by maximum likelihood constrained adaptation
- Athanasios Mouchtaris, Jan Van der Spiegel, and Paul Mueller, "Nonparallel training for voice conversion by maximum likelihood constrained adaptation, " in ICASSP. IEEE, 2004, vol. 1, pp. I-1
- (2004) ICASSP. IEEE , vol.1 , pp. I-1
- Mouchtaris, A.¹ Van Der Spiegel, J.² Mueller, P.³

12
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
- Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, and Takao Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, " in ICASSP. IEEE, 2001, vol. 2, pp. 805-808
- (2001) ICASSP. IEEE , vol.2 , pp. 805-808
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

13
- 84966348891
- An HMM-based speech synthesis system applied to english
- IEEE
- Keiichi Tokuda, Heiga Zen, and Alan W Black, "An HMM-based speech synthesis system applied to english, " in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on. IEEE, 2002, pp. 227-230
- (2002) Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on , pp. 227-230
- Tokuda, K.¹ Zen, H.² Black, A.W.³

14
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Proc. Eurospeech, 1999, pp. 2347-2350
- (1999) Proc. Eurospeech , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

15
- 84973338782
- Speech synthesis with neural networks
- cs. NE/9811031
- Orhan Karaali, Gerald Corrigan, and Ira A. Gerson, "Speech synthesis with neural networks, " CoRR, vol. cs. NE/9811031, 1998
- (1998) CoRR
- Karaali, O.¹ Corrigan, G.² Gerson, I.A.³

16
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- Heiga Zen, Andrew Senior, and Mike Schuster, "Statistical parametric speech synthesis using deep neural networks, " in ICASSP. IEEE, 2013, pp. 7962-7966
- (2013) ICASSP. IEEE , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

17
- 84959112868
- A study of speaker adaptation for DNN-based speech synthesis
- Date of Acceptance: 01/06/2015
- Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Stephen Renals, and Simon King, A study of speaker adaptation for DNN-based speech synthesis, International Speech Communication Association, 2015, Date of Acceptance: 01/06/2015
- (2015) International Speech Communication Association
- Wu, Z.¹ Swietojanski, P.² Veaux, C.³ Renals, S.⁴ King, S.⁵

18
- 84906280857
- Voice conversion in high-order eigen space using deep belief nets
- ISCA
- Toru Nakashika, Ryoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki, "Voice conversion in high-order eigen space using deep belief nets., " in Interspeech. 2013, pp. 369-372, ISCA
- (2013) Interspeech. , pp. 369-372
- Nakashika, T.¹ Takashima, R.² Takiguchi, T.³ Ariki, Y.⁴

19
- 84946051934
- Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
- April
- Yuchen Fan, Yao Qian, F. K. Soong, and Lei He, "Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis, " in ICASSP, April 2015, pp. 4475-4479
- (2015) ICASSP , pp. 4475-4479
- Fan, Y.¹ Qian, Y.² Soong, F.K.³ He, L.⁴

20
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- Daniel Erro, Asuncíon Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, no. 5, pp. 944-953, 2010
- (2010) Audio, Speech, and Language Processing, IEEE Transactions on , vol.18 , Issue.5 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

21
- 56149106209
- Frame alignment method for cross-lingual voice conversion
- Daniel Erro and Asuncíon Moreno, "Frame alignment method for cross-lingual voice conversion, " in Interspeech, 2007
- (2007) Interspeech
- Erro, D.¹ Moreno, A.²

22
- 29144534131
- Convergence theorems for generalized alternating minimization procedures
- December
- Asela Gunawardana and William Byrne, "Convergence theorems for generalized alternating minimization procedures, " Journal of Machine Learning Research, December 2005
- (2005) Journal of Machine Learning Research
- Gunawardana, A.¹ Byrne, W.²

23
- 0032202775
- Deterministic annealing for clustering, compression, classification, regression, and related optimization problems
- Kenneth Rose, "Deterministic annealing for clustering, compression, classification, regression, and related optimization problems, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2210-2239, 1998
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2210-2239
- Rose, K.¹

24
- 84906257669
- Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression
- Hanna Silén, Jani Nurminen, Elina Helander, and Moncef Gabbouj, "Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression, " in Interspeech, 2013
- (2013) Interspeech
- Silén, H.¹ Nurminen, J.² Helander, E.³ Gabbouj, M.⁴

25
- 84973359606
- Voice morphing that improves TTS quality using an optimal dynamic frequency warping-and-weighting transform
- Yannis Agiomyrgiannakis, "Voice Morphing that improves TTS quality using an Optimal Dynamic Frequency Warping-and-Weighting transform, " in ICASSP, 2016.
- (2016) ICASSP
- Agiomyrgiannakis, Y.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.