메뉴 건너뛰기




Volumn 2016-May, Issue , 2016, Pages 5645-5649

The matching-minimization algorithm, the INCA algorithm and a mathematical framework for voice conversion with unaligned corpora

Author keywords

INCA; matching minimization; nearest neighbour; voice conversion; voice transformation

Indexed keywords


EID: 84973373608     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2016.7472758     Document Type: Conference Paper
Times cited : (11)

References (25)
  • 1
    • 84905234183 scopus 로고    scopus 로고
    • Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion
    • Hadas Benisty, David Malah, and Koby Crammer, "Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion., " in ICASSP, 2014
    • (2014) ICASSP
    • Benisty, H.1    Malah, D.2    Crammer, K.3
  • 2
    • 84930664922 scopus 로고    scopus 로고
    • Vocaine the vocoder and applications in speech synthesis
    • Yannis Agiomyrgiannakis, "Vocaine the vocoder and applications in speech synthesis, " in ICASSP, 2015
    • (2015) ICASSP
    • Agiomyrgiannakis, Y.1
  • 3
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H Valbret, Eric Moulines, and Jean-Pierre Tubach, "Voice transformation using PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992
    • (1992) Speech Communication , vol.11 , Issue.2 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.3
  • 5
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Alexander Kain and MichaelWMacon, "Spectral voice conversion for text-to-speech synthesis, " in ICASSP. IEEE, 1998, vol. 1, pp. 285-288
    • (1998) ICASSP. IEEE , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 6
    • 85009224898 scopus 로고    scopus 로고
    • Perceptually weighted linear transformations for voice conversion
    • Hui Ye and Steve Young, "Perceptually weighted linear transformations for voice conversion, " in Proc. of the Eurospeech'03, 2003
    • (2003) Proc. of the Eurospeech'03
    • Ye, H.1    Young, S.2
  • 10
    • 0033100038 scopus 로고    scopus 로고
    • Maximumlikelihood stochastic-transformation adaptation of hidden markov models
    • Vassilis D Diakoloukas and Vassilios V Digalakis, "Maximumlikelihood stochastic-transformation adaptation of hidden markov models, " Speech and Audio Processing, IEEE Transactions on, vol. 7, no. 2, pp. 177-187, 1999
    • (1999) Speech and Audio Processing, IEEE Transactions on , vol.7 , Issue.2 , pp. 177-187
    • Diakoloukas, V.D.1    Digalakis, V.V.2
  • 11
    • 4544297119 scopus 로고    scopus 로고
    • Nonparallel training for voice conversion by maximum likelihood constrained adaptation
    • Athanasios Mouchtaris, Jan Van der Spiegel, and Paul Mueller, "Nonparallel training for voice conversion by maximum likelihood constrained adaptation, " in ICASSP. IEEE, 2004, vol. 1, pp. I-1
    • (2004) ICASSP. IEEE , vol.1 , pp. I-1
    • Mouchtaris, A.1    Van Der Spiegel, J.2    Mueller, P.3
  • 12
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
    • Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, and Takao Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, " in ICASSP. IEEE, 2001, vol. 2, pp. 805-808
    • (2001) ICASSP. IEEE , vol.2 , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 14
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Proc. Eurospeech, 1999, pp. 2347-2350
    • (1999) Proc. Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 15
    • 84973338782 scopus 로고    scopus 로고
    • Speech synthesis with neural networks
    • cs. NE/9811031
    • Orhan Karaali, Gerald Corrigan, and Ira A. Gerson, "Speech synthesis with neural networks, " CoRR, vol. cs. NE/9811031, 1998
    • (1998) CoRR
    • Karaali, O.1    Corrigan, G.2    Gerson, I.A.3
  • 16
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • Heiga Zen, Andrew Senior, and Mike Schuster, "Statistical parametric speech synthesis using deep neural networks, " in ICASSP. IEEE, 2013, pp. 7962-7966
    • (2013) ICASSP. IEEE , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 18
    • 84906280857 scopus 로고    scopus 로고
    • Voice conversion in high-order eigen space using deep belief nets
    • ISCA
    • Toru Nakashika, Ryoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki, "Voice conversion in high-order eigen space using deep belief nets., " in Interspeech. 2013, pp. 369-372, ISCA
    • (2013) Interspeech. , pp. 369-372
    • Nakashika, T.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 19
    • 84946051934 scopus 로고    scopus 로고
    • Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
    • April
    • Yuchen Fan, Yao Qian, F. K. Soong, and Lei He, "Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis, " in ICASSP, April 2015, pp. 4475-4479
    • (2015) ICASSP , pp. 4475-4479
    • Fan, Y.1    Qian, Y.2    Soong, F.K.3    He, L.4
  • 21
    • 56149106209 scopus 로고    scopus 로고
    • Frame alignment method for cross-lingual voice conversion
    • Daniel Erro and Asuncíon Moreno, "Frame alignment method for cross-lingual voice conversion, " in Interspeech, 2007
    • (2007) Interspeech
    • Erro, D.1    Moreno, A.2
  • 22
    • 29144534131 scopus 로고    scopus 로고
    • Convergence theorems for generalized alternating minimization procedures
    • December
    • Asela Gunawardana and William Byrne, "Convergence theorems for generalized alternating minimization procedures, " Journal of Machine Learning Research, December 2005
    • (2005) Journal of Machine Learning Research
    • Gunawardana, A.1    Byrne, W.2
  • 23
    • 0032202775 scopus 로고    scopus 로고
    • Deterministic annealing for clustering, compression, classification, regression, and related optimization problems
    • Kenneth Rose, "Deterministic annealing for clustering, compression, classification, regression, and related optimization problems, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2210-2239, 1998
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2210-2239
    • Rose, K.1
  • 24
    • 84906257669 scopus 로고    scopus 로고
    • Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression
    • Hanna Silén, Jani Nurminen, Elina Helander, and Moncef Gabbouj, "Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression, " in Interspeech, 2013
    • (2013) Interspeech
    • Silén, H.1    Nurminen, J.2    Helander, E.3    Gabbouj, M.4
  • 25
    • 84973359606 scopus 로고    scopus 로고
    • Voice morphing that improves TTS quality using an optimal dynamic frequency warping-and-weighting transform
    • Yannis Agiomyrgiannakis, "Voice Morphing that improves TTS quality using an Optimal Dynamic Frequency Warping-and-Weighting transform, " in ICASSP, 2016.
    • (2016) ICASSP
    • Agiomyrgiannakis, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.