메뉴 건너뛰기




Volumn 08-12-September-2016, Issue , 2016, Pages 1652-1656

Locally linear embedding for exemplar-based spectral conversion

Author keywords

Exemplar; Locally linear embedding; Voice conversion; Voice conversion challenge

Indexed keywords

FEATURE EXTRACTION; IMAGE PROCESSING; LEARNING ALGORITHMS; MAXIMUM LIKELIHOOD; QUALITY CONTROL; SPEECH COMMUNICATION;

EID: 84994247053     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: 10.21437/Interspeech.2016-567     Document Type: Conference Paper
Times cited : (42)

References (27)
  • 1
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Mar
    • Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 2
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 3
    • 84946033919 scopus 로고    scopus 로고
    • Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion
    • S. Takamichi, T. Toda, A. W. Black, and S. Nakamura, "Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion," Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Takamichi, S.1    Toda, T.2    Black, A.W.3    Nakamura, S.4
  • 4
    • 84893234191 scopus 로고    scopus 로고
    • Incorporating global variance in the training phase of GMMbased voice conversion
    • H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Incorporating global variance in the training phase of GMMbased voice conversion," Proc. APSIPA, 2013.
    • (2013) Proc. APSIPA
    • Hwang, H.T.1    Tsao, Y.2    Wang, H.M.3    Wang, Y.R.4    Chen, S.H.5
  • 5
    • 77953727123 scopus 로고    scopus 로고
    • Voice conversion based on weighted frequency warping
    • July
    • D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 922-931, July. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang., Process , vol.18 , Issue.5 , pp. 922-931
    • Erro, D.1    Moreno, A.2    Bonafonte, A.3
  • 6
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • May
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 8
    • 84921735339 scopus 로고    scopus 로고
    • Voice conversion using deep neural networks with layer-wise generative training
    • L. H. Chen, Z. H. Ling, L. J. Liu, and L. R. Dai, "Voice conversion using deep neural networks with layer-wise generative training," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 12, pp.1859-1872, 2014.
    • (2014) IEEE/ACM Trans. Audio, Speech, Lang., Process , vol.22 , Issue.12 , pp. 1859-1872
    • Chen, L.H.1    Ling, Z.H.2    Liu, L.J.3    Dai, L.R.4
  • 10
    • 84986185211 scopus 로고    scopus 로고
    • A probabilistic interpretation for artificial neural network-based voice conversion
    • H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," Proc. APSIPA, 2015.
    • (2015) Proc. APSIPA
    • Hwang, H.T.1    Tsao, Y.2    Wang, H.M.3    Wang, Y.R.4    Chen, S.H.5
  • 13
    • 84911369131 scopus 로고    scopus 로고
    • Exemplar-based sparse representation with residual compensation for voice conversion
    • Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-based sparse representation with residual compensation for voice conversion," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 10, pp.1506-1521, 2014.
    • (2014) IEEE/ACM Trans. Audio, Speech, Lang., Process , vol.22 , Issue.10 , pp. 1506-1521
    • Wu, Z.1    Virtanen, T.2    Chng, E.S.3    Li, H.4
  • 17
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • J.B. Tenenbaum, V. De Silva, and J.C. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol. 290, no. 5500, pp. 2319-2323, 2000.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
    • Tenenbaum, J.B.1    De Silva, V.2    Langford, J.C.3
  • 18
    • 0043278893 scopus 로고    scopus 로고
    • Laplacian eigenmaps and spectral techniques for embedding and clustering
    • M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," Advances in neural information processing systems, vol. 14, pp. 585-591, 2001.
    • (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 585-591
    • Belkin, M.1    Niyogi, P.2
  • 19
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • S.T. Roweis and L.K. Saul, "Nonlinear dimensionality reduction by locally linear embedding," Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
    • Roweis, S.T.1    Saul, L.K.2
  • 20
    • 5044219639 scopus 로고    scopus 로고
    • Super-resolution through neighbor embedding
    • H. Chang, D.Y. Yeung, and Y. Xiong, "Super-resolution through neighbor embedding," Proc. CVPR, 2004.
    • (2004) Proc. CVPR
    • Chang, H.1    Yeung, D.Y.2    Xiong, Y.3
  • 24
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
    • (1999) Speech Commun , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 25
    • 84994338928 scopus 로고    scopus 로고
    • Festvox. Available: http://www.festvox.org/download.html.
    • Festvox
  • 27
    • 84994351528 scopus 로고    scopus 로고
    • Analysis of the voice conversion challenge 2016 evaluation results
    • M. Wester, Z. Wu and J. Yamagishi, "Analysis of the Voice Conversion Challenge 2016 Evaluation Results," Proc. INTERSPEECH, 2016.
    • (2016) Proc. INTERSPEECH
    • Wester, M.1    Wu, Z.2    Yamagishi, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.