SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 08-12-September-2016, Issue , 2016, Pages 1652-1656

Locally linear embedding for exemplar-based spectral conversion

(5) Wu, Yi Chiao a Hwang, Hsin Te a Hsu, Chin Cheng a Tsao, Yu b Wang, Hsin Min a

a INSTITUTE OF INFORMATION SCIENCE (Taiwan)

b RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

Author keywords

Exemplar; Locally linear embedding; Voice conversion; Voice conversion challenge

Indexed keywords

FEATURE EXTRACTION; IMAGE PROCESSING; LEARNING ALGORITHMS; MAXIMUM LIKELIHOOD; QUALITY CONTROL; SPEECH COMMUNICATION;

COMPENSATION METHOD; EXEMPLAR; LOCALLY LINEAR EMBEDDING; LOCALLY LINEAR EMBEDDING ALGORITHMS; MANIFOLD LEARNING ALGORITHM; SPECTRAL CONVERSION; SUBJECTIVE EVALUATIONS; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84994247053 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: 10.21437/Interspeech.2016-567 Document Type: Conference Paper

Times cited : (42)

References (27)

1
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

2
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

3
- 84946033919
- Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion
- S. Takamichi, T. Toda, A. W. Black, and S. Nakamura, "Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion," Proc. ICASSP, 2015.
- (2015) Proc. ICASSP
- Takamichi, S.¹ Toda, T.² Black, A.W.³ Nakamura, S.⁴

4
- 84893234191
- Incorporating global variance in the training phase of GMMbased voice conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Incorporating global variance in the training phase of GMMbased voice conversion," Proc. APSIPA, 2013.
- (2013) Proc. APSIPA
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

5
- 77953727123
- Voice conversion based on weighted frequency warping
- July
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 922-931, July. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang., Process , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

6
- 84857498745
- Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- May
- E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

7
- 77953707533
- Spectral mapping using artificial neural networks for voice conversion
- S. Desai, A. W. Black, B. Yegnanarayana, and K. Prahallad, "Spectral mapping using artificial neural networks for voice conversion," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 954-964, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang., Process , vol.18 , Issue.5 , pp. 954-964
- Desai, S.¹ Black, A.W.² Yegnanarayana, B.³ Prahallad, K.⁴

8
- 84921735339
- Voice conversion using deep neural networks with layer-wise generative training
- L. H. Chen, Z. H. Ling, L. J. Liu, and L. R. Dai, "Voice conversion using deep neural networks with layer-wise generative training," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 12, pp.1859-1872, 2014.
- (2014) IEEE/ACM Trans. Audio, Speech, Lang., Process , vol.22 , Issue.12 , pp. 1859-1872
- Chen, L.H.¹ Ling, Z.H.² Liu, L.J.³ Dai, L.R.⁴

9
- 84906280857
- Voice conversion in high-order eigen space using deep belief nets
- T. Nakashika, R. Takashima, T. Takiguchi, and Y. Ariki, "Voice conversion in high-order eigen space using deep belief nets," Proc. INTERSPEEH, 2013.
- (2013) Proc. INTERSPEEH
- Nakashika, T.¹ Takashima, R.² Takiguchi, T.³ Ariki, Y.⁴

10
- 84986185211
- A probabilistic interpretation for artificial neural network-based voice conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," Proc. APSIPA, 2015.
- (2015) Proc. APSIPA
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

11
- 84874248255
- Exemplar-based voice conversion in noisy environment
- R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," Proc. Spoken Language Technology Workshop (SLT), 2012.
- (2012) Proc. Spoken Language Technology Workshop (SLT
- Takashima, R.¹ Takiguchi, T.² Ariki, Y.³

12
- 84901803470
- Exemplar based voice conversion using non-negative spectrogram deconvolution
- Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar based voice conversion using non-negative spectrogram deconvolution," Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013.
- (2013) Proc. 8th ISCA Speech Synth. Workshop (SSW8
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

13
- 84911369131
- Exemplar-based sparse representation with residual compensation for voice conversion
- Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-based sparse representation with residual compensation for voice conversion," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 10, pp.1506-1521, 2014.
- (2014) IEEE/ACM Trans. Audio, Speech, Lang., Process , vol.22 , Issue.10 , pp. 1506-1521
- Wu, Z.¹ Virtanen, T.² Chng, E.S.³ Li, H.⁴

14
- 70350487954
- Dimensionality reduction: A comparative review
- L. J. P. van der Maaten, E. O. Postma, and H. J. van den Herik. "Dimensionality reduction: A comparative review." Journal of Machine Learning Research 10.1-41 (2009): 66-71.
- (2009) Journal of Machine Learning Research 10 , vol.1 , Issue.41 , pp. 66-71
- Maaten Der Van, P.L.J.¹ Postma, E.O.² Herik Den Van, H.J.³

15
- 84879854889
- Representation learning: A review and new perspectives
- Y. Bengio, A. Courville, and P. Vincent, P. "Representation learning: A review and new perspectives," Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(8), pp. 1798-1828, 2013.
- (2013) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.35 , Issue.8 , pp. 1798-1828
- Bengio, Y.¹ Courville, A.² Vincent P, P.³

16
- 85032996208
- Stochastic neighbor embedding
- G. Hinton and S. Roweis, "Stochastic neighbor embedding," Advances in neural information processing systems, vol. 15, pp. 833-840, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.15 , pp. 833-840
- Hinton, G.¹ Roweis, S.²

17
- 0034704229
- A global geometric framework for nonlinear dimensionality reduction
- J.B. Tenenbaum, V. De Silva, and J.C. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol. 290, no. 5500, pp. 2319-2323, 2000.
- (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
- Tenenbaum, J.B.¹ De Silva, V.² Langford, J.C.³

18
- 0043278893
- Laplacian eigenmaps and spectral techniques for embedding and clustering
- M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," Advances in neural information processing systems, vol. 14, pp. 585-591, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 585-591
- Belkin, M.¹ Niyogi, P.²

19
- 0034704222
- Nonlinear dimensionality reduction by locally linear embedding
- S.T. Roweis and L.K. Saul, "Nonlinear dimensionality reduction by locally linear embedding," Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
- (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
- Roweis, S.T.¹ Saul, L.K.²

20
- 5044219639
- Super-resolution through neighbor embedding
- H. Chang, D.Y. Yeung, and Y. Xiong, "Super-resolution through neighbor embedding," Proc. CVPR, 2004.
- (2004) Proc. CVPR
- Chang, H.¹ Yeung, D.Y.² Xiong, Y.³

21
- 0033708106
- Speech parameter generation algorithms for HMMbased speech synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. kitamura, "Speech parameter generation algorithms for HMMbased speech synthesis," Proc. ICASSP, 2000.
- (2000) Proc. ICASSP
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

22
- 84878384520
- Ways to implement global variance in statistical speech synthesis
- H. Silén, E. Helander, J. Nurminen, M. Gabbouj, "Ways to implement global variance in statistical speech synthesis," Proc. INTERSPEECH, 2012.
- (2012) Proc. INTERSPEECH
- Silén, H.¹ Helander, E.² Nurminen, J.³ Gabbouj, M.⁴

23
- 0345410356
- L.K. Saul and S.T. Roweis, "An introduction to locally linear embedding," (2001) Available from https://www.cs.nyu.edu/~roweis/lle/papers/lleintro.pdf.
- (2001) An Introduction to Locally Linear Embedding
- Saul, L.K.¹ Roweis, S.T.²

24
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
- (1999) Speech Commun , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

25
- 84994338928
- Festvox. Available: http://www.festvox.org/download.html.
- Festvox

26
- 84994361374
- The voice conversion challenge 2016
- T. Toda, L. H. Chen, D. Saito, F. Villavicencio, M. Wester, Z. Wu and J. Yamagishi, "The Voice Conversion Challenge 2016," Proc. INTERSPEECH, 2016.
- (2016) Proc. INTERSPEECH
- Toda, T.¹ Chen, L.H.² Saito, D.³ Villavicencio, F.⁴ Wester, M.⁵ Wu, Z.⁶ Yamagishi, J.⁷

27
- 84994351528
- Analysis of the voice conversion challenge 2016 evaluation results
- M. Wester, Z. Wu and J. Yamagishi, "Analysis of the Voice Conversion Challenge 2016 Evaluation Results," Proc. INTERSPEECH, 2016.
- (2016) Proc. INTERSPEECH
- Wester, M.¹ Wu, Z.² Yamagishi, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.