메뉴 건너뛰기




Volumn , Issue , 2014, Pages 197-200

Pitch transformation in neural network based voice conversion

Author keywords

neural network; pitch; voice conversion

Indexed keywords

WAVELET DECOMPOSITION;

EID: 84912078522     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCSLP.2014.6936599     Document Type: Conference Paper
Times cited : (9)

References (24)
  • 2
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A. Black, and K. Tokuda,"Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. on Audio Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. on Audio Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 3
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol. 16, no. 2, pp. 207-216, 1995.
    • (1995) Speech Commun. , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 5
    • 84910068272 scopus 로고    scopus 로고
    • Continuous wavelet transform for analysis of speech prosody
    • M. Vainio, A. Suni, and D. Aalto, "Continuous wavelet transform for analysis of speech prosody," TRASP, pp, 78-81, 2013.
    • (2013) TRASP , pp. 78-81
    • Vainio, M.1    Suni, A.2    Aalto, D.3
  • 7
    • 84867198266 scopus 로고    scopus 로고
    • Incorporating durational modification in voice transformation
    • A. R. Toth and A. W. Black, "Incorporating durational modification in voice transformation", in Proc. Interspeech, pp. 1088-1091, 2008.
    • (2008) Proc. Interspeech , pp. 1088-1091
    • Toth, A.R.1    Black, A.W.2
  • 8
    • 85010815133 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," in Proc. ICASPP, pp. 145-148, 1992.
    • (1992) Proc. ICASPP , pp. 145-148
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 10
    • 84863934040 scopus 로고    scopus 로고
    • Duration modelling in voice conversion using artificial neural networks
    • R. Srikanth, B. Bajibabu, K. Prahallad, "Duration modelling in voice conversion using artificial neural networks," in Proc. IWSSIP, pp. 556-559, 2012.
    • (2012) Proc. IWSSIP , pp. 556-559
    • Srikanth, R.1    Bajibabu, B.2    Prahallad, K.3
  • 11
    • 56149097756 scopus 로고    scopus 로고
    • F0 transformation within the voice convesrion framework
    • Z. Hanzlicek, J. Matousek, "F0 transformation within the voice convesrion framework," in Proc. Interspeech, pp. 1961-1964, 2007.
    • (2007) Proc. Interspeech , pp. 1961-1964
    • Hanzlicek, Z.1    Matousek, J.2
  • 12
    • 60849084576 scopus 로고    scopus 로고
    • Multi-layer F0 modelling for HMM-based speech synthesis
    • C. Wang, Z. Ling, B. Zhang, and L. Dai, "Multi-layer F0 modelling for HMM-based speech synthesis." in Proc. ISCSLP, pp. 129-132, 2008.
    • (2008) Proc. ISCSLP , pp. 129-132
    • Wang, C.1    Ling, Z.2    Zhang, B.3    Dai, L.4
  • 13
  • 14
    • 44049085520 scopus 로고    scopus 로고
    • High Quality voice convesrion through phoneme based linear mapping functions with STRAIGHT for mandarin
    • K. Liu, J. Zhang, and Y. Yan. "High Quality voice convesrion through phoneme based linear mapping functions with STRAIGHT for mandarin", in Proc. 4th Int. Conf. Fuzzy Syst. Knowl. Discovery (FSKD 2007), 2007, vol.4, pp. 410-414.
    • (2007) Proc. 4th Int. Conf. Fuzzy Syst. Knowl. Discovery (FSKD 2007) , vol.4 , pp. 410-414
    • Liu, K.1    Zhang, J.2    Yan, Y.3
  • 15
    • 84906225084 scopus 로고    scopus 로고
    • Joint spectral distribution modeling using restricted boltzmann machines for voice conversion
    • L. Chen, Z. Ling, Y. Song, L. Dai, "Joint Spectral Distribution Modeling Using Restricted Boltzmann Machines for Voice Conversion," in Proc. Interspeech, pp. 3053-3056, 2013.
    • (2013) Proc. Interspeech , pp. 3053-3056
    • Chen, L.1    Ling, Z.2    Song, Y.3    Dai, L.4
  • 16
    • 0027887751 scopus 로고
    • Stochastic gradient techniques for the efficient simulation of high-speed networks using importance sampling
    • M. Devetsikiotis, W. A. Al-Qaq, J. A. Freebersyser, J. K. Townsend, "Stochastic gradient techniques for the efficient simulation of high-speed networks using importance sampling," in Proc. GLOBECOM, pp. 751-756, 1993.
    • (1993) Proc. GLOBECOM , pp. 751-756
    • Devetsikiotis, M.1    Al-Qaq, W.A.2    Freebersyser, J.A.3    Townsend, J.K.4
  • 17
    • 33646773080 scopus 로고    scopus 로고
    • The CMU ARCTIC databases for speech synthesis
    • Language Technologies Institute, Carnegie Mellon University
    • J. Kominek and A. Black, "The CMU ARCTIC databases for speech synthesis," Tech. Rep. CMU-LTI-03-177, Language Technologies Institute, Carnegie Mellon University, 2003.
    • (2003) Tech. Rep. CMU-LTI-03-177
    • Kominek, J.1    Black, A.2
  • 18
    • 84865725683 scopus 로고    scopus 로고
    • Growing a spoken language interface on amazon mechanical turk
    • I. McGraw, J. Glass and S. Seneff, "Growing a Spoken Language Interface on Amazon Mechanical Turk", in Proc. of Interspeech, 2011.
    • (2011) Proc. of Interspeech
    • McGraw, I.1    Glass, J.2    Seneff, S.3
  • 19
    • 84910090364 scopus 로고    scopus 로고
    • Line spectral pairs based voice conversion using radial basis function
    • J. H. Nirmal, S. Patnaik, Mukesh A. Zaveri, "Line Spectral Pairs Based Voice Conversion Using Radial Basis Function ", Int. J. on Signal & Image Processing, Vol. 4, No.2, pp. 26-33, 2013.
    • (2013) Int. J. on Signal & Image Processing , vol.4 , Issue.2 , pp. 26-33
    • Nirmal, J.H.1    Patnaik, S.2    Zaveri, M.A.3
  • 21
    • 84867199771 scopus 로고    scopus 로고
    • Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching
    • K. Yutani, Y. Utoi, Y. Nankaku, T. Toda, K. Tokuda, "Simultaneous Conversion of Duration and Spectrum Based on Statistical Models Including Time-Sequence Matching," in Proc. INTERSPEECH, pp. 1072-1075, 2008.
    • (2008) Proc. INTERSPEECH , pp. 1072-1075
    • Yutani, K.1    Utoi, Y.2    Nankaku, Y.3    Toda, T.4    Tokuda, K.5
  • 24
    • 84910087395 scopus 로고    scopus 로고
    • Sequence Error (SE) minimization training of neural network for voice conversion
    • Accepted
    • F-L. Xie, Y. Qian, F. K. Soong, H. Li, "Sequence Error (SE) Minimization Training of Neural Network for Voice Conversion," in Proc. Interspeech, 2014(Accepted).
    • (2014) Proc. Interspeech
    • Xie, F.-L.1    Qian, Y.2    Soong, F.K.3    Li, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.