메뉴 건너뛰기




Volumn , Issue , 2014, Pages 19-23

Voice conversion using deep neural networks with speaker-independent pre-training

Author keywords

Autoencoder; Deep neural network; Pre training; Voice conversion

Indexed keywords

BACKPROPAGATION; LEARNING SYSTEMS; NEURAL NETWORKS;

EID: 84946685887     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SLT.2014.7078543     Document Type: Conference Paper
Times cited : (113)

References (30)
  • 3
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-tospeech synthesis
    • May
    • A. Kain and M. Macon. Spectral voice conversion for text-tospeech synthesis. In Proceedings of ICASSP, volume 1, pages 285-299, May 1998.
    • (1998) Proceedings of ICASSP , vol.1 , pp. 285-299
    • Kain, A.1    Macon, M.2
  • 5
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana. Transformation of formants for voice conversion using artificial neural networks. Speech communication, 16(2): 207-216, 1995.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 8
    • 84906281619 scopus 로고    scopus 로고
    • Real-time voice conversion using artificial neural networks with rectified linear units
    • E. Azarov, M. Vashkevich, D. Likhachov, and A. Petrovsky. Real-time voice conversion using artificial neural networks with rectified linear units. In INTERSPEECH, pages 1032-1036, 2013.
    • (2013) INTERSPEECH , pp. 1032-1036
    • Azarov, E.1    Vashkevich, M.2    Likhachov, D.3    Petrovsky, A.4
  • 11
    • 84906225084 scopus 로고    scopus 로고
    • Joint spectral distribution modeling using restricted boltzmann machines for voice conversion
    • L. H. Chen, Z. H. Ling, Y. Song, and L. R. Dai. Joint spectral distribution modeling using restricted boltzmann machines for voice conversion. In INTERSPEECH, 2013.
    • (2013) INTERSPEECH
    • Chen, L.H.1    Ling, Z.H.2    Song, Y.3    Dai, L.R.4
  • 13
    • 84906280857 scopus 로고    scopus 로고
    • Voice conversion in high-order eigen space using deep belief nets
    • T. Nakashika, R. Takashima, T. Takiguchi, and Y. Ariki. Voice conversion in high-order eigen space using deep belief nets. In INTERSPEECH, pages 369-372, 2013.
    • (2013) INTERSPEECH , pp. 369-372
    • Nakashika, T.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 15
    • 0024880831 scopus 로고
    • Multilayer feedforward networks are universal approximators
    • K. Hornik, M. Stinchcombe, and H. White. Multilayer feedforward networks are universal approximators. Neural networks, 2(5):359-366, 1989.
    • (1989) Neural Networks , vol.2 , Issue.5 , pp. 359-366
    • Hornik, K.1    Stinchcombe, M.2    White, H.3
  • 19
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
    • Barcelona, Spain, August
    • H. Lu, S. King, and O. Watts. Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis. In 8th ISCAWorkshop on Speech Synthesis, pages 281-285, Barcelona, Spain, August 2013.
    • (2013) 8th ISCAWorkshop on Speech Synthesis , pp. 281-285
    • Lu, H.1    King, S.2    Watts, O.3
  • 20
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z. H. Ling, L. Deng, and D. Yu. Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis. Audio, Speech, and Language Processing, IEEE Transactions on, 21 (10):2129-2139, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.H.1    Deng, L.2    Yu., D.3
  • 22
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786): 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 23
    • 79551480483 scopus 로고    scopus 로고
    • Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion
    • P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research, 11:3371-3408, 2010.
    • (2010) The Journal of Machine Learning Research , vol.11 , pp. 3371-3408
    • Vincent, P.1    Larochelle, H.2    Lajoie, I.3    Bengio, Y.4    Manzagol, P.-A.5
  • 24
    • 84928144072 scopus 로고    scopus 로고
    • Speech signal processing toolkit (sptk)
    • Speech signal processing toolkit (sptk). URL http://sp-tk. sourceforge. net/.
  • 28
    • 79960392344 scopus 로고    scopus 로고
    • Amazon's mechanical turk-A new source of inexpensive, yet high-quality, data?
    • January
    • M. Buhrmester, T. Kwang, and S. D. Gosling. Amazon's mechanical turk-a new source of inexpensive, yet high-quality, data? Perspectives on Psychological Science, 6(1):3-5, January 2011.
    • (2011) Perspectives on Psychological Science , vol.6 , Issue.1 , pp. 3-5
    • Buhrmester, M.1    Kwang, T.2    Gosling, S.D.3
  • 29
    • 4444285698 scopus 로고    scopus 로고
    • PhD thesis, OGI School of Science & Engineering at Oregon Health & Science University
    • A. Kain. High Resolution Voice Transformation. PhD thesis, OGI School of Science & Engineering at Oregon Health & Science University, 2001.
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 30
    • 0002322469 scopus 로고
    • On a test of whether one of two random variables is stochastically larger than the other
    • H. B. Mann and D. R. Whitney. On a test of whether one of two random variables is stochastically larger than the other. The annals of mathematical statistics, pages 50-60, 1947.
    • (1947) The Annals of Mathematical Statistics , pp. 50-60
    • Mann, H.B.1    Whitney, D.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.