메뉴 건너뛰기




Volumn 8, Issue 3, 2005, Pages 227-245

Parametric formant modelling and transformation in voice conversion

Author keywords

Formant; hmms; Morphing; Voice conversion

Indexed keywords

FORMANT; HMMS; MORPHING; VOICE CONVERSION;

EID: 33744930096     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-006-5692-y     Document Type: Article
Times cited : (6)

References (37)
  • 2
    • 85135264071 scopus 로고    scopus 로고
    • Formant analysis and synthesis using hidden markov models
    • Acero, A. (1999). Formant analysis and synthesis using hidden markov models, In Proc of the Eurospeech Conference, Volume 3, Page 1047-1050.
    • (1999) Proc of the Eurospeech Conference , vol.3 , pp. 1047-1050
    • Acero, A.1
  • 4
    • 84863268465 scopus 로고    scopus 로고
    • Voice Conversion by codebook mapping of line spectral frequencies and excitation spectrum
    • Arslan L.M. and Talkin, D. (1997). Voice Conversion by codebook mapping of line spectral frequencies and excitation spectrum, EUROSPEECH 1997 Proceedings.
    • (1997) EUROSPEECH 1997 Proceedings
    • Arslan, L.M.1    Talkin, D.2
  • 5
    • 0141814630 scopus 로고    scopus 로고
    • An expectation maximazation approach for formant tracking using a parameter-free non-linear predictor
    • Bazzi, I., Acero, A., and Deng, Li. (2003). An expectation maximazation approach for Formant Tracking Using a Parameter-free Non-Linear Predictor. In Proc. ICASSP 2003, pp. 464-467.
    • (2003) Proc. ICASSP 2003 , pp. 464-467
    • Bazzi, I.1    Acero, A.2    Deng, L.3
  • 6
    • 0002515370 scopus 로고
    • The generation of affect in synthesized speech
    • Cahn, J.E. (1990). The generation of affect in synthesized speech, Journal of the American Voice I/O Society, 8(July): 1-19.
    • (1990) Journal of the American Voice I/O Society , vol.8 , Issue.JULY , pp. 1-19
    • Cahn, J.E.1
  • 7
    • 0026372714 scopus 로고
    • Experiments with voice modelling in speech synthesis
    • Carlson, R., Granstrom, B., and Karlsson, I. (1991). Experiments with voice modelling in speech synthesis. Speech Communication, 10: 481-489.
    • (1991) Speech Communication , vol.10 , pp. 481-489
    • Carlson, R.1    Granstrom, B.2    Karlsson, I.3
  • 8
    • 33744926439 scopus 로고    scopus 로고
    • Data-driven formant synthesis
    • Fonetik 2002
    • Carlson, R., Sigvardson, T. and Arvid, Sjolander. (2002). Data-driven formant synthesis, TMH-QPSR Vol.44 - Fonetik 2002.
    • (2002) TMH-QPSR , vol.44
    • Carlson, R.1    Sigvardson, T.2    Arvid, S.3
  • 9
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed gmm and map adaptation
    • Chen, Y., Chu, M., Chang, E., Liu, J., and Liu, R. (2003). Voice conversion with smoothed gmm and map adaptation, In Proc. Eurospeech 2003, pp. 2413-2416.
    • (2003) Proc. Eurospeech 2003 , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 12
    • 84928451959 scopus 로고
    • Glottal flow: Models and interaction
    • Fant G. (1986), Glottal flow: Models and interaction, Journal of Phonetics, 14: 393-399.
    • (1986) Journal of Phonetics , vol.14 , pp. 393-399
    • Fant, G.1
  • 14
    • 0141740106 scopus 로고    scopus 로고
    • Formant model estimation and transformation for voice morphing
    • Ho, C.H., Rentzos, D. Vaseghi, and S. (2002). Formant model estimation and transformation for voice morphing. In Proc. ICSLP, pp. 2149-2152.
    • (2002) Proc. ICSLP , pp. 2149-2152
    • Ho, C.H.1    Rentzos, D.2    Vaseghi, S.3
  • 15
    • 85032644657 scopus 로고    scopus 로고
    • Using formant frequencies in speech recognition
    • Holmes, J. Holmes, W. and Garner, P. (1997). Using formant frequencies in speech recognition. In Proc. Eurospeech-97, vol. 4, pp. 2083-2086.
    • (1997) Proc. Eurospeech-97 , vol.4 , pp. 2083-2086
    • Holmes, J.1    Holmes, W.2    Garner, P.3
  • 18
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Kain, A and Macon, M.W. (1998). Spectral voice conversion for text-to-speech synthesis. Proceedings of ICASSP, vol. 1, pp. 285-288.
    • (1998) Proceedings of ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 19
    • 4544367684 scopus 로고
    • Formant tracking using hidden Markov models and vector quantisation
    • Kopec, D.H. (1986). Formant tracking using hidden Markov models and vector quantisation. IEEE Trans on Acoust., Speech, Signal Processing, Vol. ASSP-34, No 4, pp. 709-729.
    • (1986) IEEE Trans on Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.4 , pp. 709-729
    • Kopec, D.H.1
  • 20
    • 0029256373 scopus 로고
    • Acoustic characteristics of speaker individuality
    • Feb.
    • Kuwabara, H. and Sagisaka, Y. (1995). Acoustic characteristics of speaker individuality: Control and Conversion. 16:165-173, Feb.
    • (1995) Control and Conversion , vol.16 , pp. 165-173
    • Kuwabara, H.1    Sagisaka, Y.2
  • 22
    • 0001935942 scopus 로고
    • Sinusoidal coding, in speech coding and synthesis
    • W.B. Kleijn and K.K. Paliwal, (Eds.) Elsevier Science
    • McAulay, R.J. and Quatieri, T.F. (1995). Sinusoidal coding, in speech coding and synthesis. In W.B. Kleijn and K.K. Paliwal, (Eds.) Elsevier Science, Hoi, 4, pp. 121-173.
    • (1995) Hoi , vol.4 , pp. 121-173
    • McAulay, R.J.1    Quatieri, T.F.2
  • 23
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • Moulines, E. and Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Communication, 9: 453-467.
    • (1990) Speech Communication , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 24
    • 0000330384 scopus 로고    scopus 로고
    • On decomposing speech into modulated components
    • Rao, A. and Kumaresan, R. (2000), On decomposing speech into modulated components. IEEE Trans. Speech and Audio Proc. 8(3): 240-254.
    • (2000) IEEE Trans. Speech and Audio Proc. , vol.8 , Issue.3 , pp. 240-254
    • Rao, A.1    Kumaresan, R.2
  • 31
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA techniques
    • Valbret H., Moulines, E. and Tubach, J.P. (1992). Voice transformation using PSOLA techniques, Speech Communication, vol. 11, pp. 175-187.
    • (1992) Speech Communication , vol.11 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 34
    • 85009135405 scopus 로고    scopus 로고
    • A new strategy of formant tracking based on dynamic programming
    • Oct. 2000
    • Xia, K. and Espy-Wilson, C. (2000). A new strategy of formant tracking based on dynamic programming. Intern. Conf. on Spoken Language Processing, Oct. 2000, pp. III 55-58.
    • (2000) Intern. Conf. on Spoken Language Processing
    • Xia, K.1    Espy-Wilson, C.2
  • 35
    • 85009200814 scopus 로고    scopus 로고
    • Comparative analysis and synthesis of formant trajectories of british and broad australian accents
    • Yan, Q., Vaseghi, S., Ho, C.H., Rentzos, D., Turajlic, E. (2003). Comparative analysis and synthesis of formant trajectories of british and broad australian accents. Proceedings of Eurospeech 2003, pp. 2941-2944.
    • (2003) Proceedings of Eurospeech 2003 , pp. 2941-2944
    • Yan, Q.1    Vaseghi, S.2    Ho, C.H.3    Rentzos, D.4    Turajlic, E.5
  • 37
    • 0030705337 scopus 로고    scopus 로고
    • Speaker normalisation based on frequency warping
    • Zhan P. & Westphal, M. (1997). Speaker normalisation based on frequency warping in proceedings of ICASSP 1997, pp. 1039-1042.
    • (1997) Proceedings of ICASSP 1997 , pp. 1039-1042
    • Zhan, P.1    Westphal, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.