SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 8, Issue 3, 2005, Pages 227-245

Parametric formant modelling and transformation in voice conversion

(4) Rentzos, Dimitrios a Vaseghi, Saeed a Yan, Qin a Ho, Ching Hsiang b

a BRUNEL UNIVERSITY (United Kingdom)

b FORTUNE INSTITUTE OF TECHNOLOGY (Taiwan)

Author keywords

Formant; hmms; Morphing; Voice conversion

Indexed keywords

FORMANT; HMMS; MORPHING; VOICE CONVERSION;

COMPUTATIONAL METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; STATISTICAL METHODS;

SPEECH PROCESSING;

EID: 33744930096 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-006-5692-y Document Type: Article

Times cited : (6)

References (37)

1
- 0023739214
- Voice conversion through vector quantization
- Abe, M., Nakamura, S., Shikano, K., and Kuwabara, H. (1988). Voice conversion through vector quantization, In Proceedings of ICASSP 1998, pp. 565-568.
- (1988) Proceedings of ICASSP 1998 , pp. 565-568
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 85135264071
- Formant analysis and synthesis using hidden markov models
- Acero, A. (1999). Formant analysis and synthesis using hidden markov models, In Proc of the Eurospeech Conference, Volume 3, Page 1047-1050.
- (1999) Proc of the Eurospeech Conference , vol.3 , pp. 1047-1050
- Acero, A.¹

3
- 0003724033
- Cambridge, Cambridge University Press
- Allen, J. Hunnicutt, S. Klatt, D. (1987). From Text to Speech: The MITalk System. Cambridge, Cambridge University Press.
- (1987) From Text to Speech: The MITalk System
- Allen, J.¹ Hunnicutt, S.² Klatt, D.³

4
- 84863268465
- Voice Conversion by codebook mapping of line spectral frequencies and excitation spectrum
- Arslan L.M. and Talkin, D. (1997). Voice Conversion by codebook mapping of line spectral frequencies and excitation spectrum, EUROSPEECH 1997 Proceedings.
- (1997) EUROSPEECH 1997 Proceedings
- Arslan, L.M.¹ Talkin, D.²

5
- 0141814630
- An expectation maximazation approach for formant tracking using a parameter-free non-linear predictor
- Bazzi, I., Acero, A., and Deng, Li. (2003). An expectation maximazation approach for Formant Tracking Using a Parameter-free Non-Linear Predictor. In Proc. ICASSP 2003, pp. 464-467.
- (2003) Proc. ICASSP 2003 , pp. 464-467
- Bazzi, I.¹ Acero, A.² Deng, L.³

6
- 0002515370
- The generation of affect in synthesized speech
- Cahn, J.E. (1990). The generation of affect in synthesized speech, Journal of the American Voice I/O Society, 8(July): 1-19.
- (1990) Journal of the American Voice I/O Society , vol.8 , Issue.JULY , pp. 1-19
- Cahn, J.E.¹

7
- 0026372714
- Experiments with voice modelling in speech synthesis
- Carlson, R., Granstrom, B., and Karlsson, I. (1991). Experiments with voice modelling in speech synthesis. Speech Communication, 10: 481-489.
- (1991) Speech Communication , vol.10 , pp. 481-489
- Carlson, R.¹ Granstrom, B.² Karlsson, I.³

8
- 33744926439
- Data-driven formant synthesis
- Fonetik 2002
- Carlson, R., Sigvardson, T. and Arvid, Sjolander. (2002). Data-driven formant synthesis, TMH-QPSR Vol.44 - Fonetik 2002.
- (2002) TMH-QPSR , vol.44
- Carlson, R.¹ Sigvardson, T.² Arvid, S.³

9
- 84905560807
- Voice conversion with smoothed gmm and map adaptation
- Chen, Y., Chu, M., Chang, E., Liu, J., and Liu, R. (2003). Voice conversion with smoothed gmm and map adaptation, In Proc. Eurospeech 2003, pp. 2413-2416.
- (2003) Proc. Eurospeech 2003 , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

10
- 0003641574
- Springer-Verlag
- De Boor, C. (1978). A Practical Guide to Splines, Springer-Verlag.
- (1978) A Practical Guide to Splines
- De Boor, C.¹

11
- 33744928386
- Chapman & Hall, London, UK
- Edrington, M. Lowry, A. Jackson, P. Breen, A. Minnis, S. (1998), Overview of Current Text-to-Speech Techniques: Part II - Prosody and Speech Generation, in Speech Technology for Telecommunications, Chapman & Hall, London, UK.
- (1998) Overview of Current Text-to-Speech Techniques: Part II - Prosody and Speech Generation, in Speech Technology for Telecommunications
- Edrington, M.L.¹ Jackson, A.² Breen, P.³ Minnis, A.S.⁴

12
- 84928451959
- Glottal flow: Models and interaction
- Fant G. (1986), Glottal flow: Models and interaction, Journal of Phonetics, 14: 393-399.
- (1986) Journal of Phonetics , vol.14 , pp. 393-399
- Fant, G.¹

13
- 0004072715
- Marcel Dekker, New York
- Furui, S. (1989). Digital Speech Processing, Synthesis, and Recognition, Marcel Dekker, New York.
- (1989) Digital Speech Processing, Synthesis, and Recognition
- Furui, S.¹

14
- 0141740106
- Formant model estimation and transformation for voice morphing
- Ho, C.H., Rentzos, D. Vaseghi, and S. (2002). Formant model estimation and transformation for voice morphing. In Proc. ICSLP, pp. 2149-2152.
- (2002) Proc. ICSLP , pp. 2149-2152
- Ho, C.H.¹ Rentzos, D.² Vaseghi, S.³

15
- 85032644657
- Using formant frequencies in speech recognition
- Holmes, J. Holmes, W. and Garner, P. (1997). Using formant frequencies in speech recognition. In Proc. Eurospeech-97, vol. 4, pp. 2083-2086.
- (1997) Proc. Eurospeech-97 , vol.4 , pp. 2083-2086
- Holmes, J.¹ Holmes, W.² Garner, P.³

16
- 84858905516
- Horne, M. (ed). Kluwer Academic Publishers, Dordrecht
- Horne, M. (ed). (2000), Prosody: Theory and Experiment. Studies Presented to Gösta Bruce. Kluwer Academic Publishers, Dordrecht.
- (2000) Prosody: Theory and Experiment. Studies Presented to Gösta Bruce

17
- 85064715894
- Speech Spectrum transformation by speaker interpolation
- Iwahashi N. and Sagisaka, Y. (1994). Speech Spectrum transformation by speaker interpolation, In Proceedings IEEE Int. Conference Acoustics, Speech Signal Processing.
- (1994) Proceedings IEEE Int. Conference Acoustics, Speech Signal Processing
- Iwahashi, N.¹ Sagisaka, Y.²

18
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Kain, A and Macon, M.W. (1998). Spectral voice conversion for text-to-speech synthesis. Proceedings of ICASSP, vol. 1, pp. 285-288.
- (1998) Proceedings of ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

19
- 4544367684
- Formant tracking using hidden Markov models and vector quantisation
- Kopec, D.H. (1986). Formant tracking using hidden Markov models and vector quantisation. IEEE Trans on Acoust., Speech, Signal Processing, Vol. ASSP-34, No 4, pp. 709-729.
- (1986) IEEE Trans on Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.4 , pp. 709-729
- Kopec, D.H.¹

20
- 0029256373
- Acoustic characteristics of speaker individuality
- Feb.
- Kuwabara, H. and Sagisaka, Y. (1995). Acoustic characteristics of speaker individuality: Control and Conversion. 16:165-173, Feb.
- (1995) Control and Conversion , vol.16 , pp. 165-173
- Kuwabara, H.¹ Sagisaka, Y.²

21
- 33645604824
- Formant tracking using segmental phonemic information
- Lee, M. van Santen, J. Mobius, B. Olive, J. (1999). Formant tracking using segmental phonemic information" In Proceedings of the Eurospeech 1999, vol. 6, 2789-2792.
- (1999) Proceedings of the Eurospeech 1999 , vol.6 , pp. 2789-2792
- Lee, M.¹ Van Santen, J.² Mobius, B.³ Olive, J.⁴

22
- 0001935942
- Sinusoidal coding, in speech coding and synthesis
- W.B. Kleijn and K.K. Paliwal, (Eds.) Elsevier Science
- McAulay, R.J. and Quatieri, T.F. (1995). Sinusoidal coding, in speech coding and synthesis. In W.B. Kleijn and K.K. Paliwal, (Eds.) Elsevier Science, Hoi, 4, pp. 121-173.
- (1995) Hoi , vol.4 , pp. 121-173
- McAulay, R.J.¹ Quatieri, T.F.²

23
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Moulines, E. and Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Communication, 9: 453-467.
- (1990) Speech Communication , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

24
- 0000330384
- On decomposing speech into modulated components
- Rao, A. and Kumaresan, R. (2000), On decomposing speech into modulated components. IEEE Trans. Speech and Audio Proc. 8(3): 240-254.
- (2000) IEEE Trans. Speech and Audio Proc. , vol.8 , Issue.3 , pp. 240-254
- Rao, A.¹ Kumaresan, R.²

25
- 0004244302
- Prentice Hall, Englewood Cliffs
- Rabiner L, Juang BH. (1993). Fundamentals of speech recognition, Prentice Hall, Englewood Cliffs.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.H.²

26
- 0029764982
- Automatic audio morphing
- Slaney, M., Covell, M., and Lassiter, B. (1996). Automatic audio morphing, In Proceedings of the 1996 ICASSP, Vol. 2 pp. 1001-1004.
- (1996) Proceedings of the 1996 ICASSP , vol.2 , pp. 1001-1004
- Slaney, M.¹ Covell, M.² Lassiter, B.³

27
- 0141700295
- Formant synthesis
- E. Keller (Ed.). Wiley
- Styger, T and Keller E. (1994). Formant synthesis. In E. Keller (Ed.), Fundamentals in Speech Synthesis and Speech Recognition, pp. 109-128. Wiley.
- (1994) Fundamentals in Speech Synthesis and Speech Recognition , pp. 109-128
- Styger, T.¹ Keller, E.²

28
- 0032026483
- Continuous probabilistic transform for voice conversion
- Stylianou, Y., Cappe, O., and Moulines, E. (1998). Continuous Probabilistic Transform for Voice Conversion, IEEE transactions on speech & audio processing, Vol.6, No.2, pp. 131-142.
- (1998) IEEE Transactions on Speech & Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

29
- 85009136689
- Voice transformations: From speech synthesis to mammalian vocalizations
- Denmark 2001
- Tang, M., C. Wang, and S. Seneff, (2001). Voice transformations: from speech synthesis to mammalian vocalizations. In Proceedings of the 7th European Conference on Speech Communication and Technology, Denmark 2001.
- (2001) Proceedings of the 7th European Conference on Speech Communication and Technology
- Tang, M.¹ Wang, C.² Seneff, S.³

30
- 85009250849
- Subband based voice conversion
- Turk, O. and Arslan, L.M. (2002). Subband based voice conversion, In Proceedings of the 2002 International Conference on Spoken Language Processing, pp. 289-292.
- (2002) Proceedings of the 2002 International Conference on Spoken Language Processing , pp. 289-292
- Turk, O.¹ Arslan, L.M.²

31
- 0026880275
- Voice transformation using PSOLA techniques
- Valbret H., Moulines, E. and Tubach, J.P. (1992). Voice transformation using PSOLA techniques, Speech Communication, vol. 11, pp. 175-187.
- (1992) Speech Communication , vol.11 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

32
- 0038697764
- Robust speech recognition and feature extraction using HMM2
- Weber K., Ikbal S., Bengio S., and Bourlard H., (2003). Robust speech recognition and feature extraction using HMM2, Computer Speech and Language 17, pp. 195-211.
- (2003) Computer Speech and Language , vol.17 , pp. 195-211
- Weber, K.¹ Ikbal, S.² Bengio, S.³ Bourlard, H.⁴

33
- 0003078259
- The HTK continuous speech recogniser
- Woodland, P.C. and Young, S.J. (1993). The HTK Continuous Speech Recogniser. Proceedings Eurospeech 1993, pp. 2207-2219.
- (1993) Proceedings Eurospeech 1993 , pp. 2207-2219
- Woodland, P.C.¹ Young, S.J.²

34
- 85009135405
- A new strategy of formant tracking based on dynamic programming
- Oct. 2000
- Xia, K. and Espy-Wilson, C. (2000). A new strategy of formant tracking based on dynamic programming. Intern. Conf. on Spoken Language Processing, Oct. 2000, pp. III 55-58.
- (2000) Intern. Conf. on Spoken Language Processing
- Xia, K.¹ Espy-Wilson, C.²

35
- 85009200814
- Comparative analysis and synthesis of formant trajectories of british and broad australian accents
- Yan, Q., Vaseghi, S., Ho, C.H., Rentzos, D., Turajlic, E. (2003). Comparative analysis and synthesis of formant trajectories of british and broad australian accents. Proceedings of Eurospeech 2003, pp. 2941-2944.
- (2003) Proceedings of Eurospeech 2003 , pp. 2941-2944
- Yan, Q.¹ Vaseghi, S.² Ho, C.H.³ Rentzos, D.⁴ Turajlic, E.⁵

36
- 0032121729
- Extraction of vocal-tract system characteristics from speech signal
- Yegnanarayana, B. and Veldhuis R.N.J.(1998). Extraction of vocal-tract system characteristics from speech signal. IEEE Trans. On Speech and Audio Processing, vol. 6, pp. 313-327.
- (1998) IEEE Trans. on Speech and Audio Processing , vol.6 , pp. 313-327
- Yegnanarayana, B.¹ Veldhuis, R.N.J.²

37
- 0030705337
- Speaker normalisation based on frequency warping
- Zhan P. & Westphal, M. (1997). Speaker normalisation based on frequency warping in proceedings of ICASSP 1997, pp. 1039-1042.
- (1997) Proceedings of ICASSP 1997 , pp. 1039-1042
- Zhan, P.¹ Westphal, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.