SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 2, 2007, Pages 641-651

Statistical approach for voice personality transformation

(1) Lee, Ki Seung a

a Konkuk University (South Korea)

Author keywords

Maximum likelihood (ML) estimation; Prosody modification; Voice conversion

Indexed keywords

ACOUSTIC FEATURES; CEPSTRUM; CROSS CORRELATIONS; EXCITATION SPECTRUM; INFORMAL LISTENING; MAXIMUM LIKELIHOOD (ML) ESTIMATION; MODIFICATION FACTORS; NON-LINEAR RELATIONSHIPS; PITCH PERIODS; PROBABILISTIC CLASSIFICATIONS; PROBABILISTIC MODELS; PROSODY MODIFICATION; SPEAKING RATES; SPECTRAL CONTOURS; SPEECH SIGNALS; STATISTICAL APPROACHES; TARGET SPEAKERS; TIME-SCALE MODIFICATIONS; TRAINING DATUM; TRANSFORMATION METHODS; VOICE CONVERSION;

BLOCK CODES; PROBABILITY DENSITY FUNCTION; SPEECH PROCESSING; VECTOR QUANTIZATION;

MAXIMUM LIKELIHOOD ESTIMATION;

EID: 38149065136 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.876760 Document Type: Article

Times cited : (38)

References (26)

1
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1988, vol. 1, pp. 565-568.
- (1988) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 565-568
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 0001503040
- Voice personality transformation
- M. Savic and I. H. Nam, "Voice personality transformation," Digital Signal Process., vol. 4, pp. 107-110, 1991.
- (1991) Digital Signal Process , vol.4 , pp. 107-110
- Savic, M.¹ Nam, I.H.²

3
- 0026880275
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Commun., vol. 11, pp. 175-187, 1992.
- (1992) Speech Commun , vol.11 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

4
- 0029256372
- Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt
- H. Mizuno and M. Abe, "Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt," Speech Commun., vol. 16, no. 2, pp. 153-164, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 153-164
- Mizuno, H.¹ Abe, M.²

5
- 0029254176
- Transformation of formants of voice conversion using artificial neural networks
- M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants of voice conversion using artificial neural networks," Speech Commun., vol. 16, no. 2, pp. 207-216, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

6
- 0029251946
- Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks
- N. Iwahashi and Y. Sagisaka, "Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks," Speech Commun., vol. 16, no. 2, pp. 139-152, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 139-152
- Iwahashi, N.¹ Sagisaka, Y.²

7
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Acoust., Speech, Signal Process., vol. 6, no. 2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. Acoust., Speech, Signal Process , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

8
- 0031104132
- Application of speech conversion to alaryngeal speech enhancement
- Mar
- N. Bi and Y. Qi, "Application of speech conversion to alaryngeal speech enhancement," IEEE Trans. Acoust., Speech, Signal Process., vol. 5, no. 2, pp. 97-105, Mar. 1997.
- (1997) IEEE Trans. Acoust., Speech, Signal Process , vol.5 , Issue.2 , pp. 97-105
- Bi, N.¹ Qi, Y.²

9
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- L. M. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., vol. 28, pp. 211-226, 1999.
- (1999) Speech Commun , vol.28 , pp. 211-226
- Arslan, L.M.¹

10
- 0030365550
- A new voice personality transformation based on both linear and nonlinear prediction analysis
- K. S. Lee, D. H. Youn, and I. W. Cha, "A new voice personality transformation based on both linear and nonlinear prediction analysis," in Proc. Int. Conf. Spoken Language Process., 1996, pp. 1401-1404.
- (1996) Proc. Int. Conf. Spoken Language Process , pp. 1401-1404
- Lee, K.S.¹ Youn, D.H.² Cha, I.W.³

11
- 0036670960
- Voice conversion using a low dimensional vector mapping
- Aug
- --, "Voice conversion using a low dimensional vector mapping," IEICE Trans. Inform. Syst., vol. E85D, no. 8, pp. 1297-1305, Aug. 2002.
- (2002) IEICE Trans. Inform. Syst , vol.E85D , Issue.8 , pp. 1297-1305
- Lee, K.S.¹ Youn, D.H.² Cha, I.W.³

12
- 0024940640
- Unsupervised speaker adaptation by probabilistic spectrum fitting
- S. J. Cox and J. S. Bridle, "Unsupervised speaker adaptation by probabilistic spectrum fitting," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1989, vol. 1, pp. 294-297.
- (1989) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 294-297
- Cox, S.J.¹ Bridle, J.S.²

13
- 0020272849
- Helium speech enhancement using the short-time fourier transform
- Dec
- M. A. Richards, "Helium speech enhancement using the short-time fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. 30, no. 6, pp. 841-853, Dec. 1982.
- (1982) IEEE Trans. Acoust., Speech, Signal Process , vol.30 , Issue.6 , pp. 841-853
- Richards, M.A.¹

14
- 0025543906
- Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines and F. Charpentier, "Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol. 9, no. 5/6, pp. 453-467, 1990.
- (1990) Speech Commun , vol.9 , Issue.5-6 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

15
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1987.
- (1987) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

16
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

17
- 4143120860
- Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming
- Apr
- G. M. White and R. B. Neely, "Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 2, pp. 183-188, Apr. 1976.
- (1976) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-24 , Issue.2 , pp. 183-188
- White, G.M.¹ Neely, R.B.²

18
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Acoust., Speech, Signal Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Acoust., Speech, Signal Process , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

19
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

20
- 0003462953
- New York: Wiley
- H. L. Van Trees, Detection, Estimation and Modulation Theory, (Part I). New York: Wiley, 1968.
- (1968) Detection, Estimation and Modulation Theory, (Part I)
- Van Trees, H.L.¹

21
- 0018918171
- An algorithm for vector quantizer design
- Jan
- Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun., vol. 28, no. 1, pp. 84-95, Jan. 1980.
- (1980) IEEE Trans. Commun , vol.28 , Issue.1 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

22
- 0022203520
- Voice conversion: Factors responsible for quality
- D. G. Childers, B. Yegnanarayana, and K.Wu, "Voice conversion: Factors responsible for quality," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1985, vol. 1, pp. 748-751.
- (1985) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 748-751
- Childers, D.G.¹ Yegnanarayana, B.² Wu, K.³

23
- 84892173311
- Estimating the speaking rate by vowel detection
- T. Pfau and G. Ruske, "Estimating the speaking rate by vowel detection," in Proc. ICASSP, 1998, pp. 945-948.
- (1998) Proc. ICASSP , pp. 945-948
- Pfau, T.¹ Ruske, G.²

24
- 0029254163
- Non-parametric techniques for pitchscale and time-scale modification of speech
- E. Moulines and J. Laroche, "Non-parametric techniques for pitchscale and time-scale modification of speech," Speech Commun., vol. 16, no. 2, pp. 175-206, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 175-206
- Moulines, E.¹ Laroche, J.²

25
- 0022249911
- High quality time-scale modification for speech
- S. Roucos and A. M. Wilgus, "High quality time-scale modification for speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1985, pp. 493-469.
- (1985) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 493-469
- Roucos, S.¹ Wilgus, A.M.²

26
- 0028997012
- Spectral dynamics is more important than spectral distortion
- H. P. Knagenhjelm and W. B. Kleijn, "Spectral dynamics is more important than spectral distortion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1995, pp. 732-735.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 732-735
- Knagenhjelm, H.P.¹ Kleijn, W.B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.