메뉴 건너뛰기




Volumn 15, Issue 2, 2007, Pages 641-651

Statistical approach for voice personality transformation

Author keywords

Maximum likelihood (ML) estimation; Prosody modification; Voice conversion

Indexed keywords

ACOUSTIC FEATURES; CEPSTRUM; CROSS CORRELATIONS; EXCITATION SPECTRUM; INFORMAL LISTENING; MAXIMUM LIKELIHOOD (ML) ESTIMATION; MODIFICATION FACTORS; NON-LINEAR RELATIONSHIPS; PITCH PERIODS; PROBABILISTIC CLASSIFICATIONS; PROBABILISTIC MODELS; PROSODY MODIFICATION; SPEAKING RATES; SPECTRAL CONTOURS; SPEECH SIGNALS; STATISTICAL APPROACHES; TARGET SPEAKERS; TIME-SCALE MODIFICATIONS; TRAINING DATUM; TRANSFORMATION METHODS; VOICE CONVERSION;

EID: 38149065136     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876760     Document Type: Article
Times cited : (38)

References (26)
  • 2
    • 0001503040 scopus 로고
    • Voice personality transformation
    • M. Savic and I. H. Nam, "Voice personality transformation," Digital Signal Process., vol. 4, pp. 107-110, 1991.
    • (1991) Digital Signal Process , vol.4 , pp. 107-110
    • Savic, M.1    Nam, I.H.2
  • 3
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Commun., vol. 11, pp. 175-187, 1992.
    • (1992) Speech Commun , vol.11 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 4
    • 0029256372 scopus 로고
    • Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt
    • H. Mizuno and M. Abe, "Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectral tilt," Speech Commun., vol. 16, no. 2, pp. 153-164, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 153-164
    • Mizuno, H.1    Abe, M.2
  • 5
    • 0029254176 scopus 로고
    • Transformation of formants of voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants of voice conversion using artificial neural networks," Speech Commun., vol. 16, no. 2, pp. 207-216, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 6
    • 0029251946 scopus 로고
    • Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks
    • N. Iwahashi and Y. Sagisaka, "Speech spectrum conversion based on speaker interpolation and multi-functional representation with weighting by radial basis function networks," Speech Commun., vol. 16, no. 2, pp. 139-152, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 139-152
    • Iwahashi, N.1    Sagisaka, Y.2
  • 8
    • 0031104132 scopus 로고    scopus 로고
    • Application of speech conversion to alaryngeal speech enhancement
    • Mar
    • N. Bi and Y. Qi, "Application of speech conversion to alaryngeal speech enhancement," IEEE Trans. Acoust., Speech, Signal Process., vol. 5, no. 2, pp. 97-105, Mar. 1997.
    • (1997) IEEE Trans. Acoust., Speech, Signal Process , vol.5 , Issue.2 , pp. 97-105
    • Bi, N.1    Qi, Y.2
  • 9
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L. M. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., vol. 28, pp. 211-226, 1999.
    • (1999) Speech Commun , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 10
    • 0030365550 scopus 로고    scopus 로고
    • A new voice personality transformation based on both linear and nonlinear prediction analysis
    • K. S. Lee, D. H. Youn, and I. W. Cha, "A new voice personality transformation based on both linear and nonlinear prediction analysis," in Proc. Int. Conf. Spoken Language Process., 1996, pp. 1401-1404.
    • (1996) Proc. Int. Conf. Spoken Language Process , pp. 1401-1404
    • Lee, K.S.1    Youn, D.H.2    Cha, I.W.3
  • 11
    • 0036670960 scopus 로고    scopus 로고
    • Voice conversion using a low dimensional vector mapping
    • Aug
    • --, "Voice conversion using a low dimensional vector mapping," IEICE Trans. Inform. Syst., vol. E85D, no. 8, pp. 1297-1305, Aug. 2002.
    • (2002) IEICE Trans. Inform. Syst , vol.E85D , Issue.8 , pp. 1297-1305
    • Lee, K.S.1    Youn, D.H.2    Cha, I.W.3
  • 13
    • 0020272849 scopus 로고
    • Helium speech enhancement using the short-time fourier transform
    • Dec
    • M. A. Richards, "Helium speech enhancement using the short-time fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol. 30, no. 6, pp. 841-853, Dec. 1982.
    • (1982) IEEE Trans. Acoust., Speech, Signal Process , vol.30 , Issue.6 , pp. 841-853
    • Richards, M.A.1
  • 14
    • 0025543906 scopus 로고
    • Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier, "Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol. 9, no. 5/6, pp. 453-467, 1990.
    • (1990) Speech Commun , vol.9 , Issue.5-6 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 17
    • 4143120860 scopus 로고
    • Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming
    • Apr
    • G. M. White and R. B. Neely, "Speech recognition experiments with linear prediction, bandpass filtering, and dynamic programming," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 2, pp. 183-188, Apr. 1976.
    • (1976) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-24 , Issue.2 , pp. 183-188
    • White, G.M.1    Neely, R.B.2
  • 18
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Acoust., Speech, Signal Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Acoust., Speech, Signal Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 19
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 21
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Jan
    • Y. Linde, A. Buzo, and R. M. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun., vol. 28, no. 1, pp. 84-95, Jan. 1980.
    • (1980) IEEE Trans. Commun , vol.28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 23
    • 84892173311 scopus 로고    scopus 로고
    • Estimating the speaking rate by vowel detection
    • T. Pfau and G. Ruske, "Estimating the speaking rate by vowel detection," in Proc. ICASSP, 1998, pp. 945-948.
    • (1998) Proc. ICASSP , pp. 945-948
    • Pfau, T.1    Ruske, G.2
  • 24
    • 0029254163 scopus 로고
    • Non-parametric techniques for pitchscale and time-scale modification of speech
    • E. Moulines and J. Laroche, "Non-parametric techniques for pitchscale and time-scale modification of speech," Speech Commun., vol. 16, no. 2, pp. 175-206, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 175-206
    • Moulines, E.1    Laroche, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.