메뉴 건너뛰기




Volumn , Issue , 2008, Pages 489-504

Voice Transformation

Author keywords

Dynamic Time Warping; Gaussian Mixture Model; Spectral Envelope; Speech Signal; Vocal Tract

Indexed keywords


EID: 85075914413     PISSN: 25228692     EISSN: 25228706     Source Type: Book Series    
DOI: 10.1007/978-3-540-49127-9_24     Document Type: Chapter
Times cited : (8)

References (42)
  • 3
    • 84989426403 scopus 로고
    • A new model of LPC excitation for producing natural-sounding speech at low bit rates
    • Vol.,) pp
    • B. Atal, J. Remde: A new model of LPC excitation for producing natural-sounding speech at low bit rates, Proc. IEEE ICASSP, Vol. 7 (1982) pp. 614–617
    • (1982) Proc. IEEE ICASSP , vol.7 , pp. 614-617
    • Atal, B.1    Remde, J.2
  • 4
    • 0022219187 scopus 로고
    • Code-excited linear prediction (CELP): High-quality speech at very low bit rates
    • Vol.,) pp
    • M.R. Schroeder, B.S. Atal: Code-excited linear prediction (CELP): High-quality speech at very low bit rates, Proc. IEEE ICASSP, Vol. 10 (1985) pp. 937–940
    • (1985) Proc. IEEE ICASSP , vol.10 , pp. 937-940
    • Schroeder, M.R.1    Atal, B.S.2
  • 5
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • R.J. McAulay, T.F. Quatieri: Speech analysis/synthesis based on a sinusoidal representation, IEEE ICASSP 34, 744–754 (1986)
    • (1986) IEEE ICASSP , vol.34 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 6
    • 50349095287 scopus 로고    scopus 로고
    • Modeling speech based on harmonic plus noise models
    • ed. by G. Chellot, A. Espos-ito, M. Faundez (Springer, Berlin, Heidelberg,) pp
    • Y. Stylianou: Modeling speech based on harmonic plus noise models. In: Nonlinear Speech Modeling and Applications, ed. by G. Chellot, A. Espos-ito, M. Faundez (Springer, Berlin, Heidelberg 2005) pp. 375–383
    • (2005) Nonlinear Speech Modeling and Applications , pp. 375-383
    • Stylianou, Y.1
  • 8
    • 0019606564 scopus 로고
    • The spectral envelope estimation vocoder
    • D.B. Paul: The spectral envelope estimation vocoder, IEEE ICASSP 29, 786–794 (1981)
    • (1981) IEEE ICASSP , vol.29 , pp. 786-794
    • Paul, D.B.1
  • 9
    • 0030127119 scopus 로고    scopus 로고
    • Regularization techniques for discrete cepstrum estimation
    • O. Cappé, E. Moulines: Regularization techniques for discrete cepstrum estimation, IEEE Signal Process. Lett. 3(4), 100–102 (1996)
    • (1996) IEEE Signal Process. Lett. , vol.3 , Issue.4 , pp. 100-102
    • Cappé, O.1    Moulines, E.2
  • 11
    • 85135177301 scopus 로고
    • High-quality speech modification based on a harmonic + noise model
    • Vol.,) pp
    • Y. Stylianou, J. Laroche, E. Moulines: High-quality speech modification based on a harmonic + noise model, Proc. Eurospeech, Vol. 95 (1995) pp. 451– 454
    • (1995) Proc. Eurospeech , vol.95 , pp. 451-454
    • Stylianou, Y.1    Laroche, J.2    Moulines, E.3
  • 12
    • 0026830163 scopus 로고
    • Shape invariant time-scale and pitch modification of speech
    • T.F. Quatieri, R.J. McAulay: Shape invariant time-scale and pitch modification of speech, IEEE ICASSP 40, 497–510 (1992)
    • (1992) IEEE ICASSP , vol.40 , pp. 497-510
    • Quatieri, T.F.1    McAulay, R.J.2
  • 13
    • 0003515694 scopus 로고
    • Low-rate speech coding based on the sinusoidal model
    • ed. by S. Furui, M. Sondhi (Marcel Dekker, New York, Chap. 6
    • R.J. McAulay, T.F. Quatieri: Low-rate speech coding based on the sinusoidal model. In: Advances in Speech Signal Processing, ed. by S. Furui, M. Sondhi (Marcel Dekker, New York 1991) pp. 165–208, Chap. 6
    • (1991) Advances in Speech Signal Processing , pp. 165-208
    • McAulay, R.J.1    Quatieri, T.F.2
  • 14
    • 0008598948 scopus 로고
    • Variable-frequency synthesis: An improved harmonic coding scheme
    • Vol.,) pp
    • L. Almeida, F. Silva: Variable-frequency synthesis: An improved harmonic coding scheme., Proc. IEEE ICASSP, Vol. 9 (1984) pp. 437–440
    • (1984) Proc. IEEE ICASSP , vol.9 , pp. 437-440
    • Almeida, L.1    Silva, F.2
  • 15
    • 0029254163 scopus 로고
    • Techniques for pitch-scale and time-scale transformation of speech. Part I. Non parametric methods
    • E. Moulines, J. Laroche: Techniques for pitch-scale and time-scale transformation of speech. Part I. Non parametric methods, Speech Commun. 16, 175–205 (1995)
    • (1995) Speech Commun , vol.16 , pp. 175-205
    • Moulines, E.1    Laroche, J.2
  • 16
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, F. Charpentier: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun. 9, 453–467 (1990)
    • (1990) Speech Commun , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 17
    • 0022249911 scopus 로고
    • High-quality time-scale modification of speech
    • pp
    • S. Roucos, A. Wilgus: High-quality time-scale modification of speech, Proc. IEEE ICASSP (1985) pp. 493–496
    • (1985) Proc. IEEE ICASSP , pp. 493-496
    • Roucos, S.1    Wilgus, A.2
  • 18
    • 0027252181 scopus 로고
    • An overlap-add technique based on waveform similarity (Wsola) for high quality time-scale modification of speech
    • Vol.,) pp
    • W. Verhelst, M. Roelands: An overlap-add technique based on waveform similarity (wsola) for high quality time-scale modification of speech, Proc. IEEE ICASSP, Vol. 2 (1993) pp. 554– 557
    • (1993) Proc. IEEE ICASSP , vol.2 , pp. 554-557
    • Verhelst, W.1    Roelands, M.2
  • 19
  • 20
    • 0022906142 scopus 로고
    • Speaker adaptation through vector quantization
    • Vol.,) pp
    • K. Shikano, K. Lee, R. Reddy: Speaker adaptation through vector quantization, Proc. IEEE ICASSP, Vol. 11 (1986) pp. 2643–2646
    • (1986) Proc. IEEE ICASSP , vol.11 , pp. 2643-2646
    • Shikano, K.1    Lee, K.2    Reddy, R.3
  • 21
    • 0029256373 scopus 로고
    • Acoustic characteristics of speaker individuality: Control and conversion
    • H. Kuwabara, Y. Sagisaka: Acoustic characteristics of speaker individuality: Control and conversion, Speech Commun. 16(2), 165–173 (1995)
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 165-173
    • Kuwabara, H.1    Sagisaka, Y.2
  • 22
    • 85064715894 scopus 로고
    • Speech spectrum transformation based on speaker interpolation
    • Vol.,) pp
    • N. Iwahashi, Y. Sagisaka: Speech spectrum transformation based on speaker interpolation, Proc. IEEE ICASSP, Vol. 1 (1994) pp. 461–464
    • (1994) Proc. IEEE ICASSP , vol.1 , pp. 461-464
    • Iwahashi, N.1    Sagisaka, Y.2
  • 23
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA techinques
    • H. Valbret, E. Moulines, J. Tubach: Voice transformation using PSOLA techinques, Speech Commun. 11(2-3), 175–187 (1992)
    • (1992) Speech Commun , vol.11 , Issue.2-3 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.3
  • 24
    • 0029256372 scopus 로고
    • Voice conversion algorithm based on piecewise linear conversion rule of formant frequency and spectrum tilt
    • H. Mizuno, M. Abe: Voice conversion algorithm based on piecewise linear conversion rule of formant frequency and spectrum tilt, Speech Commun. 16, 153–164 (1995)
    • (1995) Speech Commun , vol.16 , pp. 153-164
    • Mizuno, H.1    Abe, M.2
  • 25
    • 85135175982 scopus 로고
    • Statistical methods for voice quality transformation
    • Vol.,) pp
    • Y. Stylianou, O. Cappé, E. Moulines: Statistical methods for voice quality transformation, Proc. Eurospeech, Vol. 95 (1995) pp. 447–450
    • (1995) Proc. Eurospeech , vol.95 , pp. 447-450
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 27
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Vol.,) pp
    • A. Kain, M. Macon: Spectral voice conversion for text-to-speech synthesis, Proc. IEEE ICASSP, Vol. 5 (1998) pp. 285–288
    • (1998) Proc. IEEE ICASSP , vol.5 , pp. 285-288
    • Kain, A.1    Macon, M.2
  • 31
    • 0025682333 scopus 로고
    • Text independent speaker identification using automatic acoustic segmentation
    • Vol.,) pp
    • R.C. Rose, D.A. Reynolds: Text independent speaker identification using automatic acoustic segmentation, Proc. IEEE ICASSP, Vol. 1 (1990) pp. 293–296
    • (1990) Proc. IEEE ICASSP , vol.1 , pp. 293-296
    • Rose, R.C.1    Reynolds, D.A.2
  • 32
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm (Methodological)
    • A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm (methodological), J. R. Stat. Soc. B 39(1), 1–22 (1977)
    • (1977) J. R. Stat. Soc. B , vol.39 , Issue.1 , pp. 1-22
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 33
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm (Discussion)
    • A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm (discussion), J. R. Stat. Soc. B 39(1), 22–38 (1977)
    • (1977) J. R. Stat. Soc. B , vol.39 , Issue.1 , pp. 22-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 35
    • 0003837293 scopus 로고
    • Fundamentals of Statistical Signal Processing: Estimation Theory
    • Prentice Hall, Upper Saddle River
    • S.M. Kay: Fundamentals of Statistical Signal Processing: Estimation Theory, PH Signal Process. Ser. (Prentice Hall, Upper Saddle River 1993)
    • (1993) PH Signal Process. Ser.
    • Kay, S.M.1
  • 41
    • 84878417011 scopus 로고    scopus 로고
    • Detection of non-stationarity in speech signals and its application to time-scaling
    • Vol.,) pp
    • D. Kapilow, Y. Stylianou, J. Schroeter: Detection of non-stationarity in speech signals and its application to time-scaling, Proc. Eurospeech, Vol. 99 (1999) pp. 2307–2310
    • (1999) Proc. Eurospeech , vol.99 , pp. 2307-2310
    • Kapilow, D.1    Stylianou, Y.2    Schroeter, J.3
  • 42
    • 0033693289 scopus 로고    scopus 로고
    • Stochastic modeling of spectral adjustment for high quality pitch modification
    • Vol.,) pp
    • A. Kain, Y. Stylianou: Stochastic modeling of spectral adjustment for high quality pitch modification, Proc. IEEE ICASSP, Vol. 2 (2000) pp. 949–952
    • (2000) Proc. IEEE ICASSP , vol.2 , pp. 949-952
    • Kain, A.1    Stylianou, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.