SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Springer Handbooks

Volumn , Issue , 2008, Pages 489-504

Voice Transformation

(1) Stylianou, Yannis a

a INSTITUTE OF COMPUTER SCIENCE (Greece)

Author keywords

Dynamic Time Warping; Gaussian Mixture Model; Spectral Envelope; Speech Signal; Vocal Tract

Indexed keywords

EID: 85075914413 PISSN: 25228692 EISSN: 25228706 Source Type: Book Series
DOI: 10.1007/978-3-540-49127-9_24 Document Type: Chapter

Times cited : (8)

References (42)

1
- 0003834176
- Kluwer Academic, Dordrecht
- T. Dutoit: An Introduction to Text-to-Speech Synthesis (Kluwer Academic, Dordrecht 1997)
- (1997) An Introduction to Text-To-Speech Synthesis
- Dutoit, T.¹

2
- 0003874959
- Springer, Berlin, Heidelberg
- J.D. Markel, A.M. Gray: Linear Prediction of Speech (Springer, Berlin, Heidelberg 1976)
- (1976) Linear Prediction of Speech
- Markel, J.D.¹ Gray, A.M.²

3
- 84989426403
- A new model of LPC excitation for producing natural-sounding speech at low bit rates
- Vol.,) pp
- B. Atal, J. Remde: A new model of LPC excitation for producing natural-sounding speech at low bit rates, Proc. IEEE ICASSP, Vol. 7 (1982) pp. 614–617
- (1982) Proc. IEEE ICASSP , vol.7 , pp. 614-617
- Atal, B.¹ Remde, J.²

4
- 0022219187
- Code-excited linear prediction (CELP): High-quality speech at very low bit rates
- Vol.,) pp
- M.R. Schroeder, B.S. Atal: Code-excited linear prediction (CELP): High-quality speech at very low bit rates, Proc. IEEE ICASSP, Vol. 10 (1985) pp. 937–940
- (1985) Proc. IEEE ICASSP , vol.10 , pp. 937-940
- Schroeder, M.R.¹ Atal, B.S.²

5
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- R.J. McAulay, T.F. Quatieri: Speech analysis/synthesis based on a sinusoidal representation, IEEE ICASSP 34, 744–754 (1986)
- (1986) IEEE ICASSP , vol.34 , pp. 744-754
- McAulay, R.J.¹ Quatieri, T.F.²

6
- 50349095287
- Modeling speech based on harmonic plus noise models
- ed. by G. Chellot, A. Espos-ito, M. Faundez (Springer, Berlin, Heidelberg,) pp
- Y. Stylianou: Modeling speech based on harmonic plus noise models. In: Nonlinear Speech Modeling and Applications, ed. by G. Chellot, A. Espos-ito, M. Faundez (Springer, Berlin, Heidelberg 2005) pp. 375–383
- (2005) Nonlinear Speech Modeling and Applications , pp. 375-383
- Stylianou, Y.¹

7
- 0000250293
- Homomorphic analysis of speech
- A.V. Oppenheim, R.W. Schafer: Homomorphic analysis of speech, IEEE Trans. Audio Electroacoust. 16, 221–228 (1968)
- (1968) IEEE Trans. Audio Electroacoust. , vol.16 , pp. 221-228
- Oppenheim, A.V.¹ Schafer, R.W.²

8
- 0019606564
- The spectral envelope estimation vocoder
- D.B. Paul: The spectral envelope estimation vocoder, IEEE ICASSP 29, 786–794 (1981)
- (1981) IEEE ICASSP , vol.29 , pp. 786-794
- Paul, D.B.¹

9
- 0030127119
- Regularization techniques for discrete cepstrum estimation
- O. Cappé, E. Moulines: Regularization techniques for discrete cepstrum estimation, IEEE Signal Process. Lett. 3(4), 100–102 (1996)
- (1996) IEEE Signal Process. Lett. , vol.3 , Issue.4 , pp. 100-102
- Cappé, O.¹ Moulines, E.²

10
- 0003513556
- Prentice Hall, Englewood Cliffs
- A.V. Oppenheim, R.W. Schafer: Discrete-Time Signal Processing (Prentice Hall, Englewood Cliffs 1989)
- (1989) Discrete-Time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.²

11
- 85135177301
- High-quality speech modification based on a harmonic + noise model
- Vol.,) pp
- Y. Stylianou, J. Laroche, E. Moulines: High-quality speech modification based on a harmonic + noise model, Proc. Eurospeech, Vol. 95 (1995) pp. 451– 454
- (1995) Proc. Eurospeech , vol.95 , pp. 451-454
- Stylianou, Y.¹ Laroche, J.² Moulines, E.³

12
- 0026830163
- Shape invariant time-scale and pitch modification of speech
- T.F. Quatieri, R.J. McAulay: Shape invariant time-scale and pitch modification of speech, IEEE ICASSP 40, 497–510 (1992)
- (1992) IEEE ICASSP , vol.40 , pp. 497-510
- Quatieri, T.F.¹ McAulay, R.J.²

13
- 0003515694
- Low-rate speech coding based on the sinusoidal model
- ed. by S. Furui, M. Sondhi (Marcel Dekker, New York, Chap. 6
- R.J. McAulay, T.F. Quatieri: Low-rate speech coding based on the sinusoidal model. In: Advances in Speech Signal Processing, ed. by S. Furui, M. Sondhi (Marcel Dekker, New York 1991) pp. 165–208, Chap. 6
- (1991) Advances in Speech Signal Processing , pp. 165-208
- McAulay, R.J.¹ Quatieri, T.F.²

14
- 0008598948
- Variable-frequency synthesis: An improved harmonic coding scheme
- Vol.,) pp
- L. Almeida, F. Silva: Variable-frequency synthesis: An improved harmonic coding scheme., Proc. IEEE ICASSP, Vol. 9 (1984) pp. 437–440
- (1984) Proc. IEEE ICASSP , vol.9 , pp. 437-440
- Almeida, L.¹ Silva, F.²

15
- 0029254163
- Techniques for pitch-scale and time-scale transformation of speech. Part I. Non parametric methods
- E. Moulines, J. Laroche: Techniques for pitch-scale and time-scale transformation of speech. Part I. Non parametric methods, Speech Commun. 16, 175–205 (1995)
- (1995) Speech Commun , vol.16 , pp. 175-205
- Moulines, E.¹ Laroche, J.²

16
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines, F. Charpentier: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun. 9, 453–467 (1990)
- (1990) Speech Commun , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

17
- 0022249911
- High-quality time-scale modification of speech
- pp
- S. Roucos, A. Wilgus: High-quality time-scale modification of speech, Proc. IEEE ICASSP (1985) pp. 493–496
- (1985) Proc. IEEE ICASSP , pp. 493-496
- Roucos, S.¹ Wilgus, A.²

18
- 0027252181
- An overlap-add technique based on waveform similarity (Wsola) for high quality time-scale modification of speech
- Vol.,) pp
- W. Verhelst, M. Roelands: An overlap-add technique based on waveform similarity (wsola) for high quality time-scale modification of speech, Proc. IEEE ICASSP, Vol. 2 (1993) pp. 554– 557
- (1993) Proc. IEEE ICASSP , vol.2 , pp. 554-557
- Verhelst, W.¹ Roelands, M.²

19
- 0023739214
- Voice conversion through vector quantization
- Vol.,) pp
- M. Abe, S. Nakamura, K. Shikano, H. Kuwabara: Voice conversion through vector quantization, Proc. IEEE ICASSP, Vol. 1 (1988) pp. 655–658
- (1988) Proc. IEEE ICASSP , vol.1 , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

20
- 0022906142
- Speaker adaptation through vector quantization
- Vol.,) pp
- K. Shikano, K. Lee, R. Reddy: Speaker adaptation through vector quantization, Proc. IEEE ICASSP, Vol. 11 (1986) pp. 2643–2646
- (1986) Proc. IEEE ICASSP , vol.11 , pp. 2643-2646
- Shikano, K.¹ Lee, K.² Reddy, R.³

21
- 0029256373
- Acoustic characteristics of speaker individuality: Control and conversion
- H. Kuwabara, Y. Sagisaka: Acoustic characteristics of speaker individuality: Control and conversion, Speech Commun. 16(2), 165–173 (1995)
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 165-173
- Kuwabara, H.¹ Sagisaka, Y.²

22
- 85064715894
- Speech spectrum transformation based on speaker interpolation
- Vol.,) pp
- N. Iwahashi, Y. Sagisaka: Speech spectrum transformation based on speaker interpolation, Proc. IEEE ICASSP, Vol. 1 (1994) pp. 461–464
- (1994) Proc. IEEE ICASSP , vol.1 , pp. 461-464
- Iwahashi, N.¹ Sagisaka, Y.²

23
- 0026880275
- Voice transformation using PSOLA techinques
- H. Valbret, E. Moulines, J. Tubach: Voice transformation using PSOLA techinques, Speech Commun. 11(2-3), 175–187 (1992)
- (1992) Speech Commun , vol.11 , Issue.2-3 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.³

24
- 0029256372
- Voice conversion algorithm based on piecewise linear conversion rule of formant frequency and spectrum tilt
- H. Mizuno, M. Abe: Voice conversion algorithm based on piecewise linear conversion rule of formant frequency and spectrum tilt, Speech Commun. 16, 153–164 (1995)
- (1995) Speech Commun , vol.16 , pp. 153-164
- Mizuno, H.¹ Abe, M.²

25
- 85135175982
- Statistical methods for voice quality transformation
- Vol.,) pp
- Y. Stylianou, O. Cappé, E. Moulines: Statistical methods for voice quality transformation, Proc. Eurospeech, Vol. 95 (1995) pp. 447–450
- (1995) Proc. Eurospeech , vol.95 , pp. 447-450
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

26
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, E. Moulines: Continuous probabilistic transform for voice conversion, IEEE Trans. Speech Audio Process. 6(2), 131–142 (1998)
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

27
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Vol.,) pp
- A. Kain, M. Macon: Spectral voice conversion for text-to-speech synthesis, Proc. IEEE ICASSP, Vol. 5 (1998) pp. 285–288
- (1998) Proc. IEEE ICASSP , vol.5 , pp. 285-288
- Kain, A.¹ Macon, M.²

28
- 34047245444
- Non parallel training for voice conversion based on a parameter adaptation
- A. Mouchtaris, J.V. derSpiegel, P. Mueller: Non parallel training for voice conversion based on a parameter adaptation, IEEE Trans. Audio Speech Language Process. 14(3), 952–963 (2006)
- (2006) IEEE Trans. Audio Speech Language Process. , vol.14 , Issue.3 , pp. 952-963
- Mouchtaris, A.¹ Derspiegel, J.V.² Mueller, P.³

29
- 33947623206
- Text-independent voice conversion based on unit selection
- Vol.,) pp
- D. Suendermann, H. Hoege, A. Bonafonte, H. Ney, A. Black, S. Narayanan: Text-independent voice conversion based on unit selection, Proc. IEEE ICASSP, Vol. 1 (2006) pp. 81–84
- (2006) Proc. IEEE ICASSP , vol.1 , pp. 81-84
- Suendermann, D.¹ Hoege, H.² Bonafonte, A.³ Ney, H.⁴ Black, A.⁵ Narayanan, S.⁶

30
- 0003472470
- Wiley, New York
- R.O. Duda, P.E. Hart: Pattern Classification and Scene Analysis (Wiley, New York 1973)
- (1973) Pattern Classification and Scene Analysis
- Duda, R.O.¹ Hart, P.E.²

31
- 0025682333
- Text independent speaker identification using automatic acoustic segmentation
- Vol.,) pp
- R.C. Rose, D.A. Reynolds: Text independent speaker identification using automatic acoustic segmentation, Proc. IEEE ICASSP, Vol. 1 (1990) pp. 293–296
- (1990) Proc. IEEE ICASSP , vol.1 , pp. 293-296
- Rose, R.C.¹ Reynolds, D.A.²

32
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm (Methodological)
- A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm (methodological), J. R. Stat. Soc. B 39(1), 1–22 (1977)
- (1977) J. R. Stat. Soc. B , vol.39 , Issue.1 , pp. 1-22
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

33
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm (Discussion)
- A.P. Dempster, N.M. Laird, D.B. Rubin: Maximum likelihood from incomplete data via the EM algorithm (discussion), J. R. Stat. Soc. B 39(1), 22–38 (1977)
- (1977) J. R. Stat. Soc. B , vol.39 , Issue.1 , pp. 22-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

34
- 0004244302
- Prentice Hall, Upper Saddle River
- L.R. Rabiner, B.-H. Juang: Fundamentals of Speech Recognition (Prentice Hall, Upper Saddle River 1993)
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.-H.²

35
- 0003837293
- Fundamentals of Statistical Signal Processing: Estimation Theory
- Prentice Hall, Upper Saddle River
- S.M. Kay: Fundamentals of Statistical Signal Processing: Estimation Theory, PH Signal Process. Ser. (Prentice Hall, Upper Saddle River 1993)
- (1993) PH Signal Process. Ser.
- Kay, S.M.¹

36
- 0003515753
- Chapman Hall, Boca Raton
- C. Chatfield, A.J. Collins: Introduction to Multivariate Analysis (Chapman Hall, Boca Raton 1980)
- (1980) Introduction to Multivariate Analysis
- Chatfield, C.¹ Collins, A.J.²

37
- 0003447548
- Ph.D. ThesisEcole Nationale Supèrieure des Télécommunica-tions, Paris
- Y. Stylianou: Harmonic plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification, Ph.D. Thesis (Ecole Nationale Supèrieure des Télécommunica-tions, Paris 1996)
- (1996) Harmonic plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification
- Stylianou, Y.¹

38
- 0003927842
- Prentice Hall, Englewood Cliffs
- T.F. Quatieri: Discrete-Time Speech Signal Processing (Prentice Hall, Englewood Cliffs 2002)
- (2002) Discrete-Time Speech Signal Processing
- Quatieri, T.F.¹

39
- 0003719446
- Pro-Ed, Austin
- J.M. Pickett: The Sounds of Speech Communication (Pro-Ed, Austin 1980)
- (1980) The Sounds of Speech Communication
- Pickett, J.M.¹

40
- 0344557324
- Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge
- C. Jankowski: Fine Structure Features for Speaker Identification, Ph.D. Thesis (Massachusetts Institute of Technology, Cambridge 1996)
- (1996) Fine Structure Features for Speaker Identification
- Jankowski, C.¹

41
- 84878417011
- Detection of non-stationarity in speech signals and its application to time-scaling
- Vol.,) pp
- D. Kapilow, Y. Stylianou, J. Schroeter: Detection of non-stationarity in speech signals and its application to time-scaling, Proc. Eurospeech, Vol. 99 (1999) pp. 2307–2310
- (1999) Proc. Eurospeech , vol.99 , pp. 2307-2310
- Kapilow, D.¹ Stylianou, Y.² Schroeter, J.³

42
- 0033693289
- Stochastic modeling of spectral adjustment for high quality pitch modification
- Vol.,) pp
- A. Kain, Y. Stylianou: Stochastic modeling of spectral adjustment for high quality pitch modification, Proc. IEEE ICASSP, Vol. 2 (2000) pp. 949–952
- (2000) Proc. IEEE ICASSP , vol.2 , pp. 949-952
- Kain, A.¹ Stylianou, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.