SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 9, Issue 1, 2001, Pages 30-38

Control of spectral dynamics in concatenative speech synthesis

(2) Wouters, Johan a Macon, Michael W a

a OREGON HEALTH AND SCIENCE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; DATABASE SYSTEMS; LINGUISTICS; MARKOV PROCESSES; PARAMETER ESTIMATION; SIGNAL PROCESSING; SPEECH ANALYSIS; SPEECH INTELLIGIBILITY; TRANSFER FUNCTIONS;

ARTICULATION EFFORT; CONCATENATIVE SPEECH SYNTHESIS; SPECTRAL DYNAMICS; SPEECH MODIFICATION;

SPEECH SYNTHESIS;

EID: 0035124445 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.890069 Document Type: Article

Times cited : (45)

References (30)

1
- 0029765811
- Unit selection in a concatenate speech synthesis system using a large speech database
- A. J. Hunt and A. W. Black, "Unit selection in a concatenate speech synthesis system using a large speech database," in Int. Conf. Acoustics, Speech, Signal Processing'96, 1996, pp. 373-376.
- (1996) Int. Conf. Acoustics, Speech, Signal Processing' , vol.96 , pp. 373-376
- Hunt, A.J.¹ Black, A.W.²

2
- 84944962517
- The IBM trainable speech synthesis system
- Dec.
- R. Donovan, "The IBM trainable speech synthesis system," Int. Conf. Speech Language Processing, vol. 5, pp. 1703-1706, Dec. 1998.
- (1998) Int. Conf. Speech Language Processing , vol.5 , pp. 1703-1706
- Donovan, R.¹

3
- 85133503504
- Diphone synthesis using unit selection.'Mn
- Nov.
- M. Beutnaeel, A. Conkie, and A. Syrdal, "Diphone synthesis using unit selection.'Mn Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis, Nov. 1998, pp. 185-190.
- (1998) Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis , pp. 185-190
- Beutnaeel, M.¹ Conkie, A.² Syrdal, A.³

4
- 85021282610
- Non-uniform unit selection and the similarity metric within BT's Laureate TTS system
- Nov.
- A. P. Breen and P. Jackson, "Non-uniform unit selection and the similarity metric within BT's Laureate TTS system," in Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis, Nov. 1998, pp. 201-206.
- (1998) Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis , pp. 201-206
- Breen, A.P.¹ Jackson, P.²

5
- 0000665734
- Explaining phonetic variation: A sketch of the H&H theory
- W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer
- B. Lindblom, "Explaining phonetic variation: A sketch of the H&H theory," in Speech Production and Speech Modeling, W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer, 1990, pp. 403-439.
- Speech Production and Speech Modeling , vol.1990 , pp. 403-439
- Lindblom, B.¹

6
- 0033676753
- Spectral modification for concatenative speech synthesis
- June
- J. Wouters and M. W. Macon, "Spectral modification for concatenative speech synthesis," in Int. Conf. Acoustics Speech Signal Processing, June 2000, pp. II.941-II.944.
- (2000) Int. Conf. Acoustics Speech Signal Processing , pp. 941-944
- Wouters, J.¹ Macon, M.W.²

7
- 81155152572
- A perceptual evaluation of distance measures for concatenative speech synthesis
- Nov. 1998
- J. Wouters and M. W. Macon, "A perceptual evaluation of distance measures for concatenative speech synthesis," in Int. Conf. Speech Language Processing, vol. 6, Nov. 1998, pp. 2747-2750.
- Int. Conf. Speech Language Processing , vol.6 , pp. 2747-2750
- Wouters, J.¹ Macon, M.W.²

8
- 81155150210
- On the reduction of concatenation artefacts in diphone synthesis
- Nov. 1998
- |8] E. Klabbers and R. Veldhuis, "On the reduction of concatenation artefacts in diphone synthesis," in Int. Conf. Speech Language Processing, vol. 6, Nov. 1998, pp. 2759-2762.
- Int. Conf. Speech Language Processing , vol.6 , pp. 2759-2762
- Klabbers, E.¹ Veldhuis, R.²

9
- 0007969066
- On the ability of various speech models to smooth segment discontinuities in the context of text-to-speech synthesis by concatenation
- T. Dutoit and H. Leich, "On the ability of various speech models to smooth segment discontinuities in the context of text-to-speech synthesis by concatenation," in Proc. EUSIPCO, vol. 1, 1994, pp. 8-12.
- (1994) Proc. EUSIPCO , vol.1 , pp. 8-12
- Dutoit, T.¹ Leich, H.²

10
- 2142655909
- Interpolation properties of linear prediction parametric representations
- K. K. Paliwal, "Interpolation properties of linear prediction parametric representations," in Proc. Enrospeech: ESCA, 1995, pp. 1029-1032.
- Proc. Enrospeech: ESCA , vol.1995 , pp. 1029-1032
- Paliwal, K.K.¹

11
- 0003162919
- HMM-based smoothing for concatenative speech synthesis
- Ill) M. Plumpe, A. Acero, H. Hon, and X. Huang, "HMM-based smoothing for concatenative speech synthesis," in Int. Conf. Speech Language Processing, Dec. 1998, pp. 2751-2754.
- Int. Conf. Speech Language Processing, Dec. , vol.1998 , pp. 2751-2754
- Plumpe, I.M.¹ Acero, A.² Hon, H.³ Huang, X.⁴

12
- 0025543906
- Pitch synchronous waveform processing techniques for text-to:speech synthesis using diphones,SpcfC/i
- Dec.
- E. Moulines and F. Charpentier, "Pitch synchronous waveform processing techniques for text-to:speech synthesis using diphones,"SpcfC/i Commun., vol. 9, no. 5/6, pp. 453-467, Dec. 1990.
- (1990) Commun. , vol.9 , Issue.5-6 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

13
- 33744624029
- International Telecommunication Union. (1996) Methods for subjective determination of transmission quality. [Online]. Available: http://www.itu.int
- , vol.1996
- Union, I.T.¹

14
- 84942397864
- Spectrographic study of vowel reduction,J
- Nov.
- B. Lindblom, "Spectrographic study of vowel reduction,"J. Acoust. Soc. Amer., vol. 35, pp. 1773-1781, Nov. 1963.
- (1963) Acoust. Soc. Amer. , vol.35 , pp. 1773-1781
- Lindblom, B.¹

15
- 0014374997
- Effect of speaking rate on diphthong formant movements
- T. Gay, "Effect of speaking rate on diphthong formant movements," J. Acoust. Soc. Amer., vol. 44, no. 6, pp. 1570-1573, 1968.
- (1968) J. Acoust. Soc. Amer. , vol.44 , Issue.6 , pp. 1570-1573
- Gay, T.¹

16
- 0026090950
- Tempo, stress and vowel reduction in American English
- Oct.
- M. Fourakis, "Tempo, stress and vowel reduction in American English," J. Acoust. Soc. Amer., vol. 90, pp. 1816-1827, Oct. 1991.
- (1991) J. Acoust. Soc. Amer. , vol.90 , pp. 1816-1827
- Fourakis, M.¹

17
- 0027554395
- Acoustic vowel reduction as a function of sentence accent, word stress and word class
- Mar.
- D. R. van Bergem, "Acoustic vowel reduction as a function of sentence accent, word stress and word class," Speech Commun., vol. 12, pp. 1-23, Mar. 1993.
- (1993) Speech Commun. , vol.12 , pp. 1-23
- Van Bergem, D.R.¹

18
- 0023407575
- Review of text-to-speech conversion for English
- Sept.
- D. H. Klatt, "Review of text-to-speech conversion for English," J. Acoust. Soc. Amer., vol. 82, pp. 737-793, Sept. 1987.
- (1987) J. Acoust. Soc. Amer. , vol.82 , pp. 737-793
- Klatt, D.H.¹

19
- 0002646675
- Segmental reduction in connected speech in German: Phonological facts and phonetic explanations
- W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer
- K. J. Kohler, "Segmental reduction in connected speech in German: Phonological facts and phonetic explanations," in Speech Production and Speech Modeling, W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer, 1990, pp. 69-92.
- Speech Production and Speech Modeling , vol.1990 , pp. 69-92
- Kohler, K.J.¹

20
- 0026940107
- The use of speech synthesis in exploring different speaking styles
- Oct.
- B. Granström, "The use of speech synthesis in exploring different speaking styles," Speech Commun., vol. 11, pp. 347-355, Oct. 1992.
- (1992) Speech Commun. , vol.11 , pp. 347-355
- Granström, B.¹

21
- 0016495091
- Linear prediction: A tutorial review
- Apr
- J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, pp. 561-580, Apr. 1975.
- (1975) Proc. IEEE , vol.63 , pp. 561-580
- Makhoul, J.¹

22
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- Aug
- R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Processing, vol. 34, pp. 744-754, Aug. 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.34 , pp. 744-754
- McAulay, R.J.¹ Quatieri, T.F.²

23
- 0005500345
- Ph.D. dissertation, Georgia Inst. Technol., Atlanta, Oct.
- M. W. Macon, "Speech synthesis based on sinusoidal modeling," Ph.D. dissertation, Georgia Inst. Technol., Atlanta, Oct. 1996.
- (1996) Speech Synthesis Based on Sinusoidal Modeling
- Macon, M.W.¹

24
- 0027268967
- HNS: Speech modification based on a harmonic + noise model
- J. Laroche, Y. Stylianou, and E. Moulines, "HNS: Speech modification based on a harmonic + noise model," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Procesing, 1993, pp. 550-553.
- (1993) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Procesing , pp. 550-553
- Laroche, J.¹ Stylianou, Y.² Moulines, E.³

25
- 0003447548
- Ph.D. dissertation, Ecole Nationale Supérieure des Télécommunications, Paris, France, Jan.
- Y. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods for speech and speaker modification," Ph.D. dissertation, Ecole Nationale Supérieure des Télécommunications, Paris, France, Jan. 1996.
- (1996) Harmonic plus Noise Models for Speech, Combined with Statistical Methods for Speech and Speaker Modification
- Stylianou, Y.¹

26
- 85135175982
- Statistical methods for voice quality transformation
- Sept.
- Y. Stylianou, O. Cappé, and E. Moulines, "Statistical methods for voice quality transformation," in Proc. Eurospeech, Sept. 1995, pp. 447-450.
- (1995) Proc. Eurospeech , pp. 447-450
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

27
- 0021124704
- Spectral envelope sampling and interpolation in linear predictive analysis of speech
- H. Hermansky, H. Fujisaki, and Y. Sato, "Spectral envelope sampling and interpolation in linear predictive analysis of speech," in Proc. Int. Conf. Acoustics, Speech, Signal Processing'84, 1984, pp. 2.2.1-2.2.4.
- (1984) Proc. Int. Conf. Acoustics, Speech, Signal Processing' , vol.84 , pp. 221-224
- Hermansky, H.¹ Fujisaki, H.² Sato, Y.³

28
- 0021411482
- Maximum likelihood spectral estimation and its application to narrow-band speech coding
- Aug
- R. J. McAulay, "Maximum likelihood spectral estimation and its application to narrow-band speech coding," IEEE Trans. Aconst., Speech, Signal Processing, vol. ASSP-34, pp. 744-754, Aug. 1984.
- (1984) IEEE Trans. Aconst., Speech, Signal Processing, Vol. ASSP , vol.34 , pp. 744-754
- McAulay, R.J.¹

29
- 0001935942
- Sinusoidal coding
- W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
- R. J. McAulay and T. F. Quatieri, "Sinusoidal coding," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995, pp. 121-173.
- (1995) Speech Coding and Synthesis , pp. 121-173
- McAulay, R.J.¹ Quatieri, T.F.²

30
- 0026204672
- Formant extraction from group delay function
- H. A. Murthy and B. Yegnanarayana, "Formant extraction from group delay function," Speech Commun., vol. 10, no. 3, pp. 209-221, 1991.
- (1991) Speech Commun. , vol.10 , Issue.3 , pp. 209-221
- Murthy, H.A.¹ Yegnanarayana, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.