메뉴 건너뛰기




Volumn 9, Issue 1, 2001, Pages 30-38

Control of spectral dynamics in concatenative speech synthesis

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; DATABASE SYSTEMS; LINGUISTICS; MARKOV PROCESSES; PARAMETER ESTIMATION; SIGNAL PROCESSING; SPEECH ANALYSIS; SPEECH INTELLIGIBILITY; TRANSFER FUNCTIONS;

EID: 0035124445     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.890069     Document Type: Article
Times cited : (45)

References (30)
  • 1
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenate speech synthesis system using a large speech database
    • A. J. Hunt and A. W. Black, "Unit selection in a concatenate speech synthesis system using a large speech database," in Int. Conf. Acoustics, Speech, Signal Processing'96, 1996, pp. 373-376.
    • (1996) Int. Conf. Acoustics, Speech, Signal Processing' , vol.96 , pp. 373-376
    • Hunt, A.J.1    Black, A.W.2
  • 2
    • 84944962517 scopus 로고    scopus 로고
    • The IBM trainable speech synthesis system
    • Dec.
    • R. Donovan, "The IBM trainable speech synthesis system," Int. Conf. Speech Language Processing, vol. 5, pp. 1703-1706, Dec. 1998.
    • (1998) Int. Conf. Speech Language Processing , vol.5 , pp. 1703-1706
    • Donovan, R.1
  • 4
    • 85021282610 scopus 로고    scopus 로고
    • Non-uniform unit selection and the similarity metric within BT's Laureate TTS system
    • Nov.
    • A. P. Breen and P. Jackson, "Non-uniform unit selection and the similarity metric within BT's Laureate TTS system," in Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis, Nov. 1998, pp. 201-206.
    • (1998) Proc. 3rd ESCA/COCOSDA Workshop Speech Synthesis , pp. 201-206
    • Breen, A.P.1    Jackson, P.2
  • 5
    • 0000665734 scopus 로고    scopus 로고
    • Explaining phonetic variation: A sketch of the H&H theory
    • W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer
    • B. Lindblom, "Explaining phonetic variation: A sketch of the H&H theory," in Speech Production and Speech Modeling, W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer, 1990, pp. 403-439.
    • Speech Production and Speech Modeling , vol.1990 , pp. 403-439
    • Lindblom, B.1
  • 7
    • 81155152572 scopus 로고    scopus 로고
    • A perceptual evaluation of distance measures for concatenative speech synthesis
    • Nov. 1998
    • J. Wouters and M. W. Macon, "A perceptual evaluation of distance measures for concatenative speech synthesis," in Int. Conf. Speech Language Processing, vol. 6, Nov. 1998, pp. 2747-2750.
    • Int. Conf. Speech Language Processing , vol.6 , pp. 2747-2750
    • Wouters, J.1    Macon, M.W.2
  • 8
    • 81155150210 scopus 로고    scopus 로고
    • On the reduction of concatenation artefacts in diphone synthesis
    • Nov. 1998
    • |8] E. Klabbers and R. Veldhuis, "On the reduction of concatenation artefacts in diphone synthesis," in Int. Conf. Speech Language Processing, vol. 6, Nov. 1998, pp. 2759-2762.
    • Int. Conf. Speech Language Processing , vol.6 , pp. 2759-2762
    • Klabbers, E.1    Veldhuis, R.2
  • 9
    • 0007969066 scopus 로고
    • On the ability of various speech models to smooth segment discontinuities in the context of text-to-speech synthesis by concatenation
    • T. Dutoit and H. Leich, "On the ability of various speech models to smooth segment discontinuities in the context of text-to-speech synthesis by concatenation," in Proc. EUSIPCO, vol. 1, 1994, pp. 8-12.
    • (1994) Proc. EUSIPCO , vol.1 , pp. 8-12
    • Dutoit, T.1    Leich, H.2
  • 10
    • 2142655909 scopus 로고    scopus 로고
    • Interpolation properties of linear prediction parametric representations
    • K. K. Paliwal, "Interpolation properties of linear prediction parametric representations," in Proc. Enrospeech: ESCA, 1995, pp. 1029-1032.
    • Proc. Enrospeech: ESCA , vol.1995 , pp. 1029-1032
    • Paliwal, K.K.1
  • 12
    • 0025543906 scopus 로고
    • Pitch synchronous waveform processing techniques for text-to:speech synthesis using diphones,SpcfC/i
    • Dec.
    • E. Moulines and F. Charpentier, "Pitch synchronous waveform processing techniques for text-to:speech synthesis using diphones,"SpcfC/i Commun., vol. 9, no. 5/6, pp. 453-467, Dec. 1990.
    • (1990) Commun. , vol.9 , Issue.5-6 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 13
    • 33744624029 scopus 로고    scopus 로고
    • International Telecommunication Union. (1996) Methods for subjective determination of transmission quality. [Online]. Available: http://www.itu.int
    • , vol.1996
    • Union, I.T.1
  • 14
    • 84942397864 scopus 로고
    • Spectrographic study of vowel reduction,J
    • Nov.
    • B. Lindblom, "Spectrographic study of vowel reduction,"J. Acoust. Soc. Amer., vol. 35, pp. 1773-1781, Nov. 1963.
    • (1963) Acoust. Soc. Amer. , vol.35 , pp. 1773-1781
    • Lindblom, B.1
  • 15
    • 0014374997 scopus 로고
    • Effect of speaking rate on diphthong formant movements
    • T. Gay, "Effect of speaking rate on diphthong formant movements," J. Acoust. Soc. Amer., vol. 44, no. 6, pp. 1570-1573, 1968.
    • (1968) J. Acoust. Soc. Amer. , vol.44 , Issue.6 , pp. 1570-1573
    • Gay, T.1
  • 16
    • 0026090950 scopus 로고
    • Tempo, stress and vowel reduction in American English
    • Oct.
    • M. Fourakis, "Tempo, stress and vowel reduction in American English," J. Acoust. Soc. Amer., vol. 90, pp. 1816-1827, Oct. 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.90 , pp. 1816-1827
    • Fourakis, M.1
  • 17
    • 0027554395 scopus 로고
    • Acoustic vowel reduction as a function of sentence accent, word stress and word class
    • Mar.
    • D. R. van Bergem, "Acoustic vowel reduction as a function of sentence accent, word stress and word class," Speech Commun., vol. 12, pp. 1-23, Mar. 1993.
    • (1993) Speech Commun. , vol.12 , pp. 1-23
    • Van Bergem, D.R.1
  • 18
    • 0023407575 scopus 로고
    • Review of text-to-speech conversion for English
    • Sept.
    • D. H. Klatt, "Review of text-to-speech conversion for English," J. Acoust. Soc. Amer., vol. 82, pp. 737-793, Sept. 1987.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , pp. 737-793
    • Klatt, D.H.1
  • 19
    • 0002646675 scopus 로고    scopus 로고
    • Segmental reduction in connected speech in German: Phonological facts and phonetic explanations
    • W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer
    • K. J. Kohler, "Segmental reduction in connected speech in German: Phonological facts and phonetic explanations," in Speech Production and Speech Modeling, W. J. Hardcastle and A. Marchai, Eds. Norwell, MA: Kluwer, 1990, pp. 69-92.
    • Speech Production and Speech Modeling , vol.1990 , pp. 69-92
    • Kohler, K.J.1
  • 20
    • 0026940107 scopus 로고
    • The use of speech synthesis in exploring different speaking styles
    • Oct.
    • B. Granström, "The use of speech synthesis in exploring different speaking styles," Speech Commun., vol. 11, pp. 347-355, Oct. 1992.
    • (1992) Speech Commun. , vol.11 , pp. 347-355
    • Granström, B.1
  • 21
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Apr
    • J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 63, pp. 561-580, Apr. 1975.
    • (1975) Proc. IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 26
    • 85135175982 scopus 로고
    • Statistical methods for voice quality transformation
    • Sept.
    • Y. Stylianou, O. Cappé, and E. Moulines, "Statistical methods for voice quality transformation," in Proc. Eurospeech, Sept. 1995, pp. 447-450.
    • (1995) Proc. Eurospeech , pp. 447-450
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 28
    • 0021411482 scopus 로고
    • Maximum likelihood spectral estimation and its application to narrow-band speech coding
    • Aug
    • R. J. McAulay, "Maximum likelihood spectral estimation and its application to narrow-band speech coding," IEEE Trans. Aconst., Speech, Signal Processing, vol. ASSP-34, pp. 744-754, Aug. 1984.
    • (1984) IEEE Trans. Aconst., Speech, Signal Processing, Vol. ASSP , vol.34 , pp. 744-754
    • McAulay, R.J.1
  • 29
    • 0001935942 scopus 로고
    • Sinusoidal coding
    • W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
    • R. J. McAulay and T. F. Quatieri, "Sinusoidal coding," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995, pp. 121-173.
    • (1995) Speech Coding and Synthesis , pp. 121-173
    • McAulay, R.J.1    Quatieri, T.F.2
  • 30
    • 0026204672 scopus 로고
    • Formant extraction from group delay function
    • H. A. Murthy and B. Yegnanarayana, "Formant extraction from group delay function," Speech Commun., vol. 10, no. 3, pp. 209-221, 1991.
    • (1991) Speech Commun. , vol.10 , Issue.3 , pp. 209-221
    • Murthy, H.A.1    Yegnanarayana, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.