메뉴 건너뛰기




Volumn 6, Issue 3, 1998, Pages 201-216

HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress

Author keywords

Lombard effect; Robust speech recognition; Speech synthesis; Speech under stress

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; SPECTRUM ANALYSIS; SPEECH ANALYSIS; SPEECH SYNTHESIS;

EID: 0032069798     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.668815     Document Type: Article
Times cited : (47)

References (28)
  • 1
    • 0023168987 scopus 로고
    • Cepstral domain stress compensation for robust speech recognition
    • Dallas, TX, Apr.
    • Y. Chen, "Cepstral domain stress compensation for robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing. Dallas, TX, Apr. 1987, 717-720.
    • (1987) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 717-720
    • Chen, Y.1
  • 8
    • 0029325035 scopus 로고
    • Implementation and testing of a system for producing emotion-by-rule in synthetic speech
    • I. R. Murray and J. L. Amott, "Implementation and testing of a system for producing emotion-by-rule in synthetic speech," Speech Commun., vol. 16, pp. 369-390, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 369-390
    • Murray, I.R.1    Amott, J.L.2
  • 10
    • 0030285967 scopus 로고    scopus 로고
    • Generating stressed speech from neutral speech using a modified CELP vocoder
    • Nov.
    • S. E. Bou-Ghazale and J. Hansen, "Generating stressed speech from neutral speech using a modified CELP vocoder," Speech Commun., vol. 20, pp. 93-110, Nov. 1996.
    • (1996) Speech Commun. , vol.20 , pp. 93-110
    • Bou-Ghazale, S.E.1    Hansen, J.2
  • 12
    • 85032661370 scopus 로고
    • Duration and spectral based stress token generation for HMM speech recognition under stress
    • Adelaide, Australia, Apr.
    • S. E. Bou-Ghazale and J. H. L. Hansen, "Duration and spectral based stress token generation for HMM speech recognition under stress," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Adelaide, Australia, Apr. 1994, pp. 413-416.
    • (1994) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 413-416
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 13
    • 0028516405 scopus 로고
    • Morphological constrained enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect
    • Oct.
    • J. H. L. Hansen, "Morphological constrained enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect," IEEE Trans. Speech Audio Processing, vol. 2, pp. 598-614, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 598-614
    • Hansen, J.H.L.1
  • 14
    • 85106119047 scopus 로고
    • Lombard effect compensation for robust automatic speech recognition in noise
    • Kobe, Japan, Nov.
    • J. H. L: Hansen and O. N. Bria, "Lombard effect compensation for robust automatic speech recognition in noise," in Proc. Int. Conf. Spoken Language Processing, Kobe, Japan, Nov. 1990, pp. 1125-1128.
    • (1990) Proc. Int. Conf. Spoken Language Processing , pp. 1125-1128
    • Hansen, J.H.L.1    Bria, O.N.2
  • 15
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • Jan.
    • J. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," J. Acoust. Soc. Amer., vol. 93, pp. 510-523, Jan. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 510-523
    • Junqua, J.1
  • 16
    • 0030283946 scopus 로고    scopus 로고
    • Classification of speech under stress using target driven features
    • Nov.
    • B. D. Womack and J. H. L. Hansen, "Classification of speech under stress using target driven features," Speech Commun., vol. 20, pp. 131-150, Nov. 1996.
    • (1996) Speech Commun. , vol.20 , pp. 131-150
    • Womack, B.D.1    Hansen, J.H.L.2
  • 17
    • 0029375589 scopus 로고
    • Robust speech recognition training via duration and spectral-based stress token generation
    • Sept.
    • J. H. L. Hansen and S. E. Bou-Ghazale, "Robust speech recognition training via duration and spectral-based stress token generation," IEEE Trans. Speech Audio Processing, vol. 3, pp. 415-121, Sept. 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 415-1121
    • Hansen, J.H.L.1    Bou-Ghazale, S.E.2
  • 20
    • 0022796218 scopus 로고
    • Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models
    • Oct.
    • A. Ljolje and F. Fallside, "Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1074-1080, Oct. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 1074-1080
    • Ljolje, A.1    Fallside, F.2
  • 22
    • 0022685753 scopus 로고
    • Continuously variable duration hidden Markov models for automatic speech recognition
    • S. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol. 1, pp. 29-45, 1986.
    • (1986) Comput. Speech Lang. , vol.1 , pp. 29-45
    • Levinson, S.1
  • 23
    • 0022234383 scopus 로고
    • Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition
    • Tampa, FL, Mar.
    • M. J. Russell and R. K. Moore, "Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Tampa, FL, Mar. 1985, pp. 5-8.
    • (1985) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 5-8
    • Russell, M.J.1    Moore, R.K.2
  • 25
    • 0015409613 scopus 로고
    • Emotions and speech: Some acoustical correlates
    • C. E. Williams and K. N. Stevens, "Emotions and speech: Some acoustical correlates," J. Acoust. Soc. Amer., vol. 52, pp. 1238-1250, 1972.
    • (1972) J. Acoust. Soc. Amer. , vol.52 , pp. 1238-1250
    • Williams, C.E.1    Stevens, K.N.2
  • 26
    • 0028630509 scopus 로고
    • Nonlinear analysis and detection of speech under stressed conditions
    • D. A. Cairns and J. H. L. Hansen, "Nonlinear analysis and detection of speech under stressed conditions," J. Acoust. Soc. Amer., vol. 96, pp. 3392-2400, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.96 , pp. 3392-12400
    • Cairns, D.A.1    Hansen, J.H.L.2
  • 27
    • 0030196359 scopus 로고    scopus 로고
    • Feature analysis and neural network based classification of speech under stress
    • July
    • J. H. L. Hansen and B. D. Womack, "Feature analysis and neural network based classification of speech under stress," IEEE Trans. Speech Audio Processing, vol. 4, pp. 307-13, July 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 307-313
    • Hansen, J.H.L.1    Womack, B.D.2
  • 28
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • B. S. Atal, "Automatic speaker recognition based on pitch contours," J. Acoust. Soc. Amer., vol. 52, pp. 1687-1697, 1972.
    • (1972) J. Acoust. Soc. Amer. , vol.52 , pp. 1687-1697
    • Atal, B.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.