SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 6, Issue 3, 1998, Pages 201-216

HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress

(2) Bou Ghazale, Sahar E a,b Hansen, John H L a

a DUKE UNIVERSITY (United States)

b Rockwell Semiconductor Systems (United States)

Author keywords

Lombard effect; Robust speech recognition; Speech synthesis; Speech under stress

Indexed keywords

MARKOV PROCESSES; MATHEMATICAL MODELS; SPECTRUM ANALYSIS; SPEECH ANALYSIS; SPEECH SYNTHESIS;

STRESSED SPEECH;

SPEECH RECOGNITION;

EID: 0032069798 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.668815 Document Type: Article

Times cited : (47)

References (28)

1
- 0023168987
- Cepstral domain stress compensation for robust speech recognition
- Dallas, TX, Apr.
- Y. Chen, "Cepstral domain stress compensation for robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing. Dallas, TX, Apr. 1987, 717-720.
- (1987) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 717-720
- Chen, Y.¹

2
- 0005411393
- Ph.D. dissertation, Georgia Inst. Technol., Atlanta, July
- J. H. L. Hansen, "Analysis and compensation of stressed and noisy speech with application to robust automatic recognition," Ph.D. dissertation, Georgia Inst. Technol., Atlanta, July 1988.
- (1988) Analysis and Compensation of Stressed and Noisy Speech with Application to Robust Automatic Recognition
- Hansen, J.H.L.¹

3
- 0024932747
- Stress compensation and noise reduction algorithms for robust speech recognition
- J. H. L: Hansen and M. A. Clements, "Stress compensation and noise reduction algorithms for robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Glasgow, U.K., May 1989, pp. 266-269.
- (1989) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Glasgow, U.K., May , pp. 266-269
- Hansen, J.H.L.¹ Clements, M.A.²

4
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- Dallas, TXApr.
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Dallas, TX, "Apr. 1987, pp. 705-708.
- (1987) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

5
- 0000874053
- Le signe de l'élévation de la voix
- E. Lombard, "Le signe de l'élévation de la voix," Ann. Maladies Oreille, Larynx, Nez, Pharynx, vol. 37, pp. 101-119, 1911.
- (1911) Ann. Maladies Oreille, Larynx, Nez, Pharynx , vol.37 , pp. 101-119
- Lombard, E.¹

6
- 0012886312
- Applying analysis of human emotional speech to enhance synthetic speech
- Berlin, Germany
- A. Abadjieva, I. R. Murray, and J. L. Arnott, "Applying analysis of human emotional speech to enhance synthetic speech," in Proc. Europ. Conf. Speech Communication and Technology, Berlin, Germany, 1993, pp. 909-912.
- (1993) Proc. Europ. Conf. Speech Communication and Technology , pp. 909-912
- Abadjieva, A.¹ Murray, I.R.² Arnott, J.L.³

7
- 0002515370
- Generation of affect in synthesized speech
- J. E. Cahn, "Generation of affect in synthesized speech," in Proc. AV1OS, Meet. American Voice Input/Output Soc., 1989.
- (1989) Proc. AV1OS, Meet. American Voice Input/Output Soc.
- Cahn, J.E.¹

8
- 0029325035
- Implementation and testing of a system for producing emotion-by-rule in synthetic speech
- I. R. Murray and J. L. Amott, "Implementation and testing of a system for producing emotion-by-rule in synthetic speech," Speech Commun., vol. 16, pp. 369-390, 1995.
- (1995) Speech Commun. , vol.16 , pp. 369-390
- Murray, I.R.¹ Amott, J.L.²

9
- 0028996985
- Synthesizing styled speech using the Klatt synthesizer
- J. C. Rutledge, K. E. Cummings, D. A. Lambert, and M. A. Clements, "Synthesizing styled speech using the Klatt synthesizer," in Proc. IEEE Int. Con}. Acoustics, Speech, and Signal Processing, Detroit, MI, May 1995, pp. 648-651.
- (1995) Proc. IEEE Int. Con}. Acoustics, Speech, and Signal Processing, Detroit, MI, May , pp. 648-651
- Rutledge, J.C.¹ Cummings, K.E.² Lambert, D.A.³ Clements, M.A.⁴

10
- 0030285967
- Generating stressed speech from neutral speech using a modified CELP vocoder
- Nov.
- S. E. Bou-Ghazale and J. Hansen, "Generating stressed speech from neutral speech using a modified CELP vocoder," Speech Commun., vol. 20, pp. 93-110, Nov. 1996.
- (1996) Speech Commun. , vol.20 , pp. 93-110
- Bou-Ghazale, S.E.¹ Hansen, J.²

11
- 30244522975
- Master's thesis. Duke Univ., Durham, NC, June
- S. E. Bou-Ghazale, "Duration and spectral based stress token generation for keyword recognition using hidden Markov models," Master's thesis. Duke Univ., Durham, NC, June 1993.
- (1993) Duration and Spectral Based Stress Token Generation for Keyword Recognition Using Hidden Markov Models
- Bou-Ghazale, S.E.¹

12
- 85032661370
- Duration and spectral based stress token generation for HMM speech recognition under stress
- Adelaide, Australia, Apr.
- S. E. Bou-Ghazale and J. H. L. Hansen, "Duration and spectral based stress token generation for HMM speech recognition under stress," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Adelaide, Australia, Apr. 1994, pp. 413-416.
- (1994) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 413-416
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

13
- 0028516405
- Morphological constrained enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect
- Oct.
- J. H. L. Hansen, "Morphological constrained enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect," IEEE Trans. Speech Audio Processing, vol. 2, pp. 598-614, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 598-614
- Hansen, J.H.L.¹

14
- 85106119047
- Lombard effect compensation for robust automatic speech recognition in noise
- Kobe, Japan, Nov.
- J. H. L: Hansen and O. N. Bria, "Lombard effect compensation for robust automatic speech recognition in noise," in Proc. Int. Conf. Spoken Language Processing, Kobe, Japan, Nov. 1990, pp. 1125-1128.
- (1990) Proc. Int. Conf. Spoken Language Processing , pp. 1125-1128
- Hansen, J.H.L.¹ Bria, O.N.²

15
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizers
- Jan.
- J. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," J. Acoust. Soc. Amer., vol. 93, pp. 510-523, Jan. 1993.
- (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 510-523
- Junqua, J.¹

16
- 0030283946
- Classification of speech under stress using target driven features
- Nov.
- B. D. Womack and J. H. L. Hansen, "Classification of speech under stress using target driven features," Speech Commun., vol. 20, pp. 131-150, Nov. 1996.
- (1996) Speech Commun. , vol.20 , pp. 131-150
- Womack, B.D.¹ Hansen, J.H.L.²

17
- 0029375589
- Robust speech recognition training via duration and spectral-based stress token generation
- Sept.
- J. H. L. Hansen and S. E. Bou-Ghazale, "Robust speech recognition training via duration and spectral-based stress token generation," IEEE Trans. Speech Audio Processing, vol. 3, pp. 415-121, Sept. 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 415-1121
- Hansen, J.H.L.¹ Bou-Ghazale, S.E.²

18
- 33646923949
- Ph.D. dissertation, Duke Univ., Durham, NC, Nov.
- S. E. Bou-Ghazale, "Analysis, modeling, and perturbation of speech under stress with applications to speech synthesis and recognition," Ph.D. dissertation, Duke Univ., Durham, NC, Nov. 1996.
- (1996) Analysis, Modeling, and Perturbation of Speech under Stress with Applications to Speech Synthesis and Recognition
- Bou-Ghazale, S.E.¹

19
- 21844469662
- A study on pitch pattern generation using HMM-based statistical information
- T. Fukada, Y. Komori, T. Aso, and Y. Ohora, "A study on pitch pattern generation using HMM-based statistical information," in Proc. Int. Conf. Spoken Language Processing, Yokohama, Japan, 1994, vol. 2, pp. 723-726.
- (1994) Proc. Int. Conf. Spoken Language Processing, Yokohama, Japan , vol.2 , pp. 723-726
- Fukada, T.¹ Komori, Y.² Aso, T.³ Ohora, Y.⁴

20
- 0022796218
- Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models
- Oct.
- A. Ljolje and F. Fallside, "Synthesis of natural sounding pitch contours in isolated utterances using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1074-1080, Oct. 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 1074-1080
- Ljolje, A.¹ Fallside, F.²

21
- 0002585974
- Variable duration models for speech
- Princeton, NJ
- J. D. Ferguson, "Variable duration models for speech," in Proc. Symp. Applications of Hidden Markov Models to Text and Speech, Princeton, NJ, 1980, pp. 143-179.
- (1980) Proc. Symp. Applications of Hidden Markov Models to Text and Speech , pp. 143-179
- Ferguson, J.D.¹

22
- 0022685753
- Continuously variable duration hidden Markov models for automatic speech recognition
- S. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol. 1, pp. 29-45, 1986.
- (1986) Comput. Speech Lang. , vol.1 , pp. 29-45
- Levinson, S.¹

23
- 0022234383
- Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition
- Tampa, FL, Mar.
- M. J. Russell and R. K. Moore, "Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Tampa, FL, Mar. 1985, pp. 5-8.
- (1985) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 5-8
- Russell, M.J.¹ Moore, R.K.²

24
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.H.²

25
- 0015409613
- Emotions and speech: Some acoustical correlates
- C. E. Williams and K. N. Stevens, "Emotions and speech: Some acoustical correlates," J. Acoust. Soc. Amer., vol. 52, pp. 1238-1250, 1972.
- (1972) J. Acoust. Soc. Amer. , vol.52 , pp. 1238-1250
- Williams, C.E.¹ Stevens, K.N.²

26
- 0028630509
- Nonlinear analysis and detection of speech under stressed conditions
- D. A. Cairns and J. H. L. Hansen, "Nonlinear analysis and detection of speech under stressed conditions," J. Acoust. Soc. Amer., vol. 96, pp. 3392-2400, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.96 , pp. 3392-12400
- Cairns, D.A.¹ Hansen, J.H.L.²

27
- 0030196359
- Feature analysis and neural network based classification of speech under stress
- July
- J. H. L. Hansen and B. D. Womack, "Feature analysis and neural network based classification of speech under stress," IEEE Trans. Speech Audio Processing, vol. 4, pp. 307-13, July 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 307-313
- Hansen, J.H.L.¹ Womack, B.D.²

28
- 0015476226
- Automatic speaker recognition based on pitch contours
- B. S. Atal, "Automatic speaker recognition based on pitch contours," J. Acoust. Soc. Amer., vol. 52, pp. 1687-1697, 1972.
- (1972) J. Acoust. Soc. Amer. , vol.52 , pp. 1687-1697
- Atal, B.S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.