메뉴 건너뛰기




Volumn 4343 LNAI, Issue , 2007, Pages 108-137

Speech under stress: Analysis, modeling and recognition

Author keywords

Hidden Markov models; Lombard effect; Pitch contours; Robustness in speech recognition; Speech technology; Stress classification; Teager energy operator

Indexed keywords

ALGORITHMS; HIDDEN MARKOV MODELS; HUMAN COMPUTER INTERACTION; SPEECH RECOGNITION;

EID: 36248972553     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-74200-5_6     Document Type: Article
Times cited : (129)

References (89)
  • 1
    • 80053247759 scopus 로고    scopus 로고
    • Alm, C.O., Roth, D., Sproat, R.: Emotions from Text: Machine Learning for Textbased Emotion Prediction. In: Proceedings of HLT/EMNLP 05, Vancouver (2005)
    • Alm, C.O., Roth, D., Sproat, R.: Emotions from Text: Machine Learning for Textbased Emotion Prediction. In: Proceedings of HLT/EMNLP 05, Vancouver (2005)
  • 4
    • 36248989950 scopus 로고
    • Speech Variability Effects on Recognition Accuracy Associated With Concurrent Task Performance by Pilots
    • Technical report, Psycho-Linguistic Research Associates
    • Simpson, C.A.: Speech Variability Effects on Recognition Accuracy Associated With Concurrent Task Performance by Pilots. Technical report, Psycho-Linguistic Research Associates (1985)
    • (1985)
    • Simpson, C.A.1
  • 9
    • 0028630509 scopus 로고
    • Nonlinear Analysis and Detection of Speech under Stressed Conditions
    • Cairns, D.A., Hansen, J.H.L.: Nonlinear Analysis and Detection of Speech under Stressed Conditions. Journal of the Acoustic Society of America 96(6), 3392-3400 (1994)
    • (1994) Journal of the Acoustic Society of America , vol.96 , Issue.6 , pp. 3392-3400
    • Cairns, D.A.1    Hansen, J.H.L.2
  • 11
  • 13
    • 0029375589 scopus 로고
    • Duration and Spectral Based Stress Token Generation for Keyword Recognition under Hidden Markov Models. IEEE Transactions on Speech k
    • Hansen, J.H.L., Bou-Ghazale, S.E.: Duration and Spectral Based Stress Token Generation for Keyword Recognition under Hidden Markov Models. IEEE Transactions on Speech k Audio Processing 3(5), 415-421 (1995)
    • (1995) Audio Processing , vol.3 , Issue.5 , pp. 415-421
    • Hansen, J.H.L.1    Bou-Ghazale, S.E.2
  • 14
    • 0027465491 scopus 로고
    • The Lombard Reflex and its Role on Human Listeners and Automatic Speech Recognition
    • Junqua, J.C.: The Lombard Reflex and its Role on Human Listeners and Automatic Speech Recognition. Journal of the Acoustic Society of America 93(1), 510-524 (1993)
    • (1993) Journal of the Acoustic Society of America , vol.93 , Issue.1 , pp. 510-524
    • Junqua, J.C.1
  • 15
    • 0030291067 scopus 로고    scopus 로고
    • The Influence of Acoustics on Speech Production: A Noise-Induced Stress Phenomenon known as the Lombard Effect
    • Junqua, J.C.: The Influence of Acoustics on Speech Production: a Noise-Induced Stress Phenomenon known as the Lombard Effect. Speech Communication 20, 13-22 (1996)
    • (1996) Speech Communication , vol.20 , pp. 13-22
    • Junqua, J.C.1
  • 17
    • 36248989404 scopus 로고    scopus 로고
    • Hansen, J.H.L., Swail, C., South, A.J., Moore, R.K., Steeneken, H., Cupples, E.J., Anderson, T., Vloeberghs, C.R.A., Trancoso, I., Verlinde, P.: The Impact of Speech Under 'Stress' on Military Speech Technology. In: NATO RTO-TR-10, AC/323(IST)TP/5 IST/TG-01 (2000)
    • Hansen, J.H.L., Swail, C., South, A.J., Moore, R.K., Steeneken, H., Cupples, E.J., Anderson, T., Vloeberghs, C.R.A., Trancoso, I., Verlinde, P.: The Impact of Speech Under 'Stress' on Military Speech Technology. In: NATO RTO-TR-10, AC/323(IST)TP/5 IST/TG-01 (2000)
  • 18
    • 0030283484 scopus 로고    scopus 로고
    • Towards a Definition and Working Model of Stress and its Effects on Speech
    • Murray, I.R., Baber, C., South, A.: Towards a Definition and Working Model of Stress and its Effects on Speech Speech Communication 20, 3-12 (1996)
    • (1996) Speech Communication , vol.20 , pp. 3-12
    • Murray, I.R.1    Baber, C.2    South, A.3
  • 21
    • 0024914051 scopus 로고
    • Evaluation of Acoustic Correlates of Speech Under Stress for Robust Speech Recognition
    • Boston, pp
    • Hansen, J.H.L.: Evaluation of Acoustic Correlates of Speech Under Stress for Robust Speech Recognition. In: IEEE Proceedings of the 15th Northeast Bioengineering Conference, Boston, pp. 31-32 (1989)
    • (1989) IEEE Proceedings of the 15th Northeast Bioengineering Conference , pp. 31-32
    • Hansen, J.H.L.1
  • 22
    • 0023246158 scopus 로고    scopus 로고
    • Paul, D.B.: A Speaker-Stress Resistant HMM Isolated Word Recognizer. In: Proceedings of the 12th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '87), Dallas, pp. 713-716 (1987)
    • Paul, D.B.: A Speaker-Stress Resistant HMM Isolated Word Recognizer. In: Proceedings of the 12th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '87), Dallas, pp. 713-716 (1987)
  • 25
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and Compensation of Speech under Stress and Noise for Environmental Robustness in Speech Recognition. Speech Communications
    • Hansen, J.H.L.: Analysis and Compensation of Speech under Stress and Noise for Environmental Robustness in Speech Recognition. Speech Communications, Special Issue on Speech Under Stress 20(2), 151-170 (1996)
    • (1996) Special Issue on Speech Under Stress , vol.20 , Issue.2 , pp. 151-170
    • Hansen, J.H.L.1
  • 29
    • 0029324926 scopus 로고
    • ICARUS: Source Generator based Real-Time Recognition of Speech in Noisy Stressful and Lombard Effect Environments
    • Hansen, J.H.L., Cairns, D.A.: ICARUS: Source Generator based Real-Time Recognition of Speech in Noisy Stressful and Lombard Effect Environments. Speech Communications 16(4), 391-422 (1995)
    • (1995) Speech Communications , vol.16 , Issue.4 , pp. 391-422
    • Hansen, J.H.L.1    Cairns, D.A.2
  • 30
    • 0030196359 scopus 로고    scopus 로고
    • Feature Analysis and Neural Network based Classification of Speech under Stress
    • Hansen, J.H.L., Womack, B.: Feature Analysis and Neural Network based Classification of Speech under Stress. IEEE Transactions on Speech & Audio Processing 4(4), 307-313 (1996)
    • (1996) IEEE Transactions on Speech & Audio Processing , vol.4 , Issue.4 , pp. 307-313
    • Hansen, J.H.L.1    Womack, B.2
  • 31
    • 0030283946 scopus 로고    scopus 로고
    • Classification of Speech Under Stress using Target Driven Features. Speech Communication
    • Womack, B.D., Hansen, J.H.L.: Classification of Speech Under Stress using Target Driven Features. Speech Communication, Special Issue on Speech Under Stress 20(1), 131-150 (1996)
    • (1996) Special Issue on Speech Under Stress , vol.20 , Issue.1 , pp. 131-150
    • Womack, B.D.1    Hansen, J.H.L.2
  • 33
    • 0028516405 scopus 로고
    • Morphological Constrained Enhancement with Adaptive Cepstral Compensation (MCE-ACC) for Speech Recognition in Noise and Lombard Effect
    • Hansen, J.H.L.: Morphological Constrained Enhancement with Adaptive Cepstral Compensation (MCE-ACC) for Speech Recognition in Noise and Lombard Effect. IEEE Transactions on Speech & Audio Proc (SPECIAL ISSUE: Robust Speech Recognition) 2(4), 598-614 (1994)
    • (1994) IEEE Transactions on Speech & Audio Proc (SPECIAL ISSUE: Robust Speech Recognition) , vol.2 , Issue.4 , pp. 598-614
    • Hansen, J.H.L.1
  • 36
    • 0034229795 scopus 로고    scopus 로고
    • A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress
    • Bou-Ghazale, S.E., Hansen, J.H.L.: A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress. IEEE Transactions on Speech & Audio Processing 8(4), 429-442 (2000)
    • (2000) IEEE Transactions on Speech & Audio Processing , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 37
    • 0026135903 scopus 로고
    • Constrained Iterative Speech Enhancement with Application to Speech Recognition
    • Hansen, J.H.L., Clements, M.A.: Constrained Iterative Speech Enhancement with Application to Speech Recognition. IEEE Transactions on Signal Processing 39(4), 795-805 (1991)
    • (1991) IEEE Transactions on Signal Processing , vol.39 , Issue.4 , pp. 795-805
    • Hansen, J.H.L.1    Clements, M.A.2
  • 38
    • 0026984766 scopus 로고
    • Feature Enhancement for Multi-layer Perceptron and Semi-Continuous Hidden Markov Model Based Classifiers using Neural Networks
    • Clary, G., Hansen, J.H.L.: Feature Enhancement for Multi-layer Perceptron and Semi-Continuous Hidden Markov Model Based Classifiers using Neural Networks. In: Neural and Stochastic Methods in Image and Signal Processing, Proceedings of the SPIE, vol. 1766, pp. 529-540 (1992)
    • (1992) Neural and Stochastic Methods in Image and Signal Processing, Proceedings of the SPIE , vol.1766 , pp. 529-540
    • Clary, G.1    Hansen, J.H.L.2
  • 39
    • 79955042828 scopus 로고
    • A Comparison between Decision Accuracy Rates obtained using the Polygraph Instrument and Computer Voice Stress Analyzer (CVSA) in the absence of Jeopardy
    • Technical report, DOD Polygraph Inst
    • Cestaro, V.L.: A Comparison between Decision Accuracy Rates obtained using the Polygraph Instrument and Computer Voice Stress Analyzer (CVSA) in the absence of Jeopardy. Technical report, DOD Polygraph Inst. (1995)
    • (1995)
    • Cestaro, V.L.1
  • 44
    • 2942564603 scopus 로고    scopus 로고
    • N-Channel Hidden Markov Models for Combined Stress Speech Classification and Recognition
    • Womack, B.D., Hansen, J.H.L.: N-Channel Hidden Markov Models for Combined Stress Speech Classification and Recognition. IEEE Transactions on Speech and Audio Processing 7(6), 668-677 (1999)
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , Issue.6 , pp. 668-677
    • Womack, B.D.1    Hansen, J.H.L.2
  • 45
    • 0001059592 scopus 로고
    • Some Observations on Vocal Tract Operation from a Fluid Flow Point of View
    • Titze, I.R, Scherer, R.C, eds, Denver Center for the Performing Arts, Denver, pp
    • Kaiser, J.F.: Some Observations on Vocal Tract Operation from a Fluid Flow Point of View. In: Titze, I.R., Scherer, R.C. (eds.) Vocal Fold Physiology: Biomechanics, Acoustics, and Phonatory Control. Denver Center for the Performing Arts, Denver, pp. 358-386 (1983)
    • (1983) Vocal Fold Physiology: Biomechanics, Acoustics, and Phonatory Control , pp. 358-386
    • Kaiser, J.F.1
  • 47
    • 0005506875 scopus 로고
    • A Phenomenological Model for Vowel Production in the Vocal Tract
    • Teager, H.M., Teager, S.M.: A Phenomenological Model for Vowel Production in the Vocal Tract. In: Speech Science: Recent Advances, pp. 72-100 (1982)
    • (1982) Speech Science: Recent Advances , pp. 72-100
    • Teager, H.M.1    Teager, S.M.2
  • 48
    • 36249025434 scopus 로고    scopus 로고
    • Teager, H.M., Teager, S.: Evidence for Nonlinear Production Mechanisms in the Vocal Tract. In: NATO Advanced Study Inst. On Speech Production and Speech Modeling, Bonas, France, 55, pp. 241-261. Kluwer Academic Publishers, Boston (1989)
    • Teager, H.M., Teager, S.: Evidence for Nonlinear Production Mechanisms in the Vocal Tract. In: NATO Advanced Study Inst. On Speech Production and Speech Modeling, Bonas, France, vol. 55, pp. 241-261. Kluwer Academic Publishers, Boston (1989)
  • 49
    • 85011603315 scopus 로고
    • A Finite Element Model of Fluid Flow in the Vocal Tract
    • Thomas, T.J.: A Finite Element Model of Fluid Flow in the Vocal Tract. Computer Speech Language 1, 131-151 (1986)
    • (1986) Computer Speech Language , vol.1 , pp. 131-151
    • Thomas, T.J.1
  • 50
    • 0032030556 scopus 로고    scopus 로고
    • A Nonlinear based Speech Feature Analysis Method with Application to Vocal Fold Pathology Assessment
    • Hansen, J.H.L., Gavidia-Ceballos, L., Kaiser, J.F.: A Nonlinear based Speech Feature Analysis Method with Application to Vocal Fold Pathology Assessment. IEEE Transactions on Biomedical Engineering 45(3), 300-313 (1998)
    • (1998) IEEE Transactions on Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 55
    • 0032069798 scopus 로고    scopus 로고
    • Stress Perturbation of Neutral Speech for Synthesis based on Hidden Markov Models
    • Bou-Ghazale, S.E., Hansen, J.H.L.: Stress Perturbation of Neutral Speech for Synthesis based on Hidden Markov Models. IEEE Transactions on Speech & Audio Processing 6(3), 201-216 (1998)
    • (1998) IEEE Transactions on Speech & Audio Processing , vol.6 , Issue.3 , pp. 201-216
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 56
  • 58
    • 0029325035 scopus 로고
    • Implementation and Testing of a System for Producing Emotion-by-Rule in Synthetic Speech
    • Murray, I.R., Arnott, J.L.: Implementation and Testing of a System for Producing Emotion-by-Rule in Synthetic Speech. Speech Communication 16, 369-390 (1995)
    • (1995) Speech Communication , vol.16 , pp. 369-390
    • Murray, I.R.1    Arnott, J.L.2
  • 60
    • 36248959511 scopus 로고    scopus 로고
    • Multilingual Speech Synthesis
    • Schultz, T, Kirchhoff, K, eds, Elsevier, Academic Press
    • Black, A.: Multilingual Speech Synthesis. In: Schultz, T., Kirchhoff, K. (eds.) Multilingual Speech Processing. Elsevier, Academic Press (2006)
    • (2006) Multilingual Speech Processing
    • Black, A.1
  • 61
    • 0036472515 scopus 로고    scopus 로고
    • Computers that Recognize and Respond to User Emotion: Theoretical and Practical Implications
    • Picard, R.W., Klein, J.: Computers that Recognize and Respond to User Emotion: Theoretical and Practical Implications. Interacting with Computers 14(2), 141-169 (2002)
    • (2002) Interacting with Computers , vol.14 , Issue.2 , pp. 141-169
    • Picard, R.W.1    Klein, J.2
  • 62
    • 36248988291 scopus 로고    scopus 로고
    • Sproat, R. (ed.): Multilingual Text-to-Speech Synthesis: The Bell Labs Approach. Kluwer Academic Publishers, Boston (1997)
    • Sproat, R. (ed.): Multilingual Text-to-Speech Synthesis: The Bell Labs Approach. Kluwer Academic Publishers, Boston (1997)
  • 64
    • 36249012962 scopus 로고
    • Speech and its Potential for Stress Monitoring: Monitoring Vital Signs in the Divers
    • Technical report, Naval Medical Research Institute
    • Bachrach, A.J.: Speech and its Potential for Stress Monitoring: Monitoring Vital Signs in the Divers. Technical report, Naval Medical Research Institute (1979)
    • (1979)
    • Bachrach, A.J.1
  • 65
    • 0023925221 scopus 로고
    • Cepstral Domain Talker Stress Compensation for Robust Speech Recognition
    • Chen, Y.: Cepstral Domain Talker Stress Compensation for Robust Speech Recognition. IEEE Transactions on Acoustic Speech Signal Process. 36, 433-439 (1988)
    • (1988) IEEE Transactions on Acoustic Speech Signal Process , vol.36 , pp. 433-439
    • Chen, Y.1
  • 67
    • 30244533875 scopus 로고
    • Medical Research Committee, London
    • Flack, M.: Flying Stress. Medical Research Committee, London (1918)
    • (1918) Flying Stress
    • Flack, M.1
  • 68
    • 0342663341 scopus 로고
    • Analysis and Compensation of Noisy Stressful Speech for Environmental Robustness in Speech Recognition (invited tutorial)
    • Lisbon, Portugal, pp
    • Hansen, J.H.L.: Analysis and Compensation of Noisy Stressful Speech for Environmental Robustness in Speech Recognition (invited tutorial). In: NATO-ESCA Proc. Inter. Tutorial & Research Workshop on Speech Under Stress, Lisbon, Portugal, pp. 91-98 (1995)
    • (1995) NATO-ESCA Proc. Inter. Tutorial & Research Workshop on Speech Under Stress , pp. 91-98
    • Hansen, J.H.L.1
  • 71
    • 0032030556 scopus 로고    scopus 로고
    • A Nonlinear based Speech Feature Analysis Method with Application to Vocal Fold Pathology Assessment
    • Hansen, J.H.L., Gavidia-Ceballos, L., Kaiser, J.F.: A Nonlinear based Speech Feature Analysis Method with Application to Vocal Fold Pathology Assessment. IEEE Transactions on Biomedical Engineering 45(3), 300-313 (1998)
    • (1998) IEEE Transactions on Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 78
    • 36248947781 scopus 로고    scopus 로고
    • Malkin, F.J., Christ, K.A.: Human Factors Engineering Assessment of Voice Technology for the Light Helicopter Family. Technical Report 1-20, U. S. Armu Human Engineering Lab. (June 1985)
    • Malkin, F.J., Christ, K.A.: Human Factors Engineering Assessment of Voice Technology for the Light Helicopter Family. Technical Report 1-20, U. S. Armu Human Engineering Lab. (June 1985)
  • 79
    • 0027576027 scopus 로고    scopus 로고
    • Maragos, P., Kaiser, J.F., Quatieri, T.F.: On Amplitude and Frequency Demodulation using Energy Operators. IEEE Transactions on Signal Processing 41, 15321550 (1993)
    • Maragos, P., Kaiser, J.F., Quatieri, T.F.: On Amplitude and Frequency Demodulation using Energy Operators. IEEE Transactions on Signal Processing 41, 15321550 (1993)
  • 80
    • 15844429048 scopus 로고
    • Effect of Operator Mental Loading on Voice Recognition System Performance
    • Technical report, Naval Postgraduate School
    • Poock, G.K., Armstrong, J.W.: Effect of Operator Mental Loading on Voice Recognition System Performance. Technical report, Naval Postgraduate School (1981)
    • (1981)
    • Poock, G.K.1    Armstrong, J.W.2
  • 81
    • 15844415367 scopus 로고
    • Effect of Task Duration on Voice Recognition System Performance
    • Technical report, Naval Postgraduate School September
    • Poock, G.K., Armstrong, J.W.: Effect of Task Duration on Voice Recognition System Performance. Technical report, Naval Postgraduate School (September 1981)
    • (1981)
    • Poock, G.K.1    Armstrong, J.W.2
  • 83
    • 0017325694 scopus 로고
    • Analysis of the Human Voice as a Method of Controlling Emotional State: Achievements and Goals
    • Simonov, P.V., Frolov, M.V.: Analysis of the Human Voice as a Method of Controlling Emotional State: Achievements and Goals. Aviation, Space, and Environmental Sciences 23-25 (1977)
    • (1977) Aviation, Space, and Environmental Sciences , pp. 23-25
    • Simonov, P.V.1    Frolov, M.V.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.