메뉴 건너뛰기




Volumn 15, Issue 3, 2012, Pages 295-311

TEO-based speaker stress assessment using hybrid classification and tracking schemes

Author keywords

FLETC Corpus; Stress assessment from speech; TEO operator

Indexed keywords

ACOUSTIC MODEL; ASSESSMENT SCHEME; AUDIO DATA; BINARY DECISION; BIOMETRIC DATA; CLASSIFICATION TASKS; CURRENT STRESS; EUCLIDEAN DISTANCE METRICS; FEATURE SPACE; FLETC CORPUS; HYBRID CLASSIFICATION; LANGUAGE RECOGNITION; NEAREST NEIGHBOR CLUSTERING; NORMALIZATION STRATEGIES; SENTENCE LEVEL; SPEAKER MODEL; SPEAKER RECOGNITION; SPEAKER VARIABILITY; SPEECH DATA; SPEECH PRODUCTION; SPEECH SYSTEMS; STRESS ASSESSMENT; STRESS LEVELS; STRESS-DEPENDENT; STRESS-INDUCED; TEO OPERATOR; TRACKING CAPABILITY; TRACKING SCHEME; TRAINING SCENARIO;

EID: 84864605091     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-012-9165-1     Document Type: Article
Times cited : (10)

References (36)
  • 1
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • Universiteit Van Amsterdam
    • Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. Proceedings-Instituut Voor Fonetische Wetenschappen, Universiteit Van Amsterdam, 17, 97-110.
    • (1993) Proceedings-Instituut voor Fonetische Wetenschappen , vol.17 , pp. 97-110
    • Boersma, P.1
  • 3
    • 77955734646 scopus 로고    scopus 로고
    • Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments
    • Boril, H., & Hansen, J. H. L. (2010). Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments. IEEE Transactions on Audio, Speech, and Language Processing, 18, 1379-1393.
    • (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 1379-1393
    • Boril, H.1    Hansen, J.H.L.2
  • 4
    • 79959827405 scopus 로고    scopus 로고
    • Analysis and detection of cognitive load and frustration in drivers' speech
    • Makuhari, Chiba, Japan
    • Boril, H., Sadjadi, O., Kleinschmidt, T., & Hansen, J. H. L. (2010). Analysis and detection of cognitive load and frustration in drivers' speech. In Interspeech'10, Makuhari, Chiba, Japan (pp. 502-505).
    • (2010) Interspeech'10 , pp. 502-505
    • Boril, H.1    Sadjadi, O.2    Kleinschmidt, T.3    Hansen, J.H.L.4
  • 5
    • 84897584246 scopus 로고    scopus 로고
    • Towards multi-modal driver's stress detection
    • New York: Springer J. Hansen, P. Boyraz, K. Takeda & H. Abut (Eds.)
    • Boril, H., Boyraz, P., & Hansen, J. H. L. (2012). Towards multi-modal driver's stress detection. In J. Hansen, P. Boyraz, K. Takeda & H. Abut (Eds.), Digital signal processing for in-vehicle systems and safety (pp. 3-20). New York: Springer.
    • (2012) Digital Signal Processing for In-vehicle Systems and Safety , pp. 3-20
    • Boril, H.1    Boyraz, P.2    Hansen, J.H.L.3
  • 6
    • 0032069798 scopus 로고    scopus 로고
    • HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress
    • PII S106366769802896X
    • Bou-Ghazale, S., & Hansen, J. (1998). HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress. IEEE Transactions on Speech and Audio Processing, 6, 201-216. (Pubitemid 128720647)
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.3 , pp. 201-216
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 8
    • 0028630509 scopus 로고
    • Nonlinear analysis and classification of speech under stressed conditions
    • DOI 10.1121/1.410601
    • Cairns, D. A., & Hansen, J. H. L. (1994). Nonlinear analysis and classification of speech under stressed conditions. The Journal of the Acoustical Society of America, 96, 3392-3400. (Pubitemid 24376418)
    • (1994) Journal of the Acoustical Society of America , vol.96 , Issue.6 , pp. 3392-3400
    • Cairns, D.A.1    Hansen, J.H.L.2
  • 9
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis, S. B., & Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing, 28, 357-366. (Pubitemid 11464930)
    • (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis Steven, B.1    Mermelstein Paul2
  • 11
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • PII S0167639396000507
    • Hansen, J. H. L. (1996). Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Communication, 20, 151-173. (Pubitemid 126371283)
    • (1996) Speech Communication , vol.20 , Issue.1-2 , pp. 151-173
    • Hansen, J.H.L.1
  • 12
    • 70350454918 scopus 로고    scopus 로고
    • Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition
    • Hansen, J., & Varadarajan, V. (2009). Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition. IEEE Transactions on Audio, Speech, and Language Processing, 17, 366-378.
    • (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , pp. 366-378
    • Hansen, J.1    Varadarajan, V.2
  • 13
    • 84864587169 scopus 로고    scopus 로고
    • Speech under stress and Lombard effect: Impact and solutions for forensic speaker recognition
    • H. Patil & A. Neustein (Eds.) New York: Springer
    • Hansen, J., Sangwan, A., & Kim, W. (2012). Speech under stress and Lombard effect: impact and solutions for forensic speaker recognition. In H. Patil & A. Neustein (Eds.), Forensic speaker recognition: law enforcement and counter-terrorism (pp. 103-123). New York: Springer.
    • (2012) Forensic Speaker Recognition: Law Enforcement and Counter-terrorism , pp. 103-123
    • Hansen, J.1    Sangwan, A.2    Kim, W.3
  • 16
    • 0345940399 scopus 로고
    • On Teager's energy algorithm and its generalization to continuous signals
    • Mohonk, NY
    • Kaiser, J. (1990b). On Teager's energy algorithm and its generalization to continuous signals. In Proc. 4th IEEE digital signal processing workshop, Mohonk, NY.
    • (1990) Proc. 4th IEEE Digital Signal Processing Workshop
    • Kaiser, J.1
  • 17
    • 0029879770 scopus 로고    scopus 로고
    • Stress- and treatment-induced elevations of cortisol levels associated with impaired declarative memory in healthy adults
    • DOI 10.1016/0024-3205(96)00118-X
    • Kirschbaum, C., Wolf, O. T., May, M., Wippich, W., & Hellhammer, D. H. (1996). Stress-and treatment-induced elevations of cortisol levels associated with impaired declarative memory in healthy adults. Life Sciences, 58, 1475-1483. (Pubitemid 26122725)
    • (1996) Life Sciences , vol.58 , Issue.17 , pp. 1475-1483
    • Kirschbaum, C.1    Wolf, O.T.2    May, M.3    Wippich, W.4    Hellhammer, D.H.5
  • 18
    • 0023902773 scopus 로고    scopus 로고
    • Psychological stress increases plasma levels of prolactin, cortisol and POMC-derived peptides in man
    • Meyerhoff, J. L., Oleshansky, M. A., & Mougey, E. H. (1998). Psychological stress increases plasma levels of prolactin, cortisol and POMC-derived peptides in man. Psychosomatische Medizin, 50, 295-303.
    • (1998) Psychosomatische Medizin , vol.50 , pp. 295-303
    • Meyerhoff, J.L.1    Oleshansky, M.A.2    Mougey, E.H.3
  • 21
    • 76849101651 scopus 로고    scopus 로고
    • The physiological microphone (PMIC): A competitive alternative for speaker assessment in stress detection and speaker verification
    • Patil, S. A., & Hansen, J. H. (2010). The physiological microphone (PMIC): a competitive alternative for speaker assessment in stress detection and speaker verification. Speech Communication, 327-340.
    • (2010) Speech Communication , pp. 327-340
    • Patil, S.A.1    Hansen, J.H.2
  • 22
    • 33947692126 scopus 로고    scopus 로고
    • Frequency band analysis for stress detection using a Teager energy operator-based feature
    • Denver, CO
    • Rahurkar, M., Hansen, J., Oleshansky, M., Meyerhoff, J., & Koenig, M. (2002). Frequency band analysis for stress detection using a Teager energy operator-based feature. In ICSLP-02, Denver, CO (pp. 2021-2024).
    • (2002) ICSLP-02 , pp. 2021-2024
    • Rahurkar, M.1    Hansen, J.2    Oleshansky, M.3    Meyerhoff, J.4    Koenig, M.5
  • 27
    • 0000665811 scopus 로고
    • Critical bands
    • V. Tobias (Ed.) New York: Academic Press
    • Scharf, B. (1970). Critical bands. In V. Tobias (Ed.), Foundation of modern auditory theory. New York: Academic Press.
    • (1970) Foundation of Modern Auditory Theory
    • Scharf, B.1
  • 30
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (RAPT)
    • W. B. Kleijn & K. K. Paliwal (Eds.) Amsterdam: Elsevier
    • Talkin, D. (1995). A robust algorithm for pitch tracking (RAPT). In W. B. Kleijn & K. K. Paliwal (Eds.), Speech coding and synthesis (pp. 495-518). Amsterdam: Elsevier.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1
  • 32
    • 0003236089 scopus 로고
    • Evidence for nonlinear production mechanisms in the vocal tract
    • Teager, H., & Teager, S. (1989). Evidence for nonlinear production mechanisms in the vocal tract. Speech Production and Speech Modelling, 55, 241-261.
    • (1989) Speech Production and Speech Modelling , vol.55 , pp. 241-261
    • Teager, H.1    Teager, S.2
  • 34
    • 2942564603 scopus 로고    scopus 로고
    • N-channel hidden Markov models for combined stress speech classification and recognition
    • Womack, B., & Hansen, J. (1999). N-channel hidden Markov models for combined stress speech classification and recognition. IEEE Transactions on Speech and Audio Processing, 7, 668-677.
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 668-677
    • Womack, B.1    Hansen, J.2
  • 36
    • 0035278948 scopus 로고    scopus 로고
    • Nonlinear feature based classification of speech under stress
    • DOI 10.1109/89.905995, PII S1063667601013232
    • Zhou, G., Hansen, J., & Kaiser, J. (2001). Nonlinear feature-based classification of speech under stress. IEEE Transactions on Speech and Audio Processing, 9, 201-216. (Pubitemid 32286594)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 201-216
    • Zhou, G.1    Hansen, J.H.L.2    Kaiser, J.F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.