메뉴 건너뛰기




Volumn 17, Issue 2, 2009, Pages 366-378

Analysis and compensation of lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition

Author keywords

Lombard effect; Speaker recognition and characterization; Speech analysis; Speech in noise; Speech under stress

Indexed keywords

ENERGY HISTOGRAM; EQUAL ERROR RATE; GAUSSIAN MIXTURE MODEL; LOMBARD EFFECT; NOISE LEVELS; NOISE TYPES; SPEAKER MODEL; SPEAKER RECOGNITION; SPEAKER RECOGNITION AND CHARACTERIZATION; SPECTRAL TILT; SPEECH DATA; SPEECH PRODUCTION; SPEECH SYSTEMS; TESTING CONDITIONS; TRAINING AND TESTING; UNDER MATCHED;

EID: 70350454918     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2009019     Document Type: Article
Times cited : (86)

References (32)
  • 1
    • 0000874053 scopus 로고
    • Le signe de l'elevation de la voix, annals maladiers oreille
    • E. Lombard, "Le signe de l'elevation de la voix, annals maladiers oreille, " Larynx, Nez, Pharynx, vol. 37, pp. 101-119, 1911.
    • (1911) Larynx, Nez, Pharynx , vol.37 , pp. 101-119
    • Lombard, E.1
  • 2
    • 70350505426 scopus 로고
    • Regulation of voice communication by sensory dynamics
    • H. L. Lane, B. Tranel, and C. Sisson, "Regulation of voice communication by sensory dynamics, " J. Acoust. Soc. Amer., vol. 32, pp. 451-454, 1970.
    • (1970) J. Acoust. Soc. Amer. , vol.32 , pp. 451-454
    • Lane, H.L.1    Tranel, B.2    Sisson, C.3
  • 3
    • 0001653589 scopus 로고
    • The lombard sign and the role of hearing in speech
    • H. L. Lane and B. Tranel, "The Lombard sign and the role of hearing in speech, " J. Speech Hear. Res., vol. 14, pp. 677-709, 1971.
    • (1971) J. Speech Hear. Res. , vol.14 , pp. 677-709
    • Lane, H.L.1    Tranel, B.2
  • 5
    • 85089273681 scopus 로고    scopus 로고
    • Getting started with susas: A speech under simulated and actual stress database
    • Rhodes, Greece, Sep.
    • J. H. L. Hansen and S. Bou-Ghazale, "Getting started with SUSAS: A speech under simulated and actual stress database, " in Proc. EUROSPEECH' 97, Rhodes, Greece, Sep. 1997, vol. 4, pp. 1743-1746.
    • (1997) Proc. EUROSPEECH' 97 , vol.4 , pp. 1743-1746
    • Hansen, J.H.L.1    Bou-Ghazale, S.2
  • 6
    • 0023849345 scopus 로고
    • Acoustic-phonetic analysis of loud and lombard speech in simulated cockpit conditions
    • B. J. Stanton, L. H. Jamieson, and G. D. Allen, "Acoustic-phonetic analysis of loud and Lombard speech in simulated cockpit conditions, " in Proc. ICASSP, 1988, pp. 331-334.
    • (1988) Proc. ICASSP , pp. 331-334
    • Stanton, B.J.1    Jamieson, L.H.2    Allen, G.D.3
  • 8
    • 0027465491 scopus 로고
    • The lombard reflex and its role on human listeners and automatic speech recognizers
    • Jan.
    • J. C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers, " J. Acoust. Soc. Amer., vol. 93, pp. 510-524, Jan. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 510-524
    • Junqua, J.C.1
  • 9
    • 0000142633 scopus 로고
    • Effects of vocal force on the intelligibilty of speech sounds
    • Sep.
    • J. M. Pickett, "Effects of vocal force on the intelligibilty of speech sounds, " J. Acous. Soc. Amer., pp. 902-905, Sep. 1956.
    • (1956) J. Acous. Soc. Amer. , pp. 902-905
    • Pickett, J.M.1
  • 10
    • 0041538788 scopus 로고
    • Effects of ambient noise on speaker intelligibility for words and phrases
    • Dec.
    • J. Dreher and J. Neil, "Effects of ambient noise on speaker intelligibility for words and phrases, " J. Acous. Soc. Amer., pp. 1320-1323, Dec. 1957.
    • (1957) J. Acous. Soc. Amer. , pp. 1320-1323
    • Dreher, J.1    Neil, J.2
  • 13
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • Jul.
    • S. E. Bou-Ghazale and J. H. L. Hansen, "A comparative study of traditional and newly proposed features for recognition of speech under stress, " IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 429-442, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 14
    • 84877483026 scopus 로고    scopus 로고
    • Speech under stress conditions: Overview of the effect on speech production and on system performance
    • Phoenix, AZ, Mar. 1999
    • H. Steeneken and J. H. L. Hansen, "Speech under stress conditions: overview of the effect on speech production and on system performance, " in Proc. IEEE ICASSP'99, Phoenix, AZ, Mar. 1999, vol. 4, pp. 2079-2082.
    • Proc. IEEE ICASSP'99 , vol.4 , pp. 2079-2082
    • Steeneken, H.1    Hansen, J.H.L.2
  • 15
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • Nov.
    • J. H. L. Hansen, "Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition, " Speech Commun., Special Iss. Speech Under Stress, vol. 20, no. 2, pp. 151-170, Nov. 1996.
    • (1996) Speech Commun., Special Iss. Speech under Stress , vol.20 , Issue.2 , pp. 151-170
    • Hansen, J.H.L.1
  • 16
    • 0029376021 scopus 로고
    • Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress
    • Sep.
    • J. H. L. Hansen and M. Clements, "Source generator equalization and enhancement of spectral properties for robust speech recognition in noise and stress, " IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 407-415, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 407-415
    • Hansen, J.H.L.1    Clements, M.2
  • 17
    • 0028516405 scopus 로고
    • Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and lombard effect
    • Oct.
    • J. H. L. Hansen, "Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard Effect, " IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 598-614, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 598-614
    • Hansen, J.H.L.1
  • 18
    • 0023925221 scopus 로고
    • Cepstral domain talker stress compensation for robust speech recognition
    • Apr.
    • Y. Chen, "Cepstral domain talker stress compensation for robust speech recognition, " IEEE Trans. Acoust. Speech, Signal Process., vol. 36, no. 4, pp. 433-439, Apr. 1988.
    • (1988) IEEE Trans. Acoust. Speech, Signal Process. , vol.36 , Issue.4 , pp. 433-439
    • Chen, Y.1
  • 19
    • 0024928729 scopus 로고
    • Robust recognition of loud and lombard speech in the fighter cockpit environment
    • Glasgow, U.K. May
    • B. J. Stanton, L. H. Jamieson, and G. D. Allen, "Robust recognition of loud and Lombard speech in the fighter cockpit environment, " in Proc. IEEE ICASSP '89, Glasgow, U.K., May 1989, pp. 675-678.
    • (1989) Proc. IEEE ICASSP '89 , pp. 675-678
    • Stanton, B.J.1    Jamieson, L.H.2    Allen, G.D.3
  • 20
    • 70350472136 scopus 로고    scopus 로고
    • Variability of lombard effects under different noise conditions
    • Philadelphia, PA, Oct.
    • A. Wakao, K. Takeda, and F. Itakura, "Variability of Lombard effects under different noise conditions, " in Proc. ICSLP 96, Philadelphia, PA, Oct. 1996, pp. 418-421.
    • (1996) Proc. ICSLP 96 , pp. 418-421
    • Wakao, A.1    Takeda, K.2    Itakura, F.3
  • 21
    • 0029375589 scopus 로고
    • Robust speech recognition training via duration and spectral-based stress token generation
    • Sep.
    • S. E. Bou-Ghazale and J. H. L. Hansen, "Robust speech recognition training via duration and spectral-based stress token generation, " IEEE Trans. Speech Audio Process., vol. 3, pp. 415-421, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 415-421
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 22
    • 36248936922 scopus 로고    scopus 로고
    • UT-scope - a corpus for speech under cognitive/physical task stress and emotion
    • Genoa, Italy, May
    • V. Varadarajan, J. H. L. Hansen, and A. Ikeno, "UT-SCOPE - A corpus for speech under cognitive/physical task stress and emotion, " in Proc. LREC Workshop on Speech Under Emotion, Genoa, Italy, May 2006, pp. 72-75.
    • (2006) Proc. LREC Workshop on Speech under Emotion , pp. 72-75
    • Varadarajan, V.1    Hansen, J.H.L.2    Ikeno, A.3
  • 23
    • 44949221428 scopus 로고    scopus 로고
    • Analysis of the lombard effect under different types and levels of noise with application to in-set speaker ID systems
    • Sep.
    • V. Varadarajan and J. H. L. Hansen, "Analysis of the Lombard effect under different types and levels of noise with application to in-set speaker ID systems, " in Proc. Interspeech'06, Sep. 2006, pp. 937-940.
    • (2006) Proc. Interspeech'06 , pp. 937-940
    • Varadarajan, V.1    Hansen, J.H.L.2
  • 24
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Process., vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 25
    • 85009112385 scopus 로고    scopus 로고
    • Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification
    • Jeju Island, Korea, p. Thc1604p
    • P. Angkititrakul, J. H. L. Hansen, and S. Baghaii, "Cluster- dependent modeling and confidence measure processing for in-set/out-of-set speaker identification, " in Proc. Interspeech 2004/ICSLP 2004, Jeju Island, Korea, p. Thc1604p, 15(1-4).
    • Proc. Interspeech 2004/ICSLP 2004 , vol.15 , Issue.1-4
    • Angkititrakul, P.1    Hansen, J.H.L.2    Baghaii, S.3
  • 26
    • 85009207995 scopus 로고    scopus 로고
    • Score normalization applied to open-set, text-independent speaker identification
    • Geneva, Switzerland, Sep
    • P. Sivakumaran, J. Fortuna, and A. M. Ariyaeeinia, "Score normalization applied to open-set, text-independent speaker identification, " in Eurospeech'03, Geneva, Switzerland, Sep. 2003, pp. 2669-2672.
    • (2003) Eurospeech'03 , pp. 2669-2672
    • Sivakumaran, P.1    Fortuna, J.2    Ariyaeeinia, A.M.3
  • 27
    • 33947694367 scopus 로고    scopus 로고
    • Stress level classification of speech using euclidean distance metrics in a novel hybrid multi-dimensional feature space
    • Toulouse, France, May
    • E. Ruzanski, J. H. L. Hansen, J. Meyerhoff, G. Saviolakis, W. Norris, and T. Wollert, "Stress level classification of speech using Euclidean distance metrics in a novel hybrid multi-dimensional feature space, " in Proc. IEEE ICASSP, Toulouse, France, May 2006, vol. 1, pp. I-425-I-428.
    • (2006) Proc. IEEE ICASSP , vol.1
    • Ruzanski, E.1    Hansen, J.H.L.2    Meyerhoff, J.3    Saviolakis, G.4    Norris, W.5    Wollert, T.6
  • 28
  • 30
    • 0035278948 scopus 로고    scopus 로고
    • Nonlinear feature based classification of speech under stress
    • Mar.
    • G. Zhou, J. H. L. Hansen, and J. F. Kaiser, "Nonlinear feature based classification of speech under stress, " IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 201-216, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 201-216
    • Zhou, G.1    Hansen, J.H.L.2    Kaiser, J.F.3
  • 32
    • 70350470780 scopus 로고    scopus 로고
    • Susas-speech under simulated and actual stress database
    • Philadelphia, PA [Online]. Available
    • "SUSAS-Speech Under Simulated and Actual Stress database, " Linguistics Data Consortium (LDC), Philadelphia, PA [Online]. Available: http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC99S78.
    • Linguistics Data Consortium (LDC)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.