메뉴 건너뛰기




Volumn 13, Issue 3, 2010, Pages 141-161

Speaker recognition under stressed condition

Author keywords

Speaker recognition; Stress compensation; Stressed speech

Indexed keywords

GAUSSIAN DISTRIBUTION; REFLECTION; SPEECH; SPEECH COMMUNICATION;

EID: 79952896080     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-010-9075-z     Document Type: Article
Times cited : (31)

References (54)
  • 1
    • 0016939145 scopus 로고
    • Automatic recognition of speakers from their voices
    • 10.1109/PROC.1976.10155
    • B. S. Atal 1976 Automatic recognition of speakers from their voices Proceedings of the IEEE 64 4 460 476 10.1109/PROC.1976.10155
    • (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 460-476
    • Atal, B.S.1
  • 4
    • 0032069798 scopus 로고    scopus 로고
    • Stress perturbation of neutral speech for synthesis based on hidden Markov models
    • 10.1109/89.668815
    • S. E. Bou-Ghazale J. H. L. Hansen 1998 Stress perturbation of neutral speech for synthesis based on hidden Markov models IEEE Transactions on Speech and Audio Processing 6 201 216 10.1109/89.668815
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , pp. 201-216
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 5
    • 0028630509 scopus 로고
    • Nonlinear analysis and classification of speech under stressed conditions
    • DOI 10.1121/1.410601
    • D. A. Cairns J. H. L. Hansen 1994 Nonlinear analysis and classification of speech under stressed conditions The Journal of the Acoustical Society of America 96 6 3392 3400 10.1121/1.410601 (Pubitemid 24376418)
    • (1994) Journal of the Acoustical Society of America , vol.96 , Issue.6 , pp. 3392-3400
    • Cairns, D.A.1    Hansen, J.H.L.2
  • 6
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • 10.1109/5.628714
    • J. Campbell 1997 Speaker recognition: A tutorial Proceedings of the IEEE 85 9 1437 1462 10.1109/5.628714
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.1
  • 7
    • 0036754056 scopus 로고    scopus 로고
    • Application of time-frequency principal component analysis to text-independent speaker identification
    • DOI 10.1109/TSA.2002.800557
    • I. M. Chagnolleau G. Durou 2002 Application of time-frequency principal component analysis to text-independent speaker identification IEEE Transactions on Speech and Audio Processing 10 6 371 378 10.1109/TSA.2002.800557 (Pubitemid 35311931)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.6 , pp. 371-378
    • Magrin-Chagnolleau, I.1    Durou, G.2    Bimbot, F.3
  • 9
    • 0028016265 scopus 로고
    • Measuring and modeling vocal source-tract interaction
    • DOI 10.1109/10.301733
    • D. G. Childers C. F. Wong 1994 Measuring and modeling vocal source-tract interaction IEEE Transactions on Biomedical Engineering 41 7 663 671 10.1109/10.301733 (Pubitemid 24299313)
    • (1994) IEEE Transactions on Biomedical Engineering , vol.41 , Issue.7 , pp. 663-671
    • Childers, D.G.1    Wong, C.-F.2
  • 10
    • 0033738539 scopus 로고    scopus 로고
    • The NIST speaker recognition evaluation-overview, methodology, systems, results, perspective
    • 10.1016/S0167-6393(99)00080-1
    • G. R. Doddington A. Martin M. A. Przybockin D. Reynolds 2000 The NIST speaker recognition evaluation-overview, methodology, systems, results, perspective Speech Communication 31 2-3 225 254 10.1016/S0167-6393(99)00080-1
    • (2000) Speech Communication , vol.31 , Issue.23 , pp. 225-254
    • Doddington, G.R.1    Martin, A.2    Przybockin, M.A.3    Reynolds, D.4
  • 11
    • 0033707063 scopus 로고    scopus 로고
    • Speech coding with an analysis-by-synthesis sinusoidal model
    • Etemoglu, C. O.; Cuperman, V.; & Gersho, A. (2000). Speech coding with an analysis-by-synthesis sinusoidal model. In ICASSP (Vol.3, pp. 1371-1374).
    • (2000) ICASSP , vol.3 , pp. 1371-1374
    • Etemoglu, C.O.1    Cuperman, V.2    Gersho, A.3
  • 12
    • 0029289458 scopus 로고
    • Estimation of amplitude and phase parameters of multicomponent signals
    • 10.1109/78.376844
    • B. Friedlander J. Francos 1995 Estimation of amplitude and phase parameters of multicomponent signals IEEE Transactions on Signal Processing 43 4 917 926 10.1109/78.376844
    • (1995) IEEE Transactions on Signal Processing , vol.43 , Issue.4 , pp. 917-926
    • Friedlander, B.1    Francos, J.2
  • 13
    • 0019583902 scopus 로고
    • Comparison of speaker recognition methods using statistical features and dynamic features
    • 10.1109/TASSP.1981.1163605
    • S. Furui 1981 Comparison of speaker recognition methods using statistical features and dynamic features IEEE Transactions on Acoustics, Speech, and Signal Processing 29 3 342 350 10.1109/TASSP.1981.1163605
    • (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.29 , Issue.3 , pp. 342-350
    • Furui, S.1
  • 14
    • 0031232722 scopus 로고    scopus 로고
    • Speech analysis/synthesis and modification using an analysis-by- synthesis/overlap-add sinusoidal model
    • 10.1109/89.622558
    • E. George M. Smith 1997 Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model IEEE Transactions on Speech and Audio Processing 5 5 389 406 10.1109/89.622558
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.5 , pp. 389-406
    • George, E.1    Smith, M.2
  • 15
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • 10.1109/89.848224
    • S. Ghazale J. H. L. Hansen 2000 A comparative study of traditional and newly proposed features for recognition of speech under stress IEEE Transactions on Speech and Audio Processing 8 4 429 442 10.1109/89.848224
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.4 , pp. 429-442
    • Ghazale, S.1    Hansen, J.H.L.2
  • 16
    • 0030283741 scopus 로고    scopus 로고
    • Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    • 10.1016/S0167-6393(96)00050-7
    • J. H. L. Hansen 1996 Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition Speech Communication 20 151 173 10.1016/S0167-6393(96)00050-7
    • (1996) Speech Communication , vol.20 , pp. 151-173
    • Hansen, J.H.L.1
  • 17
    • 0029324926 scopus 로고
    • ICARUS: Source generator based real-time recognition of speech in noisy stressful and Lombard effect environments
    • 10.1016/0167-6393(95)00007-B
    • J. H. L. Hansen D. A. Cairns 1995 ICARUS: Source generator based real-time recognition of speech in noisy stressful and Lombard effect environments Speech Communication 16 4 391 422 10.1016/0167-6393(95)00007-B
    • (1995) Speech Communication , vol.16 , Issue.4 , pp. 391-422
    • Hansen, J.H.L.1    Cairns, D.A.2
  • 18
    • 0030196359 scopus 로고    scopus 로고
    • Feature analysis and neural network-based classification of speech under stress
    • 10.1109/89.506935
    • J. H. L. Hansen B. D. Womack 1996 Feature analysis and neural network-based classification of speech under stress IEEE Transactions on Speech and Audio Processing 4 307 313 10.1109/89.506935
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , pp. 307-313
    • Hansen, J.H.L.1    Womack, B.D.2
  • 21
    • 0020588276 scopus 로고
    • Further experiments in text-independent speaker recognition over communications channels
    • Hunt, M. (1983). Further experiments in text-independent speaker recognition over communications channels. In Proc. IEEE intern. conf. ASSP (pp. 563-566).
    • (1983) Proc. IEEE Intern. Conf. ASSP , pp. 563-566
    • Hunt, M.1
  • 22
    • 0031224204 scopus 로고    scopus 로고
    • A study of harmonic features for the speaker recognition
    • 10.1016/S0167-6393(97)00053-8
    • B. Imperl Z. Kačič B. Horvat 1997 A study of harmonic features for the speaker recognition Speech Communication 22 4 385 402 10.1016/S0167-6393(97)00053-8
    • (1997) Speech Communication , vol.22 , Issue.4 , pp. 385-402
    • Imperl, B.1    Kačič, Z.2    Horvat, B.3
  • 24
    • 0035472866 scopus 로고    scopus 로고
    • Speech enhancement using a constrained iterative sinusoidal model
    • DOI 10.1109/89.952491, PII S1063667601082360
    • J. Jensen J. H. L. Hansen 2001 Speech enhancement using a constrained iterative sinusoidal model IEEE Transactions on Speech and Audio Processing 9 7 731 740 10.1109/89.952491 (Pubitemid 32992837)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.7 , pp. 731-740
    • Jensen, J.1    Hansen, J.H.L.2
  • 25
    • 0035509915 scopus 로고    scopus 로고
    • A Bayesian approach to the verification problem: Applications to speaker verification
    • H. Jiang L. Deng 2001 A Bayesian approach to the verification problem: Applications to speaker verification IEEE Transactions on Speech Audio Process 9 8 883 884
    • (2001) IEEE Transactions on Speech Audio Process , vol.9 , Issue.8 , pp. 883-884
    • Jiang, H.1    Deng, L.2
  • 29
    • 77956518812 scopus 로고    scopus 로고
    • Speaker-specific mapping for text-independent speaker recognition
    • H. Misra S. Ikbal B. Yegnanarayana 2003 Speaker-specific mapping for text-independent speaker recognition Speech Communication 24 193 209
    • (2003) Speech Communication , vol.24 , pp. 193-209
    • Misra, H.1    Ikbal, S.2    Yegnanarayana, B.3
  • 30
    • 30444446629 scopus 로고    scopus 로고
    • Combining evidence from residual phase and MFCC features for speaker recognition
    • DOI 10.1109/LSP.2005.860538
    • K. S. R. Murty B. Yegnanarayana 2006 Combining evidence from residual phase and MFCC features for speaker recognition IEEE Signal Processing Letters 13 1 52 55 10.1109/LSP.2005.860538 (Pubitemid 43072461)
    • (2006) IEEE Signal Processing Letters , vol.13 , Issue.1 , pp. 52-55
    • Sri Rama Murty, K.1    Yegnanarayana, B.2
  • 31
    • 0032207163 scopus 로고    scopus 로고
    • An efficient scoring algorithm for gaussian mixture model based speaker identification
    • 10.1109/97.728467
    • B. L. Pellom J. Hansen 1998 An efficient scoring algorithm for gaussian mixture model based speaker identification IEEE Signal Processing Letters 5 11 281 284 10.1109/97.728467
    • (1998) IEEE Signal Processing Letters , vol.5 , Issue.11 , pp. 281-284
    • Pellom, B.L.1    Hansen, J.2
  • 32
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • 10.1109/5.237532
    • J. Picone 1993 Signal modeling techniques in speech recognition Proceedings of the IEEE 81 9 1215 1247 10.1109/5.237532
    • (1993) Proceedings of the IEEE , vol.81 , Issue.9 , pp. 1215-1247
    • Picone, J.1
  • 38
    • 77956018749 scopus 로고    scopus 로고
    • Performance of selective speech features for speaker identification
    • Raja, G. S.; & Dandapat, S. (2008). Performance of selective speech features for speaker identification. IE(I) Journal-CP (pp. 38-46).
    • (2008) IE(I) Journal-CP , pp. 38-46
    • Raja, G.S.1    Dandapat, S.2
  • 39
    • 34047256081 scopus 로고    scopus 로고
    • Sinusoidal model-based analysis and classification of stressed speech
    • DOI 10.1109/TSA.2005.858071
    • S. Ramamohan S. Dandapat 2006 Sinusoidal model-based analysis and classification of stressed speech IEEE Transactions on Audio Speech and Language Processing 14 3 737 746 10.1109/TSA.2005.858071 (Pubitemid 46547638)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 737-746
    • Ramamohan, S.1    Dandapat, S.2
  • 41
    • 0029355999 scopus 로고
    • Speaker identification and verification using gaussian mixture speaker models
    • 10.1016/0167-6393(95)00009-D
    • D. Reynolds 1995 Speaker identification and verification using gaussian mixture speaker models Speech Communication 17 1-2 91 108 10.1016/0167-6393(95) 00009-D
    • (1995) Speech Communication , vol.17 , Issue.12 , pp. 91-108
    • Reynolds, D.1
  • 42
    • 85075924869 scopus 로고    scopus 로고
    • Comparison of background normalization methods for text-independent speaker verification
    • Rhodes, Greece, 966
    • Reynolds, D. (1997). Comparison of background normalization methods for text-independent speaker verification. In European conference on speech processing, Rhodes, Greece (pp. 963, 966).
    • (1997) European Conference on Speech Processing , pp. 963
    • Reynolds, D.1
  • 44
    • 0016939165 scopus 로고
    • Automatic speaker verification: A review
    • 10.1109/PROC.1976.10156
    • A. E. Rosenberg 1976 Automatic speaker verification: A review IEEE Proceedings of the IEEE 64 4 475 487 10.1109/PROC.1976.10156
    • (1976) IEEE Proceedings of the IEEE , vol.64 , Issue.4 , pp. 475-487
    • Rosenberg, A.E.1
  • 45
    • 0000592562 scopus 로고
    • Evaluation of a vector quantization talker recognition system in text independent and text dependent modes
    • 10.1016/0885-2308(87)90005-2
    • A. E. Rosenberg F. K. Soong 1987 Evaluation of a vector quantization talker recognition system in text independent and text dependent modes Computer Speech and Language 2 3-4 143 157 10.1016/0885-2308(87)90005-2
    • (1987) Computer Speech and Language , vol.2 , Issue.34 , pp. 143-157
    • Rosenberg, A.E.1    Soong, F.K.2
  • 46
    • 0033688848 scopus 로고    scopus 로고
    • High resolution speech feature parametrization for monophone-based stressed speech recognition
    • 10.1109/97.847363
    • R. Sarikaya J. H. L. Hansen 2000 High resolution speech feature parametrization for monophone-based stressed speech recognition IEEE Signal Processing Letters 7 7 182 185 10.1109/97.847363
    • (2000) IEEE Signal Processing Letters , vol.7 , Issue.7 , pp. 182-185
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 47
    • 33745403627 scopus 로고    scopus 로고
    • Enhancing speaker identification performance under the shouted talking condition using second-order circular hidden Markov models
    • DOI 10.1016/j.specom.2006.01.005, PII S0167639306000082
    • I. Shahin 2006 Enhancing speaker identification performance under the shouted talking condition using second-order circular hidden Markov models Speech Communication 48 8 1047 1055 10.1016/j.specom.2006.01.005 (Pubitemid 43947274)
    • (2006) Speech Communication , vol.48 , Issue.8 , pp. 1047-1055
    • Shahin, I.1
  • 48
    • 0022794148 scopus 로고
    • Speaker recognition
    • 10.1109/MASSP.1986.1165388
    • D. O. Shaughnessy 1986 Speaker recognition IEEE ASSP Magazine 3 4 4 17 10.1109/MASSP.1986.1165388
    • (1986) IEEE ASSP Magazine , vol.3 , Issue.4 , pp. 4-17
    • Shaughnessy, D.O.1
  • 50
    • 85008050158 scopus 로고    scopus 로고
    • A simple and fast way of generating a harmonic signal
    • 10.1109/97.841155
    • Y. Stylianou 2000 A simple and fast way of generating a harmonic signal IEEE Signal Processing Letters 7 5 111 113 10.1109/97.841155
    • (2000) IEEE Signal Processing Letters , vol.7 , Issue.5 , pp. 111-113
    • Stylianou, Y.1
  • 51
    • 34047263010 scopus 로고    scopus 로고
    • Prosody conversion from neutral speech to emotional speech
    • DOI 10.1109/TASL.2006.876113
    • J. Tao Y. Kang A. Li 2006 Prosody conversion from neutral speech to emotional speech IEEE Transactions on Audio, Speech and Language Processing 14 1145 1154 10.1109/TASL.2006.876113 (Pubitemid 46547612)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1145-1153
    • Tao, J.1    Kang, Y.2    Li, A.3
  • 53
    • 22544440896 scopus 로고    scopus 로고
    • Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
    • DOI 10.1109/TSA.2005.848892
    • B. Yegnanarayana J. Zachariah S. R. M. Prasanna C. Gupta 2005 Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system IEEE Transactions on Speech and Audio Processing 13 4 575 582 10.1109/TSA.2005.848892 (Pubitemid 41013160)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.4 , pp. 575-582
    • Yegnanarayana, B.1    Prasanna, S.R.M.2    Zachariah, J.M.3    Gupta, C.S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.