메뉴 건너뛰기




Volumn 129, Issue 1, 2011, Pages 388-403

Effect of speech-intrinsic variations on human and automatic recognition of spoken phonemes

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC RECOGNITION; AUTOMATIC SPEECH RECOGNITION SYSTEM; ERROR RATE; HUMAN LISTENERS; HUMAN PERFORMANCE; HUMAN-MACHINE; INTRINSIC VARIABILITIES; NOISY ENVIRONMENT; PHONEME RECOGNITION; RECOGNITION PERFORMANCE; RECOGNITION RATES; SIGNAL TO NOISE; SPEAKING RATE; TEMPORAL CUES; TEMPORAL DYNAMICS;

EID: 79551679242     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3514525     Document Type: Article
Times cited : (39)

References (34)
  • 1
    • 34247568840 scopus 로고    scopus 로고
    • Modelling speaker intelligibility in noise
    • DOI 10.1016/j.specom.2006.11.003, PII S0167639306001701, Bridging the Gap between Human and Automatic Speech Recognition
    • Barker, J., and Cooke, M. (2007). Modelling speaker intelligibility in noise., Speech Commun. 49, 402-417. 10.1016/j.specom.2006.11.003 (Pubitemid 46670363)
    • (2007) Speech Communication , vol.49 , Issue.5 , pp. 402-417
    • Barker, J.1    Cooke, M.2
  • 3
    • 0027465489 scopus 로고
    • A model for context effects in speech recognition
    • , 10.1121/1.406844
    • Bronkhorst, A. W., Bosman, A. J., and Smoorenburg, G. F. (1993). A model for context effects in speech recognition., J. Acoust. Soc. Am. 93, 499-509. 10.1121/1.406844
    • (1993) J. Acoust. Soc. Am. , vol.93 , pp. 499-509
    • Bronkhorst, A.W.1    Bosman, A.J.2    Smoorenburg, G.F.3
  • 4
    • 33745213565 scopus 로고    scopus 로고
    • A speech similarity distance weighting for robust recognition
    • in, Lisbon, Portugal
    • Carey, M. J., and Quang, T. P. (2005). A speech similarity distance weighting for robust recognition., in Proceedings of Interspeech, Lisbon, Portugal, pp. 1257-1260.
    • (2005) Proceedings of Interspeech , pp. 1257-1260
    • Carey, M.J.1    Quang, T.P.2
  • 5
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • 10.1121/1.2166600
    • Cooke, M. (2005). A glimpsing model of speech perception in noise., J. Acoust. Soc. Am. 119, 1562-1573. 10.1121/1.2166600
    • (2005) J. Acoust. Soc. Am. , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 6
    • 70450178921 scopus 로고    scopus 로고
    • The Interspeech 2008 consonant challenge
    • in
    • Cooke, M., and Scharenborg, O. (2008). The Interspeech 2008 consonant challenge., in Proceedings of Interspeech, pp. 1781-1784.
    • (2008) Proceedings of Interspeech , pp. 1781-1784
    • Cooke, M.1    Scharenborg, O.2
  • 7
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • 10.1109/TASSP.1980.1163420
    • Davis, S., and Mermelstein, P. (1980). Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences., IEEE Trans. Acoust. Speech. Signal Process. 28, 357-366. 10.1109/TASSP.1980.1163420
    • (1980) IEEE Trans. Acoust. Speech. Signal Process. , vol.28 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 9
    • 0034920512 scopus 로고    scopus 로고
    • ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment
    • , 10.3109/00206090109073110
    • Dreschler, W. A., Verschuure, H., Ludvigson, C, and Westermann, S. (2001). ICRA noises: Artificial noise signals with speech-like spectral and temporal properties for hearing instrument assessment., Int. J. Audiol. 40, 148-157. 10.3109/00206090109073110
    • (2001) Int. J. Audiol. , vol.40 , pp. 148-157
    • Dreschler, W.A.1    Verschuure, H.2    Ludvigson, C.3    Westermann, S.4
  • 11
  • 12
    • 0345443169 scopus 로고    scopus 로고
    • Roles and representations of systematic fine phonetic detail in speech understanding
    • DOI 10.1016/j.wocn.2003.09.006
    • Hawkins, S. (2003). Roles and representations of systematic fine phonetic detail in speech understanding., J. Phonetics 31, 373-405. 10.1016/j.wocn.2003. 09.006 (Pubitemid 37495914)
    • (2003) Journal of Phonetics , vol.31 , Issue.3-4 , pp. 373-405
    • Hawkins, S.1
  • 14
    • 0029060033 scopus 로고
    • Acoustic characteristics of American English vowels
    • , 10.1121/1.411872
    • Hillenbrand, J., Getty, L., Clark, M., and Wheeler, K. (1995). Acoustic characteristics of American English vowels., J. Acoust. Soc. Am. 97, 3099-3111. 10.1121/1.411872
    • (1995) J. Acoust. Soc. Am. , vol.97 , pp. 3099-3111
    • Hillenbrand, J.1    Getty, L.2    Clark, M.3    Wheeler, K.4
  • 15
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • 10.1121/1.405631
    • Junqua, J. -C. (1993). The Lombard reflex and its role on human listeners and automatic speech recognizers., J. Acoust. Soc. Am. 93, 510-524. 10.1121/1.405631
    • (1993) J. Acoust. Soc. Am. , vol.93 , pp. 510-524
    • Junqua, J.-C.1
  • 16
    • 79551666547 scopus 로고    scopus 로고
    • Predicting consonant recognition in quiet for listeners with normal hearing and hearing impairment using an auditory model (A)
    • Jrgens, T., Brand, T., and Kollmeier, B. (2009). Predicting consonant recognition in quiet for listeners with normal hearing and hearing impairment using an auditory model (A)., J. Acoust. Soc. Am. 125, 2533.
    • (2009) J. Acoust. Soc. Am. , vol.125 , pp. 2533
    • Jrgens, T.1    Brand, T.2    Kollmeier, B.3
  • 19
    • 0242358162 scopus 로고
    • The effects of speaking rate on the intelligibility of speech for various speaking modes (A)
    • 10.1121/1.413900
    • Krause, J. C., and Braida, L. D. (1995). The effects of speaking rate on the intelligibility of speech for various speaking modes (A)., J. Acoust. Soc. Am. 98, 2982. 10.1121/1.413900
    • (1995) J. Acoust. Soc. Am. , vol.98 , pp. 2982
    • Krause, J.C.1    Braida, L.D.2
  • 20
    • 1642499127 scopus 로고    scopus 로고
    • Acoustic properties of naturally produced clear speech at normal speaking rates
    • 10.1121/1.1635842
    • Krause, J. C., and Braida, L. D. (2003). Acoustic properties of naturally produced clear speech at normal speaking rates., J. Acoust. Soc. Am. 115, 362-378. 10.1121/1.1635842
    • (2003) J. Acoust. Soc. Am. , vol.115 , pp. 362-378
    • Krause, J.C.1    Braida, L.D.2
  • 22
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • 10.1016/S0167-6393(97)00021-6
    • Lippmann, R. (1997). Speech recognition by machines and humans., Speech Commun. 22, 1-15. 10.1016/S0167-6393(97)00021-6
    • (1997) Speech Commun. , vol.22 , pp. 1-15
    • Lippmann, R.1
  • 23
    • 56149102452 scopus 로고    scopus 로고
    • A human-machine comparison in speech recognition based on a logatome corpus
    • in
    • Meyer, B., and Wesker, T. (2006). A human-machine comparison in speech recognition based on a logatome corpus., in Workshop on Speech-Intrinsic Variation, pp. 95-100.
    • (2006) Workshop on Speech-Intrinsic Variation , pp. 95-100
    • Meyer, B.1    Wesker, T.2
  • 24
    • 84933250500 scopus 로고
    • The intelligibility of interrupted speech
    • 10.1121/1.1906584
    • Miller, G. A., and Licklider, J. (1950). The intelligibility of interrupted speech., J. Acoust. Soc. Am. 22, 167-173. 10.1121/1.1906584
    • (1950) J. Acoust. Soc. Am. , vol.22 , pp. 167-173
    • Miller, G.A.1    Licklider, J.2
  • 25
    • 84955023511 scopus 로고
    • An analysis of perceptual confusions among some English consonants
    • 10.1121/1.1907526
    • Miller, G. A., and Nicely, P. E. (1955). An analysis of perceptual confusions among some English consonants., J. Acoust. Soc. Am. 27, 338-352. 10.1121/1.1907526
    • (1955) J. Acoust. Soc. Am. , vol.27 , pp. 338-352
    • Miller, G.A.1    Nicely, P.E.2
  • 27
    • 34047247534 scopus 로고    scopus 로고
    • Consonant and vowel confusions in speech-weighted noise
    • DOI 10.1121/1.2642397
    • Phatak, S. A., and Allen, J. B. (2007). Consonant and vowel confusions in speech-weighted noise., J. Acoust. Soc. Am. 121, 2312-2326. 10.1121/1.2642397 (Pubitemid 46548430)
    • (2007) Journal of the Acoustical Society of America , vol.121 , Issue.4 , pp. 2312-2326
    • Phatak, S.A.1    Allen, J.B.2
  • 28
    • 34247580087 scopus 로고    scopus 로고
    • Reaching over the gap: A review of efforts to link human and automatic speech recognition research
    • DOI 10.1016/j.specom.2007.01.009, PII S0167639307000106, Bridging the Gap between Human and Automatic Speech Recognition
    • Scharenborg, O. (2007). Reaching over the gap: A review of efforts to link human and automatic speech recognition research., Speech Commun. 49, 336-347. 10.1016/j.specom.2007.01.009 (Pubitemid 46670364)
    • (2007) Speech Communication , vol.49 , Issue.5 , pp. 336-347
    • Scharenborg, O.1
  • 30
    • 84867215557 scopus 로고    scopus 로고
    • Two protocols comparing human and machine phonetic recognition performance in conversational speech
    • in
    • Shen, W., Olive, J., and Jones, D. (2008). Two protocols comparing human and machine phonetic recognition performance in conversational speech., in Proceedings of Interspeech, pp. 1630-1633.
    • (2008) Proceedings of Interspeech , pp. 1630-1633
    • Shen, W.1    Olive, J.2    Jones, D.3
  • 32
    • 15844428932 scopus 로고    scopus 로고
    • Human and machine consonant recognition
    • DOI 10.1016/j.specom.2004.11.009, PII S0167639304001499
    • Sroka, J. J., and Braida, L. D. (2005). Human and machine consonant recognition., Speech Commun. 45, 401-423. 10.1016/j.specom.2004.11.009 (Pubitemid 40423287)
    • (2005) Speech Communication , vol.45 , Issue.4 , pp. 401-423
    • Sroka, J.J.1    Braida, L.D.2
  • 33
    • 0002788784 scopus 로고    scopus 로고
    • Signal processing for robust speech recognition
    • in, edited by C. -H. Lee, F. K. Soong, and K. K. Paliwal (Springer, Berlin)
    • Stern, R., Acero, A., Liu, F. H., and Ohshima, Y. (1996). Signal processing for robust speech recognition., in Automatic Speech and Speaker Recognition, edited by, C. -H. Lee, F. K. Soong, and, K. K. Paliwal, (Springer, Berlin), pp. 357-384.
    • (1996) Automatic Speech and Speaker Recognition , pp. 357-384
    • Stern, R.1    Acero, A.2    Liu, F.H.3    Ohshima, Y.4
  • 34
    • 33745183789 scopus 로고    scopus 로고
    • Oldenburg Logatome Speech Corpus (OLLO) for speech recognition experiments with humans and machines
    • in
    • Wesker, T., Meyer, B., Wagener, K., Anemueller, J., Mertins, A., and Kollmeier, B. (2005). Oldenburg Logatome Speech Corpus (OLLO) for speech recognition experiments with humans and machines., in Proceedings of Interspeech, pp. 1273-1276.
    • (2005) Proceedings of Interspeech , pp. 1273-1276
    • Wesker, T.1    Meyer, B.2    Wagener, K.3    Anemueller, J.4    Mertins, A.5    Kollmeier, B.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.