메뉴 건너뛰기




Volumn 128, Issue 5, 2010, Pages 3126-3141

Human phoneme recognition depending on speech-intrinsic variability

Author keywords

[No Author keywords available]

Indexed keywords

ARTICULATORY FEATURES; AUTOMATIC SPEECH RECOGNIZERS; HUMAN SPEECH PERCEPTION; INTRINSIC VARIABILITIES; LONG-TERM SPECTRUM; MASKING NOISE; PHONEME RECOGNITION; RECOGNITION RATES; SPEAKING RATE; SPECTRAL LEVELS; SPEECH DATABASE; SPEECH SEGMENTS;

EID: 78649601853     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.3493450     Document Type: Article
Times cited : (39)

References (49)
  • 1
    • 0028516073 scopus 로고
    • How do human process and recognize speech?
    • 1063-6676,. 10.1109/89.326615
    • Allen, J. B. (1994). " How do human process and recognize speech?," IEEE Trans. Speech Audio Process. 1063-6676 2, 567-577. 10.1109/89.326615
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 567-577
    • Allen, J.B.1
  • 2
    • 34247568840 scopus 로고    scopus 로고
    • Modelling speaker intelligibility in noise
    • 0167-6393,. 10.1016/j.specom.2006.11.003
    • Barker, J., and Cooke, M. (2007). " Modelling speaker intelligibility in noise.," Speech Commun. 0167-6393 49, 402-417. 10.1016/j.specom.2006.11.003
    • (2007) Speech Commun. , vol.49 , pp. 402-417
    • Barker, J.1    Cooke, M.2
  • 3
    • 0027465489 scopus 로고
    • A model for context effects in speech recognition
    • " 0001-4966,. 10.1121/1.406844
    • Bronkhorst, A. W., Bosman, A. J., and Smoorenburg, G. G. (1993). " A model for context effects in speech recognition.," J. Acoust. Soc. Am. 0001-4966 93, 499-509. 10.1121/1.406844
    • (1993) J. Acoust. Soc. Am. , vol.93 , pp. 499-509
    • Bronkhorst, A.W.1    Bosman, A.J.2    Smoorenburg, G.G.3
  • 4
    • 26444619785 scopus 로고    scopus 로고
    • An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language
    • " 0167-6393,. 10.1016/j.specom.2005.01.006
    • Chang, S., Wester, M., and Greenberg, S. (2005). " An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language.," Speech Commun. 0167-6393 47, 290-311. 10.1016/j.specom.2005.01.006
    • (2005) Speech Commun. , vol.47 , pp. 290-311
    • Chang, S.1    Wester, M.2    Greenberg, S.3
  • 6
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and uncertain acoustic data
    • " 0167-6393,. 10.1016/S0167-6393(00)00034-0
    • Cooke, M. P., Green, P. D., Josifovski, L. B., and Vizinho, A. (2001). " Robust automatic speech recognition with missing and uncertain acoustic data.," Speech Commun. 0167-6393 34, 267-285. 10.1016/S0167-6393(00)00034-0
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.D.2    Josifovski, L.B.3    Vizinho, A.4
  • 7
    • 0034920512 scopus 로고    scopus 로고
    • ICRA noises: Artificial noise signals with speechlike spectral and temporal properties for hearing instrument assessment
    • " 0020-6091,. 10.3109/00206090109073110
    • Dreschler, W. A., Ludvigson, C., and Westermann, S. (2001). " ICRA noises: Artificial noise signals with speechlike spectral and temporal properties for hearing instrument assessment.," Audiology 0020-6091 40, 148-157. 10.3109/00206090109073110
    • (2001) Audiology , vol.40 , pp. 148-157
    • Dreschler, W.A.1    Ludvigson, C.2    Westermann, S.3
  • 8
    • 0019353058 scopus 로고
    • Predicting consonant confusions from acoustic analysis
    • 0001-4966,. 10.1121/1.385345
    • Dubno, J. R., and Levitt, H. (1981). " Predicting consonant confusions from acoustic analysis.," J. Acoust. Soc. Am. 0001-4966 69, 249-261. 10.1121/1.385345
    • (1981) J. Acoust. Soc. Am. , vol.69 , pp. 249-261
    • Dubno, J.R.1    Levitt, H.2
  • 9
    • 34547941599 scopus 로고    scopus 로고
    • Automatic speech recognition and speech variability: A review
    • " 0167-6393,. 10.1016/j.specom.2007.02.006
    • Fissore, L., Mertins, A., Ris, A., Rose, R., Tyagi, V., and Wellekens, C., (2007). " Automatic speech recognition and speech variability: A review.," Speech Commun. 0167-6393 49, 763-786. 10.1016/j.specom.2007.02. 006
    • (2007) Speech Commun. , vol.49 , pp. 763-786
    • Fissore, L.1    Mertins, A.2    Ris, A.3    Rose, R.4    Tyagi, V.5    Wellekens, C.6
  • 10
    • 0038188722 scopus 로고    scopus 로고
    • Interaction between the native and second language phonetic subsystems
    • " 0167-6393,. 10.1016/S0167-6393(02)00128-0
    • Flege, J. E., Schirru, C., and MacKay, I. R. A. (2003). " Interaction between the native and second language phonetic subsystems.," Speech Commun. 0167-6393 40, 467-491. 10.1016/S0167-6393(02)00128-0
    • (2003) Speech Commun. , vol.40 , pp. 467-491
    • Flege, J.E.1    Schirru, C.2    MacKay, I.R.A.3
  • 11
    • 0033321442 scopus 로고    scopus 로고
    • Effects of speaking rate and word frequency on conversational pronunciations
    • 0167-6393,. 10.1016/S0167-6393(99)00035-7
    • Fosler-Lussier, E., and Morgan, N. (1999). " Effects of speaking rate and word frequency on conversational pronunciations.," Speech Commun. 0167-6393 29, 137-158. 10.1016/S0167-6393(99)00035-7
    • (1999) Speech Commun. , vol.29 , pp. 137-158
    • Fosler-Lussier, E.1    Morgan, N.2
  • 12
    • 84953657538 scopus 로고
    • Factors governing the intelligibility of speech sounds
    • 0001-4966,. 10.1121/1.1916407
    • French, N. R., and Steinberg, J. C. (1947). " Factors governing the intelligibility of speech sounds.," J. Acoust. Soc. Am. 0001-4966 19, 90-119. 10.1121/1.1916407
    • (1947) J. Acoust. Soc. Am. , vol.19 , pp. 90-119
    • French, N.R.1    Steinberg, J.C.2
  • 13
    • 0034902190 scopus 로고    scopus 로고
    • Speech recognition in noise as a function of the number of spectral channels: Comparison of acoustic hearing and cochlear implants
    • " 0001-4966,. 10.1121/1.1381538
    • Friesen, L. M., Shannon, R. V., Baskent, D., and Wang, X. (2001). " Speech recognition in noise as a function of the number of spectral channels: Comparison of acoustic hearing and cochlear implants.," J. Acoust. Soc. Am. 0001-4966 110, 1150-1163. 10.1121/1.1381538
    • (2001) J. Acoust. Soc. Am. , vol.110 , pp. 1150-1163
    • Friesen, L.M.1    Shannon, R.V.2    Baskent, D.3    Wang, X.4
  • 14
    • 0022348813 scopus 로고
    • Consonant recognition in quiet as a function of aging among normal hearing subjects
    • " 0001-4966,. 10.1121/1.392888
    • Gelfand, S., Piper, N., and Silman, S. (1985). " Consonant recognition in quiet as a function of aging among normal hearing subjects.," J. Acoust. Soc. Am. 0001-4966 78, 1198-1206. 10.1121/1.392888
    • (1985) J. Acoust. Soc. Am. , vol.78 , pp. 1198-1206
    • Gelfand, S.1    Piper, N.2    Silman, S.3
  • 15
    • 0029816778 scopus 로고    scopus 로고
    • Evaluating the articulation index for auditory-visual consonant recognition
    • 0001-4966,. 10.1121/1.417950
    • Grant, K. W., and Walden, B. E. (1996). " Evaluating the articulation index for auditory-visual consonant recognition.," J. Acoust. Soc. Am. 0001-4966 100, 2415-2424. 10.1121/1.417950
    • (1996) J. Acoust. Soc. Am. , vol.100 , pp. 2415-2424
    • Grant, K.W.1    Walden, B.E.2
  • 16
    • 9644287935 scopus 로고    scopus 로고
    • Acoustic-phonetic correlates of talker intelligibility for adults and children
    • 0001-4966,. 10.1121/1.1806826
    • Hazan, V., and Markham, D. (2004). " Acoustic-phonetic correlates of talker intelligibility for adults and children.," J. Acoust. Soc. Am. 0001-4966 116, 3108-3118. 10.1121/1.1806826
    • (2004) J. Acoust. Soc. Am. , vol.116 , pp. 3108-3118
    • Hazan, V.1    Markham, D.2
  • 17
    • 0028517164 scopus 로고
    • RASTA processing of speech
    • 1063-6676,. 10.1109/89.326616
    • Hermansky, H., and Morgan, H. (1994). " RASTA processing of speech.," IEEE Trans. Speech Audio Process. 1063-6676 2, 578-589. 10.1109/89.326616
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 578-589
    • Hermansky, H.1    Morgan, H.2
  • 24
    • 0030829791 scopus 로고    scopus 로고
    • Development and evaluation of a German sentence test for objective and subjective speech intelligibility assessment
    • " 0001-4966,. 10.1121/1.419624
    • Kollmeier, B., Kliem, K., and Wesselkamp, M. (1997). " Development and evaluation of a German sentence test for objective and subjective speech intelligibility assessment.," J. Acoust. Soc. Am. 0001-4966 102, 2412-2421. 10.1121/1.419624
    • (1997) J. Acoust. Soc. Am. , vol.102 , pp. 2412-2421
    • Kollmeier, B.1    Kliem, K.2    Wesselkamp, M.3
  • 25
    • 85040875724 scopus 로고
    • Sprachverständlichkeitsmessungen für die Audiologie mit einem Reimtest in deutscher Sprache: Erstellung und Evaluation von Testlisten (Speech intelligibility measurements for audiology based on a German rhyme test: Preparation and evaluation of test lists)
    • Kollmeier, B., and Wallenberg, E. -L. (1989). " Sprachverstä ndlichkeitsmessungen für die Audiologie mit einem Reimtest in deutscher Sprache: Erstellung und Evaluation von Testlisten (Speech intelligibility measurements for audiology based on a German rhyme test: Preparation and evaluation of test lists).," Audiologische Akustik 28, 50-65.
    • (1989) Audiologische Akustik , vol.28 , pp. 50-65
    • Kollmeier, B.1    Wallenberg, E.-L.2
  • 26
    • 0004138963 scopus 로고
    • Master's thesis, Dept. of Electrical Engineering, Massachusetts Institute of Technology, Cambridge, MA.
    • Krause, J. C. (1993). " The effects of speaking rate and speaking mode on intelligibility.," Master's thesis, Dept. of Electrical Engineering, Massachusetts Institute of Technology, Cambridge, MA.
    • (1993) The Effects of Speaking Rate and Speaking Mode on Intelligibility
    • Krause, J.C.1
  • 27
    • 0036841359 scopus 로고    scopus 로고
    • Investigating alternative forms of clear speech: The effects of speaking rate and speaking mode on intelligibility
    • 0001-4966,. 10.1121/1.1509432
    • Krause, J. C., and Braida, L. D. (2002). " Investigating alternative forms of clear speech: The effects of speaking rate and speaking mode on intelligibility.," J. Acoust. Soc. Am. 0001-4966 112, 2165-2172. 10.1121/1.1509432
    • (2002) J. Acoust. Soc. Am. , vol.112 , pp. 2165-2172
    • Krause, J.C.1    Braida, L.D.2
  • 28
    • 1642499127 scopus 로고    scopus 로고
    • Acoustic properties of naturally produced clear speech at normal speaking rates
    • 0001-4966,. 10.1121/1.1635842
    • Krause, J. C., and Braida, L. D. (2004). " Acoustic properties of naturally produced clear speech at normal speaking rates.," J. Acoust. Soc. Am. 0001-4966 115, 362-378. 10.1121/1.1635842
    • (2004) J. Acoust. Soc. Am. , vol.115 , pp. 362-378
    • Krause, J.C.1    Braida, L.D.2
  • 29
    • 0042068306 scopus 로고    scopus 로고
    • Accent, intelligibility, and comprehensibility in the perception of foreign-accented Lombard speech
    • 0001-4966,. 10.1121/1.1593060
    • Li, C. -n. (2003). " Accent, intelligibility, and comprehensibility in the perception of foreign-accented Lombard speech.," J. Acoust. Soc. Am. 0001-4966 114, 2364. 10.1121/1.1593060
    • (2003) J. Acoust. Soc. Am. , vol.114 , pp. 2364
    • Li, C.-N.1
  • 30
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • 0167-6393,. 10.1016/S0167-6393(97)00021-6
    • Lippmann, R. (1997). " Speech recognition by machines and humans.," Speech Commun. 0167-6393 22, 1-15. 10.1016/S0167-6393(97)00021-6
    • (1997) Speech Commun. , vol.22 , pp. 1-15
    • Lippmann, R.1
  • 34
    • 84955023511 scopus 로고
    • An analysis of perceptual confusions among some english consonants
    • 0001-4966,. 10.1121/1.1907526
    • Miller, G., and Nicely, P. (1955). " An analysis of perceptual confusions among some english consonants.," J. Acoust. Soc. Am. 0001-4966 27, 338-352. 10.1121/1.1907526
    • (1955) J. Acoust. Soc. Am. , vol.27 , pp. 338-352
    • Miller, G.1    Nicely, P.2
  • 35
    • 54149118777 scopus 로고    scopus 로고
    • Development of a speaker discrimination test for cochlear implant users based on the OLLO logatome corpus
    • " 0301-1569,. 10.1159/000165170
    • Mühler, R., Ziese, M., and Rostalski, D. (2009). " Development of a speaker discrimination test for cochlear implant users based on the OLLO logatome corpus.," ORL 0301-1569 71, 14-20. 10.1159/000165170
    • (2009) ORL , vol.71 , pp. 14-20
    • Mühler, R.1    Ziese, M.2    Rostalski, D.3
  • 37
    • 34047247534 scopus 로고    scopus 로고
    • Consonant and vowel confusions in speech-weighted noise
    • 0001-4966,. 10.1121/1.2642397
    • Phatak, S., and Allen, J. B. (2007). " Consonant and vowel confusions in speech-weighted noise.," J. Acoust. Soc. Am. 0001-4966 121, 2312-2326. 10.1121/1.2642397
    • (2007) J. Acoust. Soc. Am. , vol.121 , pp. 2312-2326
    • Phatak, S.1    Allen, J.B.2
  • 38
    • 77953557323 scopus 로고    scopus 로고
    • Modeling the use of durational information in human spoken-word recognition
    • 0001-4966,. 10.1121/1.3377050
    • Scharenborg, O. (2010). " Modeling the use of durational information in human spoken-word recognition.," J. Acoust. Soc. Am. 0001-4966 127, 3758-3770. 10.1121/1.3377050
    • (2010) J. Acoust. Soc. Am. , vol.127 , pp. 3758-3770
    • Scharenborg, O.1
  • 39
    • 0021143595 scopus 로고
    • A procedure for phonetic transcription by consensus
    • " 0022-4685.
    • Schriberg, L. D., Kwiatkowski, J., and Hoffmann, K. (1984). " A procedure for phonetic transcription by consensus.," J. Speech Hear. Res. 0022-4685 27, 456-465.
    • (1984) J. Speech Hear. Res. , vol.27 , pp. 456-465
    • Schriberg, L.D.1    Kwiatkowski, J.2    Hoffmann, K.3
  • 42
    • 15844428932 scopus 로고    scopus 로고
    • Human and machine consonant recognition
    • 0167-6393,. 10.1016/j.specom.2004.11.009
    • Sroka, J. J., and Braida, L. D. (2005). " Human and machine consonant recognition.," Speech Commun. 0167-6393 45, 401-423. 10.1016/j.specom.2004.11.009
    • (2005) Speech Commun. , vol.45 , pp. 401-423
    • Sroka, J.J.1    Braida, L.D.2
  • 43
    • 0002788784 scopus 로고    scopus 로고
    • Signal processing for robust speech recognition
    • ", edited by C. -H. Lee, F. K. Soong, and K. K. Paliwal (Springer, Berlin), Cha.
    • Stern, R., Acero, A., Liu, F. H., and Ohshima, Y. (1996). " Signal processing for robust speech recognition.," Automatic Speech and Speaker Recognition, edited by, C. -H. Lee, F. K. Soong, and, K. K. Paliwal, (Springer, Berlin), Chap.
    • (1996) Automatic Speech and Speaker Recognition
    • Stern, R.1    Acero, A.2    Liu, F.H.3    Ohshima, Y.4
  • 44
    • 0022118936 scopus 로고
    • A rationalized' arcsine transform
    • 0022-4685.
    • Studebaker, G. A. (1985). " A rationalized' arcsine transform.," J. Speech Hear. Res. 0022-4685 28, 455-462.
    • (1985) J. Speech Hear. Res. , vol.28 , pp. 455-462
    • Studebaker, G.A.1
  • 45
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • 0001-4966,. 10.1121/1.427950
    • Tchorz, J., and Kollmeier, B. (1999). " A model of auditory perception as front end for automatic speech recognition.," J. Acoust. Soc. Am. 0001-4966 106, 2040-2050. 10.1121/1.427950
    • (1999) J. Acoust. Soc. Am. , vol.106 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 46
    • 34247844467 scopus 로고    scopus 로고
    • Bridging the gap between human and automatic speech recognition
    • 0167-6393,. 10.1016/j.specom.2007.03.001
    • ten Bosch, L., and Kirchhoff, K. (2007). " Bridging the gap between human and automatic speech recognition.," Speech Commun. 0167-6393 49, 331-335. 10.1016/j.specom.2007.03.001
    • (2007) Speech Commun. , vol.49 , pp. 331-335
    • Ten Bosch, L.1    Kirchhoff, K.2
  • 47
    • 0015749654 scopus 로고
    • Consonant confusions in noise: A study of perceptual features
    • 0001-4966,. 10.1121/1.1914417
    • Wang, M., and Bilger, R. (1973). " Consonant confusions in noise: A study of perceptual features.," J. Acoust. Soc. Am. 0001-4966 54, 1248-1266. 10.1121/1.1914417
    • (1973) J. Acoust. Soc. Am. , vol.54 , pp. 1248-1266
    • Wang, M.1    Bilger, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.