-
1
-
-
0028516073
-
How do human process and recognize speech?
-
1063-6676,. 10.1109/89.326615
-
Allen, J. B. (1994). " How do human process and recognize speech?," IEEE Trans. Speech Audio Process. 1063-6676 2, 567-577. 10.1109/89.326615
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, pp. 567-577
-
-
Allen, J.B.1
-
2
-
-
34247568840
-
Modelling speaker intelligibility in noise
-
0167-6393,. 10.1016/j.specom.2006.11.003
-
Barker, J., and Cooke, M. (2007). " Modelling speaker intelligibility in noise.," Speech Commun. 0167-6393 49, 402-417. 10.1016/j.specom.2006.11.003
-
(2007)
Speech Commun.
, vol.49
, pp. 402-417
-
-
Barker, J.1
Cooke, M.2
-
3
-
-
0027465489
-
A model for context effects in speech recognition
-
" 0001-4966,. 10.1121/1.406844
-
Bronkhorst, A. W., Bosman, A. J., and Smoorenburg, G. G. (1993). " A model for context effects in speech recognition.," J. Acoust. Soc. Am. 0001-4966 93, 499-509. 10.1121/1.406844
-
(1993)
J. Acoust. Soc. Am.
, vol.93
, pp. 499-509
-
-
Bronkhorst, A.W.1
Bosman, A.J.2
Smoorenburg, G.G.3
-
4
-
-
26444619785
-
An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language
-
" 0167-6393,. 10.1016/j.specom.2005.01.006
-
Chang, S., Wester, M., and Greenberg, S. (2005). " An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language.," Speech Commun. 0167-6393 47, 290-311. 10.1016/j.specom.2005.01.006
-
(2005)
Speech Commun.
, vol.47
, pp. 290-311
-
-
Chang, S.1
Wester, M.2
Greenberg, S.3
-
6
-
-
0035342414
-
Robust automatic speech recognition with missing and uncertain acoustic data
-
" 0167-6393,. 10.1016/S0167-6393(00)00034-0
-
Cooke, M. P., Green, P. D., Josifovski, L. B., and Vizinho, A. (2001). " Robust automatic speech recognition with missing and uncertain acoustic data.," Speech Commun. 0167-6393 34, 267-285. 10.1016/S0167-6393(00)00034-0
-
(2001)
Speech Commun.
, vol.34
, pp. 267-285
-
-
Cooke, M.P.1
Green, P.D.2
Josifovski, L.B.3
Vizinho, A.4
-
7
-
-
0034920512
-
ICRA noises: Artificial noise signals with speechlike spectral and temporal properties for hearing instrument assessment
-
" 0020-6091,. 10.3109/00206090109073110
-
Dreschler, W. A., Ludvigson, C., and Westermann, S. (2001). " ICRA noises: Artificial noise signals with speechlike spectral and temporal properties for hearing instrument assessment.," Audiology 0020-6091 40, 148-157. 10.3109/00206090109073110
-
(2001)
Audiology
, vol.40
, pp. 148-157
-
-
Dreschler, W.A.1
Ludvigson, C.2
Westermann, S.3
-
8
-
-
0019353058
-
Predicting consonant confusions from acoustic analysis
-
0001-4966,. 10.1121/1.385345
-
Dubno, J. R., and Levitt, H. (1981). " Predicting consonant confusions from acoustic analysis.," J. Acoust. Soc. Am. 0001-4966 69, 249-261. 10.1121/1.385345
-
(1981)
J. Acoust. Soc. Am.
, vol.69
, pp. 249-261
-
-
Dubno, J.R.1
Levitt, H.2
-
9
-
-
34547941599
-
Automatic speech recognition and speech variability: A review
-
" 0167-6393,. 10.1016/j.specom.2007.02.006
-
Fissore, L., Mertins, A., Ris, A., Rose, R., Tyagi, V., and Wellekens, C., (2007). " Automatic speech recognition and speech variability: A review.," Speech Commun. 0167-6393 49, 763-786. 10.1016/j.specom.2007.02. 006
-
(2007)
Speech Commun.
, vol.49
, pp. 763-786
-
-
Fissore, L.1
Mertins, A.2
Ris, A.3
Rose, R.4
Tyagi, V.5
Wellekens, C.6
-
10
-
-
0038188722
-
Interaction between the native and second language phonetic subsystems
-
" 0167-6393,. 10.1016/S0167-6393(02)00128-0
-
Flege, J. E., Schirru, C., and MacKay, I. R. A. (2003). " Interaction between the native and second language phonetic subsystems.," Speech Commun. 0167-6393 40, 467-491. 10.1016/S0167-6393(02)00128-0
-
(2003)
Speech Commun.
, vol.40
, pp. 467-491
-
-
Flege, J.E.1
Schirru, C.2
MacKay, I.R.A.3
-
11
-
-
0033321442
-
Effects of speaking rate and word frequency on conversational pronunciations
-
0167-6393,. 10.1016/S0167-6393(99)00035-7
-
Fosler-Lussier, E., and Morgan, N. (1999). " Effects of speaking rate and word frequency on conversational pronunciations.," Speech Commun. 0167-6393 29, 137-158. 10.1016/S0167-6393(99)00035-7
-
(1999)
Speech Commun.
, vol.29
, pp. 137-158
-
-
Fosler-Lussier, E.1
Morgan, N.2
-
12
-
-
84953657538
-
Factors governing the intelligibility of speech sounds
-
0001-4966,. 10.1121/1.1916407
-
French, N. R., and Steinberg, J. C. (1947). " Factors governing the intelligibility of speech sounds.," J. Acoust. Soc. Am. 0001-4966 19, 90-119. 10.1121/1.1916407
-
(1947)
J. Acoust. Soc. Am.
, vol.19
, pp. 90-119
-
-
French, N.R.1
Steinberg, J.C.2
-
13
-
-
0034902190
-
Speech recognition in noise as a function of the number of spectral channels: Comparison of acoustic hearing and cochlear implants
-
" 0001-4966,. 10.1121/1.1381538
-
Friesen, L. M., Shannon, R. V., Baskent, D., and Wang, X. (2001). " Speech recognition in noise as a function of the number of spectral channels: Comparison of acoustic hearing and cochlear implants.," J. Acoust. Soc. Am. 0001-4966 110, 1150-1163. 10.1121/1.1381538
-
(2001)
J. Acoust. Soc. Am.
, vol.110
, pp. 1150-1163
-
-
Friesen, L.M.1
Shannon, R.V.2
Baskent, D.3
Wang, X.4
-
14
-
-
0022348813
-
Consonant recognition in quiet as a function of aging among normal hearing subjects
-
" 0001-4966,. 10.1121/1.392888
-
Gelfand, S., Piper, N., and Silman, S. (1985). " Consonant recognition in quiet as a function of aging among normal hearing subjects.," J. Acoust. Soc. Am. 0001-4966 78, 1198-1206. 10.1121/1.392888
-
(1985)
J. Acoust. Soc. Am.
, vol.78
, pp. 1198-1206
-
-
Gelfand, S.1
Piper, N.2
Silman, S.3
-
15
-
-
0029816778
-
Evaluating the articulation index for auditory-visual consonant recognition
-
0001-4966,. 10.1121/1.417950
-
Grant, K. W., and Walden, B. E. (1996). " Evaluating the articulation index for auditory-visual consonant recognition.," J. Acoust. Soc. Am. 0001-4966 100, 2415-2424. 10.1121/1.417950
-
(1996)
J. Acoust. Soc. Am.
, vol.100
, pp. 2415-2424
-
-
Grant, K.W.1
Walden, B.E.2
-
16
-
-
9644287935
-
Acoustic-phonetic correlates of talker intelligibility for adults and children
-
0001-4966,. 10.1121/1.1806826
-
Hazan, V., and Markham, D. (2004). " Acoustic-phonetic correlates of talker intelligibility for adults and children.," J. Acoust. Soc. Am. 0001-4966 116, 3108-3118. 10.1121/1.1806826
-
(2004)
J. Acoust. Soc. Am.
, vol.116
, pp. 3108-3118
-
-
Hazan, V.1
Markham, D.2
-
17
-
-
0028517164
-
RASTA processing of speech
-
1063-6676,. 10.1109/89.326616
-
Hermansky, H., and Morgan, H. (1994). " RASTA processing of speech.," IEEE Trans. Speech Audio Process. 1063-6676 2, 578-589. 10.1109/89.326616
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, pp. 578-589
-
-
Hermansky, H.1
Morgan, H.2
-
18
-
-
84906215978
-
-
" in Proceedings of Interspeech
-
Jürgens, T., Brand, T., and Kollmeier, B. (2007). " Modelling the human-machine gap in speech reception: Microscopic speech intelligibility prediction for normal-hearing subjects with an auditory model.," in Proceedings of Interspeech, pp. 410-413.
-
(2007)
Modelling the Human-machine Gap in Speech Reception: Microscopic Speech Intelligibility Prediction for Normal-hearing Subjects with An Auditory Model
, pp. 410-413
-
-
Jürgens, T.1
Brand, T.2
Kollmeier, B.3
-
19
-
-
0030362970
-
-
" in Proceedings of the International Conference on Spoken Language Processing (ICSLP)
-
Kipp, A., Wesenick, M. -B., and Schiel, F. (1996). " Automatic detection and segmentation of pronunciation variants in German speech corpora.," in Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp. 106-109.
-
(1996)
Automatic Detection and Segmentation of Pronunciation Variants in German Speech Corpora
, pp. 106-109
-
-
Kipp, A.1
Wesenick, M.-B.2
Schiel, F.3
-
23
-
-
0039329242
-
-
Habilitation thesis, University of Göttingen, Fachbereich Physik, Göttingen.
-
Kollmeier, B. (1990). " MeΒmethodik, Modellierung und Verbesserung der Verständlichkeit von Sprache (Measurement, modeling and improvement of speech intelligibility).," Habilitation thesis, University of Göttingen, Fachbereich Physik, Göttingen.
-
(1990)
MeΒmethodik, Modellierung und Verbesserung der Verständlichkeit von Sprache (Measurement, Modeling and Improvement of Speech Intelligibility)
-
-
Kollmeier, B.1
-
24
-
-
0030829791
-
Development and evaluation of a German sentence test for objective and subjective speech intelligibility assessment
-
" 0001-4966,. 10.1121/1.419624
-
Kollmeier, B., Kliem, K., and Wesselkamp, M. (1997). " Development and evaluation of a German sentence test for objective and subjective speech intelligibility assessment.," J. Acoust. Soc. Am. 0001-4966 102, 2412-2421. 10.1121/1.419624
-
(1997)
J. Acoust. Soc. Am.
, vol.102
, pp. 2412-2421
-
-
Kollmeier, B.1
Kliem, K.2
Wesselkamp, M.3
-
25
-
-
85040875724
-
Sprachverständlichkeitsmessungen für die Audiologie mit einem Reimtest in deutscher Sprache: Erstellung und Evaluation von Testlisten (Speech intelligibility measurements for audiology based on a German rhyme test: Preparation and evaluation of test lists)
-
Kollmeier, B., and Wallenberg, E. -L. (1989). " Sprachverstä ndlichkeitsmessungen für die Audiologie mit einem Reimtest in deutscher Sprache: Erstellung und Evaluation von Testlisten (Speech intelligibility measurements for audiology based on a German rhyme test: Preparation and evaluation of test lists).," Audiologische Akustik 28, 50-65.
-
(1989)
Audiologische Akustik
, vol.28
, pp. 50-65
-
-
Kollmeier, B.1
Wallenberg, E.-L.2
-
26
-
-
0004138963
-
-
Master's thesis, Dept. of Electrical Engineering, Massachusetts Institute of Technology, Cambridge, MA.
-
Krause, J. C. (1993). " The effects of speaking rate and speaking mode on intelligibility.," Master's thesis, Dept. of Electrical Engineering, Massachusetts Institute of Technology, Cambridge, MA.
-
(1993)
The Effects of Speaking Rate and Speaking Mode on Intelligibility
-
-
Krause, J.C.1
-
27
-
-
0036841359
-
Investigating alternative forms of clear speech: The effects of speaking rate and speaking mode on intelligibility
-
0001-4966,. 10.1121/1.1509432
-
Krause, J. C., and Braida, L. D. (2002). " Investigating alternative forms of clear speech: The effects of speaking rate and speaking mode on intelligibility.," J. Acoust. Soc. Am. 0001-4966 112, 2165-2172. 10.1121/1.1509432
-
(2002)
J. Acoust. Soc. Am.
, vol.112
, pp. 2165-2172
-
-
Krause, J.C.1
Braida, L.D.2
-
28
-
-
1642499127
-
Acoustic properties of naturally produced clear speech at normal speaking rates
-
0001-4966,. 10.1121/1.1635842
-
Krause, J. C., and Braida, L. D. (2004). " Acoustic properties of naturally produced clear speech at normal speaking rates.," J. Acoust. Soc. Am. 0001-4966 115, 362-378. 10.1121/1.1635842
-
(2004)
J. Acoust. Soc. Am.
, vol.115
, pp. 362-378
-
-
Krause, J.C.1
Braida, L.D.2
-
29
-
-
0042068306
-
Accent, intelligibility, and comprehensibility in the perception of foreign-accented Lombard speech
-
0001-4966,. 10.1121/1.1593060
-
Li, C. -n. (2003). " Accent, intelligibility, and comprehensibility in the perception of foreign-accented Lombard speech.," J. Acoust. Soc. Am. 0001-4966 114, 2364. 10.1121/1.1593060
-
(2003)
J. Acoust. Soc. Am.
, vol.114
, pp. 2364
-
-
Li, C.-N.1
-
30
-
-
0031187171
-
Speech recognition by machines and humans
-
0167-6393,. 10.1016/S0167-6393(97)00021-6
-
Lippmann, R. (1997). " Speech recognition by machines and humans.," Speech Commun. 0167-6393 22, 1-15. 10.1016/S0167-6393(97)00021-6
-
(1997)
Speech Commun.
, vol.22
, pp. 1-15
-
-
Lippmann, R.1
-
32
-
-
78649541496
-
-
" in Proceedings of Interspeech
-
Meyer, B. T., Wächter, M., Brand, T., and Kollmeier, B. (2007). " Phoneme confusions in human and automatic speech recognition.," in Proceedings of Interspeech, pp. 1485-1488.
-
(2007)
Phoneme Confusions in Human and Automatic Speech Recognition
, pp. 1485-1488
-
-
Meyer, B.T.1
Wächter, M.2
Brand, T.3
Kollmeier, B.4
-
33
-
-
78649625648
-
-
" in Proceedings of the Workshoon Speech Recognition and Intrinsic Variation
-
Meyer, B. T., Wesker, T., Brand, T., Mertins, A., and Kollmeier, B. (2006). " A human-machine comparison in speech recognition based on a logatome corpus.," in Proceedings of the Workshop on Speech Recognition and Intrinsic Variation, pp. 95-101.
-
(2006)
A Human-machine Comparison in Speech Recognition Based on A Logatome Corpus
, pp. 95-101
-
-
Meyer, B.T.1
Wesker, T.2
Brand, T.3
Mertins, A.4
Kollmeier, B.5
-
34
-
-
84955023511
-
An analysis of perceptual confusions among some english consonants
-
0001-4966,. 10.1121/1.1907526
-
Miller, G., and Nicely, P. (1955). " An analysis of perceptual confusions among some english consonants.," J. Acoust. Soc. Am. 0001-4966 27, 338-352. 10.1121/1.1907526
-
(1955)
J. Acoust. Soc. Am.
, vol.27
, pp. 338-352
-
-
Miller, G.1
Nicely, P.2
-
35
-
-
54149118777
-
Development of a speaker discrimination test for cochlear implant users based on the OLLO logatome corpus
-
" 0301-1569,. 10.1159/000165170
-
Mühler, R., Ziese, M., and Rostalski, D. (2009). " Development of a speaker discrimination test for cochlear implant users based on the OLLO logatome corpus.," ORL 0301-1569 71, 14-20. 10.1159/000165170
-
(2009)
ORL
, vol.71
, pp. 14-20
-
-
Mühler, R.1
Ziese, M.2
Rostalski, D.3
-
37
-
-
34047247534
-
Consonant and vowel confusions in speech-weighted noise
-
0001-4966,. 10.1121/1.2642397
-
Phatak, S., and Allen, J. B. (2007). " Consonant and vowel confusions in speech-weighted noise.," J. Acoust. Soc. Am. 0001-4966 121, 2312-2326. 10.1121/1.2642397
-
(2007)
J. Acoust. Soc. Am.
, vol.121
, pp. 2312-2326
-
-
Phatak, S.1
Allen, J.B.2
-
38
-
-
77953557323
-
Modeling the use of durational information in human spoken-word recognition
-
0001-4966,. 10.1121/1.3377050
-
Scharenborg, O. (2010). " Modeling the use of durational information in human spoken-word recognition.," J. Acoust. Soc. Am. 0001-4966 127, 3758-3770. 10.1121/1.3377050
-
(2010)
J. Acoust. Soc. Am.
, vol.127
, pp. 3758-3770
-
-
Scharenborg, O.1
-
39
-
-
0021143595
-
A procedure for phonetic transcription by consensus
-
" 0022-4685.
-
Schriberg, L. D., Kwiatkowski, J., and Hoffmann, K. (1984). " A procedure for phonetic transcription by consensus.," J. Speech Hear. Res. 0022-4685 27, 456-465.
-
(1984)
J. Speech Hear. Res.
, vol.27
, pp. 456-465
-
-
Schriberg, L.D.1
Kwiatkowski, J.2
Hoffmann, K.3
-
41
-
-
51449112452
-
-
" in Proceedings of ICASSP
-
Siniscalchi, S. M., Svendsen, T., and Lee, C. -H. (2008). " Towards a detector-based universal phone recognizer.," in Proceedings of ICASSP, pp. 4261-4264.
-
(2008)
Towards A Detector-based Universal Phone Recognizer
, pp. 4261-4264
-
-
Siniscalchi, S.M.1
Svendsen, T.2
Lee, C.-H.3
-
42
-
-
15844428932
-
Human and machine consonant recognition
-
0167-6393,. 10.1016/j.specom.2004.11.009
-
Sroka, J. J., and Braida, L. D. (2005). " Human and machine consonant recognition.," Speech Commun. 0167-6393 45, 401-423. 10.1016/j.specom.2004.11.009
-
(2005)
Speech Commun.
, vol.45
, pp. 401-423
-
-
Sroka, J.J.1
Braida, L.D.2
-
43
-
-
0002788784
-
Signal processing for robust speech recognition
-
", edited by C. -H. Lee, F. K. Soong, and K. K. Paliwal (Springer, Berlin), Cha.
-
Stern, R., Acero, A., Liu, F. H., and Ohshima, Y. (1996). " Signal processing for robust speech recognition.," Automatic Speech and Speaker Recognition, edited by, C. -H. Lee, F. K. Soong, and, K. K. Paliwal, (Springer, Berlin), Chap.
-
(1996)
Automatic Speech and Speaker Recognition
-
-
Stern, R.1
Acero, A.2
Liu, F.H.3
Ohshima, Y.4
-
44
-
-
0022118936
-
A rationalized' arcsine transform
-
0022-4685.
-
Studebaker, G. A. (1985). " A rationalized' arcsine transform.," J. Speech Hear. Res. 0022-4685 28, 455-462.
-
(1985)
J. Speech Hear. Res.
, vol.28
, pp. 455-462
-
-
Studebaker, G.A.1
-
45
-
-
0032828464
-
A model of auditory perception as front end for automatic speech recognition
-
0001-4966,. 10.1121/1.427950
-
Tchorz, J., and Kollmeier, B. (1999). " A model of auditory perception as front end for automatic speech recognition.," J. Acoust. Soc. Am. 0001-4966 106, 2040-2050. 10.1121/1.427950
-
(1999)
J. Acoust. Soc. Am.
, vol.106
, pp. 2040-2050
-
-
Tchorz, J.1
Kollmeier, B.2
-
46
-
-
34247844467
-
Bridging the gap between human and automatic speech recognition
-
0167-6393,. 10.1016/j.specom.2007.03.001
-
ten Bosch, L., and Kirchhoff, K. (2007). " Bridging the gap between human and automatic speech recognition.," Speech Commun. 0167-6393 49, 331-335. 10.1016/j.specom.2007.03.001
-
(2007)
Speech Commun.
, vol.49
, pp. 331-335
-
-
Ten Bosch, L.1
Kirchhoff, K.2
-
47
-
-
0015749654
-
Consonant confusions in noise: A study of perceptual features
-
0001-4966,. 10.1121/1.1914417
-
Wang, M., and Bilger, R. (1973). " Consonant confusions in noise: A study of perceptual features.," J. Acoust. Soc. Am. 0001-4966 54, 1248-1266. 10.1121/1.1914417
-
(1973)
J. Acoust. Soc. Am.
, vol.54
, pp. 1248-1266
-
-
Wang, M.1
Bilger, R.2
-
48
-
-
78649566125
-
-
" in Proceedings of the Addendum of ICSLP
-
Weintraub, M., Taussig, K., Hunicke-Smith, K., and Snodgrass, A. (1996). " Effect of speaking style on LVCSR performance.," in Proceedings of the Addendum of ICSLP, pp. 1457-1460.
-
(1996)
Effect of Speaking Style on LVCSR Performance
, pp. 1457-1460
-
-
Weintraub, M.1
Taussig, K.2
Hunicke-Smith, K.3
Snodgrass, A.4
-
49
-
-
33745183789
-
-
" in Proceedings of Interspeech.
-
Wesker, T., Meyer, B., Wagener, K., Anemüller, J., Mertins, A., and Kollmeier, B. (2005). " Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines.," in Proceedings of Interspeech, 1273-1276.
-
(2005)
Oldenburg Logatome Speech Corpus (OLLO) for Speech Recognition Experiments with Humans and Machines
, pp. 1273-1276
-
-
Wesker, T.1
Meyer, B.2
Wagener, K.3
Anemüller, J.4
Mertins, A.5
Kollmeier, B.6
|