메뉴 건너뛰기




Volumn 22, Issue 1, 1997, Pages 1-15

Speech recognition by machines and humans

Author keywords

Automatic speech recognition; Machine recognition; Noise; Nonsense sentences; Nonsense syllables; Perception; Performance; Speech; Speech perception; Speech recognition

Indexed keywords

ACOUSTIC NOISE; HUMAN COMPUTER INTERACTION; MAN MACHINE SYSTEMS; SPEECH ANALYSIS;

EID: 0031187171     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(97)00021-6     Document Type: Article
Times cited : (430)

References (44)
  • 1
    • 0030142722 scopus 로고    scopus 로고
    • Towards increasing speech recognition error rates
    • Bourlard, H., Hermansky, H., Morgan, N., 1996. Towards increasing speech recognition error rates. Speech Communication 18 (3), 205-231.
    • (1996) Speech Communication , vol.18 , Issue.3 , pp. 205-231
    • Bourlard, H.1    Hermansky, H.2    Morgan, N.3
  • 4
    • 85027136924 scopus 로고
    • Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
    • Yokohama, Japan
    • Chou, W., Lee, C.H., Juang, B.H., 1994. Minimum error rate training of inter-word context dependent acoustic model units in speech recognition. Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan, pp. 509:3.1-3.4.
    • (1994) Proc. Internat. Conf. on Spoken Language Processing , pp. 509
    • Chou, W.1    Lee, C.H.2    Juang, B.H.3
  • 6
    • 0004698056 scopus 로고    scopus 로고
    • Summary of session 7 - conversational and multi-lingual speech recognition
    • Morgan Kaufmann, Harriman, NY
    • Culhane, C., 1996. Summary of session 7 - Conversational and multi-lingual speech recognition. Proc. DARPA Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 143-144.
    • (1996) Proc. DARPA Speech Recognition Workshop , pp. 143-144
    • Culhane, C.1
  • 9
    • 15844378911 scopus 로고
    • Human speech recognition performance on the 1994 CSR spoke 10 corpus
    • Morgan Kaufmann, Austin, TX
    • Ebel, W.J., Picone, J., 1995. Human speech recognition performance on the 1994 CSR Spoke 10 corpus. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 53-59.
    • (1995) Proc. Spoken Language Systems Technology Workshop , pp. 53-59
    • Ebel, W.J.1    Picone, J.2
  • 15
    • 0022150487 scopus 로고
    • The development of an experimental discrete dictation recognizer
    • Jelinek, F., 1985. The development of an experimental discrete dictation recognizer. Proc. IEEE 73, 1616-1624.
    • (1985) Proc. IEEE , vol.73 , pp. 1616-1624
    • Jelinek, F.1
  • 16
    • 0041107346 scopus 로고
    • Adaptability to differences between talkers in Japanese monosyllabic perception
    • Tohkura, Y., Vatikiotis-Bateson, E., Sagisaka, Y. (Eds.), IOS Press, Amsterdam
    • Kakehi, K., 1992. Adaptability to differences between talkers in Japanese monosyllabic perception. In: Tohkura, Y., Vatikiotis-Bateson, E., Sagisaka, Y. (Eds.), Speech Perception, Production and Linguistic Structure. IOS Press, Amsterdam, pp. 135-142.
    • (1992) Speech Perception, Production and Linguistic Structure , pp. 135-142
    • Kakehi, K.1
  • 17
    • 0001760126 scopus 로고
    • Speech bandwidth compression through spectrum selection
    • Kryter, K.D., 1960. Speech bandwidth compression through spectrum selection. J. Acoust. Soc. Amer. 32, 547-556.
    • (1960) J. Acoust. Soc. Amer. , vol.32 , pp. 547-556
    • Kryter, K.D.1
  • 18
    • 0343125611 scopus 로고
    • Design of the 1994 CSR Benchmark Tests
    • Morgan Kaufmann, Austin, TX
    • Kubala, F., 1995. Design of the 1994 CSR Benchmark Tests. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 41-46.
    • (1995) Proc. Spoken Language Systems Technology Workshop , pp. 41-46
    • Kubala, F.1
  • 19
    • 30244476594 scopus 로고
    • SWITCHBOARD: A user's manual, catalog number LDC94s7
    • University of Pennsylvania, Philadelphia, PA
    • LDC, 1995. SWITCHBOARD: A User's Manual, Catalog Number LDC94S7. Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.
    • (1995) Linguistic Data Consortium
  • 22
    • 0012778209 scopus 로고
    • Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech
    • Licklider, J.C.R., Pollack, I., 1948. Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech. J. Acoust. Soc. Amer. 20, 42-51.
    • (1948) J. Acoust. Soc. Amer. , vol.20 , pp. 42-51
    • Licklider, J.C.R.1    Pollack, I.2
  • 23
    • 0029754956 scopus 로고    scopus 로고
    • Accurate consonant perception without mid-frequency speech energy
    • Lippmann, R.P., 1996. Accurate consonant perception without mid-frequency speech energy. IEEE Trans. Speech Audio Process. 4, 66-69.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 66-69
    • Lippmann, R.P.1
  • 25
    • 0019533542 scopus 로고
    • A study of multichannel amplitude compression and linear amplification for persons with sensorineural hearing loss
    • Lippmann, R.P., Braida, L.D., Durlach, N.I., 1981. A study of multichannel amplitude compression and linear amplification for persons with sensorineural hearing loss. J. Acoust. Soc. Amer. 69, 524-534.
    • (1981) J. Acoust. Soc. Amer. , vol.69 , pp. 524-534
    • Lippmann, R.P.1    Braida, L.D.2    Durlach, N.I.3
  • 27
    • 30244488488 scopus 로고    scopus 로고
    • Personal communication
    • A. Martin, 1996. Personal communication.
    • (1996)
    • Martin, A.1
  • 30
    • 6744229564 scopus 로고
    • DARPA resource management and ATIS benchmark test poster session
    • Morgan Kaufmann, Austin, TX
    • Pallett, D.S., 1991. DARPA resource management and ATIS benchmark test poster session. Proc. DARPA Speech and Natural Language Workshop. Morgan Kaufmann, Austin, TX, pp. 49-58.
    • (1991) Proc. DARPA Speech and Natural Language Workshop , pp. 49-58
    • Pallett, D.S.1
  • 31
    • 0012316245 scopus 로고
    • 1994 benchmark tests for the ARPA spoken language program
    • Morgan Kaufmann, Austin, TX
    • Pallett, D.S., Fiscus, J.G., et al., 1995. 1994 benchmark tests for the ARPA Spoken Language Program. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 5-36.
    • (1995) Proc. Spoken Language Systems Technology Workshop , pp. 5-36
    • Pallett, D.S.1    Fiscus, J.G.2
  • 32
    • 0012330750 scopus 로고
    • The design for the Wall Street Journal-based CSR corpus
    • Morgan Kaufmann, Austin, TX
    • Paul, D., Baker, J., 1992. The design for the Wall Street Journal-based CSR corpus. Proc. DARPA Speech and Natural Language Workshop. Morgan Kaufmann, Austin, TX, pp. 357-360.
    • (1992) Proc. DARPA Speech and Natural Language Workshop , pp. 357-360
    • Paul, D.1    Baker, J.2
  • 34
    • 84964176674 scopus 로고
    • The intelligibility of excerpts from conversation
    • Pollack, I., Pickett, J.M., 1963. The intelligibility of excerpts from conversation. Language and Speech 6, 165-171.
    • (1963) Language and Speech , vol.6 , pp. 165-171
    • Pollack, I.1    Pickett, J.M.2
  • 37
    • 0012327349 scopus 로고    scopus 로고
    • Specification of the 1995 ARPA Hub 3 evaluation: Unlimited vocabulary NAB news baseline
    • Morgan Kaufmann, Harriman, NY
    • Stern, R.M., 1996. Specification of the 1995 ARPA Hub 3 evaluation: Unlimited vocabulary NAB news baseline. Proc. Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 5-7.
    • (1996) Proc. Speech Recognition Workshop , pp. 5-7
    • Stern, R.M.1
  • 38
    • 0040262071 scopus 로고
    • Human benchmarks for speaker independent large vocabulary recognition performance
    • Madrid
    • Van Leeuwen, D.A., Van den Berg, L.G., Steeneken, H.J.M., 1995. Human benchmarks for speaker independent large vocabulary recognition performance. Eurospeech, Madrid, pp. 1461-1464.
    • (1995) Eurospeech , pp. 1461-1464
    • Van Leeuwen, D.A.1    Van Den Berg, L.G.2    Steeneken, H.J.M.3
  • 39
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • Varga, A., Steeneken, H.J.M., 1993. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication 12 (3), 247-251.
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 40
    • 0014346711 scopus 로고
    • Relation between intelligibility scores for four test methods and three types of speech distortion
    • Williams, C.E., Hecker, M.H.L., 1968. Relation between intelligibility scores for four test methods and three types of speech distortion. J. Acoust. Soc. Amer. 44 (4), 1002-1006.
    • (1968) J. Acoust. Soc. Amer. , vol.44 , Issue.4 , pp. 1002-1006
    • Williams, C.E.1    Hecker, M.H.L.2
  • 41
    • 0002452931 scopus 로고    scopus 로고
    • The HTK large vocabulary recognition system for the 1995 ARPA H3 task
    • Morgan Kaufmann, Harriman, NY
    • Woodland, P.C., Gales, M.J.F., Pye, D., Valtchev, V., 1996. The HTK large vocabulary recognition system for the 1995 ARPA H3 Task. Proc. Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 99-104.
    • (1996) Proc. Speech Recognition Workshop , pp. 99-104
    • Woodland, P.C.1    Gales, M.J.F.2    Pye, D.3    Valtchev, V.4
  • 42
    • 0030244826 scopus 로고    scopus 로고
    • A review of large-vocabulary continuous-speech recognition
    • Young, S.J., 1996. A review of large-vocabulary continuous-speech recognition. IEEE Signal Process. Mag. 13, 45-57.
    • (1996) IEEE Signal Process. Mag. , vol.13 , pp. 45-57
    • Young, S.J.1
  • 43
    • 0028516923 scopus 로고
    • Spontaneous speech recognition for the credit card corpus using the HTK toolkit
    • Young, S.J., Woodland, P.C., Byrne, W.J., 1994. Spontaneous speech recognition for the credit card corpus using the HTK toolkit. IEEE Trans. Speech Audio Process. 2 (4), 615-621.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 615-621
    • Young, S.J.1    Woodland, P.C.2    Byrne, W.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.