SCOPUS 정보 검색 플랫폼

Volumn 22, Issue 1, 1997, Pages 1-15

Speech recognition by machines and humans

a MIT LINCOLN LABORATORY (United States)

Author keywords

Automatic speech recognition; Machine recognition; Noise; Nonsense sentences; Nonsense syllables; Perception; Performance; Speech; Speech perception; Speech recognition

Indexed keywords

ACOUSTIC NOISE; HUMAN COMPUTER INTERACTION; MAN MACHINE SYSTEMS; SPEECH ANALYSIS;

AUTOMATIC SPEECH RECOGNITION; MACHINE RECOGNITION;

SPEECH RECOGNITION;

EID: 0031187171 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(97)00021-6 Document Type: Article

Times cited : (440)

References (44)

1
- 0030142722
- Towards increasing speech recognition error rates
- Bourlard, H., Hermansky, H., Morgan, N., 1996. Towards increasing speech recognition error rates. Speech Communication 18 (3), 205-231.
- (1996) Speech Communication , vol.18 , Issue.3 , pp. 205-231
- Bourlard, H.¹ Hermansky, H.² Morgan, N.³

2
- 0028531926
- Computational auditory scene analysis
- Brown, G.J., Cooke, M.P., 1994. Computational auditory scene analysis. Computer Speech and Language 8, 297-336.
- (1994) Computer Speech and Language , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

3
- 0029770992
- Improving wordspotting performance with artificially generated data
- Chang, E., Lippmann, R., 1996. Improving wordspotting performance with artificially generated data. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 526-529.
- (1996) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. , pp. 526-529
- Chang, E.¹ Lippmann, R.²

4
- 85027136924
- Minimum error rate training of inter-word context dependent acoustic model units in speech recognition
- Yokohama, Japan
- Chou, W., Lee, C.H., Juang, B.H., 1994. Minimum error rate training of inter-word context dependent acoustic model units in speech recognition. Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan, pp. 509:3.1-3.4.
- (1994) Proc. Internat. Conf. on Spoken Language Processing , pp. 509
- Chou, W.¹ Lee, C.H.² Juang, B.H.³

5
- 0025557590
- Speaker-independent recognition of spoken english letters
- San Diego, CA
- Cole, R., Fanty, M., Muthusamy, Y., Gopalakrishnan, M., 1990. Speaker-independent recognition of spoken english letters. Proc. 1990 IEEE INNS Internat. Joint Conf. on Neural Networks, San Diego, CA, Vol. 2, pp. 45-51.
- (1990) Proc. 1990 IEEE INNS Internat. Joint Conf. on Neural Networks , vol.2 , pp. 45-51
- Cole, R.¹ Fanty, M.² Muthusamy, Y.³ Gopalakrishnan, M.⁴

6
- 0004698056
- Summary of session 7 - conversational and multi-lingual speech recognition
- Morgan Kaufmann, Harriman, NY
- Culhane, C., 1996. Summary of session 7 - Conversational and multi-lingual speech recognition. Proc. DARPA Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 143-144.
- (1996) Proc. DARPA Speech Recognition Workshop , pp. 143-144
- Culhane, C.¹

7
- 30244544754
- Master's Thesis, Massachusetts Institute of Technology
- Daly, N., 1987. Recognition of words from their spellings: Integration of multiple knowledge sources. Master's Thesis, Massachusetts Institute of Technology.
- (1987) Recognition of Words from Their Spellings: Integration of Multiple Knowledge Sources
- Daly, N.¹

8
- 30244578066
- Human speech recognition performance on the 1995 CSR Hub-3 corpus
- Morgan Kaufmann, Harriman, NY
- Deshmukh, N., Ganapathiraju, A., Duncan, R.J., Picone, J., 1996. Human speech recognition performance on the 1995 CSR Hub-3 corpus. Proc. DARPA Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 129-134.
- (1996) Proc. DARPA Speech Recognition Workshop , pp. 129-134
- Deshmukh, N.¹ Ganapathiraju, A.² Duncan, R.J.³ Picone, J.⁴

9
- 15844378911
- Human speech recognition performance on the 1994 CSR spoke 10 corpus
- Morgan Kaufmann, Austin, TX
- Ebel, W.J., Picone, J., 1995. Human speech recognition performance on the 1994 CSR Spoke 10 corpus. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 53-59.
- (1995) Proc. Spoken Language Systems Technology Workshop , pp. 53-59
- Ebel, W.J.¹ Picone, J.²

10
- 0001298245
- Articulation testing methods
- Fletcher, H., Steinberg, J.C., 1929. Articulation testing methods. Bell System Technical J. 8, 806-854.
- (1929) Bell System Technical J. , vol.8 , pp. 806-854
- Fletcher, H.¹ Steinberg, J.C.²

11
- 85016587886
- SWITCHBOARD: Telephone speech corpus for research and development
- Godfrey, J.J., Holliman, E.G., McDaniel, J., 1992. SWITCHBOARD: Telephone speech corpus for research and development. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 517-520.
- (1992) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. , pp. 517-520
- Godfrey, J.J.¹ Holliman, E.G.² McDaniel, J.³

12
- 0346126988
- Robust speech recognition in noise - performance of the IBM continuous speech recognizer on the ARPA noise spoke task
- Morgan Kaufmann, Austin, TX
- Gopinath, R.A., Gales, M., Gopalakrishnan, P.S., Balakrishnan-Aiyer, S., Picheny, M.A., 1995. Robust speech recognition in noise - Performance of the IBM Continuous Speech Recognizer on the ARPA noise spoke task. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 127-130.
- (1995) Proc. Spoken Language Systems Technology Workshop , pp. 127-130
- Gopinath, R.A.¹ Gales, M.² Gopalakrishnan, P.S.³ Balakrishnan-Aiyer, S.⁴ Picheny, M.A.⁵

13
- 0028320180
- An experiment in spoken language acquisition
- Gorin, A.L., Levinson, S.E., Sankar, A., 1994. An experiment in spoken language acquisition. IEEE Trans. Speech Audio Process. 2, 224-240.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 224-240
- Gorin, A.L.¹ Levinson, S.E.² Sankar, A.³

14
- 0026374868
- Improved acoustic modeling with the sphinx speech recognition system
- Huang, X.D., Lee, K.F., Hon, H.W., Hwang, M.Y., 1991. Improved acoustic modeling with the SPHINX Speech Recognition System. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 345-348.
- (1991) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. , pp. 345-348
- Huang, X.D.¹ Lee, K.F.² Hon, H.W.³ Hwang, M.Y.⁴

15
- 0022150487
- The development of an experimental discrete dictation recognizer
- Jelinek, F., 1985. The development of an experimental discrete dictation recognizer. Proc. IEEE 73, 1616-1624.
- (1985) Proc. IEEE , vol.73 , pp. 1616-1624
- Jelinek, F.¹

16
- 0041107346
- Adaptability to differences between talkers in Japanese monosyllabic perception
- Tohkura, Y., Vatikiotis-Bateson, E., Sagisaka, Y. (Eds.), IOS Press, Amsterdam
- Kakehi, K., 1992. Adaptability to differences between talkers in Japanese monosyllabic perception. In: Tohkura, Y., Vatikiotis-Bateson, E., Sagisaka, Y. (Eds.), Speech Perception, Production and Linguistic Structure. IOS Press, Amsterdam, pp. 135-142.
- (1992) Speech Perception, Production and Linguistic Structure , pp. 135-142
- Kakehi, K.¹

17
- 0001760126
- Speech bandwidth compression through spectrum selection
- Kryter, K.D., 1960. Speech bandwidth compression through spectrum selection. J. Acoust. Soc. Amer. 32, 547-556.
- (1960) J. Acoust. Soc. Amer. , vol.32 , pp. 547-556
- Kryter, K.D.¹

18
- 0343125611
- Design of the 1994 CSR Benchmark Tests
- Morgan Kaufmann, Austin, TX
- Kubala, F., 1995. Design of the 1994 CSR Benchmark Tests. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 41-46.
- (1995) Proc. Spoken Language Systems Technology Workshop , pp. 41-46
- Kubala, F.¹

19
- 30244476594
- SWITCHBOARD: A user's manual, catalog number LDC94s7
- University of Pennsylvania, Philadelphia, PA
- LDC, 1995. SWITCHBOARD: A User's Manual, Catalog Number LDC94S7. Linguistic Data Consortium, University of Pennsylvania, Philadelphia, PA.
- (1995) Linguistic Data Consortium

20
- 0003770715
- Kluwer, Boston, MA
- Lee, K.F., 1989. Automatic Speech Recognition. Kluwer, Boston, MA.
- (1989) Automatic Speech Recognition
- Lee, K.F.¹

21
- 0021226391
- A database for speaker-independent digit recognition
- Leonard, R.G., 1984. A database for speaker-independent digit recognition, Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 42.11.1-42.11.4.
- (1984) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. , pp. 42111-42114
- Leonard, R.G.¹

22
- 0012778209
- Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech
- Licklider, J.C.R., Pollack, I., 1948. Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech. J. Acoust. Soc. Amer. 20, 42-51.
- (1948) J. Acoust. Soc. Amer. , vol.20 , pp. 42-51
- Licklider, J.C.R.¹ Pollack, I.²

23
- 0029754956
- Accurate consonant perception without mid-frequency speech energy
- Lippmann, R.P., 1996. Accurate consonant perception without mid-frequency speech energy. IEEE Trans. Speech Audio Process. 4, 66-69.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 66-69
- Lippmann, R.P.¹

24
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- Lippmann, R.P., Martin, E.A., 1987. Multi-style training for robust isolated-word speech recognition. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 705-708.
- (1987) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process. , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.²

25
- 0019533542
- A study of multichannel amplitude compression and linear amplification for persons with sensorineural hearing loss
- Lippmann, R.P., Braida, L.D., Durlach, N.I., 1981. A study of multichannel amplitude compression and linear amplification for persons with sensorineural hearing loss. J. Acoust. Soc. Amer. 69, 524-534.
- (1981) J. Acoust. Soc. Amer. , vol.69 , pp. 524-534
- Lippmann, R.P.¹ Braida, L.D.² Durlach, N.I.³

26
- 0029748334
- Speech recognition on mandarin call home: A large-vocabulary, conversational, and telephone speech corpus
- Liu, F.-H., Picheny, M., Srinivasa, P., Monkowski, M., Chen, J., 1996. Speech recognition on Mandarin Call Home: A large-vocabulary, conversational, and telephone speech corpus. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 157-160.
- (1996) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process , pp. 157-160
- Liu, F.-H.¹ Picheny, M.² Srinivasa, P.³ Monkowski, M.⁴ Chen, J.⁵

27
- 30244488488
- Personal communication
- A. Martin, 1996. Personal communication.
- (1996)
- Martin, A.¹

28
- 0002834586
- Decision units in the perception of speech
- Miller, G.A., 1962. Decision units in the perception of speech. Institute of Radio Engineers Transactions on Information Theory 8, 81-83.
- (1962) Institute of Radio Engineers Transactions on Information Theory , vol.8 , pp. 81-83
- Miller, G.A.¹

29
- 0004173371
- Freeman, New York
- Miller, G.A., 1991. The Science of Words. Freeman, New York.
- (1991) The Science of Words
- Miller, G.A.¹

30
- 6744229564
- DARPA resource management and ATIS benchmark test poster session
- Morgan Kaufmann, Austin, TX
- Pallett, D.S., 1991. DARPA resource management and ATIS benchmark test poster session. Proc. DARPA Speech and Natural Language Workshop. Morgan Kaufmann, Austin, TX, pp. 49-58.
- (1991) Proc. DARPA Speech and Natural Language Workshop , pp. 49-58
- Pallett, D.S.¹

31
- 0012316245
- 1994 benchmark tests for the ARPA spoken language program
- Morgan Kaufmann, Austin, TX
- Pallett, D.S., Fiscus, J.G., et al., 1995. 1994 benchmark tests for the ARPA Spoken Language Program. Proc. Spoken Language Systems Technology Workshop. Morgan Kaufmann, Austin, TX, pp. 5-36.
- (1995) Proc. Spoken Language Systems Technology Workshop , pp. 5-36
- Pallett, D.S.¹ Fiscus, J.G.²

32
- 0012330750
- The design for the Wall Street Journal-based CSR corpus
- Morgan Kaufmann, Austin, TX
- Paul, D., Baker, J., 1992. The design for the Wall Street Journal-based CSR corpus. Proc. DARPA Speech and Natural Language Workshop. Morgan Kaufmann, Austin, TX, pp. 357-360.
- (1992) Proc. DARPA Speech and Natural Language Workshop , pp. 357-360
- Paul, D.¹ Baker, J.²

33
- 0029725921
- Improvements in switchboard recognition and topic identification
- Peskin, B., Connolly, S., Gillick, L., Lowe, S., McAllaster, D., Nagesha, V., Van Mulbregt, P., Wegmann, S., 1996. Improvements in Switchboard recognition and topic identification. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 303-306.
- (1996) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process , pp. 303-306
- Peskin, B.¹ Connolly, S.² Gillick, L.³ Lowe, S.⁴ McAllaster, D.⁵ Nagesha, V.⁶ Van Mulbregt, P.⁷ Wegmann, S.⁸

34
- 84964176674
- The intelligibility of excerpts from conversation
- Pollack, I., Pickett, J.M., 1963. The intelligibility of excerpts from conversation. Language and Speech 6, 165-171.
- (1963) Language and Speech , vol.6 , pp. 165-171
- Pollack, I.¹ Pickett, J.M.²

35
- 30244448312
- How humans perform on a connected-digits data base
- Pols, L.C.W., 1982. How humans perform on a connected-digits data base. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 867-870.
- (1982) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process , pp. 867-870
- Pols, L.C.W.¹

36
- 0023776398
- The DARPA 1000-word resource management database for continuous speech recognition
- Price, P., Fisher, W.M., Bernstein, 3., Pallett, D.S., 1988. The DARPA 1000-word resource management database for continuous speech recognition. Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 651-654.
- (1988) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process , pp. 651-654
- Price, P.¹ Fisher, W.M.² Bernstein, B.³ Pallett, D.S.⁴

37
- 0012327349
- Specification of the 1995 ARPA Hub 3 evaluation: Unlimited vocabulary NAB news baseline
- Morgan Kaufmann, Harriman, NY
- Stern, R.M., 1996. Specification of the 1995 ARPA Hub 3 evaluation: Unlimited vocabulary NAB news baseline. Proc. Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 5-7.
- (1996) Proc. Speech Recognition Workshop , pp. 5-7
- Stern, R.M.¹

38
- 0040262071
- Human benchmarks for speaker independent large vocabulary recognition performance
- Madrid
- Van Leeuwen, D.A., Van den Berg, L.G., Steeneken, H.J.M., 1995. Human benchmarks for speaker independent large vocabulary recognition performance. Eurospeech, Madrid, pp. 1461-1464.
- (1995) Eurospeech , pp. 1461-1464
- Van Leeuwen, D.A.¹ Van Den Berg, L.G.² Steeneken, H.J.M.³

39
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- Varga, A., Steeneken, H.J.M., 1993. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication 12 (3), 247-251.
- (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-251
- Varga, A.¹ Steeneken, H.J.M.²

40
- 0014346711
- Relation between intelligibility scores for four test methods and three types of speech distortion
- Williams, C.E., Hecker, M.H.L., 1968. Relation between intelligibility scores for four test methods and three types of speech distortion. J. Acoust. Soc. Amer. 44 (4), 1002-1006.
- (1968) J. Acoust. Soc. Amer. , vol.44 , Issue.4 , pp. 1002-1006
- Williams, C.E.¹ Hecker, M.H.L.²

41
- 0002452931
- The HTK large vocabulary recognition system for the 1995 ARPA H3 task
- Morgan Kaufmann, Harriman, NY
- Woodland, P.C., Gales, M.J.F., Pye, D., Valtchev, V., 1996. The HTK large vocabulary recognition system for the 1995 ARPA H3 Task. Proc. Speech Recognition Workshop. Morgan Kaufmann, Harriman, NY, pp. 99-104.
- (1996) Proc. Speech Recognition Workshop , pp. 99-104
- Woodland, P.C.¹ Gales, M.J.F.² Pye, D.³ Valtchev, V.⁴

42
- 0030244826
- A review of large-vocabulary continuous-speech recognition
- Young, S.J., 1996. A review of large-vocabulary continuous-speech recognition. IEEE Signal Process. Mag. 13, 45-57.
- (1996) IEEE Signal Process. Mag. , vol.13 , pp. 45-57
- Young, S.J.¹

43
- 0028516923
- Spontaneous speech recognition for the credit card corpus using the HTK toolkit
- Young, S.J., Woodland, P.C., Byrne, W.J., 1994. Spontaneous speech recognition for the credit card corpus using the HTK toolkit. IEEE Trans. Speech Audio Process. 2 (4), 615-621.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 615-621
- Young, S.J.¹ Woodland, P.C.² Byrne, W.J.³

44
- 85121123643
- The MIT SUMMIT speech recognition system: A progress report
- Morgan Kaufmann, Philadelphia, PA
- Zue, V., Glass, J., Phillips, M., Seneff, S., 1989. The MIT SUMMIT speech recognition system: A progress report. Proc. DARPA Speech and Natural Language Workshop. Morgan Kaufmann, Philadelphia, PA, pp. 179-189.
- (1989) Proc. DARPA Speech and Natural Language Workshop , pp. 179-189
- Zue, V.¹ Glass, J.² Phillips, M.³ Seneff, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.