SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 18, Issue 3, 1996, Pages 205-231

Towards increasing speech recognition error rates

(3) Bourlard, Hervé a,b Hermansky, Hynek b,d Morgan, Nelson a,c

a INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

b FACULTÉ POLYTECHNIQUE DE MONS (Belgium)

c UNIVERSITY OF CALIFORNIA (United States)

d OREGON HEALTH AND SCIENCE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUDITION; DECODING; ERROR CORRECTION; FEATURE EXTRACTION; SPEECH PROCESSING;

AUTOMATIC SPEECH RECOGNITION; WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 0030142722 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/0167-6393(96)00003-9 Document Type: Article

Times cited : (79)

References (106)

1
- 0028996998
- Rapid speaker adaptation using model prediction
- S.M. Ahadi and P.C. Woodland (1995), "Rapid speaker adaptation using model prediction", Proc. IEEE Internat. Conf. Acoust. Speech, Detroit, MI, pp. 684-687.
- (1995) Proc. IEEE Internat. Conf. Acoust. Speech, Detroit, Mi , pp. 684-687
- Ahadi, S.M.¹ Woodland, P.C.²

2
- 0013864705
- Click-evoked response patterns of single units in the medial geniculate body of the cat
- L. Aitkin, C. Dunlop and W. Webster (1966), "Click-evoked response patterns of single units in the medial geniculate body of the cat", J. Neurophysiology, Vol. 29, pp. 109-123.
- (1966) J. Neurophysiology , vol.29 , pp. 109-123
- Aitkin, L.¹ Dunlop, C.² Webster, W.³

3
- 0027167185
- A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
- K. Aikawa, H. Singer, H. Kawahara and Y. Tohkura (1993), "A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Minneapolis, MN, pp. II-668-671.
- (1993) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Minneapolis, Mn
- Aikawa, K.¹ Singer, H.² Kawahara, H.³ Tohkura, Y.⁴

4
- 0028516073
- How do humans process and recognize speech?
- J.B. Allen (1994), "How do humans process and recognize speech?", IEEE Trans. Speech Audio Process., Vol. 2, No. 4, pp. 567-577.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 567-577
- Allen, J.B.¹

5
- 85079105100
- Adaptation to new microphones using tied-mixture normalization
- A. Anastasakos, F. Kubala, J. Makhoul and R. Schwartz (1994), "Adaptation to new microphones using tied-mixture normalization", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia, pp. I-433-437.
- (1994) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia
- Anastasakos, A.¹ Kubala, F.² Makhoul, J.³ Schwartz, R.⁴

6
- 0026368826
- Regression features for recognition of speech in quiet and in noise
- T.H. Applebaum and B.A. Hanson (1991), "Regression features for recognition of speech in quiet and in noise", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada, pp. 985-989.
- (1991) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada , pp. 985-989
- Applebaum, T.H.¹ Hanson, B.A.²

7
- 0025692780
- Automatic detection of new words in a large vocabulary continuous speech recognition system
- A. Asadi, R. Schwartz and J. Makhoul (1990), "Automatic detection of new words in a large vocabulary continuous speech recognition system", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM, pp. 125-128.
- (1990) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, Nm , pp. 125-128
- Asadi, A.¹ Schwartz, R.² Makhoul, J.³

8
- 0012267743
- Improving state-of-the-art continuous speech recognition systems using the N-best paradigm with neural networks
- Morgan Kaufmann, Los Altos, CA
- S. Austin, G. Zavaliagkos, J. Makhoul and J. Schwartz (1992), "Improving state-of-the-art continuous speech recognition systems using the N-best paradigm with neural networks", Proc. DARPA Speech and Natural Language Workshop, Harriman, NY (Morgan Kaufmann, Los Altos, CA), pp. 180-184.
- (1992) Proc. DARPA Speech and Natural Language Workshop, Harriman, Ny , pp. 180-184
- Austin, S.¹ Zavaliagkos, G.² Makhoul, J.³ Schwartz, J.⁴

9
- 0023776396
- A new algorithm for the estimation of hidden Markov models
- L.R. Bahl, P.P. Brown, P.V. de Souza and R.L. Mercer (1988), "A new algorithm for the estimation of hidden Markov models", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, NY, pp. 493-496.
- (1988) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, Ny , pp. 493-496
- Bahl, L.R.¹ Brown, P.P.² De Souza, P.V.³ Mercer, R.L.⁴

10
- 0016663359
- The Dragon system - An overview
- J.K. Baker (1975), "The Dragon system - An overview", IEEE Trans. Acoust. Speech Signal Process., Vol. ASSP-23, No. 1, pp. 24-29.
- (1975) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-23 , Issue.1 , pp. 24-29
- Baker, J.K.¹

11
- 0026835134
- Global optimization of a neural-hidden Markov model hybrid
- Y.R. Bengio, R. De Mori, G. Flammia and R. Kompe (1992), "Global optimization of a neural-hidden Markov model hybrid", IEEE Trans. Neural Networks, Vol. 3, pp. 252-258.
- (1992) IEEE Trans. Neural Networks , vol.3 , pp. 252-258
- Bengio, Y.R.¹ De Mori, R.² Flammia, G.³ Kompe, R.⁴

12
- 0000971250
- An input output HMM architecture
- ed. by G. Tesauro, D. Touretzky and T. Leen, MIT Press, Cambridge, MA
- Y. Bengio and P. Frasconi (1995), "An input output HMM architecture", in Advances in Neural Information Processing Systems, ed. by G. Tesauro, D. Touretzky and T. Leen, Vol. 7 (MIT Press, Cambridge, MA).
- (1995) Advances in Neural Information Processing Systems , vol.7
- Bengio, Y.¹ Frasconi, P.²

13
- 30244436756
- Personal communication
- A. Bounds (1995), Personal communication.
- (1995)
- Bounds, A.¹

14
- 0025547193
- Links between Markov models and multilayer perceptrons
- H. Bourlard and C.J. Wellekens (1990), "Links between Markov models and multilayer perceptrons", IEEE Trans. Pattern Anal. Machine Intell., Vol. 12, No. 12, pp. 1167-1178.
- (1990) IEEE Trans. Pattern Anal. Machine Intell. , vol.12 , Issue.12 , pp. 1167-1178
- Bourlard, H.¹ Wellekens, C.J.²

15
- 0003573244
- Kluwer Academic Publishers, Dordrecht
- H. Bourlard and N. Morgan (1994), Connectionist Speech Recognition - A Hybrid Approach (Kluwer Academic Publishers, Dordrecht).
- (1994) Connectionist Speech Recognition - A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

16
- 0343975592
- REMAP: Recursive estimation and maximization of a posteriori probabilities - Application to transition-based connectionist speech recognition
- Internat. Computer Science Institute, CA
- H. Bourlard, Y. Konig and N. Morgan (1994), REMAP: Recursive estimation and maximization of a posteriori probabilities - Application to transition-based connectionist speech recognition, ICSI Technical Report TR94-064, Internat. Computer Science Institute, CA.
- (1994) ICSI Technical Report TR94-064
- Bourlard, H.¹ Konig, Y.² Morgan, N.³

17
- 85102488792
- REMAP: Recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition
- H. Bourlard, Y. Konig and N. Morgan (1995), "REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition", Proc. Eurospeech '95, Madrid, Spain.
- (1995) Proc. Eurospeech '95, Madrid, Spain
- Bourlard, H.¹ Konig, Y.² Morgan, N.³

18
- 0347387977
- An experimental automatic word recognition system
- Ruislip, England: Joint Speech Research Unit
- J.S. Bridle and M.D. Brown (1974), An experimental automatic word recognition system, JSRU Report No. 1003, Ruislip, England: Joint Speech Research Unit.
- (1974) JSRU Report No. 1003 , vol.1003
- Bridle, J.S.¹ Brown, M.D.²

19
- 30244499021
- Personal communication
- J.S. Bridle (1995), Personal communication.
- (1995)
- Bridle, J.S.¹

20
- 0003572996
- PhD Thesis, Computer Science Department, Carnegie Mellon University
- P. Brown, The acoustic-modelling problem in automatic speech recognition, PhD Thesis, Computer Science Department, Carnegie Mellon University.
- The Acoustic-modelling Problem in Automatic Speech Recognition
- Brown, P.¹

21
- 0024392496
- Application of an auditory model to speech recognition
- J.R. Cohen (1989), "Application of an auditory model to speech recognition", J. Acoust. Soc. Amer., Vol. 85, No. 6, pp. 2623-2629.
- (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.6 , pp. 2623-2629
- Cohen, J.R.¹

22
- 30244466999
- Informal communication
- J.R. Cohen (1995), Informal communication.
- (1995)
- Cohen, J.R.¹

23
- 30244444358
- Hybrid neural network/hidden Markov model continuous speech recognition
- M. Cohen, H. Franco, N. Morgan, D. Rumelhart and V. Abrash (1992), "Hybrid neural network/hidden Markov model continuous speech recognition", Proc. Internat. Conf. Speech Language Processing, Banff, Canada, pp. 915-918.
- (1992) Proc. Internat. Conf. Speech Language Processing, Banff, Canada , pp. 915-918
- Cohen, M.¹ Franco, H.² Morgan, N.³ Rumelhart, D.⁴ Abrash, V.⁵

24
- 0021906779
- Central auditory processing of peripheral vowel spectra
- L.A. Chistovich (1985), "Central auditory processing of peripheral vowel spectra", J. Acoust. Soc. Amer., Vol. 77, pp. 789-805.
- (1985) J. Acoust. Soc. Amer. , vol.77 , pp. 789-805
- Chistovich, L.A.¹

25
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S.B. Davis and P. Mermelstein (1980), "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. Acoust. Speech Signal Process., Vol. 28, No. 4, pp. 357-366.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

26
- 30244481757
- Incorporating the time correlation between successive observations in an acoustic-phonetic hidden Markov model for continuous speech recognition
- P. de La Noue, S. Levinson and M. Sondhi (1989), Incorporating the time correlation between successive observations in an acoustic-phonetic hidden Markov model for continuous speech recognition, AT&T Technical Memorandum No. 11226.
- (1989) AT&T Technical Memorandum No. 11226
- De La Noue, P.¹ Levinson, S.² Sondhi, M.³

27
- 30244553472
- Adaptive language modelling using minimum discriminant estimation
- S. Della Pietra, V. Della Pietra, R.L. Mercer and S. Roukos (1992), "Adaptive language modelling using minimum discriminant estimation", Proc. DARPA Speech and Natural Language Workshop, Harriman, NY, pp. 103-106.
- (1992) Proc. DARPA Speech and Natural Language Workshop, Harriman, NY , pp. 103-106
- Pietra, S.D.¹ Della Pietra, V.² Mercer, R.L.³ Roukos, S.⁴

28
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A.P. Dempster, N.M. Laird and D.B. Rubin (1977), "Maximum likelihood from incomplete data via the EM algorithm", J. Roy. Statist. Soc., Vol. 39, pp. 1-38.
- (1977) J. Roy. Statist. Soc. , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

29
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Aksmanovic, X. Sun and C. Wu (1994), "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states", IEEE Trans. Speech Audio Process., Vol. 2, No. 4, pp. 507-520.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, X.³ Wu, C.⁴

30
- 0024905808
- Phonetically sensitive discriminants for improved speech recognition
- G. Doddington (1989), "Phonetically sensitive discriminants for improved speech recognition", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Glasgow, Scotland, pp. 556-559.
- (1989) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Glasgow, Scotland , pp. 556-559
- Doddington, G.¹

31
- 0012082151
- PhD Thesis, MIT
- P. Duchnowski (1993), A new structure for automatic speech recognition, PhD Thesis, MIT.
- (1993) A New Structure for Automatic Speech Recognition
- Duchnowski, P.¹

32
- 0003772896
- Effects of emphasizing transitional or stationary parts of the speech signal in a discrete utterance recognition system
- K. Elenius and M. Blomberg (1982), "Effects of emphasizing transitional or stationary parts of the speech signal in a discrete utterance recognition system", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Paris, France, pp. 535-537.
- (1982) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Paris, France , pp. 535-537
- Elenius, K.¹ Blomberg, M.²

33
- 0027266580
- City name recognition over the telephone
- M. Fanty, P. Schmid and R. Cole (1993), "City name recognition over the telephone", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Minneapolis, MN, pp. I-549-552.
- (1993) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Minneapolis, MN
- Fanty, M.¹ Schmid, P.² Cole, R.³

34
- 0025594074
- Connectionist Viterbi training: A new hybrid method for continuous speech recognition
- M.A. Franzini, K.F. Lee and A. Waibel (1990), "Connectionist Viterbi training: A new hybrid method for continuous speech recognition", IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM, pp. 425-428.
- (1990) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM , pp. 425-428
- Franzini, M.A.¹ Lee, K.F.² Waibel, A.³

35
- 0003549684
- Krieger, New York
- H. Fletcher (1953), Speech and Hearing in Communication (Krieger, New York).
- (1953) Speech and Hearing in Communication
- Fletcher, H.¹

36
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- S. Furui (1981), "Cepstral analysis technique for automatic speaker verification", IEEE Trans. Acoust. Speech Signal Process., Vol. 29, pp. 254-272.
- (1981) IEEE Trans. Acoust. Speech Signal Process. , vol.29 , pp. 254-272
- Furui, S.¹

37
- 0022667694
- Speaker independent isolated word recognizer using dynamic features of speech spectrum
- S. Furui (1986), "Speaker independent isolated word recognizer using dynamic features of speech spectrum", IEEE Trans. Acoust. Speech Signal Process., Vol. 34, No. 1, pp. 52-59.
- (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , Issue.1 , pp. 52-59
- Furui, S.¹

38
- 0027578207
- Hidden Markov-models with templates as non-stationary states: An application to speech recognition
- O. Ghitza and M.M. Sondhi (1993), "Hidden Markov-models with templates as non-stationary states: An application to speech recognition", Computer Speech and Language, Vol. 2, pp. 101-119.
- (1993) Computer Speech and Language , vol.2 , pp. 101-119
- Ghitza, O.¹ Sondhi, M.M.²

39
- 0025671510
- A probabilistic approach to the understanding and training of neural network classifiers
- H. Gish (1990), "A probabilistic approach to the understanding and training of neural network classifiers", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM, pp. 1361-1364.
- (1990) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM , pp. 1361-1364
- Gish, H.¹

40
- 0001373576
- Maximum entropy for hypothesis formulation, especially for multidimensional contingency tables
- I.J. Good (1963), "Maximum entropy for hypothesis formulation, especially for multidimensional contingency tables", Ann. Math. Statist., Vol. 34, pp. 911-934.
- (1963) Ann. Math. Statist. , vol.34 , pp. 911-934
- Good, I.J.¹

41
- 0004412839
- PhD Thesis, Oxford, UK
- G.G.R. Green (1976), Temporal aspects of audition, PhD Thesis, Oxford, UK.
- (1976) Temporal Aspects of Audition
- Green, G.G.R.¹

42
- 28844440746
- The representation of speech in the auditory periphery
- S. Greenberg (1988), "The representation of speech in the auditory periphery", J. Phonetics, Vol. 16, pp. 1-151.
- (1988) J. Phonetics , vol.16 , pp. 1-151
- Greenberg, S.¹

43
- 0023829165
- Decoder selection based on cross-entropies
- P.S. Gopalakrishnan, D. Kanecsky, A. Nada, D. Nahamoo and M.A. Picheny (1988), "Decoder selection based on cross-entropies", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, NY, pp. 20-23.
- (1988) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, NY , pp. 20-23
- Gopalakrishnan, P.S.¹ Kanecsky, D.² Nada, A.³ Nahamoo, D.⁴ Picheny, M.A.⁵

44
- 0010603079
- PhD Thesis, MIT
- W.D. Goldenthal (1994), Statistical trajectory models for phonetic recognition, PhD Thesis, MIT.
- (1994) Statistical Trajectory Models for Phonetic Recognition
- Goldenthal, W.D.¹

45
- 0027239233
- Improvements in connected digit recognition using linear discriminant analysis and mixture densities
- R. Haeb-Umbach, D. Geller and H. Ney (1994), "Improvements in connected digit recognition using linear discriminant analysis and mixture densities", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia, pp. II-239-242.
- (1994) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia
- Haeb-Umbach, R.¹ Geller, D.² Ney, H.³

46
- 0028996932
- Methods for improved speech recognition over telephone line
- A. Hauenstein and E. Marschall (1995), "Methods for improved speech recognition over telephone line", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI. pp. 425-428.
- (1995) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI. , pp. 425-428
- Hauenstein, A.¹ Marschall, E.²

47
- 0021122763
- The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech
- B. Hanson and D. Wong (1984), "The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., pp. 18.A.5.1-18.A.5.4.
- (1984) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process.
- Hanson, B.¹ Wong, D.²

48
- 0023167028
- An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception
- H. Hermansky (1987), "An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Dallas, TX, pp. 1159-1162.
- (1987) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Dallas, TX , pp. 1159-1162
- Hermansky, H.¹

49
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky (1990), "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Amer., Vol. 87, No. 4, pp. 1738-1752.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

50
- 0348018654
- Exploring temporal domain for robustness in speech recognition
- H. Hermansky (1995), "Exploring temporal domain for robustness in speech recognition", Proc. 15th Internat. Congress on Acoustics, Trondheim, Norway, Vol. II., pp. 61-64.
- (1995) Proc. 15th Internat. Congress on Acoustics, Trondheim, Norway , vol.2 , pp. 61-64
- Hermansky, H.¹

51
- 0020542318
- Analysis and synthesis of speech based on spectral transform linear predictive method
- H. Hermansky, H. Fujisaki and Y. Sato (1983), "Analysis and synthesis of speech based on spectral transform linear predictive method", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Boston, MA, pp. 777-780,
- (1983) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Boston, MA , pp. 777-780
- Hermansky, H.¹ Fujisaki, H.² Sato, Y.³

52
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan (1994), "'RASTA processing of speech", IEEE Trans. Speech Audio Process., Vol. 2, No. 4, pp. 578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

53
- 0028996922
- Speech enhancement based on temporal processing
- H. Hermansky, E. Wan and C. Avendano (1995), "Speech enhancement based on temporal processing", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI, pp. 405-408.
- (1995) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI , pp. 405-408
- Hermansky, H.¹ Wan, E.² Avendano, C.³

54
- 0028996853
- Recent improvements to the ABBOT large vocabulary CSR system
- M.M. Hochberg, S.J. Renais, A.J. Robinson and G.D. Cook (1995), "Recent improvements to the ABBOT large vocabulary CSR system", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI, pp. 69-72.
- (1995) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI , pp. 69-72
- Hochberg, M.M.¹ Renais, S.J.² Robinson, A.J.³ Cook, G.D.⁴

55
- 0024905238
- A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
- M. Hunt and C. Lefebvre (1989), "A comparison of several acoustic representations for speech recognition with degraded and undegraded speech", Internat. Conf. Acoust. Speech Signal Process., Glasgow, Scotland, pp. 262-265.
- (1989) Internat. Conf. Acoust. Speech Signal Process., Glasgow, Scotland , pp. 262-265
- Hunt, M.¹ Lefebvre, C.²

56
- 0037662539
- Automatic formant extraction utilizing mel scale and equal loudness contour
- S. Itahashi and S. Yokoyama (1976). "Automatic formant extraction utilizing mel scale and equal loudness contour", Internat. Conf. Acoust. Speech Signal Process., Philadelphia, PA, pp. 310-313.
- (1976) Internat. Conf. Acoust. Speech Signal Process., Philadelphia, PA , pp. 310-313
- Itahashi, S.¹ Yokoyama, S.²

57
- 0026626584
- Speaker independent phonetic classification in continuous English letters
- R.D.T. Janseen, M. Fanty and R.A. Cole (1991), "Speaker independent phonetic classification in continuous English letters", Proc, Internat. Joint Conf. on Neural Networks, Seattle, WA, pp. II-801-808.
- (1991) Proc, Internat. Joint Conf. on Neural Networks, Seattle, WA
- Janseen, R.D.T.¹ Fanty, M.² Cole, R.A.³

58
- 0016507833
- Design of a linguistic statistical decoder for the recognition of continuous speech
- F. Jelinek, L.R. Bahl and R.L. Mercer (1975), "Design of a linguistic statistical decoder for the recognition of continuous speech", IEEE Trans. Information Theory, Vol. IT-21, pp. 250-256.
- (1975) IEEE Trans. Information Theory , vol.IT-21 , pp. 250-256
- Jelinek, F.¹ Bahl, L.R.² Mercer, R.L.³

59
- 0016939124
- Continuous speech recognition by statistical methods
- F. Jelinek (1976), "Continuous speech recognition by statistical methods", IEEE Proc., Vol. 64, No. 4, pp. 532-556.
- (1976) IEEE Proc. , vol.64 , Issue.4 , pp. 532-556
- Jelinek, F.¹

60
- 0012357341
- A dynamic language model for speech recognition
- F. Jelinek, B. Merialdo, S. Roukos, and M. Strauss (1991), "A dynamic language model for speech recognition", Proc. DARPA Speech and Nautral Language Workshop, Pacific Grove, CA, pp. 293-295.
- (1991) Proc. DARPA Speech and Nautral Language Workshop, Pacific Grove, CA , pp. 293-295
- Jelinek, F.¹ Merialdo, B.² Roukos, S.³ Strauss, M.⁴

61
- 0000262562
- Hierarchical mixtures of experts and the EM algorithm
- M.I. Jordan and R.A. Jacobs (1994), "Hierarchical mixtures of experts and the EM algorithm", Neural Computation, Vol. 6, pp. 181-214.
- (1994) Neural Computation , vol.6 , pp. 181-214
- Jordan, M.I.¹ Jacobs, R.A.²

62
- 0022270364
- Mixture autoregressive hidden Markov models for speech signals
- B.H. Juang and L.R. Rabiner (1985), "Mixture autoregressive hidden Markov models for speech signals", IEEE Trans. Acoust. Speech Signal Process., Vol. 33, No. 6, pp. 1404-14013.
- (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.33 , Issue.6 , pp. 1404-14013
- Juang, B.H.¹ Rabiner, L.R.²

63
- 0003919964
- Vocal tract normalization in speech recognition: Compensation for systematic speaker variability
- T. Kamm, A.G. Andreou and J. Cohen (1995), "Vocal tract normalization in speech recognition: Compensation for systematic speaker variability", Proc. 15th Annual Speech Research Symposium, Johns Hopkins University, Baltimore, MI, pp. 175-179.
- (1995) Proc. 15th Annual Speech Research Symposium, Johns Hopkins University, Baltimore, MI , pp. 175-179
- Kamm, T.¹ Andreou, A.G.² Cohen, J.³

64
- 0026271562
- New discriminative training algorithms based on the generalized probabilistic descent method
- edited by B.H. Juang, S.Y. Kung and C.A. Kamm (Morgan Kauffman, Los Altos, CA)
- S. Katagiri, C.H. Lee and B.H. Juang (1991), "New discriminative training algorithms based on the generalized probabilistic descent method", in Proc. IEEE Workshop on Neural Networks for Signal Process., edited by B.H. Juang, S.Y. Kung and C.A. Kamm (Morgan Kauffman, Los Altos, CA), pp. 299-308.
- (1991) Proc. IEEE Workshop on Neural Networks for Signal Process. , pp. 299-308
- Katagiri, S.¹ Lee, C.H.² Juang, B.H.³

65
- 0001490199
- Speech processing strategies based on auditory models
- ed. by R. Carlson and B. Granstrom (Elsevier - Biomedical Press, New York)
- D.H. Klatt (1982), "Speech processing strategies based on auditory models", in The Representation of Speech in the Peripheral Auditory System, ed. by R. Carlson and B. Granstrom (Elsevier - Biomedical Press, New York), pp. 181-202.
- (1982) The Representation of Speech in the Peripheral Auditory System , pp. 181-202
- Klatt, D.H.¹

66
- 0026142334
- A study on speaker adaptation of the parameters of continuous density hidden Markov models
- C-H. Lee, C-H. Lin and B-H. Juang (1991), "A study on speaker adaptation of the parameters of continuous density hidden Markov models", IEEE Trans. Signal Process., Vol. 39, No. 4, pp. 806-814.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 806-814
- Lee, C.-H.¹ Lin, C.-H.² Juang, B.-H.³

67
- 0027269171
- Hidden control neural architecture modeling of nonlinear time varying systems and its applications
- E. Levin (1993), "Hidden control neural architecture modeling of nonlinear time varying systems and its applications", IEEE Trans. Neural Networks, Vol. 4, No. 1, pp. 109-116.
- (1993) IEEE Trans. Neural Networks , vol.4 , Issue.1 , pp. 109-116
- Levin, E.¹

68
- 0018478297
- Spectral root homomorphic deconvolution system
- J.S. Lim (1979), "Spectral root homomorphic deconvolution system", IEEE Trans. Acoust. Speech Signal Process., Vol. 27, No. 3, pp. 223-233.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.3 , pp. 223-233
- Lim, J.S.¹

69
- 0020180460
- Maximum likelihood estimation for multivariate observations of Markov sources
- L.A. Liporace (1982), "Maximum likelihood estimation for multivariate observations of Markov sources", IEEE Trans. Information Theory, Vol. IT-28, No. 5, pp. 729-734.
- (1982) IEEE Trans. Information Theory , vol.IT-28 , Issue.5 , pp. 729-734
- Liporace, L.A.¹

70
- 85134601175
- Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities
- D.M. Lubensky, A.O. Asadi and J.M. Naik (1994), "Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities", Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan.
- (1994) Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan
- Lubensky, D.M.¹ Asadi, A.O.² Naik, J.M.³

71
- 0020544161
- Recognition of consonant based on the perceptron model
- S. Makino, T. Kawabata and K. Kido (1983), "Recognition of consonant based on the Perceptron model", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Boston, MA, pp. 738-741.
- (1983) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Boston, MA , pp. 738-741
- Makino, S.¹ Kawabata, T.² Kido, K.³

72
- 0038133939
- Distance measures for speech recognition, psychological and instrumental
- ed. by R.C.H. Chen (Academic Press, New York)
- P. Mermelstein (1976), "Distance measures for speech recognition, psychological and instrumental", in Pattern Recognition and Artificial Intelligence, ed. by R.C.H. Chen (Academic Press, New York), pp. 374-388.
- (1976) Pattern Recognition and Artificial Intelligence , pp. 374-388
- Mermelstein, P.¹

73
- 0025659256
- Continuous speech recognition using multilayer perceptrons with hidden Markov models
- N. Morgan and H. Bourlard (1990), "Continuous speech recognition using multilayer perceptrons with hidden Markov models", IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM, pp. 413-416.
- (1990) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM , pp. 413-416
- Morgan, N.¹ Bourlard, H.²

74
- 0029308753
- Neural networks for statistical recognition of continuous speech
- N. Morgan and H. Bourlard (1995), "Neural networks for statistical recognition of continuous speech", Proc. IEEE, Vol. 83, No. 5, pp. 741-770.
- (1995) Proc. IEEE , vol.83 , Issue.5 , pp. 741-770
- Morgan, N.¹ Bourlard, H.²

75
- 0028996926
- Stochastic perceptual models of speech
- N. Morgan, H. Bourlard, S. Greenberg and H. Hermansky (1995), "Stochastic perceptual models of speech", IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., pp. 397-400.
- (1995) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process. , pp. 397-400
- Morgan, N.¹ Bourlard, H.² Greenberg, S.³ Hermansky, H.⁴

76
- 30244554686
- Digit recognition with stochastic perceptual models
- N. Morgan, S.-L. Wu and H. Bourlard (1995), "Digit recognition with stochastic perceptual models", Proc. Eurospeech'95, Madrid, Spain, pp. 771-774.
- (1995) Proc. Eurospeech'95, Madrid, Spain , pp. 771-774
- Morgan, N.¹ Wu, S.-L.² Bourlard, H.³

77
- 0002127129
- Probabilistic optimum filtering for robust speech recognition
- L. Neumayer and M. Weintraub (1994), "Probabilistic optimum filtering for robust speech recognition". IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia, pp. I-417-420.
- (1994) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Adelaide, Australia
- Neumayer, L.¹ Weintraub, M.²

78
- 30244483078
- Context modeling with the stochastic segment model
- M. Ostendorf, I. Bechwati and O. Kimball (1992), "Context modeling with the stochastic segment model", IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA, pp. 389-392.
- (1992) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA , pp. 389-392
- Ostendorf, M.¹ Bechwati, I.² Kimball, O.³

79
- 0026400228
- On the interaction between true source, training, and testing language models
- D.B. Paul, J.K. Baker and J.M. Baker (1991), "On the interaction between true source, training, and testing language models", IEEE Proc. internat. Conf. Acoust. Speech Signal Process., Toronto, Canada, pp. 569-572.
- (1991) IEEE Proc. Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada , pp. 569-572
- Paul, D.B.¹ Baker, J.K.² Baker, J.M.³

80
- 0007636578
- Temporal masking in automatic speech recognition
- M. Pavel and H. Hermansky (1994), "Temporal masking in automatic speech recognition", J. Acoust. Soc. Amer., Vol. 95, No. 5, pp. 2876.
- (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.5 , pp. 2876
- Pavel, M.¹ Hermansky, H.²

81
- 0014557625
- Perceptual and physical space of vowel sounds
- L.C.W. Pols, L.J.T. v.d. Kamp and R. Plomp (1969), "Perceptual and physical space of vowel sounds", J. Acoust. Soc. Amer., Vol. 46, pp. 458-467.
- (1969) J. Acoust. Soc. Amer. , vol.46 , pp. 458-467
- Pols, L.C.W.¹ Kamp, L.J.T.V.D.² Plomp, R.³

82
- 0015129120
- Real-time recognition of spoken words
- L.C.W. Pols (1971), "Real-time recognition of spoken words", IEEE Trans. Computers, Vol. 20(C), pp. 972-978.
- (1971) IEEE Trans. Computers , vol.20 , Issue.C , pp. 972-978
- Pols, L.C.W.¹

83
- 84985742249
- Linear predictive hidden Markov models and the speech signal
- A.B. Poritz (1982), "Linear predictive hidden Markov models and the speech signal", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Paris, France, pp. 1291-1294.
- (1982) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Paris, France , pp. 1291-1294
- Poritz, A.B.¹

84
- 0022879621
- On hidden Markov models in isolated word recognition
- A.B. Poritz and A.G. Richter (1986), "On hidden Markov models in isolated word recognition", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Tokyo, Japan, pp. 705-708.
- (1986) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Tokyo, Japan , pp. 705-708
- Poritz, A.B.¹ Richter, A.G.²

85
- 0021158675
- Optimal estimators for spectral restoration of noisy speech
- J.E. Porter and S.F. Boll (1984), "Optimal estimators for spectral restoration of noisy speech", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Diego, CA, pp. 18.A.2.1.-18.A.2.4.
- (1984) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Diego, CA
- Porter, J.E.¹ Boll, S.F.²

86
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L.R. Rabiner (1989), "A tutorial on hidden Markov models and selected applications in speech recognition", Proc. IEEE, Vol. 77, No. 2, pp. 257-285.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
- Rabiner, L.R.¹

87
- 0028996970
- Efficient search using posterior phone probability estimates
- S. Renals and M. Hochberg (1995), "Efficient search using posterior phone probability estimates", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI, pp. 596-599.
- (1995) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Detroit, MI , pp. 596-599
- Renals, S.¹ Hochberg, M.²

88
- 0001595997
- Neural network classifiers estimate Bayesian a posteriori probabilities
- M.D. Richard and R.P. Lippmann (1991), "Neural network classifiers estimate Bayesian a posteriori probabilities", Neural Computation, Vol. 3, pp. 461-483.
- (1991) Neural Computation , vol.3 , pp. 461-483
- Richard, M.D.¹ Lippmann, R.P.²

89
- 84956613804
- A neural network based, speaker independent, large vocabulary, continuous speech recognition system: The Wernicke project
- T. Robinson, L. Almeida, J.M. Boite, H. Bourlard, F. Fallside, M. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J.P. Neto, S. Renals, M. Saerens and C. Wooters (1993), "A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the Wernicke project", Proc. Eurospeech'93, Berlin, Germany, pp. 1941-1944.
- (1993) Proc. Eurospeech'93, Berlin, Germany , pp. 1941-1944
- Robinson, T.¹ Almeida, L.² Boite, J.M.³ Bourlard, H.⁴ Fallside, F.⁵ Hochberg, M.⁶ Kershaw, D.⁷ Kohn, P.⁸ Konig, Y.⁹ Morgan, N.¹⁰ Neto, J.P.¹¹ Renals, S.¹² Saerens, M.¹³ Wooters, C.¹⁴

90
- 0000329355
- A recurrent error propagation network speech recognition system
- T. Robinson and F. Fallside (1991), "A recurrent error propagation network speech recognition system", Computer Speech and Language, Vol. 5, pp. 259-274.
- (1991) Computer Speech and Language , vol.5 , pp. 259-274
- Robinson, T.¹ Fallside, F.²

91
- 84881675408
- Cepstral channel normalization techniques for HMM-based speaker verification
- A.E. Rosenberg, C. Lee and F.K. Soong (1994), "Cepstral channel normalization techniques for HMM-based speaker verification", Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan, pp. 1835-1838.
- (1994) Proc. Internat. Conf. on Spoken Language Processing, Yokohama, Japan , pp. 1835-1838
- Rosenberg, A.E.¹ Lee, C.² Soong, F.K.³

92
- 0025629492
- The ARM continuous speech recognition system
- M.J. Russell, K.M. Ponting, S.M. Peeling, S.R. Browning, J.S. Bridle, R.K. Moore, I. Galiano and P. Howell (1990), "The ARM continuous speech recognition system", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM, pp. 69-72.
- (1990) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Albuquerque, NM , pp. 69-72
- Russell, M.J.¹ Ponting, K.M.² Peeling, S.M.³ Browning, S.R.⁴ Bridle, J.S.⁵ Moore, R.K.⁶ Galiano, I.⁷ Howell, P.⁸

93
- 85017310294
- New uses of the N-best sentence hypotheses within the BYBLOS speech recognition system
- R. Schwartz, S. Austin, F. Kubala, J. Makhoul, L. Nguyen, P. Placeway and G. Zavaliagkos (1992), "New uses of the N-best sentence hypotheses within the BYBLOS speech recognition system", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA, pp. I-1.4-1.7.
- (1992) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA
- Schwartz, R.¹ Austin, S.² Kubala, F.³ Makhoul, J.⁴ Nguyen, L.⁵ Placeway, P.⁶ Zavaliagkos, G.⁷

94
- 84928837806
- A joint synchrony/mean-rate model of auditory speech processing
- S. Seneff (1985), "A joint synchrony/mean-rate model of auditory speech processing", J. Phonetics, Vol. 16, No. 1, pp.55-76.
- (1985) J. Phonetics , vol.16 , Issue.1 , pp. 55-76
- Seneff, S.¹

95
- 84973504649
- A speech recognizer using radial basis function neural networks in an HMM framework
- E. Singer and R. Lippmann (1992), "A speech recognizer using radial basis function neural networks in an HMM framework", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA, pp. 629-632.
- (1992) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., San Francisco, CA , pp. 629-632
- Singer, E.¹ Lippmann, R.²

96
- 0039670390
- Multilingual assessment of speaker independent large vocabulary speech-recognition systems: The SQALE project (speech recognition quality assessment for language engineering)
- J.M. Steeneken and D.A. Van Leeuwen (1995), "Multilingual assessment of speaker independent large vocabulary speech-recognition systems: The SQALE project (speech recognition quality assessment for language engineering)", Proc. Eurospeech'95, Madrid, Spain, pp. 1271-1274.
- (1995) Proc. Eurospeech'95, Madrid, Spain , pp. 1271-1274
- Steeneken, J.M.¹ Van Leeuwen, D.A.²

97
- 34447546202
- On the psychophysical law
- S.S. Stevens (1957), "On the psychophysical law", Psychol. Rev., Vol. 64, No. 1, pp. 153-181.
- (1957) Psychol. Rev. , vol.64 , Issue.1 , pp. 153-181
- Stevens, S.S.¹

98
- 0011405405
- Brightness and loudness as functions of stimulus duration
- J.C. Stevens and J.W. Hall (1966), "Brightness and loudness as functions of stimulus duration", Perception and Psychophysics, pp. 319-327.
- (1966) Perception and Psychophysics , pp. 319-327
- Stevens, J.C.¹ Hall, J.W.²

99
- 0016495712
- Blind deconvolution through digital signal processing
- T. Stockham, T. Cannon and R. Ingerbretsen (1975), "Blind deconvolution through digital signal processing", Proc. IEEE, Vol. 63, pp. 678-692.
- (1975) Proc. IEEE , vol.63 , pp. 678-692
- Stockham, T.¹ Cannon, T.² Ingerbretsen, R.³

100
- 0018195604
- Memory and time improvements in a dynamic programming algorithm for matching speech patterns
- C.C. Tappert and S.K. Das (1978), "Memory and time improvements in a dynamic programming algorithm for matching speech patterns", IEEE Trans. Acoust. Speech Signal Process., Vol. 26, pp. 583-586.
- (1978) IEEE Trans. Acoust. Speech Signal Process. , vol.26 , pp. 583-586
- Tappert, C.C.¹ Das, S.K.²

101
- 0026368806
- Continuous speech recognition using linked predictive neural networks
- J. Tebelskis, A. Waibel, B. Petek and O. Schmidbauer (1991), "Continuous speech recognition using linked predictive neural networks", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada, pp. 61-64.
- (1991) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Toronto, Canada , pp. 61-64
- Tebelskis, J.¹ Waibel, A.² Petek, B.³ Schmidbauer, O.⁴

102
- 0023833469
- Phoneme recognition using time-delay neural networks
- A. Waibel, T. Hanazawa, G. Hinton, K. Shikano and K. Lang (1988), "Phoneme recognition using time-delay neural networks", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, NY, pp. 107-110.
- (1988) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., New York, NY , pp. 107-110
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang, K.⁵

103
- 0023211846
- Explicit time correlation in hidden Markov models for speech recognition
- C.J. Wellekens (1987), "Explicit time correlation in hidden Markov models for speech recognition", Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Dallas, TX, pp. 384-386.
- (1987) Proc. IEEE Internat. Conf. Acoust. Speech Signal Process., Dallas, TX , pp. 384-386
- Wellekens, C.J.¹

104
- 0039777029
- Scaling
- ed. by Keidel and Neff (Springer, Berlin)
- E. Zwicker (1975), "Scaling", in Handbook of Sensory Physiology, ed. by Keidel and Neff (Springer, Berlin, Vol. 3), pp. 401-448.
- (1975) Handbook of Sensory Physiology , vol.3 , pp. 401-448
- Zwicker, E.¹

105
- 0022151324
- The use of speech knowledge in automatic speech recognition
- V. Zue (1985), "The use of speech knowledge in automatic speech recognition", Proc. IEEE, Vol. 73, No. 11, pp. 1602-1615,
- (1985) Proc. IEEE , vol.73 , Issue.11 , pp. 1602-1615
- Zue, V.¹

106
- 0004988601
- Copernicus and the ASR challenge - Waiting for Kepler
- H. Bourlard, H. Hermansky and N. Morgan (1996), "Copernicus and the ASR challenge - Waiting for Kepler", Proc. ARPA Speech Recognition Workshop, Arden House, NY, 18-21 February 1996.
- (1996) Proc. ARPA Speech Recognition Workshop, Arden House, Ny, 18-21 February 1996
- Bourlard, H.¹ Hermansky, H.² Morgan, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.