SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 15, Issue 2, 2001, Pages 175-194

A statistical model for robust integration of narrowband cues in speech

(3) Saul, Lawrence K a Rahim, Mazin G a Allen, Jont B a

a AT AND T LABS RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; BANDWIDTH; MATHEMATICAL MODELS; PROBABILITY; ROBUSTNESS (CONTROL SYSTEMS); SPECTRUM ANALYSIS; SPEECH ANALYSIS; SPURIOUS SIGNAL NOISE;

FREQUENCY BANDS;

SPEECH RECOGNITION;

EID: 0035323922 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1006/csla.2001.0164 Document Type: Article

Times cited : (7)

References (47)

1
- 0032716023
- An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech
- Hong Kong, May 1999, IEEE
- Ali, A.M.A., Van der Spiegel, J., Muller, P., Haentjens, G. & Berman, J. (1999). An acoustic-phonetic feature-based system for automatic phoneme recognition in continuous speech. In Proceedings of the International Symposium on Circuits and Systems, Hong Kong, May 1999, pp. 118-121. IEEE.
- (1999) Proceedings of the International Symposium on Circuits and Systems , pp. 118-121
- Ali, A.M.A.¹ Van der Spiegel, J.² Muller, P.³ Haentjens, G.⁴ Berman, J.⁵

2
- 0028516073
- How do humans process and recognize speech?
- Allen, J.B. (1994). How do humans process and recognize speech? IEEE Transactions on Speech and Audio Processing, 2, 567-577.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 567-577
- Allen, J.B.¹

3
- 0025591290
- Neural networks for voiced/unvoiced speech classification
- Albuquerque
- Bendiksen, A. & Steiglitz, K. (1990). Neural networks for voiced/unvoiced speech classification. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 521-524.
- (1990) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90 , pp. 521-524
- Bendiksen, A.¹ Steiglitz, K.²

4
- 0003487601
- Oxford University Press, Oxford, U.K.
- Bishop, C. (1995). Neural Networks for Pattern Recognition. Oxford University Press, Oxford, U.K.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.¹

5
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelphia
- Bourlard, H. & Dupont, S. (1996). A new ASR approach based on independent processing and recombination of partial frequency bands. In Proceedings of the International Conference on Spoken Language Processing '96, Philadelphia, pp. 422-425.
- (1996) Proceedings of the International Conference on Spoken Language Processing '96 , pp. 422-425
- Bourlard, H.¹ Dupont, S.²

6
- 0030643240
- Sub-band-based speech recognition
- Munich
- Bourlard, H. & Dupont, S. (1997). Sub-band-based speech recognition. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97, Munich, pp. 1251-1254.
- (1997) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97 , pp. 1251-1254
- Bourlard, H.¹ Dupont, S.²

7
- 0003684441
- MIT Press, Cambridge, MA, U.S.A.
- Bregman, A.S. (1994). Auditory Scene Analysis: The Perceptual Organization of Sound. MIT Press, Cambridge, MA, U.S.A.
- (1994) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.S.¹

8
- 0004119259
- MIT Press, Cambridge, MA, U.S.A.
- Chomsky, N. & Halle, M. (1968). The Sound Pattern of English. MIT Press, Cambridge, MA, U.S.A.
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

9
- 0004147298
- Blackwell Publishing Ltd, Oxford, U.K.
- Clark, J. & Yallop, C. (1995). An Introduction to Phonetics and Phonology. Blackwell Publishing Ltd, Oxford, U.K.
- (1995) An Introduction to Phonetics and Phonology
- Clark, J.¹ Yallop, C.²

10
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- in press
- Cooke, M., Green, P., Josifovski, L. & Vizinho, A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, in press.
- (2001) Speech Communication
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

11
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster, A., Laird, N. & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B, 39, 1-38.
- (1977) Journal of the Royal Statistical Society B , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

12
- 0028234947
- A statistical approach to ASR using atomic units constructed from overlapping articulatory features
- Deng, L. & Sun, D. (1994). A statistical approach to ASR using atomic units constructed from overlapping articulatory features. Journal of the Acoustical Society of America, 95, 2702-2719.
- (1994) Journal of the Acoustical Society of America , vol.95 , pp. 2702-2719
- Deng, L.¹ Sun, D.²

13
- 0029816780
- An HMM-based speech recognizer using overlapping articulatory features
- Erler, K. & Freeman, G.H. (1996). An HMM-based speech recognizer using overlapping articulatory features. Journal of the Acoustical Society of America, 96, 2500-2513.
- (1996) Journal of the Acoustical Society of America , vol.96 , pp. 2500-2513
- Erler, K.¹ Freeman, G.H.²

14
- 0028361328
- A feature-based semivowel recognition system
- Espy-Wilson, C. (1994). A feature-based semivowel recognition system. Journal of the Acoustical Society of America, 96, 65-72.
- (1994) Journal of the Acoustical Society of America , vol.96 , pp. 65-72
- Espy-Wilson, C.¹

15
- 0003549684
- Van Nostrand, New York
- Fletcher, H. (1953). Speech and Hearing in Communication. Van Nostrand, New York.
- (1953) Speech and Hearing in Communication
- Fletcher, H.¹

16
- 0003419545
- National Institute of Standards and Technology (NIST), Gaithersburgh, MD
- Garofolo, J.S. (1988). Getting Started With the DARPA TIMIT CD-ROM: An Acoustic Phonetic Continuous Speech Database. National Institute of Standards and Technology (NIST), Gaithersburgh, MD.
- (1988) Getting Started With the DARPA TIMIT CD-ROM: An Acoustic Phonetic Continuous Speech Database
- Garofolo, J.S.¹

17
- 0021461632
- Detection in noise by spectro-temporal pattern analysis
- Hall, J.W., Haggard, M.P. & Fernandes, M.A. (1984). Detection in noise by spectro-temporal pattern analysis. Journal of the Acoustical Society of America, 76, 50-56.
- (1984) Journal of the Acoustical Society of America , vol.76 , pp. 50-56
- Hall, J.W.¹ Haggard, M.P.² Fernandes, M.A.³

18
- 0004215702
- Springer-Verlag, New York
- Hartmann, W.A. (1997). Signals, Sound, and Sensation. Springer-Verlag, New York.
- (1997) Signals, Sound, and Sensation
- Hartmann, W.A.¹

19
- 0030365517
- Towards ASR on partially corrupted speech
- Philadelphia
- Hermansky, H., Pavel, M. & Tibrewala, S. (1996). Towards ASR on partially corrupted speech. In Proceedings of the International Conference on Spoken Language Processing '96, Philadelphia, pp. 462-465.
- (1996) Proceedings of the International Conference on Spoken Language Processing '96 , pp. 462-465
- Hermansky, H.¹ Pavel, M.² Tibrewala, S.³

20
- 0003391579
- Springer-Verlag, New York, NY, U.S.A.
- Hess, W. (1983). Pitch Determination of Speech Signals: Algorithms and Devices. Springer-Verlag, New York, NY, U.S.A.
- (1983) Pitch Determination of Speech Signals: Algorithms and Devices
- Hess, W.¹

21
- 0012103823
- Robust measurement of fundamental frequency and degree of voicing
- Seattle
- Holmes, J.N. (1998). Robust measurement of fundamental frequency and degree of voicing. In Proceedings of the International Conference on Speech and Language Processing '98, Seattle, pp. 1007-1010.
- (1998) Proceedings of the International Conference on Speech and Language Processing '98 , pp. 1007-1010
- Holmes, J.N.¹

22
- 0001136927
- Computing with action potentials
- M. Jordan, M. Kearns and S. Solla, eds), volume. MIT Press, Cambridge, MA, U.S.A.
- Hopfield, J., Brody, C. & Rowels, S. (1998). Computing with action potentials. In Advances in Neural Information Processing Systems. (M. Jordan, M. Kearns and S. Solla, eds), volume 10, pp. 166-172. MIT Press, Cambridge, MA, U.S.A.
- (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 166-172
- Hopfield, J.¹ Brody, C.² Rowels, S.³

23
- 0025680225
- NTIMIT: A phonetically balanced, continuous speech, telephone bandwidth speech database
- Albuquerque
- Jankowsld, C., Kalyanswamy, A., Basson, S. & Spitz, J. (1990). NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 109-112.
- (1990) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90 , pp. 109-112
- Jankowsld, C.¹ Kalyanswamy, A.² Basson, S.³ Spitz, J.⁴

24
- 0004283231
- MIT Press, Cambridge, MA, U.S.A.
- Jordan, M., ed. (1999). Learning in Graphical Models. MIT Press, Cambridge, MA, U.S.A.
- (1999) Learning in Graphical Models
- Jordan, M.¹

25
- 0034297586
- Detection of phonological features in continuous speech using neural networks
- King, S. & Taylor, P. (2000). Detection of phonological features in continuous speech using neural networks. Computer Speech and Language, 14, 333-353.
- (2000) Computer Speech and Language , vol.14 , pp. 333-353
- King, S.¹ Taylor, P.²

26
- 0003424928
- PhD Thesis, University of Bielefeld, Germany
- Kirchhoff, K. (1999). Robust speech recognition using articulatory information. PhD Thesis, University of Bielefeld, Germany.
- (1999) Robust speech recognition using articulatory information
- Kirchhoff, K.¹

27
- 0031187171
- Speech recognition by machines and humans
- Lippmann, R.P. (1997). Speech recognition by machines and humans. Speech Communication, 22, 1-15.
- (1997) Speech Communication , vol.22 , pp. 1-15
- Lippmann, R.P.¹

28
- 0029907249
- Landmark detection for distinctive feature-based speech recognition
- Liu, S. (1996). Landmark detection for distinctive feature-based speech recognition. Journal of the Acoustical Society of America, 100, 3417-3430.
- (1996) Journal of the Acoustical Society of America , vol.100 , pp. 3417-3430
- Liu, S.¹

29
- 84898935332
- A framework for multiple-instance learning
- M. Jordan, M. Kearns and S. Solla, eds. MIT Press, Cambridge, MA, U.S.A.
- Moron, O. & Lozano-Perez, T. (1998). A framework for multiple-instance learning. In Advances in Neural Information Processing Systems. (M. Jordan, M. Kearns and S. Solla, eds), volume 10, pp. 570-576. MIT Press, Cambridge, MA, U.S.A.
- (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 570-576
- Moron, O.¹ Lozano-Perez, T.²

30
- 84955023511
- An analysis of perceptual confusions among some English consonants
- Miller, G.A. & Nicely, P.E. (1955). An analysis of perceptual confusions among some English consonants. Journal of the Acoustical Society of America, 27, 338-352.
- (1955) Journal of the Acoustical Society of America , vol.27 , pp. 338-352
- Miller, G.A.¹ Nicely, P.E.²

31
- 0001279385
- Union: A new approach for combining sub-band observations for noisy speech recognition
- Tampere, Finland
- Ming, J. & Smith, F.J. (1999). Union: a new approach for combining sub-band observations for noisy speech recognition, in Proceedings of the Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, pp. 175-178.
- (1999) Proceedings of the Workshop on Robust Methods for Speech Recognition in Adverse Conditions , pp. 175-178
- Ming, J.¹ Smith, F.J.²

32
- 0004119130
- PhD Dissertation, University of California Berkeley, U.S.A.
- Mirghafori, N. (1998). A multi-band approach to automatic speech recognition. PhD Dissertation, University of California Berkeley, U.S.A.
- (1998) A multi-band approach to automatic speech recognition
- Mirghafori, N.¹

33
- 0003249790
- Sooner or later: Exploiting asynchrony in multi-band speech recognition
- Budapest
- Mirghafori, N. & Morgan, N. (1999). Sooner or later: exploiting asynchrony in multi-band speech recognition, in Proceedings of Eurospeech-99, Budapest, pp. 595-598.
- (1999) Proceedings of Eurospeech-99 , pp. 595-598
- Mirghafori, N.¹ Morgan, N.²

34
- 0003789815
- Academic Press, San Diego, CA
- Moore, B.C.J. (1997). An Introduction to the Psychology of Hearing. Academic Press, San Diego, CA.
- (1997) An Introduction to the Psychology of Hearing
- Moore, B.C.J.¹

35
- 0020816083
- Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
- Moore, B.C.J. & Glasberg, B.R. (1983). Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. Journal of the Acoustical Society of America, 74, 750-753.
- (1983) Journal of the Acoustical Society of America , vol.74 , pp. 750-753
- Moore, B.C.J.¹ Glasberg, B.R.²

36
- 0032657797
- Distinctive feature detection using support vector machines
- Phoenix
- Niyogi, P., Bnrges, C. & Ramesh, P. (1999). Distinctive feature detection using support vector machines. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '99, Phoenix, pp. 425-428.
- (1999) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '99 , pp. 425-428
- Niyogi, P.¹ Bnrges, C.² Ramesh, P.³

37
- 0031636902
- Incorporating voice onset time to improve letter recognition accuracies
- Seattle
- Niyogi, P. & Ramesh, P. (1998). Incorporating voice onset time to improve letter recognition accuracies. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '98, Seattle, pp. 721-724.
- (1998) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '98 , pp. 721-724
- Niyogi, P.¹ Ramesh, P.²

38
- 0003391330
- Morgan Kaufmann, San Mateo, CA, U.S.A.
- Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, CA, U.S.A.
- (1988) Probabilistic Reasoning in Intelligent Systems
- Pearl, J.¹

39
- 0004244302
- Prentice Hall, Englewood Cliffs, NJ, U.S.A.
- Rabiner, L.R. & Juang, B.H. (1993). Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs, NJ, U.S.A.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

40
- 0003913694
- An efficient implementation of the Patterson-Holdsworth auditory filterbank
- Cupertino, CA
- Slaney, M. (1993). An efficient implementation of the Patterson-Holdsworth auditory filterbank. Apple Computer Technical Report 35, Cupertino, CA.
- (1993) Apple Computer Technical Report , vol.35
- Slaney, M.¹

41
- 0025623060
- A perceptual pitch detector
- Albuquerque
- Slaney, M. & Lyon, R.F. (1990). A perceptual pitch detector. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90, Albuquerque, pp. 357-360.
- (1990) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '90 , pp. 357-360
- Slaney, M.¹ Lyon, R.F.²

42
- 85031258263
- A neurally motivated technique for voicing detection and FO estimation for speech
- University of Sterling, Sterling, Scotland
- Smith, L. (1996). A neurally motivated technique for voicing detection and FO estimation for speech. CCCN Technical Report 22, University of Sterling, Sterling, Scotland.
- (1996) CCCN Technical Report 22
- Smith, L.¹

43
- 84956679481
- A noise-robust auditory modelling front end for voiced speech
- Lecture Notes in Computer Science, volume. (W. Gerstner, A. Germond, M. Hasler and J.-D. Nicoud, eds). Springer-Verlag, Heidelberg, Germany
- Smith, L. (1997). A noise-robust auditory modelling front end for voiced speech. In Artificial Neural Networks ICANN-97, Lecture Notes in Computer Science, volume 1327. (W. Gerstner, A. Germond, M. Hasler and J.-D. Nicoud, eds), pp. 97-102. Springer-Verlag, Heidelberg, Germany.
- (1997) Artificial Neural Networks ICANN-97 , vol.1327 , pp. 97-102
- Smith, L.¹

44
- 0004129646
- MIT Press, Cambridge, MA, U.S.A.
- Stevens, K.N. (1999). Acoustic Phonetics. MIT Press, Cambridge, MA, U.S.A.
- (1999) Acoustic Phonetics
- Stevens, K.N.¹

45
- 0030682291
- Sub-band based recognition of noisy speech
- Munich
- Tibrewala, S. & Hermansky, H. (1997). Sub-band based recognition of noisy speech. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97, Munich, pp. 1255-1258.
- (1997) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing '97 , pp. 1255-1258
- Tibrewala, S.¹ Hermansky, H.²

46
- 0004319968
- Technical Report, DRA Speech Research Unit
- Varga, A., Steeneken, H.J.M., Tomlinson, M. & Jones, D. (1992). The noisex-92 study on the effect of additive noise on automatic speech recognition. Technical Report, DRA Speech Research Unit.
- (1992) The noisex-92 study on the effect of additive noise on automatic speech recognition
- Varga, A.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

47
- 0015749654
- Consonant confusions in noise: A study of perceptual features
- Wang, M.D. & Bilger, R.C. (1973). Consonant confusions in noise: a study of perceptual features. Journal of the Acoustical Society of America, 54, 1248-1266.
- (1973) Journal of the Acoustical Society of America , vol.54 , pp. 1248-1266
- Wang, M.D.¹ Bilger, R.C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.