SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 34, Issue 1-2, 2001, Pages 25-40

Multi-stream adaptive evidence combination for noise robust ASR

(4) Morris, Andrew a Hagen, Astrid a Glotin, Hervé a Bourlard, Hervé a,b

a IDIAP RESEARCH INSTITUTE (Switzerland)

b EPFL (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC VARIABLES CONTROL; ADAPTIVE FILTERING; DATA STRUCTURES; MARKOV PROCESSES; MATHEMATICAL MODELS; NEURAL NETWORKS; PROBABILITY DISTRIBUTIONS; ROBUSTNESS (CONTROL SYSTEMS); SPEECH ANALYSIS; SPEECH SYNTHESIS;

HIDDEN MARKOV MODELS; MULTI-STREAM ADAPTIVE EVIDENCE COMBINATION METHOD;

CONTINUOUS SPEECH RECOGNITION;

EID: 0034825241 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(00)00044-3 Document Type: Article

Times cited : (69)

References (47)

1
- 0028516073
- How do humans process and recognise speech?
- Allen, J.B., 1994. How do humans process and recognise speech?. IEEE Trans. Speech Signal Process. 2 (4), 567-576.
- (1994) IEEE Trans. Speech Signal Process. , vol.2 , Issue.4 , pp. 567-576
- Allen, J.B.¹

2
- 0001437767
- A new SNR-feature mapping for robust multi-stream speech recognition
- Berthommier, F., Glotin, H., 1999. A new SNR-feature mapping for robust multi-stream speech recognition. In: Proc. ICPhS'99, pp. 711-715.
- (1999) Proc. ICPhS'99 , pp. 711-715
- Berthommier, F.¹ Glotin, H.²

3
- 0003487601
- Clarendon Press, Oxford
- Bishop, C. (Ed.), 1995. Neural Networks for Pattern Recognition. Clarendon Press, Oxford, pp. 365-368.
- (1995) Neural Networks for Pattern Recognition , pp. 365-368
- Bishop, C.¹

4
- 0003872847
- Non-stationary multi-channel multi-stream, processing towards robust and adaptive ASR
- Bourlard, 1999. Non-stationary multi-channel (multi-stream, processing towards robust and adaptive ASR. In: Proceedings of Tampere Workshop on Robust Methods for Speech Recognition in Adverse Conditions, pp. 1-10.
- (1999) Proceedings of Tampere Workshop on Robust Methods for Speech Recognition in Adverse Conditions , pp. 1-10
- Bourlard¹

5
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelphia
- Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and recombination of partial frequency bands. In: Proc. ICSLP'96, Philadelphia, pp. 422-425.
- (1996) Proc. ICSLP'96 , pp. 422-425
- Bourlard, H.¹ Dupont, S.²

6
- 0003573244
- Kluwer Academic Publishers, Dordrecht
- Bourlard, H., Morgan, N., 1994. Connectionist speech recognition - a hybrid approach. Kluwer Academic Publishers, Dordrecht.
- (1994) Connectionist Speech Recognition - A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

7
- 3142695111
- Hybrid HMM/ANN systems for speech recognition: Overview and new research directions
- Bourlard, H., Morgan, N., 1997. Hybrid HMM/ANN systems for speech recognition: overview and new research directions. In: Proceedings of International School on Neural Nets: Adaptive Processing of Temporal Information.
- (1997) Proceedings of International School on Neural Nets: Adaptive Processing of Temporal Information
- Bourlard, H.¹ Morgan, N.²

8
- 85135196323
- New telephone speech corpora at CSLU
- Cole, R.A., Noel, T., Lander, L., Durham, T., 1995. New telephone speech corpora at CSLU. In: Proceedings of European Conference on Speech Communication and Technology, Vol. 1, pp. 821-824.
- (1995) Proceedings of European Conference on Speech Communication and Technology , vol.1 , pp. 821-824
- Cole, R.A.¹ Noel, T.² Lander, L.³ Durham, T.⁴

9
- 0002212788
- Missing feature theory in ASR: Make sure you missing the right type of features
- de Veth, J., de Wet, F., Cranen, B., Boves, L., 1999. Missing feature theory in ASR: make sure you missing the right type of features. In: Proceedings of Workshop on Robust Methods for Speech Recognition in Adverse Conditions, pp. 231-234.
- (1999) Proceedings of Workshop on Robust Methods for Speech Recognition in Adverse Conditions , pp. 231-234
- De Veth, J.¹ De Wet, F.² Cranen, B.³ Boves, L.⁴

10
- 0003472470
- Wiley, New York
- Duda, R.O., Hart, P.E., 1993. Pattern Classification and Scene Analysis. Wiley, New York.
- (1993) Pattern Classification and Scene Analysis
- Duda, R.O.¹ Hart, P.E.²

11
- 84949458153
- Using the multi-stream approach for continuous audio-visual speech recognition: Experiments on the M2VTS database
- Dupont, S., Luettin, J., 1998. Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS database. In: Proc. ICSLP'98, pp. 1283-1286.
- (1998) Proc. ICSLP'98 , pp. 1283-1286
- Dupont, S.¹ Luettin, J.²

12
- 0001347970
- The nature of speech and its interpretation
- Fletcher, H., 1922. The nature of speech and its interpretation. J. Franklin Inst. 193 (6), 729-747.
- (1922) J. Franklin Inst. , vol.193 , Issue.6 , pp. 729-747
- Fletcher, H.¹

13
- 85135375893
- HMM recognition in noise using parallel model combination
- Gales, M.J.F., Young, S.J., 1993. HMM recognition in noise using parallel model combination. In: Proc. Eurospeecl'93, pp. 837-840.
- (1993) Proc. Eurospeecl'93 , pp. 837-840
- Gales, M.J.F.¹ Young, S.J.²

14
- 0000344953
- Fusion of auditory and visual information for noisy speech enhancement: A preliminary study of vowel transitions
- Girin, L., Feng, G., Schwartz, J.-L., 1998. Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transitions. In: Proc. ICASSP'98, pp. 1005-1008.
- (1998) Proc. ICASSP'98 , pp. 1005-1008
- Girin, L.¹ Feng, G.² Schwartz, J.-L.³

15
- 85024441206
- A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition
- Glotin, H., Berthommier, F., Tessier, E., 1999. A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition. In: Proc. Eurospeech'99, pp. 2351-2354.
- (1999) Proc. Eurospeech'99 , pp. 2351-2354
- Glotin, H.¹ Berthommier, F.² Tessier, E.³

16
- 0039881085
- On the origins of speech intelligibility in the real world
- Greenberg, S., 1997. On the origins of speech intelligibility in the real world. In: Proceedings ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels, pp. 23-32.
- (1997) Proceedings ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels , pp. 23-32
- Greenberg, S.¹

17
- 84949513730
- Different weighting schemes in the full combination sub-bands approach for noise robust ASR
- Hagen, A., Morris, A.C., Bourlard, H., 1999. Different weighting schemes in the full combination sub-bands approach for noise robust ASR. In: Proceedings Tampere Workshop on Robust Methods for Speech Recognition in Adverse Conditions, pp. 199-202.
- (1999) Proceedings Tampere Workshop on Robust Methods for Speech Recognition in Adverse Conditions , pp. 199-202
- Hagen, A.¹ Morris, A.C.² Bourlard, H.³

18
- 85135149324
- Estimation of global posteriors and forward-backward training of hybrid systems
- Hennebert, J., Ris, C., Bourlard, H., Renals, S., Morgan, N., 1997. Estimation of global posteriors and forward-backward training of hybrid systems. In: Proc. Eurospeech'97, pp. 1951-1954.
- (1997) Proc. Eurospeech'97 , pp. 1951-1954
- Hennebert, J.¹ Ris, C.² Bourlard, H.³ Renals, S.⁴ Morgan, N.⁵

19
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky, H., 1990. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87 (4), 1738-1752.
- (1990) J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

20
- 0028517164
- RASTA processing of speech
- Hermansky, H., Morgan, N., 1994. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2 (4), 578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

21
- 0010604779
- Temporal patterns (TRAPS) in ASR noisy speech
- Hermansky, H., Sharma, S., 1999. Temporal patterns (TRAPS) in ASR noisy speech. In: Proc. ICASSP'99, pp. 298-292.
- (1999) Proc. ICASSP'99 , pp. 298-1292
- Hermansky, H.¹ Sharma, S.²

22
- 0030365517
- Towards ASR on partially corrupted speech
- Hermansky, H., Tibrewela, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proc. ICSLP'96, pp. 462-465.
- (1996) Proc. ICSLP'96 , pp. 462-465
- Hermansky, H.¹ Tibrewela, S.² Pavel, M.³

23
- 0028996871
- Noise estimation techniques for robust speech recognition
- Hirsch, H.G., Ehrlicher, C., 1995. Noise estimation techniques for robust speech recognition. In: ICASSP95, pp. 153-156.
- (1995) ICASSP95 , pp. 153-156
- Hirsch, H.G.¹ Ehrlicher, C.²

24
- 0000262562
- Hierarchical mixtures of experts and the EM algorithm
- Jordan, M.I., Jacobs, R.A., 1994. Hierarchical mixtures of experts and the EM algorithm. Neural Comput. 6, 181-214.
- (1994) Neural Comput. , vol.6 , pp. 181-214
- Jordan, M.I.¹ Jacobs, R.A.²

25
- 0032136330
- Robust speech recognition using the modulation spectrogram
- Kingsbury, B., Morgan, N., Greenberg, S., 1998. Robust speech recognition using the modulation spectrogram. Speech Communication 25 (1-3), 117-132.
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 117-132
- Kingsbury, B.¹ Morgan, N.² Greenberg, S.³

26
- 16344396527
- Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise
- Lippmann, R.P., Carlson, B.A., 1997. Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise. In: Proc. Eurospeech'97, pp. 37-40.
- (1997) Proc. Eurospeech'97 , pp. 37-40
- Lippmann, R.P.¹ Carlson, B.A.²

27
- 0017199877
- Hearing lips and seeing voices
- McGurk, H., McDonald, J., 1976. Hearing lips and seeing voices. Nature 264, 746-748.
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ McDonald, J.²

28
- 0001279385
- Union: A new approach for combining sub-band observations for noisy speech recognition
- Ming, J., Smith, F.J., 1999. Union: a new approach for combining sub-band observations for noisy speech recognition. In: Proceedings of Workshop on Robust Methods for Speech Recognition in Adverse Conditions, pp. 175-178.
- (1999) Proceedings of Workshop on Robust Methods for Speech Recognition in Adverse Conditions , pp. 175-178
- Ming, J.¹ Smith, F.J.²

29
- 0004119130
- PhD Dissertation, University of California at Berkeley, December 1998. Reprinted as ICSI Technical Report, ICSI TR-99-04
- Mirghafori, N., 1999. A multi-band approach to automatic speech recognition. PhD Dissertation, University of California at Berkeley, December 1998. Reprinted as ICSI Technical Report, ICSI TR-99-04.
- (1999) A Multi-band Approach to Automatic Speech Recognition
- Mirghafori, N.¹

30
- 0003789815
- Academic Press, New York
- Moore, B.C.J., 1997. An Introduction to the Psychology of Hearing, 4th edition. Academic Press, New York.
- (1997) An Introduction to the Psychology of Hearing, 4th Edition
- Moore, B.C.J.¹

31
- 85031486933
- Research Report IDIAP-RR 98-17
- Morgan, N., Bourlard, H., Hermansky, H., 1998. Automatic speech recognition: an auditory perspective. Research Report IDIAP-RR 98-17.
- (1998) Automatic Speech Recognition: An Auditory Perspective
- Morgan, N.¹ Bourlard, H.² Hermansky, H.³

32
- 0342815070
- Research Report IDIAP-Com 99-04
- Morris, A.C., 1999. Latent variable decomposition for posteriors or likelihood based sub-band ASR. Research Report IDIAP-Com 99-04.
- (1999) Latent Variable Decomposition for Posteriors or Likelihood Based Sub-band ASR
- Morris, A.C.¹

33
- 84892151303
- Some solutions to the missing feature problem in data classification, with application to noise robust ASR
- Morris, A.C., Cooke, M., Green, P., 1998. Some solutions to the missing feature problem in data classification, with application to noise robust ASR. In: Proc. ICASSP'98, pp. 737-740.
- (1998) Proc. ICASSP'98 , pp. 737-740
- Morris, A.C.¹ Cooke, M.² Green, P.³

34
- 85135272651
- The full-combination sub-bands approach to noise robust HMM/ANN based ASR
- Morris, A.C., Hagen, A., Bourlard, H., 1999. The full-combination sub-bands approach to noise robust HMM/ANN based ASR. In: Proc. Eurospeech'99, pp. 599-602.
- (1999) Proc. Eurospeech'99 , pp. 599-602
- Morris, A.C.¹ Hagen, A.² Bourlard, H.³

35
- 85135144525
- On the decorrelation of filterbank energies in speech recognition
- Nadeu, C., Hernando, J., Gorricho, M., 1995. On the decorrelation of filterbank energies in speech recognition. In: Proc. Eurospeech'95, pp. 1381-1384.
- (1995) Proc. Eurospeech'95 , pp. 1381-1384
- Nadeu, C.¹ Hernando, J.² Gorricho, M.³

36
- 84892189317
- Multi-band speech recognition in noisy environment
- Okawa, S., Boccieri, E., Potamianos, A., 1998. Multi-band speech recognition in noisy environment. In: Proc. ICASSP'98, pp. 641-644.
- (1998) Proc. ICASSP'98 , pp. 641-644
- Okawa, S.¹ Boccieri, E.² Potamianos, A.³

37
- 0004106903
- Academic Press, New York
- Pickles, J.O., 1988. An Introduction to the Physiology of Hearing. Academic Press, New York.
- (1988) An Introduction to the Physiology of Hearing
- Pickles, J.O.¹

38
- 0030196712
- Analysis of linear prediction, coding and spectral estimation from sub-bands
- Rao, S., Pearlman, W.A., 1996. Analysis of linear prediction, coding and spectral estimation from sub-bands. IEEE Trans. Inf. Theory 42, 1160-1178.
- (1996) IEEE Trans. Inf. Theory , vol.42 , pp. 1160-1178
- Rao, S.¹ Pearlman, W.A.²

39
- 0030374103
- Bootstraping with noise: An effective regularisation technique
- Raviv, Y., Intrator, N., 1996. Bootstraping with noise: an effective regularisation technique. Connection Sci., Special Issue on Combining Estimators, 8, 356-372.
- (1996) Connection Sci., Special Issue on Combining Estimators , vol.8 , pp. 356-372
- Raviv, Y.¹ Intrator, N.²

40
- 0001595997
- Neural network classifiers estimate Bayesian a-posteriori probabilities
- Richard, M.D., Lippmann, R.P., 1991. Neural network classifiers estimate Bayesian a-posteriori probabilities. J. Neural Comput. 3 (4), 461-483.
- (1991) J. Neural Comput. , vol.3 , Issue.4 , pp. 461-483
- Richard, M.D.¹ Lippmann, R.P.²

41
- 0032623519
- Mutual dependence of the octave-band weights in predicting speech intelligibility
- Steeneken, H.J.M., Houtgast, T., 1999. Mutual dependence of the octave-band weights in predicting speech intelligibility. Speech Communication 28 (2), 109-123.
- (1999) Speech Communication , vol.28 , Issue.2 , pp. 109-123
- Steeneken, H.J.M.¹ Houtgast, T.²

42
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- Tomlinson, J., Russel, M.J., Brooke, N.M., 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proc. ICASSP'96, pp. 821-824.
- (1996) Proc. ICASSP'96 , pp. 821-824
- Tomlinson, J.¹ Russel, M.J.² Brooke, N.M.³

43
- 0030643684
- Modelling asynchrony in speech using elementary single-signal decomposition
- Tomlinson, J., Russel, M.J., Moore, R.K., Bucklan, A.P., Fawley, M.A., 1997. Modelling asynchrony in speech using elementary single-signal decomposition. In: Proc. ICASSP'97, pp. 1247-1250.
- (1997) Proc. ICASSP'97 , pp. 1247-1250
- Tomlinson, J.¹ Russel, M.J.² Moore, R.K.³ Bucklan, A.P.⁴ Fawley, M.A.⁵

44
- 0025681008
- Hidden Markov model decomposition of speech and noise
- Varga, A., Moore, R., 1990. Hidden Markov model decomposition of speech and noise. In: Proc. ICASSP'90, pp. 845-848.
- (1990) Proc. ICASSP'90 , pp. 845-848
- Varga, A.¹ Moore, R.²

45
- 0004319968
- Technical Report DRA Speech Research Unit
- Varga, A., Steeneken, H.J.M., Tomlinson, M., Jones, D., 1992. The Noisex-92 study on the effect of additive noise on automatic speech recognition. Technical Report DRA Speech Research Unit.
- (1992) The Noisex-92 Study on the Effect of Additive Noise on Automatic Speech Recognition
- Varga, A.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

46
- 85099467303
- Towards spontaneous speech recognition for on-board car navigation and information systems
- Westphal, M., Waibel, A., 1999. Towards spontaneous speech recognition for on-board car navigation and information systems. In: Proc. Eurospeech'99, pp. 1955-1958.
- (1999) Proc. Eurospeech'99 , pp. 1955-1958
- Westphal, M.¹ Waibel, A.²

47
- 0343249600
- Performance improvements through combining phone and syllable scale information in automatic speech recognition
- Wu, S.-L., Kingsbury, B.E., Morgan, N., Greenberg, S., 1998. Performance improvements through combining phone and syllable scale information in automatic speech recognition. In: Proc. ICASSP'98, pp. 459-462.
- (1998) Proc. ICASSP'98 , pp. 459-462
- Wu, S.-L.¹ Kingsbury, B.E.² Morgan, N.³ Greenberg, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.