메뉴 건너뛰기




Volumn 34, Issue 1-2, 2001, Pages 25-40

Multi-stream adaptive evidence combination for noise robust ASR

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC VARIABLES CONTROL; ADAPTIVE FILTERING; DATA STRUCTURES; MARKOV PROCESSES; MATHEMATICAL MODELS; NEURAL NETWORKS; PROBABILITY DISTRIBUTIONS; ROBUSTNESS (CONTROL SYSTEMS); SPEECH ANALYSIS; SPEECH SYNTHESIS;

EID: 0034825241     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(00)00044-3     Document Type: Article
Times cited : (69)

References (47)
  • 1
    • 0028516073 scopus 로고
    • How do humans process and recognise speech?
    • Allen, J.B., 1994. How do humans process and recognise speech?. IEEE Trans. Speech Signal Process. 2 (4), 567-576.
    • (1994) IEEE Trans. Speech Signal Process. , vol.2 , Issue.4 , pp. 567-576
    • Allen, J.B.1
  • 2
    • 0001437767 scopus 로고    scopus 로고
    • A new SNR-feature mapping for robust multi-stream speech recognition
    • Berthommier, F., Glotin, H., 1999. A new SNR-feature mapping for robust multi-stream speech recognition. In: Proc. ICPhS'99, pp. 711-715.
    • (1999) Proc. ICPhS'99 , pp. 711-715
    • Berthommier, F.1    Glotin, H.2
  • 5
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • Philadelphia
    • Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and recombination of partial frequency bands. In: Proc. ICSLP'96, Philadelphia, pp. 422-425.
    • (1996) Proc. ICSLP'96 , pp. 422-425
    • Bourlard, H.1    Dupont, S.2
  • 11
    • 84949458153 scopus 로고    scopus 로고
    • Using the multi-stream approach for continuous audio-visual speech recognition: Experiments on the M2VTS database
    • Dupont, S., Luettin, J., 1998. Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS database. In: Proc. ICSLP'98, pp. 1283-1286.
    • (1998) Proc. ICSLP'98 , pp. 1283-1286
    • Dupont, S.1    Luettin, J.2
  • 12
    • 0001347970 scopus 로고
    • The nature of speech and its interpretation
    • Fletcher, H., 1922. The nature of speech and its interpretation. J. Franklin Inst. 193 (6), 729-747.
    • (1922) J. Franklin Inst. , vol.193 , Issue.6 , pp. 729-747
    • Fletcher, H.1
  • 13
    • 85135375893 scopus 로고
    • HMM recognition in noise using parallel model combination
    • Gales, M.J.F., Young, S.J., 1993. HMM recognition in noise using parallel model combination. In: Proc. Eurospeecl'93, pp. 837-840.
    • (1993) Proc. Eurospeecl'93 , pp. 837-840
    • Gales, M.J.F.1    Young, S.J.2
  • 14
    • 0000344953 scopus 로고    scopus 로고
    • Fusion of auditory and visual information for noisy speech enhancement: A preliminary study of vowel transitions
    • Girin, L., Feng, G., Schwartz, J.-L., 1998. Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transitions. In: Proc. ICASSP'98, pp. 1005-1008.
    • (1998) Proc. ICASSP'98 , pp. 1005-1008
    • Girin, L.1    Feng, G.2    Schwartz, J.-L.3
  • 15
    • 85024441206 scopus 로고    scopus 로고
    • A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition
    • Glotin, H., Berthommier, F., Tessier, E., 1999. A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition. In: Proc. Eurospeech'99, pp. 2351-2354.
    • (1999) Proc. Eurospeech'99 , pp. 2351-2354
    • Glotin, H.1    Berthommier, F.2    Tessier, E.3
  • 18
    • 85135149324 scopus 로고    scopus 로고
    • Estimation of global posteriors and forward-backward training of hybrid systems
    • Hennebert, J., Ris, C., Bourlard, H., Renals, S., Morgan, N., 1997. Estimation of global posteriors and forward-backward training of hybrid systems. In: Proc. Eurospeech'97, pp. 1951-1954.
    • (1997) Proc. Eurospeech'97 , pp. 1951-1954
    • Hennebert, J.1    Ris, C.2    Bourlard, H.3    Renals, S.4    Morgan, N.5
  • 19
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky, H., 1990. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87 (4), 1738-1752.
    • (1990) J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 21
    • 0010604779 scopus 로고    scopus 로고
    • Temporal patterns (TRAPS) in ASR noisy speech
    • Hermansky, H., Sharma, S., 1999. Temporal patterns (TRAPS) in ASR noisy speech. In: Proc. ICASSP'99, pp. 298-292.
    • (1999) Proc. ICASSP'99 , pp. 298-1292
    • Hermansky, H.1    Sharma, S.2
  • 22
    • 0030365517 scopus 로고    scopus 로고
    • Towards ASR on partially corrupted speech
    • Hermansky, H., Tibrewela, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proc. ICSLP'96, pp. 462-465.
    • (1996) Proc. ICSLP'96 , pp. 462-465
    • Hermansky, H.1    Tibrewela, S.2    Pavel, M.3
  • 23
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • Hirsch, H.G., Ehrlicher, C., 1995. Noise estimation techniques for robust speech recognition. In: ICASSP95, pp. 153-156.
    • (1995) ICASSP95 , pp. 153-156
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 24
    • 0000262562 scopus 로고
    • Hierarchical mixtures of experts and the EM algorithm
    • Jordan, M.I., Jacobs, R.A., 1994. Hierarchical mixtures of experts and the EM algorithm. Neural Comput. 6, 181-214.
    • (1994) Neural Comput. , vol.6 , pp. 181-214
    • Jordan, M.I.1    Jacobs, R.A.2
  • 25
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • Kingsbury, B., Morgan, N., Greenberg, S., 1998. Robust speech recognition using the modulation spectrogram. Speech Communication 25 (1-3), 117-132.
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 117-132
    • Kingsbury, B.1    Morgan, N.2    Greenberg, S.3
  • 26
    • 16344396527 scopus 로고    scopus 로고
    • Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise
    • Lippmann, R.P., Carlson, B.A., 1997. Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise. In: Proc. Eurospeech'97, pp. 37-40.
    • (1997) Proc. Eurospeech'97 , pp. 37-40
    • Lippmann, R.P.1    Carlson, B.A.2
  • 27
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H., McDonald, J., 1976. Hearing lips and seeing voices. Nature 264, 746-748.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    McDonald, J.2
  • 29
    • 0004119130 scopus 로고    scopus 로고
    • PhD Dissertation, University of California at Berkeley, December 1998. Reprinted as ICSI Technical Report, ICSI TR-99-04
    • Mirghafori, N., 1999. A multi-band approach to automatic speech recognition. PhD Dissertation, University of California at Berkeley, December 1998. Reprinted as ICSI Technical Report, ICSI TR-99-04.
    • (1999) A Multi-band Approach to Automatic Speech Recognition
    • Mirghafori, N.1
  • 33
    • 84892151303 scopus 로고    scopus 로고
    • Some solutions to the missing feature problem in data classification, with application to noise robust ASR
    • Morris, A.C., Cooke, M., Green, P., 1998. Some solutions to the missing feature problem in data classification, with application to noise robust ASR. In: Proc. ICASSP'98, pp. 737-740.
    • (1998) Proc. ICASSP'98 , pp. 737-740
    • Morris, A.C.1    Cooke, M.2    Green, P.3
  • 34
    • 85135272651 scopus 로고    scopus 로고
    • The full-combination sub-bands approach to noise robust HMM/ANN based ASR
    • Morris, A.C., Hagen, A., Bourlard, H., 1999. The full-combination sub-bands approach to noise robust HMM/ANN based ASR. In: Proc. Eurospeech'99, pp. 599-602.
    • (1999) Proc. Eurospeech'99 , pp. 599-602
    • Morris, A.C.1    Hagen, A.2    Bourlard, H.3
  • 35
    • 85135144525 scopus 로고
    • On the decorrelation of filterbank energies in speech recognition
    • Nadeu, C., Hernando, J., Gorricho, M., 1995. On the decorrelation of filterbank energies in speech recognition. In: Proc. Eurospeech'95, pp. 1381-1384.
    • (1995) Proc. Eurospeech'95 , pp. 1381-1384
    • Nadeu, C.1    Hernando, J.2    Gorricho, M.3
  • 36
    • 84892189317 scopus 로고    scopus 로고
    • Multi-band speech recognition in noisy environment
    • Okawa, S., Boccieri, E., Potamianos, A., 1998. Multi-band speech recognition in noisy environment. In: Proc. ICASSP'98, pp. 641-644.
    • (1998) Proc. ICASSP'98 , pp. 641-644
    • Okawa, S.1    Boccieri, E.2    Potamianos, A.3
  • 38
    • 0030196712 scopus 로고    scopus 로고
    • Analysis of linear prediction, coding and spectral estimation from sub-bands
    • Rao, S., Pearlman, W.A., 1996. Analysis of linear prediction, coding and spectral estimation from sub-bands. IEEE Trans. Inf. Theory 42, 1160-1178.
    • (1996) IEEE Trans. Inf. Theory , vol.42 , pp. 1160-1178
    • Rao, S.1    Pearlman, W.A.2
  • 40
    • 0001595997 scopus 로고
    • Neural network classifiers estimate Bayesian a-posteriori probabilities
    • Richard, M.D., Lippmann, R.P., 1991. Neural network classifiers estimate Bayesian a-posteriori probabilities. J. Neural Comput. 3 (4), 461-483.
    • (1991) J. Neural Comput. , vol.3 , Issue.4 , pp. 461-483
    • Richard, M.D.1    Lippmann, R.P.2
  • 41
    • 0032623519 scopus 로고    scopus 로고
    • Mutual dependence of the octave-band weights in predicting speech intelligibility
    • Steeneken, H.J.M., Houtgast, T., 1999. Mutual dependence of the octave-band weights in predicting speech intelligibility. Speech Communication 28 (2), 109-123.
    • (1999) Speech Communication , vol.28 , Issue.2 , pp. 109-123
    • Steeneken, H.J.M.1    Houtgast, T.2
  • 42
    • 0029747053 scopus 로고    scopus 로고
    • Integrating audio and visual information to provide highly robust speech recognition
    • Tomlinson, J., Russel, M.J., Brooke, N.M., 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proc. ICASSP'96, pp. 821-824.
    • (1996) Proc. ICASSP'96 , pp. 821-824
    • Tomlinson, J.1    Russel, M.J.2    Brooke, N.M.3
  • 43
  • 44
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • Varga, A., Moore, R., 1990. Hidden Markov model decomposition of speech and noise. In: Proc. ICASSP'90, pp. 845-848.
    • (1990) Proc. ICASSP'90 , pp. 845-848
    • Varga, A.1    Moore, R.2
  • 46
    • 85099467303 scopus 로고    scopus 로고
    • Towards spontaneous speech recognition for on-board car navigation and information systems
    • Westphal, M., Waibel, A., 1999. Towards spontaneous speech recognition for on-board car navigation and information systems. In: Proc. Eurospeech'99, pp. 1955-1958.
    • (1999) Proc. Eurospeech'99 , pp. 1955-1958
    • Westphal, M.1    Waibel, A.2
  • 47
    • 0343249600 scopus 로고    scopus 로고
    • Performance improvements through combining phone and syllable scale information in automatic speech recognition
    • Wu, S.-L., Kingsbury, B.E., Morgan, N., Greenberg, S., 1998. Performance improvements through combining phone and syllable scale information in automatic speech recognition. In: Proc. ICASSP'98, pp. 459-462.
    • (1998) Proc. ICASSP'98 , pp. 459-462
    • Wu, S.-L.1    Kingsbury, B.E.2    Morgan, N.3    Greenberg, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.