메뉴 건너뛰기




Volumn 5, Issue 2, 2004, Pages 103-117

The fusion of visual lip movements and mixed speech signals for robust speech separation

Author keywords

Audiovisual information fusion; Audiovisual signal separation; Blind speech separation; Independent component analysis; Robust speech recognition

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; ALGORITHMS; CLASSIFICATION (OF INFORMATION); COMPUTER SIMULATION; CORRELATION METHODS; DATABASE SYSTEMS; ESTIMATION; FEATURE EXTRACTION; GAUSSIAN NOISE (ELECTRONIC); INDEPENDENT COMPONENT ANALYSIS; SIGNAL TO NOISE RATIO; VECTORS; VIDEO SIGNAL PROCESSING;

EID: 1842854565     PISSN: 15662535     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.inffus.2003.10.006     Document Type: Article
Times cited : (8)

References (32)
  • 1
    • 0035458007 scopus 로고    scopus 로고
    • Multi-modal sound localization using audiovisual information Fusion
    • Aarabi P., Zaky S. Multi-modal sound localization using audiovisual information Fusion. Information Fusion. 3(2):2001;209-223.
    • (2001) Information Fusion , vol.3 , Issue.2 , pp. 209-223
    • Aarabi, P.1    Zaky, S.2
  • 2
    • 0003456875 scopus 로고    scopus 로고
    • M.A.Sc. Thesis, Department of Electrical and Computer Engineering, University of Toronto, June
    • P. Aarabi, Multi-sense artificial awareness, M.A.Sc. Thesis, Department of Electrical and Computer Engineering, University of Toronto, June 1999.
    • (1999) Multi-sense Artificial Awareness
    • Aarabi, P.1
  • 4
    • 0022021789 scopus 로고
    • The contribution of fundamental frequency, amplitude envelope, and voicing duration cues to speechreading in normal-hearing subjects
    • Grant K.W., Ardell L.H., Kuhl P.K., Sparks D.W. The contribution of fundamental frequency, amplitude envelope, and voicing duration cues to speechreading in normal-hearing subjects. Journal of the Acoustical Society of America. 77(2):1985;671-677.
    • (1985) Journal of the Acoustical Society of America , vol.77 , Issue.2 , pp. 671-677
    • Grant, K.W.1    Ardell, L.H.2    Kuhl, P.K.3    Sparks, D.W.4
  • 9
    • 0002358797 scopus 로고    scopus 로고
    • Discriminative learning of visual data for audiovisual speech recognition
    • Rogozan A. Discriminative learning of visual data for audiovisual speech recognition. International Journal on Artificial Intelligence Tools. 8(1):1999;43-52.
    • (1999) International Journal on Artificial Intelligence Tools , vol.8 , Issue.1 , pp. 43-52
    • Rogozan, A.1
  • 10
    • 0032180188 scopus 로고    scopus 로고
    • Adaptive fusion of acoustic and visual sources for automatic speech recognition
    • Rogozan A., Deléglise P. Adaptive fusion of acoustic and visual sources for automatic speech recognition. Speech Communication. 26(1-2):1998;149-161.
    • (1998) Speech Communication , vol.26 , Issue.1-2 , pp. 149-161
    • Rogozan, A.1    Deléglise, P.2
  • 11
    • 0000886386 scopus 로고
    • Visual speech recognition with stochastic networks
    • G. Tesauro, D. Toruetzky, & T. Leen. Cambridge: MIT Press
    • Movellan J.R. Visual speech recognition with stochastic networks. Tesauro G., Toruetzky D., Leen T. Advances in Neural Information Processing Systems. vol. 7:1995;MIT Press, Cambridge.
    • (1995) Advances in Neural Information Processing Systems , vol.7
    • Movellan, J.R.1
  • 12
    • 0006464281 scopus 로고    scopus 로고
    • Automatic computer lip-reading using fuzzy set theory
    • Santa Cruz, CA
    • J. Baldwin, T. Martin, M. Saeed, Automatic computer lip-reading using fuzzy set theory, in: Proceedings of AVSP 99, Santa Cruz, CA, 1999.
    • (1999) Proceedings of AVSP 99
    • Baldwin, J.1    Martin, T.2    Saeed, M.3
  • 14
    • 0000134331 scopus 로고    scopus 로고
    • 2D deformable models for visual speech analysis
    • D.G. Stork, M.E. Hennecke (Eds.). Speechreading by Humans and Machines, Springer Verlag, Berlin
    • Coianiz T., Torresani L., Capril B. 2D deformable models for visual speech analysis. Stork D.G., Hennecke M.E. Speechreading by Humans and Machines. NATO ASI Series, Series F: Computer and Systems Sciences. vol. 150:1996;391-398 Springer Verlag, Berlin.
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 391-398
    • Coianiz, T.1    Torresani, L.2    Capril, B.3
  • 17
    • 0022411853 scopus 로고
    • On the role of visual rate information in phonetic perception
    • Green K.P., Miller J.L. On the role of visual rate information in phonetic perception. Perception and Psychophysics. 38(3):1985;269-276.
    • (1985) Perception and Psychophysics , vol.38 , Issue.3 , pp. 269-276
    • Green, K.P.1    Miller, J.L.2
  • 19
    • 0030247984 scopus 로고    scopus 로고
    • Computer lipreading for improved accuracy in automatic speech recognition
    • Silsbee P.L., Bovik A.C. Computer lipreading for improved accuracy in automatic speech recognition. IEEE Transactions on Speech and Audio Processing. 4(5):1996;337-351.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 337-351
    • Silsbee, P.L.1    Bovik, A.C.2
  • 22
    • 0025503485 scopus 로고
    • Neural network models of sensory integration for improved vowel recognition
    • Yuhas B.P., Goldstein M.H., Sejnowski T.J., Jenkins R.E. Neural network models of sensory integration for improved vowel recognition. Proceedings of the IEEE. 78(10):1990;1658-1668.
    • (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1658-1668
    • Yuhas, B.P.1    Goldstein, M.H.2    Sejnowski, T.J.3    Jenkins, R.E.4
  • 25
    • 0029411030 scopus 로고
    • An information-maximization approach to blind separation and blind deconvolution
    • Bell A., Sejnowski T. An information-maximization approach to blind separation and blind deconvolution. Neural Computation. 7(7):1995;1129-1159.
    • (1995) Neural Computation , vol.7 , Issue.7 , pp. 1129-1159
    • Bell, A.1    Sejnowski, T.2
  • 26
    • 0029725825 scopus 로고    scopus 로고
    • Blind separation of delayed sources based on information maximization
    • May
    • K. Torkkola, Blind separation of delayed sources based on information maximization, ICASSP, May 1996.
    • (1996) ICASSP
    • Torkkola, K.1
  • 31
    • 1842613015 scopus 로고    scopus 로고
    • Genetic sensor selection enhanced independent component analysis and its applications to robust speech recognition
    • Baltimore, MD, June
    • P. Aarabi, Genetic sensor selection enhanced independent component analysis and its applications to robust speech recognition, in: Proceedings of the 5th IEEE Workshop on Nonlinear Signal and Image Processing (NSIP '01), Baltimore, MD, June 2001.
    • (2001) Proceedings of the 5th IEEE Workshop on Nonlinear Signal and Image Processing (NSIP '01)
    • Aarabi, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.