메뉴 건너뛰기




Volumn 49, Issue 7-8, 2007, Pages 667-677

Visual voice activity detection as a help for speech source separation from convolutive mixtures

Author keywords

Convolutive mixtures; Highly non stationary environments; Speech enhancement; Speech source separation; Visual speech processing; Voice activity detector

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; COMPUTATION THEORY; RADIO INTERFERENCE; SPEECH ENHANCEMENT;

EID: 34447095008     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.04.008     Document Type: Article
Times cited : (41)

References (20)
  • 1
    • 19944405106 scopus 로고    scopus 로고
    • A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources
    • Abrard F., and Deville Y. A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources. Signal Processing 85 7 (2005) 1389-1403
    • (2005) Signal Processing , vol.85 , Issue.7 , pp. 1389-1403
    • Abrard, F.1    Deville, Y.2
  • 2
    • 26044447327 scopus 로고    scopus 로고
    • Babaie-Zadeh, M., Mansour, A., Jutten, C., Marvasti, F., 2004. A geometric approach for separating several speech signals. In: Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), Granada, Spain, pp. 798-806.
  • 3
    • 0030362791 scopus 로고    scopus 로고
    • Bernstein, L.E., Benoît, C., 1996. For speech perception by humans or machines, three senses are better than one. In: Proc. Int. Conf. Spoken Language Processing (ICSLP), Philadelphia, USA, pp. 1477-1480.
  • 4
    • 0028996448 scopus 로고    scopus 로고
    • Capdevielle, V., Servière, C., Lacoume, J.-L., 1995. Blind separation of wide-band sources in the frequency domain. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Detroit, USA, 1995, pp. 2080-2083.
  • 5
    • 0032187518 scopus 로고    scopus 로고
    • Blind signal separation: statistical principles
    • Cardoso J.-F. Blind signal separation: statistical principles. Proceedings of the IEEE 86 10 (1998) 2009-2025
    • (1998) Proceedings of the IEEE , vol.86 , Issue.10 , pp. 2009-2025
    • Cardoso, J.-F.1
  • 6
    • 4544382447 scopus 로고    scopus 로고
    • Dansereau, R., 2004. Co-channel audiovisual speech separation using spectral matching constraints. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Montréal, Canada, 2004.
  • 7
    • 34447103915 scopus 로고    scopus 로고
    • Dapena, A., Bugallo, M.F., Castedo, L., 2001. Separation of convolutive mixtures of temporally-white signals: a novel frequency-domain approach. In: Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), San Diego, USA, pp. 315-320.
  • 8
    • 34447095971 scopus 로고    scopus 로고
    • Elisei, F., Odisio, M., Bailly, G., Badin, P., 2001. Creating and controlling video-realistic talking heads. In: Proc. Audio-Visual Speech Processing Workshop (AVSP), Aalborg, Denmark, pp. 90-97.
  • 9
    • 0034974093 scopus 로고    scopus 로고
    • Audio-visual enhancement of speech in noise
    • Girin L., Schwartz J.-L., and Feng G. Audio-visual enhancement of speech in noise. J. Acoust. Soc. Am. 109 6 (2001) 3007-3020
    • (2001) J. Acoust. Soc. Am. , vol.109 , Issue.6 , pp. 3007-3020
    • Girin, L.1    Schwartz, J.-L.2    Feng, G.3
  • 10
    • 34447107643 scopus 로고    scopus 로고
    • Lallouache, T., 1990. Un poste visage-parole. Acquisition et traitement des contours labiaux, in: Proc. Journées d'Etude sur la Parole (JEP) (French), Montréal.
  • 11
    • 34447101331 scopus 로고    scopus 로고
    • Le Goff, B., Guiard-Marigny, T., Benoît, C., 1995. Read my lips... and my jaw! How intelligible are the components of a speaker's face? In: Proc. Euro. Conf. on Speech Com. and Tech, Madrid, Spain, pp. 291-294.
  • 12
    • 4544351504 scopus 로고    scopus 로고
    • Liu, P., Wang, Z., 2004. Voice activity detection using visual information. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Montreal, 2004, pp. 609-612.
  • 13
    • 0000914334 scopus 로고    scopus 로고
    • Convolutive blind separation of non stationary sources
    • Parra L., and Spence C. Convolutive blind separation of non stationary sources. IEEE Trans. Speech Audio Process. 8 3 (2000) 320-327
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 320-327
    • Parra, L.1    Spence, C.2
  • 14
    • 34447102938 scopus 로고    scopus 로고
    • Pham, D.-T., Servière, C., Boumaraf, H., 2003. Blind separation of convolutive audio mixtures using nonstationarity. In: Proc. Int. Conf. Independent Component Analysis and Blind Source Separation (ICA), Nara, Japan.
  • 15
    • 4544247264 scopus 로고    scopus 로고
    • Rajaram, S., Nefian, A.V., Huang, T.S., 2004. Bayesian separation of audio-visual speech sources. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Montréal, Canada.
  • 16
    • 34447100075 scopus 로고    scopus 로고
    • Mixing audiovisual speech processing and blind source separation for the extraction of speech signals from convolutive mixtures
    • Rivet B., Girin L., and Jutten C. Mixing audiovisual speech processing and blind source separation for the extraction of speech signals from convolutive mixtures. IEEE Trans. Audio Speech Lang. Process. 15 1 (2007) 96-108
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.1 , pp. 96-108
    • Rivet, B.1    Girin, L.2    Jutten, C.3
  • 17
    • 10444247388 scopus 로고    scopus 로고
    • Developing an audio-visual speech source separation algorithm
    • Sodoyer D., Girin L., Jutten C., and Schwartz J.-L. Developing an audio-visual speech source separation algorithm. Speech Comm. 44 1-4 (2004) 113-125
    • (2004) Speech Comm. , vol.44 , Issue.1-4 , pp. 113-125
    • Sodoyer, D.1    Girin, L.2    Jutten, C.3    Schwartz, J.-L.4
  • 18
    • 33947625135 scopus 로고    scopus 로고
    • Sodoyer, D., Rivet, B., Girin, L., Schwartz, J.-L., Jutten, C., 2006. An analysis of visual speech information applied to voice activity detection. In: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Toulouse, France, pp. 601-604.
  • 19
    • 0001048664 scopus 로고
    • Visual contribution to speech intelligibility in noise
    • Sumby W., and Pollack I. Visual contribution to speech intelligibility in noise. J. Acoust. Soc. Am. 26 (1954) 212-215
    • (1954) J. Acoust. Soc. Am. , vol.26 , pp. 212-215
    • Sumby, W.1    Pollack, I.2
  • 20
    • 33646231347 scopus 로고    scopus 로고
    • Wang, W., Cosker, D., Hicks, Y., Sanei, S., Chambers, J.A., 2005. Video assisted speech source separation, in: Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Philadelphia, USA, 2005.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.