메뉴 건너뛰기




Volumn , Issue , 2005, Pages 2131-2134

A multimodal approach to extract optimized audio features for speaker detection

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO AND VIDEO; AUDIO FEATURES; COMMON SOURCE; MULTI-MODAL APPROACH; MUTUAL INFORMATIONS; SPEAKER DETECTION; SPEECH INFORMATION; VIDEO FEATURES;

EID: 84863714265     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (8)

References (12)
  • 1
    • 14844344462 scopus 로고    scopus 로고
    • From error probability to information theoretic (multi-modal) signal processing
    • T. Butz and J. P. Thiran, "From error probability to information theoretic (multi-modal) signal processing," Signal Processing, vol. 85, pp. 875-902, 2005.
    • (2005) Signal Processing , vol.85 , pp. 875-902
    • Butz, T.1    Thiran, J.P.2
  • 2
    • 2642562769 scopus 로고    scopus 로고
    • Speaker association with signal-level audiovisual fusion
    • J. W. Fisher III and T. Darrell, "Speaker association with signal-level audiovisual fusion," IEEE Transaction on multimedia, pp. 406-413, 2004.
    • (2004) IEEE Transaction on Multimedia , pp. 406-413
    • Fisher III, J.W.1    Darrell, T.2
  • 3
    • 35248827017 scopus 로고    scopus 로고
    • Speaker localisation using audio-visual synchrony: An empirical study
    • H. J. Nock, G. Iyengar, and C. Neti, "Speaker localisation using audio-visual synchrony: An empirical study," in CIVR, 2003, pp. 488-499.
    • (2003) CIVR , pp. 488-499
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 4
    • 84899028297 scopus 로고    scopus 로고
    • Audio-vision: Using audio-visual synchrony to locate sounds
    • J. Hershey and J. Movellan, "Audio-vision: Using audio-visual synchrony to locate sounds," in NIPS, vol. 12, 2000.
    • (2000) NIPS , vol.12
    • Hershey, J.1    Movellan, J.2
  • 5
    • 84898931254 scopus 로고    scopus 로고
    • Facesync: A linear operator for measuring synchronisation of video facial images and audio tracks
    • M. Slaney and M. Covell, "Facesync: A linear operator for measuring synchronisation of video facial images and audio tracks," in NIPS, vol. 13, 2001.
    • (2001) NIPS , vol.13
    • Slaney, M.1    Covell, M.2
  • 6
    • 13444275916 scopus 로고    scopus 로고
    • Audio/visual independent components
    • Nara, Japan, April
    • P. Smaragdis and M. Casey, "Audio/visual independent components," in ICA, Nara, Japan, April 2003.
    • (2003) ICA
    • Smaragdis, P.1    Casey, M.2
  • 7
    • 84863697795 scopus 로고    scopus 로고
    • Feature space mutual information in speech-video sequences
    • Lausanne, Switzerland
    • T. Butz and J. P. Thiran, "Feature space mutual information in speech-video sequences," in ICME, vol. II, Lausanne, Switzerland, 2002.
    • (2002) ICME , vol.2
    • Butz, T.1    Thiran, J.P.2
  • 10
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • Sept.
    • J. W. Picone, "Signal modeling techniques in speech recognition," in Proceedings of the IEEE, vol. 81, no. 9, Sept. 1993.
    • (1993) Proceedings of the IEEE , vol.81 , Issue.9
    • Picone, J.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.