SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2005, Pages 2131-2134

A multimodal approach to extract optimized audio features for speaker detection

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO AND VIDEO; AUDIO FEATURES; COMMON SOURCE; MULTI-MODAL APPROACH; MUTUAL INFORMATIONS; SPEAKER DETECTION; SPEECH INFORMATION; VIDEO FEATURES;

INFORMATION THEORY; SIGNAL PROCESSING;

OPTIMIZATION;

EID: 84863714265 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (8)

References (12)

1
- 14844344462
- From error probability to information theoretic (multi-modal) signal processing
- T. Butz and J. P. Thiran, "From error probability to information theoretic (multi-modal) signal processing," Signal Processing, vol. 85, pp. 875-902, 2005.
- (2005) Signal Processing , vol.85 , pp. 875-902
- Butz, T.¹ Thiran, J.P.²

2
- 2642562769
- Speaker association with signal-level audiovisual fusion
- J. W. Fisher III and T. Darrell, "Speaker association with signal-level audiovisual fusion," IEEE Transaction on multimedia, pp. 406-413, 2004.
- (2004) IEEE Transaction on Multimedia , pp. 406-413
- Fisher III, J.W.¹ Darrell, T.²

3
- 35248827017
- Speaker localisation using audio-visual synchrony: An empirical study
- H. J. Nock, G. Iyengar, and C. Neti, "Speaker localisation using audio-visual synchrony: An empirical study," in CIVR, 2003, pp. 488-499.
- (2003) CIVR , pp. 488-499
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

4
- 84899028297
- Audio-vision: Using audio-visual synchrony to locate sounds
- J. Hershey and J. Movellan, "Audio-vision: Using audio-visual synchrony to locate sounds," in NIPS, vol. 12, 2000.
- (2000) NIPS , vol.12
- Hershey, J.¹ Movellan, J.²

5
- 84898931254
- Facesync: A linear operator for measuring synchronisation of video facial images and audio tracks
- M. Slaney and M. Covell, "Facesync: A linear operator for measuring synchronisation of video facial images and audio tracks," in NIPS, vol. 13, 2001.
- (2001) NIPS , vol.13
- Slaney, M.¹ Covell, M.²

8
- 0019698606
- Determining optical flow
- B. K. P. Horn and B. G. Schunck, "Determining optical flow," Artificial Intelligence, pp. 185-203, 1981.
- (1981) Artificial Intelligence , pp. 185-203
- Horn, B.K.P.¹ Schunck, B.G.²

9
- 0003901864
- John Wiley & sons, Inc
- B. Gold and N. Morgan, Speech and audio signal processing. John Wiley & sons, Inc, 2000.
- (2000) Speech and Audio Signal Processing
- Gold, B.¹ Morgan, N.²

11
- 0003813740
- Oxford science publications
- A. W. Bowman and A. Azzalini, Applied smoothing techniques for data analysis. Oxford science publications, 1997.
- (1997) Applied Smoothing Techniques for Data Analysis
- Bowman, A.W.¹ Azzalini, A.²

12
- 0004161838
- 2nd ed. Cambridge University Press
- W. H. Press, S. A. Teukolsky,W. T. Vetterling, and B. P. Flannery, Numerical Recipes in C, 2nd ed. Cambridge University Press, 1992.
- (1992) Numerical Recipes C
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.T.³ Flannery, B.P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.