SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2009, Pages 1147-1150

Improving detection of acoustic events using audiovisual data and feature level fusion

(7) Butko, T a Canton Ferrer, C a Segura, C a Giro X a Nadeu, C a Hernando, J a Casas, J R a

a UNIVERSITAT POLITÈCNICA DE CATALUNYA (Spain)

Author keywords

Acoustic event detection; Acoustic localization; Hidden Markov models; Multimodal fusion; Multimodality

Indexed keywords

ACOUSTIC EVENTS; ACOUSTIC LOCALIZATION; AUDIO AND VIDEO; AUDIO INFORMATION; AUDIO-VISUAL DATA; DATA SETS; FEATURE LEVEL; FEATURE LEVEL FUSION; MULTI-MODAL FUSION; MULTI-MODALITY; SOCIAL ACTIVITIES;

HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH RECOGNITION;

AUDIO ACOUSTICS;

EID: 70450202244 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (14)

1
- 0036816475
- Content analysis for audio classification and segmentation
- L. Lu, H. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation", IEEE Trans. on Speech and Audio Processing, vol. 10, pp. 504-516, 2002.
- (2002) IEEE Trans. on Speech and Audio Processing , vol.10 , pp. 504-516
- Lu, L.¹ Zhang, H.² Jiang, H.³

2
- 85009212099
- Environmental sound source identification based on hidden Markov models for robust speech recognition
- T. Nishiura, S. Nakamura, K. Miki, and K. Shikano, "Environmental sound source identification based on hidden Markov models for robust speech recognition", in Proc. Eurospeech, pp. 2157-2160, 2003.
- (2003) Proc. Eurospeech , pp. 2157-2160
- Nishiura, T.¹ Nakamura, S.² Miki, K.³ Shikano, K.⁴

3
- 70449590704
- Acoustic event detection and classification
- PhD thesis, Technical University of Catalonia
- A. Temko. "Acoustic event detection and classification", PhD thesis, Technical University of Catalonia, 2007.
- (2007)
- Temko, A.¹

4
- 84867206190
- CHIL, Computers in the Human Interaction Loop, EU project
- CHIL - Computers in the Human Interaction Loop - EU project. http://chil.server.de, 2004-2007.
- (2004)

5
- 38049176869
- CLEAR Evaluation of Acoustic Event Detection and Classification systems
- Multimodal Technologies for Perception of Humans, Springer
- A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, M. Omologo, "CLEAR Evaluation of Acoustic Event Detection and Classification systems", in Multimodal Technologies for Perception of Humans, LNCS, vol. 4122, Springer, 2007.
- (2007) LNCS , vol.4122
- Temko, A.¹ Malkin, R.² Zieger, C.³ Macho, D.⁴ Nadeu, C.⁵ Omologo, M.⁶

6
- 84867199582
- Fusion of Audio and Video Modalities for Detection of Acoustic Events
- T. Butko, A. Temko, C. Nadeu and C. Canton, "Fusion of Audio and Video Modalities for Detection of Acoustic Events", in Proc. Interspeech, pp. 123-126, 2008.
- (2008) Proc. Interspeech , pp. 123-126
- Butko, T.¹ Temko, A.² Nadeu, C.³ Canton, C.⁴

7
- 70449558459
- Audiovisual Event Detection Towards Scene Understanding
- C. Canton-Ferrer, T. Butko, C. Segura, X. Giró, C. Nadeu, J. Hernando, J.R. Casas, "Audiovisual Event Detection Towards Scene Understanding", in Proc. IEEE Int. Conference on Computer Vision and Pattern Recognition, 2009.
- (2009) Proc. IEEE Int. Conference on Computer Vision and Pattern Recognition
- Canton-Ferrer, C.¹ Butko, T.² Segura, C.³ Giró, X.⁴ Nadeu, C.⁵ Hernando, J.⁶ Casas, J.R.⁷

8
- 85135144525
- On the decorrelation of filter-bank energies in speech recognition
- C. Nadeu, J. Hernando, and M. Gorricho, "On the decorrelation of filter-bank energies in speech recognition", in Proc. European Speech Processing Conference, pp. 1381-1384, 1995.
- (1995) Proc. European Speech Processing Conference , pp. 1381-1384
- Nadeu, C.¹ Hernando, J.² Gorricho, M.³

9
- 56749117943
- In defense of One-Vs-All Classification
- R. Rifkin, A. Klautau, "In defense of One-Vs-All Classification", Journal of Machine learning Research, vol. 5, pp.101-141, 2004.
- (2004) Journal of Machine learning Research , vol.5 , pp. 101-141
- Rifkin, R.¹ Klautau, A.²

10
- 70450188181
- J. DiBiase, H. Silverman, and M. Brandstein, Microphone Arrays: Techniques and Applications, M. S. Brandstein and D. B. Ward, Eds, pp. 157-180, Springer-Verlag, 2001.
- J. DiBiase, H. Silverman, and M. Brandstein, " Microphone Arrays: Techniques and Applications", M. S. Brandstein and D. B. Ward, Eds, pp. 157-180, Springer-Verlag, 2001.

11
- 69949162388
- Particle filtering and sparse sampling for multi-person 3D tracking
- C. Canton-Ferrer, R. Sblendido, J. R. Casas, and M. Pardas, "Particle filtering and sparse sampling for multi-person 3D tracking", in Proc. IEEE Int. Conf. on Image Processing, pp. 2644-2647, 2008.
- (2008) Proc. IEEE Int. Conf. on Image Processing , pp. 2644-2647
- Canton-Ferrer, C.¹ Sblendido, R.² Casas, J.R.³ Pardas, M.⁴

12
- 0032634283
- Adaptive background mixture models for real-time tracking
- C. Stauffer and W. Grimson, "Adaptive background mixture models for real-time tracking", in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 252-259, 1999.
- (1999) Proc. IEEE Conf. on Computer Vision and Pattern Recognition , pp. 252-259
- Stauffer, C.¹ Grimson, W.²

13
- 46749125771
- Composite object detection in video sequences: Applications to controlled environments
- X. Giró and F. Marqués, "Composite object detection in video sequences: Applications to controlled environments", in Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services, pp. 1-4, 2007.
- (2007) Proc. Int. Workshop on Image Analysis for Multimedia Interactive Services , pp. 1-4
- Giró, X.¹ Marqués, F.²

14
- 84925639646
- Real-time lip tracking and bi-modal continuous speech recognition
- M. T. Chan, Y. Zhang, and T. S. Huang, "Real-time lip tracking and bi-modal continuous speech recognition", in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 65-70, 1998.
- (1998) Proc. IEEE Workshop on Multimedia Signal Processing , pp. 65-70
- Chan, M.T.¹ Zhang, Y.² Huang, T.S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.