SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2011, Pages 349-352

Improving acoustic event detection using generalizable visual features and multi-modality modeling

(3) Huang, Po Sen a Zhuang, Xiaodan a Hasegawa Johnson, Mark a

a UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

Author keywords

acoustic event detection; coupled hidden Markov models; hidden Markov models; multi stream HMM; optical flow

Indexed keywords

ACOUSTIC EVENT CLASSIFICATION; ACOUSTIC EVENTS; ASYNCHRONY; AUDIO-VISUAL; COUPLED HIDDEN MARKOV MODELS; DATA RESOURCES; DATA SETS; JOINT MODELING; LOCALIZATION INFORMATION; MULTI-MODALITY; MULTI-STREAM; MULTI-STREAM HMM; STATE-SPACE; TIME STAMPS; VIDEO DATA; VIDEO STREAMS; VISUAL FEATURE; VISUAL REPRESENTATIONS;

CLASSIFICATION (OF INFORMATION); DETECTORS; HIDDEN MARKOV MODELS; MULTIMEDIA SYSTEMS; OPTICAL FLOWS; SIGNAL DETECTION; SPEECH COMMUNICATION; SPEECH RECOGNITION;

AUDIO ACOUSTICS;

EID: 80051652444 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2011.5946412 Document Type: Conference Paper

Times cited : (7)

References (17)

1
- 70449558459
- Audiovisual event detection towards scene understanding
- C. Canton-Ferrer, T. Butko, C. Segura, X. Giro, C. Nadeu, J. Hernando, and J. Casas, "Audiovisual event detection towards scene understanding," in IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2009, pp. 81-88.
- IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2009 , pp. 81-88
- Canton-Ferrer, C.¹ Butko, T.² Segura, C.³ Giro, X.⁴ Nadeu, C.⁵ Hernando, J.⁶ Casas, J.⁷

2
- 33750548608
- Events detection for an audio-based surveillance system
- C. Clavel, T. Ehrette, and G. Richard, "Events detection for an audio-based surveillance system," in IEEE International Conference on Multimedia and Expo, 2005.
- IEEE International Conference on Multimedia and Expo, 2005
- Clavel, C.¹ Ehrette, T.² Richard, G.³

3
- 0141479982
- Using speech/non-speech detection to bias recognition search on noisy data
- F. Beaufays, D. Boies, M. Weintraub, and Q. Zhu, "Using speech/non-speech detection to bias recognition search on noisy data," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003.
- IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003
- Beaufays, F.¹ Boies, D.² Weintraub, M.³ Zhu, Q.⁴

4
- 51449100311
- A. Temko, "CLEAR 2007 AED evaluation plan," http://isl.ira.uka.de/clear07, 2007.
- (2007) CLEAR 2007 AED Evaluation Plan
- Temko, A.¹

5
- 77955558847
- Real-world acoustic event detection
- X. Zhuang, X. Zhou, M. A. Hasegawa-Johnson, and T. S. Huang, "Real-world acoustic event detection," Pattern Recognition Letters, vol. 31, no. 12, pp. 1543-1551, 2010.
- (2010) Pattern Recognition Letters , vol.31 , Issue.12 , pp. 1543-1551
- Zhuang, X.¹ Zhou, X.² Hasegawa-Johnson, M.A.³ Huang, T.S.⁴

6
- 84908294933
- Duration-dependent input-output Markov models for audio-visual event detection
- M. R. Naphade, A. Garg, and T. Huang, "Duration-dependent input-output Markov models for audio-visual event detection," in International Conference on Multimedia, 2001.
- International Conference on Multimedia, 2001
- Naphade, M.R.¹ Garg, A.² Huang, T.³

7
- 55349112006
- Audio-visual event recognition with application in sports video
- Z. Xiong, R. Radhakrishnan, A. Divakaran, and T. Huang, "Audio-visual event recognition with application in sports video," in Intelligent Multimedia Processing with Soft Computing, ser. Studies in Fuzziness and Soft Computing, 2005, vol. 168, pp. 129-149.
- (2005) Intelligent Multimedia Processing with Soft Computing, Ser. Studies in Fuzziness and Soft Computing , vol.168 , pp. 129-149
- Xiong, Z.¹ Radhakrishnan, R.² Divakaran, A.³ Huang, T.⁴

8
- 0036295989
- Audio-visual speech modeling using coupled hidden Markov models
- S. M. Chu and T. S. Huang, "Audio-visual speech modeling using coupled hidden Markov models," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
- IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002
- Chu, S.M.¹ Huang, T.S.²

9
- 4544228318
- Identity verification using speech and face information
- C. Sanderson and K. K. Paliwal, "Identity verification using speech and face information," Digital Signal Processing, vol. 14, no. 5, pp. 449-480, 2004.
- (2004) Digital Signal Processing , vol.14 , Issue.5 , pp. 449-480
- Sanderson, C.¹ Paliwal, K.K.²

10
- 38349007037
- A duality based approach for realtime tv-l1 optical flow
- C. Zach, T. Pock, and H. Bischof, "A duality based approach for realtime tv-l1 optical flow," in Pattern Recognition (Proc. DAGM), Heidelberg, Germany, 2007, pp. 214-223.
- Pattern Recognition (Proc. DAGM), Heidelberg, Germany, 2007 , pp. 214-223
- Zach, C.¹ Pock, T.² Bischof, H.³

11
- 77957934489
- Human action recognition with line and flow histograms
- N. Ikizler, R. Cinbis, and P. Duygulu, "Human action recognition with line and flow histograms," in 19th International Conference on Pattern Recognition, 2008.
- 19th International Conference on Pattern Recognition, 2008
- Ikizler, N.¹ Cinbis, R.² Duygulu, P.³

12
- 33845572523
- Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
- S. Lazebnik, C. Schmid, and J. Ponce, "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories," in CVPR, 2006.
- (2006) CVPR
- Lazebnik, S.¹ Schmid, C.² Ponce, J.³

13
- 0001432664
- On the integration of auditory and visual parameters in an HMM-based ASR
- D. G. Stork and M. E. Hennecke (Eds.), Berlin: Springer-Verlag
- A. Adjoudani and C. Benoit, "On the integration of auditory and visual parameters in an HMM-based ASR," In D. G. Stork and M. E. Hennecke (Eds.), Speechreading by Humans and Machines. Berlin: Springer-Verlag, pp. 461-471, 1996.
- (1996) Speechreading by Humans and Machines , pp. 461-471
- Adjoudani, A.¹ Benoit, C.²

14
- 0003770986
- Comparingmodels for audiovisual fusion in a noisy-vowel recognition task
- P. Teissier, J. Robert-Ribes, and J. L. Schwartz, "Comparingmodels for audiovisual fusion in a noisy-vowel recognition task," IEEE Trans. Speech Audio Processing, vol. vol. 7, pp. 629-642, 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 629-642
- Teissier, P.¹ Robert-Ribes, J.² Schwartz, J.L.³

15
- 71149099083
- Multi-view clustering via canonical correlation analysis
- K. Chaudhuri, S. M. Kakade, and K. Livescu, "Multi-view clustering via canonical correlation analysis," in In Proc. of ICML09, 2009.
- Proc. of ICML09, 2009
- Chaudhuri, K.¹ Kakade, S.M.² Livescu, K.³

16
- 0036650148
- Statistical multimodal integration for audio-visual speech processing
- S. Nakamura, "Statistical multimodal integration for audio-visual speech processing," IEEE Transactions on Neural Networks, vol. 13, no. 4, 2002.
- (2002) IEEE Transactions on Neural Networks , vol.13 , Issue.4
- Nakamura, S.¹

17
- 51449101221
- Feature analysis and selection for acoustic event detection
- X. Zhuang, X. Zhou, T. S. Huang, and M. Hasegawa-Johnson, "Feature analysis and selection for acoustic event detection," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2008.
- IEEE International Conference on Acoustics, Speech, and Signal Processing, 2008
- Zhuang, X.¹ Zhou, X.² Huang, T.S.³ Hasegawa-Johnson, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.