SCOPUS 정보 검색 플랫폼

MM 2013 - Proceedings of the 2013 ACM Multimedia Conference

Volumn , Issue , 2013, Pages 835-838

Recent developments in openSMILE, the munich open-source multimedia feature extractor

(4) Eyben, Florian a Weninger, Felix a Gross, Florian a Schuller, Björn a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

Audio features; Multimodal fusion; Real time processing; Video features

Indexed keywords

AUDIO FEATURES; COMPONENT-BASED ARCHITECTURE; MULTI-MODAL FUSION; REALTIME PROCESSING; STATISTICAL CLASSIFIER; SUPPORT VECTOR MACHINE MODELS; VIDEO FEATURES; VOICE ACTIVITY DETECTION;

AUDITION; BATCH DATA PROCESSING; CLASSIFICATION (OF INFORMATION); FEATURE EXTRACTION; SPEECH RECOGNITION;

VIDEO SIGNAL PROCESSING;

EID: 84887494391 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2502081.2502224 Document Type: Conference Paper

Times cited : (1219)

References (12)

1
- 0003710380
- C.-C. Chang and C.-J. Lin. LibSVM: A library for support vector machines, 2001. http://www.csie.ntu.edu.tw/cjlin/libsvm.
- (2001) LibSVM: A Library for Support Vector Machines
- Chang, C.-C.¹ Lin, C.-J.²

2
- 84878420060
- Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments
- Portland, OR, USA
- Z. Duan, G. J. Mysore, and P. Smaragdis. Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments. In Proc. of Interspeech, Portland, OR, USA, 2012.
- (2012) Proc. of Interspeech
- Duan, Z.¹ Mysore, G.J.² Smaragdis, P.³

3
- 84877673002
- Violent scenes detection with large, brute-forced acoustic and visual feature sets
- Workshop, Pisa, Italy, October
- F. Eyben, F. Weninger, N. Lehment, G. Rigoll, and B. Schuller. Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets. In Proceedings MediaEval 2012 Workshop, Pisa, Italy, October 2012. 2 pages.
- (2012) Proceedings MediaEval 2012 , pp. 2
- Eyben, F.¹ Weninger, F.² Lehment, N.³ Rigoll, G.⁴ Schuller, B.⁵

4
- 78650977476
- OpenSMILE-The munich versatile and fast open-source audio feature extractor
- Florence, Italy, October, ACM
- F. Eyben, M. Ẅollmer, and B. Schuller. openSMILE-The Munich Versatile and Fast Open-Source Audio Feature Extractor. In Proc. of ACM MM, pages 1459{1462, Florence, Italy, October 2010. ACM.
- (2010) Proc. of ACM MM , pp. 1459-1462
- Eyben, F.¹ Ẅollmer, M.² Schulle, B.³

5
- 27744588611
- Framewise phoneme classification with bidirectional LSTM and other neural network architectures
- A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5-6):602{610, 2005.
- (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

6
- 84869432703
- A two-channel acoustic front-end for robust automatic speech recognition in noisy and reverberant environments
- R. Maas, A. Schwarz, Y. Zheng, K. Reindl, S. Meier, A. Sehr, and W. Kellermann. A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments. In Proc. of CHiME, pages 41{46, 2011.
- (2011) Proc. of CHiME , pp. 41-46
- Maas, R.¹ Schwarz, A.² Zheng, Y.³ Reindl, K.⁴ Meier, S.⁵ Sehr, A.⁶ Kellermann, W.⁷

7
- 85032750851
- The computational paralinguistics challenge
- July
- B. Schuller. The Computational Paralinguistics Challenge. IEEE Signal Processing Magazine, 29(4):97{101, July 2012.
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.4 , pp. 97-101
- Schuller, B.¹

8
- 84906269266
- The INTERSPEECH 2013 computational paralinguistics challenge: Social signals, conict, emotion, autism
- Lyon, France, August, ISCA. in press
- B. Schuller, S. Steidl, A. Batliner, A. Vinciarelli, K. Scherer, F. Ringeval, M. Chetouani, et al. The INTERSPEECH 2013 Computational Paralinguistics Challenge: Social Signals, Conict, Emotion, Autism. In Proc. of INTERSPEECH, Lyon, France, August 2013. ISCA. in press.
- (2013) Proc. of INTERSPEECH
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Vinciarelli, A.⁴ Scherer, K.⁵ Ringeval, F.⁶ Chetouani, M.⁷

9
- 84878925980
- On the acoustics of emotion in audio: What speech, music and sound have in common
- DOI: 10.3389/fpsyg.2013.00292, in press
- F. Weninger, F. Eyben, B. W. Schuller, M. Mortillaro, and K. R. Scherer. On the Acoustics of Emotion in Audio: What Speech, Music and Sound have in Common. Frontiers in Emotion Science, 2013. DOI: 10.3389/fpsyg.2013.00292, in press.
- (2013) Frontiers in Emotion Science
- Weninger, F.¹ Eyben, F.² Schuller, B.W.³ Mortillaro, M.⁴ Scherer, K.R.⁵

10
- 84890532851
- Speaker trait characterization in web videos: Uniting speech, language, and facial features
- Vancouver, Canada, May, IEEE. in press
- F. Weninger, C. Wagner, M. Ẅollmer, B. Schuller, and L.-P. Morency. Speaker Trait Characterization in Web Videos: Uniting Speech, Language, and Facial Features. In Proc. of ICASSP, Vancouver, Canada, May 2013. IEEE. in press.
- (2013) Proc. of ICASSP
- Weninger, F.¹ Wagner, C.² Ẅollmer, M.³ Schuller, B.⁴ Morency, L.-P.⁵

11
- 0003957032
- 2nd Edition. Morgan Kaufmann, San Francisco, 2nd edition
- I. H. Witten and E. Frank. Data mining: Practical machine learning tools and techniques, 2nd Edition. Morgan Kaufmann, San Francisco, 2nd edition, 2005.
- (2005) Data Mining: Practical Machine Learning Tools and Techniques
- Witten, I.H.¹ Frank, E.²

12
- 64849090257
- Cambridge University Press
- S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland. The HTK book (v3.4). Cambridge University Press, 2006.
- (2006) The HTK Book (V3.4)
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.¹²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.