메뉴 건너뛰기




Volumn , Issue , 2013, Pages 835-838

Recent developments in openSMILE, the munich open-source multimedia feature extractor

Author keywords

Audio features; Multimodal fusion; Real time processing; Video features

Indexed keywords

AUDIO FEATURES; COMPONENT-BASED ARCHITECTURE; MULTI-MODAL FUSION; REALTIME PROCESSING; STATISTICAL CLASSIFIER; SUPPORT VECTOR MACHINE MODELS; VIDEO FEATURES; VOICE ACTIVITY DETECTION;

EID: 84887494391     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2502081.2502224     Document Type: Conference Paper
Times cited : (1219)

References (12)
  • 2
    • 84878420060 scopus 로고    scopus 로고
    • Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments
    • Portland, OR, USA
    • Z. Duan, G. J. Mysore, and P. Smaragdis. Speech enhancement by online non-negative spectrogram decomposition in non-stationary noise environments. In Proc. of Interspeech, Portland, OR, USA, 2012.
    • (2012) Proc. of Interspeech
    • Duan, Z.1    Mysore, G.J.2    Smaragdis, P.3
  • 3
    • 84877673002 scopus 로고    scopus 로고
    • Violent scenes detection with large, brute-forced acoustic and visual feature sets
    • Workshop, Pisa, Italy, October
    • F. Eyben, F. Weninger, N. Lehment, G. Rigoll, and B. Schuller. Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets. In Proceedings MediaEval 2012 Workshop, Pisa, Italy, October 2012. 2 pages.
    • (2012) Proceedings MediaEval 2012 , pp. 2
    • Eyben, F.1    Weninger, F.2    Lehment, N.3    Rigoll, G.4    Schuller, B.5
  • 4
    • 78650977476 scopus 로고    scopus 로고
    • OpenSMILE-The munich versatile and fast open-source audio feature extractor
    • Florence, Italy, October, ACM
    • F. Eyben, M. Ẅollmer, and B. Schuller. openSMILE-The Munich Versatile and Fast Open-Source Audio Feature Extractor. In Proc. of ACM MM, pages 1459{1462, Florence, Italy, October 2010. ACM.
    • (2010) Proc. of ACM MM , pp. 1459-1462
    • Eyben, F.1    Ẅollmer, M.2    Schulle, B.3
  • 5
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • A. Graves and J. Schmidhuber. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5-6):602{610, 2005.
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 6
    • 84869432703 scopus 로고    scopus 로고
    • A two-channel acoustic front-end for robust automatic speech recognition in noisy and reverberant environments
    • R. Maas, A. Schwarz, Y. Zheng, K. Reindl, S. Meier, A. Sehr, and W. Kellermann. A Two-Channel Acoustic Front-End for Robust Automatic Speech Recognition in Noisy and Reverberant Environments. In Proc. of CHiME, pages 41{46, 2011.
    • (2011) Proc. of CHiME , pp. 41-46
    • Maas, R.1    Schwarz, A.2    Zheng, Y.3    Reindl, K.4    Meier, S.5    Sehr, A.6    Kellermann, W.7
  • 7
    • 85032750851 scopus 로고    scopus 로고
    • The computational paralinguistics challenge
    • July
    • B. Schuller. The Computational Paralinguistics Challenge. IEEE Signal Processing Magazine, 29(4):97{101, July 2012.
    • (2012) IEEE Signal Processing Magazine , vol.29 , Issue.4 , pp. 97-101
    • Schuller, B.1
  • 8
    • 84906269266 scopus 로고    scopus 로고
    • The INTERSPEECH 2013 computational paralinguistics challenge: Social signals, conict, emotion, autism
    • Lyon, France, August, ISCA. in press
    • B. Schuller, S. Steidl, A. Batliner, A. Vinciarelli, K. Scherer, F. Ringeval, M. Chetouani, et al. The INTERSPEECH 2013 Computational Paralinguistics Challenge: Social Signals, Conict, Emotion, Autism. In Proc. of INTERSPEECH, Lyon, France, August 2013. ISCA. in press.
    • (2013) Proc. of INTERSPEECH
    • Schuller, B.1    Steidl, S.2    Batliner, A.3    Vinciarelli, A.4    Scherer, K.5    Ringeval, F.6    Chetouani, M.7
  • 9
    • 84878925980 scopus 로고    scopus 로고
    • On the acoustics of emotion in audio: What speech, music and sound have in common
    • DOI: 10.3389/fpsyg.2013.00292, in press
    • F. Weninger, F. Eyben, B. W. Schuller, M. Mortillaro, and K. R. Scherer. On the Acoustics of Emotion in Audio: What Speech, Music and Sound have in Common. Frontiers in Emotion Science, 2013. DOI: 10.3389/fpsyg.2013.00292, in press.
    • (2013) Frontiers in Emotion Science
    • Weninger, F.1    Eyben, F.2    Schuller, B.W.3    Mortillaro, M.4    Scherer, K.R.5
  • 10
    • 84890532851 scopus 로고    scopus 로고
    • Speaker trait characterization in web videos: Uniting speech, language, and facial features
    • Vancouver, Canada, May, IEEE. in press
    • F. Weninger, C. Wagner, M. Ẅollmer, B. Schuller, and L.-P. Morency. Speaker Trait Characterization in Web Videos: Uniting Speech, Language, and Facial Features. In Proc. of ICASSP, Vancouver, Canada, May 2013. IEEE. in press.
    • (2013) Proc. of ICASSP
    • Weninger, F.1    Wagner, C.2    Ẅollmer, M.3    Schuller, B.4    Morency, L.-P.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.