메뉴 건너뛰기




Volumn 2014-September, Issue Septmber, 2014, Pages

Improved audio features for large-scale multimedia event detection

Author keywords

acoustic event detection; computational acoustic scene analysis; multimedia retrieval

Indexed keywords

IMAGE RETRIEVAL; NEURAL NETWORKS; SEMANTICS;

EID: 84937509499     PISSN: 19457871     EISSN: 1945788X     Source Type: Conference Proceeding    
DOI: 10.1109/ICME.2014.6890234     Document Type: Conference Paper
Times cited : (18)

References (28)
  • 1
    • 85085788280 scopus 로고    scopus 로고
    • Trecvid 2013-An introduction to the goals, tasks, data, evaluation mechanisms, and metrics
    • Gaithersburg, MD; U.S.A., Nov., National Institute of Standards and Technology
    • Paul Over, Jon Fiscus, and Greg Sanders, "TRECVID 2013-An introduction to the goals, tasks, data, evaluation mechanisms, and metrics, " in Proc. TRECVID, Gaithersburg, MD; U.S.A., Nov. 2013, National Institute of Standards and Technology, http://wwwnlpir. nist.gov/projects/tv2013/.
    • (2013) Proc. TRECVID
    • Over, P.1    Fiscus, J.2    Sanders, G.3
  • 8
    • 84905270442 scopus 로고    scopus 로고
    • IBM research and columbia university trecvid-2011 multimedia event detection (med) system
    • Gaithersburg, MD; U.S.A. , Nov., National Institute of Standards and Technology
    • Liangliang Cao, Shih-Fu Chang, Noel Codella, Courtenay Cotton, Dan Ellis, Leiguang Gong, Matthew Hill, Gang Hua, John Kender, Michele Merler, Yadong Mu, Apostol Natseve, and John R. Smith, " IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System," in Proc. TRECVID, Gaithersburg, MD; U.S.A. , Nov. 2011, National Institute of Standards and Technology, http://wwwnlpir. nist.gov/projects/tv2011/.
    • (2011) Proc. TRECVID
    • Cao, L.1    Chang, S.-F.2    Codella, N.3    Cotton, C.4    Ellis, D.5    Gong, L.6    Hill, M.7    Hua, G.8    Kender, J.9    Merler, M.10    Mu, Y.11    Natseve, A.12    Smith, J.R.13
  • 9
    • 84906214187 scopus 로고    scopus 로고
    • Robust audio codebooks for large scale event detection in consumer videos
    • Lyon; France, Aug., ISCA
    • Shourabh Rawat, Peter Schulam, Susanne Burger, Duo Ding, Yipei Wang, and Florian Metze, " Robust audio codebooks for large scale event detection in consumer videos," in Proc. INTERSPEECH, Lyon; France, Aug. 2013, ISCA.
    • (2013) Proc. INTERSPEECH
    • Rawat, S.1    Schulam, P.2    Burger, S.3    Ding, D.4    Wang, Y.5    Metze, F.6
  • 10
    • 84878606595 scopus 로고    scopus 로고
    • Bag-of-Audiowords approach for multimedia event classification
    • Stephanie Pancoast and Murat Akbacak, "Bag-of-Audiowords approach for multimedia event classification," In Proc. INTERSPEECH [27].
    • Proc. INTERSPEECH [ , vol.27
    • Pancoast, S.1    Akbacak, M.2
  • 14
    • 84937415065 scopus 로고    scopus 로고
    • National Institute of Standards of Technology Aug. 2013, Last acccessed: April 15
    • National Institute of Standards of Technology, " 2013 TRECVID Multimedia Event Detection Track," http://www.nist.gov/itl/iad/mig/med13.cfm, Aug. 2013, Last acccessed: April 15, 2014.
    • (2014) 2013 TRECVID Multimedia Event Detection Track
  • 17
    • 84953744816 scopus 로고
    • A statistical interpretation of term specificity and its application in retrieval
    • Karen Sparck Jones, " A statistical interpretation of term specificity and its application in retrieval," Journal of Documentation, 1972.
    • (1972) Journal of Documentation
    • Jones, K.S.1
  • 18
    • 84937454189 scopus 로고    scopus 로고
    • Extracting deep bottleneck features using stacked auto-encoders
    • Jonas Gehring, Yajie Miao, Florian Metze, and Alex Waibel, " Extracting deep bottleneck features using stacked auto-encoders," In Proc. ICASSP [28].
    • Proc. ICASSP [ , vol.28
    • Gehring, J.1    Miao, Y.2    Metze, F.3    Waibel, A.4
  • 19
    • 84890499569 scopus 로고    scopus 로고
    • Unsupervised hierarchical structure induction for deeper semantic analysis of audio
    • Sourish Chaudhuri and Bhiksha Raj, "Unsupervised hierarchical structure induction for deeper semantic analysis of audio, " In Proc. ICASSP [28], pp. 833-837.
    • Proc. ICASSP , vol.28 , pp. 833-837
    • Chaudhuri, S.1    Raj, B.2
  • 20
    • 51449103447 scopus 로고    scopus 로고
    • Optimizing bottleneck features for lvcsr
    • Las Vegas, NV; U.S.A. Apr. IEEE
    • Frantisek Grézl and Petr Fousek, "Optimizing bottleneck features for LVCSR, " in Proc. ICASSP, Las Vegas, NV; U.S.A., Apr. 2008, IEEE.
    • (2008) Proc. ICASSP
    • Grézl, F.1    Fousek, P.2
  • 22
    • 84937454189 scopus 로고    scopus 로고
    • Extracting deep bottleneck features using stacked auto-encoders
    • 22] Jonas Gehring, Yajie Miao, Florian Metze, and Alex Waibel, " Extracting Deep Bottleneck Features Using Stacked Auto-Encoders," In Proc. ICASSP [28].
    • Proc. ICASSP [ , vol.28
    • Gehring, J.1    Miao, Y.2    Metze, F.3    Waibel, A.4
  • 25
    • 78650977476 scopus 로고    scopus 로고
    • Opensmile: The munich versatile and fast open-source audio feature extractor
    • New York, NY; USA, MM '10 ACM
    • Florian Eyben, Martin Wöllmer, and Björn Schuller, " Opensmile: the Munich versatile and fast open-source audio feature extractor," in Proceedings of the International Conference on Multimedia, New York, NY; USA, 2010, MM '10, pp. 1459-1462, ACM.
    • (2010) Proceedings of the International Conference on Multimedia , pp. 1459-1462
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 26
    • 84890530296 scopus 로고    scopus 로고
    • Subband autocorrelation features for video soundtrack classification
    • Courtenay V. Cotton and Dan P.W. Ellis, " Subband autocorrelation features for video soundtrack classification," In Proc. ICASSP [28], pp. 8663-8666.
    • Proc. ICASSP [ , vol.28 , pp. 8663-8666
    • Cotton, C.V.1    Ellis, D.P.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.