메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages

Audio-based affect detection in web videos

Author keywords

Affect detection; Audio concept detection; Audio segmentation

Indexed keywords

SEMANTICS;

EID: 84946037102     PISSN: 19457871     EISSN: 1945788X     Source Type: Conference Proceeding    
DOI: 10.1109/ICME.2015.7177525     Document Type: Conference Paper
Times cited : (8)

References (20)
  • 1
    • 84866707906 scopus 로고    scopus 로고
    • Evaluation of low-level features and their combinations for complex event detection in open source videos
    • A. Tamrakar et al., "Evaluation of low-level features and their combinations for complex event detection in open source videos, " CVPR, 2012.
    • (2012) CVPR
    • Tamrakar, A.1
  • 2
    • 84866712341 scopus 로고    scopus 로고
    • Multimodal feature fusion for robust event detection in web videos
    • P. Natarajan et al., "Multimodal feature fusion for robust event detection in web videos, " CVPR, 2012.
    • (2012) CVPR
    • Natarajan, P.1
  • 3
    • 84946012706 scopus 로고    scopus 로고
    • Recognizing emotions from student speech in tutoring dialogues
    • D. Littman and K. Forbes, "Recognizing emotions from student speech in tutoring dialogues, " ASRU, 2003.
    • (2003) ASRU
    • Littman, D.1    Forbes, K.2
  • 4
    • 80054836058 scopus 로고    scopus 로고
    • Avec 2011 - The first international audio visual emotion challenge
    • B. Schuller et al., "Avec 2011-the first international audio visual emotion challenge, " ACII, 2011.
    • (2011) ACII
    • Schuller, B.1
  • 5
    • 84946071078 scopus 로고    scopus 로고
    • Emotion detection in speech using deep networks
    • M. Amer et al., "Emotion detection in speech using deep networks, " ICASSP, 2013.
    • (2013) ICASSP
    • Amer, M.1
  • 6
    • 79959766559 scopus 로고    scopus 로고
    • Consumer video understanding: A benchmark database and an evaluation of human and machine performance
    • Y. Jiang et al., "Consumer video understanding: A benchmark database and an evaluation of human and machine performance, " ICMR, 2011.
    • (2011) ICMR
    • Jiang, Y.1
  • 7
    • 72549095204 scopus 로고    scopus 로고
    • Large-scale multimodal semantic concept detection for consumer video
    • S. Chang et al., "Large-scale multimodal semantic concept detection for consumer video, " ACM MIR, 2007.
    • (2007) ACM MIR
    • Chang, S.1
  • 8
    • 84946070923 scopus 로고    scopus 로고
    • Audio-based semantic concept classification for consumer video
    • K. Lee and D. Ellis, "Audio-based semantic concept classification for consumer video, " IEEE TASLP, 2010.
    • (2010) IEEE TASLP
    • Lee, K.1    Ellis, D.2
  • 9
    • 84865744986 scopus 로고    scopus 로고
    • Unsupervised learning of acoustic unit descriptors for audio content representation and classification
    • S. Chaudhuri et al., "Unsupervised learning of acoustic unit descriptors for audio content representation and classification, " in INTERSPEECH, 2011.
    • (2011) Interspeech
    • Chaudhuri, S.1
  • 10
    • 84906248945 scopus 로고    scopus 로고
    • All for one: Feature combination for highly channel-degraded speech activity detection
    • M. Graciarena et al., "All for one: Feature combination for highly channel-degraded speech activity detection, " INTERSPEECH, 2013.
    • (2013) Interspeech
    • Graciarena, M.1
  • 11
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • S. Chen et al., "Speaker, environment and channel change detection and clustering via the bayesian information criterion, " in DARPA Broadcast News Transcription and Understanding Workshop, 1998, pp. 127-132.
    • (1998) DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
    • Chen, S.1
  • 12
    • 14944367313 scopus 로고    scopus 로고
    • Minimal-impact audio-based personal archives
    • D. Ellis and K. Lee, "Minimal-impact audio-based personal archives, " in ACM Workshop on CARPE, 2004.
    • (2004) ACM Workshop on CARPE
    • Ellis, D.1    Lee, K.2
  • 13
    • 0035340677 scopus 로고    scopus 로고
    • Audio content analysis for online audiovisual data segmentation and classification
    • T. Zhang et al., "Audio content analysis for online audiovisual data segmentation and classification, " Speech and Audio Processing, IEEE Transactions on, 2001.
    • (2001) Speech and Audio Processing, IEEE Transactions on
    • Zhang, T.1
  • 14
    • 84864116485 scopus 로고    scopus 로고
    • Super: Towards real-time event recognition in internet videos
    • Y. Jiang, "Super: Towards real-time event recognition in internet videos, " ICMR, 2012.
    • (2012) ICMR
    • Jiang, Y.1
  • 15
    • 85009145332 scopus 로고    scopus 로고
    • Prosody-based automatic detection of annoyance and frustration in human-computer dialog
    • J. Ang et al., "Prosody-based automatic detection of annoyance and frustration in human-computer dialog, " in ICSLP, 2002.
    • (2002) ICSLP
    • Ang, J.1
  • 16
    • 84893945649 scopus 로고    scopus 로고
    • OpenSMILE: The Munich versatile and fast open-source audio feature extractor
    • F. Eyben et al., "openSMILE: The Munich versatile and fast open-source audio feature extractor, " in ACM MM, 2010.
    • (2010) ACM MM
    • Eyben, F.1
  • 17
    • 14344252374 scopus 로고    scopus 로고
    • Multiple kernel learning, conic duality, and the smo algorithm
    • F. Bach et al., "Multiple kernel learning, conic duality, and the smo algorithm, " ICML, 2004.
    • (2004) ICML
    • Bach, F.1
  • 20
    • 84875900272 scopus 로고    scopus 로고
    • Slic superpixels compared to stateof-the-art superpixel methods
    • R. Achanta et al., "Slic superpixels compared to stateof-the-art superpixel methods, " in IEEE PAMI, 2012.
    • (2012) IEEE PAMI
    • Achanta, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.