SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2013, Pages

A database and challenge for acoustic scene classification and event detection

(6) Giannoulis, Dimitrios a Stowell, Dan a Benetos, Emmanouil b Rossignol, Mathias c Lagrange, Mathieu c Plumbley, Mark D a

a QUEEN MARY UNIVERSITY OF LONDON (United Kingdom)

b CITY UNIVERSITY (United Kingdom)

c IRCAM (France)

Author keywords

acoustic event detection; acoustic scene classification; Computational auditory scene analysis

Indexed keywords

SIGNAL PROCESSING;

ACOUSTIC EVENT DETECTIONS; BASELINE METHODS; COMPUTATIONAL AUDITORY SCENE ANALYSIS; EVALUATION FRAMEWORK; EVALUATION METRICS; EVENT DETECTION; OPEN-SOURCE CODE; SCENE CLASSIFICATION;

OPEN SOURCE SOFTWARE;

EID: 84893585319 PISSN: 22195491 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (41)

References (24)

1
- 71049180205
- Computational auditory scene analysis
- IEEE Press
- D. L. Wang and G. J. Brown, Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, IEEE Press, 2006.
- (2006) Principles, Algorithms, and Applications
- Wang, D.L.¹ Brown, G.J.²

2
- 35348847678
- The CLEAR 2006 evaluation
- R. Stiefelhagen, K. Bernardin, R. Bowers, J. Garofolo, D. Mostefa, and P. Soundararajan, "The CLEAR 2006 evaluation," Multimodal Technologies for Perception of Humans, pp. 1-44, 2007.
- (2007) Multimodal Technologies for Perception of Humans , pp. 1-44
- Stiefelhagen, R.¹ Bernardin, K.² Bowers, R.³ Garofolo, J.⁴ Mostefa, D.⁵ Soundararajan, P.⁶

3
- 84905274625
- Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics
- P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot, "Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics," in Proc TRECVID, 2012.
- (2012) Proc TRECVID
- Over, P.¹ Awad, G.² Michel, M.³ Fiscus, J.⁴ Sanders, G.⁵ Shaw, B.⁶ Kraaij, W.⁷ Smeaton, A.F.⁸ Quéenot, G.⁹

4
- 84901299901
- The Albayzin 2010 language recognition evaluation
- L. J. Rodriguez-Fuentes, M. Penagarikano, A. Varona, M. Diez, and G. Bordel, "The Albayzin 2010 language recognition evaluation," in Proc InterSpeech, 2011, pp. 28-31.
- (2011) Proc InterSpeech , pp. 28-31
- Rodriguez-Fuentes, L.J.¹ Penagarikano, M.² Varona, A.³ Diez, M.⁴ Bordel, G.⁵

5
- 84893548504
- Detection and classification of acoustic scenes and events, an IEEE AASP challenge
- Queen Mary University of London
- D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange, and M. P. Plumbley, "Detection and classification of acoustic scenes and events, an IEEE AASP challenge," Tech. Rep. EECSRR-13-01, Queen Mary University of London, 2013.
- (2013) Tech. Rep. EECSRR-13-01
- Giannoulis, D.¹ Benetos, E.² Stowell, D.³ Rossignol, M.⁴ Lagrange, M.⁵ Plumbley, M.P.⁶

6
- 34547645414
- The bagof-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
- J.-J. Aucouturier, B. Defreville, and F. Pachet, "The bagof-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," Journal of the Acoustical Society of America, vol. 122, pp. 881, 2007.
- (2007) Journal of the Acoustical Society of America , vol.122 , pp. 881
- Aucouturier, J.-J.¹ Defreville, B.² Pachet, F.³

7
- 84872728921
- MS thesis
- B. Cauchi, "Non-negative matrix factorisation applied to auditory scenes classification," MS thesis, 2011.
- (2011) Non-negative Matrix Factorisation Applied to Auditory Scenes Classification
- Cauchi, B.¹

8
- 84901321713
- Characterization of acoustic scenes using a temporally-constrained shift-invariant model
- York, UK
- E. Benetos, M. Lagrange, and S. Dixon, "Characterization of acoustic scenes using a temporally-constrained shift-invariant model," in Proc DAFX, York, UK, 2012.
- (2012) Proc DAFX
- Benetos, E.¹ Lagrange, M.² Dixon, S.³

9
- 68149163531
- Environmental sound recognition with time-frequency audio features
- S. Chu, S. Narayanan, and C.-C. Jay Kuo, "Environmental sound recognition with time-frequency audio features," IEEE Trans Audio, Speech and Language Processing, vol. 17, no. 6, pp. 1142-1158, 2009.
- (2009) IEEE Trans Audio, Speech and Language Processing , vol.17 , Issue.6 , pp. 1142-1158
- Chu, S.¹ Narayanan, S.² Jay Kuo, C.-C.³

10
- 84890493220
- Acoustic event detection in real life recordings
- A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen, "Acoustic event detection in real life recordings," in Proc EUSIPCO, 2010.
- (2010) Proc EUSIPCO
- Mesaros, A.¹ Heittola, T.² Eronen, A.³ Virtanen, T.⁴

11
- 84876152720
- Sound event detection in multisource environments using source separation
- 2011
- T. Heittola, A. Mesaros, T. Virtanen, and A. Eronen, "Sound event detection in multisource environments using source separation," in Proc CHiME, 2011, pp. 36-40.
- Proc CHiME , pp. 36-40
- Heittola, T.¹ Mesaros, A.² Virtanen, T.³ Eronen, A.⁴

12
- 84887056523
- Contextdependent sound event detection
- T. Heittola, A. Mesaros, A. Eronen, and T. Virtanen, " Contextdependent sound event detection," EURASIP Journal on Audio, Speech, and Music Processing, vol. 2013, no. 1, 2013.
- (2013) EURASIP Journal on Audio, Speech, and Music Processing , vol.2013 , Issue.1
- Heittola, T.¹ Mesaros, A.² Eronen, A.³ Virtanen, T.⁴

13
- 84863737592
- Latent semantic analysis in sound event detection
- 2011
- A. Mesaros, T. Heittola, and A. Klapuri, "Latent semantic analysis in sound event detection," in Proc EUSIPCO, 2011, pp. 1307-1311.
- Proc EUSIPCO , pp. 1307-1311
- Mesaros, A.¹ Heittola, T.² Klapuri, A.³

14
- 83455255740
- Spectral vs spectro-temporal features for acoustic event detection
- C. V. Cotton and D. P. W. Ellis, "Spectral vs. spectro-temporal features for acoustic event detection," in Proc WASPAA, 2011, pp. 69-72.
- (2011) Proc WASPAA , pp. 69-72
- Cotton, C.V.¹ Ellis, D.P.W.²

15
- 11144316019
- Decoding speech in the presence of other sources
- J. P. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Communication, vol. 45, no. 1, pp. 5-25, 2005.
- (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
- Barker, J.P.¹ Cooke, M.P.² Ellis, D.P.W.³

16
- 84918783217
- Acoustic classification of multiple simultaneous bird species: A multi-instance multilabel approach
- F. Briggs, B. Lakshminarayanan, et al., "Acoustic classification of multiple simultaneous bird species: A multi-instance multilabel approach," Journal of the Acoustical Society of America, vol. 131, pp. 4640-4650, 2012.
- (2012) Journal of the Acoustical Society of America , vol.131 , pp. 4640-4650
- Briggs, F.¹ Lakshminarayanan, B.²

17
- 84890468360
- Recognition of harmonic sounds in polyphonic audio using a missing feature approach
- D. Giannoulis, A. Klapuri, and M. D. Plumbley, "Recognition of harmonic sounds in polyphonic audio using a missing feature approach," in Proc ICASSP (to appear), 2013.
- (2013) Proc ICASSP (To Appear)
- Giannoulis, D.¹ Klapuri, A.² Plumbley, M.D.³

18
- 80052281439
- Automatic extraction of pornographic contents using radon transform based audio features
- M. J. Kim and H. Kim, "Automatic extraction of pornographic contents using radon transform based audio features," in CBMI, 2011, pp. 205-210.
- (2011) CBMI , pp. 205-210
- Kim, M.J.¹ Kim, H.²

19
- 38049176869
- CLEAR evaluation of acoustic event detection and classification systems
- Southampton, UK
- A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, and M. Omologo, "CLEAR evaluation of acoustic event detection and classification systems," in Proc CLEAR, Southampton, UK, 2007, pp. 311-322.
- (2007) Proc CLEAR , pp. 311-322
- Temko, A.¹ Malkin, R.² Zieger, C.³ Macho, D.⁴ Nadeu, C.⁵ Omologo, M.⁶

20
- 57649180845
- Content-based retrieval of music and audio
- J. Foote, "Content-based retrieval of music and audio," in Proc SPIE, 1997, vol. 3229, pp. 138-147.
- (1997) Proc SPIE , vol.3229 , pp. 138-147
- Foote, J.¹

21
- 70349203078
- On the robustness of audio features for musical instrument classification
- S. Wegener, M. Haller, J. J. Burred, T. Sikora, S. Essid, and G. Richard, "On the robustness of audio features for musical instrument classification," in Proc EUSIPCO, 2008.
- (2008) Proc EUSIPCO
- Wegener, S.¹ Haller, M.² Burred, J.J.³ Sikora, T.⁴ Essid, S.⁵ Richard, G.⁶

22
- 33745000971
- Improving timbre similarity: How high's the sky?
- J.-J. Aucouturier and F. Pachet, "Improving timbre similarity: how high's the sky?," Journal of Negative Results in Speech and Audio Sciences, vol. 1, no. 1, pp. 1-13, 2004.
- (2004) Journal of Negative Results in Speech and Audio Sciences , vol.1 , Issue.1 , pp. 1-13
- Aucouturier, J.-J.¹ Pachet, F.²

23
- 33847655586
- A generalized divergence measure for nonnegative matrix factorization
- R. Kompass, "A generalized divergence measure for nonnegative matrix factorization," Neural Computation, vol. 19, no. 3, pp. 780-791, 2007.
- (2007) Neural Computation , vol.19 , Issue.3 , pp. 780-791
- Kompass, R.¹

24
- 80053103566
- Constant-Q transform toolbox for music processing
- Barcelona, Spain, July
- C. Schörkhuber and A. Klapuri, "Constant-Q transform toolbox for music processing," in Proc SMC, Barcelona, Spain, July 2010, pp. 3-64.
- (2010) Proc SMC , pp. 3-64
- Schörkhuber, C.¹ Klapuri, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.