-
2
-
-
35348847678
-
The CLEAR 2006 evaluation
-
R. Stiefelhagen, K. Bernardin, R. Bowers, J. Garofolo, D. Mostefa, and P. Soundararajan, "The CLEAR 2006 evaluation," Multimodal Technologies for Perception of Humans, pp. 1-44, 2007.
-
(2007)
Multimodal Technologies for Perception of Humans
, pp. 1-44
-
-
Stiefelhagen, R.1
Bernardin, K.2
Bowers, R.3
Garofolo, J.4
Mostefa, D.5
Soundararajan, P.6
-
3
-
-
84905274625
-
Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics
-
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot, "Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics," in Proc TRECVID, 2012.
-
(2012)
Proc TRECVID
-
-
Over, P.1
Awad, G.2
Michel, M.3
Fiscus, J.4
Sanders, G.5
Shaw, B.6
Kraaij, W.7
Smeaton, A.F.8
Quéenot, G.9
-
4
-
-
84901299901
-
The Albayzin 2010 language recognition evaluation
-
L. J. Rodriguez-Fuentes, M. Penagarikano, A. Varona, M. Diez, and G. Bordel, "The Albayzin 2010 language recognition evaluation," in Proc InterSpeech, 2011, pp. 28-31.
-
(2011)
Proc InterSpeech
, pp. 28-31
-
-
Rodriguez-Fuentes, L.J.1
Penagarikano, M.2
Varona, A.3
Diez, M.4
Bordel, G.5
-
5
-
-
84893548504
-
Detection and classification of acoustic scenes and events, an IEEE AASP challenge
-
Queen Mary University of London
-
D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M. Lagrange, and M. P. Plumbley, "Detection and classification of acoustic scenes and events, an IEEE AASP challenge," Tech. Rep. EECSRR-13-01, Queen Mary University of London, 2013.
-
(2013)
Tech. Rep. EECSRR-13-01
-
-
Giannoulis, D.1
Benetos, E.2
Stowell, D.3
Rossignol, M.4
Lagrange, M.5
Plumbley, M.P.6
-
6
-
-
34547645414
-
The bagof-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
-
J.-J. Aucouturier, B. Defreville, and F. Pachet, "The bagof-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," Journal of the Acoustical Society of America, vol. 122, pp. 881, 2007.
-
(2007)
Journal of the Acoustical Society of America
, vol.122
, pp. 881
-
-
Aucouturier, J.-J.1
Defreville, B.2
Pachet, F.3
-
8
-
-
84901321713
-
Characterization of acoustic scenes using a temporally-constrained shift-invariant model
-
York, UK
-
E. Benetos, M. Lagrange, and S. Dixon, "Characterization of acoustic scenes using a temporally-constrained shift-invariant model," in Proc DAFX, York, UK, 2012.
-
(2012)
Proc DAFX
-
-
Benetos, E.1
Lagrange, M.2
Dixon, S.3
-
9
-
-
68149163531
-
Environmental sound recognition with time-frequency audio features
-
S. Chu, S. Narayanan, and C.-C. Jay Kuo, "Environmental sound recognition with time-frequency audio features," IEEE Trans Audio, Speech and Language Processing, vol. 17, no. 6, pp. 1142-1158, 2009.
-
(2009)
IEEE Trans Audio, Speech and Language Processing
, vol.17
, Issue.6
, pp. 1142-1158
-
-
Chu, S.1
Narayanan, S.2
Jay Kuo, C.-C.3
-
10
-
-
84890493220
-
Acoustic event detection in real life recordings
-
A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen, "Acoustic event detection in real life recordings," in Proc EUSIPCO, 2010.
-
(2010)
Proc EUSIPCO
-
-
Mesaros, A.1
Heittola, T.2
Eronen, A.3
Virtanen, T.4
-
11
-
-
84876152720
-
Sound event detection in multisource environments using source separation
-
2011
-
T. Heittola, A. Mesaros, T. Virtanen, and A. Eronen, "Sound event detection in multisource environments using source separation," in Proc CHiME, 2011, pp. 36-40.
-
Proc CHiME
, pp. 36-40
-
-
Heittola, T.1
Mesaros, A.2
Virtanen, T.3
Eronen, A.4
-
12
-
-
84887056523
-
Contextdependent sound event detection
-
T. Heittola, A. Mesaros, A. Eronen, and T. Virtanen, " Contextdependent sound event detection," EURASIP Journal on Audio, Speech, and Music Processing, vol. 2013, no. 1, 2013.
-
(2013)
EURASIP Journal on Audio, Speech, and Music Processing
, vol.2013
, Issue.1
-
-
Heittola, T.1
Mesaros, A.2
Eronen, A.3
Virtanen, T.4
-
13
-
-
84863737592
-
Latent semantic analysis in sound event detection
-
2011
-
A. Mesaros, T. Heittola, and A. Klapuri, "Latent semantic analysis in sound event detection," in Proc EUSIPCO, 2011, pp. 1307-1311.
-
Proc EUSIPCO
, pp. 1307-1311
-
-
Mesaros, A.1
Heittola, T.2
Klapuri, A.3
-
14
-
-
83455255740
-
Spectral vs spectro-temporal features for acoustic event detection
-
C. V. Cotton and D. P. W. Ellis, "Spectral vs. spectro-temporal features for acoustic event detection," in Proc WASPAA, 2011, pp. 69-72.
-
(2011)
Proc WASPAA
, pp. 69-72
-
-
Cotton, C.V.1
Ellis, D.P.W.2
-
15
-
-
11144316019
-
Decoding speech in the presence of other sources
-
J. P. Barker, M. P. Cooke, and D. P. W. Ellis, "Decoding speech in the presence of other sources," Speech Communication, vol. 45, no. 1, pp. 5-25, 2005.
-
(2005)
Speech Communication
, vol.45
, Issue.1
, pp. 5-25
-
-
Barker, J.P.1
Cooke, M.P.2
Ellis, D.P.W.3
-
16
-
-
84918783217
-
Acoustic classification of multiple simultaneous bird species: A multi-instance multilabel approach
-
F. Briggs, B. Lakshminarayanan, et al., "Acoustic classification of multiple simultaneous bird species: A multi-instance multilabel approach," Journal of the Acoustical Society of America, vol. 131, pp. 4640-4650, 2012.
-
(2012)
Journal of the Acoustical Society of America
, vol.131
, pp. 4640-4650
-
-
Briggs, F.1
Lakshminarayanan, B.2
-
17
-
-
84890468360
-
Recognition of harmonic sounds in polyphonic audio using a missing feature approach
-
D. Giannoulis, A. Klapuri, and M. D. Plumbley, "Recognition of harmonic sounds in polyphonic audio using a missing feature approach," in Proc ICASSP (to appear), 2013.
-
(2013)
Proc ICASSP (To Appear)
-
-
Giannoulis, D.1
Klapuri, A.2
Plumbley, M.D.3
-
18
-
-
80052281439
-
Automatic extraction of pornographic contents using radon transform based audio features
-
M. J. Kim and H. Kim, "Automatic extraction of pornographic contents using radon transform based audio features," in CBMI, 2011, pp. 205-210.
-
(2011)
CBMI
, pp. 205-210
-
-
Kim, M.J.1
Kim, H.2
-
19
-
-
38049176869
-
CLEAR evaluation of acoustic event detection and classification systems
-
Southampton, UK
-
A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, and M. Omologo, "CLEAR evaluation of acoustic event detection and classification systems," in Proc CLEAR, Southampton, UK, 2007, pp. 311-322.
-
(2007)
Proc CLEAR
, pp. 311-322
-
-
Temko, A.1
Malkin, R.2
Zieger, C.3
Macho, D.4
Nadeu, C.5
Omologo, M.6
-
20
-
-
57649180845
-
Content-based retrieval of music and audio
-
J. Foote, "Content-based retrieval of music and audio," in Proc SPIE, 1997, vol. 3229, pp. 138-147.
-
(1997)
Proc SPIE
, vol.3229
, pp. 138-147
-
-
Foote, J.1
-
21
-
-
70349203078
-
On the robustness of audio features for musical instrument classification
-
S. Wegener, M. Haller, J. J. Burred, T. Sikora, S. Essid, and G. Richard, "On the robustness of audio features for musical instrument classification," in Proc EUSIPCO, 2008.
-
(2008)
Proc EUSIPCO
-
-
Wegener, S.1
Haller, M.2
Burred, J.J.3
Sikora, T.4
Essid, S.5
Richard, G.6
-
22
-
-
33745000971
-
Improving timbre similarity: How high's the sky?
-
J.-J. Aucouturier and F. Pachet, "Improving timbre similarity: how high's the sky?," Journal of Negative Results in Speech and Audio Sciences, vol. 1, no. 1, pp. 1-13, 2004.
-
(2004)
Journal of Negative Results in Speech and Audio Sciences
, vol.1
, Issue.1
, pp. 1-13
-
-
Aucouturier, J.-J.1
Pachet, F.2
-
23
-
-
33847655586
-
A generalized divergence measure for nonnegative matrix factorization
-
R. Kompass, "A generalized divergence measure for nonnegative matrix factorization," Neural Computation, vol. 19, no. 3, pp. 780-791, 2007.
-
(2007)
Neural Computation
, vol.19
, Issue.3
, pp. 780-791
-
-
Kompass, R.1
-
24
-
-
80053103566
-
Constant-Q transform toolbox for music processing
-
Barcelona, Spain, July
-
C. Schörkhuber and A. Klapuri, "Constant-Q transform toolbox for music processing," in Proc SMC, Barcelona, Spain, July 2010, pp. 3-64.
-
(2010)
Proc SMC
, pp. 3-64
-
-
Schörkhuber, C.1
Klapuri, A.2
|