-
1
-
-
15044354466
-
Automatic analysis of multimodal group actions in meetings
-
Mar.
-
Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Guillaume Lathoud, Mark Barnard, and Dong Zhang, "Automatic analysis of multimodal group actions in meetings.," IEEE transactions on pattern analysis and machine intelligence, vol. 27, no. 3, pp. 305-17, Mar. 2005.
-
(2005)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.27
, Issue.3
, pp. 305-317
-
-
McCowan, I.1
Gatica-Perez, D.2
Bengio, S.3
Lathoud, G.4
Barnard, M.5
Zhang, D.6
-
2
-
-
80051642116
-
Multiple speaker tracking using a microphone array by combining auditory processing and a gaussian mixture cardinalized probability hypothesis density filter
-
Prague, Czech Republic
-
Axel Plinge, Daniel Hauschildt, Marius H Hennecke, and Gernot A Fink, "Multiple Speaker Tracking using a Microphone Array by Combining Auditory Processing and a Gaussian Mixture Cardinalized Probability Hypothesis Density Filter," in IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, 2011, pp. 2476-2479.
-
(2011)
IEEE Int. Conf. on Acoustics, Speech, and Signal Processing
, pp. 2476-2479
-
-
Plinge, A.1
Hauschildt, D.2
Hennecke, M.H.3
Fink, G.A.4
-
3
-
-
38049176869
-
CLEAR evaluation of acoustic event detection and classification systems
-
Rainer Stiefelhagen and John Garofolo, Eds. vol. 4122 of Lecture Notes in Computer Science. Springer Berlin Heidelberg
-
Andrey Temko, Robert Malkin, Christian Zieger, Dúsan Macho, Climent Nadeu, and Maurizio Omologo, "CLEAR Evaluation of Acoustic Event Detection and Classification Systems," in Multimodal Technologies for Perception of Humans, Rainer Stiefelhagen and John Garofolo, Eds., vol. 4122 of Lecture Notes in Computer Science, pp. 311-322. Springer Berlin Heidelberg, 2007.
-
(2007)
Multimodal Technologies for Perception of Humans
, pp. 311-322
-
-
Temko, A.1
Malkin, R.2
Zieger, C.3
Macho, D.4
Nadeu, C.5
Omologo, M.6
-
4
-
-
79959754926
-
Acoustic event detection in reallife recordings
-
Aalborg, Denmark
-
Annamaria Mesaros, Toni Heittola, Antti Eronen, and Tuomas Virtanen, "Acoustic Event Detection in Reallife Recordings," in European Signal Processing Conference, Aalborg, Denmark, 2010, pp. 1267-1271.
-
(2010)
European Signal Processing Conference
, pp. 1267-1271
-
-
Mesaros, A.1
Heittola, T.2
Eronen, A.3
Virtanen, T.4
-
5
-
-
84893585319
-
A database and challenge for acoustic scene classification and event detection
-
Marrakech, Morocco
-
Dimitrios Giannoulis, Dan Stowell, Emmanouil Benetos, Mathias Rossignol, and Mathieu Lagrange, "A Database and Challenge for Acoustic Scene Classification and Event Detection," in European Signal Processing Conference, Marrakech, Morocco, 2013.
-
(2013)
European Signal Processing Conference
-
-
Giannoulis, D.1
Stowell, D.2
Benetos, E.3
Rossignol, M.4
Lagrange, M.5
-
6
-
-
84859204374
-
Partially supervised speaker clustering
-
Hao Tang, Stephen M Chu, Mark Hasegawa-Johnson, and Thomas S Huang, "Partially supervised speaker clustering," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 34, no. 5, pp. 959-971, 2012.
-
(2012)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.34
, Issue.5
, pp. 959-971
-
-
Tang, H.1
Chu, S.M.2
Hasegawa-Johnson, M.3
Huang, T.S.4
-
7
-
-
34547645414
-
The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music
-
Jean-Julien Aucouturier, Boris Defreville, and Francois Pachet, "The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music," The Journal of the Acoustical Society of America, vol. 122, no. 2, pp. 881-891, 2007.
-
(2007)
The Journal of the Acoustical Society of America
, vol.122
, Issue.2
, pp. 881-891
-
-
Aucouturier, J.1
Defreville, B.2
Pachet, F.3
-
10
-
-
84889563061
-
Bag-of-Features HMMs for segmentation-free word spotting in handwritten documents
-
Washington DC, USA
-
Leonard Rothacker, Marcal Rusinol, and Gernot A. Fink, "Bag-of-Features HMMs for Segmentation-Free Word Spotting in Handwritten Documents," in Proc. Int. Conf. on Document Analysis and Recognition,Washington DC, USA, 2013.
-
(2013)
Proc. Int. Conf. on Document Analysis and Recognition
-
-
Rothacker, L.1
Rusinol, M.2
Fink, G.A.3
-
11
-
-
84898420173
-
The devil is in the details: An evaluation of recent feature encoding methods
-
Ken Chatfield, Victor Lempitsky, Andrea Vedaldi, and Andrew Zisserman, "The devil is in the details: An evaluation of recent feature encoding methods," in British Machine Vision Conference, 2011.
-
(2011)
British Machine Vision Conference
-
-
Chatfield, K.1
Lempitsky, V.2
Vedaldi, A.3
Zisserman, A.4
-
12
-
-
84897817926
-
Bag-of-features representations using spatial visual vocabularies for object classification
-
Melbourne, Australia
-
René Grzeszick, Leonard Rothacker, and Gernot A. Fink, "Bag-of-features representations using spatial visual vocabularies for object classification," in IEEE Intl. Conf. on Image Processing, Melbourne, Australia, 2013.
-
(2013)
IEEE Intl. Conf. on Image Processing
-
-
Grzeszick, R.1
Rothacker, L.2
Fink, G.A.3
-
13
-
-
84878606595
-
Bag-of-audio-words approach for multimedia event classification
-
Portland, OR, USA
-
Stephanie Pancoast and Murat Akbacak, "Bag-of-Audio-Words Approach for Multimedia Event Classification," in Interspeech, Portland, OR, USA, 2012.
-
(2012)
Interspeech
-
-
Pancoast, S.1
Akbacak, M.2
-
14
-
-
82255178542
-
-
IEEE Press
-
DeLiangWang and Guy J. Brown, Eds., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, IEEE Press, 2006.
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
-
-
Wang, D.1
Brown, G.J.2
-
15
-
-
34547499683
-
Incorporating auditory feature uncertainties in robust speaker identification
-
Yang Shao, Soundararajan Srinivasan, and DeLiang Wang, "Incorporating auditory feature uncertainties in robust speaker identification," in IEEE International Conference on Acoustics, Speech, and Signal Processing, 2007, pp. 277-280.
-
(2007)
IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 277-280
-
-
Shao, Y.1
Srinivasan, S.2
Wang, D.3
-
16
-
-
0032633659
-
A method of signal extraction from noisy signal based on auditory scene analysis
-
Masashi Unoki and Masato Akagi, "A Method of Signal Extraction from Noisy Signal based on Auditory Scene Analysis," Speech Communication, vol. 27, no. 3, pp. 261-279, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 261-279
-
-
Unoki, M.1
Akagi, M.2
-
18
-
-
67349266616
-
Supervised learning of quantizer codebooks by information loss minimization
-
Svetlana Lazebnik and Maxim Raginsky, "Supervised learning of quantizer codebooks by information loss minimization," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 31, no. 7, pp. 1294-1309, 2009.
-
(2009)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.31
, Issue.7
, pp. 1294-1309
-
-
Lazebnik, S.1
Raginsky, M.2
-
19
-
-
33845572523
-
Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
-
Svetlana Lazebnik, Cordelia Schmid, and Jean Ponce, "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories," in Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on. IEEE, 2006, vol. 2, pp. 2169-2178.
-
(2006)
Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference On. IEEE
, vol.2
, pp. 2169-2178
-
-
Lazebnik, S.1
Schmid, C.2
Ponce, J.3
-
20
-
-
0033592606
-
Learning the parts of objects by non-negative matrix factorization
-
Oct.
-
Daniel D. Lee and H. Sebastian Seung, "Learning the Parts of Objects by Non-negative Matrix Factorization.," Nature, vol. 401, no. 6755, pp. 788-91, Oct. 1999.
-
(1999)
Nature
, vol.401
, Issue.6755
, pp. 788-791
-
-
Lee, D.D.1
Sebastian Seung, H.2
|