-
1
-
-
0036816475
-
Content analysis for audio classification and segmentation
-
L. Lu, H. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation", IEEE Trans. on Speech and Audio Processing, vol. 10, pp. 504-516, 2002.
-
(2002)
IEEE Trans. on Speech and Audio Processing
, vol.10
, pp. 504-516
-
-
Lu, L.1
Zhang, H.2
Jiang, H.3
-
2
-
-
85009212099
-
Environmental sound source identification based on hidden Markov models for robust speech recognition
-
T. Nishiura, S. Nakamura, K. Miki, and K. Shikano, "Environmental sound source identification based on hidden Markov models for robust speech recognition", in Proc. Eurospeech, pp. 2157-2160, 2003.
-
(2003)
Proc. Eurospeech
, pp. 2157-2160
-
-
Nishiura, T.1
Nakamura, S.2
Miki, K.3
Shikano, K.4
-
3
-
-
70449590704
-
Acoustic event detection and classification
-
PhD thesis, Technical University of Catalonia
-
A. Temko. "Acoustic event detection and classification", PhD thesis, Technical University of Catalonia, 2007.
-
(2007)
-
-
Temko, A.1
-
4
-
-
84867206190
-
-
CHIL, Computers in the Human Interaction Loop, EU project
-
CHIL - Computers in the Human Interaction Loop - EU project. http://chil.server.de, 2004-2007.
-
(2004)
-
-
-
5
-
-
38049176869
-
CLEAR Evaluation of Acoustic Event Detection and Classification systems
-
Multimodal Technologies for Perception of Humans, Springer
-
A. Temko, R. Malkin, C. Zieger, D. Macho, C. Nadeu, M. Omologo, "CLEAR Evaluation of Acoustic Event Detection and Classification systems", in Multimodal Technologies for Perception of Humans, LNCS, vol. 4122, Springer, 2007.
-
(2007)
LNCS
, vol.4122
-
-
Temko, A.1
Malkin, R.2
Zieger, C.3
Macho, D.4
Nadeu, C.5
Omologo, M.6
-
6
-
-
84867199582
-
Fusion of Audio and Video Modalities for Detection of Acoustic Events
-
T. Butko, A. Temko, C. Nadeu and C. Canton, "Fusion of Audio and Video Modalities for Detection of Acoustic Events", in Proc. Interspeech, pp. 123-126, 2008.
-
(2008)
Proc. Interspeech
, pp. 123-126
-
-
Butko, T.1
Temko, A.2
Nadeu, C.3
Canton, C.4
-
7
-
-
70449558459
-
Audiovisual Event Detection Towards Scene Understanding
-
C. Canton-Ferrer, T. Butko, C. Segura, X. Giró, C. Nadeu, J. Hernando, J.R. Casas, "Audiovisual Event Detection Towards Scene Understanding", in Proc. IEEE Int. Conference on Computer Vision and Pattern Recognition, 2009.
-
(2009)
Proc. IEEE Int. Conference on Computer Vision and Pattern Recognition
-
-
Canton-Ferrer, C.1
Butko, T.2
Segura, C.3
Giró, X.4
Nadeu, C.5
Hernando, J.6
Casas, J.R.7
-
8
-
-
85135144525
-
On the decorrelation of filter-bank energies in speech recognition
-
C. Nadeu, J. Hernando, and M. Gorricho, "On the decorrelation of filter-bank energies in speech recognition", in Proc. European Speech Processing Conference, pp. 1381-1384, 1995.
-
(1995)
Proc. European Speech Processing Conference
, pp. 1381-1384
-
-
Nadeu, C.1
Hernando, J.2
Gorricho, M.3
-
9
-
-
56749117943
-
In defense of One-Vs-All Classification
-
R. Rifkin, A. Klautau, "In defense of One-Vs-All Classification", Journal of Machine learning Research, vol. 5, pp.101-141, 2004.
-
(2004)
Journal of Machine learning Research
, vol.5
, pp. 101-141
-
-
Rifkin, R.1
Klautau, A.2
-
10
-
-
70450188181
-
-
J. DiBiase, H. Silverman, and M. Brandstein, Microphone Arrays: Techniques and Applications, M. S. Brandstein and D. B. Ward, Eds, pp. 157-180, Springer-Verlag, 2001.
-
J. DiBiase, H. Silverman, and M. Brandstein, " Microphone Arrays: Techniques and Applications", M. S. Brandstein and D. B. Ward, Eds, pp. 157-180, Springer-Verlag, 2001.
-
-
-
-
11
-
-
69949162388
-
Particle filtering and sparse sampling for multi-person 3D tracking
-
C. Canton-Ferrer, R. Sblendido, J. R. Casas, and M. Pardas, "Particle filtering and sparse sampling for multi-person 3D tracking", in Proc. IEEE Int. Conf. on Image Processing, pp. 2644-2647, 2008.
-
(2008)
Proc. IEEE Int. Conf. on Image Processing
, pp. 2644-2647
-
-
Canton-Ferrer, C.1
Sblendido, R.2
Casas, J.R.3
Pardas, M.4
-
14
-
-
84925639646
-
Real-time lip tracking and bi-modal continuous speech recognition
-
M. T. Chan, Y. Zhang, and T. S. Huang, "Real-time lip tracking and bi-modal continuous speech recognition", in Proc. IEEE Workshop on Multimedia Signal Processing, pp. 65-70, 1998.
-
(1998)
Proc. IEEE Workshop on Multimedia Signal Processing
, pp. 65-70
-
-
Chan, M.T.1
Zhang, Y.2
Huang, T.S.3
|