-
1
-
-
85071069033
-
Segmentation and classification of broadcast news audio
-
T. Hain and P. C. Woodland, "Segmentation and classification of broadcast news audio," in Proc. of ICSLP, 1998, pp. 2727-2730.
-
(1998)
Proc. of ICSLP
, pp. 2727-2730
-
-
Hain, T.1
Woodland, P.C.2
-
2
-
-
84865783453
-
Hierarchical audio segmentation with hmm and factor analysis in broadcast news domain
-
D. Castán, C. Vaquero, A. Ortega, D. Martínez, J. Villalba, and E. Lleida, "Hierarchical audio segmentation with hmm and factor analysis in broadcast news domain," in Proc. of Interspeech, 2011, pp. 421-424.
-
(2011)
Proc. of Interspeech
, pp. 421-424
-
-
Castán, D.1
Vaquero, C.2
Ortega, A.3
Martínez, D.4
Villalba, J.5
Lleida, E.6
-
3
-
-
84865772202
-
Online speech activity detection in broadcast news
-
C. Gao, G. Saikumar, S. Khanwalkar, A. Herscovici, A. Kumar, A. Srivastava, and P. Natarajan, "Online speech activity detection in broadcast news," in Proc. of Interspeech, 2011, pp. 2637-2640.
-
(2011)
Proc. of Interspeech
, pp. 2637-2640
-
-
Gao, C.1
Saikumar, G.2
Khanwalkar, S.3
Herscovici, A.4
Kumar, A.5
Srivastava, A.6
Natarajan, P.7
-
4
-
-
84867208777
-
Study of integration of statistical model-based voice activity detection and noise suppression
-
M. Fujimoto, K. Ishizuka, and T. Nakatani, "Study of integration of statistical model-based voice activity detection and noise suppression," in Proc. of Interspeech, 2008, pp. 2008-2011.
-
(2008)
Proc. of Interspeech
, pp. 2008-2011
-
-
Fujimoto, M.1
Ishizuka, K.2
Nakatani, T.3
-
5
-
-
84867225559
-
DySANA: Dynamic speech and noise adaptation for voice activity detection
-
R. J. Weiss and T. Kristjansson, "DySANA: Dynamic speech and noise adaptation for voice activity detection," in Proc. of Interspeech, 2008, pp. 127-130.
-
(2008)
Proc. of Interspeech
, pp. 127-130
-
-
Weiss, R.J.1
Kristjansson, T.2
-
6
-
-
33745218538
-
Voicing features for robust speech detection
-
T. Kristjansson, S. Deligne, and P. Olsen, "Voicing features for robust speech detection," in Proc. of Interspeech, 2005, pp. 369-372.
-
(2005)
Proc. of Interspeech
, pp. 369-372
-
-
Kristjansson, T.1
Deligne, S.2
Olsen, P.3
-
7
-
-
84865706680
-
On noise robust voice activity detection
-
T. Dekens and W. Verhelst, "On noise robust voice activity detection," in Proc. of Interspeech, 2011, pp. 2649-2652.
-
(2011)
Proc. of Interspeech
, pp. 2649-2652
-
-
Dekens, T.1
Verhelst, W.2
-
8
-
-
79959838316
-
Voice activity detection based on conditional random fields using multiple features
-
A. Saito, Y. Nankaku, A. Lee, and K. Tokuda, "Voice activity detection based on conditional random fields using multiple features," in Proc. of Interspeech, 2010, pp. 2086-2089.
-
(2010)
Proc. of Interspeech
, pp. 2086-2089
-
-
Saito, A.1
Nankaku, Y.2
Lee, A.3
Tokuda, K.4
-
9
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. of ICASSP, 1997, pp. 1331-1334.
-
(1997)
Proc. of ICASSP
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
10
-
-
0032638667
-
A comparison of features for speech, music discrimination
-
M. J. Carey, E. S. Parris, and H. Lloyd-Thomas, "A comparison of features for speech, music discrimination," in Proc. of ICASSP, 1999, pp. 149-152.
-
(1999)
Proc. of ICASSP
, pp. 149-152
-
-
Carey, M.J.1
Parris, E.S.2
Lloyd-Thomas, H.3
-
11
-
-
0036648502
-
Musical genre classification of audio signals
-
July
-
G. Tzanetakis and P. Cook, "Musical genre classification of audio signals," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 5, pp. 293-302, July 2002.
-
(2002)
IEEE Trans. on Speech and Audio Proc.
, vol.10
, Issue.5
, pp. 293-302
-
-
Tzanetakis, G.1
Cook, P.2
-
12
-
-
0036816475
-
Content analysis for audio classification and segmentation
-
October
-
L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. on Speech and Audio Proc., vol. 10, no. 7, pp. 504-516, October 2002.
-
(2002)
IEEE Trans. on Speech and Audio Proc.
, vol.10
, Issue.7
, pp. 504-516
-
-
Lu, L.1
Zhang, H.-J.2
Jiang, H.3
-
13
-
-
80051623447
-
Speaker diarization of heterogeneous web video files: A preliminary study
-
P. Clement, T. Bazillon, and C. Fredouille, "Speaker diarization of heterogeneous web video files: A preliminary study," in Proc. of ICASSP, 2011, pp. 4432-4435.
-
(2011)
Proc. of ICASSP
, pp. 4432-4435
-
-
Clement, P.1
Bazillon, T.2
Fredouille, C.3
-
14
-
-
80052652249
-
Efficient large-scale distributed training of conditional maximum entropy models
-
G. Mann, R. McDonald, M. Mohri, N. Silberman, and D. Walker, "Efficient large-scale distributed training of conditional maximum entropy models," In Advances in Neural Information Processing Systems, vol. 22, pp. 1231-1239, 2009.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1231-1239
-
-
Mann, G.1
McDonald, R.2
Mohri, M.3
Silberman, N.4
Walker, D.5
-
15
-
-
33645690579
-
Fast binary feature selection with conditional mutual information
-
December
-
F. Fleuret, "Fast binary feature selection with conditional mutual information," Journal of Machine Learning Research, vol. 5, pp. 1531-1535, December 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 1531-1535
-
-
Fleuret, F.1
-
16
-
-
0032021555
-
On combining classifiers
-
March
-
J. Kittler, M. Hatef, R. P. Duin, and J. Matas, "On combining classifiers," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 20, no. 3, pp. 226-239, March 1998.
-
(1998)
IEEE Trans. on Pattern Analysis and Machine Intelligence
, vol.20
, Issue.3
, pp. 226-239
-
-
Kittler, J.1
Hatef, M.2
Duin, R.P.3
Matas, J.4
|