-
1
-
-
33646380923
-
-
Philadelphia, PA, March
-
D. Reynolds and P. Torres-Carrasquillo, "Approaches and applications of audio diarization," Philadelphia, PA, March 2005, pp. 953-956.
-
(2005)
Approaches and applications of audio diarization
, pp. 953-956
-
-
Reynolds, D.1
Torres-Carrasquillo, P.2
-
2
-
-
47749152568
-
-
J. G. Fiscus, J. Ajot, and J. S. Garofolo, The rich transcription 2007 meeting recognition evaluation, in Multimodal Technologies for Perception of Humans, ser. Lecture Notes in Computer Science. Berlin: Springer Verlag, 2008.
-
J. G. Fiscus, J. Ajot, and J. S. Garofolo, "The rich transcription 2007 meeting recognition evaluation," in Multimodal Technologies for Perception of Humans, ser. Lecture Notes in Computer Science. Berlin: Springer Verlag, 2008.
-
-
-
-
3
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (rover)
-
Santa Barbara, CA
-
J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (rover)," in proceedings 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, 1997, pp. 347-352.
-
(1997)
proceedings 1997 IEEE Workshop on Automatic Speech Recognition and Understanding
, pp. 347-352
-
-
Fiscus, J.G.1
-
4
-
-
85009289298
-
Unknown- multiple speaker clustering using HMM
-
Denver, Colorado, USA
-
J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan, "Unknown- multiple speaker clustering using HMM," in proceedings of the International Conference on Spoken Language Processing (ICSLP), Denver, Colorado, USA, 2002.
-
(2002)
proceedings of the International Conference on Spoken Language Processing (ICSLP)
-
-
Ajmera, J.1
Bourlard, H.2
Lapidot, I.3
McCowan, I.4
-
5
-
-
47749147613
-
Filtering the unknown: Speech activity detection in heterogeneous video collections
-
Antwerp, Belgium, August
-
M. Huijbregts, C. Wooters, and R. Ordelman, "Filtering the unknown: Speech activity detection in heterogeneous video collections," in proceedings of Interspeech, Antwerp, Belgium, August 2007.
-
(2007)
proceedings of Interspeech
-
-
Huijbregts, M.1
Wooters, C.2
Ordelman, R.3
-
6
-
-
77249176190
-
-
D. van Leeuwen and M. Huijbregts, The AMI speaker diarization system for NIST RT06s meeting data, in Machine Learning for Multimodal Interaction (MLMI), ser. Lecture Notes in Computer Science, 4299. Berlin: Springer Verlag, October 2007, pp. 371-384.
-
D. van Leeuwen and M. Huijbregts, "The AMI speaker diarization system for NIST RT06s meeting data," in Machine Learning for Multimodal Interaction (MLMI), ser. Lecture Notes in Computer Science, vol. 4299. Berlin: Springer Verlag, October 2007, pp. 371-384.
-
-
-
-
7
-
-
85009231870
-
Qualcomm-icsi-ogi features for asr
-
A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan, and S. Sivadas, "Qualcomm-icsi-ogi features for asr," in proceedings of ICSLP, 2002.
-
(2002)
proceedings of ICSLP
-
-
Adami, A.1
Burget, L.2
Dupont, S.3
Garudadri, H.4
Grezl, F.5
Hermansky, H.6
Jain, P.7
Kajarekar, S.8
Morgan, N.9
Sivadas, S.10
-
8
-
-
44849123928
-
Robust speaker diarization for meetings,
-
Ph.D. dissertation, Universitat Politecnica De Catalunya
-
X. Anguera, "Robust speaker diarization for meetings," Ph.D. dissertation, Universitat Politecnica De Catalunya, 2006.
-
(2006)
-
-
Anguera, X.1
-
9
-
-
34548351229
-
-
J. M. Pardo1, X. Anguera, and C. Wooters, Speaker diarization for multiple distant microphone meetings: Mixing acoustic features and inter-channel time differences, in proceedings of Interspeech, 2006.
-
J. M. Pardo1, X. Anguera, and C. Wooters, "Speaker diarization for multiple distant microphone meetings: Mixing acoustic features and inter-channel time differences," in proceedings of Interspeech, 2006.
-
-
-
-
10
-
-
47749119617
-
The ICSI RT07s speaker diarization system
-
Multimodal Technologies for Perception of Humans, Berlin: Springer Verlag
-
C. Wooters and M. Huijbregts, "The ICSI RT07s speaker diarization system," in Multimodal Technologies for Perception of Humans, ser. Lecture Notes in Computer Science. Berlin: Springer Verlag, 2008.
-
(2008)
ser. Lecture Notes in Computer Science
-
-
Wooters, C.1
Huijbregts, M.2
-
11
-
-
4544361649
-
The elisa consortium approaches in broadcast news speaker segmentation during the nist 2003 rich transcription evaluation
-
D. Moraru, S. Meignier, C. Fredouille, L. Besacier, and L. Bonastre, "The elisa consortium approaches in broadcast news speaker segmentation during the nist 2003 rich transcription evaluation," in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2004.
-
(2004)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Moraru, D.1
Meignier, S.2
Fredouille, C.3
Besacier, L.4
Bonastre, L.5
|