-
1
-
-
0002595416
-
Speaker, environment and channel change detection and clustering via the bayesian information criterion
-
S. Chen and P. Gopalakrishnan. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In DARPA speech recognition workshop, 1998.
-
(1998)
DARPA speech recognition workshop
-
-
Chen, S.1
Gopalakrishnan, P.2
-
2
-
-
85009212151
-
A sequential metric-based audio segmentation method via the bayesian information criterion
-
S.-S. Cheng and H.-M. Wang. A sequential metric-based audio segmentation method via the bayesian information criterion. In Proceedings of Eurospeech, pages 945-948, 2003.
-
(2003)
Proceedings of Eurospeech
, pp. 945-948
-
-
Cheng, S.-S.1
Wang, H.-M.2
-
3
-
-
1842830672
-
Audio-visual segmentation and "the cocktail party effect
-
T. Darrell, J. Fisher, P. A. Viola, and W. T. Freeman. Audio-visual segmentation and "the cocktail party effect". In ICMI, pages 320, 2000.
-
(2000)
ICMI
, pp. 320
-
-
Darrell, T.1
Fisher, J.2
Viola, P.A.3
Freeman, W.T.4
-
4
-
-
34047200341
-
-
J. Fiscus, N. Radde, J. Garofolo, A. Le, J. Ajot, and C. Laprun. Rich transcription 2005 spring meeting recognition evaluation. In www.nist.gov/speech/publications/papersrc/rt05sresults.pdf.
-
Rich transcription 2005 spring meeting recognition evaluation
-
-
Fiscus, J.1
Radde, N.2
Garofolo, J.3
Le, A.4
Ajot, J.5
Laprun, C.6
-
5
-
-
84949961905
-
Probabalistic models and informative subspaces for audiovisual correspondence
-
J. Fisher and T. Darrell. Probabalistic models and informative subspaces for audiovisual correspondence. In ECCV(3), pages 592-603, 2002.
-
(2002)
ECCV
, vol.3
, pp. 592-603
-
-
Fisher, J.1
Darrell, T.2
-
6
-
-
0009622482
-
Audio vision: Using audio-visual synchrony to locate sounds
-
J. Hershey and J. R. Movellan. Audio vision: Using audio-visual synchrony to locate sounds. In NIPS, 1999.
-
(1999)
NIPS
-
-
Hershey, J.1
Movellan, J.R.2
-
8
-
-
20444478554
-
Speaker localisation using audio-visual synchrony: An empirical study
-
H. J. Nock, G. Iyengar, and C. Neti. Speaker localisation using audio-visual synchrony: An empirical study. In CIVR, 2003.
-
(2003)
CIVR
-
-
Nock, H.J.1
Iyengar, G.2
Neti, C.3
-
9
-
-
85037085294
-
Gesture cues for conversational interaction in monocular video
-
F. Quek, D. McNeill, R. Ansari, X.-F. Ma, R. Bryll, S. Duncan, and K. E. McCullough. Gesture cues for conversational interaction in monocular video. In RATFG-RTS, 1999.
-
(1999)
RATFG-RTS
-
-
Quek, F.1
McNeill, D.2
Ansari, R.3
Ma, X.-F.4
Bryll, R.5
Duncan, S.6
McCullough, K.E.7
-
11
-
-
78650540904
-
Improved speaker segmentation and segments clustering using the bayesian information criterion
-
A. Tritschler and R. Gopinath. Improved speaker segmentation and segments clustering using the bayesian information criterion. In Proceedings of Eurospeech, 1999.
-
(1999)
Proceedings of Eurospeech
-
-
Tritschler, A.1
Gopinath, R.2
|