-
1
-
-
34548339624
-
Speaker diarization for multi-microphone meetings using only between-channel differences
-
Sept, Springer
-
J. M. Pardo, X. Anguera, and C. Wooters, "Speaker diarization for multi-microphone meetings using only between-channel differences," in Proc. of MLMI'06 (LNCS 4299). Sept. 2006, pp. 257-264, Springer.
-
(2006)
Proc. of MLMI'06 (LNCS 4299)
, pp. 257-264
-
-
Pardo, J.M.1
Anguera, X.2
Wooters, C.3
-
2
-
-
50449086237
-
Acoustic beamforming for speaker diarization of meetings
-
Sept
-
X. Anguera, C. Wooters, and J. Hernando, "Acoustic beamforming for speaker diarization of meetings," IEEE Trans. Audio, Speech and Language Processing, vol. 15, pp. 2011-2022, Sept. 2007.
-
(2007)
IEEE Trans. Audio, Speech and Language Processing
, vol.15
, pp. 2011-2022
-
-
Anguera, X.1
Wooters, C.2
Hernando, J.3
-
3
-
-
51449106462
-
-
http://www.nist.gov/speech/test beds/mr proj/
-
-
-
-
4
-
-
33746619064
-
Speaker turn segmentation based on between-channel differences
-
D. Ellis and J. Liu, "Speaker turn segmentation based on between-channel differences," in Proc. of NIST Meeting Recognition Workshop, 2004, pp. 112-117.
-
(2004)
Proc. of NIST Meeting Recognition Workshop
, pp. 112-117
-
-
Ellis, D.1
Liu, J.2
-
5
-
-
34547535369
-
Real-time monitoring of participants' interaction in a meeting using audio-visual sensors
-
Apr
-
C. Busso, P. Panayiotis, G. Georgiou, and S. Narayanan, "Real-time monitoring of participants' interaction in a meeting using audio-visual sensors," in Proc. of ICASSP'07, Apr. 2007, vol. II, pp. 685-688.
-
(2007)
Proc. of ICASSP'07
, vol.2
, pp. 685-688
-
-
Busso, C.1
Panayiotis, P.2
Georgiou, G.3
Narayanan, S.4
-
6
-
-
34547498831
-
Blind speech separation in a meeting situation with maximum SNR beamformers
-
Apr
-
S. Araki, H. Sawada, and S. Makino, "Blind speech separation in a meeting situation with maximum SNR beamformers," in Proc. of ICASSP'07, Apr. 2007, vol. I, pp. 41-45.
-
(2007)
Proc. of ICASSP'07
, vol.1
, pp. 41-45
-
-
Araki, S.1
Sawada, H.2
Makino, S.3
-
7
-
-
85164649882
-
Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio
-
K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, "Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio," in Proc. of Interspeech '07, 2007, pp. 230-233.
-
(2007)
Proc. of Interspeech '07
, pp. 230-233
-
-
Ishizuka, K.1
Nakatani, T.2
Fujimoto, M.3
Miyazaki, N.4
-
8
-
-
50449097931
-
Noise robust voice activity detection based on switching Kalman filter
-
Aug
-
M. Fujimoto and K. Ishizuka, "Noise robust voice activity detection based on switching Kalman filter," in Proc. of Interspeech '07, Aug. 2007, pp. 2933-2936.
-
(2007)
Proc. of Interspeech '07
, pp. 2933-2936
-
-
Fujimoto, M.1
Ishizuka, K.2
-
9
-
-
50449109527
-
A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme
-
Mar, in submitting
-
M. Fujimoto, K. Ishizuka, and T. Nakatani, "A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme," in Proc. of ICASSP '08, Mar. 2008, (in submitting).
-
(2008)
Proc. of ICASSP '08
-
-
Fujimoto, M.1
Ishizuka, K.2
Nakatani, T.3
-
10
-
-
0016990291
-
The generalized correlation method for estimation of time delay
-
C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech and Signal Processing, vol. 24, no. 4, pp. 320-327, 1976.
-
(1976)
IEEE Trans. Acoust. Speech and Signal Processing
, vol.24
, Issue.4
, pp. 320-327
-
-
Knapp, C.H.1
Carter, G.C.2
-
11
-
-
33947664111
-
DOA estimation for multiple sparse sources with normalized observation vector clustering
-
May
-
S. Araki, H. Sawada, R. Mukai, and S. Makino, "DOA estimation for multiple sparse sources with normalized observation vector clustering," in Proc. of ICASSP'06, May 2006, vol. 5, pp. 33-36.
-
(2006)
Proc. of ICASSP'06
, vol.5
, pp. 33-36
-
-
Araki, S.1
Sawada, H.2
Mukai, R.3
Makino, S.4
-
12
-
-
0003922190
-
-
Wiley Interscience, 2nd edition
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, Wiley Interscience, 2nd edition, 2000.
-
(2000)
Pattern Classification
-
-
Duda, R.O.1
Hart, P.E.2
Stork, D.G.3
-
13
-
-
0032762471
-
A statistical model-based voice activity detection
-
J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
-
(1999)
IEEE Signal Processing Letters
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
|