SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2008, Pages 93-96

Speaker indexing and speech enhancement in real meetings/conversations

(5) Araki, Shoko a Fujimoto, Masakiyo a Ishizuka, Kentaro a Sawada, Hiroshi a Makino, Shoji a

a NTT Communication Science Laboratories (Japan)

Author keywords

Diarization; Maximum SNR beamformer; Speaker indexing; Voice activity detector

Indexed keywords

ACOUSTICS; ARCHITECTURAL ACOUSTICS; COMPUTER NETWORKS; ERROR ANALYSIS; RADIO DIRECTION FINDING SYSTEMS; REVERBERATION; SIGNAL PROCESSING; SPEECH; SPEECH ENHANCEMENT;

DIARIZATION; MAXIMUM SNR BEAMFORMER; SPEAKER INDEXING; VOICE ACTIVITY DETECTOR;

INDEXING (OF INFORMATION);

EID: 51449113843 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2008.4517554 Document Type: Conference Paper

Times cited : (23)

References (13)

1
- 34548339624
- Speaker diarization for multi-microphone meetings using only between-channel differences
- Sept, Springer
- J. M. Pardo, X. Anguera, and C. Wooters, "Speaker diarization for multi-microphone meetings using only between-channel differences," in Proc. of MLMI'06 (LNCS 4299). Sept. 2006, pp. 257-264, Springer.
- (2006) Proc. of MLMI'06 (LNCS 4299) , pp. 257-264
- Pardo, J.M.¹ Anguera, X.² Wooters, C.³

2
- 50449086237
- Acoustic beamforming for speaker diarization of meetings
- Sept
- X. Anguera, C. Wooters, and J. Hernando, "Acoustic beamforming for speaker diarization of meetings," IEEE Trans. Audio, Speech and Language Processing, vol. 15, pp. 2011-2022, Sept. 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , pp. 2011-2022
- Anguera, X.¹ Wooters, C.² Hernando, J.³

3
- 51449106462
- http://www.nist.gov/speech/test beds/mr proj/

4
- 33746619064
- Speaker turn segmentation based on between-channel differences
- D. Ellis and J. Liu, "Speaker turn segmentation based on between-channel differences," in Proc. of NIST Meeting Recognition Workshop, 2004, pp. 112-117.
- (2004) Proc. of NIST Meeting Recognition Workshop , pp. 112-117
- Ellis, D.¹ Liu, J.²

5
- 34547535369
- Real-time monitoring of participants' interaction in a meeting using audio-visual sensors
- Apr
- C. Busso, P. Panayiotis, G. Georgiou, and S. Narayanan, "Real-time monitoring of participants' interaction in a meeting using audio-visual sensors," in Proc. of ICASSP'07, Apr. 2007, vol. II, pp. 685-688.
- (2007) Proc. of ICASSP'07 , vol.2 , pp. 685-688
- Busso, C.¹ Panayiotis, P.² Georgiou, G.³ Narayanan, S.⁴

6
- 34547498831
- Blind speech separation in a meeting situation with maximum SNR beamformers
- Apr
- S. Araki, H. Sawada, and S. Makino, "Blind speech separation in a meeting situation with maximum SNR beamformers," in Proc. of ICASSP'07, Apr. 2007, vol. I, pp. 41-45.
- (2007) Proc. of ICASSP'07 , vol.1 , pp. 41-45
- Araki, S.¹ Sawada, H.² Makino, S.³

7
- 85164649882
- Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio
- K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, "Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio," in Proc. of Interspeech '07, 2007, pp. 230-233.
- (2007) Proc. of Interspeech '07 , pp. 230-233
- Ishizuka, K.¹ Nakatani, T.² Fujimoto, M.³ Miyazaki, N.⁴

8
- 50449097931
- Noise robust voice activity detection based on switching Kalman filter
- Aug
- M. Fujimoto and K. Ishizuka, "Noise robust voice activity detection based on switching Kalman filter," in Proc. of Interspeech '07, Aug. 2007, pp. 2933-2936.
- (2007) Proc. of Interspeech '07 , pp. 2933-2936
- Fujimoto, M.¹ Ishizuka, K.²

9
- 50449109527
- A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme
- Mar, in submitting
- M. Fujimoto, K. Ishizuka, and T. Nakatani, "A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme," in Proc. of ICASSP '08, Mar. 2008, (in submitting).
- (2008) Proc. of ICASSP '08
- Fujimoto, M.¹ Ishizuka, K.² Nakatani, T.³

10
- 0016990291
- The generalized correlation method for estimation of time delay
- C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech and Signal Processing, vol. 24, no. 4, pp. 320-327, 1976.
- (1976) IEEE Trans. Acoust. Speech and Signal Processing , vol.24 , Issue.4 , pp. 320-327
- Knapp, C.H.¹ Carter, G.C.²

11
- 33947664111
- DOA estimation for multiple sparse sources with normalized observation vector clustering
- May
- S. Araki, H. Sawada, R. Mukai, and S. Makino, "DOA estimation for multiple sparse sources with normalized observation vector clustering," in Proc. of ICASSP'06, May 2006, vol. 5, pp. 33-36.
- (2006) Proc. of ICASSP'06 , vol.5 , pp. 33-36
- Araki, S.¹ Sawada, H.² Mukai, R.³ Makino, S.⁴

12
- 0003922190
- Wiley Interscience, 2nd edition
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, Wiley Interscience, 2nd edition, 2000.
- (2000) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

13
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
- (1999) IEEE Signal Processing Letters , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.