메뉴 건너뛰기




Volumn , Issue , 2008, Pages 93-96

Speaker indexing and speech enhancement in real meetings/conversations

Author keywords

Diarization; Maximum SNR beamformer; Speaker indexing; Voice activity detector

Indexed keywords

ACOUSTICS; ARCHITECTURAL ACOUSTICS; COMPUTER NETWORKS; ERROR ANALYSIS; RADIO DIRECTION FINDING SYSTEMS; REVERBERATION; SIGNAL PROCESSING; SPEECH; SPEECH ENHANCEMENT;

EID: 51449113843     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2008.4517554     Document Type: Conference Paper
Times cited : (23)

References (13)
  • 1
    • 34548339624 scopus 로고    scopus 로고
    • Speaker diarization for multi-microphone meetings using only between-channel differences
    • Sept, Springer
    • J. M. Pardo, X. Anguera, and C. Wooters, "Speaker diarization for multi-microphone meetings using only between-channel differences," in Proc. of MLMI'06 (LNCS 4299). Sept. 2006, pp. 257-264, Springer.
    • (2006) Proc. of MLMI'06 (LNCS 4299) , pp. 257-264
    • Pardo, J.M.1    Anguera, X.2    Wooters, C.3
  • 3
    • 51449106462 scopus 로고    scopus 로고
    • http://www.nist.gov/speech/test beds/mr proj/
  • 4
    • 33746619064 scopus 로고    scopus 로고
    • Speaker turn segmentation based on between-channel differences
    • D. Ellis and J. Liu, "Speaker turn segmentation based on between-channel differences," in Proc. of NIST Meeting Recognition Workshop, 2004, pp. 112-117.
    • (2004) Proc. of NIST Meeting Recognition Workshop , pp. 112-117
    • Ellis, D.1    Liu, J.2
  • 5
    • 34547535369 scopus 로고    scopus 로고
    • Real-time monitoring of participants' interaction in a meeting using audio-visual sensors
    • Apr
    • C. Busso, P. Panayiotis, G. Georgiou, and S. Narayanan, "Real-time monitoring of participants' interaction in a meeting using audio-visual sensors," in Proc. of ICASSP'07, Apr. 2007, vol. II, pp. 685-688.
    • (2007) Proc. of ICASSP'07 , vol.2 , pp. 685-688
    • Busso, C.1    Panayiotis, P.2    Georgiou, G.3    Narayanan, S.4
  • 6
    • 34547498831 scopus 로고    scopus 로고
    • Blind speech separation in a meeting situation with maximum SNR beamformers
    • Apr
    • S. Araki, H. Sawada, and S. Makino, "Blind speech separation in a meeting situation with maximum SNR beamformers," in Proc. of ICASSP'07, Apr. 2007, vol. I, pp. 41-45.
    • (2007) Proc. of ICASSP'07 , vol.1 , pp. 41-45
    • Araki, S.1    Sawada, H.2    Makino, S.3
  • 7
    • 85164649882 scopus 로고    scopus 로고
    • Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio
    • K. Ishizuka, T. Nakatani, M. Fujimoto, and N. Miyazaki, "Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio," in Proc. of Interspeech '07, 2007, pp. 230-233.
    • (2007) Proc. of Interspeech '07 , pp. 230-233
    • Ishizuka, K.1    Nakatani, T.2    Fujimoto, M.3    Miyazaki, N.4
  • 8
    • 50449097931 scopus 로고    scopus 로고
    • Noise robust voice activity detection based on switching Kalman filter
    • Aug
    • M. Fujimoto and K. Ishizuka, "Noise robust voice activity detection based on switching Kalman filter," in Proc. of Interspeech '07, Aug. 2007, pp. 2933-2936.
    • (2007) Proc. of Interspeech '07 , pp. 2933-2936
    • Fujimoto, M.1    Ishizuka, K.2
  • 9
    • 50449109527 scopus 로고    scopus 로고
    • A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme
    • Mar, in submitting
    • M. Fujimoto, K. Ishizuka, and T. Nakatani, "A voice activity detection based on adaptive integration of multiple speech feature and signal decision scheme," in Proc. of ICASSP '08, Mar. 2008, (in submitting).
    • (2008) Proc. of ICASSP '08
    • Fujimoto, M.1    Ishizuka, K.2    Nakatani, T.3
  • 10
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech and Signal Processing, vol. 24, no. 4, pp. 320-327, 1976.
    • (1976) IEEE Trans. Acoust. Speech and Signal Processing , vol.24 , Issue.4 , pp. 320-327
    • Knapp, C.H.1    Carter, G.C.2
  • 11
    • 33947664111 scopus 로고    scopus 로고
    • DOA estimation for multiple sparse sources with normalized observation vector clustering
    • May
    • S. Araki, H. Sawada, R. Mukai, and S. Makino, "DOA estimation for multiple sparse sources with normalized observation vector clustering," in Proc. of ICASSP'06, May 2006, vol. 5, pp. 33-36.
    • (2006) Proc. of ICASSP'06 , vol.5 , pp. 33-36
    • Araki, S.1    Sawada, H.2    Mukai, R.3    Makino, S.4
  • 13
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) IEEE Signal Processing Letters , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.