메뉴 건너뛰기




Volumn , Issue , 2010, Pages 2302-2305

Overlap detection for speaker diarization by fusing spectral and spatial features

Author keywords

Cross correlation; Speaker diarization; Speaker overlap detection

Indexed keywords

FEATURE EXTRACTION; SPEECH COMMUNICATION; SPEECH RECOGNITION;

EID: 79959829540     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (12)

References (12)
  • 1
    • 33745224103 scopus 로고    scopus 로고
    • Spontaneous speech: How people really talk and why engineers should care
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • Shriberg, E., "Spontaneous Speech: How People Really Talk and Why Engineers Should Care," in Proc. Interspeech '05, Lisbon, Portugal, 2005, pp. 1781-1784. (Pubitemid 43908428)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 1781-1784
    • Shriberg, E.1
  • 2
    • 0141469852 scopus 로고    scopus 로고
    • Multispeaker speech activity detector for the ICSI meeting recorder
    • Madonna di Campiglio, Italy
    • Pfau, E., Ellis, D.P.W and Stolcke, A., "Multispeaker Speech Activity Detector for the ICSI Meeting Recorder," in Proc. ASRU '01, Madonna di Campiglio, Italy, 2001, pp. 107-110.
    • (2001) Proc. ASRU '01 , pp. 107-110
    • Pfau, E.1    Ellis, D.P.W.2    Stolcke, A.3
  • 4
    • 85009097062 scopus 로고    scopus 로고
    • Crosscorrelation-based multispeaker speech activity detection
    • Jeju Island, Korea
    • Laskowski, K., Jin, Q. and Schultz, T. "Crosscorrelation-based Multispeaker Speech Activity Detection," in Interspeech '04, Jeju Island, Korea, 2004, pp. 973-976.
    • (2004) Interspeech '04 , pp. 973-976
    • Laskowski, K.1    Jin, Q.2    Schultz, T.3
  • 5
    • 0344425668 scopus 로고    scopus 로고
    • Location based speaker segmentation
    • Baltimore, USA
    • Lathoud, G. and McCowan, L., "Location Based Speaker Segmentation," in Proc. ICME '03, Baltimore, USA, 2003, pp. III-621-4 vol.3.
    • (2003) Proc. ICME '03 , vol.3
    • Lathoud, G.1    McCowan, L.2
  • 6
    • 77249176190 scopus 로고    scopus 로고
    • The AMI speaker di-arization system for NIST RT06s meeting data
    • LNCS Springer Berlin/Heidelberg
    • van Leeuwen, D.A. and Huijbregts, M., "The AMI Speaker Di-arization System for NIST RT06s Meeting Data," in Machine Learning for Multimodal Interaction, LNCS, vol. 4299/2006, Springer Berlin/Heidelberg, 2006, pp. 371-384.
    • (2006) Machine Learning for Multimodal Interaction , vol.4299 , Issue.2006 , pp. 371-384
    • Van Leeuwen, D.A.1    Huijbregts, M.2
  • 7
    • 84867228708 scopus 로고    scopus 로고
    • Two's a crowd: Improving speaker diarization by automatically identifying and excluding overlapped speech
    • Brisbane, Australia
    • Boakye, K., Vinyals, O., and Friedland, G., "Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech," in Proc. Interspeech '08, Brisbane, Australia, 2008, pp. 32-35.
    • (2008) Proc. Interspeech '08 , pp. 32-35
    • Boakye, K.1    Vinyals, O.2    Friedland, G.3
  • 8
    • 70349225212 scopus 로고    scopus 로고
    • Improved location features for meeting speaker diarization
    • Antwerp, Belgium
    • Otterson, S., "Improved Location Features for Meeting Speaker Diarization," in Proc. Interspeech '07, Antwerp, Belgium, 2007, pp. 1849-1852.
    • (2007) Proc. Interspeech '07 , pp. 1849-1852
    • Otterson, S.1
  • 9
    • 11144286121 scopus 로고    scopus 로고
    • The spectral autocorrelation peak valley ratio (SAPVR) - A usable speech measure emplyed as a co-channel detection system
    • Yantorno, R., "The Spectral Autocorrelation Peak Valley Ratio (SAPVR) - A Usable Speech Measure Emplyed as a Co-Channel Detection System," in Proc. of IEEE Workshop on Intelligent Signal Processing, 2001.
    • (2001) Proc. of IEEE Workshop on Intelligent Signal Processing
    • Yantorno, R.1
  • 10
    • 0030676520 scopus 로고    scopus 로고
    • Acoustic source location in a three-dimensional space using crosspower spectrum phase
    • Munich, Germany
    • Svaizer, P. et al., "Acoustic source location in a three-dimensional space using crosspower spectrum phase,", in Proc. ICASSP '97, Munich, Germany, 1997, pp. 231-234.
    • (1997) Proc. ICASSP '97 , pp. 231-234
    • Svaizer, P.1
  • 11
    • 0030701369 scopus 로고    scopus 로고
    • A robust method for speech signal time-delay estimation in reverberant rooms
    • Munich, Germany
    • Brandstein, M. S. and Silverman, H. F., "A robust method for speech signal time-delay estimation in reverberant rooms," in Proc. ICASSP '97, Munich, Germany, 1997, pp. 375-378.
    • (1997) Proc. ICASSP '97 , pp. 375-378
    • Brandstein, M.S.1    Silverman, H.F.2
  • 12
    • 47749127366 scopus 로고    scopus 로고
    • Speaker diarization for conference room: The UPC RT07s evaluation system
    • LNCS Springer Berlin/Heidelberg
    • Luque, J. et al., "Speaker Diarization for Conference Room: The UPC RT07s Evaluation System," in Multimodal Technologies for Perception of Humans, LNCS, vol. 4625/2008, Springer Berlin/Heidelberg, 2008, pp. 543-553.
    • (2008) Multimodal Technologies for Perception of Humans , vol.4625 , Issue.2008 , pp. 543-553
    • Luque, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.