메뉴 건너뛰기




Volumn , Issue , 2007, Pages 683-686

Efficient use of overlap information in speaker diarization

Author keywords

Diarization; Localization; Overlap; Speaker identification

Indexed keywords

CROSS CORRELATIONS; DIARIZATION; LOCALIZATION; OVERLAP; OVERLAP DETECTIONS; SPEAKER CLUSTERING; SPEAKER DIARIZATION; SPEAKER IDENTIFICATION;

EID: 44849101173     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/asru.2007.4430194     Document Type: Conference Paper
Times cited : (45)

References (23)
  • 2
    • 85009145345 scopus 로고    scopus 로고
    • Observations on overlap: Findings and implications for automatic processing of multi-party conversation
    • E. Shriberg, A. Stoicke, and D. Baron, "Observations on overlap: Findings and implications for automatic processing of multi-party conversation," in Eurospeech, 2001.
    • (2001) Eurospeech
    • Shriberg, E.1    Stoicke, A.2    Baron, D.3
  • 3
    • 44849093045 scopus 로고    scopus 로고
    • Progress in the AMIDA speaker diarization system for meeting data
    • D. A. van Leeuwen and M. Konecny, "Progress in the AMIDA speaker diarization system for meeting data," in NIST RT07 Workshop, 2007.
    • (2007) NIST RT07 Workshop
    • van Leeuwen, D.A.1    Konecny, M.2
  • 6
    • 0031352137 scopus 로고    scopus 로고
    • Blind source separation and deconvolution by dynamic component analysis
    • H. Attias and C. E. Schreiner, "Blind source separation and deconvolution by dynamic component analysis," Neural Networks for Signal Processing, vol. VII, pp. 456-465, 1997.
    • (1997) Neural Networks for Signal Processing , vol.7 , pp. 456-465
    • Attias, H.1    Schreiner, C.E.2
  • 7
    • 0031145298 scopus 로고    scopus 로고
    • Multichannel speech separation by eigendecomposition and its application to co-talker interference removal
    • Y. Cao, S. Sridharan, and M. Moody, "Multichannel speech separation by eigendecomposition and its application to co-talker interference removal," IEEE Trans. on Speech and Audio Proc., vol. 5, no. 3, pp. 209-219, 1997.
    • (1997) IEEE Trans. on Speech and Audio Proc , vol.5 , Issue.3 , pp. 209-219
    • Cao, Y.1    Sridharan, S.2    Moody, M.3
  • 9
    • 84892176648 scopus 로고    scopus 로고
    • Combining time-delayed decorrelation and ICA: Towards solving the cocktail party problem
    • T.-W. Lee, A. Ziehe, R. Orglmeister, and T. Sejnowski, "Combining time-delayed decorrelation and ICA: Towards solving the cocktail party problem," in Proc. ICASSP, vol. 2, pp. 1249-1253, 1998.
    • (1998) Proc. ICASSP , vol.2 , pp. 1249-1253
    • Lee, T.-W.1    Ziehe, A.2    Orglmeister, R.3    Sejnowski, T.4
  • 11
    • 0035280043 scopus 로고    scopus 로고
    • A comparison of auditory and blind separation techniques for speech segregaton
    • A. W. van der Kouwe, D. Wang, and G. J. Brown, "A comparison of auditory and blind separation techniques for speech segregaton," IEEE Trans. on Speech and Audio Proc., vol. 9, pp. 189-195, 2000.
    • (2000) IEEE Trans. on Speech and Audio Proc , vol.9 , pp. 189-195
    • van der Kouwe, A.W.1    Wang, D.2    Brown, G.J.3
  • 12
    • 0034843120 scopus 로고    scopus 로고
    • Use of local kurtosis measure for spotting usable speech segments in cochannel speech
    • K. R. Krishnamachari, R. E. Yantorno, J. M. Lovekin, D. S. Benincasa, and S. J. Wendt, "Use of local kurtosis measure for spotting usable speech segments in cochannel speech," in Proc. ICASSP, vol. 1, pp. 649-652, 2001.
    • (2001) Proc. ICASSP , vol.1 , pp. 649-652
    • Krishnamachari, K.R.1    Yantorno, R.E.2    Lovekin, J.M.3    Benincasa, D.S.4    Wendt, S.J.5
  • 13
    • 44849126533 scopus 로고    scopus 로고
    • Adjacent pitch period comparison (APPC) as a usability measure of speech segments under co-channel conditions
    • J. Lovekin, K. R. Krishnanmachari, and R. E. Yantorno, "Adjacent pitch period comparison (APPC) as a usability measure of speech segments under co-channel conditions," in ISPACS, 2001.
    • (2001) ISPACS
    • Lovekin, J.1    Krishnanmachari, K.R.2    Yantorno, R.E.3
  • 14
    • 0037856369 scopus 로고    scopus 로고
    • Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual
    • N. Chandra and R. E. Yantorno, "Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual," in Intl. Conf, Signal and Image Proc., 2002.
    • (2002) Intl. Conf, Signal and Image Proc
    • Chandra, N.1    Yantorno, R.E.2
  • 17
    • 0034818605 scopus 로고    scopus 로고
    • Cochannel speaker count labelling based on the use of cepstral and pitch predicton derived features
    • M. A. Lewis and R. P. Ramachandran, "Cochannel speaker count labelling based on the use of cepstral and pitch predicton derived features," Pattern Recongnition, vol. 34, pp. 449-507, 2001.
    • (2001) Pattern Recongnition , vol.34 , pp. 449-507
    • Lewis, M.A.1    Ramachandran, R.P.2
  • 18
    • 44849113330 scopus 로고    scopus 로고
    • Speaker overlap detection with Hough transform pitch features,
    • Tech. Rep. 2004-0012, Univesity of Washington
    • S. Otterson, S. Furui, and M. Ostendorf, "Speaker overlap detection with Hough transform pitch features," Tech. Rep. 2004-0012, Univesity of Washington, 2004.
    • (2004)
    • Otterson, S.1    Furui, S.2    Ostendorf, M.3
  • 19
    • 0003548585 scopus 로고
    • DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM.,
    • Tech. Rep. NT-STIR 4930, National Institute of Standards and Technology
    • J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM.," Tech. Rep. NT-STIR 4930, National Institute of Standards and Technology, 1993.
    • (1993)
    • Garofolo, J.1    Lamel, L.2    Fisher, W.3    Fiscus, J.4    Pallett, D.5    Dahlgren, N.6
  • 20
    • 0033693217 scopus 로고    scopus 로고
    • Multi-source localization in reverberant environments by ROOT-MUSIC and clustering
    • E. D. D. Claudio, R. Parisi, and G. Orlandi, "Multi-source localization in reverberant environments by ROOT-MUSIC and clustering," in Proc. ICASSP, vol. 2, pp. 921-924, 2000.
    • (2000) Proc. ICASSP , vol.2 , pp. 921-924
    • Claudio, E.D.D.1    Parisi, R.2    Orlandi, G.3
  • 21
    • 44849118348 scopus 로고    scopus 로고
    • The AMI speaker diarization system for NIST RT06s meeting data
    • D. A. van Leeuwen and M. Huijbregts, "The AMI speaker diarization system for NIST RT06s meeting data," in NIST RT06 workshop, 2006.
    • (2006) NIST RT06 workshop
    • van Leeuwen, D.A.1    Huijbregts, M.2
  • 22
    • 44949197897 scopus 로고    scopus 로고
    • Robust speaker diarization for meetings: ICSI RT06s evaluation system
    • X. Anguera, C. Wooters, and J. M. Pardo, "Robust speaker diarization for meetings: ICSI RT06s evaluation system," in Proc. ICSLP, 2006.
    • (2006) Proc. ICSLP
    • Anguera, X.1    Wooters, C.2    Pardo, J.M.3
  • 23
    • 56149094619 scopus 로고    scopus 로고
    • Improved location features for meeting speaker diarization
    • S. Otterson, "Improved location features for meeting speaker diarization," in Interspeech, 2007.
    • (2007) Interspeech
    • Otterson, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.