메뉴 건너뛰기




Volumn 54, Issue 1, 2012, Pages 55-67

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features

Author keywords

Information bottleneck diarization; Meeting recordings; Multi stream modeling; NIST rich transcription; Speaker diarization

Indexed keywords

INFORMATION BOTTLENECK; MEETING RECORDINGS; MULTI-STREAM; NIST RICH TRANSCRIPTION; SPEAKER DIARIZATION;

EID: 80052714549     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.07.001     Document Type: Conference Paper
Times cited : (16)

References (29)
  • 1
    • 80052782258 scopus 로고    scopus 로고
    • Ph.D. thesis, Ecole Polytechnique Federale de Lausanne (EPFL)
    • Ajmera, Jitendra.; 2004. Robust Audio Segmentation, Ph.D. thesis, Ecole Polytechnique Federale de Lausanne (EPFL).
    • (2004) Robust Audio Segmentation
    • Jitendra, A.1
  • 6
    • 33745560829 scopus 로고    scopus 로고
    • Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
    • X. Anguera, C. Wooters, B. Peskin, and M. Aguiló Robust speaker segmentation for meetings: the ICSI-SRI spring 2005 diarization system Lect. Notes Comput. Sci. 3869 2006 402
    • (2006) Lect. Notes Comput. Sci. , vol.3869 , pp. 402
    • Anguera, X.1    Wooters, C.2    Peskin, B.3    Aguiló, M.4
  • 7
    • 84863773378 scopus 로고    scopus 로고
    • Frequency-domain linear prediction for temporal features
    • Understanding, ASRU '03
    • Athineos, M. et al.; 2003. Frequency-domain linear prediction for temporal features. In: Proc. IEEE Workshop Autom. Speech Recognit. Understanding, ASRU '03.
    • (2003) Proc. IEEE Workshop Autom. Speech Recognit
    • Athineos, M.1
  • 8
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • Chen, S.S.; Gopalakrishnan, P.S.; 1998. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: Proc. DARPA Speech Recognition Workshop, pp. 127-138.
    • (1998) Proc. DARPA Speech Recognition Workshop , pp. 127-138
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 9
    • 67651165389 scopus 로고    scopus 로고
    • Prosodic and other long-term features for speaker diarization
    • G. Friedland Prosodic and other long-term features for speaker diarization IEEE Trans. Audio Speech Language Process. 17 5 2009 985 993
    • (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , Issue.5 , pp. 985-993
    • Friedland, G.1
  • 10
    • 84867195580 scopus 로고    scopus 로고
    • Front-end for far-field speech recognition based on frequency domain linear prediction
    • Brisbane, Australia
    • Ganapathy, S.; et al.; 2008. Front-end for far-field speech recognition based on frequency domain linear prediction. In: Proc. INTERSPEECH, Brisbane, Australia
    • (2008) Proc. INTERSPEECH
    • Ganapathy, S.1
  • 12
    • 51649124977 scopus 로고    scopus 로고
    • The Information bottleneck revisited or how to choose a good distortion measure
    • ISIT, 2007
    • Harremoës, P.; Tishby, N.; 2007. The Information bottleneck revisited or how to choose a good distortion measure. In: IEEE Internat. Symp. Inform. Theor.; ISIT 2007, pp. 566-570.
    • (2007) IEEE Internat. Symp. Inform. Theor. , pp. 566-570
    • Harremoës, P.1    Tishby, N.2
  • 13
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • B. Kingsbury, N. Morgan, and S. Greenberg Robust speech recognition using the modulation spectrogram Speech Commun. 25 1998 117132
    • (1998) Speech Commun. , vol.25 , pp. 117132
    • Kingsbury, B.1    Morgan, N.2    Greenberg, S.3
  • 14
    • 85084015242 scopus 로고    scopus 로고
    • Dimension reduction of the modulation spectrogram for speaker verification
    • Workshop, Stellenbosch, South Africa
    • Kinnunen, T.; et al.; 2008. Dimension reduction of the modulation spectrogram for speaker verification. In: Proc. Odyssey: The Speaker and Language Recognit. Workshop, Stellenbosch, South Africa.
    • (2008) Proc. Odyssey: The Speaker and Language Recognit
    • Kinnunen, T.1
  • 15
    • 47749103773 scopus 로고    scopus 로고
    • Progress in the AMIDA speaker diarization system for meeting data
    • Evaluation Workshops Clear 2007 and Rt 2007, Baltimore, MD, USA, May 8-11 Revised Selected Papers
    • van Leeuwen, D.A.; Konecny, M.; 2008. Progress in the AMIDA speaker diarization system for meeting data. In: Multimodal Technologies for Perception of Humans: Internat. Evaluation Workshops Clear 2007 and Rt 2007, Baltimore, MD, USA, May 8-11, 2007, Revised Selected Papers, p. 475.
    • (2007) Multimodal Technologies for Perception of Humans: Internat , pp. 475
    • Van Leeuwen, D.A.1    Konecny, M.2
  • 16
    • 80052700063 scopus 로고    scopus 로고
    • .
  • 17
    • 57649176425 scopus 로고    scopus 로고
    • On-line multi-modal speaker diarization
    • Noulas, A.K.; Krose, B.; 2007. On-line multi-modal speaker diarization. In: Proceedings of ICMI, pp. 350-357.
    • (2007) Proceedings of ICMI , pp. 350-357
    • Noulas, A.K.1    Krose, B.2
  • 18
    • 34548351229 scopus 로고    scopus 로고
    • Speaker diarization for multi-microphone meetings: Mixing acoustic features and inter-channel time differences
    • Pardo, J.M.; Anguera, X.; Wooters, C.; 2006. Speaker diarization for multi-microphone meetings: mixing acoustic features and inter-channel time differences. In: Internat. Conf. Speech Language Process.
    • (2006) Internat. Conf. Speech Language Process
    • Pardo, J.M.1    Anguera, X.2    Wooters, C.3
  • 19
    • 34548310397 scopus 로고    scopus 로고
    • Speaker diarization for multiple-distant-microphone meetings using several sources of information
    • DOI 10.1109/TC.2007.70746, Emergent Systems, Algorithms, and Architectures for Speech-Based Human Machine Interaction
    • J.M. Pardo, X. Anguera, and C. Wooters Speaker diarization for multiple-distant-microphone meetings using several sources of information IEEE Trans. Comput. 56 9 2007 1212 1224 (Pubitemid 47333559)
    • (2007) IEEE Transactions on Computers , vol.56 , Issue.9 , pp. 1212-1224
    • Pardo, J.M.1    Anguera, X.2    Wooters, C.3
  • 22
    • 67650107416 scopus 로고    scopus 로고
    • Recognition of reverberant speech using frequency domain linear prediction
    • S. Thomas Recognition of reverberant speech using frequency domain linear prediction IEEE Signal Process. Lett. 15 2008 681 684
    • (2008) IEEE Signal Process. Lett. , vol.15 , pp. 681-684
    • Thomas, S.1
  • 25
    • 84867206818 scopus 로고    scopus 로고
    • Integration of TDOA features in information bottleneck framework for fast speaker diarization
    • Vijayasenan, D. et al.; 2008. Integration of TDOA features in information bottleneck framework for fast speaker diarization. In: Interspeech 2008.
    • (2008) Interspeech 2008
    • Vijayasenan, D.1
  • 26
    • 68649087212 scopus 로고    scopus 로고
    • An information theoretic approach to speaker diarization of meeting data
    • D. Vijayasenan An information theoretic approach to speaker diarization of meeting data IEEE Trans. Audio Speech Language Process. 17 7 2009 1382 1393
    • (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , Issue.7 , pp. 1382-1393
    • Vijayasenan, D.1
  • 28
    • 70450205418 scopus 로고    scopus 로고
    • Modulation spectrogram features for speaker diarization
    • Vinyals, O. et al.; 2008. Modulation spectrogram features for speaker diarization. In: Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech, 2008
    • Vinyals, O.1
  • 29
    • 47749119617 scopus 로고    scopus 로고
    • The ICSI RT07s speaker diarization system
    • C. Wooters, and M. Huijbregts The ICSI RT07s speaker diarization system Lect. Notes Comput. Sci. 4625 2008 509 519
    • (2008) Lect. Notes Comput. Sci. , vol.4625 , pp. 509-519
    • Wooters, C.1    Huijbregts, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.