SCOPUS 정보 검색 플랫폼

Volumn 54, Issue 1, 2012, Pages 55-67

Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features

(3) Vijayasenan, Deepu a Valente, Fabio a Bourlard, Hervé a

a IDIAP RESEARCH INSTITUTE (Switzerland)

Author keywords

Information bottleneck diarization; Meeting recordings; Multi stream modeling; NIST rich transcription; Speaker diarization

Indexed keywords

INFORMATION BOTTLENECK; MEETING RECORDINGS; MULTI-STREAM; NIST RICH TRANSCRIPTION; SPEAKER DIARIZATION;

EXPERIMENTS; SPECTRUM ANALYSIS; TRANSCRIPTION;

COMPUTATIONAL COMPLEXITY;

EID: 80052714549 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2011.07.001 Document Type: Conference Paper

Times cited : (16)

References (29)

1
- 80052782258
- Ph.D. thesis, Ecole Polytechnique Federale de Lausanne (EPFL)
- Ajmera, Jitendra.; 2004. Robust Audio Segmentation, Ph.D. thesis, Ecole Polytechnique Federale de Lausanne (EPFL).
- (2004) Robust Audio Segmentation
- Jitendra, A.¹

2
- 3543144948
- Robust speaker change detection
- J. Ajmera, I. McCowan, and H. Bourlard Robust speaker change detection Signal Process. Lett. IEEE 11 8 2004 649 651
- (2004) Signal Process. Lett. IEEE , vol.11 , Issue.8 , pp. 649-651
- Ajmera, J.¹ McCowan, I.² Bourlard, H.³

3
- 44849143841
- Anguera, X.; 2006. Beamformit, the fast and robust acoustic beamformer. Available from .
- (2006) Beamformit, the Fast and Robust Acoustic Beamformer
- Anguera, X.¹

4
- 77956193852
- Ph.D. thesis, Universitat Politecnica de Catalunya
- Anguera, Xavier.; 2006. Robust Speaker Diarization for Meetings, Ph.D. thesis, Universitat Politecnica de Catalunya,.
- (2006) Robust Speaker Diarization for Meetings
- Xavier, A.¹

5
- 34547553154
- Speaker diarization for multi-party meetings using acoustic fusion
- Anguera, X.; Wooters, C.; Hernando, J.H.; 2006. Speaker diarization for multi-party meetings using acoustic fusion. In: Proc. Autom. Speech Recognit. Understanding, pp. 426-431.
- (2006) Proc. Autom. Speech Recognit. Understanding , pp. 426-431
- Anguera, X.¹ Wooters, C.² Hernando, J.H.³

6
- 33745560829
- Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
- X. Anguera, C. Wooters, B. Peskin, and M. Aguiló Robust speaker segmentation for meetings: the ICSI-SRI spring 2005 diarization system Lect. Notes Comput. Sci. 3869 2006 402
- (2006) Lect. Notes Comput. Sci. , vol.3869 , pp. 402
- Anguera, X.¹ Wooters, C.² Peskin, B.³ Aguiló, M.⁴

7
- 84863773378
- Frequency-domain linear prediction for temporal features
- Understanding, ASRU '03
- Athineos, M. et al.; 2003. Frequency-domain linear prediction for temporal features. In: Proc. IEEE Workshop Autom. Speech Recognit. Understanding, ASRU '03.
- (2003) Proc. IEEE Workshop Autom. Speech Recognit
- Athineos, M.¹

8
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Chen, S.S.; Gopalakrishnan, P.S.; 1998. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: Proc. DARPA Speech Recognition Workshop, pp. 127-138.
- (1998) Proc. DARPA Speech Recognition Workshop , pp. 127-138
- Chen, S.S.¹ Gopalakrishnan, P.S.²

9
- 67651165389
- Prosodic and other long-term features for speaker diarization
- G. Friedland Prosodic and other long-term features for speaker diarization IEEE Trans. Audio Speech Language Process. 17 5 2009 985 993
- (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , Issue.5 , pp. 985-993
- Friedland, G.¹

10
- 84867195580
- Front-end for far-field speech recognition based on frequency domain linear prediction
- Brisbane, Australia
- Ganapathy, S.; et al.; 2008. Front-end for far-field speech recognition based on frequency domain linear prediction. In: Proc. INTERSPEECH, Brisbane, Australia
- (2008) Proc. INTERSPEECH
- Ganapathy, S.¹

11
- 70450151824
- Ph.D. thesis, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Guillermo Aradilla.; 2008. Acoustic Models for Posterior Features in Speech Recognition. Ph.D. thesis, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
- (2008) Acoustic Models for Posterior Features in Speech Recognition
- Aradilla, G.¹

12
- 51649124977
- The Information bottleneck revisited or how to choose a good distortion measure
- ISIT, 2007
- Harremoës, P.; Tishby, N.; 2007. The Information bottleneck revisited or how to choose a good distortion measure. In: IEEE Internat. Symp. Inform. Theor.; ISIT 2007, pp. 566-570.
- (2007) IEEE Internat. Symp. Inform. Theor. , pp. 566-570
- Harremoës, P.¹ Tishby, N.²

13
- 0032136330
- Robust speech recognition using the modulation spectrogram
- B. Kingsbury, N. Morgan, and S. Greenberg Robust speech recognition using the modulation spectrogram Speech Commun. 25 1998 117132
- (1998) Speech Commun. , vol.25 , pp. 117132
- Kingsbury, B.¹ Morgan, N.² Greenberg, S.³

14
- 85084015242
- Dimension reduction of the modulation spectrogram for speaker verification
- Workshop, Stellenbosch, South Africa
- Kinnunen, T.; et al.; 2008. Dimension reduction of the modulation spectrogram for speaker verification. In: Proc. Odyssey: The Speaker and Language Recognit. Workshop, Stellenbosch, South Africa.
- (2008) Proc. Odyssey: The Speaker and Language Recognit
- Kinnunen, T.¹

15
- 47749103773
- Progress in the AMIDA speaker diarization system for meeting data
- Evaluation Workshops Clear 2007 and Rt 2007, Baltimore, MD, USA, May 8-11 Revised Selected Papers
- van Leeuwen, D.A.; Konecny, M.; 2008. Progress in the AMIDA speaker diarization system for meeting data. In: Multimodal Technologies for Perception of Humans: Internat. Evaluation Workshops Clear 2007 and Rt 2007, Baltimore, MD, USA, May 8-11, 2007, Revised Selected Papers, p. 475.
- (2007) Multimodal Technologies for Perception of Humans: Internat , pp. 475
- Van Leeuwen, D.A.¹ Konecny, M.²

16
- 80052700063
- .

17
- 57649176425
- On-line multi-modal speaker diarization
- Noulas, A.K.; Krose, B.; 2007. On-line multi-modal speaker diarization. In: Proceedings of ICMI, pp. 350-357.
- (2007) Proceedings of ICMI , pp. 350-357
- Noulas, A.K.¹ Krose, B.²

18
- 34548351229
- Speaker diarization for multi-microphone meetings: Mixing acoustic features and inter-channel time differences
- Pardo, J.M.; Anguera, X.; Wooters, C.; 2006. Speaker diarization for multi-microphone meetings: mixing acoustic features and inter-channel time differences. In: Internat. Conf. Speech Language Process.
- (2006) Internat. Conf. Speech Language Process
- Pardo, J.M.¹ Anguera, X.² Wooters, C.³

19
- 34548310397
- Speaker diarization for multiple-distant-microphone meetings using several sources of information
- DOI 10.1109/TC.2007.70746, Emergent Systems, Algorithms, and Architectures for Speech-Based Human Machine Interaction
- J.M. Pardo, X. Anguera, and C. Wooters Speaker diarization for multiple-distant-microphone meetings using several sources of information IEEE Trans. Comput. 56 9 2007 1212 1224 (Pubitemid 47333559)
- (2007) IEEE Transactions on Computers , vol.56 , Issue.9 , pp. 1212-1224
- Pardo, J.M.¹ Anguera, X.² Wooters, C.³

20
- 80052705170
- Ph.D. thesis, The Hebrew University of Jerusalem
- Slonim, Noam.; 2002. The Information Bottleneck: Theory and Applications. Ph.D. thesis, The Hebrew University of Jerusalem.
- (2002) The Information Bottleneck: Theory and Applications
- Noam, S.¹

21
- 84899004693
- Agglomerative information bottleneck
- MIT Press
- Noam Slonim, N. Friedman, and Naftali Tishby Agglomerative information bottleneck Proc. Adv. Neural Inform. Process. Syst. 1999 MIT Press 617 623
- (1999) Proc. Adv. Neural Inform. Process. Syst. , pp. 617-623
- Slonim, N.¹ Friedman, N.² Tishby, N.³

22
- 67650107416
- Recognition of reverberant speech using frequency domain linear prediction
- S. Thomas Recognition of reverberant speech using frequency domain linear prediction IEEE Signal Process. Lett. 15 2008 681 684
- (2008) IEEE Signal Process. Lett. , vol.15 , pp. 681-684
- Thomas, S.¹

23
- 44849094004
- The information bottleneck method
- Tishby, Naftali.; Pereira, F.C.; Bialek, W.; 1998. The information bottleneck method. In: NEC Res. Inst. TR.
- (1998) NEC Res. Inst. TR
- Tishby, N.¹ Pereira, F.C.² Bialek, W.³

24
- 34047261805
- An overview of automatic speaker diarization systems
- DOI 10.1109/TASL.2006.878256
- S. Tranter, and D. Reynolds An overview of automatic speaker diarisation systems IEEE Trans. Audio Speech Language Process. 14 2006 1557 1565 (Pubitemid 46547580)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

25
- 84867206818
- Integration of TDOA features in information bottleneck framework for fast speaker diarization
- Vijayasenan, D. et al.; 2008. Integration of TDOA features in information bottleneck framework for fast speaker diarization. In: Interspeech 2008.
- (2008) Interspeech 2008
- Vijayasenan, D.¹

26
- 68649087212
- An information theoretic approach to speaker diarization of meeting data
- D. Vijayasenan An information theoretic approach to speaker diarization of meeting data IEEE Trans. Audio Speech Language Process. 17 7 2009 1382 1393
- (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , Issue.7 , pp. 1382-1393
- Vijayasenan, D.¹

27
- 70450175269
- KL realignment for speaker diarization with multiple feature streams
- Vijayasenan, D.; et al.; 2009. KL realignment for speaker diarization with multiple feature streams. In 10th Ann. Conf. Internat. Speech Commun. Assoc.
- (2009) 10th Ann. Conf. Internat. Speech Commun. Assoc
- Vijayasenan, D.¹

28
- 70450205418
- Modulation spectrogram features for speaker diarization
- Vinyals, O. et al.; 2008. Modulation spectrogram features for speaker diarization. In: Proc. Interspeech, 2008.
- (2008) Proc. Interspeech, 2008
- Vinyals, O.¹

29
- 47749119617
- The ICSI RT07s speaker diarization system
- C. Wooters, and M. Huijbregts The ICSI RT07s speaker diarization system Lect. Notes Comput. Sci. 4625 2008 509 519
- (2008) Lect. Notes Comput. Sci. , vol.4625 , pp. 509-519
- Wooters, C.¹ Huijbregts, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.