SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 2, 2011, Pages 431-438

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization

(3) Vijayasenan, Deepu a Valente, Fabio a Bourlard, Hervé a

a IDIAP RESEARCH INSTITUTE (Switzerland)

Author keywords

Feature combination; information bottleneck; meeting data; speaker diarization

Indexed keywords

EID: 85008566469 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2010.2048603 Document Type: Article

Times cited : (27)

References (23)

1
- 33646360883
- Robust audio segmentation
- Ph.D. dissertation
- J. Ajmera, “Robust audio segmentation,” Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne (EPFL), Lausanne, Switzerland, 2004.
- (2004)
- Ajmera, J.¹

2
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- S. Chen and P. Gopalakrishnan, “Speaker, environment and channel change detection and clustering via the bayesian information criterion,” in Proc. DARPA Speech Recognition Workshop, 1998, pp. 127–138.
- (1998) Proc. DARPA Speech Recognition Workshop , pp. 127-138
- Chen, S.¹ Gopalakrishnan, P.²

3
- 3543144948
- Robust speaker change detection
- Nov.
- J. Ajmera, I. McCowan, and H. Bourlard “Robust speaker change detection,” IEEE Signal Process. Lett., vol. 11, no. 8, pp. 649–651, Nov. 2004.
- (2004) IEEE Signal Process. Lett. , vol.11 , Issue.8 , pp. 649-651
- Ajmera, J.¹ McCowan, I.² Bourlard, H.³

4
- 44849123928
- Robust speaker diarization for meetings
- Ph.D. dissertation, Univ. Politecnica de Catalunya, Barcelona, Spain
- X. Anguera, “Robust speaker diarization for meetings,” Ph.D. dissertation, Univ. Politecnica de Catalunya, Barcelona, Spain, 2006.
- (2006)
- Anguera, X.¹

5
- 77249176190
- The AMI speaker diarization system for NIST RT06s meeting data
- D. van Leeuwen and M. Huijbregts, “The AMI speaker diarization system for NIST RT06s meeting data,” in Lecture Notes in Computer Science, 2006, vol. 4299, p. 371.
- (2006) Lecture Notes in Computer Science , vol.4299 , pp. 371
- van Leeuwen, D.¹ Huijbregts, M.²

6
- 0141591540
- Location based speaker segmentation
- G. Lathoud and I. McCowan, “Location based speaker segmentation,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2003, pp. 621–624.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 621-624
- Lathoud, G.¹ McCowan, I.²

7
- 77950110298
- Speaker diarization for multi-microphone meetings using only between-channel differences
- J. Pardo, X. Anguera, and C. Wooters, “Speaker diarization for multi-microphone meetings using only between-channel differences,” in Proc. MLMI, 2006.
- (2006) Proc. MLMI
- Pardo, J.¹ Anguera, X.² Wooters, C.³

8
- 34548351229
- Speaker diarization for multimicrophone meetings: Mixing acoustic features and inter-channel time differences
- J. Pardo, X. Anguera, and C. Wooters, “Speaker diarization for multimicrophone meetings: Mixing acoustic features and inter-channel time differences,” in Proc. Int. Conf. Speech Lang. Process., 2006.
- (2006) Proc. Int. Conf. Speech Lang. Process.
- Pardo, J.¹ Anguera, X.² Wooters, C.³

9
- 34548310397
- Speaker diarization for multiple-distant-microphone meetings using several sources of information
- Sep.
- J. M. Pardo, X. Anguera, and C. Wooters “Speaker diarization for multiple-distant-microphone meetings using several sources of information,” IEEE Trans. Comput., vol. 56, no. 9, pp. 1189–1198, Sep. 2007.
- (2007) IEEE Trans. Comput. , vol.56 , Issue.9 , pp. 1189-1198
- Pardo, J.M.¹ Anguera, X.² Wooters, C.³

10
- 33745560829
- Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
- X. Anguera, C. Wooters, B. Peskin, and M. Aguilo, “Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system,” in Lecture Notes in Computer Science, 2006, vol. 3869, p. 402.
- (2006) Lecture Notes in Computer Science , vol.3869 , pp. 402
- Anguera, X.¹ Wooters, C.² Peskin, B.³ Aguilo, M.⁴

11
- 47749119617
- The ICSI RT07s speaker diarization system
- C. Wooters and M. Huijbregts “The ICSI RT07s speaker diarization system,” Lecture Notes in Computer Science, vol. 4625, pp. 509–519, 2008.
- (2008) Lecture Notes in Computer Science , vol.4625 , pp. 509-519
- Wooters, C.¹ Huijbregts, M.²

12
- 47749103773
- Progress in the AMIDA speaker diarization system for meeting data
- Baltimore, MD, May 8–11, Revised Selected Papers
- D. van Leeuwen and M. Konecny, “Progress in the AMIDA speaker diarization system for meeting data,” in Proc. Multimodal Technologies for Perception of Humans: Int. Evaluation Workshops Clear 2007 and Rt 2007, Baltimore, MD, May 8–11, 2007, Revised Selected Papers, p. 475.
- (2007) Proc. Multimodal Technologies for Perception of Humans: Int. Evaluation Workshops Clear 2007 and Rt 2007 , pp. 475
- van Leeuwen, D.¹ Konecny, M.²

13
- 77249114287
- The rich transcription 2006 spring meeting recognition evaluation
- J. Fiscus, J. Ajot, M. Michel, and J. Garofolo, “The rich transcription 2006 spring meeting recognition evaluation,” in Lecture Notes in Computer Science, 2006, vol. 4299, p. 309.
- (2006) Lecture Notes in Computer Science , vol.4299 , pp. 309
- Fiscus, J.¹ Ajot, J.² Michel, M.³ Garofolo, J.⁴

14
- 47749152568
- Berlin, Germany
- J. Fiscus, J. Ajot, and J. Garofolo, “The rich transcription 2007 meeting recognition evaluation,” in Proc. Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, Berlin, Germany, 2008.
- (2008) Proc. Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science
- Fiscus, J.¹ Ajot, J.² Garofolo, J.³

15
- 68649087212
- An information theoretic approach to speaker diarization of meeting data
- Jul.
- D. Vijayasenan, F. Valente, and H. Bourlard “An information theoretic approach to speaker diarization of meeting data,” IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 7, pp. 1382–1393, Jul. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.7 , pp. 1382-1393
- Vijayasenan, D.¹ Valente, F.² Bourlard, H.³

16
- 33846242627
- Speaker diarization for multi-party meetings using acoustic fusion
- X. Anguera, C. Wooters, and J. H. Hernando, “Speaker diarization for multi-party meetings using acoustic fusion,” in Proc. Autom. Speech Recognition and Understanding, 2006, pp. 426–431.
- (2006) Proc. Autom. Speech Recognition and Understanding , pp. 426-431
- Anguera, X.¹ Wooters, C.² Hernando, J.H.³

17
- 44849143841
- Beamformit, the Fast and Robust Acoustic Beamformer
- [Online]. Available: http://www.icsi.berkeley.edu/xanguera/Beamformlt
- X. Anguera, “Beamformit, the Fast and Robust Acoustic Beamformer,” 2006 [Online]. Available: http://www.icsi.berkeley.edu/xanguera/Beamformlt
- (2006)
- Anguera, X.¹

18
- 9444222292
- The Information Bottleneck: Theory and Applications
- Ph.D. dissertation, Hebrew Univ. of Jerusalem, Jerusalem, Israel
- N. Slonim, “The Information Bottleneck: Theory and Applications,” Ph.D. dissertation, Hebrew Univ. of Jerusalem, Jerusalem, Israel, 2002.
- (2002)
- Slonim, N.¹

19
- 0001808038
- Princeton, NJ: NEC Research Institute TR
- N. Tishby, F. Pereira, and W. Bialek, The Information Bottleneck Method. Princeton, NJ: NEC Research Institute TR, 1998.
- (1998) The Information Bottleneck Method
- Tishby, N.¹ Pereira, F.² Bialek, W.³

20
- 84899004693
- Agglomerative information bottleneck
- Cambridge, MA: MIT Press
- N. Slonim, N. Friedman, and N. Tishby, “Agglomerative information bottleneck,” in Proc. Adv. in Neural Information Process. Syst. Cambridge, MA: MIT Press, 1999, pp. 617–623.
- (1999) Proc. Adv. in Neural Information Process. Syst. , pp. 617-623
- Slonim, N.¹ Friedman, N.² Tishby, N.³

21
- 85008551444
- New entropy based combination rules in HMM/ANN multistream ASR
- H. Misra, H. Bourlard, and V. Tyagi, “New entropy based combination rules in HMM/ANN multistream ASR,” in Proc. ICASSP, 2003, vol. 3, pp. 1–5.
- (2003) Proc. ICASSP , vol.3 , pp. 1-5
- Misra, H.¹ Bourlard, H.² Tyagi, V.³

22
- 51649124977
- The information bottleneck revisited or how to choose a good distortion measure
- P. Harremoes and N. Tishby, “The information bottleneck revisited or how to choose a good distortion measure,” in Proc. IEEE Int. Symp. Inf. Theory, ISIT 2007, 2007, pp. 566–570.
- (2007) Proc. IEEE Int. Symp. Inf. Theory, ISIT 2007 , pp. 566-570
- Harremoes, P.¹ Tishby, N.²

23
- 70450151824
- Acoustic models for posterior features in speech recognition
- Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland
- G. Aradilla, “Acoustic models for posterior features in speech recognition,” Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland, 2008.
- (2008)
- Aradilla, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.