메뉴 건너뛰기




Volumn 19, Issue 2, 2011, Pages 431-438

An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization

Author keywords

Feature combination; information bottleneck; meeting data; speaker diarization

Indexed keywords


EID: 85008566469     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2010.2048603     Document Type: Article
Times cited : (27)

References (23)
  • 1
    • 33646360883 scopus 로고    scopus 로고
    • Robust audio segmentation
    • Ph.D. dissertation
    • J. Ajmera, “Robust audio segmentation,” Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne (EPFL), Lausanne, Switzerland, 2004.
    • (2004)
    • Ajmera, J.1
  • 2
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • S. Chen and P. Gopalakrishnan, “Speaker, environment and channel change detection and clustering via the bayesian information criterion,” in Proc. DARPA Speech Recognition Workshop, 1998, pp. 127–138.
    • (1998) Proc. DARPA Speech Recognition Workshop , pp. 127-138
    • Chen, S.1    Gopalakrishnan, P.2
  • 3
  • 4
    • 44849123928 scopus 로고    scopus 로고
    • Robust speaker diarization for meetings
    • Ph.D. dissertation, Univ. Politecnica de Catalunya, Barcelona, Spain
    • X. Anguera, “Robust speaker diarization for meetings,” Ph.D. dissertation, Univ. Politecnica de Catalunya, Barcelona, Spain, 2006.
    • (2006)
    • Anguera, X.1
  • 5
    • 77249176190 scopus 로고    scopus 로고
    • The AMI speaker diarization system for NIST RT06s meeting data
    • D. van Leeuwen and M. Huijbregts, “The AMI speaker diarization system for NIST RT06s meeting data,” in Lecture Notes in Computer Science, 2006, vol. 4299, p. 371.
    • (2006) Lecture Notes in Computer Science , vol.4299 , pp. 371
    • van Leeuwen, D.1    Huijbregts, M.2
  • 7
    • 77950110298 scopus 로고    scopus 로고
    • Speaker diarization for multi-microphone meetings using only between-channel differences
    • J. Pardo, X. Anguera, and C. Wooters, “Speaker diarization for multi-microphone meetings using only between-channel differences,” in Proc. MLMI, 2006.
    • (2006) Proc. MLMI
    • Pardo, J.1    Anguera, X.2    Wooters, C.3
  • 8
    • 34548351229 scopus 로고    scopus 로고
    • Speaker diarization for multimicrophone meetings: Mixing acoustic features and inter-channel time differences
    • J. Pardo, X. Anguera, and C. Wooters, “Speaker diarization for multimicrophone meetings: Mixing acoustic features and inter-channel time differences,” in Proc. Int. Conf. Speech Lang. Process., 2006.
    • (2006) Proc. Int. Conf. Speech Lang. Process.
    • Pardo, J.1    Anguera, X.2    Wooters, C.3
  • 9
    • 34548310397 scopus 로고    scopus 로고
    • Speaker diarization for multiple-distant-microphone meetings using several sources of information
    • Sep.
    • J. M. Pardo, X. Anguera, and C. Wooters “Speaker diarization for multiple-distant-microphone meetings using several sources of information,” IEEE Trans. Comput., vol. 56, no. 9, pp. 1189–1198, Sep. 2007.
    • (2007) IEEE Trans. Comput. , vol.56 , Issue.9 , pp. 1189-1198
    • Pardo, J.M.1    Anguera, X.2    Wooters, C.3
  • 10
    • 33745560829 scopus 로고    scopus 로고
    • Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
    • X. Anguera, C. Wooters, B. Peskin, and M. Aguilo, “Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system,” in Lecture Notes in Computer Science, 2006, vol. 3869, p. 402.
    • (2006) Lecture Notes in Computer Science , vol.3869 , pp. 402
    • Anguera, X.1    Wooters, C.2    Peskin, B.3    Aguilo, M.4
  • 13
  • 15
    • 68649087212 scopus 로고    scopus 로고
    • An information theoretic approach to speaker diarization of meeting data
    • Jul.
    • D. Vijayasenan, F. Valente, and H. Bourlard “An information theoretic approach to speaker diarization of meeting data,” IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 7, pp. 1382–1393, Jul. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.7 , pp. 1382-1393
    • Vijayasenan, D.1    Valente, F.2    Bourlard, H.3
  • 17
    • 44849143841 scopus 로고    scopus 로고
    • Beamformit, the Fast and Robust Acoustic Beamformer
    • [Online]. Available: http://www.icsi.berkeley.edu/xanguera/Beamformlt
    • X. Anguera, “Beamformit, the Fast and Robust Acoustic Beamformer,” 2006 [Online]. Available: http://www.icsi.berkeley.edu/xanguera/Beamformlt
    • (2006)
    • Anguera, X.1
  • 18
    • 9444222292 scopus 로고    scopus 로고
    • The Information Bottleneck: Theory and Applications
    • Ph.D. dissertation, Hebrew Univ. of Jerusalem, Jerusalem, Israel
    • N. Slonim, “The Information Bottleneck: Theory and Applications,” Ph.D. dissertation, Hebrew Univ. of Jerusalem, Jerusalem, Israel, 2002.
    • (2002)
    • Slonim, N.1
  • 21
    • 85008551444 scopus 로고    scopus 로고
    • New entropy based combination rules in HMM/ANN multistream ASR
    • H. Misra, H. Bourlard, and V. Tyagi, “New entropy based combination rules in HMM/ANN multistream ASR,” in Proc. ICASSP, 2003, vol. 3, pp. 1–5.
    • (2003) Proc. ICASSP , vol.3 , pp. 1-5
    • Misra, H.1    Bourlard, H.2    Tyagi, V.3
  • 22
    • 51649124977 scopus 로고    scopus 로고
    • The information bottleneck revisited or how to choose a good distortion measure
    • P. Harremoes and N. Tishby, “The information bottleneck revisited or how to choose a good distortion measure,” in Proc. IEEE Int. Symp. Inf. Theory, ISIT 2007, 2007, pp. 566–570.
    • (2007) Proc. IEEE Int. Symp. Inf. Theory, ISIT 2007 , pp. 566-570
    • Harremoes, P.1    Tishby, N.2
  • 23
    • 70450151824 scopus 로고    scopus 로고
    • Acoustic models for posterior features in speech recognition
    • Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland
    • G. Aradilla, “Acoustic models for posterior features in speech recognition,” Ph.D. dissertation, Ecole Polytechnique Federale de Lausanne, Lausanne, Switzerland, 2008.
    • (2008)
    • Aradilla, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.