메뉴 건너뛰기




Volumn 4, Issue 6, 2010, Pages 1059-1070

Diarization of telephone conversations using factor analysis

Author keywords

Channel factors; clustering; diarization; speaker factors; speaker recognition; speaker segmentation; variational Bayes

Indexed keywords

CHANNEL FACTORS; CLUSTERING; DIARIZATION; SPEAKER FACTORS; SPEAKER RECOGNITION; SPEAKER SEGMENTATIONS; VARIATIONAL BAYES;

EID: 78649270455     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2081790     Document Type: Article
Times cited : (133)

References (29)
  • 3
    • 58349102016 scopus 로고    scopus 로고
    • Analysis of feature extraction and channel compensation in GMM speaker recognition system
    • Sep.
    • L. Burget, P. Matejka, O. Glembek, P. Schwarz, and J. Cernocky, "Analysis of feature extraction and channel compensation in GMM speaker recognition system," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1979-1986, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1979-1986
    • Burget, L.1    Matejka, P.2    Glembek, O.3    Schwarz, P.4    Cernocky, J.5
  • 4
    • 34548248573 scopus 로고    scopus 로고
    • Explicit modeling of session variability for speaker verification
    • R. J. Vogt and S. Sridharan, "Explicit modeling of session variability for speaker verification," Comput. Speech Lang., vol. 22, no. 1, pp. 17-38, 2008.
    • (2008) Comput. Speech Lang. , vol.22 , Issue.1 , pp. 17-38
    • Vogt, R.J.1    Sridharan, S.2
  • 5
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker di-arization systems
    • Sep.
    • S. Tranter and D. Reynolds, "An overview of automatic speaker di-arization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.1    Reynolds, D.2
  • 8
    • 78649285331 scopus 로고    scopus 로고
    • Comparison of a joint iterative method for multiple speaker identification with sequential blind source separation and speaker identification
    • South Africa, Jan.
    • Y. E. Kim, J. M. Walsh, and T. M. Doll, "Comparison of a joint iterative method for multiple speaker identification with sequential blind source separation and speaker identification," in Proc. IEEE Odyssey Workshop, Stellenbosch, South Africa, Jan. 2008, pp. 283-286.
    • (2008) Proc. IEEE Odyssey Workshop, Stellenbosch , pp. 283-286
    • Kim, Y.E.1    Walsh, J.M.2    Doll, T.M.3
  • 13
    • 70349218125 scopus 로고    scopus 로고
    • Variational Bayesian joint factor analysis for speaker verification
    • Taipei, Taiwan Apr.
    • X. Zhao, Y. Dong, J. Zhao, L. Lu, J. Liu, and H. Wang, "Variational Bayesian joint factor analysis for speaker verification," in Proc. ICASSP'09, Taipei, Taiwan, Apr. 2009, pp. 4049-4052.
    • (2009) Proc. ICASSP'09 , pp. 4049-4052
    • Zhao, X.1    Dong, Y.2    Zhao, J.3    Lu, L.4    Liu, J.5    Wang, H.6
  • 14
    • 33947637189 scopus 로고    scopus 로고
    • Joint factor analysis of speaker and session variability: Theory and algorithms
    • [Online]. Available
    • P. Kenny, "Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms,' Tech. Rep. CRIM-06/08-13 2005 [Online]. Available: http://www.crim.ca/perso/patrick.kenny
    • (2005) Tech. Rep. CRIM-06/08-13
    • Kenny, P.1
  • 16
    • 85032751295 scopus 로고    scopus 로고
    • The variational approximation for Bayesian inference
    • Nov.
    • D. G. Tzikas, A. C. Likas, and N. P. Galatsanos, "The variational approximation for Bayesian inference," IEEE Signal Process. Mag., vol. 25, no. 6, pp. 131-146, Nov. 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.6 , pp. 131-146
    • Tzikas, D.G.1    Likas, A.C.2    Galatsanos, N.P.3
  • 17
    • 84898688036 scopus 로고    scopus 로고
    • Speech denoising and dereverberation using probabilistic models
    • Apr.
    • H. Attias, J. C. Platt, A. Acero, and L. Deng, "Speech denoising and dereverberation using probabilistic models," Adv. Neural Inf. Process. Syst., vol. 13, pp. 758-764, Apr. 2001.
    • (2001) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 758-764
    • Attias, H.1    Platt, J.C.2    Acero, A.3    Deng, L.4
  • 23
    • 84867223951 scopus 로고    scopus 로고
    • Speaker recognition in two wire test sessions
    • Australia. Sep.
    • H. Aronowitz and Y. Solewicz, "Speaker recognition in two wire test sessions," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008, pp. 865-868.
    • (2008) Proc. Interspeech'08, Brisbane , pp. 865-868
    • Aronowitz, H.1    Solewicz, Y.2
  • 25
    • 36749008026 scopus 로고    scopus 로고
    • Combining Gaussianized/non-Gaussianized features to improve speaker diarization of telephone conversations
    • Dec.
    • V. Gupta, P. Kenny, P. Ouellet, G. Boulianne, and P. Dumouchel, "Combining Gaussianized/non-Gaussianized features to improve speaker diarization of telephone conversations," IEEE Signal Process. Lett., vol. 14, no. 12, pp. 1040-1043, Dec. 2007.
    • (2007) IEEE Signal Process. Lett. , vol.14 , Issue.12 , pp. 1040-1043
    • Gupta, V.1    Kenny, P.2    Ouellet, P.3    Boulianne, G.4    Dumouchel, P.5
  • 26
    • 84867228708 scopus 로고    scopus 로고
    • Two's a crowd: Improving speaker diarization by automatically identifying and excluding overlapped speech
    • Brisbane, Australia Sep.
    • K. Boakye, O. Vinyals, and G. Friedland, "Two's a crowd: Improving speaker diarization by automatically identifying and excluding overlapped speech," in Proc. Interspeech, Brisbane, Australia, Sep. 2008, pp. 32-35.
    • (2008) Proc. Interspeech , pp. 32-35
    • Boakye, K.1    Vinyals, O.2    Friedland, G.3
  • 27
    • 84867210921 scopus 로고    scopus 로고
    • Factor analysis subspace estimation for speaker verification with short utterances
    • Brisbane, Australia Sep.
    • R. Vogt, B. Baker, and S. Sridharan, "Factor analysis subspace estimation for speaker verification with short utterances," in Proc. Inter-speech'08, Brisbane, Australia, Sep. 2008, pp. 853-856.
    • (2008) Proc. Inter-speech'08 , pp. 853-856
    • Vogt, R.1    Baker, B.2    Sridharan, S.3
  • 28
    • 64249126167 scopus 로고    scopus 로고
    • Trainable speaker diarization
    • Belgium Aug.
    • H.Aronowitz, "Trainable speaker diarization," inProc. Interspeech'07, Antwerp, Belgium, Aug. 2007, pp. 1861-1864.
    • (2007) Proc. Interspeech'07, Antwerp , pp. 1861-1864
    • Aronowitz, H.1
  • 29
    • 63749085692 scopus 로고    scopus 로고
    • Lo-quendo-Politecnico di Torino's 2006 NIST speaker recognition evaluation system
    • Belgium Aug.
    • C. Vair, D. Colibro, F. Castaldo, E. Dalmasso, and P. Laface, "Lo-quendo-Politecnico di Torino's 2006 NIST speaker recognition evaluation system," in Proc. Interspeech, Antwerp, Belgium, Aug. 2007, pp. 1238-1241.
    • (2007) Proc. Interspeech, Antwerp , pp. 1238-1241
    • Vair, C.1    Colibro, D.2    Castaldo, F.3    Dalmasso, E.4    Laface, P.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.