메뉴 건너뛰기




Volumn 21, Issue 10, 2013, Pages 2015-2028

Unsupervised methods for speaker diarization: An integrated and iterative approach

Author keywords

Bayesian nonparametric inference; factor analysis; HDP HMM; i vectors; principal component analysis; speaker clustering; speaker diarization; spectral clustering; variational Bayes

Indexed keywords

HDP-HMM; I-VECTORS; NONPARAMETRIC INFERENCE; SPEAKER CLUSTERING; SPEAKER DIARIZATION; SPECTRAL CLUSTERING; VARIATIONAL BAYES;

EID: 84881068970     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2264673     Document Type: Article
Times cited : (210)

References (42)
  • 2
    • 84864282162 scopus 로고    scopus 로고
    • A review on speaker diarization systems and approaches
    • M. H. Moattar and M. M. Homayounpour, "A review on speaker diarization systems and approaches," Speech Commun., vol. 54, no. 10, pp. 1065-1103, 2012.
    • (2012) Speech Commun. , vol.54 , Issue.10 , pp. 1065-1103
    • Moattar, M.H.1    Homayounpour, M.M.2
  • 4
    • 34047264090 scopus 로고    scopus 로고
    • The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast audio and telephone conversations
    • D. Reynolds and P. Torres-Carrasquillo, "The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast audio and telephone conversations," in Proc. NIST Rich Transcript. Workshop, 2004.
    • (2004) Proc. NIST Rich Transcript. Workshop
    • Reynolds, D.1    Torres-Carrasquillo, P.2
  • 5
    • 70450171620 scopus 로고    scopus 로고
    • Ph.D. dissertation Univ. De Nice-Sophia Antipolis- UFR Sciences, Nice, France Sep.
    • F. Valente, "Variational Bayesian methods for audio indexing," Ph.D. dissertation, Univ. De Nice-Sophia Antipolis - UFR Sciences, Nice, France, Sep. 2005.
    • (2005) Variational Bayesian Methods for Audio Indexing
    • Valente, F.1
  • 6
    • 44949197897 scopus 로고    scopus 로고
    • Robust speaker diarization for meetings: ICSI RT06's evaluation system
    • X. Anguera, C. Wooters, and J. M. Pardo, "Robust speaker diarization for meetings: ICSI RT06's evaluation system," in Proc. ICSLP, 2006.
    • (2006) Proc. ICSLP
    • Anguera, X.1    Wooters, C.2    Pardo, J.M.3
  • 8
    • 29044442235 scopus 로고    scopus 로고
    • Step-by-step and integrated approaches in broadcast news speaker diarization
    • DOI 10.1016/j.csl.2005.08.002, PII S0885230805000471, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • S. Meignier, D. Moraru, C. Fredouille, J.-F. Bonastre, and L. Besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization," Comput. Speech Lang., vol. 20, no. 2, pp. 303-330, Jul. 2006. (Pubitemid 41787540)
    • (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISS. , pp. 303-330
    • Meignier, S.1    Moraru, D.2    Fredouille, C.3    Bonastre, J.-F.4    Besacier, L.5
  • 9
    • 78049378635 scopus 로고    scopus 로고
    • The LIA-EURECOM RT'09 speaker diarization system: Enhancements in speaker modeling and cluster purification
    • S. Bozonnet,N.W.D. Evans, and C. Fredouille, "The LIA-EURECOM RT'09 speaker diarization system: Enhancements in speaker modeling and cluster purification," in Proc. ICASSP, 2010, pp. 4958-4961.
    • (2010) Proc. ICASSP , pp. 4958-4961
    • Bozonnet, S.1    Evans, N.W.D.2    Fredouille, C.3
  • 10
    • 79952660404 scopus 로고    scopus 로고
    • A sticky HDP-HMM with application to speaker diarization
    • Jun
    • E. B. Fox, E. B. Sudderth, M. I. Jordan, and A. S. Willsky, "A sticky HDP-HMM with application to speaker diarization," Ann. Appl. Statist., vol. 5, no. 2A, pp. 1020-1056, Jun. 2011.
    • (2011) Ann. Appl. Statist. , vol.5 , Issue.2 A , pp. 1020-1056
    • Fox, E.B.1    Sudderth, E.B.2    Jordan, M.I.3    Willsky, A.S.4
  • 12
    • 84867186048 scopus 로고    scopus 로고
    • Variational inference for dirichlet processmixtures
    • D. Blei andM. Jordan, "Variational inference for dirichlet processmixtures," Bayesian Anal., vol. 1, no. 1, pp. 121-144, 2006.
    • (2006) Bayesian Anal. , vol.1 , Issue.1 , pp. 121-144
    • Blei, D.1    Jordan, M.2
  • 13
    • 78649270455 scopus 로고    scopus 로고
    • Diarization of telephone conversations using factor analysis
    • Dec
    • P. Kenny, D. Reynolds, and F. Castaldo, "Diarization of telephone conversations using factor analysis," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 1059-1070, Dec. 2010.
    • (2010) IEEE J. Sel. Topics Signal Process. , vol.4 , Issue.6 , pp. 1059-1070
    • Kenny, P.1    Reynolds, D.2    Castaldo, F.3
  • 14
    • 51449110881 scopus 로고    scopus 로고
    • Streambased speaker segmentation using speaker factors and eigenvoices
    • F. Castaldo, D. Colibro, E. Dalmasso, P. Laface, and C. Vair, "Streambased speaker segmentation using speaker factors and eigenvoices," in Proc. ICASSP, 2008, pp. 4133-4136.
    • (2008) Proc. ICASSP , pp. 4133-4136
    • Castaldo, F.1    Colibro, D.2    Dalmasso, E.3    Laface, P.4    Vair, C.5
  • 17
    • 80052190450 scopus 로고    scopus 로고
    • Unsupervised speaker adaptation based on the cosine similarity for text-independent speaker verification
    • S. Shum, N. Dehak, R. Dehak, and J. Glass, "Unsupervised speaker adaptation based on the cosine similarity for text-independent speaker verification," in Proc. IEEE Odyssey, 2010.
    • (2010) Proc. IEEE Odyssey
    • Shum, S.1    Dehak, N.2    Dehak, R.3    Glass, J.4
  • 18
    • 84878381961 scopus 로고    scopus 로고
    • On the use of spectral and iterative methods for speaker diarization
    • S. Shum, N. Dehak, and J. Glass, "On the use of spectral and iterative methods for speaker diarization," in Proc. Interspeech, 2012.
    • (2012) Proc. Interspeech
    • Shum, S.1    Dehak, N.2    Glass, J.3
  • 19
    • 44949264065 scopus 로고    scopus 로고
    • A spectral clustering approach to speaker diarization
    • H. Ning, M. Liu, H. Tang, and T. Huang, "A spectral clustering approach to speaker diarization," in Proc. ICSLP, 2006.
    • (2006) Proc. ICSLP
    • Ning, H.1    Liu, M.2    Tang, H.3    Huang, T.4
  • 21
    • 80051631569 scopus 로고    scopus 로고
    • Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation
    • C. Vaquero, A. Ortega, and E. Lleida, "Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation," in Proc. ICASSP, 2011, pp. 4532-4535.
    • (2011) Proc. ICASSP , pp. 4532-4535
    • Vaquero, C.1    Ortega, A.2    Lleida, E.3
  • 22
    • 84865711272 scopus 로고    scopus 로고
    • Cross likelihood ratio based speaker clustering using eigenvoice models
    • D. Wang, R. Vogt, S. Sridharan, and D. Dean, "Cross likelihood ratio based speaker clustering using eigenvoice models," in Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Wang, D.1    Vogt, R.2    Sridharan, S.3    Dean, D.4
  • 23
    • 82955196715 scopus 로고    scopus 로고
    • Speaker diarization using PLDA-based speaker clustering
    • J. Prazak and J. Silovsky, "Speaker diarization using PLDA-based speaker clustering," in Proc. IDAACS, 2011.
    • (2011) Proc. IDAACS
    • Prazak, J.1    Silovsky, J.2
  • 24
    • 84905255747 scopus 로고    scopus 로고
    • A global optimization framework for speaker diarization
    • M. Rouvier and S. Meignier, "A global optimization framework for speaker diarization," in Proc. IEEE Odyssey, 2012.
    • Proc. IEEE Odyssey , vol.2012
    • Rouvier, M.1    Meignier, S.2
  • 25
    • 84881083009 scopus 로고    scopus 로고
    • Mean shift algorithm for exponential families with applications to speaker clustering
    • T. Stafylakis, V. Katsouros, P. Kenny, and P. Dumouchel, "Mean shift algorithm for exponential families with applications to speaker clustering," in Proc. IEEE Odyssey, 2012.
    • Proc. IEEE Odyssey , vol.2012
    • Stafylakis, T.1    Katsouros, V.2    Kenny, P.3    Dumouchel, P.4
  • 33
    • 0041875229 scopus 로고    scopus 로고
    • On spectral clustering: Analysis and an algorithm
    • A. Ng, M. Jordan, and Y.Weiss, "On spectral clustering: Analysis and an algorithm," in Proc. NIPS, 2001.
    • (2001) Proc. NIPS
    • Ng, A.1    Jordan, M.2    Weiss, Y.3
  • 34
    • 84946742526 scopus 로고    scopus 로고
    • A robust speaker clustering algorithm
    • J. Ajmera and C. Wooters, "A robust speaker clustering algorithm," in Proc. ASRU, 2003.
    • (2003) Proc. ASRU
    • Ajmera, J.1    Wooters, C.2
  • 35
    • 85009110150 scopus 로고    scopus 로고
    • Speaker recognition in a multi-speaker environment
    • A. Martin and M. Przybocki, "Speaker recognition in a multi-speaker environment," in Proc. Eurospeech, 2001.
    • (2001) Proc. Eurospeech
    • Martin, A.1    Przybocki, M.2
  • 38
    • 85162044398 scopus 로고    scopus 로고
    • Construction of dependent Dirichlet processes based on Poisson processes
    • D. Lin, E. Grimson, and J. Fisher, "Construction of dependent Dirichlet processes based on Poisson processes," in Proc. NIPS, 2010.
    • (2010) Proc. NIPS
    • Lin, D.1    Grimson, E.2    Fisher, J.3
  • 39
    • 0001654702 scopus 로고
    • Extensions of Lipschitz mappings into a Hilbert space
    • W. Johnson and J. Lindenstrauss, "Extensions of Lipschitz mappings into a Hilbert space," Contemp. Math., vol. 26, pp. 189-206, 1984.
    • (1984) Contemp. Math. , vol.26 , pp. 189-206
    • Johnson, W.1    Lindenstrauss, J.2
  • 40
    • 84860606154 scopus 로고    scopus 로고
    • Principal component analysis with contaminated data: The high dimensional case
    • H. Xu, C. Caramanis, and S. Mannor, "Principal component analysis with contaminated data: The high dimensional case," in Proc. COLT, 2010.
    • (2010) Proc. COLT
    • Xu, H.1    Caramanis, C.2    Mannor, S.3
  • 41
    • 57249084011 scopus 로고    scopus 로고
    • Visualizing data using t-sne
    • Nov.
    • L. van der Maaten and G. Hinton, "Visualizing data using t-sne," J. Mach. Learn. Res., vol. 9, pp. 2579-2605, Nov. 2008.
    • (2008) J. Mach. Learn. Res , vol.9 , pp. 2579-2605
    • Van Der Maaten, L.1    Hinton, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.