SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 4, Issue 6, 2010, Pages 1059-1070

Diarization of telephone conversations using factor analysis

(3) Kenny, Patrick a Reynolds, Douglas b Castaldo, Fabio c

a Cent de Recherche Informatique de Montreal (Canada)

b MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

c Loquendo (Italy)

Author keywords

Channel factors; clustering; diarization; speaker factors; speaker recognition; speaker segmentation; variational Bayes

Indexed keywords

CHANNEL FACTORS; CLUSTERING; DIARIZATION; SPEAKER FACTORS; SPEAKER RECOGNITION; SPEAKER SEGMENTATIONS; VARIATIONAL BAYES;

CLUSTER ANALYSIS; ERRORS; TELEPHONE; TELEPHONE SETS; TELEPHONE SYSTEMS;

SPEECH RECOGNITION;

EID: 78649270455 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2010.2081790 Document Type: Article

Times cited : (133)

References (29)

1
- 58349106697
- A study of inter-speaker variability in speaker verification
- Jul.
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 980-988, Jul. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

2
- 43249126081
- Compensation of nuisance factors for speaker and language recognition
- F. Castaldo, D. Colibro, E.Dalmasso, P. Laface, and C. Vair, "Compensation of nuisance factors for speaker and language recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1969-1978, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1969-1978
- Castaldo, F.¹ Colibro, D.² Dalmasso, E.³ Laface, P.⁴ Vair, C.⁵

3
- 58349102016
- Analysis of feature extraction and channel compensation in GMM speaker recognition system
- Sep.
- L. Burget, P. Matejka, O. Glembek, P. Schwarz, and J. Cernocky, "Analysis of feature extraction and channel compensation in GMM speaker recognition system," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1979-1986, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1979-1986
- Burget, L.¹ Matejka, P.² Glembek, O.³ Schwarz, P.⁴ Cernocky, J.⁵

4
- 34548248573
- Explicit modeling of session variability for speaker verification
- R. J. Vogt and S. Sridharan, "Explicit modeling of session variability for speaker verification," Comput. Speech Lang., vol. 22, no. 1, pp. 17-38, 2008.
- (2008) Comput. Speech Lang. , vol.22 , Issue.1 , pp. 17-38
- Vogt, R.J.¹ Sridharan, S.²

5
- 34047261805
- An overview of automatic speaker di-arization systems
- Sep.
- S. Tranter and D. Reynolds, "An overview of automatic speaker di-arization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.¹ Reynolds, D.²

6
- 51449110881
- Stream-based speaker segmentation using speaker factors and eigenvoices
- Mar.
- F. Castaldo, D. Colibro, E. Dalmasso, P. Laface, and C. Vair, "Stream-based speaker segmentation using speaker factors and eigenvoices," in Proc. ICASSP, Las Vegas, NV, Mar. 2008, pp. 4133-4136.
- (2008) Proc. ICASSP, Las Vegas, NV , pp. 4133-4136
- Castaldo, F.¹ Colibro, D.² Dalmasso, E.³ Laface, P.⁴ Vair, C.⁵

7
- 70450171620
- Ph.D. dissertation Eurecom, Sophia-Antipolis, France
- F. Valente, "Variational Bayesian methods for audio indexing," Ph.D. dissertation, Eurecom, Sophia-Antipolis, France, 2005.
- (2005) Variational Bayesian Methods for Audio Indexing
- Valente, F.¹

8
- 78649285331
- Comparison of a joint iterative method for multiple speaker identification with sequential blind source separation and speaker identification
- South Africa, Jan.
- Y. E. Kim, J. M. Walsh, and T. M. Doll, "Comparison of a joint iterative method for multiple speaker identification with sequential blind source separation and speaker identification," in Proc. IEEE Odyssey Workshop, Stellenbosch, South Africa, Jan. 2008, pp. 283-286.
- (2008) Proc. IEEE Odyssey Workshop, Stellenbosch , pp. 283-286
- Kim, Y.E.¹ Walsh, J.M.² Doll, T.M.³

9
- 33846516584
- New York: Springer Science+Business Media, LLC
- C. Bishop, Pattern Recognition and Machine Learning. New York: Springer Science+Business Media, LLC, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

10
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- DOI 10.1109/89.876308
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 695-707, Nov. 2000. (Pubitemid 32025317)
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

11
- 70450151829
- [Online] Available
- P. Kenny, "Bayesian Analysis of Speaker Diarization with Eigenvoice Priors," 2008 [Online]. Available: http://www.crim.ca/perso/patrick. kenny
- (2008) Bayesian Analysis of Speaker Diarization with Eigenvoice Priors
- Kenny, P.¹

12
- 33947612420
- Berlin, Germany: Springer-Verlag
- V. Smidl and A. Quinn, The Variational Bayes Method in Signal Processing. Berlin, Germany: Springer-Verlag, 2006.
- (2006) The Variational Bayes Method in Signal Processing
- Smidl, V.¹ Quinn, A.²

13
- 70349218125
- Variational Bayesian joint factor analysis for speaker verification
- Taipei, Taiwan Apr.
- X. Zhao, Y. Dong, J. Zhao, L. Lu, J. Liu, and H. Wang, "Variational Bayesian joint factor analysis for speaker verification," in Proc. ICASSP'09, Taipei, Taiwan, Apr. 2009, pp. 4049-4052.
- (2009) Proc. ICASSP'09 , pp. 4049-4052
- Zhao, X.¹ Dong, Y.² Zhao, J.³ Lu, L.⁴ Liu, J.⁵ Wang, H.⁶

14
- 33947637189
- Joint factor analysis of speaker and session variability: Theory and algorithms
- [Online]. Available
- P. Kenny, "Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms,' Tech. Rep. CRIM-06/08-13 2005 [Online]. Available: http://www.crim.ca/perso/patrick.kenny
- (2005) Tech. Rep. CRIM-06/08-13
- Kenny, P.¹

15
- 4544238074
- [Online]. Available
- The NIST Year 2001 Speaker Recognition Evaluation Plan, 2001 [Online]. Available: http://www.nist.gov/speech/tests/spk/2001/doc/2001-spkrec-evalplan- v05.9.pdf
- (2001) The NIST Year 2001 Speaker Recognition Evaluation Plan

16
- 85032751295
- The variational approximation for Bayesian inference
- Nov.
- D. G. Tzikas, A. C. Likas, and N. P. Galatsanos, "The variational approximation for Bayesian inference," IEEE Signal Process. Mag., vol. 25, no. 6, pp. 131-146, Nov. 2008.
- (2008) IEEE Signal Process. Mag. , vol.25 , Issue.6 , pp. 131-146
- Tzikas, D.G.¹ Likas, A.C.² Galatsanos, N.P.³

17
- 84898688036
- Speech denoising and dereverberation using probabilistic models
- Apr.
- H. Attias, J. C. Platt, A. Acero, and L. Deng, "Speech denoising and dereverberation using probabilistic models," Adv. Neural Inf. Process. Syst., vol. 13, pp. 758-764, Apr. 2001.
- (2001) Adv. Neural Inf. Process. Syst. , vol.13 , pp. 758-764
- Attias, H.¹ Platt, J.C.² Acero, A.³ Deng, L.⁴

18
- 78649267471
- An HDP-HMM for systems with state persistence
- Cambridge, MA: MIT Press
- E. B. Fox, E. B. Sudderth, M. I. Jordan, and A. S. Willsky, "An HDP-HMM for systems with state persistence," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2009, vol. 21.
- (2009) Advances in Neural Information Processing Systems , vol.21
- Fox, E.B.¹ Sudderth, E.B.² Jordan, M.I.³ Willsky, A.S.⁴

19
- 18744386134
- Eigenvoice modeling with sparse training data
- May
- P. Kenny, G. Boulianne, and P. Dumouchel, "Eigenvoice modeling with sparse training data," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 345-359, May 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.3 , pp. 345-359
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

20
- 0004272772
- New York: Cambridge Univ. Press
- D. MacKay, Information Theory, Inference and Learning Algorithms. New York: Cambridge Univ. Press, 2003.
- (2003) Information Theory Inference and Learning Algorithms
- MacKay, D.¹

21
- 85073258179
- Feature warping for robust speaker verification
- Jun.
- J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in Proc. Speaker Odyssey, Crete, Greece, Jun. 2001, pp. 213-218.
- (2001) Proc. Speaker Odyssey, Crete, Greece , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

22
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Speaker and session variability in GMM-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1448-1460, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

23
- 84867223951
- Speaker recognition in two wire test sessions
- Australia. Sep.
- H. Aronowitz and Y. Solewicz, "Speaker recognition in two wire test sessions," in Proc. Interspeech'08, Brisbane, Australia, Sep. 2008, pp. 865-868.
- (2008) Proc. Interspeech'08, Brisbane , pp. 865-868
- Aronowitz, H.¹ Solewicz, Y.²

24
- 34047266609
- Multistage speaker diarization of broadcast news
- Sep.
- C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Multistage speaker diarization of broadcast news," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1505-1512, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1505-1512
- Barras, C.¹ Zhu, X.² Meignier, S.³ Gauvain, J.-L.⁴

25
- 36749008026
- Combining Gaussianized/non-Gaussianized features to improve speaker diarization of telephone conversations
- Dec.
- V. Gupta, P. Kenny, P. Ouellet, G. Boulianne, and P. Dumouchel, "Combining Gaussianized/non-Gaussianized features to improve speaker diarization of telephone conversations," IEEE Signal Process. Lett., vol. 14, no. 12, pp. 1040-1043, Dec. 2007.
- (2007) IEEE Signal Process. Lett. , vol.14 , Issue.12 , pp. 1040-1043
- Gupta, V.¹ Kenny, P.² Ouellet, P.³ Boulianne, G.⁴ Dumouchel, P.⁵

26
- 84867228708
- Two's a crowd: Improving speaker diarization by automatically identifying and excluding overlapped speech
- Brisbane, Australia Sep.
- K. Boakye, O. Vinyals, and G. Friedland, "Two's a crowd: Improving speaker diarization by automatically identifying and excluding overlapped speech," in Proc. Interspeech, Brisbane, Australia, Sep. 2008, pp. 32-35.
- (2008) Proc. Interspeech , pp. 32-35
- Boakye, K.¹ Vinyals, O.² Friedland, G.³

27
- 84867210921
- Factor analysis subspace estimation for speaker verification with short utterances
- Brisbane, Australia Sep.
- R. Vogt, B. Baker, and S. Sridharan, "Factor analysis subspace estimation for speaker verification with short utterances," in Proc. Inter-speech'08, Brisbane, Australia, Sep. 2008, pp. 853-856.
- (2008) Proc. Inter-speech'08 , pp. 853-856
- Vogt, R.¹ Baker, B.² Sridharan, S.³

28
- 64249126167
- Trainable speaker diarization
- Belgium Aug.
- H.Aronowitz, "Trainable speaker diarization," inProc. Interspeech'07, Antwerp, Belgium, Aug. 2007, pp. 1861-1864.
- (2007) Proc. Interspeech'07, Antwerp , pp. 1861-1864
- Aronowitz, H.¹

29
- 63749085692
- Lo-quendo-Politecnico di Torino's 2006 NIST speaker recognition evaluation system
- Belgium Aug.
- C. Vair, D. Colibro, F. Castaldo, E. Dalmasso, and P. Laface, "Lo-quendo-Politecnico di Torino's 2006 NIST speaker recognition evaluation system," in Proc. Interspeech, Antwerp, Belgium, Aug. 2007, pp. 1238-1241.
- (2007) Proc. Interspeech, Antwerp , pp. 1238-1241
- Vair, C.¹ Colibro, D.² Castaldo, F.³ Dalmasso, E.⁴ Laface, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.