SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 1986-1990

I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification

(34) Saeidi, R a Lee, K A b Kinnunen, T c Hasan, T d Fauve, B e Bousquet, P M f Khoury, E g Sordo Martinez, P L h Kua, J M K i You, C H b Sun, H b Larcher, A b Rajan, P c Hautamaki V c Hanilci, C c Braithwaite, B c Gonzales Hautamaki R c Sadjadi, S O d Liu, G d Boril, H d more..

a RADBOUD UNIVERSITY NIJMEGEN (Netherlands)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

c UNIVERSITY OF EASTERN FINLAND (Finland)

d UNIVERSITY OF TEXAS AT DALLAS (United States)

e VALIDSOFT LTD (United Kingdom)

f UNIVERSITY OF AVIGNON (France)

g IDIAP RESEARCH INSTITUTE (Switzerland)

h SWANSEA UNIVERSITY (United Kingdom)

i UNIVERSITY OF NEW SOUTH WALES (Australia)

Author keywords

I4U; Ivector; NIST SRE 2012; Speaker verification

Indexed keywords

COMPUTER APPLICATIONS; COMPUTER SIMULATION;

I VECTORS; I4U; NIST SRE 2012; ONLINE DISCUSSIONS; RESEARCH INSTITUTES; SINGAPORE; SPEAKER VERIFICATION;

SPEECH RECOGNITION;

EID: 84898068800 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (74)

References (35)

1
- 84906274779
- NIST speaker recognition evaluation 2012. http://www.nist.gov/itl/iad/ mig/sre12.cfm.
- NIST Speaker Recognition Evaluation 2012

2
- 44949251671
- Data-driven design of front-end filter bank for Lombard speech recognition
- H. Boril, P. Fousek, and P. Pollak. Data-driven design of front-end filter bank for Lombard speech recognition. In Proc. Interspeech 2006 (ICSLP), pages 381-384, 2006.
- (2006) Proc. Interspeech 2006 (ICSLP) , pp. 381-384
- Boril, H.¹ Fousek, P.² Pollak, P.³

3
- 80051641505
- Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
- S. O. Sadjadi and J. H. L. Hansen. Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2011), pages 5448 -5451, 2011.
- (2011) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2011) , pp. 5448-5451
- Sadjadi, S.O.¹ Hansen, J.H.L.²

4
- 84886695822
- A simple and effective speech activity detection algorithm for telephone and microphone speech
- Atlanta, US
- M. McLaren and D. A. van Leeuwen. A simple and effective speech activity detection algorithm for telephone and microphone speech. In Proc. NIST SRE 2011 workshop, Atlanta, US, 2011.
- (2011) Proc. NIST SRE 2011 Workshop
- McLaren, M.¹ Van Leeuwen, D.A.²

5
- 0041360463
- Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
- I. Cohen. Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. on Speech and Audio Processing, 11(5):466 - 475, 2003.
- (2003) IEEE Trans. on Speech and Audio Processing , vol.11 , Issue.5 , pp. 466-475
- Cohen, I.¹

6
- 85073258179
- Feature warping for robust speaker verification
- J. Pelecanos and S. Sridharan. Feature warping for robust speaker verification. In Odyssey, 2001.
- (2001) Odyssey
- Pelecanos, J.¹ Sridharan, S.²

7
- 84873315510
- Unsupervised speech activity detection using voicing measures and perceptual spectral flux
- Mar
- S. O. Sadjadi and J. H. L. Hansen. Unsupervised speech activity detection using voicing measures and perceptual spectral flux. IEEE Signal Processing Letters, pages 197-200, Mar. 2013.
- (2013) IEEE Signal Processing Letters , pp. 197-200
- Sadjadi, S.O.¹ Hansen, J.H.L.²

8
- 84865772156
- Front-end compensation methods for LVCSR under Lombard effect
- Florence, Italy
- H. Bořil, F. Grézl, and J. H. L. Hansen. Front-end compensation methods for LVCSR under Lombard effect. In INTERSPEECH 2011, pages 1257-1260, Florence, Italy, 2011.
- (2011) Interspeech 2011 , pp. 1257-1260
- Bořil, H.¹ Grézl, F.² Hansen, J.H.L.³

9
- 0031636164
- A voice activity detector employing soft decision based noise spectrum adaptation
- vol.1
- J. Sohn and W. Sung. A voice activity detector employing soft decision based noise spectrum adaptation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1998), volume 1, pages 365 -368 vol.1, 1998.
- (1998) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1998) , vol.1 , pp. 365-368
- Sohn, J.¹ Sung, W.²

10
- 84890449972
- A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data
- T. Kinnunen and P. Rajan. A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
- (2013) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
- Kinnunen, T.¹ Rajan, P.²

11
- 79951609039
- Front-end factor analysis for speaker verification
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet. Front-end factor analysis for speaker verification. IEEE Trans. Audio, Speech and Language Processing, 19(4):788 -798, 2011.
- (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

12
- 50649094277
- Probabilistic linear discriminant analysis for inferences about identity
- S. J. D. Prince and J. H. Elder. Probabilistic linear discriminant analysis for inferences about identity. In 11th International Conference on Computer Vision, pages 1-8, 2007.
- (2007) 11th International Conference on Computer Vision , pp. 1-8
- Prince, S.J.D.¹ Elder, J.H.²

13
- 58349106697
- A study of inter-speaker variability in speaker verification
- July
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel. A study of inter-speaker variability in speaker verification. IEEE Trans. Audio, Speech and Language Processing, 16(5):980-988, July 2008.
- (2008) IEEE Trans. Audio, Speech and Language Processing , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

14
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel. Speaker and session variability in GMM-based speaker verification. IEEE Trans. Audio, Speech and Language Processing, 15(4):1448- 1460, May 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.4 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

15
- 85073229756
- Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis
- P. M. Bousquet, A. Larcher, D. Matrouf, J. F. Bonastre, and O. Plchot. Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis. In Odyssey, 2012, 2012.
- (2012) Odyssey, 2012
- Bousquet, P.M.¹ Larcher, A.² Matrouf, D.³ Bonastre, J.F.⁴ Plchot, O.⁵

16
- 84856092767
- Inter-session variability modelling and joint factor analysis for face authentication
- R.Wallace, M.McLaren, C.McCool, and S.Marcel. Inter-session variability modelling and joint factor analysis for face authentication. In International Joint Conference on Biometrics, 2011.
- (2011) International Joint Conference on Biometrics
- Wallace, R.¹ McLaren, M.² McCool, C.³ Marcel, S.⁴

17
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- D. Garcia-Romero and C. Y. Espy-Wilson. Analysis of i-vector length normalization in speaker recognition systems. In Proc. Interspeech 2011, pages 249-252, 2011.
- (2011) Proc. Interspeech 2011 , pp. 249-252
- Garcia-Romero, D.¹ Espy-Wilson, C.Y.²

18
- 50949133669
- LIBLINEAR: A library for large linear classification
- R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, and C. J. Lin. LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res., 9:1871-1874, 2008.
- (2008) J. Mach. Learn. Res , vol.9 , pp. 1871-1874
- Fan, R.E.¹ Chang, K.W.² Hsieh, C.J.³ Wang, X.R.⁴ Lin, C.J.⁵

19
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- R. Martin. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. on Speech and Audio Processing, 9(5):504 -512, 2001.
- (2001) IEEE Trans. on Speech and Audio Processing , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

20
- 84890535969
- An investigation on back-end for speaker recognition in multi-session enrollment
- G. Liu, T. Hasan, H. Bořil, and J.H.L. Hansen. An investigation on back-end for speaker recognition in multi-session enrollment. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
- (2013) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
- Liu, G.¹ Hasan, T.² Bořil, H.³ Hansen, J.H.L.⁴

21
- 84890540706
- CRSS systems for 2012 NIST speaker recognition evaluation
- T. Hasan, S. O. Sadjadi, G. Liu, N. Shokouhi, H. Bořil, and J.H.L. Hansen. CRSS Systems for 2012 NIST Speaker Recognition Evaluation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
- (2013) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
- Hasan, T.¹ Sadjadi, S.O.² Liu, G.³ Shokouhi, N.⁴ Bořil, H.⁵ Hansen, J.H.L.⁶

22
- 85073247582
- Variational Bayes logistic regression as regularized fusion for nist sre 2010
- V. Hautamäki, K. A. Lee, A. Larcher, T. Kinnunen, B. Ma, and H. Li. Variational Bayes logistic regression as regularized fusion for nist sre 2010. In Odyssey, 2012, 2012.
- (2012) Odyssey, 2012
- Hautamäki, V.¹ Lee, K.A.² Larcher, A.³ Kinnunen, T.⁴ Ma, B.⁵ Li, H.⁶

23
- 84877743396
- Sparse classifier fusion for speaker verification
- Accepted for publication
- V. Hautamaki, T. Kinnunen, F. Sedlak, K.-A. Lee, B. Ma, and H. Li. Sparse classifier fusion for speaker verification. IEEE Trans. Audio, Speech and Language Processing, 2013, Accepted for publication.
- (2013) IEEE Trans. Audio, Speech and Language Processing
- Hautamaki, V.¹ Kinnunen, T.² Sedlak, F.³ Lee, K.-A.⁴ Ma, B.⁵ Li, H.⁶

24
- 85073232294
- A small foot-print i-vector extractor
- P. Kenny. A small foot-print i-vector extractor. In Odyssey, 2012, pages 1-6, 2012.
- (2012) Odyssey, 2012 , pp. 1-6
- Kenny, P.¹

25
- 85073109470
- An i-vector extractor suitable for speaker recognition with both microphone and telephone speech
- M. Senoussaoui, P. Kenny, N. Dehak, and P. Dumouchel. An i-vector extractor suitable for speaker recognition with both microphone and telephone speech. In Odyssey, 2010, pages 28-33, 2010.
- (2010) Odyssey, 2010 , pp. 28-33
- Senoussaoui, M.¹ Kenny, P.² Dehak, N.³ Dumouchel, P.⁴

26
- 84878413073
- PLDA modeling in i-vector and supervector space for speaker verification
- Y. Jiang, K. A. Lee, Z. Tang, B. Ma, A. Larcher, and H. Li. PLDA modeling in i-vector and supervector space for speaker verification. In Proc. Interspeech 2012, 2012.
- (2012) Proc. Interspeech 2012
- Jiang, Y.¹ Lee, K.A.² Tang, Z.³ Ma, B.⁴ Larcher, A.⁵ Li, H.⁶

27
- 33645887246
- Support vector machines using GMM supervectors for speaker verification
- W. M. Campbell, D. E. Sturim, and D. A. Reynolds. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing Letters, 13(5):308 - 311, 2006.
- (2006) IEEE Signal Processing Letters , vol.13 , Issue.5 , pp. 308-311
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³

28
- 33947696754
- SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
- Toulouse, France
- W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), volume 1, pages 97-100, Toulouse, France, 2006.
- (2006) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) , vol.1 , pp. 97-100
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³ Solomonoff, A.⁴

29
- 33947630848
- Use of antimodels to further improve state-of-the-art PRLM language recognition system
- P. Matejka, P. Schwarz, L. Burget, and J. Cernocky. Use of antimodels to further improve state-of-the-art PRLM language recognition system. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), volume 1, pages 197-200, 2006.
- (2006) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) , vol.1 , pp. 197-200
- Matejka, P.¹ Schwarz, P.² Burget, L.³ Cernocky, J.⁴

30
- 77955790894
- GMM-SVM kernel with a bhattacharyya-based distance for speaker recognition
- C. H. You, K. A. Lee, and H. Li. GMM-SVM kernel with a bhattacharyya-based distance for speaker recognition. IEEE Trans. Audio, Speech and Language Processing, 18(6):1300 - 1312, 2010.
- (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.6 , pp. 1300-1312
- You, C.H.¹ Lee, K.A.² Li, H.³

31
- 84863799477
- A GMM-supervector approach to language recognition with adaptive relevance factor
- C. H. You, H. Li, and K. A. Lee. A GMM-supervector approach to language recognition with adaptive relevance factor. In Proc. 18th European Conf. on Signal Processing (EUSIPCO 2010), pages 1993-1997, 2010.
- (2010) Proc. 18th European Conf. on Signal Processing (EUSIPCO 2010) , pp. 1993-1997
- You, C.H.¹ Li, H.² Lee, K.A.³

32
- 84906283218
- IIR system description for the 2010 nist speaker recognition evaluation submission
- B. Ma, H. Sun, K. A. Lee, C. H. You, D. Zhu, E. Wang, R. Tong, C. L. Huang, C. C. Leung, V. Hautamäki, and H. Li. IIR system description for the 2010 nist speaker recognition evaluation submission. In Proc. NIST SRE 2010 workshop, 2011.
- (2011) Proc. NIST SRE 2010 Workshop
- Ma, B.¹ Sun, H.² Lee, K.A.³ You, C.H.⁴ Zhu, D.⁵ Wang, E.⁶ Tong, R.⁷ Huang, C.L.⁸ Leung, C.C.⁹ Hautamäki, V.¹⁰ Li, H.¹¹

33
- 70349199116
- Comparison of scoring methods used in speaker recognition with joint factor analysis
- O. Glembek, L. Burget, N. Dehak, N. Brummer, and P. Kenny. Comparison of scoring methods used in speaker recognition with joint factor analysis. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), 2009.
- (2009) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009)
- Glembek, O.¹ Burget, L.² Dehak, N.³ Brummer, N.⁴ Kenny, P.⁵

34
- 84890523079
- Knowing the non-target speakers: The effect of the i-vector population for PLDA training in speaker recognition
- D. A. van Leeuwen and R. Saeidi. Knowing the non-target speakers: The effect of the i-vector population for PLDA training in speaker recognition. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
- (2013) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
- Van Leeuwen, D.A.¹ Saeidi, R.²

35
- 71249110320
- A majorization minimization algorithm for (multiple) hyperparameter learning
- C. S. Foo, C. B. Do, and A. Y. Ng. A majorization minimization algorithm for (multiple) hyperparameter learning. In Int. Conf. Mach. Learning, pages 321-328, 2009.
- (2009) Int. Conf. Mach. Learning , pp. 321-328
- Foo, C.S.¹ Do, C.B.² Ng, A.Y.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.