SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 7, 2012, Pages 1990-2001

Low-variance multitaper MFCC features: A case study in robust speaker verification

(7) Kinnunen, Tomi a Saeidi, Rahim a,b Sedlák, Filip a Lee, Kong Aik c Sandberg, Johan d,e Hansson Sandsten, Maria e Li, Haizhou c

a UNIVERSITY OF EASTERN FINLAND (Finland)

b RADBOUD UNIVERSITY NIJMEGEN (Netherlands)

c INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

d Nordea Bank (Denmark)

e LUND UNIVERSITY (Sweden)

Author keywords

Mel frequency cepstral coefficient (MFCC); multitaper; small variance estimation; speaker verification

Indexed keywords

AUDIO APPLICATIONS; AUTO REGRESSIVE PROCESS; BIAS AND VARIANCE; FREQUENCY DOMAINS; GAUSSIAN MIXTURE MODEL; JOINT FACTOR ANALYSIS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MULTITAPER; MULTITAPER METHODS; MULTITAPERS; PARAMETER SELECTION; ROBUST SPEAKER VERIFICATION; SIGNAL SPECTRUM; SPEAKER VERIFICATION; SPECTRAL LEAKAGE; TIME DOMAIN; UNIVERSAL BACKGROUND MODEL;

DISCRETE FOURIER TRANSFORMS; GESTURE RECOGNITION; SOFTWARE AGENTS; SPEECH PROCESSING; SUPPORT VECTOR MACHINES; TELEPHONE SYSTEMS; TIME DOMAIN ANALYSIS;

SPEECH RECOGNITION;

EID: 84860850285 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2191960 Document Type: Article

Times cited : (131)

References (51)

1
- 0017851927
- On the use of windows for harmonic analysis with the discrete Fourier transform
- Jan.
- F. J. Harris, "On the use of windows for harmonic analysis with the discrete Fourier transform," Proc. IEEE, vol. 66, no. 1, pp. 51-84, Jan. 1978.
- (1978) Proc. IEEE , vol.66 , Issue.1 , pp. 51-84
- Harris, F.J.¹

2
- 0016495091
- Linear prediction: A tutorial review
- Apr.
- J. Makhoul, "Linear prediction: A tutorial review," Proc. IEEE, vol. 64, no. 4, pp. 561-580, Apr. 1975.
- (1975) Proc. IEEE , vol.64 , Issue.4 , pp. 561-580
- Makhoul, J.¹

3
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. 28, no. 4, pp. 357-366, Aug. 1980. (Pubitemid 11464930)
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis Steven, B.¹ Mermelstein Paul²

4
- 0004056285
- Upper Saddle River, New Jersey: Prentice-Hall
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Upper Saddle River, New Jersey: Prentice-Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

5
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

6
- 34347376319
- Temporal structure normalization of speech feature for robust speech recognition
- DOI 10.1109/LSP.2006.891341
- X. Xiao, E.-S. Chng, and H. Li, "Temporal structure normalization of speech feature for robust speech recognition," IEEE Signal Process. Lett., vol. 14, no. 7, pp. 500-503, Jul. 2007. (Pubitemid 47018924)
- (2007) IEEE Signal Processing Letters , vol.14 , Issue.7 , pp. 500-503
- Xiao, X.¹ Chng, E.S.² Li, H.³

7
- 85073258179
- Feature warping for robust speaker verification
- Crete, Greece, Jun.
- J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in Proc. Speaker Odyssey: Speaker Recognition Workshop (Odyssey 2001), Crete, Greece, Jun. 2001, pp. 213-218.
- (2001) Proc. Speaker Odyssey: Speaker Recognition Workshop (Odyssey 2001) , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

8
- 42549139762
- MVA processing of speech features
- Jan.
- C.-P. Chen and J. A. Bilmes, "MVA processing of speech features," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 257-270, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 257-270
- Chen, C.-P.¹ Bilmes, J.A.²

9
- 70349223791
- Optimal cepstrum estimation using multiple windows
- Taipei, Taiwan, Apr.
- M. Hansson-Sandsten and J. Sandberg, "Optimal cepstrum estimation using multiple windows," in Proc. ICASSP '09, Taipei, Taiwan, Apr. 2009, pp. 3077-3080.
- (2009) Proc. ICASSP ' , vol.9 , pp. 3077-3080
- Hansson-Sandsten, M.¹ Sandberg, J.²

10
- 0003590536
- Cambridge, MA: Cambridge Univ. Press
- D. B. Percival and A. T.Walden, Spectral Analysis for Physical Applications. Cambridge, MA: Cambridge Univ. Press, 1993.
- (1993) Spectral Analysis for Physical Applications
- Percival, D.B.¹ Walden, A.T.²

11
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- Jan.
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors," Speech Comm., vol. 52, no. 1, pp. 12-40, Jan. 2010.
- (2010) Speech Comm. , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

12
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, no. 1, pp. 19-41, Jan. 2000. (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

13
- 33645887246
- Support vector machines using GMM supervectors for speaker verification
- May
- W. M. Campbell, D. E. Sturim, and D. A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, May 2006.
- (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³

14
- 51449086024
- Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006
- Sep.
- N. Brümmer, L. Burget, J. H. Cernocký, O. Glembek, F. Grézl, M. Karafiát, D. A. v. Leeuwen, P. Matejka, P. Schwartz, and A. Strasheim, "Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2072-2084, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2072-2084
- Brümmer, N.¹ Burget, L.² Cernocký, J.H.³ Glembek, O.⁴ Grézl, F.⁵ Karafiát, M.⁶ Leeuwen, D.A.V.⁷ Matejka, P.⁸ Schwartz, P.⁹ Strasheim, A.¹⁰

15
- 58349106697
- A study of inter-speaker variability in speaker verification
- Jul.
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 980-988, Jul. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

16
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1435-1447, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

17
- 51449111842
- Speaker recognition with session variability normalization based on MLLR adaptation transforms
- Sep.
- A. Stolcke, S. S. Kajarekar, L. Ferrer, and E. Shriberg, "Speaker recognition with session variability normalization based on MLLR adaptation transforms," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1987-1998, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1987-1998
- Stolcke, A.¹ Kajarekar, S.S.² Ferrer, L.³ Shriberg, E.⁴

18
- 77955790894
- GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition
- Aug.
- C. H. You, K. A. Lee, and H. Li, "GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1300-1312, Aug. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1300-1312
- You, C.H.¹ Lee, K.A.² Li, H.³

19
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

20
- 33846259282
- Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
- Mar.
- A. Davis, S. Nordholm, and R. Togneri, "Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 412-424, Mar. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 412-424
- Davis, A.¹ Nordholm, S.² Togneri, R.³

21
- 78649989192
- Robust voice activity detection using long-term signal variability
- Mar.
- P. K. Ghosh, A. Tsiartas, and S. Narayanan, "Robust voice activity detection using long-term signal variability," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 3, pp. 600-613, Mar. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.3 , pp. 600-613
- Ghosh, P.K.¹ Tsiartas, A.² Narayanan, S.³

22
- 0742307391
- Jan.
- Y. Hu and P. C. Loizou, "Speech enhancement based on wavelet thresholding the multitaper spectrum," , vol. 12, no. 1, pp. 59-67, Jan. 2004.
- (2004) Speech Enhancement Based on Wavelet Thresholding the Multitaper Spectrum , vol.12 , Issue.1 , pp. 59-67
- Hu, Y.¹ Loizou, P.C.²

23
- 17244374373
- Multitapering and a wavelet variant of MFCC in speech recognition
- DOI 10.1049/ip-vis:20051004
- L. P. Ricotti, "Multitapering and a wavelet variant of MFCC in speech recognition," IEE Proc. Vis., Image Signal Process., vol. 152, no. 1, pp. 29-35, Feb 2005. (Pubitemid 40527360)
- (2005) IEE Proceedings: Vision, Image and Signal Processing , vol.152 , Issue.1 , pp. 29-35
- Ricotti, L.P.¹

24
- 0020189541
- Spectrum estimation and harmonic analysis
- Sep.
- D. J. Thomson, "Spectrum estimation and harmonic analysis," Proc. IEEE, vol. 70, no. 9, pp. 1055-1096, Sep. 1982.
- (1982) Proc. IEEE , vol.70 , Issue.9 , pp. 1055-1096
- Thomson, D.J.¹

25
- 0029184722
- Minimum bias multiple taper spectral estimation
- Jan
- K. S. Riedel and A. Sidorenko, "Minimum bias multiple taper spectral estimation," IEEE Trans. Signal Process., vol. 43, no. 1, pp. 188-195, Jan 1995.
- (1995) IEEE Trans. Signal Process. , vol.43 , Issue.1 , pp. 188-195
- Riedel, K.S.¹ Sidorenko, A.²

26
- 0031095319
- A multiple window method for estimation of peaked spectra
- PII S1053587X97018710
- M. Hansson and G. Salomonsson, "A multiple window method for estimation of peaked spectra," IEEE Trans. Signal Process., vol. 45, no. 3, pp. 778-781, Mar. 1997. (Pubitemid 127765966)
- (1997) IEEE Transactions on Signal Processing , vol.45 , Issue.3 , pp. 778-781
- Hansson, M.¹ Salomonsson, G.²

27
- 77249096360
- Multitaper estimation of frequency-warped cepstra with application to speaker verification
- Apr.
- J. Sandberg, M. Hansson-Sandsten, T. Kinnunen, R. Saeidi, P. Flandrin, and P. Borgnat, "Multitaper estimation of frequency-warped cepstra with application to speaker verification," IEEE Signal Process. Lett., vol. 17, no. 4, pp. 343-346, Apr. 2010.
- (2010) IEEE Signal Process. Lett. , vol.17 , Issue.4 , pp. 343-346
- Sandberg, J.¹ Hansson-Sandsten, M.² Kinnunen, T.³ Saeidi, R.⁴ Flandrin, P.⁵ Borgnat, P.⁶

28
- 0028378535
- The variance of multitaper spectrum estimates for real gaussian processes
- Feb.
- A. T. Walden, E. McCoy, and D. B. Percival, "The variance of multitaper spectrum estimates for real gaussian processes," IEEE Trans. Signal Process., vol. 42, no. 2, pp. 479-482, Feb. 1994.
- (1994) IEEE Trans. Signal Process. , vol.42 , Issue.2 , pp. 479-482
- Walden, A.T.¹ McCoy, E.² Percival, D.B.³

29
- 0026965779
- On the performance advantage of multitaper spectral analysis
- Dec.
- T. P. Bronez, "On the performance advantage of multitaper spectral analysis," IEEE Trans. on Sign. Proc., vol. 40, no. 12, pp. 2941-2946, Dec. 1992.
- (1992) IEEE Trans. on Sign. Proc. , vol.40 , Issue.12 , pp. 2941-2946
- Bronez, T.P.¹

30
- 84908144695
- The use of Fast Fourier Transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms
- Jun.
- P. D. Welch, "The use of Fast Fourier Transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms," IEEE Trans. Audio Electroacoust., vol. AU-15, no. 2, pp. 70-73, Jun. 1967.
- (1967) IEEE Trans. Audio Electroacoust. , vol.AU-15 , Issue.2 , pp. 70-73
- Welch, P.D.¹

31
- 84860843356
- Multitaper analysis of fundamental frequency variations during voiced fricatives
- Dec.
- C. H. Shadle and G. Ramsay, "Multitaper analysis of fundamental frequency variations during voiced fricatives," in Proc. 6th Int. Seminar Speech Product., Dec. 2003, p. CD-6.
- (2003) Proc. 6th Int. Seminar Speech Product.
- Shadle, C.H.¹ Ramsay, G.²

32
- 33847668886
- Multitaper covariance estimation and spectral denoising
- 1599939, Conference Record of The Thirty-Ninth Asilomar Conference on Signals, Systems and Computers
- N. Erdol and T. Gunes, "Multitaper covariance estimation and spectral denoising," in Proc. Conf. Rec. 39th Asilomar Conf. Signals, Syst., Comput., Nov. 2005, pp. 1144-1147. (Pubitemid 46350492)
- (2005) Conference Record - Asilomar Conference on Signals, Systems and Computers , vol.2005 , pp. 1144-1147
- Erdol, N.¹ Gunes, T.²

33
- 79959826333
- What else is new than the Hamming window? Robust MFCCs for speaker recognition via multitapering
- Japan, Sep.
- T. Kinnunen, R. Saeidi, J. Sandberg, and M. Hansson-Sandsten, "What else is new than the Hamming window? Robust MFCCs for speaker recognition via multitapering," in Proc. Interspeech, Makuhari, Japan, Sep. 2010, pp. 2734-2737.
- (2010) Proc. Interspeech, Makuhari , pp. 2734-2737
- Kinnunen, T.¹ Saeidi, R.² Sandberg, J.³ Hansson-Sandsten, M.⁴

34
- 33645895387
- Advances in channel compensation for SVM speaker recognition
- Philadelphia, PA, Mar.
- A. Solomonoff, W. M. Campbell, and I. Boardman, "Advances in channel compensation for SVM speaker recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP 2005), Philadelphia, PA, Mar. 2005, pp. 629-632.
- (2005) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP 2005) , pp. 629-632
- Solomonoff, A.¹ Campbell, W.M.² Boardman, I.³

35
- 84860848915
- P. Kenny, Joint factor analysis of speaker and session variability: Theory and algorithms Tech. Rep. CRIM-06/08-14, 2006.
- (2006) Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms Tech. Rep. CRIM-06/08-14
- Kenny, P.¹

36
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Speaker and session variability in GMM-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1448-1460, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

37
- 85032751338
- Jackknifing multitaper spectrum estimates
- DOI 10.1109/MSP.2007.4286561
- D. J. Thomson, "Jackknifing multitaper spectrum estimates," IEEE Signal Process. Mag., vol. 24, no. 4, pp. 20-30, Jul. 2007. (Pubitemid 47316164)
- (2007) IEEE Signal Processing Magazine , vol.24 , Issue.4 , pp. 20-30
- Thomson, D.J.¹

38
- 70350488536
- On the statistics of spectral amplitudes after variance reduction by temporal cepstrum smoothing and cepstral nulling
- Nov
- T. Gerkmann and R. Martin, "On the statistics of spectral amplitudes after variance reduction by temporal cepstrum smoothing and cepstral nulling," IEEE Trans. Signal Process., vol. 57, no. 11, pp. 4165-4174, Nov 2009.
- (2009) IEEE Trans. Signal Process. , vol.57 , Issue.11 , pp. 4165-4174
- Gerkmann, T.¹ Martin, R.²

39
- 0000120766
- Estimating the dimension of a model
- Mar.
- G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, Mar. 1978.
- (1978) Ann. Statist. , vol.6 , pp. 461-464
- Schwarz, G.¹

40
- 0002537922
- Algorithm 808: ARFIT - A Matlab package for the estimation of parameters and eigenmodes of multivariate autoregressive models
- DOI 10.1145/382043.382316
- T. Schneider and A. Neumaier, "Algorithm 808: ARfit-a Matlab package for the estimation of parameters and eigenmodes of multivariate autoregressive models," ACM Trans. Math. Softw., vol. 27, pp. 58-65, 2001. (Pubitemid 33609115)
- (2001) ACM Transactions on Mathematical Software , vol.27 , Issue.1 , pp. 58-65
- Schneider, T.¹ Neumaier, A.²

41
- 0033884857
- Score normalization for text-independent speaker verification systems
- DOI 10.1006/dspr.1999.0360
- R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, no. 1-3, pp. 42-54, Jan. 2000. (Pubitemid 30592165)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

42
- 33745210768
- Modelling session variability in text-independent speaker verification
- Lisbon, Portugal, Sep.
- R. Vogt, B. Baker, and S. Sridharan, "Modelling session variability in text-independent speaker verification," in Proc. Interspeech '05, Lisbon, Portugal, Sep. 2005, pp. 3117-3120.
- (2005) Proc. Interspeech ' , vol.5 , pp. 3117-3120
- Vogt, R.¹ Baker, B.² Sridharan, S.³

43
- 70350101560
- Particle swarm optimization for sorted adapted Gaussian mixture models
- Feb.
- R. Saeidi, H. R. S. Mohammadi, T. Ganchev, and R. D. Rodman, "Particle swarm optimization for sorted adapted Gaussian mixture models," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 344-353, Feb. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 344-353
- Saeidi, R.¹ Mohammadi, H.R.S.² Ganchev, T.³ Rodman, R.D.⁴

44
- 77952192470
- Temporally weighted linear prediction features for tackling additive noise in speaker verification
- Jun.
- R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for tackling additive noise in speaker verification," IEEE Signal Process. Lett., vol. 17, no. 6, pp. 599-602, Jun. 2010.
- (2010) IEEE Signal Process. Lett. , vol.17 , Issue.6 , pp. 599-602
- Saeidi, R.¹ Pohjalainen, J.² Kinnunen, T.³ Alku, P.⁴

45
- 79959832654
- Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions
- Makuhari, Japan, Sep.
- J. Pohjalainen, R. Saeidi, T. Kinnunen, and P. Alku, "Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions," in Proc. Interspeech '10, Makuhari, Japan, Sep. 2010, pp. 1477-1480.
- (2010) Proc. Interspeech ' , vol.10 , pp. 1477-1480
- Pohjalainen, J.¹ Saeidi, R.² Kinnunen, T.³ Alku, P.⁴

46
- 70349203858
- The I4U system in NIST 2008 speaker recognition evaluation
- Taipei, Taiwan, Apr.
- H. Li, B. Ma, K.-A. Lee, H. Sun, D. Zhu, K. C. Sim, C. You, R. Tong, I. Kärkkäinen, C.-L. Huang, V. Pervouchine, W. Guo, Y. Li, L. Dai, M. Nosratighods, T. Tharmarajah, J. Epps, E. Ambikairajah, E.-S. Chng, T. Schultz, and Q. Jin, "The I4U system in NIST 2008 speaker recognition evaluation," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP '09), Taipei, Taiwan, Apr. 2009, pp. 4201-4204.
- (2009) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP '09) , pp. 4201-4204
- Li, H.¹ Ma, B.² Lee, K.-A.³ Sun, H.⁴ Zhu, D.⁵ Sim, K.C.⁶ You, C.⁷ Tong, R.⁸ Kärkkäinen, I.⁹ Huang, C.-L.¹⁰ Pervouchine, V.¹¹ Guo, W.¹² Li, Y.¹³ Dai, L.¹⁴ Nosratighods, M.¹⁵ Tharmarajah, T.¹⁶ Epps, J.¹⁷ Ambikairajah, E.¹⁸ Chng, E.-S.¹⁹ Schultz, T.²⁰ Jin, Q.²¹ more..

47
- 84906232348
- Temporally weighted linear prediction features for speaker verification in additive noise
- Brno, Czech Republic, Jun.
- R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for speaker verification in additive noise," in Proc. Odyssey 2010: Speaker Lang. Recogni. Workshop, Brno, Czech Republic, Jun. 2010.
- (2010) Proc. Odyssey 2010: Speaker Lang. Recogni. Workshop
- Saeidi, R.¹ Pohjalainen, J.² Kinnunen, T.³ Alku, P.⁴

48
- 34447100796
- Boca Raton, FL: CRC
- P. C. Loizou, Speech Enhancement: Theory and Practice. Boca Raton, FL: CRC, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.C.¹

49
- 85046873967
- The DET curve in assessment of detection task performance
- Rhodos, Greece, Sep.
- A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. 5th Eur. Conf. Speech Commun. Technol. (Eurospeech '97), Rhodos, Greece, Sep. 1997, pp. 1895-1898.
- (1997) Proc. 5th Eur. Conf. Speech Commun. Technol. (Eurospeech '97) , pp. 1895-1898
- Martin, A.¹ Doddington, G.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.⁵

50
- 29044433161
- NIST and NFI-TNO evaluations of automatic speaker recognition
- DOI 10.1016/j.csl.2005.07.001, PII S088523080500032X, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
- D. A. van Leeuwen, A. F. Martin, M. A. Przybocki, and J. S. Bouten, "NIST and NFI-TNO evaluations of automatic speaker recognition," Comput. Speech Lang., vol. 20, pp. 128-158, Apr.-Jul. 2006. (Pubitemid 41787534)
- (2006) Computer Speech and Language , vol.20 , Issue.2-3 SPEC. ISS. , pp. 128-158
- Van Leeuwen, D.A.¹ Martin, A.F.² Przybocki, M.A.³ Bouten, J.S.⁴

51
- 84858967474
- Multitaper MFCC features for speaker verification using i-vectors
- Dec.
- M. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, and D. O'Shaughnessy, "Multitaper MFCC features for speaker verification using i-vectors," in Proc. IEEE Autom. Speech Recognit. Understanding (ASRU 2011), Dec. 2011, pp. 547-552.
- (2011) Proc. IEEE Autom. Speech Recognit. Understanding (ASRU 2011) , pp. 547-552
- Alam, M.J.¹ Kinnunen, T.² Kenny, P.³ Ouellet, P.⁴ O'Shaughnessy, D.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.