SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 7214-7218

Robust front-end processing for speaker identification over extremely degraded communication channels

(2) Sadjadi, Seyed Omid a Hansen, John H L a

a UNIVERSITY OF TEXAS AT DALLAS (United States)

Author keywords

Mean Hilbert Envelope Coefficients (MHEC); speaker identification (SID); spectral flux; speech activity detection (SAD); voicing measures

Indexed keywords

HILBERT ENVELOPE; SPEAKER IDENTIFICATION; SPECTRAL FLUX; SPEECH ACTIVITY DETECTIONS; VOICING MEASURES;

FEATURE EXTRACTION; MICROPHONES; SIGNAL PROCESSING;

LOUDSPEAKERS;

EID: 84890490765 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6639063 Document Type: Conference Paper

Times cited : (9)

References (32)

1
- 0031238211
- ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
- Sept
- A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, and J.-P. Petit, "ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Commun. Mag., vol. 35, pp. 64-73, Sept. 1997.
- (1997) IEEE Commun. Mag. , vol.35 , pp. 64-73
- Benyassine, A.¹ Shlomot, E.² Su, H.-Y.³ Massaloux, D.⁴ Lamblin, C.⁵ Petit, J.-P.⁶

2
- 0016962193
- A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition
- Jun
- B. S. Atal and L. R. Rabiner, "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 24, no. 3, pp. 201-2012, Jun. 1976.
- (1976) IEEE Trans. Audio Speech Lang. Process. , vol.24 , Issue.3 , pp. 201-2012
- Atal, B.S.¹ Rabiner, L.R.²

3
- 14544287662
- Robust detection of speech activity in the presence of noise
- Dec
- R. Sarikaya and J. H. L. Hansen, "Robust detection of speech activity in the presence of noise," in Proc. ICSLP, Dec. 1998, pp. 1455-1458.
- (1998) Proc. ICSLP , pp. 1455-1458
- Sarikaya, R.¹ Hansen, J.H.L.²

4
- 17344389852
- Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
- May
- B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system," in Proc. IEEE ICASSP, May 2002, pp. 53-56.
- (2002) Proc. IEEE ICASSP , pp. 53-56
- Kingsbury, B.¹ Saon, G.² Mangu, L.³ Padmanabhan, M.⁴ Sarikaya, R.⁵

5
- 33947620115
- Hierarchical structures of neural networks for phoneme recognition
- May
- P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," in Proc. IEEE ICASSP, May 2006, p. I.
- (2006) Proc. IEEE ICASSP , pp. 1
- Schwarz, P.¹ Matejka, P.² Cernocky, J.³

6
- 79960665866
- The delta-phase spectrum with application to voice activity detection and speaker recognition
- Sept
- I. McCowan, D. Dean, M. McLaren, R. Vogt, and S. Sridharan, "The delta-phase spectrum with application to voice activity detection and speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 7, pp. 2026-2038, Sept. 2011.
- (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.7 , pp. 2026-2038
- McCowan, I.¹ Dean, D.² McLaren, M.³ Vogt, R.⁴ Sridharan, S.⁵

7
- 84878535284
- Developing a speech activity detection system for the darpa rats program
- Sept
- T. Ng, B. Zhang, L. Nguyen, S. Matsoukas, X. Zhou, N. Mesgarani, K. Vesely, and P. Matejka, "Developing a speech activity detection system for the DARPA RATS program," in Proc. INTERSPEECH, Sept. 2012.
- (2012) Proc. INTERSPEECH
- Ng, T.¹ Zhang, B.² Nguyen, L.³ Matsoukas, S.⁴ Zhou, X.⁵ Mesgarani, N.⁶ Vesely, K.⁷ Matejka, P.⁸

8
- 84878413949
- Patrol team language identication system for DARPA RATS P1 evaluation
- Sept
- P. Matejka, O. Plchot, M. Soufifar, O. Glembek, L. D'Haro, K. Vesely, F. Grezl, J. Ma, S. Matsoukas, and N. Dehak, "Patrol team language identication system for DARPA RATS P1 evaluation," in Proc. INTERSPEECH, Sept. 2012.
- (2012) Proc. INTERSPEECH
- Matejka, P.¹ Plchot, O.² Soufifar, M.³ Glembek, O.⁴ D'haro, L.⁵ Vesely, K.⁶ Grezl, F.⁷ Ma, J.⁸ Matsoukas, S.⁹ Dehak, N.¹⁰

9
- 74549182880
- A multidecision sub-band voice activity detector
- Sept
- A. Davis, S. Nordholm, S. Y. Low, and R. Togneri, "A multidecision sub-band voice activity detector," in Proc. EUSIPCO, Sept. 2006.
- (2006) Proc. EUSIPCO
- Davis, A.¹ Nordholm, S.² Low, S.Y.³ Togneri, R.⁴

10
- 84873310339
- The RATS radio traffic collection system
- Jun
- K. Walker and S. Strassel, "The RATS radio traffic collection system," in Proc. ISCA Odyssey, Jun. 2012.
- (2012) Proc. ISCA Odyssey
- Walker, K.¹ Strassel, S.²

11
- 84878548167
- Speech activity detection for noisy data using adaptation techniques
- Sept
- M. K. Omar, "Speech activity detection for noisy data using adaptation techniques," in Proc. INTERSPEECH, Sept. 2012.
- (2012) Proc. INTERSPEECH
- Omar, M.K.¹

12
- 1842476689
- Efficient voice activity detection algorithms using longterm speech information
- Apr
- J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using longterm speech information," Speech Commun., vol. 42, pp. 271-287, Apr. 2004.
- (2004) Speech Commun. , vol.42 , pp. 271-287
- Ramirez, J.¹ Segura, J.C.² Benitez, C.³ De La Torre, A.⁴ Rubio, A.⁵

13
- 79959844439
- Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments
- Sept
- M. C. Huggins, B. Y. Smolenski, and A. D. Lawson, "Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments," in Proc. INTERSPEECH, Sept. 2010, pp. 3094-3097.
- (2010) Proc. INTERSPEECH , pp. 3094-3097
- Huggins, M.C.¹ Smolenski, B.Y.² Lawson, A.D.³

14
- 0032762471
- A statistical model-based voice activity detection
- Jan
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
- (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

15
- 23344452899
- Statistical voice activity detection using a multiple observation likelihood ratio test
- Oct
- J. Ramirez, J. Segura, C. Benitez, L. Garcia, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, Oct. 2005.
- (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
- Ramirez, J.¹ Segura, J.² Benitez, C.³ Garcia, L.⁴ Rubio, A.⁵

16
- 33846259282
- Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
- Mar
- A. Davis, S. Nordholm, and R. Togneri, "Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold," IEEE Trans. Audio Speech Lang. Process., vol. 14, pp. 412-424, Mar. 2006.
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 412-424
- Davis, A.¹ Nordholm, S.² Togneri, R.³

17
- 78049406668
- Voice activity detection using harmonic frequency components in likelihood ratio test
- Mar
- L. N. Tan, B. J. Borgstrom, and A. Alwan, "Voice activity detection using harmonic frequency components in likelihood ratio test," in Proc. IEEE ICASSP, Mar. 2010, pp. 4466-4469.
- (2010) Proc. IEEE ICASSP , pp. 4466-4469
- Tan, L.N.¹ Borgstrom, B.J.² Alwan, A.³

18
- 84890539402
- DARPA Robust Automatic Transcription of Speech (RATS)
- DARPA Robust Automatic Transcription of Speech (RATS).[Online]. Available: http://projects.ldc.upenn.edu/RATS

19
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

20
- 80051641505
- Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
- May
- S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions," in Proc. IEEE ICASSP, May 2011, pp. 5448-5451.
- (2011) Proc. IEEE ICASSP , pp. 5448-5451
- Sadjadi, S.O.¹ Hansen, J.H.L.²

21
- 84878408467
- Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition
- Sept
- S. O. Sadjadi, T. Hasan, and J. H. L. Hansen, "Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition," in Proc. INTERSPEECH, Sept. 2012.
- (2012) Proc. INTERSPEECH
- Sadjadi, S.O.¹ Hasan, T.² Hansen, J.H.L.³

22
- 84858995082
- Linear versus mel frequency cepstral coefficients for speaker recognition
- Dec
- X. Zhou, D. Garcia-Romero, R. Duraiswami, C. Espy-Wilson, and S. Shamma, "Linear versus mel frequency cepstral coefficients for speaker recognition," in Proc. IEEE ASRU, Dec. 2011, pp. 559-564.
- (2011) Proc. IEEE ASRU , pp. 559-564
- Zhou, X.¹ Garcia-Romero, D.² Duraiswami, R.³ Espy-Wilson, C.⁴ Shamma, S.⁵

23
- 84878378089
- Regularized all-pole models for speaker verification under noisy environments
- Mar
- C. Hanilci, T. Kinnunen, F. Ertas, R. Saeidi, J. Pohjalainen, and P. Alku, "Regularized all-pole models for speaker verification under noisy environments," IEEE Signal Process. Lett., vol. 19, pp. 163-166, Mar. 2012.
- (2012) IEEE Signal Process. Lett. , vol.19 , pp. 163-166
- Hanilci, C.¹ Kinnunen, T.² Ertas, F.³ Saeidi, R.⁴ Pohjalainen, J.⁵ Alku, P.⁶

24
- 84860850285
- Low-variance multitaper MFCC features: A case study in robust speaker verification
- Sept
- T. Kinnunen, R. Saeidi, F. Sedlak, K. A. Lee, J. Sandberg, M. Hansson-Sandsten, and H. Li, "Low-variance multitaper MFCC features: A case study in robust speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 7, pp. 1990-2001, Sept. 2012.
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.7 , pp. 1990-2001
- Kinnunen, T.¹ Saeidi, R.² Sedlak, F.³ Lee, K.A.⁴ Sandberg, J.⁵ Hansson-Sandsten, M.⁶ Li, H.⁷

25
- 84870238795
- Multitaper MFCC and PLP features for speaker verification using i-vectors
- Feb
- M. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, and D. O'Shaughnessy, "Multitaper MFCC and PLP features for speaker verification using i-vectors," Speech Commun., vol. 55, no. 2, pp. 237-251, Feb. 2013.
- (2013) Speech Commun. , vol.55 , Issue.2 , pp. 237-251
- Alam, M.J.¹ Kinnunen, T.² Kenny, P.³ Ouellet, P.⁴ O'shaughnessy, D.⁵

26
- 0001835850
- Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound
- P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound," in Proc. Institute of Phonetic Sciences, vol. 17, 1993, pp. 97-110.
- (1993) Proc. Institute of Phonetic Sciences , vol.17 , pp. 97-110
- Boersma, P.¹

27
- 84873313607
- 1st ed. Upper Saddle River, NJ: Prentice Hall Press
- L. R. Rabiner and R. W. Schafer, Theory and Applications of Digital Speech Processing, 1st ed. Upper Saddle River, NJ: Prentice Hall Press, 2010.
- (2010) Theory and Applications of Digital Speech Processing
- Rabiner, L.R.¹ Schafer, R.W.²

28
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE ICASSP, Apr. 1997, pp. 1331-1334.
- Proc. IEEE ICASSP, Apr. 1997 , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

29
- 84873315510
- Unsupervised speech activity detection using voicing measures and perceptual spectral flux
- Mar
- S. O. Sadjadi and J. H. L. Hansen, "Unsupervised speech activity detection using voicing measures and perceptual spectral flux," IEEE Signal Process. Lett., vol. 20, pp. 197-200, Mar. 2013.
- (2013) IEEE Signal Process. Lett , vol.20 , pp. 197-200
- Sadjadi, S.O.¹ Hansen, J.H.L.²

30
- 84890455722
- HTK-Hidden Markov Model Toolkit v3. 4. 1
- HTK-Hidden Markov Model Toolkit v3.4.1.[Online]. Available: http://htk.eng.cam.ac.uk

31
- 50649094277
- Probabilistic linear discriminant analysis for inferences about identity
- S. Prince and J. Elder, "Probabilistic linear discriminant analysis for inferences about identity," in Proc. IEEE Int. Conf. Computer Vision, ICCV 2007, Oct. 2007, pp. 1-8.
- Proc. IEEE Int. Conf. Computer Vision, ICCV 2007, Oct. 2007 , pp. 1-8
- Prince, S.¹ Elder, J.²

32
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- D. Garcia-Romero and C. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems," in Proc. INTERSPEECH, Sept. 2011, pp. 249-252.
- Proc. INTERSPEECH, Sept. 2011 , pp. 249-252
- Garcia-Romero, D.¹ Espy-Wilson, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.