메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7214-7218

Robust front-end processing for speaker identification over extremely degraded communication channels

Author keywords

Mean Hilbert Envelope Coefficients (MHEC); speaker identification (SID); spectral flux; speech activity detection (SAD); voicing measures

Indexed keywords

HILBERT ENVELOPE; SPEAKER IDENTIFICATION; SPECTRAL FLUX; SPEECH ACTIVITY DETECTIONS; VOICING MEASURES;

EID: 84890490765     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639063     Document Type: Conference Paper
Times cited : (9)

References (32)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • Sept
    • A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, and J.-P. Petit, "ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Commun. Mag., vol. 35, pp. 64-73, Sept. 1997.
    • (1997) IEEE Commun. Mag. , vol.35 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.-Y.3    Massaloux, D.4    Lamblin, C.5    Petit, J.-P.6
  • 2
    • 0016962193 scopus 로고
    • A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition
    • Jun
    • B. S. Atal and L. R. Rabiner, "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 24, no. 3, pp. 201-2012, Jun. 1976.
    • (1976) IEEE Trans. Audio Speech Lang. Process. , vol.24 , Issue.3 , pp. 201-2012
    • Atal, B.S.1    Rabiner, L.R.2
  • 3
    • 14544287662 scopus 로고    scopus 로고
    • Robust detection of speech activity in the presence of noise
    • Dec
    • R. Sarikaya and J. H. L. Hansen, "Robust detection of speech activity in the presence of noise," in Proc. ICSLP, Dec. 1998, pp. 1455-1458.
    • (1998) Proc. ICSLP , pp. 1455-1458
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 4
    • 17344389852 scopus 로고    scopus 로고
    • Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
    • May
    • B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system," in Proc. IEEE ICASSP, May 2002, pp. 53-56.
    • (2002) Proc. IEEE ICASSP , pp. 53-56
    • Kingsbury, B.1    Saon, G.2    Mangu, L.3    Padmanabhan, M.4    Sarikaya, R.5
  • 5
    • 33947620115 scopus 로고    scopus 로고
    • Hierarchical structures of neural networks for phoneme recognition
    • May
    • P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," in Proc. IEEE ICASSP, May 2006, p. I.
    • (2006) Proc. IEEE ICASSP , pp. 1
    • Schwarz, P.1    Matejka, P.2    Cernocky, J.3
  • 6
    • 79960665866 scopus 로고    scopus 로고
    • The delta-phase spectrum with application to voice activity detection and speaker recognition
    • Sept
    • I. McCowan, D. Dean, M. McLaren, R. Vogt, and S. Sridharan, "The delta-phase spectrum with application to voice activity detection and speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 7, pp. 2026-2038, Sept. 2011.
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.7 , pp. 2026-2038
    • McCowan, I.1    Dean, D.2    McLaren, M.3    Vogt, R.4    Sridharan, S.5
  • 10
    • 84873310339 scopus 로고    scopus 로고
    • The RATS radio traffic collection system
    • Jun
    • K. Walker and S. Strassel, "The RATS radio traffic collection system," in Proc. ISCA Odyssey, Jun. 2012.
    • (2012) Proc. ISCA Odyssey
    • Walker, K.1    Strassel, S.2
  • 11
    • 84878548167 scopus 로고    scopus 로고
    • Speech activity detection for noisy data using adaptation techniques
    • Sept
    • M. K. Omar, "Speech activity detection for noisy data using adaptation techniques," in Proc. INTERSPEECH, Sept. 2012.
    • (2012) Proc. INTERSPEECH
    • Omar, M.K.1
  • 12
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using longterm speech information
    • Apr
    • J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using longterm speech information," Speech Commun., vol. 42, pp. 271-287, Apr. 2004.
    • (2004) Speech Commun. , vol.42 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    De La Torre, A.4    Rubio, A.5
  • 13
    • 79959844439 scopus 로고    scopus 로고
    • Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments
    • Sept
    • M. C. Huggins, B. Y. Smolenski, and A. D. Lawson, "Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments," in Proc. INTERSPEECH, Sept. 2010, pp. 3094-3097.
    • (2010) Proc. INTERSPEECH , pp. 3094-3097
    • Huggins, M.C.1    Smolenski, B.Y.2    Lawson, A.D.3
  • 14
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 15
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • Oct
    • J. Ramirez, J. Segura, C. Benitez, L. Garcia, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, Oct. 2005.
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.10 , pp. 689-692
    • Ramirez, J.1    Segura, J.2    Benitez, C.3    Garcia, L.4    Rubio, A.5
  • 16
    • 33846259282 scopus 로고    scopus 로고
    • Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
    • Mar
    • A. Davis, S. Nordholm, and R. Togneri, "Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold," IEEE Trans. Audio Speech Lang. Process., vol. 14, pp. 412-424, Mar. 2006.
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 412-424
    • Davis, A.1    Nordholm, S.2    Togneri, R.3
  • 17
    • 78049406668 scopus 로고    scopus 로고
    • Voice activity detection using harmonic frequency components in likelihood ratio test
    • Mar
    • L. N. Tan, B. J. Borgstrom, and A. Alwan, "Voice activity detection using harmonic frequency components in likelihood ratio test," in Proc. IEEE ICASSP, Mar. 2010, pp. 4466-4469.
    • (2010) Proc. IEEE ICASSP , pp. 4466-4469
    • Tan, L.N.1    Borgstrom, B.J.2    Alwan, A.3
  • 18
    • 84890539402 scopus 로고    scopus 로고
    • DARPA Robust Automatic Transcription of Speech (RATS)
    • DARPA Robust Automatic Transcription of Speech (RATS).[Online]. Available: http://projects.ldc.upenn.edu/RATS
  • 20
    • 80051641505 scopus 로고    scopus 로고
    • Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
    • May
    • S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions," in Proc. IEEE ICASSP, May 2011, pp. 5448-5451.
    • (2011) Proc. IEEE ICASSP , pp. 5448-5451
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 21
    • 84878408467 scopus 로고    scopus 로고
    • Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition
    • Sept
    • S. O. Sadjadi, T. Hasan, and J. H. L. Hansen, "Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition," in Proc. INTERSPEECH, Sept. 2012.
    • (2012) Proc. INTERSPEECH
    • Sadjadi, S.O.1    Hasan, T.2    Hansen, J.H.L.3
  • 25
    • 84870238795 scopus 로고    scopus 로고
    • Multitaper MFCC and PLP features for speaker verification using i-vectors
    • Feb
    • M. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, and D. O'Shaughnessy, "Multitaper MFCC and PLP features for speaker verification using i-vectors," Speech Commun., vol. 55, no. 2, pp. 237-251, Feb. 2013.
    • (2013) Speech Commun. , vol.55 , Issue.2 , pp. 237-251
    • Alam, M.J.1    Kinnunen, T.2    Kenny, P.3    Ouellet, P.4    O'shaughnessy, D.5
  • 26
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound
    • P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound," in Proc. Institute of Phonetic Sciences, vol. 17, 1993, pp. 97-110.
    • (1993) Proc. Institute of Phonetic Sciences , vol.17 , pp. 97-110
    • Boersma, P.1
  • 28
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE ICASSP, Apr. 1997, pp. 1331-1334.
    • Proc. IEEE ICASSP, Apr. 1997 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 29
    • 84873315510 scopus 로고    scopus 로고
    • Unsupervised speech activity detection using voicing measures and perceptual spectral flux
    • Mar
    • S. O. Sadjadi and J. H. L. Hansen, "Unsupervised speech activity detection using voicing measures and perceptual spectral flux," IEEE Signal Process. Lett., vol. 20, pp. 197-200, Mar. 2013.
    • (2013) IEEE Signal Process. Lett , vol.20 , pp. 197-200
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 30
    • 84890455722 scopus 로고    scopus 로고
    • HTK-Hidden Markov Model Toolkit v3. 4. 1
    • HTK-Hidden Markov Model Toolkit v3.4.1.[Online]. Available: http://htk.eng.cam.ac.uk
  • 32
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector length normalization in speaker recognition systems
    • D. Garcia-Romero and C. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems," in Proc. INTERSPEECH, Sept. 2011, pp. 249-252.
    • Proc. INTERSPEECH, Sept. 2011 , pp. 249-252
    • Garcia-Romero, D.1    Espy-Wilson, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.