-
1
-
-
0031238211
-
ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
-
Sept
-
A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, and J.-P. Petit, "ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Commun. Mag., vol. 35, pp. 64-73, Sept. 1997.
-
(1997)
IEEE Commun. Mag.
, vol.35
, pp. 64-73
-
-
Benyassine, A.1
Shlomot, E.2
Su, H.-Y.3
Massaloux, D.4
Lamblin, C.5
Petit, J.-P.6
-
2
-
-
0016962193
-
A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition
-
Jun
-
B. S. Atal and L. R. Rabiner, "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition," IEEE Trans. Audio Speech Lang. Process., vol. 24, no. 3, pp. 201-2012, Jun. 1976.
-
(1976)
IEEE Trans. Audio Speech Lang. Process.
, vol.24
, Issue.3
, pp. 201-2012
-
-
Atal, B.S.1
Rabiner, L.R.2
-
3
-
-
14544287662
-
Robust detection of speech activity in the presence of noise
-
Dec
-
R. Sarikaya and J. H. L. Hansen, "Robust detection of speech activity in the presence of noise," in Proc. ICSLP, Dec. 1998, pp. 1455-1458.
-
(1998)
Proc. ICSLP
, pp. 1455-1458
-
-
Sarikaya, R.1
Hansen, J.H.L.2
-
4
-
-
17344389852
-
Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
-
May
-
B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system," in Proc. IEEE ICASSP, May 2002, pp. 53-56.
-
(2002)
Proc. IEEE ICASSP
, pp. 53-56
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Padmanabhan, M.4
Sarikaya, R.5
-
5
-
-
33947620115
-
Hierarchical structures of neural networks for phoneme recognition
-
May
-
P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," in Proc. IEEE ICASSP, May 2006, p. I.
-
(2006)
Proc. IEEE ICASSP
, pp. 1
-
-
Schwarz, P.1
Matejka, P.2
Cernocky, J.3
-
6
-
-
79960665866
-
The delta-phase spectrum with application to voice activity detection and speaker recognition
-
Sept
-
I. McCowan, D. Dean, M. McLaren, R. Vogt, and S. Sridharan, "The delta-phase spectrum with application to voice activity detection and speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 7, pp. 2026-2038, Sept. 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.7
, pp. 2026-2038
-
-
McCowan, I.1
Dean, D.2
McLaren, M.3
Vogt, R.4
Sridharan, S.5
-
7
-
-
84878535284
-
Developing a speech activity detection system for the darpa rats program
-
Sept
-
T. Ng, B. Zhang, L. Nguyen, S. Matsoukas, X. Zhou, N. Mesgarani, K. Vesely, and P. Matejka, "Developing a speech activity detection system for the DARPA RATS program," in Proc. INTERSPEECH, Sept. 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Ng, T.1
Zhang, B.2
Nguyen, L.3
Matsoukas, S.4
Zhou, X.5
Mesgarani, N.6
Vesely, K.7
Matejka, P.8
-
8
-
-
84878413949
-
Patrol team language identication system for DARPA RATS P1 evaluation
-
Sept
-
P. Matejka, O. Plchot, M. Soufifar, O. Glembek, L. D'Haro, K. Vesely, F. Grezl, J. Ma, S. Matsoukas, and N. Dehak, "Patrol team language identication system for DARPA RATS P1 evaluation," in Proc. INTERSPEECH, Sept. 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Matejka, P.1
Plchot, O.2
Soufifar, M.3
Glembek, O.4
D'haro, L.5
Vesely, K.6
Grezl, F.7
Ma, J.8
Matsoukas, S.9
Dehak, N.10
-
9
-
-
74549182880
-
A multidecision sub-band voice activity detector
-
Sept
-
A. Davis, S. Nordholm, S. Y. Low, and R. Togneri, "A multidecision sub-band voice activity detector," in Proc. EUSIPCO, Sept. 2006.
-
(2006)
Proc. EUSIPCO
-
-
Davis, A.1
Nordholm, S.2
Low, S.Y.3
Togneri, R.4
-
10
-
-
84873310339
-
The RATS radio traffic collection system
-
Jun
-
K. Walker and S. Strassel, "The RATS radio traffic collection system," in Proc. ISCA Odyssey, Jun. 2012.
-
(2012)
Proc. ISCA Odyssey
-
-
Walker, K.1
Strassel, S.2
-
11
-
-
84878548167
-
Speech activity detection for noisy data using adaptation techniques
-
Sept
-
M. K. Omar, "Speech activity detection for noisy data using adaptation techniques," in Proc. INTERSPEECH, Sept. 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Omar, M.K.1
-
12
-
-
1842476689
-
Efficient voice activity detection algorithms using longterm speech information
-
Apr
-
J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using longterm speech information," Speech Commun., vol. 42, pp. 271-287, Apr. 2004.
-
(2004)
Speech Commun.
, vol.42
, pp. 271-287
-
-
Ramirez, J.1
Segura, J.C.2
Benitez, C.3
De La Torre, A.4
Rubio, A.5
-
13
-
-
79959844439
-
Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments
-
Sept
-
M. C. Huggins, B. Y. Smolenski, and A. D. Lawson, "Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments," in Proc. INTERSPEECH, Sept. 2010, pp. 3094-3097.
-
(2010)
Proc. INTERSPEECH
, pp. 3094-3097
-
-
Huggins, M.C.1
Smolenski, B.Y.2
Lawson, A.D.3
-
14
-
-
0032762471
-
A statistical model-based voice activity detection
-
Jan
-
J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
-
(1999)
IEEE Signal Process. Lett.
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
15
-
-
23344452899
-
Statistical voice activity detection using a multiple observation likelihood ratio test
-
Oct
-
J. Ramirez, J. Segura, C. Benitez, L. Garcia, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, Oct. 2005.
-
(2005)
IEEE Signal Process. Lett.
, vol.12
, Issue.10
, pp. 689-692
-
-
Ramirez, J.1
Segura, J.2
Benitez, C.3
Garcia, L.4
Rubio, A.5
-
16
-
-
33846259282
-
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
-
Mar
-
A. Davis, S. Nordholm, and R. Togneri, "Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold," IEEE Trans. Audio Speech Lang. Process., vol. 14, pp. 412-424, Mar. 2006.
-
(2006)
IEEE Trans. Audio Speech Lang. Process.
, vol.14
, pp. 412-424
-
-
Davis, A.1
Nordholm, S.2
Togneri, R.3
-
17
-
-
78049406668
-
Voice activity detection using harmonic frequency components in likelihood ratio test
-
Mar
-
L. N. Tan, B. J. Borgstrom, and A. Alwan, "Voice activity detection using harmonic frequency components in likelihood ratio test," in Proc. IEEE ICASSP, Mar. 2010, pp. 4466-4469.
-
(2010)
Proc. IEEE ICASSP
, pp. 4466-4469
-
-
Tan, L.N.1
Borgstrom, B.J.2
Alwan, A.3
-
18
-
-
84890539402
-
-
DARPA Robust Automatic Transcription of Speech (RATS)
-
DARPA Robust Automatic Transcription of Speech (RATS).[Online]. Available: http://projects.ldc.upenn.edu/RATS
-
-
-
-
19
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
May
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 4, pp. 788-798, May 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
20
-
-
80051641505
-
Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
-
May
-
S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions," in Proc. IEEE ICASSP, May 2011, pp. 5448-5451.
-
(2011)
Proc. IEEE ICASSP
, pp. 5448-5451
-
-
Sadjadi, S.O.1
Hansen, J.H.L.2
-
21
-
-
84878408467
-
Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition
-
Sept
-
S. O. Sadjadi, T. Hasan, and J. H. L. Hansen, "Mean Hilbert envelope coefficients (MHEC) for robust speaker recognition," in Proc. INTERSPEECH, Sept. 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Sadjadi, S.O.1
Hasan, T.2
Hansen, J.H.L.3
-
22
-
-
84858995082
-
Linear versus mel frequency cepstral coefficients for speaker recognition
-
Dec
-
X. Zhou, D. Garcia-Romero, R. Duraiswami, C. Espy-Wilson, and S. Shamma, "Linear versus mel frequency cepstral coefficients for speaker recognition," in Proc. IEEE ASRU, Dec. 2011, pp. 559-564.
-
(2011)
Proc. IEEE ASRU
, pp. 559-564
-
-
Zhou, X.1
Garcia-Romero, D.2
Duraiswami, R.3
Espy-Wilson, C.4
Shamma, S.5
-
23
-
-
84878378089
-
Regularized all-pole models for speaker verification under noisy environments
-
Mar
-
C. Hanilci, T. Kinnunen, F. Ertas, R. Saeidi, J. Pohjalainen, and P. Alku, "Regularized all-pole models for speaker verification under noisy environments," IEEE Signal Process. Lett., vol. 19, pp. 163-166, Mar. 2012.
-
(2012)
IEEE Signal Process. Lett.
, vol.19
, pp. 163-166
-
-
Hanilci, C.1
Kinnunen, T.2
Ertas, F.3
Saeidi, R.4
Pohjalainen, J.5
Alku, P.6
-
24
-
-
84860850285
-
Low-variance multitaper MFCC features: A case study in robust speaker verification
-
Sept
-
T. Kinnunen, R. Saeidi, F. Sedlak, K. A. Lee, J. Sandberg, M. Hansson-Sandsten, and H. Li, "Low-variance multitaper MFCC features: A case study in robust speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 7, pp. 1990-2001, Sept. 2012.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.7
, pp. 1990-2001
-
-
Kinnunen, T.1
Saeidi, R.2
Sedlak, F.3
Lee, K.A.4
Sandberg, J.5
Hansson-Sandsten, M.6
Li, H.7
-
25
-
-
84870238795
-
Multitaper MFCC and PLP features for speaker verification using i-vectors
-
Feb
-
M. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, and D. O'Shaughnessy, "Multitaper MFCC and PLP features for speaker verification using i-vectors," Speech Commun., vol. 55, no. 2, pp. 237-251, Feb. 2013.
-
(2013)
Speech Commun.
, vol.55
, Issue.2
, pp. 237-251
-
-
Alam, M.J.1
Kinnunen, T.2
Kenny, P.3
Ouellet, P.4
O'shaughnessy, D.5
-
26
-
-
0001835850
-
Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound
-
P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of the sampled sound," in Proc. Institute of Phonetic Sciences, vol. 17, 1993, pp. 97-110.
-
(1993)
Proc. Institute of Phonetic Sciences
, vol.17
, pp. 97-110
-
-
Boersma, P.1
-
28
-
-
0030648077
-
Construction and evaluation of a robust multifeature speech/music discriminator
-
E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE ICASSP, Apr. 1997, pp. 1331-1334.
-
Proc. IEEE ICASSP, Apr. 1997
, pp. 1331-1334
-
-
Scheirer, E.1
Slaney, M.2
-
29
-
-
84873315510
-
Unsupervised speech activity detection using voicing measures and perceptual spectral flux
-
Mar
-
S. O. Sadjadi and J. H. L. Hansen, "Unsupervised speech activity detection using voicing measures and perceptual spectral flux," IEEE Signal Process. Lett., vol. 20, pp. 197-200, Mar. 2013.
-
(2013)
IEEE Signal Process. Lett
, vol.20
, pp. 197-200
-
-
Sadjadi, S.O.1
Hansen, J.H.L.2
-
30
-
-
84890455722
-
-
HTK-Hidden Markov Model Toolkit v3. 4. 1
-
HTK-Hidden Markov Model Toolkit v3.4.1.[Online]. Available: http://htk.eng.cam.ac.uk
-
-
-
-
31
-
-
50649094277
-
Probabilistic linear discriminant analysis for inferences about identity
-
S. Prince and J. Elder, "Probabilistic linear discriminant analysis for inferences about identity," in Proc. IEEE Int. Conf. Computer Vision, ICCV 2007, Oct. 2007, pp. 1-8.
-
Proc. IEEE Int. Conf. Computer Vision, ICCV 2007, Oct. 2007
, pp. 1-8
-
-
Prince, S.1
Elder, J.2
-
32
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
D. Garcia-Romero and C. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems," in Proc. INTERSPEECH, Sept. 2011, pp. 249-252.
-
Proc. INTERSPEECH, Sept. 2011
, pp. 249-252
-
-
Garcia-Romero, D.1
Espy-Wilson, C.2
|