-
1
-
-
63249107289
-
Robust speaker recognition in noisy conditions
-
Ming, J., Hazen, T.J., Glass, J.R. and Reynolds, D.A., “Robust Speaker Recognition in Noisy Conditions”, IEEE Tran. on Audio Speech Lang. Proc., Vol 15 (5), 2007, pp. 1711 - 1723.
-
(2007)
IEEE Tran. On Audio Speech Lang. Proc.
, vol.15
, Issue.5
, pp. 1711-1723
-
-
Ming, J.1
Hazen, T.J.2
Glass, J.R.3
Reynolds, D.A.4
-
2
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
Apr
-
Boll, S.F.,“Suppression of acoustic noise in speech using spectral subtraction”, IEEE Trans. Acoust. Speech Signal Process., Vol. 27 (2), Apr. 1979, pp. 113-120.
-
(1979)
IEEE Trans. Acoust. Speech Signal Process.
, vol.27
, Issue.2
, pp. 113-120
-
-
Boll, S.F.1
-
4
-
-
0030671924
-
Missing data techniques for robust speech recognition
-
Cooke, M., Morris, A., Green, P., “Missing data techniques for robust speech recognition”, Proc. ICASSP, 1997, pp. 863-866.
-
(1997)
Proc. ICASSP
, pp. 863-866
-
-
Cooke, M.1
Morris, A.2
Green, P.3
-
5
-
-
85073258179
-
Feature warping for robust speaker verification
-
Greece
-
Pelecanos, J. and Sridharan, S., “Feature warping for robust speaker verification”, Proc. Speaker Odyssey 2001 Speaker Recognition Workshop, Greece, pp. 213-218, 2001.
-
(2001)
Proc. Speaker Odyssey 2001 Speaker Recognition Workshop
, pp. 213-218
-
-
Pelecanos, J.1
Sridharan, S.2
-
6
-
-
0028517164
-
RASTA processing of speech
-
Hermansky, H. and Morgan, N., “RASTA processing of speech,” IEEE Trans. on Speech and Audio Process., Vol. 2, pp. 578-589, 1994.
-
(1994)
IEEE Trans. On Speech and Audio Process.
, vol.2
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
7
-
-
84881675408
-
Cepstral channel normalization techniques for HMM-based speaker verification
-
Rosenberg, A.E., Lee, C. and Soong, F.K., “Cepstral Channel Normalization Techniques for HMM-Based Speaker Verification,” in Proc. ICSLP, pp. 1835-1838, 1994.
-
(1994)
Proc. ICSLP
, pp. 1835-1838
-
-
Rosenberg, A.E.1
Lee, C.2
Soong, F.K.3
-
8
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
Davis, S. and Mermelstein, R., “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, IEEE Trans. Acoust. Speech Signal Process., Vol. 28 (4), Aug. 1980, pp. 357-366.
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, R.2
-
9
-
-
0015112070
-
Speech analysis and synthesis by linear prediction of the speech wave
-
Atal, B.S., Hanauer, L.S, “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave”, J. Acoust. America, Vol 50 (28), 1971, pp. 637-655.
-
(1971)
J. Acoust. America
, vol.50
, Issue.28
, pp. 637-655
-
-
Atal, B.S.1
Hanauer, L.S.2
-
10
-
-
0016495091
-
Linear prediction: A tutorial review
-
Makhoul, J., “Linear Prediction: A Tutorial Review”,in Proc. of the IEEE, Vol 63(4), pp. 561-580, 1975.
-
(1975)
Proc. Of the IEEE
, vol.63
, Issue.4
, pp. 561-580
-
-
Makhoul, J.1
-
11
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
Hermansky, H., “Perceptual Linear Predictive (PLP) Analysis of Speech,” J. Acoust. Soc. Am., vol. 87, pp. 1738-1752, 1990.
-
(1990)
J. Acoust. Soc. Am.
, vol.87
, pp. 1738-1752
-
-
Hermansky, H.1
-
12
-
-
80051618525
-
Feature normalization for speaker verification in room reverberation
-
Ganapathy, S., Pelecanos, J. and Omar, M.K., “Feature Normalization for Speaker Verification in Room Reverberation”, Proc. ICASSP, 2011, pp. 4836-4839.
-
(2011)
Proc. ICASSP
, pp. 4836-4839
-
-
Ganapathy, S.1
Pelecanos, J.2
Omar, M.K.3
-
13
-
-
27144453376
-
PLP2 Autoregressive modeling of auditory-like 2-D spectro-temporal patterns
-
Athineos, M. and Hermansky, H. and Ellis, D., “PLP2 Autoregressive modeling of auditory-like 2-D spectro-temporal patterns”, Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA04, pp. 3742, 2004.
-
(2004)
Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA04
, pp. 3742
-
-
Athineos, M.1
Hermansky, H.2
Ellis, D.3
-
14
-
-
0033004349
-
Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications
-
Mar
-
Kumerasan, R. and Rao, A., “Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications,” Journal of Acoustical Society of America, Vol. 105, no 3, pp. 1912-1924, Mar. 1999.
-
(1999)
Journal of Acoustical Society of America
, vol.105
, Issue.3
, pp. 1912-1924
-
-
Kumerasan, R.1
Rao, A.2
-
15
-
-
36248966385
-
Autoregressive modelling of temporal envelopes
-
Athineos, M. and Ellis, D., “Autoregressive modelling of temporal envelopes,” IEEE Tran. Signal Proc., Vol. 55, pp. 5237-5245, 2007.
-
(2007)
IEEE Tran. Signal Proc.
, vol.55
, pp. 5237-5245
-
-
Athineos, M.1
Ellis, D.2
-
16
-
-
0016067897
-
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
-
Atal, B.S., “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification”, J. Acoust. America, Vol 55 (6), 1974, pp. 1304-1312.
-
(1974)
J. Acoust. America
, vol.55
, Issue.6
, pp. 1304-1312
-
-
Atal, B.S.1
-
17
-
-
84906248150
-
-
website
-
“National Institute of Standards and Technology (NIST),” speech group website, http://www.nist.gov/speech, 2010.
-
(2010)
Speech Group
-
-
-
18
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P and Ouellet, P., “Front-End Factor Analysis for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, Vol. 19(4), pp. 788-798, 2011.
-
(2011)
IEEE Transactions on Audio, Speech and Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
19
-
-
84865733857
-
Analysis of i-vector Length Normalization in Speaker Recognition Systems
-
Romero, D. and Espy-Wilson, C.Y., “Analysis of i-vector Length Normalization in Speaker Recognition Systems”, Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Romero, D.1
Espy-Wilson, C.Y.2
-
21
-
-
0032634932
-
Computing the discrete-time analytic signal via FFT
-
Marple, L.S., “Computing the Discrete-Time Analytic Signal via FFT”, IEEE Trans. on Acoust., Speech and Sig. Proc., Vol. 47, pp. 2600-2603, 1999.
-
(1999)
IEEE Trans. On Acoust., Speech and Sig. Proc.
, vol.47
, pp. 2600-2603
-
-
Marple, L.S.1
-
22
-
-
67650107416
-
Recognition of reverberant speech using frequency domain linear prediction
-
Dec
-
Thomas, S., Ganapathy, S. and Hermansky, H. “Recognition of Reverberant Speech Using Frequency Domain Linear Prediction,” IEEE Signal Proc. Letters, Vol. 15, Dec. 2008, pp. 681-684.
-
(2008)
IEEE Signal Proc. Letters
, vol.15
, pp. 681-684
-
-
Thomas, S.1
Ganapathy, S.2
Hermansky, H.3
-
23
-
-
0028429724
-
Symmetric convolution and the discrete sine and cosine transforms
-
Martucci, S.A., “Symmetric convolution and the discrete sine and cosine transforms”, IEEE Tran. Signal Proc., Vol. 42(5), 1994 pp. 1038-1051.
-
(1994)
IEEE Tran. Signal Proc.
, vol.42
, Issue.5
, pp. 1038-1051
-
-
Martucci, S.A.1
-
24
-
-
0029355999
-
Speaker Identification and Verification using Gaussian Mixture Speaker Models
-
Aug
-
Reynolds, D., “Speaker Identification and Verification using Gaussian Mixture Speaker Models,” Speech Comm. Vol. 17, Aug. 1995, pp. 91-108.
-
(1995)
Speech Comm
, vol.17
, pp. 91-108
-
-
Reynolds, D.1
-
27
-
-
83455246037
-
Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification
-
Ganapathy, S., Rajan, P. and Hermansky, H., “Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification”, IEEE Workshop on Application of Signal Proc. to Audio and Acoustics, 2011, pp. 321-324.
-
(2011)
IEEE Workshop on Application of Signal Proc. To Audio and Acoustics
, pp. 321-324
-
-
Ganapathy, S.1
Rajan, P.2
Hermansky, H.3
|