SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 2484-2488

Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data

(4) Do, Cong Thanh a Barras, Claude a Le, Viet Bac b Sarkar, Achintya K a

a CNRS (France)

b Vocapia Research (France)

Author keywords

GMM UBM; Multi layer perceptron (MLP); NIST SRE 2008; Principal component analysis (PCA); Speaker verification

Indexed keywords

PRINCIPAL COMPONENT ANALYSIS; TELEPHONE SETS;

AUTOMATIC SPEECH RECOGNITION; GMM-UBM; MLP (MULTILAYER PERCEPTRON); MULTI LAYER PERCEPTRON; NIST SRE 2008; SPEAKER RECOGNITION EVALUATIONS; SPEAKER VERIFICATION; TEXT-INDEPENDENT SPEAKER VERIFICATION;

SPEECH RECOGNITION;

EID: 84906241163 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (15)

References (24)

1
- 70350125882
- An overview of textindependent speaker recognition: From features to supervectors
- Jan
- Kinnunen, T. and Li, H., "An overview of textindependent speaker recognition: from features to supervectors", Speech Communication, 52(1):12-40, Jan. 2010.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

2
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- Feb
- Furui, S., "Speaker-independent isolated word recognition using dynamic features of speech spectrum", IEEE Trans. on Acoustics, Speech, and Signal Processing, 34(1):52- 59, Feb. 1986.
- (1986) IEEE Trans. on Acoustics, Speech, and Signal Processing , vol.34 , Issue.1 , pp. 52-59
- Furui, S.¹

3
- 85032751546
- Pushing the envelope - Aside
- Sep
- Morgan, N., et al., "Pushing the envelope - Aside", IEEE Signal Processing Magazine, 22(5):81-88, Sep. 2005.
- (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 81-88
- Morgan, N.¹

4
- 80051613059
- Improved models for Mandarin speech-to-text transcription
- May 22-25, Prague, Czech Republic
- Lamel, L., Gauvain, J.-L., Le, V.B., Oparin, I. and Meng, S., "Improved models for Mandarin speech-to-text transcription", IEEE ICASSP, pp. 4660-4663, May 22-25, Prague, Czech Republic, 2011.
- (2011) IEEE ICASSP , pp. 4660-4663
- Lamel, L.¹ Gauvain, J.-L.² Le, V.B.³ Oparin, I.⁴ Meng, S.⁵

5
- 79959848126
- A comparative large scale study of MLP features for Mandarin ASR
- September 26-30, Makuhari, Japan
- Valente, F., Magimai-Doss, M., Plahl, C., Ravuri, S. and Wang, W., "A comparative large scale study of MLP features for Mandarin ASR", INTERSPEECH, pp. 2630- 2633, September 26-30, Makuhari, Japan, 2010.
- (2010) Interspeech , pp. 2630-2633
- Valente, F.¹ Magimai-Doss, M.² Plahl, C.³ Ravuri, S.⁴ Wang, W.⁵

6
- 51449103447
- Optimizing bottle-neck features for LVCSR
- March 30 - April 04, Las Vegas, USA
- Grezl, F. and Fousek, P., "Optimizing bottle-neck features for LVCSR", IEEE ICASSP, pp. 4729-4732, March 30 - April 04, Las Vegas, USA, 2008.
- (2008) IEEE ICASSP , pp. 4729-4732
- Grezl, F.¹ Fousek, P.²

7
- 2942594475
- A tutorial on text-independent speaker verification
- Bimbot, F., et al., "A tutorial on text-independent speaker verification", EURASIP Journal on Applied Signal Processing, 24(4):430-451, 2004.
- (2004) EURASIP Journal on Applied Signal Processing , vol.24 , Issue.4 , pp. 430-451
- Bimbot, F.¹

8
- 0033746018
- Robustness to telephone handset distortion in speaker recognition by discriminative feature design
- Jun
- Heck, L. P., Konig, Y., Sonmez, M. K. and Weintraub, M., "Robustness to telephone handset distortion in speaker recognition by discriminative feature design", Speech Communication, 31(2-3):181-192, Jun. 2000.
- (2000) Speech Communication , vol.31 , Issue.2-3 , pp. 181-192
- Heck, L.P.¹ Konig, Y.² Sonmez, M.K.³ Weintraub, M.⁴

9
- 33745477958
- MLP internal representation as discriminative features for improved speaker recognition
- April 19- 22, Barcelona, Spain
- Wu, D., Morris, A. and Koreman, J., "MLP internal representation as discriminative features for improved speaker recognition", NOLISP'05, pp. 72-80, April 19- 22, Barcelona, Spain, 2005.
- (2005) NOLISP'05 , pp. 72-80
- Wu, D.¹ Morris, A.² Koreman, J.³

10
- 85073199671
- Bottleneck features for speaker recognition
- June 25-28, Singapore
- Yaman, S., Pelecanos, J. and Sarikaya, R., "Bottleneck features for speaker recognition", Odyssey'12, pp. 105- 108, June 25-28, Singapore, 2012.
- (2012) Odyssey'12 , pp. 105-108
- Yaman, S.¹ Pelecanos, J.² Sarikaya, R.³

11
- 38549166347
- Speaker recognition via nonlinear discriminant features
- May 22-25, Paris, France
- Stoll, L., Frankel, J. and Mirghafori, N., "Speaker recognition via nonlinear discriminant features", NOLISP'07, pp. 114-123, May 22-25, Paris, France, 2007.
- (2007) NOLISP'07 , pp. 114-123
- Stoll, L.¹ Frankel, J.² Mirghafori, N.³

12
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky, H., "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Am., 87(4):1738-1752, 1990.
- (1990) J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

13
- 36248960119
- High-level features in speaker recognition
- (C. Mueller Eds.), Springer, Heidelberg, Germany
- Shriberg, E., "High-level features in speaker recognition", Lecture Notes in Artificial Intelligence, Speaker Classification (C. Mueller Eds.), Springer, Heidelberg, Germany, vol. 4343, 2007.
- (2007) Lecture Notes in Artificial Intelligence, Speaker Classification , vol.4343
- Shriberg, E.¹

14
- 84867209138
- Transcribing broadcast data using MLP features
- September 22-26, Brisbane, Australia
- Fousek, P., Lamel, L. and Gauvain, J.-L., "Transcribing broadcast data using MLP features", INTERSPEECH, pp. 1433-1436, September 22-26, Brisbane, Australia, 2008.
- (2008) Interspeech , pp. 1433-1436
- Fousek, P.¹ Lamel, L.² Gauvain, J.-L.³

15
- 33745208455
- The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system
- September 04-08, Lisbon, Portugal
- Prasad, R., et al., "The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system", INTERSPEECH, pp. 1645-1648, September 04-08, Lisbon, Portugal, 2005.
- (2005) Interspeech , pp. 1645-1648
- Prasad, R.¹

16
- 33745185321
- Using MLP features in SRI's conversational speech recognition system
- September 04- 08, Lisbon, Portugal
- Zhu, Q., Stolcke, A., Chen, B.Y. and Morgan, N., "Using MLP features in SRI's conversational speech recognition system", INTERSPEECH, pp. 2141-2144, September 04- 08, Lisbon, Portugal, 2005.
- (2005) Interspeech , pp. 2141-2144
- Zhu, Q.¹ Stolcke, A.² Chen, B.Y.³ Morgan, N.⁴

17
- 34548463136
- Springer series in statistics, Springer-Verlag, 2nd Eds
- Jolliffe, I.T., "Principal component analysis", Springer series in statistics, Springer-Verlag, 2nd Eds., pp. 487, 2002.
- (2002) Principal Component Analysis , pp. 487
- Jolliffe, I.T.¹

18
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- Reynolds, D., Quatieri, T. and Dunn, R., "Speaker verification using adapted Gaussian mixture models", Digital Signal Processing, 87:19-41, 2000.
- (2000) Digital Signal Processing , vol.87 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.³

19
- 51549102132
- "The NIST year 2004 speaker recognition evaluation plan", http://www.itl.nist.gov/iad/mig/tests/spk/2004, 2004.
- (2004) The NIST Year 2004 Speaker Recognition Evaluation Plan

20
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- Gauvain, J.-L. and Lee, C.-H., "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains", IEEE Trans. on Speech and Audio Processing, 2(2):291-298, Apr. 1994.
- (1994) IEEE Trans. on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

21
- 64549135569
- "The NIST year 2008 speaker recognition evaluation plan", http://www.itl.nist.gov/iad/mig/tests/sre/2008/2008.
- (2008) The NIST Year 2008 Speaker Recognition Evaluation Plan

22
- 33646768994
- Speaker adaptive cohort selection for T-norm in text-independent speaker verification
- March 18-23, Philadelphia, USA
- Sturim, D.E. and Reynolds, D.A., "Speaker adaptive cohort selection for T-norm in text-independent speaker verification", IEEE ICASSP, pp. 741-744, March 18-23, Philadelphia, USA, 2005.
- (2005) IEEE ICASSP , pp. 741-744
- Sturim, D.E.¹ Reynolds, D.A.²

23
- 84906279562
- Cochlear implant-like processing of speech signal for speaker verification
- September 07- 08, Portland, OR, USA
- Do, C.-T. and Barras, C., "Cochlear implant-like processing of speech signal for speaker verification", SAPA (Statistical and Perceptual Audition) Conference, satellite workshop of INTERSPEECH, pp. 17-21, September 07- 08, Portland, OR, USA, 2012.
- (2012) SAPA (Statistical and Perceptual Audition) Conference, Satellite Workshop of INTERSPEECH , pp. 17-21
- Do, C.-T.¹ Barras, C.²

24
- 0010534620
- Application of LDA to speaker recognition
- October 16-20, Beijing, China
- Jin, Q. and Waibel, A., "Application of LDA to speaker recognition", ISCA ICSLP, pp. 250-253, October 16-20, Beijing, China, 2000.
- (2000) ISCA ICSLP , pp. 250-253
- Jin, Q.¹ Waibel, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.