메뉴 건너뛰기




Volumn , Issue , 2013, Pages 2484-2488

Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data

Author keywords

GMM UBM; Multi layer perceptron (MLP); NIST SRE 2008; Principal component analysis (PCA); Speaker verification

Indexed keywords

PRINCIPAL COMPONENT ANALYSIS; TELEPHONE SETS;

EID: 84906241163     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (24)
  • 1
    • 70350125882 scopus 로고    scopus 로고
    • An overview of textindependent speaker recognition: From features to supervectors
    • Jan
    • Kinnunen, T. and Li, H., "An overview of textindependent speaker recognition: from features to supervectors", Speech Communication, 52(1):12-40, Jan. 2010.
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 2
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • Feb
    • Furui, S., "Speaker-independent isolated word recognition using dynamic features of speech spectrum", IEEE Trans. on Acoustics, Speech, and Signal Processing, 34(1):52- 59, Feb. 1986.
    • (1986) IEEE Trans. on Acoustics, Speech, and Signal Processing , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 3
    • 85032751546 scopus 로고    scopus 로고
    • Pushing the envelope - Aside
    • Sep
    • Morgan, N., et al., "Pushing the envelope - Aside", IEEE Signal Processing Magazine, 22(5):81-88, Sep. 2005.
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 81-88
    • Morgan, N.1
  • 4
    • 80051613059 scopus 로고    scopus 로고
    • Improved models for Mandarin speech-to-text transcription
    • May 22-25, Prague, Czech Republic
    • Lamel, L., Gauvain, J.-L., Le, V.B., Oparin, I. and Meng, S., "Improved models for Mandarin speech-to-text transcription", IEEE ICASSP, pp. 4660-4663, May 22-25, Prague, Czech Republic, 2011.
    • (2011) IEEE ICASSP , pp. 4660-4663
    • Lamel, L.1    Gauvain, J.-L.2    Le, V.B.3    Oparin, I.4    Meng, S.5
  • 5
    • 79959848126 scopus 로고    scopus 로고
    • A comparative large scale study of MLP features for Mandarin ASR
    • September 26-30, Makuhari, Japan
    • Valente, F., Magimai-Doss, M., Plahl, C., Ravuri, S. and Wang, W., "A comparative large scale study of MLP features for Mandarin ASR", INTERSPEECH, pp. 2630- 2633, September 26-30, Makuhari, Japan, 2010.
    • (2010) Interspeech , pp. 2630-2633
    • Valente, F.1    Magimai-Doss, M.2    Plahl, C.3    Ravuri, S.4    Wang, W.5
  • 6
    • 51449103447 scopus 로고    scopus 로고
    • Optimizing bottle-neck features for LVCSR
    • March 30 - April 04, Las Vegas, USA
    • Grezl, F. and Fousek, P., "Optimizing bottle-neck features for LVCSR", IEEE ICASSP, pp. 4729-4732, March 30 - April 04, Las Vegas, USA, 2008.
    • (2008) IEEE ICASSP , pp. 4729-4732
    • Grezl, F.1    Fousek, P.2
  • 7
    • 2942594475 scopus 로고    scopus 로고
    • A tutorial on text-independent speaker verification
    • Bimbot, F., et al., "A tutorial on text-independent speaker verification", EURASIP Journal on Applied Signal Processing, 24(4):430-451, 2004.
    • (2004) EURASIP Journal on Applied Signal Processing , vol.24 , Issue.4 , pp. 430-451
    • Bimbot, F.1
  • 8
    • 0033746018 scopus 로고    scopus 로고
    • Robustness to telephone handset distortion in speaker recognition by discriminative feature design
    • Jun
    • Heck, L. P., Konig, Y., Sonmez, M. K. and Weintraub, M., "Robustness to telephone handset distortion in speaker recognition by discriminative feature design", Speech Communication, 31(2-3):181-192, Jun. 2000.
    • (2000) Speech Communication , vol.31 , Issue.2-3 , pp. 181-192
    • Heck, L.P.1    Konig, Y.2    Sonmez, M.K.3    Weintraub, M.4
  • 9
    • 33745477958 scopus 로고    scopus 로고
    • MLP internal representation as discriminative features for improved speaker recognition
    • April 19- 22, Barcelona, Spain
    • Wu, D., Morris, A. and Koreman, J., "MLP internal representation as discriminative features for improved speaker recognition", NOLISP'05, pp. 72-80, April 19- 22, Barcelona, Spain, 2005.
    • (2005) NOLISP'05 , pp. 72-80
    • Wu, D.1    Morris, A.2    Koreman, J.3
  • 10
    • 85073199671 scopus 로고    scopus 로고
    • Bottleneck features for speaker recognition
    • June 25-28, Singapore
    • Yaman, S., Pelecanos, J. and Sarikaya, R., "Bottleneck features for speaker recognition", Odyssey'12, pp. 105- 108, June 25-28, Singapore, 2012.
    • (2012) Odyssey'12 , pp. 105-108
    • Yaman, S.1    Pelecanos, J.2    Sarikaya, R.3
  • 11
    • 38549166347 scopus 로고    scopus 로고
    • Speaker recognition via nonlinear discriminant features
    • May 22-25, Paris, France
    • Stoll, L., Frankel, J. and Mirghafori, N., "Speaker recognition via nonlinear discriminant features", NOLISP'07, pp. 114-123, May 22-25, Paris, France, 2007.
    • (2007) NOLISP'07 , pp. 114-123
    • Stoll, L.1    Frankel, J.2    Mirghafori, N.3
  • 12
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky, H., "Perceptual linear predictive (PLP) analysis of speech", J. Acoust. Soc. Am., 87(4):1738-1752, 1990.
    • (1990) J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 13
    • 36248960119 scopus 로고    scopus 로고
    • High-level features in speaker recognition
    • (C. Mueller Eds.), Springer, Heidelberg, Germany
    • Shriberg, E., "High-level features in speaker recognition", Lecture Notes in Artificial Intelligence, Speaker Classification (C. Mueller Eds.), Springer, Heidelberg, Germany, vol. 4343, 2007.
    • (2007) Lecture Notes in Artificial Intelligence, Speaker Classification , vol.4343
    • Shriberg, E.1
  • 14
    • 84867209138 scopus 로고    scopus 로고
    • Transcribing broadcast data using MLP features
    • September 22-26, Brisbane, Australia
    • Fousek, P., Lamel, L. and Gauvain, J.-L., "Transcribing broadcast data using MLP features", INTERSPEECH, pp. 1433-1436, September 22-26, Brisbane, Australia, 2008.
    • (2008) Interspeech , pp. 1433-1436
    • Fousek, P.1    Lamel, L.2    Gauvain, J.-L.3
  • 15
    • 33745208455 scopus 로고    scopus 로고
    • The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system
    • September 04-08, Lisbon, Portugal
    • Prasad, R., et al., "The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system", INTERSPEECH, pp. 1645-1648, September 04-08, Lisbon, Portugal, 2005.
    • (2005) Interspeech , pp. 1645-1648
    • Prasad, R.1
  • 16
    • 33745185321 scopus 로고    scopus 로고
    • Using MLP features in SRI's conversational speech recognition system
    • September 04- 08, Lisbon, Portugal
    • Zhu, Q., Stolcke, A., Chen, B.Y. and Morgan, N., "Using MLP features in SRI's conversational speech recognition system", INTERSPEECH, pp. 2141-2144, September 04- 08, Lisbon, Portugal, 2005.
    • (2005) Interspeech , pp. 2141-2144
    • Zhu, Q.1    Stolcke, A.2    Chen, B.Y.3    Morgan, N.4
  • 17
    • 34548463136 scopus 로고    scopus 로고
    • Springer series in statistics, Springer-Verlag, 2nd Eds
    • Jolliffe, I.T., "Principal component analysis", Springer series in statistics, Springer-Verlag, 2nd Eds., pp. 487, 2002.
    • (2002) Principal Component Analysis , pp. 487
    • Jolliffe, I.T.1
  • 18
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds, D., Quatieri, T. and Dunn, R., "Speaker verification using adapted Gaussian mixture models", Digital Signal Processing, 87:19-41, 2000.
    • (2000) Digital Signal Processing , vol.87 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 20
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • Gauvain, J.-L. and Lee, C.-H., "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains", IEEE Trans. on Speech and Audio Processing, 2(2):291-298, Apr. 1994.
    • (1994) IEEE Trans. on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 22
    • 33646768994 scopus 로고    scopus 로고
    • Speaker adaptive cohort selection for T-norm in text-independent speaker verification
    • March 18-23, Philadelphia, USA
    • Sturim, D.E. and Reynolds, D.A., "Speaker adaptive cohort selection for T-norm in text-independent speaker verification", IEEE ICASSP, pp. 741-744, March 18-23, Philadelphia, USA, 2005.
    • (2005) IEEE ICASSP , pp. 741-744
    • Sturim, D.E.1    Reynolds, D.A.2
  • 24
    • 0010534620 scopus 로고    scopus 로고
    • Application of LDA to speaker recognition
    • October 16-20, Beijing, China
    • Jin, Q. and Waibel, A., "Application of LDA to speaker recognition", ISCA ICSLP, pp. 250-253, October 16-20, Beijing, China, 2000.
    • (2000) ISCA ICSLP , pp. 250-253
    • Jin, Q.1    Waibel, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.