메뉴 건너뛰기




Volumn 44, Issue 10-11, 2011, Pages 2749-2759

Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

Author keywords

Discrete cosine transform; Feature extraction; Radon transform; Speaker recognition; Spectrogram

Indexed keywords

ACOUSTIC CHARACTERISTIC; ACOUSTIC FEATURES; COMPUTATIONALLY EFFICIENT; FEATURE EXTRACTION TECHNIQUES; FEATURE VECTORS; IMAGE PROCESSING TECHNIQUE; LOW DIMENSIONAL; MASSACHUSETTS INSTITUTE OF TECHNOLOGY; PIXEL VALUES; RADON TRANSFORM; RECOGNITION RATES; SPEAKER RECOGNITION; SPECTROGRAM; SPECTROGRAMS; SPEECH SPECTROGRAM; STRAIGHT LINES; TEXAS INSTRUMENTS; TEXT-INDEPENDENT SPEAKER IDENTIFICATION; TIMIT DATABASE;

EID: 79958815892     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2011.04.009     Document Type: Article
Times cited : (64)

References (39)
  • 4
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen, and H. Li An overview of text-independent speaker recognition: from features to supervectors Speech Communication 52 1 2010 12 40
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 8
    • 34548248573 scopus 로고    scopus 로고
    • Explicit modelling of session variability for speaker verification
    • DOI 10.1016/j.csl.2007.05.003, PII S0885230807000277
    • R. Vogt, and S. Sridharan Explicit modeling of session variability for speaker verification Computer Speech and Language 22 1 2008 17 38 (Pubitemid 47333032)
    • (2008) Computer Speech and Language , vol.22 , Issue.1 , pp. 17-38
    • Vogt, R.1    Sridharan, S.2
  • 11
    • 0035935951 scopus 로고    scopus 로고
    • GMM based on local PCA for speaker identification
    • DOI 10.1049/el:20010976
    • C.W. Seo, K.Y. Lee, and J. Lee GMM based on local PCA for speaker identification Electronics Letter 37 24 2001 1486 1488 (Pubitemid 33128315)
    • (2001) Electronics Letters , vol.37 , Issue.24 , pp. 1486-1488
    • Seo, C.1    Lee, K.Y.2    Lee, J.3
  • 13
    • 0019053271 scopus 로고
    • Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences
    • S.B. Davis, and P. Mermelstain Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustic, Speech and Signal Processing 28 4 1980 357 366
    • (1980) IEEE Transactions on Acoustic, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstain, P.2
  • 14
    • 0028515984 scopus 로고
    • Experimental evaluation of features for robust speaker identification
    • D.A. Reynolds Experimental evaluation of features for robust speaker identification IEEE Transactions on Speech and Audio Processing 2 4 1994 639 643
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.4 , pp. 639-643
    • Reynolds, D.A.1
  • 15
    • 0028466072 scopus 로고
    • The importance of cepstral parameter correlations in speech recognition
    • A. Ljolje The importance of cepstral parameter correlations in speech recognition Computer Speech and Language 8 1994 223 232
    • (1994) Computer Speech and Language , vol.8 , pp. 223-232
    • Ljolje, A.1
  • 16
    • 0033154048 scopus 로고    scopus 로고
    • Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification
    • K. You, and H. Wang Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification Speech Communication 28 3 1999 227 241
    • (1999) Speech Communication , vol.28 , Issue.3 , pp. 227-241
    • You, K.1    Wang, H.2
  • 19
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D.A. Reynolds, and R.C. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Transactions on Speech and Audio Processing 3 1 1995 72 82
    • (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-82
    • Reynolds, D.A.1    Rose, R.C.2
  • 27
    • 0037211087 scopus 로고    scopus 로고
    • Sub-band SNR estimation using auditory feature processing
    • DOI 10.1016/S0167-6393(02)00058-4, PII S0167639302000584
    • M. Kleinschmidt, and V. Hohmann Sub-band SNR estimation using auditory feature processing Speech Communication 39 12 2003 47 63 (Pubitemid 35412361)
    • (2003) Speech Communication , vol.39 , Issue.1-2 , pp. 47-63
    • Kleinschmidt, M.1    Hohmann, V.2
  • 28
    • 77949346152 scopus 로고    scopus 로고
    • Methods for capturing spectro-temporal modulations in automatic speech recognition
    • M. Kleinschmidt Methods for capturing spectro-temporal modulations in automatic speech recognition Acta Acustica 8 2001 1 6
    • (2001) Acta Acustica , vol.8 , pp. 1-6
    • Kleinschmidt, M.1
  • 29
    • 23744508888 scopus 로고    scopus 로고
    • Multiresolution spectrotemporal analysis of complex sounds
    • DOI 10.1121/1.1945807
    • T. Chih, P. Ru, and S. Shamma Multi resolution spectro-temporal analysis of complex sounds Journal of Acoustic Society of America 118 2005 887 906 (Pubitemid 41129224)
    • (2005) Journal of the Acoustical Society of America , vol.118 , Issue.2 , pp. 887-906
    • Chi, T.1    Ru, P.2    Shamma, S.A.3
  • 33
    • 34247137524 scopus 로고    scopus 로고
    • A speech-and-speaker identification system: Feature extraction, description, and classification of speech-signal image
    • DOI 10.1109/TIE.2007.891647
    • K. Saeed, and M.K. Nammous. A speech-and-speaker identification system: feature extraction, description, and classification of speech-signal image IEEE Transactions on Industrial Electronics 54 2 2007 887 897 (Pubitemid 46591011)
    • (2007) IEEE Transactions on Industrial Electronics , vol.54 , Issue.2 , pp. 887-897
    • Saeed, K.1    Nammous, M.K.2
  • 36
    • 20444411826 scopus 로고    scopus 로고
    • Rotation invariant multiresolution texture analysis using radon and wavelet transform
    • Kourosh Jafari-Khouzani, and Humid Soltanian-Zadeh Rotation invariant multiresolution texture analysis using radon and wavelet transform IEEE Transactions on Image Processing 14 6 2005 783 794
    • (2005) IEEE Transactions on Image Processing , vol.14 , Issue.6 , pp. 783-794
    • Jafari-Khouzani, K.1    Soltanian-Zadeh, H.2
  • 37
    • 34547697579 scopus 로고    scopus 로고
    • Scaling and rotation invariant approach to object recognition based on Radon and FourierMellin transforms
    • W. Xuan, X. Bin, M. JianFeng, and B. Xiu-Li Scaling and rotation invariant approach to object recognition based on Radon and FourierMellin transforms Pattern Recognition 40 12 2007 3503 3508
    • (2007) Pattern Recognition , vol.40 , Issue.12 , pp. 3503-3508
    • Xuan, W.1    Bin, X.2    Jianfeng, M.3    Xiu-Li, B.4
  • 38
    • 25844493811 scopus 로고    scopus 로고
    • PCA and LDA in DCT domain
    • DOI 10.1016/j.patrec.2005.05.004, PII S0167865505001522
    • W. Chen, M.J. Er, and S. Wu PCA and LDA in DCT domain Pattern Recognition Letter 26 15 2005 2474 2482 (Pubitemid 41394303)
    • (2005) Pattern Recognition Letters , vol.26 , Issue.15 , pp. 2474-2482
    • Chen, W.1    Er, M.J.2    Wu, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.