메뉴 건너뛰기




Volumn 27, Issue 1, 2013, Pages 151-167

Automatic speaker age and gender recognition using acoustic and prosodic level information fusion

Author keywords

Age recognition; Formant; Gender recognition; GMM; Harmonic structure; Maximum likelihood linear regression; Pitch; Polynomial expansion; Prosodic features; Score level fusion; Sparse representation; SVM; UBM weight posterior probability supervectors

Indexed keywords

AGE RECOGNITION; FORMANT; GENDER RECOGNITION; GMM; HARMONIC STRUCTURES; MAXIMUM LIKELIHOOD LINEAR REGRESSION; PITCH; POLYNOMIAL EXPANSION; POSTERIOR PROBABILITY; PROSODIC FEATURES; SCORE-LEVEL FUSION; SPARSE REPRESENTATION; SVM;

EID: 84867336595     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2012.01.008     Document Type: Article
Times cited : (158)

References (47)
  • 1
    • 84867328971 scopus 로고    scopus 로고
    • Age and gender classification using modulation cepstrum
    • Ajmera J.; and Burkhardt F. Age and gender classification using modulation cepstrum Proc. Odyssey 2008 025
    • (2008) Proc. Odyssey , pp. 025
    • Ajmera, J.1    Burkhardt, F.2
  • 3
    • 51449091542 scopus 로고    scopus 로고
    • Age and gender recognition for telephone applications based on GMM supervectors and support vector machines
    • Bocklet T.; Maier A.; Bauer J.; Burkhardt F.; and Nöth E. Age and gender recognition for telephone applications based on GMM supervectors and support vector machines Proc. ICASSP 2008 1605 1608
    • (2008) Proc. ICASSP , pp. 1605-1608
    • Bocklet, T.1    Maier, A.2    Bauer, J.3    Burkhardt, F.4    Nöth, E.5
  • 4
    • 79959826869 scopus 로고    scopus 로고
    • Age and gender recognition based on multiple systems - Early vs. late fusion
    • Bocklet T.; Stemmer G.; Zeissler V.; and Nöth E. Age and gender recognition based on multiple systems - early vs. late fusion Proc. INTERSPEECH 2010 2830 2833
    • (2010) Proc. INTERSPEECH , pp. 2830-2833
    • Bocklet, T.1    Stemmer, G.2    Zeissler, V.3    Nöth, E.4
  • 8
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
    • Campbell W.; Sturim D.; Reynolds D.; and Solomonoff A. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation Proc. ICASSP 2006 97 100
    • (2006) Proc. ICASSP , pp. 97-100
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3    Solomonoff, A.4
  • 11
    • 0000913324 scopus 로고    scopus 로고
    • SVMTorch: Support vector machines for large-scale regression problems
    • Collobert R.; and Bengio S. SVMTorch: support vector machines for large-scale regression problems The Journal of Machine Learning Research 1 2001 143 160
    • (2001) The Journal of Machine Learning Research , vol.1 , pp. 143-160
    • Collobert, R.1    Bengio, S.2
  • 14
    • 51449092448 scopus 로고    scopus 로고
    • Continuous prosodic features and formant modeling with joint factor analysis for speaker verification
    • Dehak N.; Kenny P.; and Dumouchel P. Continuous prosodic features and formant modeling with joint factor analysis for speaker verification Proc. INTERSPEECH 2007 1234 1237
    • (2007) Proc. INTERSPEECH , pp. 1234-1237
    • Dehak, N.1    Kenny, P.2    Dumouchel, P.3
  • 15
    • 70450161521 scopus 로고    scopus 로고
    • Dimension reduction approaches for SVM based speaker age estimation
    • Dobry G.; Hecht R.; Avigal M.; and Zigel Y. Dimension reduction approaches for SVM based speaker age estimation Proc. INTERSPEECH 2009 2031 2034
    • (2009) Proc. INTERSPEECH , pp. 2031-2034
    • Dobry, G.1    Hecht, R.2    Avigal, M.3    Zigel, Y.4
  • 17
    • 79959823933 scopus 로고    scopus 로고
    • Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation
    • Gajšek R.; Žibert J.; Justin T.; Štruc V.; Vesnicer B.; and Mihelič F. Gender and affect recognition based on GMM and GMM-UBM modeling with relevance MAP estimation Proc. INTERSPEECH 2010 2810 2813
    • (2010) Proc. INTERSPEECH , pp. 2810-2813
    • Gajšek, R.1    Žibert, J.2    Justin, T.3    Štruc, V.4    Vesnicer, B.5    Mihelič, F.6
  • 22
    • 79959829347 scopus 로고    scopus 로고
    • Brno university of technology system for interspeech 2010 paralinguistic challenge
    • Kockmann M.; Burget L.; and Černocký J. Brno university of technology system for interspeech 2010 paralinguistic challenge Proc. INTERSPEECH 2010 2822 2825
    • (2010) Proc. INTERSPEECH , pp. 2822-2825
    • Kockmann, M.1    Burget, L.2    Černocký, J.3
  • 24
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter C.; and Woodland P. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech and Language 9 1995 171
    • (1995) Computer Speech and Language , vol.9 , pp. 171
    • Leggetter, C.1    Woodland, P.2
  • 25
    • 84867205560 scopus 로고    scopus 로고
    • Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping
    • Li M.; Cao C.; Di Wang P.; Fu Q.; and Yan Y. Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping Proc. INTERSPEECH 2008 151 154
    • (2008) Proc. INTERSPEECH , pp. 151-154
    • Li, M.1    Cao, C.2    Di Wang, P.3    Fu, Q.4    Yan, Y.5
  • 26
    • 79959838775 scopus 로고    scopus 로고
    • Combining five acoustic level methods for automatic speaker age and gender recognition
    • Li M.; Jung C.S.; and Han K.J. Combining five acoustic level methods for automatic speaker age and gender recognition Proc. INTERSPEECH 2010 2826 2829
    • (2010) Proc. INTERSPEECH , pp. 2826-2829
    • Li, M.1    Jung, C.S.2    Han, K.J.3
  • 27
    • 80051633581 scopus 로고    scopus 로고
    • Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors
    • Li M.; and Narayanan S. Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors Proc. ICASSP 2011 1481 1484
    • (2011) Proc. ICASSP , pp. 1481-1484
    • Li, M.1    Narayanan, S.2
  • 28
    • 79959831145 scopus 로고    scopus 로고
    • Spoken language identification using score vector modeling and support vector machine
    • Li M.; Suo H.; Wu X.; Lu P.; and Yan Y. Spoken language identification using score vector modeling and support vector machine Proc. INTERSPEECH 2007 350 353
    • (2007) Proc. INTERSPEECH , pp. 350-353
    • Li, M.1    Suo, H.2    Wu, X.3    Lu, P.4    Yan, Y.5
  • 29
    • 84865799827 scopus 로고    scopus 로고
    • Speaker verification using sparse representations on total variability i-vectors
    • Li M.; Zhang X.; Yan Y.; and Narayanan S. Speaker verification using sparse representations on total variability i-vectors Proc. INTERSPEECH 2011
    • (2011) Proc. INTERSPEECH
    • Li, M.1    Zhang, X.2    Yan, Y.3    Narayanan, S.4
  • 30
    • 33646817778 scopus 로고    scopus 로고
    • Language identification using pitch contour information
    • Lin C.; and Wang H. Language identification using pitch contour information Proc. ICASSP 2005 601 604
    • (2005) Proc. ICASSP , pp. 601-604
    • Lin, C.1    Wang, H.2
  • 31
    • 79959842932 scopus 로고    scopus 로고
    • Age and gender classification from speech using decision level fusion and ensemble based techniques
    • Lingenfelser F.; Wagner J.; Vogt T.; Kim J.; and André E. Age and gender classification from speech using decision level fusion and ensemble based techniques Proc. INTERSPEECH 2010 2798 2801
    • (2010) Proc. INTERSPEECH , pp. 2798-2801
    • Lingenfelser, F.1    Wagner, J.2    Vogt, T.3    Kim, J.4    André, E.5
  • 32
    • 79959823110 scopus 로고    scopus 로고
    • Age and gender classification using fusion of acoustic and prosodic features
    • Meinedo H.; and Trancoso I. Age and gender classification using fusion of acoustic and prosodic features Proc. INTERSPEECH 2010 2818 2821
    • (2010) Proc. INTERSPEECH , pp. 2818-2821
    • Meinedo, H.1    Trancoso, I.2
  • 34
    • 56149095112 scopus 로고    scopus 로고
    • Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age
    • Müller C.; and Burkhardt F. Combining short-term cepstral and long-term pitch features for automatic recognition of speaker age Proc. INTERSPEECH 2007 2277 2280
    • (2007) Proc. INTERSPEECH , pp. 2277-2280
    • Müller, C.1    Burkhardt, F.2
  • 35
    • 79959858399 scopus 로고    scopus 로고
    • Fuzzy support vector machines for age and gender classification
    • Nguyen P.; Le T.; Tran D.; Huang X.; and Sharma D. Fuzzy support vector machines for age and gender classification Proc. INTERSPEECH 2010 2806 2809
    • (2010) Proc. INTERSPEECH , pp. 2806-2809
    • Nguyen, P.1    Le, T.2    Tran, D.3    Huang, X.4    Sharma, D.5
  • 36
    • 79959846299 scopus 로고    scopus 로고
    • Age recognition based on speech signals using weights supervector
    • Porat R.; Lange D.; and Zigel Y. Age recognition based on speech signals using weights supervector Proc. INTERSPEECH 2010 2814 2817
    • (2010) Proc. INTERSPEECH , pp. 2814-2817
    • Porat, R.1    Lange, D.2    Zigel, Y.3
  • 37
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds D.; Quatieri T.; and Dunn R. Speaker verification using adapted Gaussian mixture models Digital Signal Processing 10 2000 19 41
    • (2000) Digital Signal Processing , vol.10 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 38
    • 36248944637 scopus 로고    scopus 로고
    • Acoustic analysis of adult speaker age. Speaker classification i
    • Schötz S. Acoustic analysis of adult speaker age. Speaker classification I Lecture Notes in Computer Science 2007 88 107
    • (2007) Lecture Notes in Computer Science , pp. 88-107
    • Schötz, S.1
  • 40
    • 33947620115 scopus 로고    scopus 로고
    • Hierarchical structures of neural networks for phoneme
    • Software
    • Schwarz, P.; Matejka, P.; Cernocky, J.; 2006. Hierarchical structures of neural networks for phoneme. In: Proc. ICASSP, pp. 325-328. Software available at http://speech.fit.vutbr.cz/software/phoneme-recognizer-based-long-temporal- context.
    • (2006) Proc. ICASSP , pp. 325-328
    • Schwarz, P.1    Matejka, P.2    Cernocky, J.3
  • 41
    • 85009141765 scopus 로고    scopus 로고
    • Wavesurfer-an open source speech tool
    • Sjölander K.; and Beskow J. Wavesurfer-an open source speech tool Proc. ICSLP 2000 464 467
    • (2000) Proc. ICSLP , pp. 464-467
    • Sjölander, K.1    Beskow, J.2
  • 45
    • 70450180871 scopus 로고    scopus 로고
    • Age recognition for spoken dialogue systems: Do we need it?
    • Wolters M.; Vipperla R.; and Renals S. Age recognition for spoken dialogue systems: Do we need it? Proc. INTERSPEECH 2009 1435 1438
    • (2009) Proc. INTERSPEECH , pp. 1435-1438
    • Wolters, M.1    Vipperla, R.2    Renals, S.3
  • 47


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.