메뉴 건너뛰기




Volumn 12, Issue 4, 2009, Pages 149-160

Estimation of unknown speaker's height from speech

Author keywords

Human height estimation from speech; Regression algorithms; Speech processing

Indexed keywords

FEATURE VECTORS; HEIGHT ESTIMATION; HUMAN HEIGHT; HUMAN HEIGHT ESTIMATION FROM SPEECH; MEAN ABSOLUTE ERROR; NON-LINEAR REGRESSION; REGRESSION ALGORITHMS; REGRESSION MODEL; RELATIVE ERRORS; SPEECH INPUT; TIMIT DATABASE;

EID: 77949917470     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-010-9064-2     Document Type: Article
Times cited : (41)

References (58)
  • 1
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • 0314.62039 10.1109/TAC.1974.1100705 423716
    • H. Akaike 1974 A new look at the statistical model identification IEEE Transactions on Automatic Control 19 6 716 723 0314.62039 10.1109/TAC.1974. 1100705 423716
    • (1974) IEEE Transactions on Automatic Control , vol.19 , Issue.6 , pp. 716-723
    • Akaike, H.1
  • 2
    • 36248969261 scopus 로고    scopus 로고
    • Speaker characteristics and emotion classification
    • C. Muller (eds). Springer Berlin. 10.1007/978-3-540-74200-5-7
    • Batliner, A., & Huber, R. (2007). Speaker characteristics and emotion classification. In C. Muller (Ed.), LNAI : Vol. 4343. Speaker classification I (pp. 138-151). Berlin: Springer. 10.1007/978-3-540-74200-5-7
    • (2007) Speaker Classification i LNAi , vol.4343 , pp. 138-151
    • Batliner, A.1    Huber, R.2
  • 5
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • 0858.68080 1425957
    • L. Breiman 1996 Bagging predictors Machine Learning 24 2 123 140 0858.68080 1425957
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 6
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • Campbell, J. P. (1997). Speaker recognition: a tutorial. Proceedings of the IEEE, 85(9).
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9
    • Campbell, J.P.1
  • 7
    • 0038895405 scopus 로고    scopus 로고
    • Training v-support vector regression: Theory and algorithms
    • 10.1162/089976602760128081
    • C. C. Chang C. J. Lin 2002 Training v-support vector regression: theory and algorithms Neural Computation 14 8 1959-1977 10.1162/089976602760128081
    • (2002) Neural Computation , vol.14 , Issue.8 , pp. 1959-1977
    • Chang, C.C.1    Lin, C.J.2
  • 9
    • 0003603515 scopus 로고    scopus 로고
    • Cambridge University Press Cambridge ISBN-978-0521592772 R. Cole, J. Mariani, H. Uszkoreit, G. Battista Varile, A. Zaenen, & A. Zampolli (Eds.)
    • Cole et al. (1998). Survey of the state of the art in human language technology (studies in natural language processing). Cambridge: Cambridge University Press. R. Cole, J. Mariani, H. Uszkoreit, G. Battista Varile, A. Zaenen, & A. Zampolli (Eds.). ISBN-13:978-0521592772.
    • (1998) Survey of the State of the Art in Human Language Technology (Studies in Natural Language Processing)
    • Cole1
  • 10
    • 0034490567 scopus 로고    scopus 로고
    • Men's voices and women's choices
    • 10.1006/anbe.2000.1523
    • S. A. Collins 2000 Men's voices and women's choices Animal Behaviour 60 773 780 10.1006/anbe.2000.1523
    • (2000) Animal Behaviour , vol.60 , pp. 773-780
    • Collins, S.A.1
  • 11
    • 0005504614 scopus 로고
    • Speakers and hearers are people: Reflections on speech deterioration as a consequence of acquired deafness
    • K.-E. Spens G. Plant (eds). Whurr London
    • Cowie, R., & Douglas-Cowie, E. (1995). Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness. In K.-E. Spens & G. Plant (Eds.), Profound deafness and speech communication (pp. 510-527). London: Whurr.
    • (1995) Profound Deafness and Speech Communication , pp. 510-527
    • Cowie, R.1    Douglas-Cowie, E.2
  • 13
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • 10.1109/TASSP.1980.1163420
    • S. B. Davis P. Mermelstein 1980 Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions Acoustics, Speech and Signal Processing 28 4 357 366 10.1109/TASSP.1980.1163420
    • (1980) IEEE Transactions Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 18
    • 0030858647 scopus 로고    scopus 로고
    • Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaque
    • 10.1121/1.421048
    • W. T. Fitch 1997 Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaque Journal of Acoustical Society of America (JASA) 102 2 1213 1222 10.1121/1.421048
    • (1997) Journal of Acoustical Society of America (JASA) , vol.102 , Issue.2 , pp. 1213-1222
    • Fitch, W.T.1
  • 19
    • 0032878792 scopus 로고    scopus 로고
    • Morphology and development of human vocal tract: A study using magnetic resonance imaging
    • 10.1121/1.427148
    • W. T. Fitch J. Giedd 1999 Morphology and development of human vocal tract: a study using magnetic resonance imaging Journal of Acoustical Society of America (JASA) 106 3 1511 1522 10.1121/1.427148
    • (1999) Journal of Acoustical Society of America (JASA) , vol.106 , Issue.3 , pp. 1511-1522
    • Fitch, W.T.1    Giedd, J.2
  • 20
    • 0037186544 scopus 로고    scopus 로고
    • Stochastic gradient boosting
    • 1072.65502 10.1016/S0167-9473(01)00065-2 1884869
    • J. H. Friedman 2002 Stochastic gradient boosting Computational Statistics and Data Analysis 38 4 367 378 1072.65502 10.1016/S0167-9473(01)00065-2 1884869
    • (2002) Computational Statistics and Data Analysis , vol.38 , Issue.4 , pp. 367-378
    • Friedman, J.H.1
  • 22
    • 0042344809 scopus 로고    scopus 로고
    • Estimation of speaker's weight and height from speech: A re-analysis of data from multiple studies by Lass and colleagues
    • 10.2466/PMS.96.1.297-304
    • J. Gonzalez 2003 Estimation of speaker's weight and height from speech: a re-analysis of data from multiple studies by Lass and colleagues Perceptual and Motor Skills 96 297 304 10.2466/PMS.96.1.297-304
    • (2003) Perceptual and Motor Skills , vol.96 , pp. 297-304
    • Gonzalez, J.1
  • 23
    • 67349233475 scopus 로고    scopus 로고
    • Research in acoustics of human speech sounds: Correlates and perception of speaker body size
    • S. G. Pandalai (eds). Transworld Research Network Kerala ISBN-81-7895-213-0
    • González, J. (2006). Research in acoustics of human speech sounds: correlates and perception of speaker body size. In S. G. Pandalai (Ed.), Recent research developments in applied physics, Vol. 9. Kerala: Transworld Research Network. ISBN:81-7895-213-0.
    • (2006) Recent Research Developments in Applied Physics 9
    • González, J.1
  • 24
    • 84925980153 scopus 로고
    • Listener estimations of speaker height and weight in unfiltered and filtered conditions
    • C. D. Gunter W. H. Manning 1982 Listener estimations of speaker height and weight in unfiltered and filtered conditions Journal of Phonetics 10 251 257
    • (1982) Journal of Phonetics , vol.10 , pp. 251-257
    • Gunter, C.D.1    Manning, W.H.2
  • 27
    • 8844259623 scopus 로고    scopus 로고
    • Can soft biometric traits assist user recognition?
    • A. K. Jain & N. K. Ratha (Eds.), Biometric technology for human identification
    • Jain, A. K., Dass, S. C., & Nandakumar, K. (2004). Can soft biometric traits assist user recognition? In A. K. Jain & N. K. Ratha (Eds.), Biometric technology for human identification. Proceedings of the SPIE 2004 (Vol. 5404, pp. 561-572).
    • (2004) Proceedings of the SPIE 2004 , vol.5404 , pp. 561-572
    • Jain, A.K.1    Dass, S.C.2    Nandakumar, K.3
  • 30
    • 0024842758 scopus 로고
    • How well does average fundamental frequency correlate with speaker height and weight?
    • 10.1159/000261832
    • H. J. Kunzel 1989 How well does average fundamental frequency correlate with speaker height and weight? Phonetica 46 117 125 10.1159/000261832
    • (1989) Phonetica , vol.46 , pp. 117-125
    • Kunzel, H.J.1
  • 31
    • 0033097628 scopus 로고    scopus 로고
    • Robust speech detection method for telephone speech recognition system
    • 10.1016/S0167-6393(98)00072-7
    • S. Kuroiwa M. Naito S. Yamamoto N. Higuchi 1999 Robust speech detection method for telephone speech recognition system Speech Communication 27 135 148 10.1016/S0167-6393(98)00072-7
    • (1999) Speech Communication , vol.27 , pp. 135-148
    • Kuroiwa, S.1    Naito, M.2    Yamamoto, S.3    Higuchi, N.4
  • 32
    • 0017961479 scopus 로고
    • Correlation study of speaker's heights, weights, body surface areas, and speaking fundamental frequencies
    • N. J. Lass W. S. Brown 1978 Correlation study of speaker's heights, weights, body surface areas, and speaking fundamental frequencies Journal of Acoustical Society of America (JASA) 63 4 700 703
    • (1978) Journal of Acoustical Society of America (JASA) , vol.63 , Issue.4 , pp. 700-703
    • Lass, N.J.1    Brown, W.S.2
  • 33
    • 0017031660 scopus 로고
    • An investigation of speaker height and weight identification
    • 10.1121/1.381142
    • N. J. Lass M. Davis 1976 An investigation of speaker height and weight identification Journal of Acoustical Society of America (JASA) 60 3 700 703 10.1121/1.381142
    • (1976) Journal of Acoustical Society of America (JASA) , vol.60 , Issue.3 , pp. 700-703
    • Lass, N.J.1    Davis, M.2
  • 34
    • 0343097639 scopus 로고
    • The effect of filtered speech on speaker height and weight identification
    • N. J. Lass J. K. Phillips C. A. Bruchey 1980 The effect of filtered speech on speaker height and weight identification Journal of Phonetics 8 91 100
    • (1980) Journal of Phonetics , vol.8 , pp. 91-100
    • Lass, N.J.1    Phillips, J.K.2    Bruchey, C.A.3
  • 36
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • 10.1109/PROC.1975.9792
    • J. Makhoul 1975 Linear prediction: a tutorial review Proceedings of the IEEE 63 5 561 580 10.1109/PROC.1975.9792
    • (1975) Proceedings of the IEEE , vol.63 , Issue.5 , pp. 561-580
    • Makhoul, J.1
  • 37
    • 34547542381 scopus 로고    scopus 로고
    • Comparison of four approaches to age and gender recognition for telephone applications
    • Metze, F., Ajmera, J., Englert, R., Bub, U., Burkhardt, F., Stegmann, J., Müller, C., Huber, R., Andrassy, B., Bauer, J., & Littel, B. (2007). Comparison of four approaches to age and gender recognition for telephone applications. In Proc. of the 2007 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2007) (Vol. 4, pp. 1089-1092).
    • (2007) Proc. of the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007) , vol.4 , pp. 1089-1092
    • Metze, F.1
  • 38
    • 70349263477 scopus 로고    scopus 로고
    • Speech segmentation using regression fusion of boundary predictions
    • 10.1016/j.csl.2009.04.004
    • I. Mporas T. Ganchev N. Fakotakis 2010 Speech segmentation using regression fusion of boundary predictions Computer Speech and Language 24 2 273 288 10.1016/j.csl.2009.04.004
    • (2010) Computer Speech and Language , vol.24 , Issue.2 , pp. 273-288
    • Mporas, I.1    Ganchev, T.2    Fakotakis, N.3
  • 41
    • 77949914930 scopus 로고
    • American Academy of Ophthalmology and Otolaryngology Rochester (Rev. by J. A. Krichner)
    • Pressman, J. J., & Keleman, G. (1970). Physiology of the Larynx (Rev. by J. A. Krichner). Rochester: American Academy of Ophthalmology and Otolaryngology.
    • (1970) Physiology of the Larynx
    • Pressman, J.J.1    Keleman, G.2
  • 43
    • 13544268439 scopus 로고    scopus 로고
    • Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: The role of vocalizer body size and voice-acoustic allometry
    • D. Rendall S. Kollias C. Ney 2005 Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: the role of vocalizer body size and voice-acoustic allometry Journal of Acoustical Society of America (JASA) 117 2 1 12
    • (2005) Journal of Acoustical Society of America (JASA) , vol.117 , Issue.2 , pp. 1-12
    • Rendall, D.1    Kollias, S.2    Ney, C.3
  • 49
    • 85055298118 scopus 로고
    • Speaker height and weight identification: Re-evaluation of some old data
    • W. A. van Dommelen 1993 Speaker height and weight identification: re-evaluation of some old data Journal of Phonetics 21 337 341
    • (1993) Journal of Phonetics , vol.21 , pp. 337-341
    • Van Dommelen, W.A.1
  • 50
    • 84970406832 scopus 로고
    • Acoustic parameters in speaker height and weight identification: Sex-specific behaviour
    • W. A. van Dommelen B. H. Moxness 1995 Acoustic parameters in speaker height and weight identification: sex-specific behaviour Language and Speech 38 267 287
    • (1995) Language and Speech , vol.38 , pp. 267-287
    • Van Dommelen, W.A.1    Moxness, B.H.2
  • 53
    • 0029431587 scopus 로고
    • Generalized additive models versus linear regression in generating probabilistic MOS forecasts of aviation weather parameters
    • 10.1175/1520-0434(1995)010<0669:GAMVLR>2.0.CO;2
    • R. L. Vislocky J. M. Fritsch 1995 Generalized additive models versus linear regression in generating probabilistic MOS forecasts of aviation weather parameters Weather and Forecasting 10 4 669 680 10.1175/1520-0434(1995) 010<0669:GAMVLR>2.0.CO;2
    • (1995) Weather and Forecasting , vol.10 , Issue.4 , pp. 669-680
    • Vislocky, R.L.1    Fritsch, J.M.2
  • 56
    • 41049090228 scopus 로고    scopus 로고
    • Phone duration modeling using gradient tree boosting
    • 10.1016/j.specom.2007.12.003
    • J. Yamagishia H. Kawaia T. Kobayashib 2008 Phone duration modeling using gradient tree boosting Speech Communication 50 5 405 415 10.1016/j.specom.2007. 12.003
    • (2008) Speech Communication , vol.50 , Issue.5 , pp. 405-415
    • Yamagishia, J.1    Kawaia, H.2    Kobayashib, T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.