-
1
-
-
0016355478
-
A new look at the statistical model identification
-
0314.62039 10.1109/TAC.1974.1100705 423716
-
H. Akaike 1974 A new look at the statistical model identification IEEE Transactions on Automatic Control 19 6 716 723 0314.62039 10.1109/TAC.1974. 1100705 423716
-
(1974)
IEEE Transactions on Automatic Control
, vol.19
, Issue.6
, pp. 716-723
-
-
Akaike, H.1
-
2
-
-
36248969261
-
Speaker characteristics and emotion classification
-
C. Muller (eds). Springer Berlin. 10.1007/978-3-540-74200-5-7
-
Batliner, A., & Huber, R. (2007). Speaker characteristics and emotion classification. In C. Muller (Ed.), LNAI : Vol. 4343. Speaker classification I (pp. 138-151). Berlin: Springer. 10.1007/978-3-540-74200-5-7
-
(2007)
Speaker Classification i LNAi
, vol.4343
, pp. 138-151
-
-
Batliner, A.1
Huber, R.2
-
5
-
-
0030211964
-
Bagging predictors
-
0858.68080 1425957
-
L. Breiman 1996 Bagging predictors Machine Learning 24 2 123 140 0858.68080 1425957
-
(1996)
Machine Learning
, vol.24
, Issue.2
, pp. 123-140
-
-
Breiman, L.1
-
6
-
-
0031233424
-
Speaker recognition: A tutorial
-
Campbell, J. P. (1997). Speaker recognition: a tutorial. Proceedings of the IEEE, 85(9).
-
(1997)
Proceedings of the IEEE
, vol.85
, Issue.9
-
-
Campbell, J.P.1
-
7
-
-
0038895405
-
Training v-support vector regression: Theory and algorithms
-
10.1162/089976602760128081
-
C. C. Chang C. J. Lin 2002 Training v-support vector regression: theory and algorithms Neural Computation 14 8 1959-1977 10.1162/089976602760128081
-
(2002)
Neural Computation
, vol.14
, Issue.8
, pp. 1959-1977
-
-
Chang, C.C.1
Lin, C.J.2
-
9
-
-
0003603515
-
-
Cambridge University Press Cambridge ISBN-978-0521592772 R. Cole, J. Mariani, H. Uszkoreit, G. Battista Varile, A. Zaenen, & A. Zampolli (Eds.)
-
Cole et al. (1998). Survey of the state of the art in human language technology (studies in natural language processing). Cambridge: Cambridge University Press. R. Cole, J. Mariani, H. Uszkoreit, G. Battista Varile, A. Zaenen, & A. Zampolli (Eds.). ISBN-13:978-0521592772.
-
(1998)
Survey of the State of the Art in Human Language Technology (Studies in Natural Language Processing)
-
-
Cole1
-
10
-
-
0034490567
-
Men's voices and women's choices
-
10.1006/anbe.2000.1523
-
S. A. Collins 2000 Men's voices and women's choices Animal Behaviour 60 773 780 10.1006/anbe.2000.1523
-
(2000)
Animal Behaviour
, vol.60
, pp. 773-780
-
-
Collins, S.A.1
-
11
-
-
0005504614
-
Speakers and hearers are people: Reflections on speech deterioration as a consequence of acquired deafness
-
K.-E. Spens G. Plant (eds). Whurr London
-
Cowie, R., & Douglas-Cowie, E. (1995). Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness. In K.-E. Spens & G. Plant (Eds.), Profound deafness and speech communication (pp. 510-527). London: Whurr.
-
(1995)
Profound Deafness and Speech Communication
, pp. 510-527
-
-
Cowie, R.1
Douglas-Cowie, E.2
-
12
-
-
85032751766
-
Emotion recognition in human-computer interaction
-
10.1109/79.911197
-
R. Cowie E. Douglas-Cowie N. Tsapatsoulis G. Votsis S. Kollias W. Fellenz J. G. Taylor 2001 Emotion recognition in human-computer interaction IEEE Signal Processing Magazine 18 1 32 80 10.1109/79.911197
-
(2001)
IEEE Signal Processing Magazine
, vol.18
, Issue.1
, pp. 32-80
-
-
Cowie, R.1
Douglas-Cowie, E.2
Tsapatsoulis, N.3
Votsis, G.4
Kollias, S.5
Fellenz, W.6
Taylor, J.G.7
-
13
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
10.1109/TASSP.1980.1163420
-
S. B. Davis P. Mermelstein 1980 Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions Acoustics, Speech and Signal Processing 28 4 357 366 10.1109/TASSP.1980.1163420
-
(1980)
IEEE Transactions Acoustics, Speech and Signal Processing
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
15
-
-
84874111109
-
-
IOS Press Utrecht
-
Esposito, A., Bratanic, M., Keller, E., & Marinaro, M. (2007). NATO security through science series E: Human and societal dynamics : Vol. 18. Fundamentals of verbal and nonverbal communication and the biometric issue. Utrecht: IOS Press.
-
(2007)
Fundamentals of Verbal and Nonverbal Communication and the Biometric Issue NATO Security Through Science Series E: Human and Societal Dynamics
, vol.18
-
-
Esposito, A.1
Bratanic, M.2
Keller, E.3
Marinaro, M.4
-
18
-
-
0030858647
-
Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaque
-
10.1121/1.421048
-
W. T. Fitch 1997 Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaque Journal of Acoustical Society of America (JASA) 102 2 1213 1222 10.1121/1.421048
-
(1997)
Journal of Acoustical Society of America (JASA)
, vol.102
, Issue.2
, pp. 1213-1222
-
-
Fitch, W.T.1
-
19
-
-
0032878792
-
Morphology and development of human vocal tract: A study using magnetic resonance imaging
-
10.1121/1.427148
-
W. T. Fitch J. Giedd 1999 Morphology and development of human vocal tract: a study using magnetic resonance imaging Journal of Acoustical Society of America (JASA) 106 3 1511 1522 10.1121/1.427148
-
(1999)
Journal of Acoustical Society of America (JASA)
, vol.106
, Issue.3
, pp. 1511-1522
-
-
Fitch, W.T.1
Giedd, J.2
-
20
-
-
0037186544
-
Stochastic gradient boosting
-
1072.65502 10.1016/S0167-9473(01)00065-2 1884869
-
J. H. Friedman 2002 Stochastic gradient boosting Computational Statistics and Data Analysis 38 4 367 378 1072.65502 10.1016/S0167-9473(01)00065-2 1884869
-
(2002)
Computational Statistics and Data Analysis
, vol.38
, Issue.4
, pp. 367-378
-
-
Friedman, J.H.1
-
22
-
-
0042344809
-
Estimation of speaker's weight and height from speech: A re-analysis of data from multiple studies by Lass and colleagues
-
10.2466/PMS.96.1.297-304
-
J. Gonzalez 2003 Estimation of speaker's weight and height from speech: a re-analysis of data from multiple studies by Lass and colleagues Perceptual and Motor Skills 96 297 304 10.2466/PMS.96.1.297-304
-
(2003)
Perceptual and Motor Skills
, vol.96
, pp. 297-304
-
-
Gonzalez, J.1
-
23
-
-
67349233475
-
Research in acoustics of human speech sounds: Correlates and perception of speaker body size
-
S. G. Pandalai (eds). Transworld Research Network Kerala ISBN-81-7895-213-0
-
González, J. (2006). Research in acoustics of human speech sounds: correlates and perception of speaker body size. In S. G. Pandalai (Ed.), Recent research developments in applied physics, Vol. 9. Kerala: Transworld Research Network. ISBN:81-7895-213-0.
-
(2006)
Recent Research Developments in Applied Physics 9
-
-
González, J.1
-
24
-
-
84925980153
-
Listener estimations of speaker height and weight in unfiltered and filtered conditions
-
C. D. Gunter W. H. Manning 1982 Listener estimations of speaker height and weight in unfiltered and filtered conditions Journal of Phonetics 10 251 257
-
(1982)
Journal of Phonetics
, vol.10
, pp. 251-257
-
-
Gunter, C.D.1
Manning, W.H.2
-
26
-
-
64149085238
-
Dialect/accent classification using unrestricted audio
-
10.1109/TASL.2006.881695
-
R. Huang J. H. L. Hansen P. Angkititrakul 2007 Dialect/accent classification using unrestricted audio IEEE Transactions on Audio, Speech, and Language Processing 15 2 453 464 10.1109/TASL.2006.881695
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.2
, pp. 453-464
-
-
Huang, R.1
Hansen, J.H.L.2
Angkititrakul, P.3
-
27
-
-
8844259623
-
Can soft biometric traits assist user recognition?
-
A. K. Jain & N. K. Ratha (Eds.), Biometric technology for human identification
-
Jain, A. K., Dass, S. C., & Nandakumar, K. (2004). Can soft biometric traits assist user recognition? In A. K. Jain & N. K. Ratha (Eds.), Biometric technology for human identification. Proceedings of the SPIE 2004 (Vol. 5404, pp. 561-572).
-
(2004)
Proceedings of the SPIE 2004
, vol.5404
, pp. 561-572
-
-
Jain, A.K.1
Dass, S.C.2
Nandakumar, K.3
-
30
-
-
0024842758
-
How well does average fundamental frequency correlate with speaker height and weight?
-
10.1159/000261832
-
H. J. Kunzel 1989 How well does average fundamental frequency correlate with speaker height and weight? Phonetica 46 117 125 10.1159/000261832
-
(1989)
Phonetica
, vol.46
, pp. 117-125
-
-
Kunzel, H.J.1
-
31
-
-
0033097628
-
Robust speech detection method for telephone speech recognition system
-
10.1016/S0167-6393(98)00072-7
-
S. Kuroiwa M. Naito S. Yamamoto N. Higuchi 1999 Robust speech detection method for telephone speech recognition system Speech Communication 27 135 148 10.1016/S0167-6393(98)00072-7
-
(1999)
Speech Communication
, vol.27
, pp. 135-148
-
-
Kuroiwa, S.1
Naito, M.2
Yamamoto, S.3
Higuchi, N.4
-
32
-
-
0017961479
-
Correlation study of speaker's heights, weights, body surface areas, and speaking fundamental frequencies
-
N. J. Lass W. S. Brown 1978 Correlation study of speaker's heights, weights, body surface areas, and speaking fundamental frequencies Journal of Acoustical Society of America (JASA) 63 4 700 703
-
(1978)
Journal of Acoustical Society of America (JASA)
, vol.63
, Issue.4
, pp. 700-703
-
-
Lass, N.J.1
Brown, W.S.2
-
33
-
-
0017031660
-
An investigation of speaker height and weight identification
-
10.1121/1.381142
-
N. J. Lass M. Davis 1976 An investigation of speaker height and weight identification Journal of Acoustical Society of America (JASA) 60 3 700 703 10.1121/1.381142
-
(1976)
Journal of Acoustical Society of America (JASA)
, vol.60
, Issue.3
, pp. 700-703
-
-
Lass, N.J.1
Davis, M.2
-
34
-
-
0343097639
-
The effect of filtered speech on speaker height and weight identification
-
N. J. Lass J. K. Phillips C. A. Bruchey 1980 The effect of filtered speech on speaker height and weight identification Journal of Phonetics 8 91 100
-
(1980)
Journal of Phonetics
, vol.8
, pp. 91-100
-
-
Lass, N.J.1
Phillips, J.K.2
Bruchey, C.A.3
-
36
-
-
0016495091
-
Linear prediction: A tutorial review
-
10.1109/PROC.1975.9792
-
J. Makhoul 1975 Linear prediction: a tutorial review Proceedings of the IEEE 63 5 561 580 10.1109/PROC.1975.9792
-
(1975)
Proceedings of the IEEE
, vol.63
, Issue.5
, pp. 561-580
-
-
Makhoul, J.1
-
37
-
-
34547542381
-
Comparison of four approaches to age and gender recognition for telephone applications
-
Metze, F., Ajmera, J., Englert, R., Bub, U., Burkhardt, F., Stegmann, J., Müller, C., Huber, R., Andrassy, B., Bauer, J., & Littel, B. (2007). Comparison of four approaches to age and gender recognition for telephone applications. In Proc. of the 2007 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2007) (Vol. 4, pp. 1089-1092).
-
(2007)
Proc. of the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007)
, vol.4
, pp. 1089-1092
-
-
Metze, F.1
-
38
-
-
70349263477
-
Speech segmentation using regression fusion of boundary predictions
-
10.1016/j.csl.2009.04.004
-
I. Mporas T. Ganchev N. Fakotakis 2010 Speech segmentation using regression fusion of boundary predictions Computer Speech and Language 24 2 273 288 10.1016/j.csl.2009.04.004
-
(2010)
Computer Speech and Language
, vol.24
, Issue.2
, pp. 273-288
-
-
Mporas, I.1
Ganchev, T.2
Fakotakis, N.3
-
39
-
-
0033693369
-
Unsupervised estimation of the human vocal tract length over sentence level utterances
-
Necioglu, B. F., Clements, M. A., & Barnwell III, T. P. (2000). Unsupervised estimation of the human vocal tract length over sentence level utterances. In Proc. of the 2000 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2000) (Vol. 3, pp. 1319-1322).
-
(2000)
Proc. of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000)
, vol.3
, pp. 1319-1322
-
-
Necioglu, B.F.1
Clements, M.A.2
Barnwell Iii, T.P.3
-
41
-
-
77949914930
-
-
American Academy of Ophthalmology and Otolaryngology Rochester (Rev. by J. A. Krichner)
-
Pressman, J. J., & Keleman, G. (1970). Physiology of the Larynx (Rev. by J. A. Krichner). Rochester: American Academy of Ophthalmology and Otolaryngology.
-
(1970)
Physiology of the Larynx
-
-
Pressman, J.J.1
Keleman, G.2
-
43
-
-
13544268439
-
Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: The role of vocalizer body size and voice-acoustic allometry
-
D. Rendall S. Kollias C. Ney 2005 Pitch (F0) and formant profiles of human vowels and vowel-like baboon grunts: the role of vocalizer body size and voice-acoustic allometry Journal of Acoustical Society of America (JASA) 117 2 1 12
-
(2005)
Journal of Acoustical Society of America (JASA)
, vol.117
, Issue.2
, pp. 1-12
-
-
Rendall, D.1
Kollias, S.2
Ney, C.3
-
48
-
-
4544283894
-
An estimate of physical scale from speech
-
Smith, L. H., & Nelson, D. J. (2004). An estimate of physical scale from speech. In Proc. of the 2004 IEEE international conference on acoustics, speech, and signal processing (ICASSP 2004) (Vol. 1, pp. 561-564).
-
(2004)
Proc. of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004)
, vol.1
, pp. 561-564
-
-
Smith, L.H.1
Nelson, D.J.2
-
49
-
-
85055298118
-
Speaker height and weight identification: Re-evaluation of some old data
-
W. A. van Dommelen 1993 Speaker height and weight identification: re-evaluation of some old data Journal of Phonetics 21 337 341
-
(1993)
Journal of Phonetics
, vol.21
, pp. 337-341
-
-
Van Dommelen, W.A.1
-
50
-
-
84970406832
-
Acoustic parameters in speaker height and weight identification: Sex-specific behaviour
-
W. A. van Dommelen B. H. Moxness 1995 Acoustic parameters in speaker height and weight identification: sex-specific behaviour Language and Speech 38 267 287
-
(1995)
Language and Speech
, vol.38
, pp. 267-287
-
-
Van Dommelen, W.A.1
Moxness, B.H.2
-
53
-
-
0029431587
-
Generalized additive models versus linear regression in generating probabilistic MOS forecasts of aviation weather parameters
-
10.1175/1520-0434(1995)010<0669:GAMVLR>2.0.CO;2
-
R. L. Vislocky J. M. Fritsch 1995 Generalized additive models versus linear regression in generating probabilistic MOS forecasts of aviation weather parameters Weather and Forecasting 10 4 669 680 10.1175/1520-0434(1995) 010<0669:GAMVLR>2.0.CO;2
-
(1995)
Weather and Forecasting
, vol.10
, Issue.4
, pp. 669-680
-
-
Vislocky, R.L.1
Fritsch, J.M.2
-
56
-
-
41049090228
-
Phone duration modeling using gradient tree boosting
-
10.1016/j.specom.2007.12.003
-
J. Yamagishia H. Kawaia T. Kobayashib 2008 Phone duration modeling using gradient tree boosting Speech Communication 50 5 405 415 10.1016/j.specom.2007. 12.003
-
(2008)
Speech Communication
, vol.50
, Issue.5
, pp. 405-415
-
-
Yamagishia, J.1
Kawaia, H.2
Kobayashib, T.3
-
57
-
-
0003571976
-
-
Cambridge University Engineering Department Cambridge
-
Young, S., Evermann, G., Gales, M., Hain, T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., & Woodland, P. (2006). The HTK book (for HTK version 3.4). Cambridge: Cambridge University Engineering Department.
-
(2006)
The HTK Book (For HTK Version 3.4)
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
Valtchev, V.11
Woodland, P.12
-
58
-
-
33947247091
-
Robust GMM-based gender classification using pitch and RASTA-PLP parameters of speech
-
2006
-
Zeng, Y., Wu, Z., Falk, T. H., & Chan, W.-Y. (2006). Robust GMM-based gender classification using pitch and RASTA-PLP parameters of speech. In Proc of intl. conf. on machine learning and cybernetics 2006.
-
(2006)
Proc of Intl. Conf. on Machine Learning and Cybernetics
-
-
Zeng, Y.1
Wu, Z.2
Falk, T.H.3
Chan, W.-Y.4
|