메뉴 건너뛰기




Volumn , Issue , 2014, Pages 93-101

The SRI AVEC-2014 evaluation system

Author keywords

Acoustic features; Articulatory features; Decision trees; Depression; Prosody; Robust signal analysis; Support vector regression; Time series prediction

Indexed keywords

DECISION TREES; FORESTRY; MEAN SQUARE ERROR; TIME SERIES ANALYSIS;

EID: 84919341100     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2661806.2661818     Document Type: Conference Paper
Times cited : (61)

References (54)
  • 1
    • 84870469320 scopus 로고    scopus 로고
    • American Psychiatric Association, Fourth Edition, Text Revision, Washington, DC, American Psychiatric Association
    • American Psychiatric Association, Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision, Washington, DC, American Psychiatric Association, 2000.
    • (2000) Diagnostic and Statistical Manual of Mental Disorders
  • 3
    • 4043156247 scopus 로고    scopus 로고
    • Treatment for Adolescents with Depression Study (TADS) team, fluoxetine, cognitive-behavioral therapy, and their combination for adolescents with depression: Treatment for Adolescents with Depression Study (TADS) randomized controlled trial
    • J. March, S. Silva, S. Petrycki, J. Curry, K. Wells, J. Fairbank, B. Burns, M. Domino, S. McNulty, B. Vitiello, J. Severe, "Treatment for Adolescents with Depression Study (TADS) team. Fluoxetine, cognitive-behavioral therapy, and their combination for adolescents with depression: Treatment for Adolescents with Depression Study (TADS) randomized controlled trial, " Journal of the American Medical Association, 2004; 292(7):807-820.
    • (2004) Journal of the American Medical Association , vol.292 , Issue.7 , pp. 807-820
    • March, J.1    Silva, S.2    Petrycki, S.3    Curry, J.4    Wells, K.5    Fairbank, J.6    Burns, B.7    Domino, M.8    McNulty, S.9    Vitiello, B.10    Severe, J.11
  • 4
    • 34247365352 scopus 로고    scopus 로고
    • Clinical response and risk for reported suicidal ideation and suicide attempts in pediatric antidepressant treatment, a meta-analysis of randomized controlled trials
    • J.A. Bridge, S. Iyengar, C.B. Salary, R.P. Barbe, B. Birmaher, H.A. Pincus, L. Ren, D.A. Brent, "Clinical response and risk for reported suicidal ideation and suicide attempts in pediatric antidepressant treatment, a meta-analysis of randomized controlled trials, " Journal of the American Medical Association, 2007; 297(15):1683-1696.
    • (2007) Journal of the American Medical Association , vol.297 , Issue.15 , pp. 1683-1696
    • Bridge, J.A.1    Iyengar, S.2    Salary, C.B.3    Barbe, R.P.4    Birmaher, B.5    Pincus, H.A.6    Ren, L.7    Brent, D.A.8
  • 5
    • 0017596911 scopus 로고
    • Vocal and speech patterns of depressive patients
    • J. Darby and H. Hollien, "Vocal and speech patterns of depressive patients, " Folia phoniat, vol. 29, pp. 279-291, 1977.
    • (1977) Folia Phoniat , vol.29 , pp. 279-291
    • Darby, J.1    Hollien, H.2
  • 6
    • 0021418575 scopus 로고
    • Speech and voice parameters of depression: A pilot study
    • J. Darby, N. Simons, and P. Berger, "Speech and voice parameters of depression: A pilot study, " J. Commun. Disorders, vol. 17, pp. 75-85, 1984.
    • (1984) J. Commun. Disorders , vol.17 , pp. 75-85
    • Darby, J.1    Simons, N.2    Berger, P.3
  • 8
    • 4143060162 scopus 로고    scopus 로고
    • Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk
    • September
    • A. Ozdas, R. G. Shiavi, S. E. Silverman, M. K. Silverman, and D. M. Wilkes, "Investigation of vocal jitter and glottal flow spectrum as possible cues for depression and near-term suicidal risk, " IEEE Transactions on Biomedical Engineering, vol. 51, no. 9, pp. 1530-1540, September 2004.
    • (2004) IEEE Transactions on Biomedical Engineering , vol.51 , Issue.9 , pp. 1530-1540
    • Ozdas, A.1    Shiavi, R.G.2    Silverman, S.E.3    Silverman, M.K.4    Wilkes, D.M.5
  • 11
    • 84863746063 scopus 로고    scopus 로고
    • Screening for high risk suicidal states using mel-cepstral coefficients and energy in frequency bands
    • Poznan, Poland
    • H. K. Keskinpala, T. Yingtha wornsuk, D. M. Wilkes, R. G. Shiavi, and R. M. Salomon, "Screening for high risk suicidal states using mel-cepstral coefficients and energy in frequency bands, " in European Signal Processing Conference, Poznan, Poland, 2007, pp. 2229-2233.
    • (2007) European Signal Processing Conference , pp. 2229-2233
    • Keskinpala, H.K.1    Wornsuk, T.Y.2    Wilkes, D.M.3    Shiavi, R.G.4    Salomon, R.M.5
  • 13
    • 37349079113 scopus 로고    scopus 로고
    • Criticalanalysis of the impact of glottal features in the classification of clinical depression in speech
    • January
    • E. M. II, M. A. Clements, J. W. Peifer, and L. Weisser, "Criticalanalysis of the impact of glottal features in the classification of clinical depression in speech, " IEEE Transactions on Biomedical Engineering, vol. 55, no. 1, pp. 96-107, January 2008.
    • (2008) IEEE Transactions on Biomedical Engineering , vol.55 , Issue.1 , pp. 96-107
    • Mu, E.1    Clements, M.A.2    Peifer, J.W.3    Weisser, L.4
  • 15
    • 58149091821 scopus 로고    scopus 로고
    • Distinguishing depression and suicidal risk in men using GMM based frequency contents of affective vocal tract response
    • Seoul, Korea
    • T. Yingthawornsuk and R. G. Shiavi, "Distinguishing depression and suicidal risk in men using GMM based frequency contents of affective vocal tract response, " in International Conference on Control, Automation and Systems, Seoul, Korea, 2008, pp. 901-904.
    • (2008) International Conference on Control, Automation and Systems , pp. 901-904
    • Yingthawornsuk, T.1    Shiavi, R.G.2
  • 18
    • 84919365008 scopus 로고    scopus 로고
    • Depression recognition based on dynamic facial and vocal expression features using partial least square regression
    • H. Meng, H. Wang, H. Yang, M. Al-Shuraifi, Y. Wang, "Depression Recognition based on Dynamic Facial and Vocal Expression Features using Partial Least Square Regression, " Proc. of AVEC 2013.
    • (2013) Proc. of AVEC
    • Meng, H.1    Wang, H.2    Yang, H.3    Al-Shuraifi, M.4    Wang, Y.5
  • 19
    • 84885679134 scopus 로고    scopus 로고
    • Affect analysis in natural human interaction using joint hidden conditional random fields
    • B. Siddiquie, S. Khan, A. Divakaran, H. Sawhney "Affect Analysis in natural human interaction using joint hidden conditional random fields, " Proc of ICME 2013.
    • (2013) Proc of ICME
    • Siddiquie, B.1    Khan, S.2    Divakaran, A.3    Sawhney, H.4
  • 21
    • 84861140327 scopus 로고    scopus 로고
    • Chapter 13 - Psychiatric rating scales
    • F. B. Michael J. Aminoff and F. S. Dick, Eds. Elsevier
    • D. Maust, M. Cristancho, L. Gray, S. Rushing, C. Tjoa, and M. E. Thase, "Chapter 13 - Psychiatric rating scales, " in Handbook of Clinical Neurology, vol. Volume 106, F. B. Michael J. Aminoff and F. S. Dick, Eds. Elsevier, 2012, pp. 227-237.
    • (2012) Handbook of Clinical Neurology , vol.106 , pp. 227-237
    • Maust, D.1    Cristancho, M.2    Gray, L.3    Rushing, S.4    Tjoa, C.5    Thase, M.E.6
  • 24
    • 0029803149 scopus 로고    scopus 로고
    • Comparison of beck depression inventories -ia and -ii in psychiatric outpatients
    • December
    • A. Beck, R. Steer, R. Ball, and W. Ranieri, Comparison of beck depression inventories -ia and -ii in psychiatric outpatients. Journal of Personality Assessment, 67(3):588{97, December 1996.
    • (1996) Journal of Personality Assessment , vol.67 , Issue.3 , pp. 588-597
    • Beck, A.1    Steer, R.2    Ball, R.3    Ranieri, W.4
  • 25
    • 84906260861 scopus 로고    scopus 로고
    • Damped oscillator cepstral coefficients for robust speech recognition
    • V. Mitra, H. Franco, M. Graciarena, "Damped Oscillator Cepstral Coefficients for Robust Speech Recognition, " Proc. of Interspeech, pp. 886-890, 2013.
    • (2013) Proc. of Interspeech , pp. 886-890
    • Mitra, V.1    Franco, H.2    Graciarena, M.3
  • 26
    • 84867589420 scopus 로고    scopus 로고
    • Normalized amplitude modulation features for large vocabulary noise- robust speech recognition
    • V. Mitra, H. Franco, M. Graciarena, A. Mandal, "Normalized Amplitude Modulation Features for Large Vocabulary Noise- Robust Speech Recognition, " Proc. of ICASSP, pp. 4117- 4120, 2012.
    • (2012) Proc. of ICASSP , pp. 4117-4120
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Mandal, A.4
  • 27
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulations on speech reception
    • R. Drullman, J.M. Festen, R. Plomp, "Effect of Reducing Slow Temporal Modulations on Speech Reception, " J. Acoust. Soc. of Am., Vol. 95, No. 5, pp. 2670-2680, 1994.
    • (1994) J. Acoust. Soc. of Am. , vol.95 , Issue.5 , pp. 2670-2680
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 28
    • 0034844903 scopus 로고    scopus 로고
    • On the upper cutoff frequency of auditory critical-band envelope detectors in the context of speech perception
    • V. Ghitza, "On the Upper Cutoff Frequency of Auditory Critical-Band Envelope Detectors in the Context of Speech Perception, " J. Acoust. Soc. of America, vol. 110, no. 3, pp. 1628-1640, 2001.
    • (2001) J. Acoust. Soc. of America , vol.110 , Issue.3 , pp. 1628-1640
    • Ghitza, V.1
  • 29
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. Kaiser, T. Quatieri, "Energy Separation in Signal Modulations with Application to Speech Analysis, " IEEE Trans. Signal Processing, Vol. 41, pp. 3024-3051, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.2    Quatieri, T.3
  • 30
    • 84890510678 scopus 로고    scopus 로고
    • Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion
    • M. McLaren, N. Scheffer, M. Graciarena, L. Ferrer and Y. Lei, "Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion", in proc. of ICASSP 2013.
    • (2013) Proc. of ICASSP
    • McLaren, M.1    Scheffer, N.2    Graciarena, M.3    Ferrer, L.4    Lei, Y.5
  • 31
    • 84906217020 scopus 로고    scopus 로고
    • Improving language identification robustness to highly channel-degraded speech through multiple system fusion
    • Lyon
    • A. Lawson, M. McLaren, Y. Lei, V. Mitra, N. Scheffer, L. Ferrer, M. Graciarena, "Improving Language Identification Robustness to Highly Channel-Degraded Speech Through Multiple System Fusion, " in Proc. of Interspeech, pp. 1507- 1510, Lyon, 2013.
    • (2013) Proc. of Interspeech , pp. 1507-1510
    • Lawson, A.1    McLaren, M.2    Lei, Y.3    Mitra, V.4    Scheffer, N.5    Ferrer, L.6    Graciarena, M.7
  • 33
    • 84905269267 scopus 로고    scopus 로고
    • Medium duration modulation cepstral feature for robust speech recognition
    • Florence
    • V. Mitra, H. Franco, M. Graciarena, D. Vergyri, "Medium duration modulation cepstral feature for robust speech recognition, " Proc. of ICASSP, pp. 1768-1772, Florence, 2014.
    • (2014) Proc. of ICASSP , pp. 1768-1772
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Vergyri, D.4
  • 34
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • H. Teager, "Some Observations on Oral Air Flow during Phonation, " IEEE Trans. ASSP, pp. 599-601, 1980.
    • (1980) IEEE Trans. ASSP , pp. 599-601
    • Teager, H.1
  • 35
    • 84905234271 scopus 로고    scopus 로고
    • Articulatory features from deep neural networks and their role in speech recognition
    • Florence
    • V. Mitra, G. Sivaraman, H. Nam, C. Espy-Wilson, E. Saltzman, "Articulatory features from deep neural networks and their role in speech recognition, " Proc. of ICASSP, pp.3041-3045, Florence, 2014.
    • (2014) Proc. of ICASSP , pp. 3041-3045
    • Mitra, V.1    Sivaraman, G.2    Nam, H.3    Espy-Wilson, C.4    Saltzman, E.5
  • 36
    • 79960545035 scopus 로고    scopus 로고
    • Articulatory information for noise robust speech recognition
    • V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, L. Goldstein, "Articulatory Information for Noise Robust Speech Recognition, " IEEE Trans. on ASLP, Vol. 19, Iss. 7, pp. 1913- 1924, 2010.
    • (2010) IEEE Trans. on ASLP , vol.19 , Issue.7 , pp. 1913-1924
    • Mitra, V.1    Nam, H.2    Espy-Wilson, C.3    Saltzman, E.4    Goldstein, L.5
  • 37
    • 70349207706 scopus 로고    scopus 로고
    • TADA: An enhanced, portable task dynamics model in matlab
    • H. Nam, L. Goldstein, E. Saltzman, D. Byrd, "TADA: An enhanced, Portable Task Dynamics Model in Matlab, " J. of Acoust. Soc. Am., 115(5), p. 2430, 2004.
    • (2004) J. of Acoust. Soc. Am. , vol.115 , Issue.5
    • Nam, H.1    Goldstein, L.2    Saltzman, E.3    Byrd, D.4
  • 38
    • 84906248474 scopus 로고    scopus 로고
    • Addressee detection for dialog systems using temporal and spectral dimensions of speaking style
    • E. Shriberg, A. Stolcke, S. Ravuri, "Addressee Detection for Dialog Systems Using Temporal and Spectral Dimensions of Speaking Style, " Proc. of Interspeech, 2013.
    • (2013) Proc. of Interspeech
    • Shriberg, E.1    Stolcke, A.2    Ravuri, S.3
  • 40
    • 84919394117 scopus 로고    scopus 로고
    • url
    • N.C. Yoder, "Peak Finder, " Matlab program, url: http://www.mathworks.com/matlabcentral/fileexchange/25500-peakfinder, 2011.
    • (2011) Peak Finder
    • Yoder, N.C.1
  • 45
    • 70450204125 scopus 로고    scopus 로고
    • Acoustic parameters for the automatic detection of vowel nasalization
    • T. Pruthi, C. Espy-Wilson, "Acoustic parameters for the automatic detection of vowel nasalization, " Proceedings of INTERSPEECH, pp. 1925-1928, 2007.
    • (2007) Proceedings of INTERSPEECH , pp. 1925-1928
    • Pruthi, T.1    Espy-Wilson, C.2
  • 48
    • 84905259009 scopus 로고    scopus 로고
    • Effective use of DCTs for contextualizing features for speaker recognition
    • McLaren M.; Scheffer N.; Ferrer L. & Lei, Y. "Effective use of DCTs for Contextualizing Features for Speaker Recognition, " Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • McLaren, M.1    Scheffer, N.2    Ferrer, L.3    Lei, Y.4
  • 51
    • 80555140075 scopus 로고    scopus 로고
    • Scikit-learn: Machine Learning in Python
    • url
    • Pedregosa et al. "Scikit-learn: Machine Learning in Python, " JMLR 12, pp. 2825-2830, 2011. url: http://scikit-learn.org.
    • (2011) JMLR , vol.12 , pp. 2825-2830
    • Pedregosa1
  • 52
    • 84906252107 scopus 로고    scopus 로고
    • A unified approach for audio characterization and its application to speaker recognition
    • Odyssey 2010, Brno, Czech Republic, Jun
    • L. Ferrer, L. Burget, O. Plchot, and N. Scheffer, "A unified approach for audio characterization and its application to speaker recognition, " in Proc. of the Speaker and Language Recognition Workshop, Odyssey 2010, Brno, Czech Republic, Jun. 2010.
    • (2010) Proc. of the Speaker and Language Recognition Workshop
    • Ferrer, L.1    Burget, L.2    Plchot, O.3    Scheffer, N.4
  • 53
    • 78650977476 scopus 로고    scopus 로고
    • OpenSMILE - The munich versatile and fast open-source audio feature extractor
    • ACM, Florence, Italy, 25.- 29.10.2010
    • F. Eyben, M. Wöllmer, B. Schuller: "openSMILE - The Munich Versatile and Fast Open-Source Audio Feature Extractor", Proc. ACM Multimedia (MM), ACM, Florence, Italy, ISBN 978-1-60558-933-6, pp. 1459-1462, 25.- 29.10.2010.
    • Proc. ACM Multimedia (MM) , pp. 1459-1462
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.