메뉴 건너뛰기




Volumn , Issue , 2013, Pages 215-219

Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition

Author keywords

Autism; Emotion

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; SPEECH RECOGNITION; SUPPORT VECTOR MACHINES;

EID: 84906234329     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (30)

References (42)
  • 2
    • 85047302788 scopus 로고    scopus 로고
    • Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011
    • C.-N. Anagnostopoulos, T. Iliou, and I. Giannoukos, "Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011, " Artificial Intelligence Review, pp. 1-23, 2012.
    • (2012) Artificial Intelligence Review , pp. 1-23
    • Anagnostopoulos, C.-N.1    Iliou, T.2    Giannoukos, I.3
  • 3
    • 80051631315 scopus 로고    scopus 로고
    • Deep neural networks for acoustic emotion recognition: Raising the benchmarks
    • A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: Raising the benchmarks, " in ICASSP, 2011.
    • (2011) ICASSP
    • Stuhlsatz, A.1    Meyer, C.2    Eyben, F.3    Zieike, T.4    Meier, G.5    Schuller, B.6
  • 4
    • 33750564952 scopus 로고    scopus 로고
    • Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
    • T. Vogt and E. Andre, "Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition, " in ICME, 2005.
    • (2005) ICME
    • Vogt, T.1    Andre, E.2
  • 7
    • 84878390748 scopus 로고    scopus 로고
    • A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation
    • D. Bone, C.-C. Lee, and S. S. Narayanan, "A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation, " in Interspeech, 2012.
    • (2012) Interspeech
    • Bone, D.1    Lee, C.-C.2    Narayanan, S.S.3
  • 10
  • 11
    • 84878393217 scopus 로고    scopus 로고
    • Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist
    • D. Bone, M. P. Black, C.-C. Lee, M. E.Williams, P. Levitt, S. Lee, and S. S. Narayanan, "Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist, " in Interspeech, 2012.
    • (2012) Interspeech
    • Bone, D.1    Black, M.P.2    Lee, C.-C.3    Williams, M.E.4    Levitt, P.5    Lee, S.6    Narayanan, S.S.7
  • 12
    • 84878383416 scopus 로고    scopus 로고
    • Contrastive intonation in autism: The effect of speaker- And listener-perspective
    • C. Kaland, E. Krahmer, and M. Swerts, "Contrastive intonation in autism: The effect of speaker- And listener-perspective, " in Interspeech, 2012.
    • (2012) Interspeech
    • Kaland, C.1    Krahmer, E.2    Swerts, M.3
  • 13
    • 84878411630 scopus 로고    scopus 로고
    • Interactions between turn-taking gaps, disfluencies and social obligation
    • R. Lunsford, P. A. Heeman, and J. P. H. van Santen, "Interactions between turn-taking gaps, disfluencies and social obligation, " in Interspeech, 2012.
    • (2012) Interspeech
    • Lunsford, R.1    Heeman, P.A.2    Van Santen, J.P.H.3
  • 14
    • 84878379006 scopus 로고    scopus 로고
    • On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD)
    • M. Swerts and C. de Bie, "On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD), " in Interspeech, 2012.
    • (2012) Interspeech
    • Swerts, M.1    De Bie, C.2
  • 15
    • 84878421621 scopus 로고    scopus 로고
    • Quantitative analysis of pitch in speech of children with neurodevelopmental disorders
    • G. Kiss, J. P. van Santen, E. Prudhommeaux, and L. M. Black, "Quantitative analysis of pitch in speech of children with neurodevelopmental disorders, " in Interspeech, 2012.
    • (2012) Interspeech
    • Kiss, G.1    Santen, J.P.V.2    Prudhommeaux, E.3    Black, L.M.4
  • 17
    • 0010442827 scopus 로고    scopus 로고
    • On the algorithmic implementation of multiclass kernel-based vector machines
    • K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines, " J. Mach. Learn. Res., vol. 2, pp. 265-292, 2002.
    • (2002) J. Mach. Learn. Res. , vol.2 , pp. 265-292
    • Crammer, K.1    Singer, Y.2
  • 18
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets, " Neural Comput., vol. 18, pp. 1527- 1554, 2006.
    • (2006) Neural Comput , vol.18 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 20
    • 77649319843 scopus 로고    scopus 로고
    • Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech
    • T. L. Pao, Y. M. Cheng, Y. T. Chen, and J. H. Yeh, "Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech, " International Journal of Information Acquisition, vol. 4, pp. 339 - 346, 2007.
    • (2007) International Journal of Information Acquisition , vol.4 , pp. 339-346
    • Pao, T.L.1    Cheng, Y.M.2    Chen, Y.T.3    Yeh, J.H.4
  • 21
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in ICASSP, 1988.
    • (1988) ICASSP
    • Lee, C.-H.1    Soong, F.2    Juang, B.-H.3
  • 23
    • 84873444148 scopus 로고    scopus 로고
    • A study on music genre classification based on universal acoustic models
    • J. Reed, "A study on music genre classification based on universal acoustic models, " in ISMIR, 2006.
    • (2006) ISMIR
    • Reed, J.1
  • 24
    • 78049411640 scopus 로고    scopus 로고
    • An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
    • Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition, " in ICASSP, 2010.
    • (2010) ICASSP
    • Tsao, Y.1    Sun, H.2    Li, H.3    Lee, C.-H.4
  • 25
    • 70449646765 scopus 로고    scopus 로고
    • Acoustic segment modeling for speaker recognition
    • B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in ICME, 2009.
    • (2009) ICME
    • Ma, B.1    Zhu, D.2    Li, H.3
  • 26
    • 79959819374 scopus 로고    scopus 로고
    • Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
    • M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision, " in Interspeech, 2010.
    • (2010) Interspeech
    • Siu, M.-H.1    Gish, H.2    Chan, A.3    Belfield, W.4
  • 27
    • 84858975943 scopus 로고    scopus 로고
    • Topic modeling for spoken documents using only phonetic information
    • T. J. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only phonetic information, " in ASRU, 2011.
    • (2011) ASRU
    • Hazen, T.J.1    Siu, M.-H.2    Gish, H.3    Lowe, S.4    Chan, A.5
  • 28
    • 70450158585 scopus 로고    scopus 로고
    • Unsupervised training of an hmm-based speech recognizer for topic classification
    • H. Gish, M. hung Siu, and A. C. amd William Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification, " in Interspeech, 2009.
    • (2009) Interspeech
    • Gish, H.1    Siu, M.H.2    Belfield, A.C.A.W.3
  • 29
    • 84865744986 scopus 로고    scopus 로고
    • Unsupervised learning of acoustic unit descriptors for audio content representation and classification
    • S. Chaudhuri, M. Harvilla, and B. Raj, "Unsupervised learning of acoustic unit descriptors for audio content representation and classification, " in Interspeech, 2011.
    • (2011) Interspeech
    • Chaudhuri, S.1    Harvilla, M.2    Raj, B.3
  • 30
    • 84890511750 scopus 로고    scopus 로고
    • Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns
    • H.-Y. Lee, Y.-C. Li, C.-T. Chung, and L. shan Lee, "Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns, " in ICASSP, 2013.
    • (2013) ICASSP
    • Lee, H.-Y.1    Li, Y.-C.2    Chung, C.-T.3    Lee, L.S.4
  • 31
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric Bayesian approach to acoustic model discovery
    • C.-Y. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in ACL, 2012.
    • (2012) ACL
    • Lee, C.-Y.1    Glass, J.2
  • 32
    • 84867600320 scopus 로고    scopus 로고
    • An acoustic segment modeling approach to query-by-example spoken term detection
    • H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection, " in ICASSP, 2012.
    • (2012) ICASSP
    • Wang, H.1    Leung, C.-C.2    Lee, T.3    Ma, B.4    Li, H.5
  • 33
    • 77949578539 scopus 로고    scopus 로고
    • A text retrieval approach to content-based audio retrieval
    • M. Riley, E. Heinen, and J. Ghosh, "A text retrieval approach to content-based audio retrieval, " in ISMIR, 2008.
    • (2008) ISMIR
    • Riley, M.1    Heinen, E.2    Ghosh, J.3
  • 34
    • 0023211850 scopus 로고
    • On the automatic segmentation of speech signals
    • T. Svendsen and F. Soong, "On the automatic segmentation of speech signals, " in ICASSP, 1987.
    • (1987) ICASSP
    • Svendsen, T.1    Soong, F.2
  • 35
    • 84890479779 scopus 로고    scopus 로고
    • Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization
    • C.-T. Chung, C.-A. Chan, and L.-S. Lee, "Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization, " in ICASSP, 2013.
    • (2013) ICASSP
    • Chung, C.-T.1    Chan, C.-A.2    Lee, L.-S.3
  • 36
    • 78650043038 scopus 로고    scopus 로고
    • UBM based speaker selection and model re-estimation for speaker adaptation
    • J.Wang, J. Guo, G. Liu, and J. Lei, "UBM based speaker selection and model re-estimation for speaker adaptation, " in ICCI, vol. 2, 2006, pp. 856-860.
    • (2006) ICCI , vol.2 , pp. 856-860
    • Wang, J.1    Guo, J.2    Liu, G.3    Lei, J.4
  • 38
    • 84906270598 scopus 로고    scopus 로고
    • http://svmlight.joachims.org/.
  • 42
    • 67651177785 scopus 로고    scopus 로고
    • An ensemble speaker and speaking environment modeling approach to robust speech recognition
    • Y. Tsao and C.-H. Lee, "An ensemble speaker and speaking environment modeling approach to robust speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, pp. 1025-1037, 2009.
    • (2009) Audio, Speech, and Language Processing, IEEE Transactions on , vol.17 , pp. 1025-1037
    • Tsao, Y.1    Lee, C.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.