SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 215-219

Ensemble of machine learning and acoustic segment model techniques for speech emotion and autism spectrum disorders recognition

(7) Lee, Hung Yi a Hu, Ting Yao b Jing, How a Chang, Yun Fan a Tsao, Yu a Kao, Yu Cheng c Pao, Tsang Long c

a RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

b NATIONAL TAIWAN UNIVERSITY (Taiwan)

c TATUNG UNIVERSITY (Taiwan)

Author keywords

Autism; Emotion

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; SPEECH RECOGNITION; SUPPORT VECTOR MACHINES;

ACOUSTIC SEGMENT MODELS; AUTISM; AUTISM SPECTRUM DISORDERS; CLASSIFICATION PERFORMANCE; EMOTION; ENSEMBLE CLASSIFICATION; K NEAREST NEIGHBOURS (K-NN); MACHINE LEARNING TECHNIQUES;

DISEASES;

EID: 84906234329 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (30)

References (42)

1
- 84867332081
- Paralinguistics in speech and languagexstate-of-the-art and the challenge
- B. Schuller, S. Steidl, A. Batliner, F. Burkhardt, L. Devillers, C. Muller, and S. Narayanan, "Paralinguistics in speech and languagexstate-of-the-art and the challenge, " Computer Speech & Language, vol. 27, pp. 4 - 39, 2013.
- (2013) Computer Speech & Language , vol.27 , pp. 4-39
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Burkhardt, F.⁴ Devillers, L.⁵ Muller, C.⁶ Narayanan, S.⁷

2
- 85047302788
- Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011
- C.-N. Anagnostopoulos, T. Iliou, and I. Giannoukos, "Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011, " Artificial Intelligence Review, pp. 1-23, 2012.
- (2012) Artificial Intelligence Review , pp. 1-23
- Anagnostopoulos, C.-N.¹ Iliou, T.² Giannoukos, I.³

3
- 80051631315
- Deep neural networks for acoustic emotion recognition: Raising the benchmarks
- A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: Raising the benchmarks, " in ICASSP, 2011.
- (2011) ICASSP
- Stuhlsatz, A.¹ Meyer, C.² Eyben, F.³ Zieike, T.⁴ Meier, G.⁵ Schuller, B.⁶

4
- 33750564952
- Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
- T. Vogt and E. Andre, "Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition, " in ICME, 2005.
- (2005) ICME
- Vogt, T.¹ Andre, E.²

5
- 0012745713
- Desperately seeking emotions: Actors, wizards, and human beings
- A. Batliner, K. Fischer, R. Huber, J. Spilker, and E. Noth, "Desperately seeking emotions: Actors, wizards, and human beings, " in Proc. ISCA Workshop on Speech and Emotion, 2000.
- (2000) Proc. ISCA Workshop on Speech and Emotion
- Batliner, A.¹ Fischer, K.² Huber, R.³ Spilker, J.⁴ Noth, E.⁵

6
- 84878403287
- A sequential bayesian dialog agent for computational ethnography
- A. Kazemzadeh, J. Gibson, J. Li, S. Lee, P. Georgiou, and S. Narayanan, "A sequential bayesian dialog agent for computational ethnography, " in Interspeech, 2012.
- (2012) Interspeech
- Kazemzadeh, A.¹ Gibson, J.² Li, J.³ Lee, S.⁴ Georgiou, P.⁵ Narayanan, S.⁶

7
- 84878390748
- A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation
- D. Bone, C.-C. Lee, and S. S. Narayanan, "A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation, " in Interspeech, 2012.
- (2012) Interspeech
- Bone, D.¹ Lee, C.-C.² Narayanan, S.S.³

8
- 85008006613
- A framework for automatic human emotion classification using emotion profiles
- E. Mower, M. Mataricc, and S. Narayanan, "A framework for automatic human emotion classification using emotion profiles, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, pp. 1057-1070, 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , pp. 1057-1070
- Mower, E.¹ Mataricc, M.² Narayanan, S.³

9
- 0142125311
- Prosody in autism spectrum disorders: A critical review
- J. McCann and S. Peppe, "Prosody in autism spectrum disorders: A critical review, " International Journal of Language & Communication Disorders, vol. 38(4), pp. 325-350, 2003.
- (2003) International Journal of Language & Communication Disorders , vol.38 , Issue.4 , pp. 325-350
- McCann, J.¹ Peppe, S.²

10
- 77954366803
- Computational prosodic markers for autism
- J. van Santen, E. Prudhommeaux, L. Black, and M. Mitchell, "Computational prosodic markers for autism, " Autism, vol. 14, pp. 215-236, 2010.
- (2010) Autism , vol.14 , pp. 215-236
- Van Santen, J.¹ Prudhommeaux, E.² Black, L.³ Mitchell, M.⁴

11
- 84878393217
- Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist
- D. Bone, M. P. Black, C.-C. Lee, M. E.Williams, P. Levitt, S. Lee, and S. S. Narayanan, "Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist, " in Interspeech, 2012.
- (2012) Interspeech
- Bone, D.¹ Black, M.P.² Lee, C.-C.³ Williams, M.E.⁴ Levitt, P.⁵ Lee, S.⁶ Narayanan, S.S.⁷

12
- 84878383416
- Contrastive intonation in autism: The effect of speaker- And listener-perspective
- C. Kaland, E. Krahmer, and M. Swerts, "Contrastive intonation in autism: The effect of speaker- And listener-perspective, " in Interspeech, 2012.
- (2012) Interspeech
- Kaland, C.¹ Krahmer, E.² Swerts, M.³

13
- 84878411630
- Interactions between turn-taking gaps, disfluencies and social obligation
- R. Lunsford, P. A. Heeman, and J. P. H. van Santen, "Interactions between turn-taking gaps, disfluencies and social obligation, " in Interspeech, 2012.
- (2012) Interspeech
- Lunsford, R.¹ Heeman, P.A.² Van Santen, J.P.H.³

14
- 84878379006
- On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD)
- M. Swerts and C. de Bie, "On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD), " in Interspeech, 2012.
- (2012) Interspeech
- Swerts, M.¹ De Bie, C.²

15
- 84878421621
- Quantitative analysis of pitch in speech of children with neurodevelopmental disorders
- G. Kiss, J. P. van Santen, E. Prudhommeaux, and L. M. Black, "Quantitative analysis of pitch in speech of children with neurodevelopmental disorders, " in Interspeech, 2012.
- (2012) Interspeech
- Kiss, G.¹ Santen, J.P.V.² Prudhommeaux, E.³ Black, L.M.⁴

16
- 84906269266
- The interspeech 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism
- B. Schuller, S. Steidl, A. Batliner, A. Vinciarelli, K. Scherer, F. Ringeval, M. Chetouani, F. Weninger, F. Eyben, E. Marchi, M. Mortillaro, H. Salamin, A. Polychroniou, F. Valente, and S. Kim, "The interspeech 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism, " in Interspeech, 2013.
- (2013) Interspeech
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Vinciarelli, A.⁴ Scherer, K.⁵ Ringeval, F.⁶ Chetouani, M.⁷ Weninger, F.⁸ Eyben, F.⁹ Marchi, E.¹⁰ Mortillaro, M.¹¹ Salamin, H.¹² Polychroniou, A.¹³ Valente, F.¹⁴ Kim, S.¹⁵

17
- 0010442827
- On the algorithmic implementation of multiclass kernel-based vector machines
- K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines, " J. Mach. Learn. Res., vol. 2, pp. 265-292, 2002.
- (2002) J. Mach. Learn. Res. , vol.2 , pp. 265-292
- Crammer, K.¹ Singer, Y.²

18
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets, " Neural Comput., vol. 18, pp. 1527- 1554, 2006.
- (2006) Neural Comput , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.-W.³

19
- 84867720412
- G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Improving neural networks by preventing coadaptation of feature detectors.
- Improving Neural Networks by Preventing Coadaptation of Feature Detectors
- Hinton, G.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

20
- 77649319843
- Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech
- T. L. Pao, Y. M. Cheng, Y. T. Chen, and J. H. Yeh, "Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech, " International Journal of Information Acquisition, vol. 4, pp. 339 - 346, 2007.
- (2007) International Journal of Information Acquisition , vol.4 , pp. 339-346
- Pao, T.L.¹ Cheng, Y.M.² Chen, Y.T.³ Yeh, J.H.⁴

21
- 0023800699
- A segment model based approach to speech recognition
- C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in ICASSP, 1988.
- (1988) ICASSP
- Lee, C.-H.¹ Soong, F.² Juang, B.-H.³

22
- 34547502608
- A vector space modeling approach to spoken language identification
- H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, pp. 271-284, 2007.
- (2007) Audio, Speech, and Language Processing, IEEE Transactions on , vol.15 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.-H.³

23
- 84873444148
- A study on music genre classification based on universal acoustic models
- J. Reed, "A study on music genre classification based on universal acoustic models, " in ISMIR, 2006.
- (2006) ISMIR
- Reed, J.¹

24
- 78049411640
- An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
- Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition, " in ICASSP, 2010.
- (2010) ICASSP
- Tsao, Y.¹ Sun, H.² Li, H.³ Lee, C.-H.⁴

25
- 70449646765
- Acoustic segment modeling for speaker recognition
- B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in ICME, 2009.
- (2009) ICME
- Ma, B.¹ Zhu, D.² Li, H.³

26
- 79959819374
- Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
- M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision, " in Interspeech, 2010.
- (2010) Interspeech
- Siu, M.-H.¹ Gish, H.² Chan, A.³ Belfield, W.⁴

27
- 84858975943
- Topic modeling for spoken documents using only phonetic information
- T. J. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only phonetic information, " in ASRU, 2011.
- (2011) ASRU
- Hazen, T.J.¹ Siu, M.-H.² Gish, H.³ Lowe, S.⁴ Chan, A.⁵

28
- 70450158585
- Unsupervised training of an hmm-based speech recognizer for topic classification
- H. Gish, M. hung Siu, and A. C. amd William Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification, " in Interspeech, 2009.
- (2009) Interspeech
- Gish, H.¹ Siu, M.H.² Belfield, A.C.A.W.³

29
- 84865744986
- Unsupervised learning of acoustic unit descriptors for audio content representation and classification
- S. Chaudhuri, M. Harvilla, and B. Raj, "Unsupervised learning of acoustic unit descriptors for audio content representation and classification, " in Interspeech, 2011.
- (2011) Interspeech
- Chaudhuri, S.¹ Harvilla, M.² Raj, B.³

30
- 84890511750
- Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns
- H.-Y. Lee, Y.-C. Li, C.-T. Chung, and L. shan Lee, "Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns, " in ICASSP, 2013.
- (2013) ICASSP
- Lee, H.-Y.¹ Li, Y.-C.² Chung, C.-T.³ Lee, L.S.⁴

31
- 84867809023
- A nonparametric Bayesian approach to acoustic model discovery
- C.-Y. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in ACL, 2012.
- (2012) ACL
- Lee, C.-Y.¹ Glass, J.²

32
- 84867600320
- An acoustic segment modeling approach to query-by-example spoken term detection
- H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection, " in ICASSP, 2012.
- (2012) ICASSP
- Wang, H.¹ Leung, C.-C.² Lee, T.³ Ma, B.⁴ Li, H.⁵

33
- 77949578539
- A text retrieval approach to content-based audio retrieval
- M. Riley, E. Heinen, and J. Ghosh, "A text retrieval approach to content-based audio retrieval, " in ISMIR, 2008.
- (2008) ISMIR
- Riley, M.¹ Heinen, E.² Ghosh, J.³

34
- 0023211850
- On the automatic segmentation of speech signals
- T. Svendsen and F. Soong, "On the automatic segmentation of speech signals, " in ICASSP, 1987.
- (1987) ICASSP
- Svendsen, T.¹ Soong, F.²

35
- 84890479779
- Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization
- C.-T. Chung, C.-A. Chan, and L.-S. Lee, "Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization, " in ICASSP, 2013.
- (2013) ICASSP
- Chung, C.-T.¹ Chan, C.-A.² Lee, L.-S.³

36
- 78650043038
- UBM based speaker selection and model re-estimation for speaker adaptation
- J.Wang, J. Guo, G. Liu, and J. Lei, "UBM based speaker selection and model re-estimation for speaker adaptation, " in ICCI, vol. 2, 2006, pp. 856-860.
- (2006) ICCI , vol.2 , pp. 856-860
- Wang, J.¹ Guo, J.² Liu, G.³ Lei, J.⁴

37
- 4944228528
- A practical guide to support vector classification
- C.-W. Hsu, C.-C. Chang, and C.-J. Lin, "A practical guide to support vector classification, " National Taiwan University, Tech. Rep., 2003.
- (2003) National Taiwan University, Tech. Rep.
- Hsu, C.-W.¹ Chang, C.-C.² Lin, C.-J.³

38
- 84906270598
- http://svmlight.joachims.org/.

39
- 14344250451
- Support vector machine learning for interdependent and structured output spaces
- I. Tsochantaridis, T. Hofmann, T. Joachims, and Y. Altun, "Support vector machine learning for interdependent and structured output spaces, " in Proceedings of the twenty-first international conference on Machine learning, 2004.
- (2004) Proceedings of the Twenty-first International Conference on Machine Learning
- Tsochantaridis, I.¹ Hofmann, T.² Joachims, T.³ Altun, Y.⁴

40
- 84893584920
- Master's thesis, Technical University of Denmark
- R. B. Palm, "Prediction as a candidate for learning deep hierarchical models of data, " Master's thesis, Technical University of Denmark, 2012.
- (2012) Prediction as a Candidate for Learning Deep Hierarchical Models of Data
- Palm, R.B.¹

41
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space, " Speech and Audio Processing, IEEE Transactions on, vol. 8, pp. 695-707, 2000.
- (2000) Speech and Audio Processing, IEEE Transactions on , vol.8 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

42
- 67651177785
- An ensemble speaker and speaking environment modeling approach to robust speech recognition
- Y. Tsao and C.-H. Lee, "An ensemble speaker and speaking environment modeling approach to robust speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, pp. 1025-1037, 2009.
- (2009) Audio, Speech, and Language Processing, IEEE Transactions on , vol.17 , pp. 1025-1037
- Tsao, Y.¹ Lee, C.-H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.