-
1
-
-
84867332081
-
Paralinguistics in speech and languagexstate-of-the-art and the challenge
-
B. Schuller, S. Steidl, A. Batliner, F. Burkhardt, L. Devillers, C. Muller, and S. Narayanan, "Paralinguistics in speech and languagexstate-of-the-art and the challenge, " Computer Speech & Language, vol. 27, pp. 4 - 39, 2013.
-
(2013)
Computer Speech & Language
, vol.27
, pp. 4-39
-
-
Schuller, B.1
Steidl, S.2
Batliner, A.3
Burkhardt, F.4
Devillers, L.5
Muller, C.6
Narayanan, S.7
-
2
-
-
85047302788
-
Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011
-
C.-N. Anagnostopoulos, T. Iliou, and I. Giannoukos, "Features and classifiers for emotion recognition from speech: A survey from 2000 to 2011, " Artificial Intelligence Review, pp. 1-23, 2012.
-
(2012)
Artificial Intelligence Review
, pp. 1-23
-
-
Anagnostopoulos, C.-N.1
Iliou, T.2
Giannoukos, I.3
-
3
-
-
80051631315
-
Deep neural networks for acoustic emotion recognition: Raising the benchmarks
-
A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: Raising the benchmarks, " in ICASSP, 2011.
-
(2011)
ICASSP
-
-
Stuhlsatz, A.1
Meyer, C.2
Eyben, F.3
Zieike, T.4
Meier, G.5
Schuller, B.6
-
4
-
-
33750564952
-
Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
-
T. Vogt and E. Andre, "Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition, " in ICME, 2005.
-
(2005)
ICME
-
-
Vogt, T.1
Andre, E.2
-
5
-
-
0012745713
-
Desperately seeking emotions: Actors, wizards, and human beings
-
A. Batliner, K. Fischer, R. Huber, J. Spilker, and E. Noth, "Desperately seeking emotions: Actors, wizards, and human beings, " in Proc. ISCA Workshop on Speech and Emotion, 2000.
-
(2000)
Proc. ISCA Workshop on Speech and Emotion
-
-
Batliner, A.1
Fischer, K.2
Huber, R.3
Spilker, J.4
Noth, E.5
-
6
-
-
84878403287
-
A sequential bayesian dialog agent for computational ethnography
-
A. Kazemzadeh, J. Gibson, J. Li, S. Lee, P. Georgiou, and S. Narayanan, "A sequential bayesian dialog agent for computational ethnography, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Kazemzadeh, A.1
Gibson, J.2
Li, J.3
Lee, S.4
Georgiou, P.5
Narayanan, S.6
-
7
-
-
84878390748
-
A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation
-
D. Bone, C.-C. Lee, and S. S. Narayanan, "A robust unsupervised arousal rating framework using prosody with cross-corpora evaluation, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Bone, D.1
Lee, C.-C.2
Narayanan, S.S.3
-
8
-
-
85008006613
-
A framework for automatic human emotion classification using emotion profiles
-
E. Mower, M. Mataricc, and S. Narayanan, "A framework for automatic human emotion classification using emotion profiles, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, pp. 1057-1070, 2011.
-
(2011)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.19
, pp. 1057-1070
-
-
Mower, E.1
Mataricc, M.2
Narayanan, S.3
-
9
-
-
0142125311
-
Prosody in autism spectrum disorders: A critical review
-
J. McCann and S. Peppe, "Prosody in autism spectrum disorders: A critical review, " International Journal of Language & Communication Disorders, vol. 38(4), pp. 325-350, 2003.
-
(2003)
International Journal of Language & Communication Disorders
, vol.38
, Issue.4
, pp. 325-350
-
-
McCann, J.1
Peppe, S.2
-
10
-
-
77954366803
-
Computational prosodic markers for autism
-
J. van Santen, E. Prudhommeaux, L. Black, and M. Mitchell, "Computational prosodic markers for autism, " Autism, vol. 14, pp. 215-236, 2010.
-
(2010)
Autism
, vol.14
, pp. 215-236
-
-
Van Santen, J.1
Prudhommeaux, E.2
Black, L.3
Mitchell, M.4
-
11
-
-
84878393217
-
Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist
-
D. Bone, M. P. Black, C.-C. Lee, M. E.Williams, P. Levitt, S. Lee, and S. S. Narayanan, "Spontaneous-speech acoustic-prosodic features of children with autism and the interacting psychologist, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Bone, D.1
Black, M.P.2
Lee, C.-C.3
Williams, M.E.4
Levitt, P.5
Lee, S.6
Narayanan, S.S.7
-
12
-
-
84878383416
-
Contrastive intonation in autism: The effect of speaker- And listener-perspective
-
C. Kaland, E. Krahmer, and M. Swerts, "Contrastive intonation in autism: The effect of speaker- And listener-perspective, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Kaland, C.1
Krahmer, E.2
Swerts, M.3
-
13
-
-
84878411630
-
Interactions between turn-taking gaps, disfluencies and social obligation
-
R. Lunsford, P. A. Heeman, and J. P. H. van Santen, "Interactions between turn-taking gaps, disfluencies and social obligation, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Lunsford, R.1
Heeman, P.A.2
Van Santen, J.P.H.3
-
14
-
-
84878379006
-
On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD)
-
M. Swerts and C. de Bie, "On the assessment of audiovisual cues to speaker confidence by preteens with typical development (TD) and atypical development (AD), " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Swerts, M.1
De Bie, C.2
-
15
-
-
84878421621
-
Quantitative analysis of pitch in speech of children with neurodevelopmental disorders
-
G. Kiss, J. P. van Santen, E. Prudhommeaux, and L. M. Black, "Quantitative analysis of pitch in speech of children with neurodevelopmental disorders, " in Interspeech, 2012.
-
(2012)
Interspeech
-
-
Kiss, G.1
Santen, J.P.V.2
Prudhommeaux, E.3
Black, L.M.4
-
16
-
-
84906269266
-
The interspeech 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism
-
B. Schuller, S. Steidl, A. Batliner, A. Vinciarelli, K. Scherer, F. Ringeval, M. Chetouani, F. Weninger, F. Eyben, E. Marchi, M. Mortillaro, H. Salamin, A. Polychroniou, F. Valente, and S. Kim, "The interspeech 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism, " in Interspeech, 2013.
-
(2013)
Interspeech
-
-
Schuller, B.1
Steidl, S.2
Batliner, A.3
Vinciarelli, A.4
Scherer, K.5
Ringeval, F.6
Chetouani, M.7
Weninger, F.8
Eyben, F.9
Marchi, E.10
Mortillaro, M.11
Salamin, H.12
Polychroniou, A.13
Valente, F.14
Kim, S.15
-
17
-
-
0010442827
-
On the algorithmic implementation of multiclass kernel-based vector machines
-
K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines, " J. Mach. Learn. Res., vol. 2, pp. 265-292, 2002.
-
(2002)
J. Mach. Learn. Res.
, vol.2
, pp. 265-292
-
-
Crammer, K.1
Singer, Y.2
-
18
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets, " Neural Comput., vol. 18, pp. 1527- 1554, 2006.
-
(2006)
Neural Comput
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.-W.3
-
19
-
-
84867720412
-
-
G. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Improving neural networks by preventing coadaptation of feature detectors.
-
Improving Neural Networks by Preventing Coadaptation of Feature Detectors
-
-
Hinton, G.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
20
-
-
77649319843
-
Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech
-
T. L. Pao, Y. M. Cheng, Y. T. Chen, and J. H. Yeh, "Performance evaluation of different weighting schemes on knn-based emotion recognition in mandarin speech, " International Journal of Information Acquisition, vol. 4, pp. 339 - 346, 2007.
-
(2007)
International Journal of Information Acquisition
, vol.4
, pp. 339-346
-
-
Pao, T.L.1
Cheng, Y.M.2
Chen, Y.T.3
Yeh, J.H.4
-
21
-
-
0023800699
-
A segment model based approach to speech recognition
-
C.-H. Lee, F. Soong, and B.-H. Juang, "A segment model based approach to speech recognition, " in ICASSP, 1988.
-
(1988)
ICASSP
-
-
Lee, C.-H.1
Soong, F.2
Juang, B.-H.3
-
22
-
-
34547502608
-
A vector space modeling approach to spoken language identification
-
H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, pp. 271-284, 2007.
-
(2007)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.15
, pp. 271-284
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
23
-
-
84873444148
-
A study on music genre classification based on universal acoustic models
-
J. Reed, "A study on music genre classification based on universal acoustic models, " in ISMIR, 2006.
-
(2006)
ISMIR
-
-
Reed, J.1
-
24
-
-
78049411640
-
An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
-
Y. Tsao, H. Sun, H. Li, and C.-H. Lee, "An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition, " in ICASSP, 2010.
-
(2010)
ICASSP
-
-
Tsao, Y.1
Sun, H.2
Li, H.3
Lee, C.-H.4
-
25
-
-
70449646765
-
Acoustic segment modeling for speaker recognition
-
B. Ma, D. Zhu, and H. Li, "Acoustic segment modeling for speaker recognition, " in ICME, 2009.
-
(2009)
ICME
-
-
Ma, B.1
Zhu, D.2
Li, H.3
-
26
-
-
79959819374
-
Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision
-
M.-H. Siu, H. Gish, A. Chan, and W. Belfield, "Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision, " in Interspeech, 2010.
-
(2010)
Interspeech
-
-
Siu, M.-H.1
Gish, H.2
Chan, A.3
Belfield, W.4
-
27
-
-
84858975943
-
Topic modeling for spoken documents using only phonetic information
-
T. J. Hazen, M.-H. Siu, H. Gish, S. Lowe, and A. Chan, "Topic modeling for spoken documents using only phonetic information, " in ASRU, 2011.
-
(2011)
ASRU
-
-
Hazen, T.J.1
Siu, M.-H.2
Gish, H.3
Lowe, S.4
Chan, A.5
-
28
-
-
70450158585
-
Unsupervised training of an hmm-based speech recognizer for topic classification
-
H. Gish, M. hung Siu, and A. C. amd William Belfield, "Unsupervised training of an HMM-based speech recognizer for topic classification, " in Interspeech, 2009.
-
(2009)
Interspeech
-
-
Gish, H.1
Siu, M.H.2
Belfield, A.C.A.W.3
-
29
-
-
84865744986
-
Unsupervised learning of acoustic unit descriptors for audio content representation and classification
-
S. Chaudhuri, M. Harvilla, and B. Raj, "Unsupervised learning of acoustic unit descriptors for audio content representation and classification, " in Interspeech, 2011.
-
(2011)
Interspeech
-
-
Chaudhuri, S.1
Harvilla, M.2
Raj, B.3
-
30
-
-
84890511750
-
Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns
-
H.-Y. Lee, Y.-C. Li, C.-T. Chung, and L. shan Lee, "Enhancing query expansion for semantic retrieval of spoken content with automatically discovered acoustic patterns, " in ICASSP, 2013.
-
(2013)
ICASSP
-
-
Lee, H.-Y.1
Li, Y.-C.2
Chung, C.-T.3
Lee, L.S.4
-
31
-
-
84867809023
-
A nonparametric Bayesian approach to acoustic model discovery
-
C.-Y. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in ACL, 2012.
-
(2012)
ACL
-
-
Lee, C.-Y.1
Glass, J.2
-
32
-
-
84867600320
-
An acoustic segment modeling approach to query-by-example spoken term detection
-
H. Wang, C.-C. Leung, T. Lee, B. Ma, and H. Li, "An acoustic segment modeling approach to query-by-example spoken term detection, " in ICASSP, 2012.
-
(2012)
ICASSP
-
-
Wang, H.1
Leung, C.-C.2
Lee, T.3
Ma, B.4
Li, H.5
-
33
-
-
77949578539
-
A text retrieval approach to content-based audio retrieval
-
M. Riley, E. Heinen, and J. Ghosh, "A text retrieval approach to content-based audio retrieval, " in ISMIR, 2008.
-
(2008)
ISMIR
-
-
Riley, M.1
Heinen, E.2
Ghosh, J.3
-
34
-
-
0023211850
-
On the automatic segmentation of speech signals
-
T. Svendsen and F. Soong, "On the automatic segmentation of speech signals, " in ICASSP, 1987.
-
(1987)
ICASSP
-
-
Svendsen, T.1
Soong, F.2
-
35
-
-
84890479779
-
Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization
-
C.-T. Chung, C.-A. Chan, and L.-S. Lee, "Unsupervised discovery of linguistic structure including two-level acoustic patterns using three cascaded stages of iterative optimization, " in ICASSP, 2013.
-
(2013)
ICASSP
-
-
Chung, C.-T.1
Chan, C.-A.2
Lee, L.-S.3
-
36
-
-
78650043038
-
UBM based speaker selection and model re-estimation for speaker adaptation
-
J.Wang, J. Guo, G. Liu, and J. Lei, "UBM based speaker selection and model re-estimation for speaker adaptation, " in ICCI, vol. 2, 2006, pp. 856-860.
-
(2006)
ICCI
, vol.2
, pp. 856-860
-
-
Wang, J.1
Guo, J.2
Liu, G.3
Lei, J.4
-
37
-
-
4944228528
-
A practical guide to support vector classification
-
C.-W. Hsu, C.-C. Chang, and C.-J. Lin, "A practical guide to support vector classification, " National Taiwan University, Tech. Rep., 2003.
-
(2003)
National Taiwan University, Tech. Rep.
-
-
Hsu, C.-W.1
Chang, C.-C.2
Lin, C.-J.3
-
38
-
-
84906270598
-
-
http://svmlight.joachims.org/.
-
-
-
-
39
-
-
14344250451
-
Support vector machine learning for interdependent and structured output spaces
-
I. Tsochantaridis, T. Hofmann, T. Joachims, and Y. Altun, "Support vector machine learning for interdependent and structured output spaces, " in Proceedings of the twenty-first international conference on Machine learning, 2004.
-
(2004)
Proceedings of the Twenty-first International Conference on Machine Learning
-
-
Tsochantaridis, I.1
Hofmann, T.2
Joachims, T.3
Altun, Y.4
-
41
-
-
0034320005
-
Rapid speaker adaptation in eigenvoice space
-
R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space, " Speech and Audio Processing, IEEE Transactions on, vol. 8, pp. 695-707, 2000.
-
(2000)
Speech and Audio Processing, IEEE Transactions on
, vol.8
, pp. 695-707
-
-
Kuhn, R.1
Junqua, J.-C.2
Nguyen, P.3
Niedzielski, N.4
-
42
-
-
67651177785
-
An ensemble speaker and speaking environment modeling approach to robust speech recognition
-
Y. Tsao and C.-H. Lee, "An ensemble speaker and speaking environment modeling approach to robust speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, pp. 1025-1037, 2009.
-
(2009)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.17
, pp. 1025-1037
-
-
Tsao, Y.1
Lee, C.-H.2
|