-
3
-
-
34547505647
-
Combining efforts for improving automatic classification of emotional user states
-
Ljubljana (Slovenia)
-
Anton Batliner, Stefan Steidl, Bjrn Schuller, Dino Seppi, Kornel Laskowski, Thurid Vogt, Laurence Devillers, Laurence Vidrascu, Noam Amir, Loic Kessous, and Vered Aharonson. 2006. Combining efforts for improving automatic classification of emotional user states. In Information Society - Language TechnologiesConference (IS-LTC), pages 240-245, Ljubljana (Slovenia).
-
(2006)
Information Society - Language TechnologiesConference (IS-LTC)
, pp. 240-245
-
-
Batliner, A.1
Steidl, S.2
Schuller, B.3
Seppi, D.4
Laskowski, K.5
Vogt, T.6
Devillers, L.7
Vidrascu, L.8
Amir, N.9
Kessous, L.10
Aharonson, V.11
-
5
-
-
34547958553
-
Multistyle classification of speech under stress using feature subset selection based on genetic algorithms
-
Salvatore Casale, Alessandra Russo, and Salvatore Serano. 2007. Multistyle classification of speech under stress using feature subset selection based on genetic algorithms. Speech Communication, 49(10):801-810.
-
(2007)
Speech Communication
, vol.49
, Issue.10
, pp. 801-810
-
-
Casale, S.1
Russo, A.2
Serano, S.3
-
8
-
-
84910032186
-
SPEECON - Speech databases for consumer devices: Database specification and validation
-
Las Palmas, Spain
-
Dorota Iskra, Beate Grosskopf, Krzysztof Marasek, Henk van del Heuvel, Frank Diehl, and Andreas Kiessling. 2002. SPEECON - speech databases for consumer devices: database specification and validation. In Language Resources and Evaluation Conference (LREC), pages 329-333, Las Palmas, Spain.
-
(2002)
Language Resources and Evaluation Conference (LREC)
, pp. 329-333
-
-
Iskra, D.1
Grosskopf, B.2
Marasek, K.3
Van Del Heuvel, H.4
Diehl, F.5
Kiessling, A.6
-
10
-
-
48149087416
-
Real-time emotion detection system using speech: Multi-modal fusion of different timescale features
-
Crete
-
Samuel Kim, Panayiotis G. Georgiou, Sungbok Lee, and Shrikanth Narayanan. 2007. Real-time emotion detection system using speech: Multi-modal fusion of different timescale features. In IEEE Workshop on Multimedia Signal Processing, pages 48-51, Crete.
-
(2007)
IEEE Workshop on Multimedia Signal Processing
, pp. 48-51
-
-
Kim, S.1
Georgiou, P.G.2
Lee, S.3
Narayanan, S.4
-
11
-
-
0141478766
-
Pitch maxima for robust speaker recognition
-
Hong Kong
-
S. Krishnakumar, K.R. Prasanna Kumar, and N. Balakrishnan. 2003. Pitch maxima for robust speaker recognition. In ICASSP, Volume 2, pages 201-204, Hong Kong.
-
(2003)
ICASSP
, vol.2
, pp. 201-204
-
-
Krishnakumar, S.1
Prasanna Kumar, K.R.2
Balakrishnan, N.3
-
12
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
Rhodes, Greece
-
Alvin F. Martin, George R. Doddington, Terri Kamm, Mark Ordowski, and Mark A. Przybocki. 1997. The DET curve in assessment of detection task performance. In Eurospeech, pages 1895-1898, Rhodes, Greece.
-
(1997)
Eurospeech
, pp. 1895-1898
-
-
Martin, A.F.1
Doddington, G.R.2
Kamm, T.3
Ordowski, M.4
Przybocki, M.A.5
-
13
-
-
0242721417
-
Speech emotion recognition using hidden Markov models
-
Tin Lay Nwe, Say Wei Foo, and Liyanage C. de Silva. 2003. Speech emotion recognition using hidden Markov models. Speech Communication, 41(4):603-623.
-
(2003)
Speech Communication
, vol.41
, Issue.4
, pp. 603-623
-
-
Nwe, T.L.1
Foo, S.W.2
De Silva, L.C.3
-
14
-
-
33646093001
-
Feature representation and discrimination based on Gaussian mixture model probability densities - Practices and algorithms
-
Pekka Paalanen, Joni-Kristian Kamarainen, Jarmo Ilonen, and Heikki Klviinen. 2006. Feature representation and discrimination based on gaussian mixture model probability densities - practices and algorithms. Pattern Recognition, 39(7):1346-1358.
-
(2006)
Pattern Recognition
, vol.39
, Issue.7
, pp. 1346-1358
-
-
Paalanen, P.1
Kamarainen, J.-K.2
Ilonen, J.3
Klviinen, H.4
-
15
-
-
23144440245
-
Global trend of fundamental frequency in emotional speech
-
Nara, Japan
-
A. Paeschke. 2004. Global trend of fundamental frequency in emotional speech. In Speech Prosody, pages 671-674, Nara, Japan.
-
(2004)
Speech Prosody
, pp. 671-674
-
-
Paeschke, A.1
-
16
-
-
1842476689
-
Efficient voice activity detection algorithms using long term speech information
-
Javier Ramirez, Jose C. Segura, Carmen Benitez, Angel de la Torre, and Antonio Rubio. 2004. Efficient voice activity detection algorithms using long term speech information. Speech Communication, 42:271-287.
-
(2004)
Speech Communication
, vol.42
, pp. 271-287
-
-
Ramirez, J.1
Segura, J.C.2
Benitez, C.3
De La Torre, A.4
Rubio, A.5
-
17
-
-
38049048651
-
Frame vs. Turn-level: Emotion recognition from speech considering static and dynamic processing
-
Bogdan Vlasenko, Bjrn Schuller, Andreas Wendemuth, and Gerhard Rigoll. 2007. Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing. Lecture Notes on Computer Science, 4738:139-147.
-
(2007)
Lecture Notes on Computer Science
, vol.4738
, pp. 139-147
-
-
Vlasenko, B.1
Schuller, B.2
Wendemuth, A.3
Rigoll, G.4
|