-
1
-
-
78649328053
-
Survey on speech emotion recognition: Features, classification schemes, and databases
-
M. El Ayadi, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases," Pattern Recognition, vol. 44, no. 3, pp. 572-587, 2011.
-
(2011)
Pattern Recognition
, vol.44
, Issue.3
, pp. 572-587
-
-
El Ayadi, M.1
Kamel, M.S.2
Karray, F.3
-
2
-
-
0242721417
-
Speech emotion recognition using hidden markov models
-
T. L. Nwe, S. W. Foo, and L. C. De Silva, "Speech emotion recognition using hidden markov models," Speech communication, vol. 41, no. 4, pp. 603-623, 2003.
-
(2003)
Speech Communication
, vol.41
, Issue.4
, pp. 603-623
-
-
Nwe, T.L.1
Foo, S.W.2
De Silva, L.C.3
-
4
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups," Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
5
-
-
84055222005
-
Context-dependent pretrained deep neural networks for large-vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large-vocabulary speech recognition," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
6
-
-
84858976070
-
Feature engineering in contextdependent deep neural networks for conversational speech transcription
-
IEEE
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in contextdependent deep neural networks for conversational speech transcription," in Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2011, pp. 24-29.
-
(2011)
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
7
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets," Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.-W.3
-
8
-
-
80051631315
-
Deep neural networks for acoustic emotion recognition: Raising the benchmarks
-
IEEE
-
A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: raising the benchmarks," in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. IEEE, 2011, pp. 5688-5691.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 5688-5691
-
-
Stuhlsatz, A.1
Meyer, C.2
Eyben, F.3
Zieike, T.4
Meier, G.5
Schuller, B.6
-
9
-
-
84922798491
-
The enterface05 audio-visual emotion database
-
IEEE
-
O. Martin, I. Kotsia, B. Macq, and I. Pitas, "The enterface05 audio-visual emotion database," in Data Engineering Workshops, 2006. Proceedings. 22nd International Conference on. IEEE, 2006, pp. 8-8.
-
(2006)
Data Engineering Workshops, 2006. Proceedings. 22nd International Conference On.
, pp. 8-8
-
-
Martin, O.1
Kotsia, I.2
MacQ, B.3
Pitas, I.4
-
10
-
-
34247610490
-
A database of german emotional speech
-
F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, and B. Weiss, "A database of german emotional speech," in Proc. Interspeech, vol. 2005, 2005.
-
(2005)
Proc. Interspeech
, vol.2005
-
-
Burkhardt, F.1
Paeschke, A.2
Rolfes, M.3
Sendlmeier, W.4
Weiss, B.5
-
11
-
-
84864026688
-
-
MIT; 1998
-
G. W. Taylor, G. E. Hinton, and S. T. Roweis, "Modeling human motion using binary latent variables," vol. 19. MIT; 1998, 2007, p. 1345.
-
(2007)
Modeling Human Motion Using Binary Latent Variables
, vol.19
, pp. 1345
-
-
Taylor, G.W.1
Hinton, G.E.2
Roweis, S.T.3
-
12
-
-
0034346176
-
Emotion recognition in speech using neural networks
-
J. Nicholson, K. Takahashi, and R. Nakatsu, "Emotion recognition in speech using neural networks," Neural Computing & Applications, vol. 9, no. 4, pp. 290-296, 2000.
-
(2000)
Neural Computing & Applications
, vol.9
, Issue.4
, pp. 290-296
-
-
Nicholson, J.1
Takahashi, K.2
Nakatsu, R.3
-
13
-
-
0024610919
-
A tutorial on hidden markov models and selected applications in speech recognition
-
L. R. Rabiner, "A tutorial on hidden markov models and selected applications in speech recognition," Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
14
-
-
33947620115
-
Hierarchical structures of neural networks for phoneme recognition
-
IEEE
-
P. Schwarz, P. Matejka, and J. Cernocky, "Hierarchical structures of neural networks for phoneme recognition," in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, vol. 1. IEEE, 2006, pp. I-I.
-
(2006)
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
, vol.1
-
-
Schwarz, P.1
Matejka, P.2
Cernocky, J.3
-
15
-
-
79955989999
-
The htk book
-
S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw, X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey et al., "The htk book," Cambridge University Engineering Department, vol. 3, 2002.
-
(2002)
Cambridge University Engineering Department
, vol.3
-
-
Young, S.1
Evermann, G.2
Gales, M.3
Hain, T.4
Kershaw, D.5
Liu, X.6
Moore, G.7
Odell, J.8
Ollason, D.9
Povey, D.10
|