-
1
-
-
0038370976
-
Facial and vocal expressions of emotion
-
J. A. Russell, J. A. Bachorowski and J. M. Fernández-Dols, (2003). Facial and vocal expressions of emotion. Annual review of psychology, 54(1), 329-349.
-
(2003)
Annual Review of Psychology
, vol.54
, Issue.1
, pp. 329-349
-
-
Russell, J.A.1
Bachorowski, J.A.2
Fernández-Dols, J.M.3
-
4
-
-
0037384712
-
Vocal communication of emotion: A review of research paradigms
-
K. R. Scherer, (2003). Vocal communication of emotion: A review of research paradigms. Speech communication, 40(1), 227-256.
-
(2003)
Speech Communication
, vol.40
, Issue.1
, pp. 227-256
-
-
Scherer, K.R.1
-
6
-
-
34047263010
-
Prosody conversion from neutral speech to emotional speech
-
J. Tao, Y. Kang and A. Li, (2006). Prosody conversion from neutral speech to emotional speech. IEEE Transactions on Audio, Speech, and Language Processing, 14(4), 1145-1154.
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.14
, Issue.4
, pp. 1145-1154
-
-
Tao, J.1
Kang, Y.2
Li, A.3
-
7
-
-
84938935270
-
A system for transform- ing the emotion in speech: Combining data-driven conversion tech- niques for prosody and voice quality
-
August
-
Z. Inanoglu and S. Young, (2007, August). A system for transform- ing the emotion in speech: combining data-driven conversion tech- niques for prosody and voice quality. In INTERSPEECH (pp. 490- 493).
-
(2007)
INTERSPEECH
, pp. 490-493
-
-
Inanoglu, Z.1
Young, S.2
-
8
-
-
84890451203
-
GMM- based emotional voice conversion using spectrum and prosody fea- tures
-
R. Aihara, R. Takashima, T. Takiguchi and Y. Ariki, (2012). GMM- based emotional voice conversion using spectrum and prosody fea- tures. In American Journal of Signal Processing, 2(5), 134-138.
-
(2012)
American Journal of Signal Processing
, vol.2
, Issue.5
, pp. 134-138
-
-
Aihara, R.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
9
-
-
84949924136
-
Exemplar-based emotional voice conversion using non-negative matrix factorization
-
December IEEE
-
R. Aihara, R. Ueda, T. Takiguchi and Y. Ariki, (2014, December). Exemplar-based emotional voice conversion using non-negative matrix factorization. In Asia-Pacific Signal and Information Pro- cessing Association, 2014 Annual Summit and Conference (APSI- PA) (pp. 1-7). IEEE.
-
(2014)
Asia-Pacific Signal and Information Pro- Cessing Association, 2014 Annual Summit and Conference (APSI- PA)
, pp. 1-7
-
-
Aihara, R.1
Ueda, R.2
Takiguchi, T.3
Ariki, Y.4
-
10
-
-
84964010208
-
Fundamental frequency modeling using wavelets for emo- tional voice conversion
-
H. Ming, D. Huang, L. Xie, S. Zhang, M. Dong and H. Li, (2015). Fundamental frequency modeling using wavelets for emo- tional voice conversion. In 6th Affective Computing and Intelligent Interaction (ACII)Workshop on Affective Social Multimedia Com- puting.
-
(2015)
6th Affective Computing and Intelligent Interaction (ACII)Workshop on Affective Social Multimedia Com- Puting
-
-
Ming, H.1
Huang, D.2
Xie, L.3
Zhang, S.4
Dong, M.5
Li, H.6
-
11
-
-
84876502441
-
Review of F0 modelling and generation in HMM based speech synthesis
-
October
-
K. Yu, (2012, October). Review of F0 modelling and generation in HMM based speech synthesis. In IEEE 11th International Con- ference on Signal Processing (ICSP), (Vol. 1, pp. 599-604).
-
(2012)
IEEE 11th International Con- Ference on Signal Processing (ICSP)
, vol.1
, pp. 599-604
-
-
Yu, K.1
-
12
-
-
77955722263
-
Hier- archical prosody conversion using regression-based clustering for emotional speech synthesis
-
C. H. Wu, C. C. Hsia, C. H. Lee and M. C. Lin, (2010). Hier- archical prosody conversion using regression-based clustering for emotional speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 18(6), 1394-1405.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, Issue.6
, pp. 1394-1405
-
-
Wu, C.H.1
Hsia, C.C.2
Lee, C.H.3
Lin, M.C.4
-
14
-
-
84864409462
-
Speech prosody: A methodological review
-
Y. Xu, (2011). Speech prosody: A methodological review. Journal of Speech Sciences, 1(1), 85-115.
-
(2011)
Journal of Speech Sciences
, vol.1
, Issue.1
, pp. 85-115
-
-
Xu, Y.1
-
17
-
-
84867194192
-
Multilevel parametric-base F0 model for speech synthesis
-
September
-
J. Latorre and M. Akamine, (2008, September). Multilevel parametric-base F0 model for speech synthesis. In INTERSPEECH (pp. 2274-2277).
-
(2008)
INTERSPEECH
, pp. 2274-2277
-
-
Latorre, J.1
Akamine, M.2
-
18
-
-
85008039410
-
Improved prosody generation by maximizing joint probability of state and longer units
-
Y. Qian, Z. Wu, B. Gao and F. K. Soong, F. K. (2011). Improved prosody generation by maximizing joint probability of state and longer units. IEEE Transactions on Audio, Speech, and Language Processing, 19(6), 1702-1710.
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, Issue.6
, pp. 1702-1710
-
-
Qian, Y.1
Wu, Z.2
Gao, B.3
Soong, F.K.4
-
19
-
-
84865714286
-
Stylization and trajectory modelling of short and long term speech prosody variations
-
August
-
N. Obin, A. Lacheret and X. Rodet, (2011, August). Stylization and trajectory modelling of short and long term speech prosody variations. In INTERSPEECH.
-
(2011)
INTERSPEECH
-
-
Obin, N.1
Lacheret, A.2
Rodet, X.3
-
21
-
-
84946045633
-
Wavelets for intonation modeling in HMM speech synthesis
-
A. S. Suni, D. Aalto, T. Raitio, P. Alku and M. Vainio, (2013). Wavelets for intonation modeling in HMM speech synthesis. In 8th ISCA Workshop on Speech Synthesis, Proceedings, Barcelona.
-
(2013)
8th ISCA Workshop on Speech Synthesis, Proceedings, Barcelona
-
-
Suni, A.S.1
Aalto, D.2
Raitio, T.3
Alku, P.4
Vainio, M.5
-
22
-
-
84973338474
-
Exemplar-based sparse representation of timbre and prosody for voice conversion
-
H. Ming, D. Huang, L. Xie, S. Zhang, M. Dong and H. Li, (2016). Exemplar-based Sparse Representation of Timbre and Prosody for Voice Conversion. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
-
(2016)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
-
Ming, H.1
Huang, D.2
Xie, L.3
Zhang, S.4
Dong, M.5
Li, H.6
-
23
-
-
0035505385
-
LSTM recurrent network- s learn simple context-free and context-sensitive languages
-
F. A. Gers, and J. Schmidhuber, (2001). LSTM recurrent network- s learn simple context-free and context-sensitive languages. IEEE Transactions on Neural Networks, 12(6), 1333-1340.
-
(2001)
IEEE Transactions on Neural Networks
, vol.12
, Issue.6
, pp. 1333-1340
-
-
Gers, F.A.1
Schmidhuber, J.2
-
24
-
-
84910046405
-
Long short-term memory recurrent neural network architectures for large vocabulary speech recognition
-
September
-
H. Sak, A. W. Senior, and F. Beaufays, (2014, September). Long short-term memory recurrent neural network architectures for large vocabulary speech recognition. In INTERSPEECH (pp. 338-342).
-
(2014)
INTERSPEECH
, pp. 338-342
-
-
Sak, H.1
Senior, A.W.2
Beaufays, F.3
-
25
-
-
84910047819
-
TTS synthesis with bidirectional LSTM based recurrent neural net- works
-
September
-
Y. Fan, Y. Qian, F. L., Xie and F. K. Soong, (2014, September). TTS synthesis with bidirectional LSTM based recurrent neural net- works. In INTERSPEECH (pp. 1964-1968).
-
(2014)
INTERSPEECH
, pp. 1964-1968
-
-
Fan, Y.1
Qian, Y.2
Xie, F.L.3
Soong, F.K.4
-
26
-
-
84946027999
-
Voice con- version using deep bidirectional long short-term memory based recurrent neural networks
-
April
-
L. Sun, S. Kang, K. Li and H. Meng, (2015, April). Voice con- version using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (pp. 4869- 4873).
-
(2015)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 4869-4873
-
-
Sun, L.1
Kang, S.2
Li, K.3
Meng, H.4
-
27
-
-
51449108867
-
TANDEM-STRAIGHT: A temporally stable power spectral representation for periodic signals and appli- cations to interference-free spectrum, F0, and aperiodicity estima- tion
-
March
-
H. Kawahara, M. Morise, T. Takahashi, R. Nisimura, T. Irino and H. Banno, (2008, March). TANDEM-STRAIGHT: A temporally stable power spectral representation for periodic signals and appli- cations to interference-free spectrum, F0, and aperiodicity estima- tion. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (pp. 3933-3936).
-
(2008)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 3933-3936
-
-
Kawahara, H.1
Morise, M.2
Takahashi, T.3
Nisimura, R.4
Irino, T.5
Banno, H.6
-
29
-
-
84949924808
-
Emotional facial expression transfer based on tempo- ral restricted Boltzmann machines
-
December
-
S. Liu, D. Y. Huang,W. Lin, M. Dong, H. Li and E. P. Ong, (2014, December). Emotional facial expression transfer based on tempo- ral restricted Boltzmann machines. In Asia-Pacific Signal and In- formation Processing Association (APSIPA).
-
(2014)
Asia-Pacific Signal and In- Formation Processing Association (APSIPA)
-
-
Liu, S.1
Huang, W.2
Lin, D.Y.3
Dong, M.4
Li, H.5
Ong, E.P.6
|