-
1
-
-
84902986478
-
Emotion identification for evaluation of synthesized emotional speech
-
S. Steidl, T. Polzehl, H. T. Bunnell, Y. Dou, P. K. Muthukumar, D. Perry, K. Prahallad, C. Vaughn, A. W. Black, and F. Metze, "Emotion identification for evaluation of synthesized emotional speech, " in Proc. of speech prosody, 2012.
-
(2012)
Proc. of Speech Prosody
-
-
Steidl, S.1
Polzehl, T.2
Bunnell, H.T.3
Dou, Y.4
Muthukumar, P.K.5
Perry, D.6
Prahallad, K.7
Vaughn, C.8
Black, A.W.9
Metze, F.10
-
2
-
-
77957744515
-
Hmm-based speech synthesis utilizing glottal inverse filtering
-
T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku, "Hmm-based speech synthesis utilizing glottal inverse filtering, " IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 153-165, pp. 459-476, 2011.
-
(2011)
IEEE Transactions on Audio, Speech and Language Processing
, vol.19
, Issue.153-165
, pp. 459-476
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
-
3
-
-
0001810975
-
Line spectrum representation of linear predictor coefficients of speech signals
-
F. Itakura, "Line spectrum representation of linear predictor coefficients of speech signals, " The Journal of the Acoustical Society of America, vol. 57, no. S1, pp. S35-S35, 1975.
-
(1975)
The Journal of the Acoustical Society of America
, vol.57
, Issue.S1
-
-
Itakura, F.1
-
4
-
-
0026881761
-
On the relation between voice source parameters and prosodic features in connected speech
-
H. Strik and L. Boves, "On the relation between voice source parameters and prosodic features in connected speech, " Speech Communication, vol. 11, no. 23, pp. 167 - 174, 1992.
-
(1992)
Speech Communication
, vol.11
, Issue.23
, pp. 167-174
-
-
Strik, H.1
Boves, L.2
-
6
-
-
38049065378
-
Time- And amplitude-based voice source correlates of emotional portrayals
-
ser. Lecture Notes in Computer Science, A. Paiva, R. Prada, and R. Picard, Eds. Springer Berlin Heidelberg
-
I. Yanushevskaya, M. Tooher, C. Gobl, and A. Ni Chasaide, "Time- And amplitude-based voice source correlates of emotional portrayals, " in Affective Computing and Intelligent Interaction, ser. Lecture Notes in Computer Science, A. Paiva, R. Prada, and R. Picard, Eds. Springer Berlin Heidelberg, 2007, vol. 4738, pp. 159-170.
-
(2007)
Affective Computing and Intelligent Interaction
, vol.4738
, pp. 159-170
-
-
Yanushevskaya, I.1
Tooher, M.2
Gobl, C.3
Chasaide, A.N.4
-
7
-
-
0027228739
-
Glottal source estimation: Methods of applying the lf-model to inverse filtering
-
1993
-
E. Riegelsberger and A. Krishnamurthy, "Glottal source estimation: Methods of applying the lf-model to inverse filtering, " in ICASSP-93., 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1993., vol. 2, 1993, pp. 542-545.
-
(1993)
ICASSP-93, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing
, vol.2
, pp. 542-545
-
-
Riegelsberger, E.1
Krishnamurthy, A.2
-
8
-
-
84908330581
-
Automatic parameterisation of the glottal waveform combining time and frequency domain measures
-
J. Kane and C. Gobl, "Automatic parameterisation of the glottal waveform combining time and frequency domain measures, " Proceedings of 6th Maveba International Workshop, 2009.
-
(2009)
th Maveba International Workshop
-
-
Kane, J.1
Gobl, C.2
-
9
-
-
79959831472
-
A spectral lf model based approach to voice source paramet- erisation
-
J. Kane, M. Kane, and C. Gobl, "A spectral lf model based approach to voice source paramet- erisation, " Interspeech 2010, 2010.
-
(2010)
Interspeech 2010
-
-
Kane, J.1
Kane, M.2
Gobl, C.3
-
10
-
-
80051650578
-
Utilizing glottal source pulse library for generating improved excitation signal for hmm-based speech synthesis
-
T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "Utilizing glottal source pulse library for generating improved excitation signal for hmm-based speech synthesis, " in ICASSP 2011, 2011.
-
(2011)
ICASSP 2011
-
-
Raitio, T.1
Suni, A.2
Pulakka, H.3
Vainio, M.4
Alku, P.5
-
11
-
-
82155160991
-
Towards an improved modeling of the glottal source in statistical parametric speech synthesis
-
Bonn, Germany
-
J. Cabral, S. Renals, K. Richmond, and J. Yamagishi, "Towards an improved modeling of the glottal source in statistical parametric speech synthesis, " in Proc.of the 6th ISCA Workshop on Speech Synthesis, Bonn, Germany, 2007.
-
(2007)
th ISCA Workshop on Speech Synthesis
-
-
Cabral, J.1
Renals, S.2
Richmond, K.3
Yamagishi, J.4
-
12
-
-
84867224654
-
Glottal spectral separation for parametric speech synthesis
-
Brisbane, Australia, Sep
-
J. Cabral, S. Renals, K. Richmond, and J. Yamagishi, "Glottal spectral separation for parametric speech synthesis, " in Proc. Interspeech, Brisbane, Australia, Sep. 2008, pp. 1829-1832.
-
(2008)
Proc. Interspeech
, pp. 1829-1832
-
-
Cabral, J.1
Renals, S.2
Richmond, K.3
Yamagishi, J.4
-
13
-
-
33646813326
-
A novel source analysis method by matching spectral characters of lf model with straight spectrum
-
J. Tao, T. Tan, and R.W. Picard, Eds. Spring-Verlag
-
Z.-H. Ling, Y. Hu, and R.-H. Wang, "A novel source analysis method by matching spectral characters of lf model with straight spectrum, " in ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction, J. Tao, T. Tan, and R.W. Picard, Eds. Spring-Verlag, 2005, pp. 441-448.
-
(2005)
ACII'05 Proceedings of the First International Conference on Affective Computing and Intelligent Interaction
, pp. 441-448
-
-
Ling, Z.-H.1
Hu, Y.2
Wang, R.-H.3
-
14
-
-
84902669930
-
Transformation of lf parameters for speech synthesis of emotion: Regression trees
-
M. Tooher, I. Yanushevskaya, and C. Gobl, "Transformation of lf parameters for speech synthesis of emotion: Regression trees, " in Speech Prosody 2008, 2008, pp. 705-708.
-
(2008)
Speech Prosody 2008
, pp. 705-708
-
-
Tooher, M.1
Yanushevskaya, I.2
Gobl, C.3
-
15
-
-
77957744515
-
Hmm-based speech synthesis utilizing glottal inverse filtering
-
Jan
-
T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku, "Hmm-based speech synthesis utilizing glottal inverse filtering, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 1, pp. 153-165, Jan.
-
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.19
, Issue.1
, pp. 153-165
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
-
16
-
-
84856248602
-
The deterministic plus stochastic model of the residual signal and its applications
-
T. Drugman and T. Dutoit, "The deterministic plus stochastic model of the residual signal and its applications, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 3, pp. 968-981, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.3
, pp. 968-981
-
-
Drugman, T.1
Dutoit, T.2
-
17
-
-
34547541173
-
A new method for speech synthesis and transformation based on an ARX-lf source-filter decomposition and HNM modeling
-
IEEE
-
D. Vincent, O. Rosec, and T. Chonavel, "A new method for speech synthesis and transformation based on an arx-lf source-filter decomposition and hnm modeling, " in Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, vol. 4. IEEE, 2007, pp. 4-525.
-
(2007)
Acoustics, Speech and Signal Processing, 2007 ICASSP 2007 IEEE International Conference on
, vol.4
, pp. 4-525
-
-
Vincent, D.1
Rosec, O.2
Chonavel, T.3
-
19
-
-
0003447548
-
-
Ph.D. dissertation, Ecole Nationale Superieure des Telecommunications
-
I. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification, " Ph.D. dissertation, Ecole Nationale Superieure des Telecommunications, 1996.
-
(1996)
Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification
-
-
Stylianou, I.1
-
20
-
-
33947684811
-
A four-parameter model of glottal flow
-
G. Fant, J. Liljencrants, and Q. Lin, "A four-parameter model of glottal flow, " STL-QPSR, vol. 4, no. 1985, pp. 1-13, 1985.
-
(1985)
STL-QPSR
, vol.4
, Issue.1985
, pp. 1-13
-
-
Fant, G.1
Liljencrants, J.2
Lin, Q.3
-
21
-
-
84906227057
-
Glottal wave analysis with pitch synchronous iterative adaptive filtering
-
P. Alku, "Glottal wave analysis with pitch synchronous iterative adaptive filtering, " Speech Communication, vol. 19, pp. 459-476.
-
Speech Communication
, vol.19
, pp. 459-476
-
-
Alku, P.1
-
22
-
-
44949232373
-
Cluster Gen: A statistical parametric synthesizer using trajectory modeling
-
A. Black, "Cluster Gen: A statistical parametric synthesizer using trajectory modeling, " in Proceedings of INTERSPEECH, 2006, pp. 1762-1765.
-
(2006)
Proceedings of INTERSPEECH
, pp. 1762-1765
-
-
Black, A.1
-
24
-
-
84890536802
-
Test Vox: Web-based framework for subjective evaluation of speech synthesis
-
A. Parlikar. (2012) Test Vox: Web-based Framework for Subjective Evaluation of Speech Synthesis. Open Source Software.
-
(2012)
Open Source Software
-
-
Parlikar, A.1
-
25
-
-
85009097254
-
Mixed excitation for HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for hmm-based speech synthesis, " in Proc. Eurospeech, vol. 1, 2001.
-
(2001)
Proc. Eurospeech
, vol.1
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
26
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039- 1064, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
-
28
-
-
84928842013
-
Frequency domain interpretation and derivation of glottal flow parameters
-
G. Fant and Q. Lin, "Frequency domain interpretation and derivation of glottal flow parameters, " STL-QPSR, vol. 29, no. 2-3, pp. 1-21, 1988.
-
(1988)
STL-QPSR
, vol.29
, Issue.2-3
, pp. 1-21
-
-
Fant, G.1
Lin, Q.2
-
29
-
-
84966348891
-
An hmm-based speech synthesis system applied to
-
IEEE
-
K. Tokuda, H. Zen, and A. Black, "An hmm-based speech synthesis system applied to english, " in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on. IEEE, 2002, pp. 227-230.
-
(2002)
English, in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
, pp. 227-230
-
-
Tokuda, K.1
Zen, H.2
Black, A.3
-
30
-
-
33947674781
-
Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis
-
IEEE
-
K. Prahallad, A.W. Black, and R. Mosur, "Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis, " in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, vol. 1. IEEE, 2006, pp. 1.
-
(2006)
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
, vol.1
, pp. 1
-
-
Prahallad, K.1
Black, A.W.2
Mosur, R.3
-
31
-
-
70349208664
-
Optimizing segment label boundaries for statistical speech synthesis
-
IEEE
-
A. W. Black and J. Kominek, "Optimizing segment label boundaries for statistical speech synthesis, " in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. IEEE, 2009, pp. 3785-3788.
-
(2009)
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
, pp. 3785-3788
-
-
Black, A.W.1
Kominek, J.2
-
33
-
-
84867602871
-
Articulatory features for expressive speech synthesis
-
IEEE
-
A. W. Black, H. T. Bunnell, Y. Dou, P. Kumar Muthukumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaughn, "Articulatory features for expressive speech synthesis, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 2012, pp. 4005-4008.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference On.
, pp. 4005-4008
-
-
Black, A.W.1
Bunnell, H.T.2
Dou, Y.3
Muthukumar, P.K.4
Metze, F.5
Perry, D.6
Polzehl, T.7
Prahallad, K.8
Steidl, S.9
Vaughn, C.10
|