SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 397-401

Optimizations and fitting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis

(3) Muthukumar, Prasanna Kumar a Black, Alan W a Bunnell, H Timothy b

a Carnegie Mellon University (United States)

b NEMOURS CHILDREN S CLINIC (United States)

Author keywords

Liljencrants Fant model; Speech synthesis; Statistical parametric synthesis

Indexed keywords

ALGORITHMS; ITERATIVE METHODS; OPTIMIZATION;

EXCITATION MODELING; EXCITATION PARAMETERS; FITTING PROCEDURE; GRADIENT DESCENT OPTIMIZATION; OBJECTIVE METRICS; PARAMETRIC SYNTHESIS; SPEECH SYNTHESIZER; STATISTICAL PARAMETRIC SPEECH SYNTHESIS;

SPEECH SYNTHESIS;

EID: 84906279165 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (8)

References (33)

1
- 84902986478
- Emotion identification for evaluation of synthesized emotional speech
- S. Steidl, T. Polzehl, H. T. Bunnell, Y. Dou, P. K. Muthukumar, D. Perry, K. Prahallad, C. Vaughn, A. W. Black, and F. Metze, "Emotion identification for evaluation of synthesized emotional speech, " in Proc. of speech prosody, 2012.
- (2012) Proc. of Speech Prosody
- Steidl, S.¹ Polzehl, T.² Bunnell, H.T.³ Dou, Y.⁴ Muthukumar, P.K.⁵ Perry, D.⁶ Prahallad, K.⁷ Vaughn, C.⁸ Black, A.W.⁹ Metze, F.¹⁰

2
- 77957744515
- Hmm-based speech synthesis utilizing glottal inverse filtering
- T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku, "Hmm-based speech synthesis utilizing glottal inverse filtering, " IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 153-165, pp. 459-476, 2011.
- (2011) IEEE Transactions on Audio, Speech and Language Processing , vol.19 , Issue.153-165 , pp. 459-476
- Raitio, T.¹ Suni, A.² Yamagishi, J.³ Pulakka, H.⁴ Nurminen, J.⁵ Vainio, M.⁶ Alku, P.⁷

3
- 0001810975
- Line spectrum representation of linear predictor coefficients of speech signals
- F. Itakura, "Line spectrum representation of linear predictor coefficients of speech signals, " The Journal of the Acoustical Society of America, vol. 57, no. S1, pp. S35-S35, 1975.
- (1975) The Journal of the Acoustical Society of America , vol.57 , Issue.S1
- Itakura, F.¹

4
- 0026881761
- On the relation between voice source parameters and prosodic features in connected speech
- H. Strik and L. Boves, "On the relation between voice source parameters and prosodic features in connected speech, " Speech Communication, vol. 11, no. 23, pp. 167 - 174, 1992.
- (1992) Speech Communication , vol.11 , Issue.23 , pp. 167-174
- Strik, H.¹ Boves, L.²

5
- 33745184089
- Amplitude-based source parameters for measuring voice quality
- C. Gobl and A. N. Chasaide, "Amplitude-based source parameters for measuring voice quality, " in ISCA Tutorial and Research Workshop on Voice Quality: Functions, Analysis and Synthesis, 2003.
- (2003) ISCA Tutorial and Research Workshop on Voice Quality: Functions, Analysis and Synthesis
- Gobl, C.¹ Chasaide, A.N.²

6
- 38049065378
- Time- And amplitude-based voice source correlates of emotional portrayals
- ser. Lecture Notes in Computer Science, A. Paiva, R. Prada, and R. Picard, Eds. Springer Berlin Heidelberg
- I. Yanushevskaya, M. Tooher, C. Gobl, and A. Ni Chasaide, "Time- And amplitude-based voice source correlates of emotional portrayals, " in Affective Computing and Intelligent Interaction, ser. Lecture Notes in Computer Science, A. Paiva, R. Prada, and R. Picard, Eds. Springer Berlin Heidelberg, 2007, vol. 4738, pp. 159-170.
- (2007) Affective Computing and Intelligent Interaction , vol.4738 , pp. 159-170
- Yanushevskaya, I.¹ Tooher, M.² Gobl, C.³ Chasaide, A.N.⁴

7
- 0027228739
- Glottal source estimation: Methods of applying the lf-model to inverse filtering
- 1993
- E. Riegelsberger and A. Krishnamurthy, "Glottal source estimation: Methods of applying the lf-model to inverse filtering, " in ICASSP-93., 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1993., vol. 2, 1993, pp. 542-545.
- (1993) ICASSP-93, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 542-545
- Riegelsberger, E.¹ Krishnamurthy, A.²

8
- 84908330581
- Automatic parameterisation of the glottal waveform combining time and frequency domain measures
- J. Kane and C. Gobl, "Automatic parameterisation of the glottal waveform combining time and frequency domain measures, " Proceedings of 6th Maveba International Workshop, 2009.
- (2009) th Maveba International Workshop
- Kane, J.¹ Gobl, C.²

9
- 79959831472
- A spectral lf model based approach to voice source paramet- erisation
- J. Kane, M. Kane, and C. Gobl, "A spectral lf model based approach to voice source paramet- erisation, " Interspeech 2010, 2010.
- (2010) Interspeech 2010
- Kane, J.¹ Kane, M.² Gobl, C.³

10
- 80051650578
- Utilizing glottal source pulse library for generating improved excitation signal for hmm-based speech synthesis
- T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "Utilizing glottal source pulse library for generating improved excitation signal for hmm-based speech synthesis, " in ICASSP 2011, 2011.
- (2011) ICASSP 2011
- Raitio, T.¹ Suni, A.² Pulakka, H.³ Vainio, M.⁴ Alku, P.⁵

11
- 82155160991
- Towards an improved modeling of the glottal source in statistical parametric speech synthesis
- Bonn, Germany
- J. Cabral, S. Renals, K. Richmond, and J. Yamagishi, "Towards an improved modeling of the glottal source in statistical parametric speech synthesis, " in Proc.of the 6th ISCA Workshop on Speech Synthesis, Bonn, Germany, 2007.
- (2007) th ISCA Workshop on Speech Synthesis
- Cabral, J.¹ Renals, S.² Richmond, K.³ Yamagishi, J.⁴

12
- 84867224654
- Glottal spectral separation for parametric speech synthesis
- Brisbane, Australia, Sep
- J. Cabral, S. Renals, K. Richmond, and J. Yamagishi, "Glottal spectral separation for parametric speech synthesis, " in Proc. Interspeech, Brisbane, Australia, Sep. 2008, pp. 1829-1832.
- (2008) Proc. Interspeech , pp. 1829-1832
- Cabral, J.¹ Renals, S.² Richmond, K.³ Yamagishi, J.⁴

13
- 33646813326
- A novel source analysis method by matching spectral characters of lf model with straight spectrum
- J. Tao, T. Tan, and R.W. Picard, Eds. Spring-Verlag
- Z.-H. Ling, Y. Hu, and R.-H. Wang, "A novel source analysis method by matching spectral characters of lf model with straight spectrum, " in ACII'05 Proceedings of the First international conference on Affective Computing and Intelligent Interaction, J. Tao, T. Tan, and R.W. Picard, Eds. Spring-Verlag, 2005, pp. 441-448.
- (2005) ACII'05 Proceedings of the First International Conference on Affective Computing and Intelligent Interaction , pp. 441-448
- Ling, Z.-H.¹ Hu, Y.² Wang, R.-H.³

14
- 84902669930
- Transformation of lf parameters for speech synthesis of emotion: Regression trees
- M. Tooher, I. Yanushevskaya, and C. Gobl, "Transformation of lf parameters for speech synthesis of emotion: Regression trees, " in Speech Prosody 2008, 2008, pp. 705-708.
- (2008) Speech Prosody 2008 , pp. 705-708
- Tooher, M.¹ Yanushevskaya, I.² Gobl, C.³

15
- 77957744515
- Hmm-based speech synthesis utilizing glottal inverse filtering
- Jan
- T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku, "Hmm-based speech synthesis utilizing glottal inverse filtering, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 1, pp. 153-165, Jan.
- Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.1 , pp. 153-165
- Raitio, T.¹ Suni, A.² Yamagishi, J.³ Pulakka, H.⁴ Nurminen, J.⁵ Vainio, M.⁶ Alku, P.⁷

16
- 84856248602
- The deterministic plus stochastic model of the residual signal and its applications
- T. Drugman and T. Dutoit, "The deterministic plus stochastic model of the residual signal and its applications, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 3, pp. 968-981, 2012.
- (2012) Audio, Speech, and Language Processing, IEEE Transactions on , vol.20 , Issue.3 , pp. 968-981
- Drugman, T.¹ Dutoit, T.²

17
- 34547541173
- A new method for speech synthesis and transformation based on an ARX-lf source-filter decomposition and HNM modeling
- IEEE
- D. Vincent, O. Rosec, and T. Chonavel, "A new method for speech synthesis and transformation based on an arx-lf source-filter decomposition and hnm modeling, " in Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, vol. 4. IEEE, 2007, pp. 4-525.
- (2007) Acoustics, Speech and Signal Processing, 2007 ICASSP 2007 IEEE International Conference on , vol.4 , pp. 4-525
- Vincent, D.¹ Rosec, O.² Chonavel, T.³

18
- 33745214458
- Estimation of lf glottal source parameters based on an ARX model
- D. Vincent, O. Rosec, and T. Chonavel "Estimation of lf glottal source parameters based on an arx model, " in Ninth European Conference on Speech Communication and Technology, 2005.
- (2005) Ninth European Conference on Speech Communication and Technology
- Vincent, D.¹ Rosec, O.² Chonavel, T.³

19
- 0003447548
- Ph.D. dissertation, Ecole Nationale Superieure des Telecommunications
- I. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification, " Ph.D. dissertation, Ecole Nationale Superieure des Telecommunications, 1996.
- (1996) Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification
- Stylianou, I.¹

20
- 33947684811
- A four-parameter model of glottal flow
- G. Fant, J. Liljencrants, and Q. Lin, "A four-parameter model of glottal flow, " STL-QPSR, vol. 4, no. 1985, pp. 1-13, 1985.
- (1985) STL-QPSR , vol.4 , Issue.1985 , pp. 1-13
- Fant, G.¹ Liljencrants, J.² Lin, Q.³

21
- 84906227057
- Glottal wave analysis with pitch synchronous iterative adaptive filtering
- P. Alku, "Glottal wave analysis with pitch synchronous iterative adaptive filtering, " Speech Communication, vol. 19, pp. 459-476.
- Speech Communication , vol.19 , pp. 459-476
- Alku, P.¹

22
- 44949232373
- Cluster Gen: A statistical parametric synthesizer using trajectory modeling
- A. Black, "Cluster Gen: A statistical parametric synthesizer using trajectory modeling, " in Proceedings of INTERSPEECH, 2006, pp. 1762-1765.
- (2006) Proceedings of INTERSPEECH , pp. 1762-1765
- Black, A.¹

23
- 85090475413
- The CMU arctic speech databases
- J. Kominek and A. W. Black, "The cmu arctic speech databases, " in Fifth ISCA Workshop on Speech Synthesis, 2004.
- (2004) Fifth ISCA Workshop on Speech Synthesis
- Kominek, J.¹ Black, A.W.²

24
- 84890536802
- Test Vox: Web-based framework for subjective evaluation of speech synthesis
- A. Parlikar. (2012) Test Vox: Web-based Framework for Subjective Evaluation of Speech Synthesis. Open Source Software.
- (2012) Open Source Software
- Parlikar, A.¹

25
- 85009097254
- Mixed excitation for HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for hmm-based speech synthesis, " in Proc. Eurospeech, vol. 1, 2001.
- (2001) Proc. Eurospeech , vol.1
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

26
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039- 1064, 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.³

27
- 0003802343
- Chapman & Hall/CRC
- L. Breiman, J. Friedman, C. Stone, and R. Olshen, Classification and regression trees. Chapman & Hall/CRC, 1984.
- (1984) Classification and Regression Trees
- Breiman, L.¹ Friedman, J.² Stone, C.³ Olshen, R.⁴

28
- 84928842013
- Frequency domain interpretation and derivation of glottal flow parameters
- G. Fant and Q. Lin, "Frequency domain interpretation and derivation of glottal flow parameters, " STL-QPSR, vol. 29, no. 2-3, pp. 1-21, 1988.
- (1988) STL-QPSR , vol.29 , Issue.2-3 , pp. 1-21
- Fant, G.¹ Lin, Q.²

29
- 84966348891
- An hmm-based speech synthesis system applied to
- IEEE
- K. Tokuda, H. Zen, and A. Black, "An hmm-based speech synthesis system applied to english, " in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on. IEEE, 2002, pp. 227-230.
- (2002) English, in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on , pp. 227-230
- Tokuda, K.¹ Zen, H.² Black, A.³

30
- 33947674781
- Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis
- IEEE
- K. Prahallad, A.W. Black, and R. Mosur, "Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis, " in Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, vol. 1. IEEE, 2006, pp. 1.
- (2006) Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on , vol.1 , pp. 1
- Prahallad, K.¹ Black, A.W.² Mosur, R.³

31
- 70349208664
- Optimizing segment label boundaries for statistical speech synthesis
- IEEE
- A. W. Black and J. Kominek, "Optimizing segment label boundaries for statistical speech synthesis, " in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. IEEE, 2009, pp. 3785-3788.
- (2009) Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on , pp. 3785-3788
- Black, A.W.¹ Kominek, J.²

32
- 84865748446
- A statistical phrase/accent model for intonation modeling
- G. K. Anumanchipalli, L. C. Oliveira, and A. W. Black, "A statistical phrase/accent model for intonation modeling, " in Twelfth Annual Conference of the International Speech Communication Association, 2011.
- (2011) Twelfth Annual Conference of the International Speech Communication Association
- Anumanchipalli, G.K.¹ Oliveira, L.C.² Black, A.W.³

33
- 84867602871
- Articulatory features for expressive speech synthesis
- IEEE
- A. W. Black, H. T. Bunnell, Y. Dou, P. Kumar Muthukumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. Vaughn, "Articulatory features for expressive speech synthesis, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 2012, pp. 4005-4008.
- (2012) Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference On. , pp. 4005-4008
- Black, A.W.¹ Bunnell, H.T.² Dou, Y.³ Muthukumar, P.K.⁴ Metze, F.⁵ Perry, D.⁶ Polzehl, T.⁷ Prahallad, K.⁸ Steidl, S.⁹ Vaughn, C.¹⁰

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.