SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 290-294

A postfilter to modify the modulation spectrum in HMM-based speech synthesis

(5) Takamichi, Shinnosuke a Toda, Tomoki a Neubig, Graham a Sakti, Sakriani a Nakamura, Satoshi a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

global variance; HMM based speech synthesis; modulation spectrum; over smoothing; postfilter

Indexed keywords

MODULATION; SPEECH SYNTHESIS;

GLOBAL VARIANCE; HMM-BASED SPEECH SYNTHESIS; MODULATION SPECTRUM; OVER-SMOOTHING; POSTFILTERS;

PLASMA DIAGNOSTICS;

EID: 84905234422 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6853604 Document Type: Conference Paper

Times cited : (66)

References (19)

1
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. Black. Statistical parametric speech synthesis. Speech Commun., Vol. 51, No. 11, pp. 1039-1064, 2009.
- (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.³

2
- 0034230270
- Speaker interpolation for HMM-based speech synthesis system
- T. Yoshimura, T. Masuko, K. Tokuda, T. Kobayashi, and T. Kitamura. Speaker interpolation for HMM-based speech synthesis system. J. Acoust. Soc. Jpn. (E), Vol. 21, No. 4, pp. 199-206, 2000.
- (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.4 , pp. 199-206
- Yoshimura, T.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- J. Yamagishi and T. Kobayashi. Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training. IEICE Trans., Inf. and Syst., Vol. E90-D, No. 2, pp. 533-543, 2007.
- (2007) IEICE Trans., Inf. and Syst. , vol.E90-D , Issue.2 , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

4
- 51449114529
- A style control technique for HMM-based expressive speech synthesis
- T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi. A style control technique for HMM-based expressive speech synthesis. IEICE Trans., Inf. and Syst., Vol. E90-D, No. 9, pp. 1406-1413, 2007.
- (2007) IEICE Trans., Inf. and Syst. , vol.E90-D , Issue.9 , pp. 1406-1413
- Nose, T.¹ Yamagishi, J.² Masuko, T.³ Kobayashi, T.⁴

5
- 84878419996
- The blizzard challenge 2011
- Turin, Italy, Sept.
- S. King and V. Karaiskos. The blizzard challenge 2011. In Proc. Blizzard Challenge workshop, Turin, Italy, Sept. 2011.
- (2011) Proc. Blizzard Challenge Workshop
- King, S.¹ Karaiskos, V.²

6
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda and K. Tokuda. A speech parameter generation algorithm considering global variance for HMM-based speech synthesis. IEICE Trans., Vol. E90-D, No. 5, pp. 816-824, 2007.
- (2007) IEICE Trans. , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

7
- 0028287770
- Effect of reducing slow temporalmodulations on speech reception
- R. Drullman, J.M. Festen, and R. Plomp. Effect of reducing slow temporalmodulations on speech reception. J. Acoust. Soc. of America, Vol. 95, pp. 2670-2680, 1994.
- (1994) J. Acoust. Soc. of America , vol.95 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

8
- 70349212558
- Phoneme recgnition usng spectral envelop and modulation frequency features
- Taipei, Taiwan, April
- S. Thomas, S. Ganapathy, and H. Hermansky. Phoneme recgnition usng spectral envelop and modulation frequency features. In Proc. ICASSP, pp. 4453-4456, Taipei, Taiwan, April 2009.
- (2009) Proc. ICASSP , pp. 4453-4456
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

9
- 0033708106
- Speech parameter generation algorithms for HMMbased speech synthesis
- Istanbul, Turkey, June
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura. Speech parameter generation algorithms for HMMbased speech synthesis. In Proc. ICASSP, pp. 1315-1318, Istanbul, Turkey, June 2000.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

10
- 0038443474
- Joint acoustic and modulation frequency
- L. Atlas and S. A.Shamma. Joint acoustic and modulation frequency. EURASIP Journal on Applied Signal Processing, Vol. 7, pp. 668-675, 2003.
- (2003) EURASIP Journal on Applied Signal Processing , vol.7 , pp. 668-675
- Atlas, L.¹ Shamma, S.A.²

11
- 85008023596
- Continuous F0 modeling for HMM based statistical parametric speech synthesis
- K. Yu and S. Young. Continuous F0 modeling for HMM based statistical parametric speech synthesis. IEEE Trans. Audio, Speech and Language, Vol. 19, No. 5, pp. 1071-1079, 2011.
- (2011) IEEE Trans. Audio, Speech and Language , vol.19 , Issue.5 , pp. 1071-1079
- Yu, K.¹ Young, S.²

12
- 84905244240
- A hybrid approach to electrolaryngeal speech enhansement based on spectral subtraction and statistical voice conversion
- Lyon, France, Sep.
- K. Tanaka, T. Toda, G. Neubig, S. Sakti, and S. Nakamura. A hybrid approach to electrolaryngeal speech enhansement based on spectral subtraction and statistical voice conversion. In Proc. INTERSPEECH, pp. 3067-3071, Lyon, France, Sep. 2013.
- (2013) Proc. INTERSPEECH , pp. 3067-3071
- Tanaka, K.¹ Toda, T.² Neubig, G.³ Sakti, S.⁴ Nakamura, S.⁵

13
- 84925160976
- Cambridge Univ. Press
- P. Taylor. Text-To-Speech synthesis. Cambridge Univ. Press, 2009.
- (2009) Text-To-Speech Synthesis
- Taylor, P.¹

14
- 84878390910
- Implementation of conputationally efficient real-time voice conversion
- Portland, Oregon, U.S., Sept.
- T. Toda, T. Muramatsu, and H. Banno. Implementation of conputationally efficient real-time voice conversion. In Proc. INTERSPEECH, Portland, Oregon, U.S., Sept. 2012.
- (2012) Proc. INTERSPEECH
- Toda, T.¹ Muramatsu, T.² Banno, H.³

15
- 44449177634
- Hidden semi-markov model based speech synthesis system
- H. Zen, K. Tokuda, T. Kobayashi T. Masuko, and T. Kitamura. Hidden semi-markov model based speech synthesis system. IEICE Trans., Inf. and Syst., E90-D, No. 5, pp. 825-834, 2007.
- (2007) IEICE Trans., Inf. and Syst. , vol.E90-D , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Kobayashi, T.³ Masuko, T.⁴ Kitamura, T.⁵

16
- 6644226630
- A large-scale Japanese speech database
- Kobe, Japan, Nov.
- Y. Sagisaka, K. Takeda, M. Abe, S. Katagiri, T. Umeda, and H. Kuawhara. A large-scale Japanese speech database. In ICSLP90, pp. 1089-1092, Kobe, Japan, Nov. 1990.
- (1990) ICSLP90 , pp. 1089-1092
- Sagisaka, Y.¹ Takeda, K.² Abe, M.³ Katagiri, S.⁴ Umeda, T.⁵ Kuawhara, H.⁶

17
- 84874199000
- Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
- Firentze, Italy, Sept.
- H. Kawahara, Jo Estill, and O. Fujimura. Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT. In MAVEBA 2001, pp. 1-6, Firentze, Italy, Sept. 2001.
- (2001) MAVEBA 2001 , pp. 1-6
- Kawahara, H.¹ Estill, J.² Fujimura, O.³

18
- 44949143155
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
- Pittsburgh, U.S.A., Sep.
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation. In Proc. INTERSPEECH, pp. 2266-2269, Pittsburgh, U.S.A., Sep. 2006.
- (2006) Proc. INTERSPEECH , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

19
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. D. Cheveigne. Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Commun., Vol. 27, No. 3-4, pp. 187-207, 1999.
- (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigne, A.D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.