SCOPUS 정보 검색 플랫폼

Volumn 17, Issue 6, 2009, Pages 1231-1239

A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS

Author keywords

Bilingual hidden Markov model (HMM) based speech synthesis; Kullback Leibler divergence (KLD); new language synthesis

Indexed keywords

EID: 85008020260 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2009.2015708 Document Type: Article

Times cited : (67)

References (18)

1
- 84856249636
- From multilingual to polyglot speech synthesis
- C. Traber et al., “From multilingual to polyglot speech synthesis,” in Proc. Eurospeech, 1999, pp. 835–838.
- (1999) Proc. Eurospeech , pp. 835-838
- Traber, C.¹

2
- 60849134487
- Foreign-language speech synthesis
- N. Campbell, “Foreign-language speech synthesis,” in Proc. ESCA/COCOSDA Workshop Speech Synth., 1998, pp. 177–180.
- (1998) Proc. ESCA/COCOSDA Workshop Speech Synth. , pp. 177-180
- Campbell, N.¹

3
- 85073193558
- Language independent phoneme mapping for foreign TTS
- L. Badino, C. Barolo, and S. Quazza, “Language independent phoneme mapping for foreign TTS,” in Proc. 5th ISCA Speech Synth. Workshop, 2004, pp. 217–218.
- (2004) Proc. 5th ISCA Speech Synth. Workshop , pp. 217-218
- Badino, L.¹ Barolo, C.² Quazza, S.³

4
- 33646769932
- Polyglot synthesis using a mixture of monolingual corpora
- J. Latorre, K. Iwano, and S. Furui, “Polyglot synthesis using a mixture of monolingual corpora,” in Proc. ICASSP, 2005, vol. 1, pp. 1–4.
- (2005) Proc. ICASSP , vol.1 , pp. 1-4
- Latorre, J.¹ Iwano, K.² Furui, S.³

5
- 0141480034
- Microsoft Mulan—A bilingual TTS system
- M. Chu, H. Peng, Y. Zhao, Z. Y. Niu, and E. Chang, “Microsoft Mulan—A bilingual TTS system,” in Proc. ICASSP, 2003, vol. 1, pp. 264–267.
- (2003) Proc. ICASSP , vol.1 , pp. 264-267
- Chu, M.¹ Peng, H.² Zhao, Y.³ Niu, Z.Y.⁴ Chang, E.⁵

6
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- K. Tokuda, T. Kobayashi, T. Masuko, T. Kobayashi, and T. Kitamura, “Speech parameter generation algorithms for HMM-based speech synthesis,” in Proc. ICASSP, 2000, vol. 3, pp. 1315–1318.
- (2000) Proc. ICASSP , vol.3 , pp. 1315-1318
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

7
- 0141479954
- Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation
- T. A. Myrvoll and F. K. Soong, “Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation,” in Proc. ICASSP, 2003, vol. 1, pp. 552–555.
- (2003) Proc. ICASSP , vol.1 , pp. 552-555
- Myrvoll, T.A.¹ Soong, F.K.²

8
- 70349222530
- Measuring attribute dissimilarity with HMM KL-divergence for speech synthesis
- Y. Zhao, C. Zhang, F. K. Soong, M. Chu, and X. Xiao, “Measuring attribute dissimilarity with HMM KL-divergence for speech synthesis,” in Proc. 6th ISCA Speech Synth. Workshop, 2007, pp. 206–210.
- (2007) Proc. 6th ISCA Speech Synth. Workshop , pp. 206-210
- Zhao, Y.¹ Zhang, C.² Soong, F.K.³ Chu, M.⁴ Xiao, X.⁵

9
- 85063093863
- An HMM-based bilingual (Mandarin-English) TTS
- H. Liang, Y. Qian, and F. K. Soong, “An HMM-based bilingual (Mandarin-English) TTS,” in Proc. 6th ISCA Speech Synth. Workshop, 2007, pp. 137–142.
- (2007) Proc. 6th ISCA Speech Synth. Workshop , pp. 137-142
- Liang, H.¹ Qian, Y.² Soong, F.K.³

10
- 51449111086
- A cross-language state mapping approach to bilingual (Mandarin-English) TTS
- H. Liang, Y. Qian, F. K. Soong, and G. Liu, “A cross-language state mapping approach to bilingual (Mandarin-English) TTS,” in Proc. ICASSP, 2008, pp. 4641–4644.
- (2008) Proc. ICASSP , pp. 4641-4644
- Liang, H.¹ Qian, Y.² Soong, F.K.³ Liu, G.⁴

11
- 4544354696
- Seg-mental tonal modeling for phone set design in Mandarin LVCSR
- C. Huang, Y. Shi, J.-L. Zhou, M. Chu, T. Wang, and E. Chang, “Seg-mental tonal modeling for phone set design in Mandarin LVCSR,” in Proc. ICASSP, 2004, vol. 1, pp. 901–904.
- (2004) Proc. ICASSP , vol.1 , pp. 901-904
- Huang, C.¹ Shi, Y.² Zhou, J.-L.³ Chu, M.⁴ Wang, T.⁵ Chang, E.⁶

12
- 0021157408
- Line Spectrum Pair (LSP) and speech data compression
- F. K. Soong and B.-H. Juang, “Line Spectrum Pair (LSP) and speech data compression,” in Proc. ICASSP, 1984, pp. 1.10.1–1.10.4.
- (1984) Proc. ICASSP
- Soong, F.K.¹ Juang, B.-H.²

13
- 0007985533
- Speaker adaptation for HMM-based speech synthesis system using MLLR
- M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, “Speaker adaptation for HMM-based speech synthesis system using MLLR,” in Proc. 3rd ESCA/COCOSDA Workshop Speech Synth., 1998, pp. 273–276.
- (1998) Proc. 3rd ESCA/COCOSDA Workshop Speech Synth. , pp. 273-276
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

14
- 85009129569
- Evaluation of cross-language voice conversion based on GMM and STRAIGHT
- M. Mashimo, T. Toda, K. Shikano, and N. Campbell, “Evaluation of cross-language voice conversion based on GMM and STRAIGHT,” in Proc. Eurospeech, 2001, pp. 361–364.
- (2001) Proc. Eurospeech , pp. 361-364
- Mashimo, M.¹ Toda, T.² Shikano, K.³ Campbell, N.⁴

15
- 0036522887
- Multi-space probability distribution HMM
- K. Tokuda, T. Mausko, N. Miyazaki, and T. Kobayashi, “Multi-space probability distribution HMM,” IEICE Trans. Inf. Syst., vol. E85-D, no. 3, pp. 455–464, 2002.
- (2002) IEICE Trans. Inf. Syst. , vol.E85-D , Issue.3 , pp. 455-464
- Tokuda, K.¹ Mausko, T.² Miyazaki, N.³ Kobayashi, T.⁴

16
- 0032673049
- Restructuring speech representations using pitch-adaptive time-frequency smoothing and instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne “Restructuring speech representations using pitch-adaptive time-frequency smoothing and instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds,” Speech Commun., vol. 27, pp. 187–207, 1999.
- (1999) Speech Commun. , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigne, A.³

17
- 0033906251
- MDL-based context-dependent subword modeling for speech recognition
- K. Shinoda and T. Watanable, “MDL-based context-dependent subword modeling for speech recognition,” J. Acoust. Soc. Jpn.(E), vol. 21, no. 2, pp. 79–86, 2000.
- (2000) J. Acoust. Soc. Jpn.(E) , vol.21 , Issue.2 , pp. 79-86
- Shinoda, K.¹ Watanable, T.²

18
- 0003450846
- Rec., ITU-T Std.
- Methods for Subjective Determination of Transmission Quality, Rec. p. 800, ITU-T Std., 1996.
- (1996) Methods for Subjective Determination of Transmission Quality , pp. 800

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.