SCOPUS 정보 검색 플랫폼

Volumn E89-D, Issue 11, 2006, Pages 2775-2782

Hybrid voice conversion of unit selection and generation using prosody dependent HMM

Author keywords

HMM; MLLR; Speech synthesis; Unit selection; Voice conversion

Indexed keywords

OPTIMIZATION; PROBABILITY DISTRIBUTIONS; WAVEFORM ANALYSIS;

TARGET WAVEFORMS; VOICE CONVERSION;

SPEECH SYNTHESIS;

EID: 33845586220 PISSN: 09168532 EISSN: 17451361 Source Type: Journal
DOI: 10.1093/ietisy/e89-d.11.2775 Document Type: Article

Times cited : (8)

References (14)

1
- 0003571407
- A.W. Black, P. Taylor, and R. Caley, "The festival speech synthesis system," http://festvox.org/festival/
- The Festival Speech Synthesis System
- Black, A.W.¹ Taylor, P.² Caley, R.³

2
- 0011946055
- CHATR: A high-definition speech re-sequencing system
- N. Campbell, "CHATR: A high-definition speech re-sequencing system," Proc. 3rd ASA/ASJ Joint Meeting, pp.1223-1228, 1996.
- (1996) Proc. 3rd ASA/ASJ Joint Meeting , pp. 1223-1228
- Campbell, N.¹

3
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis from HMMs using dynamic features," Proc. ICASSP96, vol.1, pp.389-392, 1996.
- (1996) Proc. ICASSP96 , vol.1 , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

4
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M.J.F. Gales and P.C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol.10, no.4, pp.249-264, 1996.
- (1996) Comput. Speech Lang. , vol.10 , Issue.4 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

5
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
- M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR," Proc. ICASSP2001, vol.2, pp.805-808, 2001.
- (2001) Proc. ICASSP2001 , vol.2 , pp. 805-808
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

6
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," Proc. ICASSP88, pp.655-658, 1988.
- (1988) Proc. ICASSP88 , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

8
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP98, pp.285-299, 1998.
- (1998) Proc. ICASSP98 , pp. 285-299
- Kain, A.¹ Macon, M.²

9
- 85031628788
- An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
- K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features," Proc. EUROSPEECH95, pp.757-760, 1995.
- (1995) Proc. EUROSPEECH95 , pp. 757-760
- Tokuda, K.¹ Masuko, T.² Yamada, T.³ Kobayashi, T.⁴ Imai, S.⁵

10
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- A.J. Hunt and A.W. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," Proc. ICASSP96, pp.373-376, 1996.
- (1996) Proc. ICASSP96 , pp. 373-376
- Hunt, A.J.¹ Black, A.W.²

11
- 0028996983
- Automatic speech synthesizer parameter estimation using HMMs
- R.E. Donovan and P.C. Woodland, "Automatic speech synthesizer parameter estimation using HMMs," Proc. ICASSP95, pp.640-643, 1995.
- (1995) Proc. ICASSP95 , pp. 640-643
- Donovan, R.E.¹ Woodland, P.C.²

12
- 0030683369
- Recent improvements on Microsoft's trainable text-to-speech system -Whistler
- X. Huang, A. Acero, H. Hon, Y. Ju, J. Liu, S. Meredith, and M. Plumpe, "Recent improvements on Microsoft's trainable text-to-speech system -Whistler," Proc. ICASSP97, vol.2, pp.959-962, 1997.
- (1997) Proc. ICASSP97 , vol.2 , pp. 959-962
- Huang, X.¹ Acero, A.² Hon, H.³ Ju, Y.⁴ Liu, J.⁵ Meredith, S.⁶ Plumpe, M.⁷

13
- 0030364809
- An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words
- Y. Arai, R. Mochizuki, H. Nishimura, and T. Honda, "An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words," Proc. ICSLP96, vol.3, pp.1437-1440, 1996.
- (1996) Proc. ICSLP96 , vol.3 , pp. 1437-1440
- Arai, Y.¹ Mochizuki, R.² Nishimura, H.³ Honda, T.⁴

14
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol.9, pp.453-467, 1990.
- (1990) Speech Commun. , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.