-
1
-
-
0025543906
-
Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones
-
Moulines, E. and Charpentier, F., “Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones”, Speech Communication 9 (1990), 453-467.
-
(1990)
Speech Communication
, vol.9
, pp. 453-467
-
-
Moulines, E.1
Charpentier, F.2
-
2
-
-
0141588508
-
The Klattalk text-to-speech conversion system
-
Klatt, D.H., “The Klattalk text-to-speech conversion system”, Proc. ICASSP'82, 1982, 1589-1592.
-
(1982)
Proc. ICASSP'82
, pp. 1589-1592
-
-
Klatt, D.H.1
-
3
-
-
0022896756
-
Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers
-
Fujisaki, H., Hirose, K., Takahashi, N. and Morikawa, H., “Acoustic characteristics and the underlying rules of intonation of the common Japanese used by radio and TV announcers”, Proc. ICASSP'86, 1986, 2039-2042.
-
(1986)
Proc. ICASSP'86
, pp. 2039-2042
-
-
Fujisaki, H.1
Hirose, K.2
Takahashi, N.3
Morikawa, H.4
-
4
-
-
0028405296
-
Assignment of segmental duration in text-to-speech synthesis
-
Van Santen, J., “Assignment of segmental duration in text-to-speech synthesis”, Computer Speech and Language, Vol.8, 1994, 95-129.
-
(1994)
Computer Speech and Language
, vol.8
, pp. 95-129
-
-
Van Santen, J.1
-
5
-
-
21844469662
-
A study of pitch pattern generation using HMM-based statistical information
-
Fukada, T., Komori, Y. Aso,T. and Ohora, Y., “A study of pitch pattern generation using HMM-based statistical information”, Proc. ICSLP'94, 1994, 723-726.
-
(1994)
Proc. ICSLP'94
, pp. 723-726
-
-
Fukada, T.1
Komori, Y.2
Aso, T.3
Ohora, Y.4
-
6
-
-
0032665603
-
A dynamical system model for generating fundamental frequency for speech synthesis
-
Ross, K.N. and Ostendorf, M., “A dynamical system model for generating fundamental frequency for speech synthesis”, IEEE transactions on speech and audio processing, Vol.7, No. 3, 1999, 295-309.
-
(1999)
IEEE transactions on speech and audio processing
, vol.7
, Issue.3
, pp. 295-309
-
-
Ross, K.N.1
Ostendorf, M.2
-
7
-
-
85009107944
-
Using Bayesian belief networks for model duration in text-to-speech systems
-
Goubanova, O. and Taylor, P., “Using Bayesian belief networks for model duration in text-to-speech systems”, Proc. ICSLP'2000, 2000.
-
(2000)
Proc. ICSLP'2000
-
-
Goubanova, O.1
Taylor, P.2
-
8
-
-
0035121063
-
Statistical prosodic modeling: from corpus design to parameter estimation
-
Bellegarda, J.R., Silverman, K.E.A., Lenzo, K. and Anderson, V., “Statistical prosodic modeling: from corpus design to parameter estimation”, IEEE transactions on speech and audio processing, Vol.9. No.1, 2001, 52-66.
-
(2001)
IEEE transactions on speech and audio processing
, vol.9
, Issue.1
, pp. 52-66
-
-
Bellegarda, J.R.1
Silverman, K.E.A.2
Lenzo, K.3
Anderson, V.4
-
9
-
-
0032073761
-
An RNN-based prosodic information synthesizer for Mandarin text-to-speech
-
Chen, S., Hwang, S. and Wang, Y., “An RNN-based prosodic information synthesizer for Mandarin text-to-speech”, IEEE transactions on speech and audio processing, Vol.6, No.3, 1998, 226-239.
-
(1998)
IEEE transactions on speech and audio processing
, vol.6
, Issue.3
, pp. 226-239
-
-
Chen, S.1
Hwang, S.2
Wang, Y.3
-
10
-
-
17344374779
-
Tree-based unit selection for English speech synthesis
-
Wang, W. J., Campbell, W. N., Iwahashi, N. and Sagisaka, Y., “Tree-based unit selection for English speech synthesis”, ICASSP'93, vol.2, 191-194.
-
ICASSP'93
, vol.2
, pp. 191-194
-
-
Wang, W. J.1
Campbell, W. N.2
Iwahashi, N.3
Sagisaka, Y.4
-
11
-
-
0031642265
-
Automatic generation of synthesis units for trainable text-to-speech systems
-
Hon, H., Acero, A., Huang, S., Liu, J. and Plumpe, M., “Automatic generation of synthesis units for trainable text-to-speech systems”, ICASSP'98, vol.1, 293-296.
-
ICASSP'98
, vol.1
, pp. 293-296
-
-
Hon, H.1
Acero, A.2
Huang, S.3
Liu, J.4
Plumpe, M.5
-
12
-
-
0001208125
-
Optimizing selection of units from speech database for concatenative synthesis
-
Black, A. and Campbell, N., “Optimizing selection of units from speech database for concatenative synthesis”, ICASSP'96, 373-376, 1996.
-
(1996)
ICASSP'96
, pp. 373-376
-
-
Black, A.1
Campbell, N.2
-
13
-
-
0003840408
-
Research on perception of juncture between syllables in Chinese
-
Chu, M., Tang, D., Si, H., Tian, X. and Lu, S., “Research on perception of juncture between syllables in Chinese”, Chinese Journal of Acoustics, Vol.17, No.2, 143-152.
-
Chinese Journal of Acoustics
, vol.17
, Issue.2
, pp. 143-152
-
-
Chu, M.1
Tang, D.2
Si, H.3
Tian, X.4
Lu, S.5
-
14
-
-
0034855169
-
Segmenting unrestricted Chinese text into prosodic words instead of lexical words
-
Qian, Y., Chu, M., Peng, H., “Segmenting unrestricted Chinese text into prosodic words instead of lexical words”, Proc. ICASSP2001, 2001.
-
(2001)
Proc. ICASSP2001
-
-
Qian, Y.1
Chu, M.2
Peng, H.3
-
16
-
-
0004056285
-
-
chapter 4
-
Huang, X.D., Acero, A., Hon, H. and Meredith, S., Spoken Language Processing (draft), chapter 4.
-
Spoken Language Processing (draft)
-
-
Huang, X.D.1
Acero, A.2
Hon, H.3
Meredith, S.4
|