-
1
-
-
0013634269
-
A study of the building blocks in speech
-
C.M. Harris: A study of the building blocks in speech, J. Acoust. Soc. Am. 25, 962–969 (1953)
-
(1953)
J. Acoust. Soc. Am.
, vol.25
, pp. 962-969
-
-
Harris, C.M.1
-
2
-
-
0028823541
-
Speech recognition with primarily temporal cues
-
R.V. Shannon, F.G. Zeng, V. Kamath, J. Wygon-ski, M. Ekelid: Speech recognition with primarily temporal cues, Science 13(5234), 270 (1995)
-
(1995)
Science
, vol.13
, Issue.5234
-
-
Shannon, R.V.1
Zeng, F.G.2
Kamath, V.3
Wygon-Ski, J.4
Ekelid, M.5
-
3
-
-
0345093720
-
Terminal analog synthesis of continuous speech using the diphone method of segment assembly
-
N.R. Dixon, H.D. Maxey: Terminal analog synthesis of continuous speech using the diphone method of segment assembly, IEEE Trans. ASSP AU-16(1), 40–50 (1968)
-
(1968)
IEEE Trans. ASSP AU-
, vol.16
, Issue.1
, pp. 40-50
-
-
Dixon, N.R.1
Maxey, H.D.2
-
5
-
-
85075932515
-
Improving the quality of MBROLA synthesis for non-uniform units synthesis
-
ed. by S. Narayanan, A. Alwan (Prentice-Hall, Upper Saddle River
-
B. Bozkurt, T. Dutoit, R. Prudon, C. d’Alessandro, V. Pagel: Improving the quality of MBROLA synthesis for non-uniform units synthesis. In: Text to Speech Synthesis: New Paradigms and Advances, ed. by S. Narayanan, A. Alwan (Prentice-Hall, Upper Saddle River 2004)
-
(2004)
Text to Speech Synthesis: New Paradigms and Advances
-
-
Bozkurt, B.1
Dutoit, T.2
Prudon, R.3
D’Alessandro, C.4
Pagel, V.5
-
6
-
-
0025543906
-
Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
E. Moulines, F. Charpentier: Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun. 9, 5–6 (1990)
-
(1990)
Speech Commun
, vol.9
, pp. 5-6
-
-
Moulines, E.1
Charpentier, F.2
-
7
-
-
0035127703
-
Applying the harmonic plus noise model in concatenative synthesis
-
Y. Stylianou: Applying the harmonic plus noise model in concatenative synthesis, IEEE Trans. Speech Audio Process. 9(1), 21–29 (2001)
-
(2001)
IEEE Trans. Speech Audio Process.
, vol.9
, Issue.1
, pp. 21-29
-
-
Stylianou, Y.1
-
8
-
-
0027252181
-
An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech
-
Vol., pp
-
W. Verhelst, M. Roelands: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech, Proc. ICASSP 93, Vol. 2 (1993) pp. 554–557
-
(1993)
Proc. ICASSP 93
, vol.2
, pp. 554-557
-
-
Verhelst, W.1
Roelands, M.2
-
9
-
-
0008006444
-
Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation
-
Vol.,) pp
-
N. Morita, F. Itakura: Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation, Proc. Annu. Meeting of Acoust. Soc. Jpn., Vol. 86 (1986) pp. 9–16
-
(1986)
Proc. Annu. Meeting of Acoust. Soc. Jpn
, vol.86
, pp. 9-16
-
-
Morita, N.1
Itakura, F.2
-
13
-
-
0027839344
-
MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database
-
T. Dutoit, H. Leich: MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database, Speech Commun. 13, 435–440 (1993)
-
(1993)
Speech Commun
, vol.13
, pp. 435-440
-
-
Dutoit, T.1
Leich, H.2
-
15
-
-
0035124445
-
Control of spectral dynamics in concatenative speech synthesis
-
J. Wouters, M.W. Macon: Control of spectral dynamics in concatenative speech synthesis, IEEE Trans. Speech Audio Process. 9(1), 30–38 (2001)
-
(2001)
IEEE Trans. Speech Audio Process.
, vol.9
, Issue.1
, pp. 30-38
-
-
Wouters, J.1
Macon, M.W.2
-
17
-
-
0027228898
-
Multilingual PSOLA text-to-speech system
-
Vol.,) pp
-
D. Bigorne, O. Boeffard, B. Cherbonnel, F. Emerard, D. Larreur, J.L. Le Saint-Milon, I. Metayer, C. Sorin, S. White: Multilingual PSOLA text-to-speech system, Proc. Int. Conf. Acoust. Speech Signal Process., Vol. 2 (1993) pp. 187–190
-
(1993)
Proc. Int. Conf. Acoust. Speech Signal Process
, vol.2
, pp. 187-190
-
-
Bigorne, D.1
Boeffard, O.2
Cherbonnel, B.3
Emerard, F.4
Larreur, D.5
Le Saint-Milon, J.L.I.6
Metayer, C.7
Sorin, S.W.8
-
18
-
-
0012356658
-
HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes
-
T. Portele, W. Sendlemeier, W. Hess: HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes, Proc. First ESCA Workshop on Speech Synthesis (1990) pp. 161– 164
-
(1990)
Proc. First ESCA Workshop on Speech Synthesis
, pp. 161-164
-
-
Portele, T.1
Sendlemeier, W.2
Hess, W.3
-
19
-
-
0028499480
-
Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering
-
S. Nakajima: Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering, Speech Commun. 14, 313–324 (1994)
-
(1994)
Speech Commun
, vol.14
, pp. 313-324
-
-
Nakajima, S.1
-
20
-
-
85135181226
-
Improvements in an HMM-based speech synthesizer
-
Vol.,) pp
-
R. Donovan, P. Woodland: Improvements in an HMM-based speech synthesizer, Proc. Eurospeech 95, Vol. 1 (1995) pp. 573–576
-
(1995)
Proc. Eurospeech 95
, vol.1
, pp. 573-576
-
-
Donovan, R.1
Woodland, P.2
-
21
-
-
85068112784
-
Rule synthesis of speech from diadic units
-
Vol.,) pp
-
J.P. Olive: Rule synthesis of speech from diadic units, Proc. ICASSP, Vol. 77 (1977) pp. 568–570
-
(1977)
Proc. ICASSP
, vol.77
, pp. 568-570
-
-
Olive, J.P.1
-
22
-
-
85135109865
-
ATR ν-TALK speech synthesis system
-
pp
-
Y. Sagisaka, N. Kaiki, N. Iwahashi, K. Mimura: ATR ν-TALK speech synthesis system, Proc. ICSLP 92, Vol. 1 (1992) pp. 483–486
-
(1992)
Proc. ICSLP 92
, vol.1
, pp. 483-486
-
-
Sagisaka, Y.1
Kaiki, N.2
Iwahashi, N.3
Mimura, K.4
-
23
-
-
0029765811
-
Unit selection in a concatenative speech synthesis system using a large speech database
-
Vol.,) pp
-
A.J. Hunt, A.W. Black: Unit selection in a concatenative speech synthesis system using a large speech database, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96), Vol. 1 (1996) pp. 373–376
-
(1996)
Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96)
, vol.1
, pp. 373-376
-
-
Hunt, A.J.1
Black, A.W.2
-
24
-
-
0000237685
-
Prosody and the selection of source units for concatenative synthesis
-
ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg
-
N. Campbell, A. Black: Prosody and the selection of source units for concatenative synthesis. In: Progress in Speech Synthesis, ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg 1995)
-
(1995)
Progress in Speech Synthesis
-
-
Campbell, N.1
Black, A.2
-
25
-
-
0002425861
-
The AT&T next-gen TTS system
-
M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, A. Syrdal: The AT&T next-gen TTS system, Proc. Joint Meeting of ASA (1999)
-
(1999)
Proc. Joint Meeting of ASA
-
-
Beutnagel, M.1
Conkie, A.2
Schroeter, J.3
Stylianou, Y.4
Syrdal, A.5
-
26
-
-
85135259940
-
Choose the best to modify the least: A new generation concatenative synthesis system
-
Vol.,) pp
-
M. Balestri, A. Paechiotti, S. Quazza, P.L. Salza, S. Sandri: Choose the best to modify the least: a new generation concatenative synthesis system, Proc. Eurospeech, Vol. 99 (1999) pp. 2291–2294
-
(1999)
Proc. Eurospeech
, vol.99
, pp. 2291-2294
-
-
Balestri, M.1
Paechiotti, A.2
Quazza, S.3
Salza, P.L.4
Sandri, S.5
-
27
-
-
85135272129
-
Speech synthesis by phonological structure matching
-
Vol.,) pp
-
P. Taylor, A.W. Black: Speech synthesis by phonological structure matching, Proc. Eurospeech, Vol. 99 (1999) pp. 623–626
-
(1999)
Proc. Eurospeech
, vol.99
, pp. 623-626
-
-
Taylor, P.1
Black, A.W.2
-
28
-
-
70349848071
-
Join cost for unit selection speech synthesis
-
ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River
-
J. Vepa, S. King: Join cost for unit selection speech synthesis. In: Speech Synthesis, ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River 2004)
-
(2004)
Speech Synthesis
-
-
Vepa, J.1
King, S.2
-
29
-
-
0037236894
-
Rare events and closed domains: Two delicate concepts in speech synthesis
-
B. Möbius: Rare events and closed domains: Two delicate concepts in speech synthesis, Int. J. Speech Technol. 6(1), 57–71 (2003)
-
(2003)
Int. J. Speech Technol.
, vol.6
, Issue.1
, pp. 57-71
-
-
Möbius, B.1
-
30
-
-
85135154775
-
Combinatorial issues in text-to-speech synthesis
-
Vol.,) pp
-
J.P.H. van Santen: Combinatorial issues in text-to-speech synthesis, Proc. Euro. Conf. Speech Commun. Technol., Vol. 5 (1997) pp. 2511–2514
-
(1997)
Proc. Euro. Conf. Speech Commun. Technol
, vol.5
, pp. 2511-2514
-
-
van Santen, J.P.H.1
-
31
-
-
84966301419
-
Limited domain synthesis
-
pp
-
A. Black, K. Lenzo: Limited domain synthesis, Proc. ICSLP (2000) pp. 411–414
-
(2000)
Proc. ICSLP
, pp. 411-414
-
-
Black, A.1
Lenzo, K.2
-
33
-
-
85075951335
-
Whistler: A trainable text-to-speech system
-
Vol.,) pp
-
X. Huang, A. Acero, J. Adcock, H. Hon, J. Goldsmith, J. Liu, M. Plumpe: Whistler: A trainable text-to-speech system, Proc. ICSLP, Vol. 96 (1996) pp. 659– 662
-
(1996)
Proc. ICSLP
, vol.96
, pp. 659-662
-
-
Huang, X.1
Acero, A.2
Adcock, J.3
Hon, H.4
Goldsmith, J.5
Liu, J.6
Plumpe, M.7
-
34
-
-
33645758767
-
An HMM-based approach to multilingual speech synthesis
-
ed. by S. Narayanan, A. AlwanPrentice Hall, Upper Saddle River
-
K. Tokuda, H. Zen, A. Black: An HMM-based approach to multilingual speech synthesis. In: Text to Speech Synthesis: New Paradigms and Advances, ed. by S. Narayanan, A. Alwan (Prentice Hall, Upper Saddle River 2004) pp. 135–153
-
(2004)
Text to Speech Synthesis: New Paradigms and Advances
, pp. 135-153
-
-
Tokuda, K.1
Zen, H.2
Black, A.3
-
35
-
-
85009231020
-
Custom-tailoring TTS voice font – keeping the naturalness when reducing database size
-
pp
-
Y. Zhao, M. Chu, H. Peng, E. Chang: Custom-tailoring TTS voice font – keeping the naturalness when reducing database size, Proc. Eurospeech, Vol. 2003 (2003) pp. 2957–2960
-
(2003)
Proc. Eurospeech
, vol.2003
, pp. 2957-2960
-
-
Zhao, Y.1
Chu, M.2
Peng, H.3
Chang, E.4
-
36
-
-
33745216013
-
Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling
-
Vol.,) pp
-
D. Chazan, R. Hoory, Z. Kons, A. Sagi, S. Shechtman, A. Sorin: Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling, Proc. Interspeech, Vol. 2005 (2005) pp. 2569–2572
-
(2005)
Proc. Interspeech
, vol.2005
, pp. 2569-2572
-
-
Chazan, D.1
Hoory, R.2
Kons, Z.3
Sagi, A.4
Shechtman, S.5
Sorin, A.6
-
37
-
-
85133526552
-
Automatically clustering similar units for unit selection in speech synthesis
-
Vol
-
A.W. Black, P. Taylor: Automatically clustering similar units for unit selection in speech synthesis, Proc. Eurospeech, Vol. 2 (1997)
-
(1997)
Proc. Eurospeech
, vol.2
-
-
Black, A.W.1
Taylor, P.2
-
39
-
-
84966366503
-
Rapid unit selection from a large speech corpus for concatenative speech synthesis
-
Vol.,) pp
-
M. Beutnagel, M. Mohri, M. Riley: Rapid unit selection from a large speech corpus for concatenative speech synthesis, Proc. Eurospeech ’99, Vol. 2 (1999) pp. 607–610
-
(1999)
Proc. Eurospeech ’99
, vol.2
, pp. 607-610
-
-
Beutnagel, M.1
Mohri, M.2
Riley, M.3
-
41
-
-
85134885765
-
A hidden Markov model approach to speech synthesis
-
Vol.,) pp
-
A. Falaschi, M. Giustiniani, M. Verola: A hidden Markov model approach to speech synthesis, Proc. Eurospeech, Vol. 1989 (1989) pp. 2187– 2190
-
(1989)
Proc. Eurospeech
, vol.1989
, pp. 2187-2190
-
-
Falaschi, A.1
Giustiniani, M.2
Verola, M.3
-
42
-
-
42649146508
-
On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum
-
Vol.,) pp
-
K. Nakamura, T. Toda, Y. Nankaku, K. Tokuda: On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum, Proc. ICASSP, Vol. 06 (2006) pp. 93–96
-
(2006)
Proc. ICASSP
, vol.6
, pp. 93-96
-
-
Nakamura, K.1
Toda, T.2
Nankaku, Y.3
Tokuda, K.4
-
43
-
-
85123861026
-
XIMERA: A new TTS from ATR based on corpus-based technologies
-
pp
-
H. Kawai, T. Toda, J. Ni, M. Tsuzaki, K. Tokuda: XIMERA: A new TTS from ATR based on corpus-based technologies, Proc. 5th ISCA Speech Synthesis Workshop (2004) pp. 179–184
-
(2004)
Proc. 5Th ISCA Speech Synthesis Workshop
, pp. 179-184
-
-
Kawai, H.1
Toda, T.2
Ni, J.3
Tsuzaki, M.4
Tokuda, K.5
|