-
1
-
-
0012925721
-
Perceptual bandwidth
-
Mar
-
B. Reeves and C. Nass, "Perceptual bandwidth," Commun. ACM, vol. 43, no. 3, pp. 65-70, Mar. 2000.
-
(2000)
Commun. ACM
, vol.43
, Issue.3
, pp. 65-70
-
-
Reeves, B.1
Nass, C.2
-
2
-
-
34047248476
-
Multi-layered extensions to the speech synthesis markup language for describing expressiveness
-
Geneva, Switzerland
-
E. Eide, R. Bakis, W. Hamza, and J. Pitrelli, "Multi-layered extensions to the speech synthesis markup language for describing expressiveness," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 1645-1648.
-
(2003)
Proc. Eurospeech
, pp. 1645-1648
-
-
Eide, E.1
Bakis, R.2
Hamza, W.3
Pitrelli, J.4
-
4
-
-
85001632375
-
Corpus-based techniques in the AT&T NextGen synthesis system
-
Beijing, China
-
A. K. Syrdal, C. W. Wightman, A. Conkie, Y. Stylianou, M. Beutnagel, J. Schroeter, V. Strom, K. Lee, and M. J. Makashay, "Corpus-based techniques in the AT&T NextGen synthesis system," in Proc. ICSLP, Beijing, China, 2000, pp. 431-434.
-
(2000)
Proc. ICSLP
, pp. 431-434
-
-
Syrdal, A.K.1
Wightman, C.W.2
Conkie, A.3
Stylianou, Y.4
Beutnagel, M.5
Schroeter, J.6
Strom, V.7
Lee, K.8
Makashay, M.J.9
-
5
-
-
84985926077
-
Segment Sektion in the L&H realspeak laboratory TTS system
-
Beijing, China
-
G. Coorman, J. Fackrell, P. Rutten, and B. Van Coile, "Segment Sektion in the L&H realspeak laboratory TTS system," in Proc. ICSLP, Beijing, China, 2000, pp. 395-398.
-
(2000)
Proc. ICSLP
, pp. 395-398
-
-
Coorman, G.1
Fackrell, J.2
Rutten, P.3
Van Coile, B.4
-
6
-
-
0004131347
-
Trainable speech synthesis,
-
Ph.D. dissertation, Cambridge Univ. Eng. Dept, Cambridge, U.K
-
R. E. Donovan, "Trainable speech synthesis," Ph.D. dissertation, Cambridge Univ. Eng. Dept., Cambridge, U.K., 1996.
-
(1996)
-
-
Donovan, R.E.1
-
7
-
-
0003802343
-
-
Monterey, CA: Wadsworth and Brooks/Cole
-
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, Classification and Regression Trees. Monterey, CA: Wadsworth and Brooks/Cole, 1984.
-
(1984)
Classification and Regression Trees
-
-
Breiman, L.1
Friedman, J.H.2
Olshen, R.A.3
Stone, C.J.4
-
8
-
-
0039885315
-
Context dependent vector quantization for continuous speech recognition
-
Minneapolis, MN
-
L. R. Bahl, P. V. deSouza, P. S. Gopalakrishnan, and M. A. Picheny, "Context dependent vector quantization for continuous speech recognition," in Proc. ICASSP, Minneapolis, MN, 1993, pp. 632-635.
-
(1993)
Proc. ICASSP
, pp. 632-635
-
-
Bahl, L.R.1
deSouza, P.V.2
Gopalakrishnan, P.S.3
Picheny, M.A.4
-
9
-
-
85135181226
-
Improvements in an HMM-based speech synthesiser
-
R. Donovan and P. Woodland, "Improvements in an HMM-based speech synthesiser," in Proc. Eurospeech, 1995, pp. 573-576.
-
(1995)
Proc. Eurospeech
, pp. 573-576
-
-
Donovan, R.1
Woodland, P.2
-
10
-
-
85133526552
-
Automatically Clustering Similar Units for Unit Selection in Speech Synthesis
-
A. W. Black and P. Taylor, "Automatically Clustering Similar Units for Unit Selection in Speech Synthesis," in Proc. Eurospeech, 1997, pp. 601-604.
-
(1997)
Proc. Eurospeech
, pp. 601-604
-
-
Black, A.W.1
Taylor, P.2
-
11
-
-
80051612889
-
A new distance measure for costing spectral discontinuities in concatenate speech synthesisers
-
Perthshire, U.K
-
R. E. Donovan, "A new distance measure for costing spectral discontinuities in concatenate speech synthesisers," in Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, U.K., 2001, pp. 59-62.
-
(2001)
Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis
, pp. 59-62
-
-
Donovan, R.E.1
-
12
-
-
34047249700
-
Intrinsic phone durations are speaker-specific
-
Denver, CO
-
H. R. Pfitzinger, "Intrinsic phone durations are speaker-specific," in Proc. ICSLP, vol. 2, Denver, CO, 2002, pp. 1113-1116.
-
(2002)
Proc. ICSLP
, vol.2
, pp. 1113-1116
-
-
Pfitzinger, H.R.1
-
13
-
-
33745197584
-
Reconciling pronunciation differences between the front-end and back-end in the IBM speech synthesis system
-
Jeju, South Korea, Oct
-
W. Hamza, R. Bakis, and E. Eide, "Reconciling pronunciation differences between the front-end and back-end in the IBM speech synthesis system," in Proc. ICSLP, Jeju, South Korea, Oct. 2004, pp. 2561-2564.
-
(2004)
Proc. ICSLP
, pp. 2561-2564
-
-
Hamza, W.1
Bakis, R.2
Eide, E.3
-
14
-
-
85009274956
-
Data-driven segment preselection in the IBM trainable speech synthesis system
-
Denver, CO
-
W. Hamza and R. Donovan, "Data-driven segment preselection in the IBM trainable speech synthesis system," in Proc. ICSLP, Denver, CO, 2002, pp. 2609-1612.
-
(2002)
Proc. ICSLP
, pp. 2609-1612
-
-
Hamza, W.1
Donovan, R.2
-
15
-
-
0029765811
-
Unit selection in a concatenative speech synthesis system using a large speech database
-
Atlanta, GA
-
A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, Atlanta, GA, 1996, pp. 373-376.
-
(1996)
Proc. ICASSP
, pp. 373-376
-
-
Hunt, A.1
Black, A.2
-
16
-
-
34047274359
-
Re-defining intonation from selected units for nonuniform units based speech synthesis
-
Leuven, Belgium
-
B. Bozkurt, T. Dutoit, and V. Pagel, "Re-defining intonation from selected units for nonuniform units based speech synthesis," in Proc. SPS-IEEE Benelux Signal Process. Symp., Leuven, Belgium, 2002, pp. 141-144.
-
(2002)
Proc. SPS-IEEE Benelux Signal Process. Symp
, pp. 141-144
-
-
Bozkurt, B.1
Dutoit, T.2
Pagel, V.3
-
18
-
-
85009247888
-
Expressive speech synthesis : Using a concatenative synthesizer
-
Denver, CO
-
M. Bulut, S. Narayanan, and A. Syrdal, "Expressive speech synthesis : using a concatenative synthesizer," in Proc. ICSLP, Denver, CO, 2002, pp. 1265-1268.
-
(2002)
Proc. ICSLP
, pp. 1265-1268
-
-
Bulut, M.1
Narayanan, S.2
Syrdal, A.3
-
19
-
-
84966356293
-
Preservation, identification, and use of emotion in a text-to-speech system
-
Santa Monica, CA, Sep
-
E. Eide, "Preservation, identification, and use of emotion in a text-to-speech system," in Proc. IEEE Workshop on Speech Synthesis, Santa Monica, CA, Sep. 2002.
-
(2002)
Proc. IEEE Workshop on Speech Synthesis
-
-
Eide, E.1
-
21
-
-
0030355540
-
0 contours from ToBI labels using linear regression
-
Philadelphia, PA, pp
-
0 contours from ToBI labels using linear regression," in Proc. ICSLP, Philadelphia, PA, pp. 1385-1388.
-
Proc. ICSLP
, pp. 1385-1388
-
-
Black, A.W.1
Hunt, A.J.2
-
22
-
-
85030872484
-
Evaluation of prosodic transcription labeling reliability in the ToBI framework
-
Yokohama, Japan, Sep
-
J. F. Pitrelli, M. E. Beckman, and J. Hirschberg, "Evaluation of prosodic transcription labeling reliability in the ToBI framework," in Proc. ICSLP, vol. I, Yokohama, Japan, Sep. 1994, pp. 123-126.
-
(1994)
Proc. ICSLP
, vol.1
, pp. 123-126
-
-
Pitrelli, J.F.1
Beckman, M.E.2
Hirschberg, J.3
-
23
-
-
85009080611
-
Inter-transcriber reliability of ToBI prosodie labeling
-
Beijing, China
-
A. K. Syrdal and J. McGory, "Inter-transcriber reliability of ToBI prosodie labeling," in Proc. ICSLP, Beijing, China., 2000, pp. 235-238.
-
(2000)
Proc. ICSLP
, pp. 235-238
-
-
Syrdal, A.K.1
McGory, J.2
-
24
-
-
85119213703
-
TOBI: A standard for labeling english prosody
-
Banff, AB, Canada, Oct
-
K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg, "TOBI: A standard for labeling english prosody," in Proc. ICSLP, vol. 2, Banff, AB, Canada, Oct. 1992, pp. 867-870.
-
(1992)
Proc. ICSLP
, vol.2
, pp. 867-870
-
-
Silverman, K.1
Beckman, M.2
Pitrelli, J.3
Ostendorf, M.4
Wightman, C.5
Price, P.6
Pierrehumbert, J.7
Hirschberg, J.8
-
25
-
-
0343353984
-
Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events
-
A. Conkie, G. Riccardi, and R. C. Rose, "Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events," in Proc. Eurospeech, 1999, pp. 523-526.
-
(1999)
Proc. Eurospeech
, pp. 523-526
-
-
Conkie, A.1
Riccardi, G.2
Rose, R.C.3
-
26
-
-
21844471192
-
ToBI or Not ToBI
-
Aix-en-Provence, France, pp
-
C. W. Wightman, "ToBI or Not ToBI," in Proc. Speech Prosody, Aix-en-Provence, France, pp. 25-29.
-
Proc. Speech Prosody
, pp. 25-29
-
-
Wightman, C.W.1
-
27
-
-
0035156005
-
Automatic ToBI prediction and alignment to speed manual labeling of prosody
-
A. K. Syrdal, J. Hirschberg, J. McGory, and M. Beckman, "Automatic ToBI prediction and alignment to speed manual labeling of prosody," Speech Commun., vol. 33, no. 1-2, pp. 135-151, 2001.
-
(2001)
Speech Commun
, vol.33
, Issue.1-2
, pp. 135-151
-
-
Syrdal, A.K.1
Hirschberg, J.2
McGory, J.3
Beckman, M.4
|