-
1
-
-
0004056285
-
-
Prentice-Hall, New York, NJ, USA
-
Huang, X., Acero, A., Hon, H.W.: In: Spoken Language Proceesing. Prentice-Hall, New York, NJ, USA (2001)
-
(2001)
Spoken Language Proceesing
-
-
Huang, X.1
Acero, A.2
Hon, H.W.3
-
2
-
-
0016952322
-
Linguistic uses of segmental duration in English: Acoustic and perceptual evidence
-
Klatt, D.H.: Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. Journal of Acoustic Society of America 59 (1976) 1209-1221
-
(1976)
Journal of Acoustic Society of America
, vol.59
, pp. 1209-1221
-
-
Klatt, D.H.1
-
3
-
-
4544325422
-
Development of text-to-speech system for Indian languages
-
Pune, India
-
Yegnanarayana, B., Murthy, H.A., Sundar, R., Ramachandran, V.R., Kumar, A.S.M., Alwar, N., Rajendran, S.: Development of text-to-speech system for Indian languages. In: Proceedings of the International Conference on Knowledge Based Computer Systems, Pune, India (1990) 467-476
-
(1990)
Proceedings of the International Conference on Knowledge Based Computer Systems
, pp. 467-476
-
-
Yegnanarayana, B.1
Murthy, H.A.2
Sundar, R.3
Ramachandran, V.R.4
Kumar, A.S.M.5
Alwar, N.6
Rajendran, S.7
-
4
-
-
0043095316
-
A new duration modeling approach for Mandarin speech
-
Chen, S.H., Lai, W.H., Wang, Y.R.: A new duration modeling approach for Mandarin speech. IEEE Transactions on Speech and Audio Processing 11 (2003) 308-320
-
(2003)
IEEE Transactions on Speech and Audio Processing
, vol.11
, pp. 308-320
-
-
Chen, S.H.1
Lai, W.H.2
Wang, Y.R.3
-
5
-
-
85009154226
-
Building an integrated prosodic model of German
-
Aalborg, Denmark
-
Mixdorff., H., Jokisch, O.: Building an integrated prosodic model of German. In: Proceeding of the European Conference on Speech Communication and Technology. Volume 2, Aalborg, Denmark (2001) 947-950
-
(2001)
Proceeding of the European Conference on Speech Communication and Technology
, vol.2
, pp. 947-950
-
-
Mixdorff, H.1
Jokisch, O.2
-
7
-
-
0028405296
-
Assignment of segment duration in text-to-speech synthesis
-
Santen, J.P.H.V.: Assignment of segment duration in text-to-speech synthesis. Computer Speech and Language 8 (1994) 95-128
-
(1994)
Computer Speech and Language
, vol.8
, pp. 95-128
-
-
Santen, J.P.H.V.1
-
8
-
-
85009107944
-
Using Bayesian belief networks for modeling duration in text-to-speech systems
-
Beijing, China
-
Goubanova, O., Taylor, P.: Using Bayesian belief networks for modeling duration in text-to-speech systems. In: Proceedings of the International Conference on Spoken Language Processing. Volume 2, Beijing, China (2000) 427-431
-
(2000)
Proceedings of the International Conference on Spoken Language Processing
, vol.2
, pp. 427-431
-
-
Goubanova, O.1
Taylor, P.2
-
12
-
-
37549066440
-
-
Yegnanarayana, B. In: Arti.cial Neural Networks. Prentice-Hall, New Delhi, India (1999)
-
Yegnanarayana, B. In: Arti.cial Neural Networks. Prentice-Hall, New Delhi, India (1999)
-
-
-
-
13
-
-
0025387541
-
Analog i/o nets for syllable timing
-
Campbell, W.N.: Analog i/o nets for syllable timing. Speech Communication 9 (1990) 57-61
-
(1990)
Speech Communication
, vol.9
, pp. 57-61
-
-
Campbell, W.N.1
-
14
-
-
0001717383
-
Syllable based segment duration
-
Bailly, G, Benoit, C, Sawallis, T.R, eds, Elsevier
-
Campbell, W.N.: Syllable based segment duration. In: Bailly, G., Benoit, C., Sawallis, T.R., eds.: Talking Machines: Theories, Models and Designs. Elsevier (1992) 211-224
-
(1992)
Talking Machines: Theories, Models and Designs
, pp. 211-224
-
-
Campbell, W.N.1
-
15
-
-
85027104127
-
Predicting segmental durations for accommodation within a syllable-level timing framework
-
Berlin, Germany
-
Campbell, W.N.: Predicting segmental durations for accommodation within a syllable-level timing framework. In: Proceedings of the European Conference on Speech Communication and Technology. Volume 2, Berlin, Germany (1993) 1081-1084
-
(1993)
Proceedings of the European Conference on Speech Communication and Technology
, vol.2
, pp. 1081-1084
-
-
Campbell, W.N.1
-
16
-
-
0028531866
-
Characterization of rhythmic patterns for text-to-speech synthesis
-
Barbosa, P.A., Bailly, G.: Characterization of rhythmic patterns for text-to-speech synthesis. Speech Communication 15 (1994) 127-137
-
(1994)
Speech Communication
, vol.15
, pp. 127-137
-
-
Barbosa, P.A.1
Bailly, G.2
-
18
-
-
0003830071
-
Automatic modeling of duration in a Spanish text-to-speech system using neural networks
-
Budapest, Hungary
-
Cordoba, R., Vallejo, J.A., Montero, J.M., Gutierrezarriola, J., Lopez, M.A., Pardo, J.M.: Automatic modeling of duration in a Spanish text-to-speech system using neural networks. In: Proceedings of the European Conference on Speech Communication and Technology, Budapest, Hungary (1999)
-
(1999)
Proceedings of the European Conference on Speech Communication and Technology
-
-
Cordoba, R.1
Vallejo, J.A.2
Montero, J.M.3
Gutierrezarriola, J.4
Lopez, M.A.5
Pardo, J.M.6
-
19
-
-
77952314450
-
Duration modeling of Arabic text-to-speech synthesis
-
Denver, Colorado, USA
-
Hifny, Y., Rashwan, M.: Duration modeling of Arabic text-to-speech synthesis. In: Proceedings of the International Conference on Spoken Language Processing, Denver, Colorado, USA (2002) 1773-1776
-
(2002)
Proceedings of the International Conference on Spoken Language Processing
, pp. 1773-1776
-
-
Hifny, Y.1
Rashwan, M.2
-
20
-
-
0030710662
-
Prosody generation with a neural network: Weighing the importance of input parameters
-
Munich, Germany
-
Sonntag, G.P., Portele, T., Heuft, B.: Prosody generation with a neural network: Weighing the importance of input parameters. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Munich, Germany (1997) 931-934
-
(1997)
Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 931-934
-
-
Sonntag, G.P.1
Portele, T.2
Heuft, B.3
-
22
-
-
0023407575
-
Review of text-to-speech conversion for English
-
Klatt, D.H.: Review of text-to-speech conversion for English. Journal of Acoustic Society of America 82(3) (1987) 737-793
-
(1987)
Journal of Acoustic Society of America
, vol.82
, Issue.3
, pp. 737-793
-
-
Klatt, D.H.1
-
23
-
-
0016471605
-
Fundamental frequency rules for the synthesis of simple declarative English sentences
-
Olive, J.P.: Fundamental frequency rules for the synthesis of simple declarative English sentences. Journal of Acoustic Society of America (1975) 476-482
-
(1975)
Journal of Acoustic Society of America
, pp. 476-482
-
-
Olive, J.P.1
-
24
-
-
0022896756
-
Acoustic characteristics and the underlying rules of the intonation of the common Japanese used by radio and TV anouncers
-
Fujisaki, H., Hirose, K., Takahashi, N.: Acoustic characteristics and the underlying rules of the intonation of the common Japanese used by radio and TV anouncers. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (1986) 2039-2042
-
(1986)
Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 2039-2042
-
-
Fujisaki, H.1
Hirose, K.2
Takahashi, N.3
-
25
-
-
0034008810
-
Analysis and synthesis of intonation using the Tilt model
-
Taylor, P.A.: Analysis and synthesis of intonation using the Tilt model. Journal of Acoustic Society of America 107 (2000) 1697-1714
-
(2000)
Journal of Acoustic Society of America
, vol.107
, pp. 1697-1714
-
-
Taylor, P.A.1
-
26
-
-
37549008444
-
Synthesizing intonation for speech in Hindi
-
Geneva, Italy
-
Madhukumar, A.S., Rajendran, S., Sekhar, C.C., Yegnanarayana, B.: Synthesizing intonation for speech in Hindi. In: Proceedings of the Second European Conference on Speech Communication and Technology. Volume 3, Geneva, Italy (1991) 1153-1156
-
(1991)
Proceedings of the Second European Conference on Speech Communication and Technology
, vol.3
, pp. 1153-1156
-
-
Madhukumar, A.S.1
Rajendran, S.2
Sekhar, C.C.3
Yegnanarayana, B.4
-
28
-
-
0002944642
-
Dynamic characteristics of voice fundamental frequency in speech and singing
-
MacNeilage, P.F, ed, Springer-Verlag, New York, USA
-
Fujisaki, H.: Dynamic characteristics of voice fundamental frequency in speech and singing. In: MacNeilage, P.F., ed.: The Production of Speech. Springer-Verlag, New York, USA (1983) 39-55
-
(1983)
The Production of Speech
, pp. 39-55
-
-
Fujisaki, H.1
-
29
-
-
0001810979
-
A note on the physiological and physical basis for the phrase and accent components in the voice fundamental frequency contour
-
Fujimura, O, ed, Raven Press, New York, USA
-
Fujisaki, H.: A note on the physiological and physical basis for the phrase and accent components in the voice fundamental frequency contour. In: Fujimura, O., ed.: Vocal Physiology: Voice Production, Mechanisms and Functions. Raven Press, New York, USA (1988) 347-355
-
(1988)
Vocal Physiology: Voice Production, Mechanisms and Functions
, pp. 347-355
-
-
Fujisaki, H.1
-
30
-
-
0003788784
-
-
Cambridge University Press, Cambridge
-
t'Hart, J., Collier, R., Cohen, A.: A Perceptual Study of Intonation. Cambridge University Press, Cambridge
-
A Perceptual Study of Intonation
-
-
t'Hart, J.1
Collier, R.2
Cohen, A.3
-
31
-
-
84869597582
-
Festival speaks Italian
-
Aalborg, Denmark
-
Cosi, P., Tesser, F., Gretter, R.: Festival speaks Italian. In: Proceedings of EUROSPEECH 2001, Aalborg, Denmark (2001) 509-512
-
(2001)
Proceedings of EUROSPEECH
, pp. 509-512
-
-
Cosi, P.1
Tesser, F.2
Gretter, R.3
-
32
-
-
84945895231
-
Prosodic data driven modeling of a narrative style in Festival TTS
-
Pittsburgh, USA
-
Tesser, F., Cosi, P., Drioli, C., Tisato, G.: Prosodic data driven modeling of a narrative style in Festival TTS. In: Fifth ESCA Speech Synthesis Workshop, Pittsburgh, USA (2004) 185-190
-
(2004)
Fifth ESCA Speech Synthesis Workshop
, pp. 185-190
-
-
Tesser, F.1
Cosi, P.2
Drioli, C.3
Tisato, G.4
-
35
-
-
0024876896
-
Neural network based generation of fundamental frequency contours
-
Glasgow, Scotland
-
Scordilis, M.S., Gowdy, J.N.: Neural network based generation of fundamental frequency contours. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing. Volume 1, Glasgow, Scotland (1989) 219-222
-
(1989)
Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
, vol.1
, pp. 219-222
-
-
Scordilis, M.S.1
Gowdy, J.N.2
-
37
-
-
85009062747
-
Data driven intonation modeling of 6 languages
-
Beijing, China
-
Buhmann, J., Vereecken, H., Fackrell, J., Martens, J.P., Coile, B.V.: Data driven intonation modeling of 6 languages. In: Proceedings of the International Conference on Spoken Language Processing. Volume 3, Beijing, China (2000) 179-183
-
(2000)
Proceedings of the International Conference on Spoken Language Processing
, vol.3
, pp. 179-183
-
-
Buhmann, J.1
Vereecken, H.2
Fackrell, J.3
Martens, J.P.4
Coile, B.V.5
-
39
-
-
37549034753
-
Syllabic properties of three Indian languages: Implications for speech recognition and language identification
-
Mysore, India
-
Khan, A.N., Gangashetty, S.V., Yegnanarayana, B.: Syllabic properties of three Indian languages: Implications for speech recognition and language identification. In: International Conference on Natural Language Processing, Mysore, India (2003) 125-134
-
(2003)
International Conference on Natural Language Processing
, pp. 125-134
-
-
Khan, A.N.1
Gangashetty, S.V.2
Yegnanarayana, B.3
-
40
-
-
37549017812
-
-
Chopde, A, Itrans Indian language transliteration package version 5.2 source
-
Chopde, A.: (Itrans Indian language transliteration package version 5.2 source) http://www.aczone.con/itrans/.
-
-
-
-
41
-
-
4544369752
-
Extraction of pitch in adverse conditions
-
Montreal, Canada
-
Prasanna, S.R.M., Yegnanarayana, B.: Extraction of pitch in adverse conditions. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Montreal, Canada (2004)
-
(2004)
Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
-
-
Prasanna, S.R.M.1
Yegnanarayana, B.2
-
42
-
-
0035121063
-
Statistical prosodic modeling: From corpus design to parameter estimation
-
Bellegarda, J.R., Silverman, K.E.A., Lenzo, K., Anderson, V.: Statistical prosodic modeling: From corpus design to parameter estimation. IEEE Transactions on Speech and Audio Processing 9 (2001) 52-66
-
(2001)
IEEE Transactions on Speech and Audio Processing
, vol.9
, pp. 52-66
-
-
Bellegarda, J.R.1
Silverman, K.E.A.2
Lenzo, K.3
Anderson, V.4
-
44
-
-
0032672117
-
Using a sigmoid transformation for improved modeling of phoneme duration
-
Phoenix, AZ, USA
-
Silverman, K.E.A., Bellegarda, J.R.: Using a sigmoid transformation for improved modeling of phoneme duration. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Phoenix, AZ, USA (1999) 385-388
-
(1999)
Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
, pp. 385-388
-
-
Silverman, K.E.A.1
Bellegarda, J.R.2
-
45
-
-
37549047309
-
Phonetic and timing considerations in a Swiss high German TTS system
-
Keller, E, Bailly, G, Monaghan, A, Terken, J, Huckvale, M, eds, Wiley, Chichester
-
Siebenhaar, B., Zellner-Keller, B., Keller, E.: Phonetic and timing considerations in a Swiss high German TTS system. In: Keller, E., Bailly, G., Monaghan, A., Terken, J., Huckvale, M., eds.: Improvements in Speech Synthesis. Wiley, Chichester (2001)
-
(2001)
Improvements in Speech Synthesis
-
-
Siebenhaar, B.1
Zellner-Keller, B.2
Keller, E.3
-
46
-
-
27144489164
-
A tutorial on support vector machines for pattern recognition
-
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2 (1998) 121-167
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, pp. 121-167
-
-
Burges, C.J.C.1
-
47
-
-
0141741010
-
Prosody modeling for automatic speech understanding: An overview of recent research at SRI
-
Red Bank, NJ, USA
-
Shriberg, Elizabeth, Stolcke, Andreas: Prosody modeling for automatic speech understanding: An overview of recent research at SRI. In: Prosody in Speech Recognition and Understanding, ISCA Tutorial and Research Workshop (ITRW), Molly Pitcher Inn, Red Bank, NJ, USA (2001)
-
(2001)
Prosody in Speech Recognition and Understanding, ISCA Tutorial and Research Workshop (ITRW), Molly Pitcher Inn
-
-
Shriberg, E.1
Stolcke, A.2
-
48
-
-
37549013057
-
A text-to-speech conversion system for Indian languages based on waveform concatenation model
-
Technical report no. 11, Project VOIS, Department of Computer Science and Engineering, Indian Institute of Technology Madras
-
Srikanth, S., Kumar, S.R.R., Sundar, R., Yegnanarayana, B. In: A text-to-speech conversion system for Indian languages based on waveform concatenation model. Technical report no. 11, Project VOIS, Department of Computer Science and Engineering, Indian Institute of Technology Madras (1989)
-
(1989)
-
-
Srikanth, S.1
Kumar, S.R.R.2
Sundar, R.3
Yegnanarayana, B.4
-
49
-
-
4544252352
-
Prosodic manipulation using instants of significant excitation
-
Baltimore, Maryland, USA
-
Rao, K.S., Yegnanarayana, B.: Prosodic manipulation using instants of significant excitation. In: Proceedings of the IEEE International Conference on Multimedia and Expo, Baltimore, Maryland, USA (2003) 389-392
-
(2003)
Proceedings of the IEEE International Conference on Multimedia and Expo
, pp. 389-392
-
-
Rao, K.S.1
Yegnanarayana, B.2
-
50
-
-
0029375490
-
Determination of instants of significant excitation in speech using group delay function
-
Smits, R., Yegnanarayana, B.: Determination of instants of significant excitation in speech using group delay function. IEEE Transactions on Speech and Audio Processing 3 (1995) 325-333
-
(1995)
IEEE Transactions on Speech and Audio Processing
, vol.3
, pp. 325-333
-
-
Smits, R.1
Yegnanarayana, B.2
-
51
-
-
0000668614
-
Robustness of group-delay-based method for extraction of significant excitation from speech signals
-
Murthy, P.S., Yegnanarayana, B.: Robustness of group-delay-based method for extraction of significant excitation from speech signals. IEEE Transactions on Speech and Audio Processing 7 (1999) 609-619
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, pp. 609-619
-
-
Murthy, P.S.1
Yegnanarayana, B.2
-
52
-
-
0003424145
-
-
Macmillan, New York, USA
-
Deller, J.R., Proakis, J.G., Hansen, J.H.L. In: Discrete-Time Processing of Speech Signals. Macmillan, New York, USA (1993)
-
(1993)
Discrete-Time Processing of Speech Signals
-
-
Deller, J.R.1
Proakis, J.G.2
Hansen, J.H.L.3
|