SCOPUS 정보 검색 플랫폼

Studies in Computational Intelligence

Volumn 83, Issue , 2008, Pages 71-95

Modeling supra-segmental features of syllables using neural networks

(1) Rao, K Sreenivasa a

a INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI (India)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 37549007588 PISSN: 1860949X EISSN: None Source Type: Book Series
DOI: 10.1007/978-3-540-75398-8_4 Document Type: Article

Times cited : (3)

References (52)

1
- 0004056285
- Prentice-Hall, New York, NJ, USA
- Huang, X., Acero, A., Hon, H.W.: In: Spoken Language Proceesing. Prentice-Hall, New York, NJ, USA (2001)
- (2001) Spoken Language Proceesing
- Huang, X.¹ Acero, A.² Hon, H.W.³

2
- 0016952322
- Linguistic uses of segmental duration in English: Acoustic and perceptual evidence
- Klatt, D.H.: Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. Journal of Acoustic Society of America 59 (1976) 1209-1221
- (1976) Journal of Acoustic Society of America , vol.59 , pp. 1209-1221
- Klatt, D.H.¹

3
- 4544325422
- Development of text-to-speech system for Indian languages
- Pune, India
- Yegnanarayana, B., Murthy, H.A., Sundar, R., Ramachandran, V.R., Kumar, A.S.M., Alwar, N., Rajendran, S.: Development of text-to-speech system for Indian languages. In: Proceedings of the International Conference on Knowledge Based Computer Systems, Pune, India (1990) 467-476
- (1990) Proceedings of the International Conference on Knowledge Based Computer Systems , pp. 467-476
- Yegnanarayana, B.¹ Murthy, H.A.² Sundar, R.³ Ramachandran, V.R.⁴ Kumar, A.S.M.⁵ Alwar, N.⁶ Rajendran, S.⁷

4
- 0043095316
- A new duration modeling approach for Mandarin speech
- Chen, S.H., Lai, W.H., Wang, Y.R.: A new duration modeling approach for Mandarin speech. IEEE Transactions on Speech and Audio Processing 11 (2003) 308-320
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , pp. 308-320
- Chen, S.H.¹ Lai, W.H.² Wang, Y.R.³

5
- 85009154226
- Building an integrated prosodic model of German
- Aalborg, Denmark
- Mixdorff., H., Jokisch, O.: Building an integrated prosodic model of German. In: Proceeding of the European Conference on Speech Communication and Technology. Volume 2, Aalborg, Denmark (2001) 947-950
- (2001) Proceeding of the European Conference on Speech Communication and Technology , vol.2 , pp. 947-950
- Mixdorff, H.¹ Jokisch, O.²

6
- 33745210960
- PhD Thesis, Technical University, Dresden, Germany
- Mixdorff., H.: An integrated approach to modeling German prosody. PhD Thesis, Technical University, Dresden, Germany (2002)
- (2002) An integrated approach to modeling German prosody
- Mixdorff, H.¹

7
- 0028405296
- Assignment of segment duration in text-to-speech synthesis
- Santen, J.P.H.V.: Assignment of segment duration in text-to-speech synthesis. Computer Speech and Language 8 (1994) 95-128
- (1994) Computer Speech and Language , vol.8 , pp. 95-128
- Santen, J.P.H.V.¹

8
- 85009107944
- Using Bayesian belief networks for modeling duration in text-to-speech systems
- Beijing, China
- Goubanova, O., Taylor, P.: Using Bayesian belief networks for modeling duration in text-to-speech systems. In: Proceedings of the International Conference on Spoken Language Processing. Volume 2, Beijing, China (2000) 427-431
- (2000) Proceedings of the International Conference on Spoken Language Processing , vol.2 , pp. 427-431
- Goubanova, O.¹ Taylor, P.²

9
- 37549025522
- Master's Thesis, Department of Electrical and Electronics Engineering, Bogaziei University
- Sayli, O.: Duration analysis and modeling for Turkish text-to-speech synthesis. Master's Thesis, Department of Electrical and Electronics Engineering, Bogaziei University (2002)
- (2002) Duration analysis and modeling for Turkish text-to-speech synthesis
- Sayli, O.¹

10
- 0002069313
- Tree-based modeling of segmental durations
- Riley, M.: Tree-based modeling of segmental durations. Talking Machines: Theories, Models and Designs (1992) 265-273
- (1992) Talking Machines: Theories, Models and Designs , pp. 265-273
- Riley, M.¹

11
- 0003413187
- Pearson Education Asia, Inc, New Delhi, India
- Haykin, S. In: Neural Networks: A Comprehensive Foundation. Pearson Education Asia, Inc., New Delhi, India (1999)
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

12
- 37549066440
- Yegnanarayana, B. In: Arti.cial Neural Networks. Prentice-Hall, New Delhi, India (1999)
- Yegnanarayana, B. In: Arti.cial Neural Networks. Prentice-Hall, New Delhi, India (1999)

13
- 0025387541
- Analog i/o nets for syllable timing
- Campbell, W.N.: Analog i/o nets for syllable timing. Speech Communication 9 (1990) 57-61
- (1990) Speech Communication , vol.9 , pp. 57-61
- Campbell, W.N.¹

14
- 0001717383
- Syllable based segment duration
- Bailly, G, Benoit, C, Sawallis, T.R, eds, Elsevier
- Campbell, W.N.: Syllable based segment duration. In: Bailly, G., Benoit, C., Sawallis, T.R., eds.: Talking Machines: Theories, Models and Designs. Elsevier (1992) 211-224
- (1992) Talking Machines: Theories, Models and Designs , pp. 211-224
- Campbell, W.N.¹

15
- 85027104127
- Predicting segmental durations for accommodation within a syllable-level timing framework
- Berlin, Germany
- Campbell, W.N.: Predicting segmental durations for accommodation within a syllable-level timing framework. In: Proceedings of the European Conference on Speech Communication and Technology. Volume 2, Berlin, Germany (1993) 1081-1084
- (1993) Proceedings of the European Conference on Speech Communication and Technology , vol.2 , pp. 1081-1084
- Campbell, W.N.¹

16
- 0028531866
- Characterization of rhythmic patterns for text-to-speech synthesis
- Barbosa, P.A., Bailly, G.: Characterization of rhythmic patterns for text-to-speech synthesis. Speech Communication 15 (1994) 127-137
- (1994) Speech Communication , vol.15 , pp. 127-137
- Barbosa, P.A.¹ Bailly, G.²

17
- 37549028551
- Generating segmental duration by P-centers
- Bourges, France
- Barbosa, P.A., Bailly, G.: Generating segmental duration by P-centers. In: Proceedings of the Fourth Workshop on Rhythm Perception and Production, Bourges, France (1992) 163-168
- (1992) Proceedings of the Fourth Workshop on Rhythm Perception and Production , pp. 163-168
- Barbosa, P.A.¹ Bailly, G.²

18
- 0003830071
- Automatic modeling of duration in a Spanish text-to-speech system using neural networks
- Budapest, Hungary
- Cordoba, R., Vallejo, J.A., Montero, J.M., Gutierrezarriola, J., Lopez, M.A., Pardo, J.M.: Automatic modeling of duration in a Spanish text-to-speech system using neural networks. In: Proceedings of the European Conference on Speech Communication and Technology, Budapest, Hungary (1999)
- (1999) Proceedings of the European Conference on Speech Communication and Technology
- Cordoba, R.¹ Vallejo, J.A.² Montero, J.M.³ Gutierrezarriola, J.⁴ Lopez, M.A.⁵ Pardo, J.M.⁶

19
- 77952314450
- Duration modeling of Arabic text-to-speech synthesis
- Denver, Colorado, USA
- Hifny, Y., Rashwan, M.: Duration modeling of Arabic text-to-speech synthesis. In: Proceedings of the International Conference on Spoken Language Processing, Denver, Colorado, USA (2002) 1773-1776
- (2002) Proceedings of the International Conference on Spoken Language Processing , pp. 1773-1776
- Hifny, Y.¹ Rashwan, M.²

20
- 0030710662
- Prosody generation with a neural network: Weighing the importance of input parameters
- Munich, Germany
- Sonntag, G.P., Portele, T., Heuft, B.: Prosody generation with a neural network: Weighing the importance of input parameters. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Munich, Germany (1997) 931-934
- (1997) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 931-934
- Sonntag, G.P.¹ Portele, T.² Heuft, B.³

21
- 85009231337
- Segmental durations predicted with a neural network
- Geneva, Switzerland
- Teixeira, J.P., Freitas, D.: Segmental durations predicted with a neural network. In: Proceedings of the European Conference on Speech Communication and Technology, Geneva, Switzerland (2003) 169-172
- (2003) Proceedings of the European Conference on Speech Communication and Technology , pp. 169-172
- Teixeira, J.P.¹ Freitas, D.²

22
- 0023407575
- Review of text-to-speech conversion for English
- Klatt, D.H.: Review of text-to-speech conversion for English. Journal of Acoustic Society of America 82(3) (1987) 737-793
- (1987) Journal of Acoustic Society of America , vol.82 , Issue.3 , pp. 737-793
- Klatt, D.H.¹

23
- 0016471605
- Fundamental frequency rules for the synthesis of simple declarative English sentences
- Olive, J.P.: Fundamental frequency rules for the synthesis of simple declarative English sentences. Journal of Acoustic Society of America (1975) 476-482
- (1975) Journal of Acoustic Society of America , pp. 476-482
- Olive, J.P.¹

24
- 0022896756
- Acoustic characteristics and the underlying rules of the intonation of the common Japanese used by radio and TV anouncers
- Fujisaki, H., Hirose, K., Takahashi, N.: Acoustic characteristics and the underlying rules of the intonation of the common Japanese used by radio and TV anouncers. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing (1986) 2039-2042
- (1986) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 2039-2042
- Fujisaki, H.¹ Hirose, K.² Takahashi, N.³

25
- 0034008810
- Analysis and synthesis of intonation using the Tilt model
- Taylor, P.A.: Analysis and synthesis of intonation using the Tilt model. Journal of Acoustic Society of America 107 (2000) 1697-1714
- (2000) Journal of Acoustic Society of America , vol.107 , pp. 1697-1714
- Taylor, P.A.¹

26
- 37549008444
- Synthesizing intonation for speech in Hindi
- Geneva, Italy
- Madhukumar, A.S., Rajendran, S., Sekhar, C.C., Yegnanarayana, B.: Synthesizing intonation for speech in Hindi. In: Proceedings of the Second European Conference on Speech Communication and Technology. Volume 3, Geneva, Italy (1991) 1153-1156
- (1991) Proceedings of the Second European Conference on Speech Communication and Technology , vol.3 , pp. 1153-1156
- Madhukumar, A.S.¹ Rajendran, S.² Sekhar, C.C.³ Yegnanarayana, B.⁴

27
- 0003754220
- PhD Thesis, MIT, MA, USA
- Pierrehumbert, J.B.: The Phonology and Phonetics of English Intonation. PhD Thesis, MIT, MA, USA (1980)
- (1980) The Phonology and Phonetics of English Intonation
- Pierrehumbert, J.B.¹

28
- 0002944642
- Dynamic characteristics of voice fundamental frequency in speech and singing
- MacNeilage, P.F, ed, Springer-Verlag, New York, USA
- Fujisaki, H.: Dynamic characteristics of voice fundamental frequency in speech and singing. In: MacNeilage, P.F., ed.: The Production of Speech. Springer-Verlag, New York, USA (1983) 39-55
- (1983) The Production of Speech , pp. 39-55
- Fujisaki, H.¹

29
- 0001810979
- A note on the physiological and physical basis for the phrase and accent components in the voice fundamental frequency contour
- Fujimura, O, ed, Raven Press, New York, USA
- Fujisaki, H.: A note on the physiological and physical basis for the phrase and accent components in the voice fundamental frequency contour. In: Fujimura, O., ed.: Vocal Physiology: Voice Production, Mechanisms and Functions. Raven Press, New York, USA (1988) 347-355
- (1988) Vocal Physiology: Voice Production, Mechanisms and Functions , pp. 347-355
- Fujisaki, H.¹

30
- 0003788784
- Cambridge University Press, Cambridge
- t'Hart, J., Collier, R., Cohen, A.: A Perceptual Study of Intonation. Cambridge University Press, Cambridge
- A Perceptual Study of Intonation
- t'Hart, J.¹ Collier, R.² Cohen, A.³

31
- 84869597582
- Festival speaks Italian
- Aalborg, Denmark
- Cosi, P., Tesser, F., Gretter, R.: Festival speaks Italian. In: Proceedings of EUROSPEECH 2001, Aalborg, Denmark (2001) 509-512
- (2001) Proceedings of EUROSPEECH , pp. 509-512
- Cosi, P.¹ Tesser, F.² Gretter, R.³

32
- 84945895231
- Prosodic data driven modeling of a narrative style in Festival TTS
- Pittsburgh, USA
- Tesser, F., Cosi, P., Drioli, C., Tisato, G.: Prosodic data driven modeling of a narrative style in Festival TTS. In: Fifth ESCA Speech Synthesis Workshop, Pittsburgh, USA (2004) 185-190
- (2004) Fifth ESCA Speech Synthesis Workshop , pp. 185-190
- Tesser, F.¹ Cosi, P.² Drioli, C.³ Tisato, G.⁴

33
- 4544268744
- Modeling the microprosody of pitch and loudness for speech synthesis with neural networks
- Sidney, Australia
- Vainio, M., Altosaar, T.: Modeling the microprosody of pitch and loudness for speech synthesis with neural networks. In: Proceedings of the International Conference on Spoken Language Processing, Sidney, Australia (1998)
- (1998) Proceedings of the International Conference on Spoken Language Processing
- Vainio, M.¹ Altosaar, T.²

34
- 37549053769
- Master's Thesis, Department of Linguistics, University of Edinburgh
- Vegnaduzzo, M.: Modeling intonation for the Italian festival TTS using linear regression. Master's Thesis, Department of Linguistics, University of Edinburgh (2003)
- (2003) Modeling intonation for the Italian festival TTS using linear regression
- Vegnaduzzo, M.¹

35
- 0024876896
- Neural network based generation of fundamental frequency contours
- Glasgow, Scotland
- Scordilis, M.S., Gowdy, J.N.: Neural network based generation of fundamental frequency contours. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing. Volume 1, Glasgow, Scotland (1989) 219-222
- (1989) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing , vol.1 , pp. 219-222
- Scordilis, M.S.¹ Gowdy, J.N.²

36
- 33646681559
- PhD Thesis, Department of Phonetics, University of Helsinki, Finland
- Vainio, M.: Artificial neural network based prosody models for Finnish text-to-speech synthesis. PhD Thesis, Department of Phonetics, University of Helsinki, Finland (2001)
- (2001) Artificial neural network based prosody models for Finnish text-to-speech synthesis
- Vainio, M.¹

37
- 85009062747
- Data driven intonation modeling of 6 languages
- Beijing, China
- Buhmann, J., Vereecken, H., Fackrell, J., Martens, J.P., Coile, B.V.: Data driven intonation modeling of 6 languages. In: Proceedings of the International Conference on Spoken Language Processing. Volume 3, Beijing, China (2000) 179-183
- (2000) Proceedings of the International Conference on Spoken Language Processing , vol.3 , pp. 179-183
- Buhmann, J.¹ Vereecken, H.² Fackrell, J.³ Martens, J.P.⁴ Coile, B.V.⁵

38
- 0028712434
- Neural-network-based F0 text-to-speech synthesizer for Mandarin
- Hwang, S.H., Chen, S.H.: Neural-network-based F0 text-to-speech synthesizer for Mandarin. IEEE Proceedings on Image Signal Processing 141 (1994) 384-390
- (1994) IEEE Proceedings on Image Signal Processing , vol.141 , pp. 384-390
- Hwang, S.H.¹ Chen, S.H.²

39
- 37549034753
- Syllabic properties of three Indian languages: Implications for speech recognition and language identification
- Mysore, India
- Khan, A.N., Gangashetty, S.V., Yegnanarayana, B.: Syllabic properties of three Indian languages: Implications for speech recognition and language identification. In: International Conference on Natural Language Processing, Mysore, India (2003) 125-134
- (2003) International Conference on Natural Language Processing , pp. 125-134
- Khan, A.N.¹ Gangashetty, S.V.² Yegnanarayana, B.³

40
- 37549017812
- Chopde, A, Itrans Indian language transliteration package version 5.2 source
- Chopde, A.: (Itrans Indian language transliteration package version 5.2 source) http://www.aczone.con/itrans/.

41
- 4544369752
- Extraction of pitch in adverse conditions
- Montreal, Canada
- Prasanna, S.R.M., Yegnanarayana, B.: Extraction of pitch in adverse conditions. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Montreal, Canada (2004)
- (2004) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing
- Prasanna, S.R.M.¹ Yegnanarayana, B.²

42
- 0035121063
- Statistical prosodic modeling: From corpus design to parameter estimation
- Bellegarda, J.R., Silverman, K.E.A., Lenzo, K., Anderson, V.: Statistical prosodic modeling: From corpus design to parameter estimation. IEEE Transactions on Speech and Audio Processing 9 (2001) 52-66
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , pp. 52-66
- Bellegarda, J.R.¹ Silverman, K.E.A.² Lenzo, K.³ Anderson, V.⁴

43
- 33744699487
- Improved duration modeling of English phonemes using a root sinusoidal transformation
- Bellegarda, J.R., Silverman, K.E.A.: Improved duration modeling of English phonemes using a root sinusoidal transformation. In: Proceedings of the International Conference on Spoken Language Processing (1998) 21-24
- (1998) Proceedings of the International Conference on Spoken Language Processing , pp. 21-24
- Bellegarda, J.R.¹ Silverman, K.E.A.²

44
- 0032672117
- Using a sigmoid transformation for improved modeling of phoneme duration
- Phoenix, AZ, USA
- Silverman, K.E.A., Bellegarda, J.R.: Using a sigmoid transformation for improved modeling of phoneme duration. In: Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing, Phoenix, AZ, USA (1999) 385-388
- (1999) Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 385-388
- Silverman, K.E.A.¹ Bellegarda, J.R.²

45
- 37549047309
- Phonetic and timing considerations in a Swiss high German TTS system
- Keller, E, Bailly, G, Monaghan, A, Terken, J, Huckvale, M, eds, Wiley, Chichester
- Siebenhaar, B., Zellner-Keller, B., Keller, E.: Phonetic and timing considerations in a Swiss high German TTS system. In: Keller, E., Bailly, G., Monaghan, A., Terken, J., Huckvale, M., eds.: Improvements in Speech Synthesis. Wiley, Chichester (2001)
- (2001) Improvements in Speech Synthesis
- Siebenhaar, B.¹ Zellner-Keller, B.² Keller, E.³

46
- 27144489164
- A tutorial on support vector machines for pattern recognition
- Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2 (1998) 121-167
- (1998) Data Mining and Knowledge Discovery , vol.2 , pp. 121-167
- Burges, C.J.C.¹

47
- 0141741010
- Prosody modeling for automatic speech understanding: An overview of recent research at SRI
- Red Bank, NJ, USA
- Shriberg, Elizabeth, Stolcke, Andreas: Prosody modeling for automatic speech understanding: An overview of recent research at SRI. In: Prosody in Speech Recognition and Understanding, ISCA Tutorial and Research Workshop (ITRW), Molly Pitcher Inn, Red Bank, NJ, USA (2001)
- (2001) Prosody in Speech Recognition and Understanding, ISCA Tutorial and Research Workshop (ITRW), Molly Pitcher Inn
- Shriberg, E.¹ Stolcke, A.²

48
- 37549013057
- A text-to-speech conversion system for Indian languages based on waveform concatenation model
- Technical report no. 11, Project VOIS, Department of Computer Science and Engineering, Indian Institute of Technology Madras
- Srikanth, S., Kumar, S.R.R., Sundar, R., Yegnanarayana, B. In: A text-to-speech conversion system for Indian languages based on waveform concatenation model. Technical report no. 11, Project VOIS, Department of Computer Science and Engineering, Indian Institute of Technology Madras (1989)
- (1989)
- Srikanth, S.¹ Kumar, S.R.R.² Sundar, R.³ Yegnanarayana, B.⁴

49
- 4544252352
- Prosodic manipulation using instants of significant excitation
- Baltimore, Maryland, USA
- Rao, K.S., Yegnanarayana, B.: Prosodic manipulation using instants of significant excitation. In: Proceedings of the IEEE International Conference on Multimedia and Expo, Baltimore, Maryland, USA (2003) 389-392
- (2003) Proceedings of the IEEE International Conference on Multimedia and Expo , pp. 389-392
- Rao, K.S.¹ Yegnanarayana, B.²

50
- 0029375490
- Determination of instants of significant excitation in speech using group delay function
- Smits, R., Yegnanarayana, B.: Determination of instants of significant excitation in speech using group delay function. IEEE Transactions on Speech and Audio Processing 3 (1995) 325-333
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 325-333
- Smits, R.¹ Yegnanarayana, B.²

51
- 0000668614
- Robustness of group-delay-based method for extraction of significant excitation from speech signals
- Murthy, P.S., Yegnanarayana, B.: Robustness of group-delay-based method for extraction of significant excitation from speech signals. IEEE Transactions on Speech and Audio Processing 7 (1999) 609-619
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 609-619
- Murthy, P.S.¹ Yegnanarayana, B.²

52
- 0003424145
- Macmillan, New York, USA
- Deller, J.R., Proakis, J.G., Hansen, J.H.L. In: Discrete-Time Processing of Speech Signals. Macmillan, New York, USA (1993)
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.