SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 21, Issue 2, 2007, Pages 282-295

Modeling durations of syllables using neural networks

(2) Rao, K Sreenivasa a Yegnanarayana, B b

a INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI (India)

b INDIAN INSTITUTE OF TECHNOLOGY MADRAS (India)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; CORRELATION METHODS; FEATURE EXTRACTION; KNOWLEDGE ENGINEERING; NEURAL NETWORKS; TEXT PROCESSING;

AVERAGE PREDICTION ERROR; CONTEXTUAL FEATURES; SYLLABLES; TWO STAGE DURATION MODEL;

NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 33750713338 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2006.06.003 Document Type: Article

Times cited : (69)

References (37)

1
- 33750717907
- Barbosa, P.A., Bailly, G. 1992. Generating segmental duration by p-centers. In: Proceedings of the Fourth Workshop on Rhythm Perception and Production, Bourges, France, June, pp. 163-168.

2
- 0028531866
- Characterization of rhythmic patterns for text-to-speech synthesis
- Barbosa P.A., and Bailly G. Characterization of rhythmic patterns for text-to-speech synthesis. Speech Communication 15 (1994) 127-137
- (1994) Speech Communication , vol.15 , pp. 127-137
- Barbosa, P.A.¹ Bailly, G.²

3
- 0023404428
- A model of segmental duration for speech synthesis in French
- Bartkova K., and Sorin C. A model of segmental duration for speech synthesis in French. Speech Communication 6 (1987) 245-260
- (1987) Speech Communication , Issue.6 , pp. 245-260
- Bartkova, K.¹ Sorin, C.²

4
- 0035121063
- Statistical prosodic modeling: From corpus design to parameter estimation
- Bellegarda J.R., Silverman K.E.A., Lenzo K., and Anderson V. Statistical prosodic modeling: From corpus design to parameter estimation. IEEE Transactions on Speech and Audio Processing 9 Jan (2001) 52-66
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.Jan , pp. 52-66
- Bellegarda, J.R.¹ Silverman, K.E.A.² Lenzo, K.³ Anderson, V.⁴

5
- 33750712116
- Black, A.W., Taylor, P., Caley, R., 2000. The festival speech synthesis system: System documentation. The Centre for Speech Technology Research (CSTR), University of Edinburgh, 1.4.0 edition. Available from: http://www.cstr.ed.ac.uk/projects/festival/manual/festival_toc.html.

6
- 27144489164
- A tutorial on support vector machines for pattern recognition
- Burges C.J.C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2 2 (1998) 121-167
- (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
- Burges, C.J.C.¹

7
- 0025387541
- Analog i/o nets for syllable timing
- Campbell W.N. Analog i/o nets for syllable timing. Speech Communication 9 February (1990) 57-61
- (1990) Speech Communication , vol.9 , Issue.February , pp. 57-61
- Campbell, W.N.¹

8
- 0001717383
- Syllable based segment duration
- Bailly G., Benoit C., and Sawallis T.R. (Eds), Elsevier, Amsterdam
- Campbell W.N. Syllable based segment duration. In: Bailly G., Benoit C., and Sawallis T.R. (Eds). Talking Machines: Theories, Models and Designs (1992), Elsevier, Amsterdam 211-224
- (1992) Talking Machines: Theories, Models and Designs , pp. 211-224
- Campbell, W.N.¹

9
- 33750709760
- Campbell, W.N., 1993. Predicting segmental durations for accommodation within a syllable-level timing framework. In: Proceedings of the European Conference Speech Communication and Technology, vol. 2, Berlin, Germany, September, pp. 1081-1084.

10
- 84930566044
- Segment durations in a syllable frame
- Campbell W.N., and Isard S.D. Segment durations in a syllable frame. Journal of Phonetics: Special issue on speech synthesis 19 (1991) 37-47
- (1991) Journal of Phonetics: Special issue on speech synthesis , vol.19 , pp. 37-47
- Campbell, W.N.¹ Isard, S.D.²

11
- 0043095316
- A new duration modeling approach for Mandarin speech
- Chen S.H., Lai W.H., and Wang Y.R. A new duration modeling approach for Mandarin speech. IEEE Transactions on Speech and Audio Processing 11 July (2003) 308-320
- (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.July , pp. 308-320
- Chen, S.H.¹ Lai, W.H.² Wang, Y.R.³

12
- 33750721752
- Chopde, A. Itrans Indian language transliteration package version 5.2 source. Available from: http://www.aczone.con/itrans/.

13
- 33750711787
- Chung, H., 2002a. Duration models and the perceptual evaluation of spoken Korean. In: Proceedings of Speech Prosody, Aix-en-Provence, France, pp. 219-222.

14
- 33750687350
- Perceptual evaluation of duration models in spoken Korean
- Chung H. Perceptual evaluation of duration models in spoken Korean. The Korean Journal of Speech Sciences 9 (2002) 207-215
- (2002) The Korean Journal of Speech Sciences , vol.9 , pp. 207-215
- Chung, H.¹

15
- 33750695022
- Cordoba, R., Vallejo, J.A., Montero, J.M., Gutierrezarriola, J., Lopez, M.A., Pardo, J.M. 1999. Automatic modeling of duration in a Spanish text-to-speech system using neural networks. In: Proceedings of the European Conference on Speech Communication and Technology, September, Budapest, Hungary.

16
- 85009107944
- Goubanova, O, Taylor, P. 2000. Using bayesian belief networks for modeling duration in text-to-speech systems. In: Proceedings of the International Conference on Spoken Language Processing, vol. 2, Beijing, China, October 2000, pp. 427-431.

17
- 0003413187
- Pearson Education Asia, Inc., New Delhi, India
- Haykin S. Neural Networks: A Comprehensive Foundation (1999), Pearson Education Asia, Inc., New Delhi, India
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

18
- 77952314450
- Hifny, Y, Rashwan, M. 2002. Duration modeling of Arabic text-to-speech synthesis. In: Proceedings of the International Conference on Spoken Language Processing, Denver, CO, USA, September, pp. 1773-1776.

19
- 0004056285
- Prentice-Hall, Inc., New York, NJ, USA
- Huang X., Acero A., and Hon H.W. Spoken Language Processing (2001), Prentice-Hall, Inc., New York, NJ, USA
- (2001) Spoken Language Processing
- Huang, X.¹ Acero, A.² Hon, H.W.³

20
- 33750744519
- Khan, A.N., Gangashetty, S.V., Yegnanarayana, B., 2003. Syllabic properties of three Indian languages: Implications for speech recognition and language identification. In: International Conference on Natural Language Processing, Mysore, India, December, pp. 125-134.

21
- 0016952322
- Linguistic uses of segmental duration in English: Acoustic and perceptual evidence
- Klatt D.H. Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. Journal of Acoustic Society of America 59 (1976) 1209-1221
- (1976) Journal of Acoustic Society of America , vol.59 , pp. 1209-1221
- Klatt, D.H.¹

22
- 0343410895
- Zeistrukturierung in der Sprachsynthese
- Kohler K.J. Zeistrukturierung in der Sprachsynthese. ITG-Tagung Digitalc Sprachverarbeitung 6 (1988) 165-170
- (1988) ITG-Tagung Digitalc Sprachverarbeitung , Issue.6 , pp. 165-170
- Kohler, K.J.¹

23
- 33750700891
- Krishna, N.S., Murthy, H.A., 2004. Duration modeling of Indian languages Hindi and Telugu. In: 5th ISCA Speech Synthesis Workshop, Pittsburgh, USA, May, pp. 197-202.

24
- 33750698008
- Kumar, K.K., 2002. Duration and intonation knowledge for text-to-speech conversion system for Telugu and Hindi, Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, India, May.

25
- 33750692405
- Mixdorff, H., 2002. An integrated approach to modeling German prosody. PhD thesis, Technical University, Dresden, Germany, July.

26
- 85009154226
- Mixdorff, H., Jokisch, O. 2001. Building an integrated prosodic model of German. In: Proceedings of the European Conference on Speech Communication and Technology, vol. 2, Aalborg, Denmark, September, pp. 947-950.

27
- 0002069313
- Tree-based modeling of segmental durations
- Riley M. Tree-based modeling of segmental durations. Talking Machines: Theories, Models and Designs (1992) 265-273
- (1992) Talking Machines: Theories, Models and Designs , pp. 265-273
- Riley, M.¹

28
- 0028405296
- Assignment of segment duration in text-to-speech synthesis
- Santen J.P.H.V. Assignment of segment duration in text-to-speech synthesis. Computer Speech and Language 8 April (1994) 95-128
- (1994) Computer Speech and Language , vol.8 , Issue.April , pp. 95-128
- Santen, J.P.H.V.¹

29
- 33750740461
- Sayli, O, 2002. Duration analysis and modeling for Turkish text-to-speech synthesis, Master's thesis, Department of Electrical and Electronics Engineering, Bogaziei University, 2002.

30
- 0032672117
- Silverman, K.E.A., Bellegarda, J.R. 1999. Using a sigmoid transformation for improved modeling of phoneme duration. In: Proceedings of the IEEE International Conference on Acoustic Speech, Signal Processing, Phoenix, AZ, USA, March 1999, pp. 385-388.

31
- 85009288419
- Smith, C.L., 2002. Modeling durational variability in reading aloud a connected text. In: Proceedings of the International Conference on Spoken Language Processing, Denver, CO, USA, September, pp. 1769-1772.

32
- 0030710662
- Sonntag, G.P., Portele, T., Heuft, B. 1997. Prosody generation with a neural network: Weighing the importance of input parameters. In: Proceedings of the IEEE International Conference on Acoustic, Speech, Signal Processing, Munich, Germany, April, pp. 931-934.

33
- 0026953356
- Feedback stabilization using two hidden layer nets
- Sontag E.D. Feedback stabilization using two hidden layer nets. IEEE Transactions on Neural Networks 3 November (1992) 981-990
- (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.November , pp. 981-990
- Sontag, E.D.¹

34
- 0004161686
- Sproat R. (Ed), Kluwer Academic Publishers, Dordrecht, The Netherlands
- In: Sproat R. (Ed). Multilingual Text-to-Speech Synthesis: The Bell Labs Approach (1998), Kluwer Academic Publishers, Dordrecht, The Netherlands
- (1998) Multilingual Text-to-Speech Synthesis: The Bell Labs Approach

35
- 85009231337
- Teixeira, J.P., Freitas, D. 2003. Segmental durations predicted with a neural network. In: Proceedings of the European Conference on Speech Communication and Technology, Geneva, Switzerland, September, pp. 169-172.

36
- 0004312284
- Prentice-Hall, New Delhi, India
- Yegnanarayana B. Artificial Neural Networks (1999), Prentice-Hall, New Delhi, India
- (1999) Artificial Neural Networks
- Yegnanarayana, B.¹

37
- 33750730686
- Yegnanarayana, B., Murthy, H.A., Sundar, R., Ramachandran, V.R., Kumar, A.S.M., Alwar, N., Rajendran, S., 1990. Development of text-to-speech system for Indian languages. In: Proceedings of the International Conference on Knowledge Based Computer Systems, Pune, India, December, pp. 467-476.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.