메뉴 건너뛰기




Volumn 21, Issue 2, 2007, Pages 282-295

Modeling durations of syllables using neural networks

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; CORRELATION METHODS; FEATURE EXTRACTION; KNOWLEDGE ENGINEERING; NEURAL NETWORKS; TEXT PROCESSING;

EID: 33750713338     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2006.06.003     Document Type: Article
Times cited : (69)

References (37)
  • 1
    • 33750717907 scopus 로고    scopus 로고
    • Barbosa, P.A., Bailly, G. 1992. Generating segmental duration by p-centers. In: Proceedings of the Fourth Workshop on Rhythm Perception and Production, Bourges, France, June, pp. 163-168.
  • 2
    • 0028531866 scopus 로고
    • Characterization of rhythmic patterns for text-to-speech synthesis
    • Barbosa P.A., and Bailly G. Characterization of rhythmic patterns for text-to-speech synthesis. Speech Communication 15 (1994) 127-137
    • (1994) Speech Communication , vol.15 , pp. 127-137
    • Barbosa, P.A.1    Bailly, G.2
  • 3
    • 0023404428 scopus 로고
    • A model of segmental duration for speech synthesis in French
    • Bartkova K., and Sorin C. A model of segmental duration for speech synthesis in French. Speech Communication 6 (1987) 245-260
    • (1987) Speech Communication , Issue.6 , pp. 245-260
    • Bartkova, K.1    Sorin, C.2
  • 5
    • 33750712116 scopus 로고    scopus 로고
    • Black, A.W., Taylor, P., Caley, R., 2000. The festival speech synthesis system: System documentation. The Centre for Speech Technology Research (CSTR), University of Edinburgh, 1.4.0 edition. Available from: http://www.cstr.ed.ac.uk/projects/festival/manual/festival_toc.html.
  • 6
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • Burges C.J.C. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2 2 (1998) 121-167
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
    • Burges, C.J.C.1
  • 7
    • 0025387541 scopus 로고
    • Analog i/o nets for syllable timing
    • Campbell W.N. Analog i/o nets for syllable timing. Speech Communication 9 February (1990) 57-61
    • (1990) Speech Communication , vol.9 , Issue.February , pp. 57-61
    • Campbell, W.N.1
  • 8
    • 0001717383 scopus 로고
    • Syllable based segment duration
    • Bailly G., Benoit C., and Sawallis T.R. (Eds), Elsevier, Amsterdam
    • Campbell W.N. Syllable based segment duration. In: Bailly G., Benoit C., and Sawallis T.R. (Eds). Talking Machines: Theories, Models and Designs (1992), Elsevier, Amsterdam 211-224
    • (1992) Talking Machines: Theories, Models and Designs , pp. 211-224
    • Campbell, W.N.1
  • 9
    • 33750709760 scopus 로고    scopus 로고
    • Campbell, W.N., 1993. Predicting segmental durations for accommodation within a syllable-level timing framework. In: Proceedings of the European Conference Speech Communication and Technology, vol. 2, Berlin, Germany, September, pp. 1081-1084.
  • 12
    • 33750721752 scopus 로고    scopus 로고
    • Chopde, A. Itrans Indian language transliteration package version 5.2 source. Available from: http://www.aczone.con/itrans/.
  • 13
    • 33750711787 scopus 로고    scopus 로고
    • Chung, H., 2002a. Duration models and the perceptual evaluation of spoken Korean. In: Proceedings of Speech Prosody, Aix-en-Provence, France, pp. 219-222.
  • 14
    • 33750687350 scopus 로고    scopus 로고
    • Perceptual evaluation of duration models in spoken Korean
    • Chung H. Perceptual evaluation of duration models in spoken Korean. The Korean Journal of Speech Sciences 9 (2002) 207-215
    • (2002) The Korean Journal of Speech Sciences , vol.9 , pp. 207-215
    • Chung, H.1
  • 15
    • 33750695022 scopus 로고    scopus 로고
    • Cordoba, R., Vallejo, J.A., Montero, J.M., Gutierrezarriola, J., Lopez, M.A., Pardo, J.M. 1999. Automatic modeling of duration in a Spanish text-to-speech system using neural networks. In: Proceedings of the European Conference on Speech Communication and Technology, September, Budapest, Hungary.
  • 16
    • 85009107944 scopus 로고    scopus 로고
    • Goubanova, O, Taylor, P. 2000. Using bayesian belief networks for modeling duration in text-to-speech systems. In: Proceedings of the International Conference on Spoken Language Processing, vol. 2, Beijing, China, October 2000, pp. 427-431.
  • 18
    • 77952314450 scopus 로고    scopus 로고
    • Hifny, Y, Rashwan, M. 2002. Duration modeling of Arabic text-to-speech synthesis. In: Proceedings of the International Conference on Spoken Language Processing, Denver, CO, USA, September, pp. 1773-1776.
  • 20
    • 33750744519 scopus 로고    scopus 로고
    • Khan, A.N., Gangashetty, S.V., Yegnanarayana, B., 2003. Syllabic properties of three Indian languages: Implications for speech recognition and language identification. In: International Conference on Natural Language Processing, Mysore, India, December, pp. 125-134.
  • 21
    • 0016952322 scopus 로고
    • Linguistic uses of segmental duration in English: Acoustic and perceptual evidence
    • Klatt D.H. Linguistic uses of segmental duration in English: Acoustic and perceptual evidence. Journal of Acoustic Society of America 59 (1976) 1209-1221
    • (1976) Journal of Acoustic Society of America , vol.59 , pp. 1209-1221
    • Klatt, D.H.1
  • 23
    • 33750700891 scopus 로고    scopus 로고
    • Krishna, N.S., Murthy, H.A., 2004. Duration modeling of Indian languages Hindi and Telugu. In: 5th ISCA Speech Synthesis Workshop, Pittsburgh, USA, May, pp. 197-202.
  • 24
    • 33750698008 scopus 로고    scopus 로고
    • Kumar, K.K., 2002. Duration and intonation knowledge for text-to-speech conversion system for Telugu and Hindi, Master's thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, India, May.
  • 25
    • 33750692405 scopus 로고    scopus 로고
    • Mixdorff, H., 2002. An integrated approach to modeling German prosody. PhD thesis, Technical University, Dresden, Germany, July.
  • 26
    • 85009154226 scopus 로고    scopus 로고
    • Mixdorff, H., Jokisch, O. 2001. Building an integrated prosodic model of German. In: Proceedings of the European Conference on Speech Communication and Technology, vol. 2, Aalborg, Denmark, September, pp. 947-950.
  • 28
    • 0028405296 scopus 로고
    • Assignment of segment duration in text-to-speech synthesis
    • Santen J.P.H.V. Assignment of segment duration in text-to-speech synthesis. Computer Speech and Language 8 April (1994) 95-128
    • (1994) Computer Speech and Language , vol.8 , Issue.April , pp. 95-128
    • Santen, J.P.H.V.1
  • 29
    • 33750740461 scopus 로고    scopus 로고
    • Sayli, O, 2002. Duration analysis and modeling for Turkish text-to-speech synthesis, Master's thesis, Department of Electrical and Electronics Engineering, Bogaziei University, 2002.
  • 30
    • 0032672117 scopus 로고    scopus 로고
    • Silverman, K.E.A., Bellegarda, J.R. 1999. Using a sigmoid transformation for improved modeling of phoneme duration. In: Proceedings of the IEEE International Conference on Acoustic Speech, Signal Processing, Phoenix, AZ, USA, March 1999, pp. 385-388.
  • 31
    • 85009288419 scopus 로고    scopus 로고
    • Smith, C.L., 2002. Modeling durational variability in reading aloud a connected text. In: Proceedings of the International Conference on Spoken Language Processing, Denver, CO, USA, September, pp. 1769-1772.
  • 32
    • 0030710662 scopus 로고    scopus 로고
    • Sonntag, G.P., Portele, T., Heuft, B. 1997. Prosody generation with a neural network: Weighing the importance of input parameters. In: Proceedings of the IEEE International Conference on Acoustic, Speech, Signal Processing, Munich, Germany, April, pp. 931-934.
  • 33
    • 0026953356 scopus 로고
    • Feedback stabilization using two hidden layer nets
    • Sontag E.D. Feedback stabilization using two hidden layer nets. IEEE Transactions on Neural Networks 3 November (1992) 981-990
    • (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.November , pp. 981-990
    • Sontag, E.D.1
  • 35
    • 85009231337 scopus 로고    scopus 로고
    • Teixeira, J.P., Freitas, D. 2003. Segmental durations predicted with a neural network. In: Proceedings of the European Conference on Speech Communication and Technology, Geneva, Switzerland, September, pp. 169-172.
  • 37
    • 33750730686 scopus 로고    scopus 로고
    • Yegnanarayana, B., Murthy, H.A., Sundar, R., Ramachandran, V.R., Kumar, A.S.M., Alwar, N., Rajendran, S., 1990. Development of text-to-speech system for Indian languages. In: Proceedings of the International Conference on Knowledge Based Computer Systems, Pune, India, December, pp. 467-476.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.