메뉴 건너뛰기




Volumn 14, Issue 1, 2011, Pages 19-33

Application of prosody models for developing speech systems in Indian languages

Author keywords

Duration; Feedforward neural network; Intonation; Prosody; Speech systems

Indexed keywords

DURATION; FEED-FORWARD; INTONATION; PROSODY; SPEECH SYSTEMS;

EID: 79953168002     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-010-9086-9     Document Type: Article
Times cited : (23)

References (48)
  • 1
    • 84994310262 scopus 로고    scopus 로고
    • Prosodic models, automatic speech understanding, and speech synthesis: Towards the common ground
    • Batliner, A., Mobius, B., Mohler, G., Schweitzer, A., & Noth, E. (2001). Prosodic models, automatic speech understanding, and speech synthesis: towards the common ground. In Eurospeech, Scandinavia.
    • (2001) Eurospeech Scandinavia
    • Batliner, A.1    Mobius, B.2    Mohler, G.3    Schweitzer, A.4    Noth, E.5
  • 2
    • 67650565075 scopus 로고    scopus 로고
    • J. Benesty M. M. Sondhi Y. Huang (eds). Springer New York
    • Benesty, J., Sondhi, M. M., & Huang, Y. (Eds.) (2008). Springer handbook on speech processing. New York: Springer.
    • (2008) Springer Handbook on Speech Processing
  • 12
    • 0028996978 scopus 로고
    • A prosodic model for mandarin speech and its application to pitch level generation for text-to-speech
    • Hwang, S.-H., & Chen, S.-H. (1995). A prosodic model for mandarin speech and its application to pitch level generation for text-to-speech. In Proc. IEEE int. conf. acoust., speech, signal processing, May 1995 (pp. 616-619).
    • (1995) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing May 1995 , pp. 616-619
    • Hwang, S.-H.1    Chen, S.-H.2
  • 17
    • 0027626742 scopus 로고
    • Intonation component of a text-to-speech system for Hindi
    • DOI 10.1006/csla.1993.1015
    • A. S. M. Kumar S. Rajendran B. Yegnanarayana 1993 Intonation component of text-to-speech system for Hindi Computer Speech and Language 7 283 301 10.1006/csla.1993.1015 (Pubitemid 23705304)
    • (1993) Computer Speech and Language , vol.7 , Issue.3 , pp. 283-301
    • Madhukumar, A.S.1    Rajendran, S.2    Yegnanarayana, B.3
  • 21
    • 79953181342 scopus 로고    scopus 로고
    • PhD thesis, Dept. of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India, June
    • Mary, L. (2006). Multi level implicit features for language and speaker recognition. PhD thesis, Dept. of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India, June.
    • (2006) Multi Level Implicit Features for Language and Speaker Recognition
    • Mary, L.1
  • 22
    • 52949094265 scopus 로고    scopus 로고
    • Extraction and representation of prosodic features for language and speaker recognition
    • 10.1016/j.specom.2008.04.010
    • L. Mary B. Yegnanarayana 2008 Extraction and representation of prosodic features for language and speaker recognition Speech communication 50 782 796 10.1016/j.specom.2008.04.010
    • (2008) Speech Communication , vol.50 , pp. 782-796
    • Mary, L.1    Yegnanarayana, B.2
  • 23
    • 0000668614 scopus 로고    scopus 로고
    • Robustness of group-delay-based method for extraction of significant excitation from speech signals
    • 10.1109/89.799686
    • P. S. Murthy B. Yegnanarayana 1999 Robustness of group-delay-based method for extraction of significant excitation from speech signals IEEE Transactions on Speech and Audio Processing 7 609 619 10.1109/89.799686
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 609-619
    • Murthy, P.S.1    Yegnanarayana, B.2
  • 25
    • 33745205178 scopus 로고    scopus 로고
    • PhD thesis, Dept. of computer science and engineering, Indian institute of technology, Madras, Chennai, India, March
    • Prasanna, S. R. M. (2004). Event-based analysis of speech. PhD thesis, Dept. of computer science and engineering, Indian institute of technology, Madras, Chennai, India, March.
    • (2004) Event-based Analysis of Speech
    • Prasanna, S.R.M.1
  • 28
    • 65249112285 scopus 로고    scopus 로고
    • Vowel onset point detection using source, spectral peaks, and modulation spectrum energies
    • 10.1109/TASL.2008.2010884
    • S. R. M. Prasanna B. V. S. Reddy P. K. Murthy 2009 Vowel onset point detection using source, spectral peaks, and modulation spectrum energies IEEE Transactions on Speech and Audio Processing 17 556 565 10.1109/TASL.2008.2010884
    • (2009) IEEE Transactions on Speech and Audio Processing , vol.17 , pp. 556-565
    • Prasanna, S.R.M.1    Reddy, B.V.S.2    Murthy, P.K.3
  • 31
    • 37549007588 scopus 로고    scopus 로고
    • Modeling supra-segmental features of syllables using neural networks
    • P. B. Prasad S. R. M. Prasanna (eds). Springer New York. 10.1007/978-3-540-75398-8-4
    • Rao, K. S. (2008). Modeling supra-segmental features of syllables using neural networks. In P. B. Prasad & S. R. M. Prasanna (Eds.), Speech, audio, image and biomedical signal processing using neural networks (pp. 71-95). New York: Springer.
    • (2008) Speech, Audio, Image and Biomedical Signal Processing Using Neural Networks , pp. 71-95
    • Rao, K.S.1
  • 34
    • 33750713338 scopus 로고    scopus 로고
    • Modeling durations of syllables using neural networks
    • DOI 10.1016/j.csl.2006.06.003, PII S0885230806000234
    • K. S. Rao B. Yegnanarayana 2007 Modeling durations of syllables using neural networks Computer Speech and Language 21 282 295 10.1016/j.csl.2006.06. 003 (Pubitemid 44709836)
    • (2007) Computer Speech and Language , vol.21 , Issue.2 , pp. 282-295
    • Rao, K.S.1    Yegnanarayana, B.2
  • 35
    • 54049142844 scopus 로고    scopus 로고
    • Intonation modeling for Indian languages
    • 10.1016/j.csl.2008.06.005
    • K. S. Rao B. Yegnanarayana 2009 Intonation modeling for Indian languages Computer Speech and Language 23 240 256 10.1016/j.csl.2008.06.005
    • (2009) Computer Speech and Language , vol.23 , pp. 240-256
    • Rao, K.S.1    Yegnanarayana, B.2
  • 38
    • 0029375490 scopus 로고
    • Determination of instants of significant excitation in speech using group delay function
    • 10.1109/89.466662
    • R. Smits B. Yegnanarayana 1995 Determination of instants of significant excitation in speech using group delay function IEEE Transactions on Speech and Audio Processing 3 325 333 10.1109/89.466662
    • (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 39
    • 37549013057 scopus 로고
    • A text-to-speech conversion system for Indian languages based on waveform concatenation model
    • Dept. of computer science and engineering, Indian institute of technology, Madras, March
    • Srikanth, S., Kumar, S. R. R., Sundar, R., & Yegnanarayana, B. (1989). A text-to-speech conversion system for Indian languages based on waveform concatenation model. Technical report No. 11, project VOIS, Dept. of computer science and engineering, Indian institute of technology, Madras, March.
    • (1989) Technical Report No. 11, Project VOIS
    • Srikanth, S.1    Kumar, S.R.R.2    Sundar, R.3    Yegnanarayana, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.