메뉴 건너뛰기




Volumn , Issue , 2010, Pages 92-115

Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit

Author keywords

[No Author keywords available]

Indexed keywords


EID: 77953693885     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.4018/978-1-61520-725-1.ch006     Document Type: Chapter
Times cited : (11)

References (52)
  • 1
    • 31044454666 scopus 로고    scopus 로고
    • Designing desirability in an augmentative and alternative communication device
    • doi:10.1007/s10209-005-0117-2
    • Allen, J. (2005). Designing desirability in an augmentative and alternative communication device. Universal Access in the Information Society, 4, 135-145. doi:10.1007/s10209-005-0117-2
    • (2005) Universal Access in the Information Society , vol.4 , pp. 135-145
    • Allen, J.1
  • 2
    • 0013234449 scopus 로고    scopus 로고
    • Family perspective on augmentative and alternative communication: Families of adolescents and young adults
    • doi:10.1080/07434619612331277438
    • Angelo, D. H., Kokosa, S. M., & Jones, S. D. (1996). Family perspective on augmentative and alternative communication: families of adolescents and young adults. Augmentative and Alternative Communication, 12(1), 13-20. doi:10.1080/07434619612331277438
    • (1996) Augmentative and Alternative Communication , vol.12 , Issue.1
    • Angelo, D.H.1    Kokosa, S.M.2    Jones, S.D.3
  • 4
    • 28244501231 scopus 로고    scopus 로고
    • Retrieved February 2, 2007, from
    • Black, A. W., & Lenzo, K. A. (2007). Building synthetic voices. Retrieved February 2, 2007, from http://festvox.org/festvox/festvox_toc.html
    • (2007) Building Synthetic Voices
    • Black, A.W.1    Lenzo, K.A.2
  • 5
    • 11344272923 scopus 로고    scopus 로고
    • The under-standability of aac: A conversation analysis study of acquired dysarthria
    • doi:10.1080/07434610400005614
    • Bloch, S., & Wilkinson, R. (2004). The under-standability of AAC: a conversation analysis study of acquired dysarthria. Augmentative and Alternative Communication, 20(4), 272-282. doi:10.1080/07434610400005614
    • (2004) Augmentative and Alternative Communication , vol.20 , Issue.4 , pp. 272-282
    • Bloch, S.1    Wilkinson, R.2
  • 9
    • 84961448597 scopus 로고
    • Age and gender preferences for synthetic and natural speech
    • doi:10.1080/07434619012331275544
    • Crabtree, M., Mirenda, P., & Beukelman, D. R. (1990). Age and gender preferences for synthetic and natural speech. Augmentative and Alternative Communication, 6(4), 256-261. doi:10.1080/07434619012331275544
    • (1990) Augmentative and Alternative Communication , vol.6 , Issue.4
    • Crabtree, M.1    Mirenda, P.2    Beukelman, D.R.3
  • 10
    • 70450160946 scopus 로고    scopus 로고
    • Personalizing synthetic voices for people with progressive speech disorders: Judging voice similarity
    • (in press), In
    • Creer, S. M., Cunningham, S. P., Green, P. D., & Fatema, K. (in press). Personalizing synthetic voices for people with progressive speech disorders: judging voice similarity. In Proceedings of Interspeech 2009.
    • Proceedings of Interspeech 2009
    • Creer, S.M.1    Cunningham, S.P.2    Green, P.D.3    Fatema, K.4
  • 15
    • 0025977381 scopus 로고
    • The effects of information and augmentative communication technique on attitudes toward non-speaking individuals
    • Gorenflo, D. W., & Gorenflo, C. W. (1991). The effects of information and augmentative communication technique on attitudes toward non-speaking individuals. Journal of Speech and Hearing Research, 34, 19-26.
    • (1991) Journal of Speech and Hearing Research , vol.34 , pp. 19-26
    • Gorenflo, D.W.1    Gorenflo, C.W.2
  • 16
    • 0001835075 scopus 로고    scopus 로고
    • Cultural aspects in the development of aac users
    • doi:10.1080/07434619612331277488
    • Hetzroni, O. E., & Harris, O. L. (1996). Cultural aspects in the development of AAC users. Augmentative and Alternative Communication, 12(1), 52-58. doi:10.1080/07434619612331277488
    • (1996) Augmentative and Alternative Communication , vol.12 , Issue.1 , pp. 52-58
    • Hetzroni, O.E.1    Harris, O.L.2
  • 17
    • 84872282749 scopus 로고    scopus 로고
    • Retrieved January 7, 2009, from
    • Huckvale, M. (2004) SCRIBE manual version 1.0. Retrieved January 7, 2009, from http://www.phon.ucl.ac.uk/resource/scribe/scribe-manual.htm
    • (2004) SCRIBE Manual Version 1.0
    • Huckvale, M.1
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • doi:10.1016/S0167-6393(98)00085-5
    • Kawahara, H., Masuda-Katsuse, I., & de Cheveigné, A. (1999). Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Communication, 27, 187-207. doi:10.1016/S0167-6393(98)00085-5
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigné, A.3
  • 19
    • 0033512363 scopus 로고    scopus 로고
    • Quality of life while ageing with a disability
    • Kemp, B. (1999). Quality of life while ageing with a disability. Assistive Technology, 11, 158-163.
    • (1999) Assistive Technology , vol.11 , pp. 158-163
    • Kemp, B.1
  • 22
    • 84961415281 scopus 로고
    • Interaction involving individuals using augmentative and alternative communication systems: State of the art and future directions
    • doi:10.1080/07434618812331274657
    • Light, J. (1988). Interaction involving individuals using augmentative and alternative communication systems: state of the art and future directions. Augmentative and Alternative Communication, 4(2), 66-82. doi:10.1080/07434618812331274657
    • (1988) Augmentative and Alternative Communication , vol.4 , Issue.2 , pp. 66-82
    • Light, J.1
  • 23
    • 36248967624 scopus 로고    scopus 로고
    • Children's ideas for the design of aac assistive technologies for young children with complex communication needs
    • doi:10.1080/07434610701390475
    • Light, J., Page, R., Curran, J., & Pitkin, L. (2007). Children's ideas for the design of AAC assistive technologies for young children with complex communication needs. Augmentative and Alternative Communication, 23(4), 274-287. doi:10.1080/07434610701390475
    • (2007) Augmentative and Alternative Communication , vol.23 , Issue.4 , pp. 274-287
    • Light, J.1    Page, R.2    Curran, J.3    Pitkin, L.4
  • 24
    • 0036612363 scopus 로고    scopus 로고
    • Attitudes of children toward an unfamiliar peer using an aac device with and without voice output
    • doi:10.1080/07434610212331281191
    • Lilienfeld, M., & Alant, E. (2002). Attitudes of children toward an unfamiliar peer using an AAC device with and without voice output. Augmentative and Alternative Communication, 18(2), 91-101. doi:10.1080/07434610212331281191
    • (2002) Augmentative and Alternative Communication , vol.18 , Issue.2 , pp. 91-101
    • Lilienfeld, M.1    Alant, E.2
  • 26
    • 33646199423 scopus 로고    scopus 로고
    • Life with communication changes in parkinson's disease
    • doi:10.1093/ageing/afj053
    • Miller, N., Noble, E., Jones, D., & Burn, D. (2006). Life with communication changes in Parkinson's disease. Age and Ageing, 35, 235-239. doi:10.1093/ageing/afj053
    • (2006) Age and Ageing , vol.35 , pp. 235-239
    • Miller, N.1    Noble, E.2    Jones, D.3    Burn, D.4
  • 28
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • doi:10.1016/0167-6393(90)90021-Z
    • Moulines, E., & Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9, 453-467. doi:10.1016/0167-6393(90)90021-Z
    • (1990) Speech Communication , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 29
    • 11344285691 scopus 로고    scopus 로고
    • I prefer contact this close': Perceptions of aac by people with motor neurone disease and their communication partners
    • doi:10.1080/07434610400005663
    • Murphy, J. (2004). 'I prefer contact this close': perceptions of AAC by people with motor neurone disease and their communication partners. Augmentative and Alternative Communication, 20(4), 259-271. doi:10.1080/07434610400005663
    • (2004) Augmentative and Alternative Communication , vol.20 , Issue.4 , pp. 259-271
    • Murphy, J.1
  • 30
    • 0031574796 scopus 로고    scopus 로고
    • Are machines gender neutral? Gender-stereotypic responses to computers with voices
    • doi:10.1111/j.1559-1816.1997.tb00275.x
    • Nass, C., Moon, Y., & Green, N. (1997). Are machines gender neutral? Gender-stereotypic responses to computers with voices. Journal of Applied Social Psychology, 27, 864-876. doi:10.1111/j.1559-1816.1997.tb00275.x
    • (1997) Journal of Applied Social Psychology , vol.27 , pp. 864-876
    • Nass, C.1    Moon, Y.2    Green, N.3
  • 31
    • 0032012873 scopus 로고    scopus 로고
    • Identification and rankings of communication aid features by five groups
    • doi:10.1080/07434619812331278186
    • O'Keefe, B. M., Brown, L., & Schuller, R. (1998). Identification and rankings of communication aid features by five groups. Augmentative and Alternative Communication, 14(1), 37-50. doi:10.1080/07434619812331278186
    • (1998) Augmentative and Alternative Communication , vol.14 , Issue.1
    • O'Keefe, B.M.1    Brown, L.2    Schuller, R.3
  • 32
    • 7044232675 scopus 로고    scopus 로고
    • Working with asian american families whose children have augmentative and alternative communication needs
    • Retrieved January 4, 2009, from
    • Parette, P., & Huer, M. B. (2002). Working with Asian American families whose children have augmentative and alternative communication needs. Journal of Special Education Technology E-Journal, 17(4). Retrieved January 4, 2009, from http://jset.unlv.edu/17.4T/parette/first.html
    • (2002) Journal of Special Education Technology E-Journal , vol.17 , Issue.4
    • Parette, P.1    Huer, M.B.2
  • 33
    • 0024610919 scopus 로고
    • A tutorial on hmm and selected applications in speech recognition
    • doi:10.1109/5.18626
    • Rabiner, L. R. (1989). A tutorial on HMM and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257-286. doi:10.1109/5.18626
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 34
    • 15844380954 scopus 로고    scopus 로고
    • The dual challenges of aided communication and adolescence
    • doi:10.1080/10428190400006625
    • Smith, M. M. (2005). The dual challenges of aided communication and adolescence. Augmentative and Alternative Communication, 21(1), 76-79. doi:10.1080/10428190400006625
    • (2005) Augmentative and Alternative Communication , vol.21 , Issue.1 , pp. 76-79
    • Smith, M.M.1
  • 35
    • 48849108750 scopus 로고    scopus 로고
    • Computer synthesized speech and perceptions of the social influence of disabled users
    • doi:10.1177/0261927X08318035
    • Stern, S. E. (2008). Computer synthesized speech and perceptions of the social influence of disabled users. Journal of Language and Social Psychology, 27(3), 254-265. doi:10.1177/0261927X08318035
    • (2008) Journal of Language and Social Psychology , vol.27 , Issue.3 , pp. 254-265
    • Stern, S.E.1
  • 36
    • 0033505120 scopus 로고    scopus 로고
    • The persuasiveness of synthetic speech versus human speech
    • doi:10.1518/001872099779656680
    • Stern, S. E., Mullennix, J. W., Dyson, C.-L., & Wilson, S. J. (1999). The persuasiveness of synthetic speech versus human speech. Human Factors, 41, 588-595. doi:10.1518/001872099779656680
    • (1999) Human Factors , vol.41 , pp. 588-595
    • Stern, S.E.1    Mullennix, J.W.2    Dyson, C.-L.3    Wilson, S.J.4
  • 37
    • 0036333967 scopus 로고    scopus 로고
    • Effects of perceived disability on persuasiveness of computer synthesized speech
    • doi:10.1037/0021-9010.87.2.411
    • Stern, S. E., Mullennix, J. W., & Wilson, S. J. (2002). Effects of perceived disability on persuasiveness of computer synthesized speech. The Journal of Applied Psychology, 87, 411-417. doi:10.1037/0021-9010.87.2.411
    • (2002) The Journal of Applied Psychology , vol.87 , pp. 411-417
    • Stern, S.E.1    Mullennix, J.W.2    Wilson, S.J.3
  • 38
    • 27944500254 scopus 로고    scopus 로고
    • Persuasion and social perception of human vs. Synthetic voice across person as source and computer as source conditions
    • doi:10.1016/j.ijhcs.2005.07.002
    • Stern, S. E., Mullennix, J. W., & Yaroslavsky, I. (2006). Persuasion and social perception of human vs. synthetic voice across person as source and computer as source conditions. International Journal of Human-Computer Studies, 64, 43-52. doi:10.1016/j.ijhcs.2005.07.002
    • (2006) International Journal of Human-Computer Studies , vol.64 , pp. 43-52
    • Stern, S.E.1    Mullennix, J.W.2    Yaroslavsky, I.3
  • 39
    • 0002507218 scopus 로고
    • Speech accommodation theory: A social cognitive approach to language and speech
    • In M. Roloff, & C. R. Berger, (Eds.), Beverly Hills, CA: Sage
    • Street, R. L., & Giles, H. (1982). Speech accommodation theory: a social cognitive approach to language and speech. In M. Roloff, & C. R. Berger, (Eds.), Social cognition and communication (pp. 193-226). Beverly Hills, CA: Sage.
    • (1982) Social Cognition and Communication , pp. 193-226
    • Street, R.L.1    Giles, H.2
  • 41
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for hmm-based speech synthesis
    • Toda, T., & Tokuda, K. (2007). A speech parameter generation algorithm considering global variance for HMM-based speech synthesis. IEICE Transactions on Information and Systems. E (Norwalk, Conn.), 90-D(5), 816-824.
    • (2007) IEICE Transactions on Information and Systems. E (Norwalk, Conn.) , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 44
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using hsmm-based speaker adaptation and adaptive training
    • Yamagishi, J., & Kobayashi, T. (2007). Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training. IEICE Transactions on Information and Systems. E (Norwalk, Conn.), 90-D(2), 533-543.
    • (2007) IEICE Transactions on Information and Systems. E (Norwalk, Conn.) , vol.90 , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 45
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for hmm-based speech synthesis and a constrained smaplr adaptation algorithm
    • doi:10.1109/TASL.2008.2006647
    • Yamagishi, J., Kobayashi, T., Nakano, Y., Ogata, K., & Isogai, J. (2009). Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm. IEEE Transactions on Audio. Speech and Language Processing, 17(1), 66-83. doi:10.1109/TASL.2008.2006647
    • (2009) IEEE Transactions on Audio. Speech and Language Processing , vol.17 , Issue.1
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 48
    • 70449126171 scopus 로고    scopus 로고
    • The hts-2008 system: Yet another evaluation of the speaker-adaptive hmmbased speech synthesis system in the 2008 blizzard challenge
    • In, Brisbane, Australia. Retrieved March 2, 2009, from
    • Yamagishi, J., Zen, H., Wu, Y.-J., Toda, T., & Tokuda, K. (2008). The HTS-2008 system: yet another evaluation of the speaker-adaptive HMMbased speech synthesis system in the 2008 Blizzard challenge. In Proceedings of the Blizzard Challenge 2008, Brisbane, Australia. Retrieved March 2, 2009, from http://festvox.org/blizzard/bc2008/hts_Blizzard2008.pdf.
    • (2008) Proceedings of the Blizzard Challenge 2008
    • Yamagishi, J.1    Zen, H.2    Wu, Y.-J.3    Toda, T.4    Tokuda, K.5
  • 49
    • 32344446555 scopus 로고    scopus 로고
    • A system for creating personalized synthetic voices
    • In [baltimore.]
    • Yarrington, D., Pennington, C., Gray, J., & Bunnell, H. T. (2005). A system for creating personalized synthetic voices. In [Baltimore.]. Proceedings of ASSETS, 2005, 196-197.
    • (2005) Proceedings of ASSETS , vol.2005 , pp. 196-197
    • Yarrington, D.1    Pennington, C.2    Gray, J.3    Bunnell, H.T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.