메뉴 건너뛰기




Volumn 30, Issue 3, 2014, Pages 226-236

Towards personalized speech synthesis for augmentative and alternative communication

Author keywords

Assistive communication; Assistive technology; Speaker identity; Speech synthesis; Voice conversion

Indexed keywords

COMMUNICATION AID; DYSARTHRIA; HUMAN; VOICE;

EID: 84907046206     PISSN: 07434618     EISSN: 14773848     Source Type: Journal    
DOI: 10.3109/07434618.2014.924026     Document Type: Article
Times cited : (29)

References (112)
  • 1
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering
    • Alku, P. (1992). Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering. Speech Communication, 11, 109-118.
    • (1992) Speech Communication , vol.11 , pp. 109-118
    • Alku, P.1
  • 2
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • Arslan, L. (1999). Speaker transformation algorithm using segmental codebooks (STASC). Speech Communication, 28, 211-226.
    • (1999) Speech Communication , vol.28 , pp. 211-226
    • Arslan, L.1
  • 3
    • 0032797216 scopus 로고    scopus 로고
    • Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech
    • Bachorowski, J., & Owren, M. (1999). Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. Journal of the Acoustical Society of America, 106, 1054-1063.
    • (1999) Journal of the Acoustical Society of America , vol.106 , pp. 1054-1063
    • Bachorowski, J.1    Owren, M.2
  • 4
    • 0030166343 scopus 로고    scopus 로고
    • The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences
    • Benoît, C., Grice, M., & Hazan, V. (1996). The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences. Speech Communication, 18, 381-392.
    • (1996) Speech Communication , vol.18 , pp. 381-392
    • Benoît, C.1    Grice, M.2    Hazan, V.3
  • 5
    • 85133503504 scopus 로고    scopus 로고
    • Diphone synthesis using unit selection
    • November Paper presented at Blue Mountains, Australia. Retrieved from
    • Beutnagel, M., Conkie, A., & Syrdal, A. K. (1998, November). Diphone synthesis using unit selection. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca- speech.org/archive-open/archive-papers/ssw3/ssw3-185.pdf
    • (1998) The 3rd ISCA Speech Synthesis Workshop (SSW3)
    • Beutnagel, M.1    Conkie, A.2    Syrdal, A.K.3
  • 6
    • 84961425284 scopus 로고
    • A statewide demographic survey of people with severe communication impairments
    • Bloomberg, K., & Johnson, H. (1990). A statewide demographic survey of people with severe communication impairments. Augmentative and Alternative Communication, 6, 50-60.
    • (1990) Augmentative and Alternative Communication , vol.6 , pp. 50-60
    • Bloomberg, K.1    Johnson, H.2
  • 7
    • 0001856243 scopus 로고
    • Contrastive accent and contrastive stress
    • Bolinger, D. (1961). Contrastive accent and contrastive stress, Language, 37, 83-96.
    • (1961) Language , vol.37 , pp. 83-96
    • Bolinger, D.1
  • 8
    • 0003708078 scopus 로고
    • Palo Alto, CA: Stanford University Press
    • Bolinger, D. (1989). Intonation and its uses. Palo Alto, CA: Stanford University Press.
    • (1989) Intonation and Its Uses
    • Bolinger, D.1
  • 10
    • 84907049577 scopus 로고    scopus 로고
    • Crafting small databases for unit selection TTS: Effects on intelligibility
    • September Paper presented at Kyoto, Japan. Retrieved from
    • Bunnell, H. T. (2010, September). Crafting small databases for unit selection TTS: Effects on intelligibility. Paper presented at the 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto, Japan. Retrieved from http://isw3.naist.jp/∼ tomoki/ssw7/www/doc/ssw7-proceedings-rev.pdf
    • (2010) The 7th ISCA Speech Synthesis Workshop (SSW7)
    • Bunnell, H.T.1
  • 11
    • 85039153976 scopus 로고    scopus 로고
    • A biphone constrained concatenation method for diphone synthesis
    • November Paper presented at Blue Mountains, Australia. Retrieved from
    • Bunnell, H. T., Hoskins, S. R., & Yarrington, D. M. (1998, November). A biphone constrained concatenation method for diphone synthesis. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca-speech.org/archive-open/archive- papers/ssw3/ssw3-171.pdf
    • (1998) The 3rd ISCA Speech Synthesis Workshop(SSW3)
    • Bunnell, H.T.1    Hoskins, S.R.2    Yarrington, D.M.3
  • 12
    • 85133491738 scopus 로고    scopus 로고
    • Analysis methods for assessing TTS intelligibility
    • August Paper presented at Bonn, Germany. Retrieved from
    • Bunnell, H. T., & Lilley, J. (2007, August). Analysis methods for assessing TTS intelligibility. Paper presented at the 6th ISCA Speech Synthesis Workshop (SSW6), Bonn, Germany. Retrieved from http://www.isca-speech.org/ archive-open/archive-papers/ssw6/ssw6-374.pdf
    • (2007) The 6th ISCA Speech Synthesis Workshop(SSW6)
    • Bunnell, H.T.1    Lilley, J.2
  • 13
    • 84899214271 scopus 로고    scopus 로고
    • Advances in computer speech synthesis and implications for assistive technologies
    • J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
    • Bunnell, H. T., & Pennington, C. (2010). Advances in computer speech synthesis and implications for assistive technologies. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 71-91). Hershey, PA: IGI Global.
    • (2010) Computer Synthesized Speech Technologies: Tools for Aiding Impairment , pp. 71-91
    • Bunnell, H.T.1    Pennington, C.2
  • 14
    • 33745218768 scopus 로고    scopus 로고
    • Automatic personal synthetic voice construction
    • Paper presented at Retrieved from
    • Bunnell, H. T., Pennington, C., Yarrington, D., & Gray, J. (2005). Automatic personal synthetic voice construction. Paper presented at Eurospeech 2005, 89-92. Retrieved from http://www.iscaspeech. org/archive/interspeech-2005/ i05-0089.html
    • (2005) Eurospeech , vol.2005 , pp. 89-92
    • Bunnell, H.T.1    Pennington, C.2    Yarrington, D.3    Gray, J.4
  • 17
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed GMM and MAP adaptation
    • September Paper presented at Geneva, Switzerland. Retrieved from
    • Chen, Y., Chu, M., Chang, E., Liu, J., & Liu, R. (2003, September). Voice conversion with smoothed GMM and MAP adaptation. Paper presented at Eurospeech 2003, Geneva, Switzerland. Retrieved from: http://www.isca-speech. org/archive/eurospeech-2003/e03-2413.html
    • (2003) Eurospeech 2003
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 19
    • 0034490567 scopus 로고    scopus 로고
    • Men ' s voices and women ' s choices
    • Collins, S. (2000). Men ' s voices and women ' s choices. Animal Behavior, 60, 773-780.
    • (2000) Animal Behavior , vol.60 , pp. 773-780
    • Collins, S.1
  • 21
    • 77953693885 scopus 로고    scopus 로고
    • Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit
    • J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
    • Creer, S., Green, P., Cunningham, S., & Yamagishi, J. (2010). Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 92-115). Hershey, PA: IGI Global.
    • (2010) Computer Synthesized Speech Technologies: Tools for Aiding Impairment , pp. 92-115
    • Creer, S.1    Green, P.2    Cunningham, S.3    Yamagishi, J.4
  • 22
    • 84976113790 scopus 로고
    • Falls and rises: Meanings and universals
    • Cruttenden, A. (1981). Falls and rises: meanings and universals. Journal of Linguistics 17, 77-91.
    • (1981) Journal of Linguistics , vol.17 , pp. 77-91
    • Cruttenden, A.1
  • 23
    • 0004239281 scopus 로고
    • Cambridge, UK: Cambridge University Press
    • Cruttenden, A. (1986). Intonation. Cambridge, UK: Cambridge University Press.
    • (1986) Intonation
    • Cruttenden, A.1
  • 25
    • 85079090632 scopus 로고
    • A quantitative assessment of the relative speaker discriminating properties of phonemes
    • April Paper presented at Adelaide, Australia. doi:10.1109/ICASSP.1994. 389337
    • Eatock, J., & Mason, J. (1994, April). A quantitative assessment of the relative speaker discriminating properties of phonemes. Paper presented at the 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Adelaide, Australia. doi:10.1109/ICASSP.1994.389337
    • (1994) The 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Eatock, J.1    Mason, J.2
  • 26
    • 0000337137 scopus 로고
    • Articulation testing methods
    • Egan, J.P. (1948). Articulation testing methods. The Laryngoscope 58, 955-991.
    • (1948) The Laryngoscope , vol.58 , pp. 955-991
    • Egan, J.P.1
  • 30
    • 0002633841 scopus 로고
    • A note on vocal tract size factors and non-uniform F-pattern scaling
    • Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
    • Fant, G. (1966). A note on vocal tract size factors and non-uniform F-pattern scaling. Speech Transmission Laboratories Quarterly Progress Status Report, 7 (4), 22-30. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www. speech.kth.se/prod/publications/fi les/qpsr/1966/1966-7-4-022-030.pdf
    • (1966) Speech Transmission Laboratories Quarterly Progress Status Report , vol.7 , Issue.4 , pp. 22-30
    • Fant, G.1
  • 31
    • 33947684811 scopus 로고
    • A four-parameter model of glottal fl ow
    • Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
    • Fant, G., Liljencrants, J., & Lin, Q. (1985). A four-parameter model of glottal fl ow. Speech Transmission Laboratories Quarterly Progress Status Report, 26 (4), 1-13. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www.speech.kth. se/prod/publications/fi les/qpsr/1985/1985-26-4-001-013.pdf
    • (1985) Speech Transmission Laboratories Quarterly Progress Status Report , vol.26 , Issue.4 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 32
    • 13844254175 scopus 로고    scopus 로고
    • Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices
    • Feinberg, D. R., Jones, B. C., Little, A. C., Burt, D. M., & Perrett, D. I. (2005). Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices. Animal Behavior, 69, 561-568.
    • (2005) Animal Behavior , vol.69 , pp. 561-568
    • Feinberg, D.R.1    Jones, B.C.2    Little, A.C.3    Burt, D.M.4    Perrett, D.I.5
  • 33
    • 0031203338 scopus 로고    scopus 로고
    • Perceiving the sex and identity of a talker without natural vocal timbre
    • Fellows, J. M., Remez, R. E., & Rubin, P. E. (1997). Perceiving the sex and identity of a talker without natural vocal timbre. Perception and Psychophysics 59, 839-849.
    • (1997) Perception and Psychophysics , vol.59 , pp. 839-849
    • Fellows, J.M.1    Remez, R.E.2    Rubin, P.E.3
  • 34
    • 0032878792 scopus 로고    scopus 로고
    • Morphology and development of the human vocal tract: A study using magnetic resonance imaging
    • Fitch, W. T., & Giedd, J. (1999). Morphology and development of the human vocal tract: A study using magnetic resonance imaging. Journal of the Acoustical Society of America, 106, 1511-1522.
    • (1999) Journal of the Acoustical Society of America , vol.106 , pp. 1511-1522
    • Fitch, W.T.1    Giedd, J.2
  • 35
    • 0028098207 scopus 로고
    • Effects of synthetic voice output on attitudes toward the augmented communicator
    • Gorenflo, C., Gorenflo, D., & Santer, S. A. (1994). Effects of synthetic voice output on attitudes toward the augmented communicator. Journal of Speech and Hearing Research, 37, 64-68.
    • (1994) Journal of Speech and Hearing Research , vol.37 , pp. 64-68
    • Gorenflo, C.1    Gorenflo, D.2    Santer, S.A.3
  • 36
    • 0017238868 scopus 로고
    • Perceptual features of speech for males in four perceived age decades
    • Hartman, D., & Danhauer, J. (1976). Perceptual features of speech for males in four perceived age decades. Journal of the Acoustical Society of America, 59, 713-715.
    • (1976) Journal of the Acoustical Society of America , vol.59 , pp. 713-715
    • Hartman, D.1    Danhauer, J.2
  • 38
    • 79959836789 scopus 로고    scopus 로고
    • Maximum a posteriori voice conversion using sequential Monte Carlo methods
    • September Paper presented at Makuhari, Japan
    • Helander, E., Silén, H., Míguez, J., & Gabbouj, M. (2010, September). Maximum a posteriori voice conversion using sequential Monte Carlo methods. Paper presented at Interspeech 2010, Makuhari, Japan.
    • (2010) Interspeech 2010
    • Helander, E.1    Silén, H.2    Míguez, J.3    Gabbouj, M.4
  • 40
    • 44949164829 scopus 로고    scopus 로고
    • A model of the regularities underlying speaker variation: Evidence from hybrid synthesis
    • September Paper presented at Pittsburgh, PA. Retrieved from
    • Hertz, S.R. (2006, September). A model of the regularities underlying speaker variation: Evidence from hybrid synthesis. Paper presented at the Ninth International Conference on Spoken Language Processing (ICSLP). Pittsburgh, PA. Retrieved from http://www. novaspeech.com/Documents/interspeech2006.pdf
    • (2006) The Ninth International Conference on Spoken Language Processing (ICSLP)
    • Hertz, S.R.1
  • 42
    • 0008499169 scopus 로고
    • Perceptual analysis of speaker identity
    • S. Saito (Ed.) Burke, VA: IOS press
    • Itoh, K., (1992). Perceptual analysis of speaker identity. In: S. Saito (Ed.), Speech science and technology (pp. 133-145). Burke, VA: IOS press.
    • (1992) Speech Science and Technology , pp. 133-145
    • Itoh, K.1
  • 46
    • 0034841948 scopus 로고    scopus 로고
    • Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
    • May Paper presented at Salt Lake City, UT. doi:10.1109/ICASSP.2001.941039
    • Kain, A., & Macon, M. W. (2001, May). Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT. doi:10.1109/ICASSP. 2001.941039
    • (2001) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Kain, A.1    MacOn, M.W.2
  • 47
    • 85133413596 scopus 로고    scopus 로고
    • Formant re-synthesis of dysarthric speech
    • June Paper presented at Pittsburgh, PA. Retrieved from
    • Kain, A., Niu, X., Hosom, J.-P., Miao, Q., & van Santen, J. P. H. (2004, June). Formant re-synthesis of dysarthric speech. Paper presented at the 5th ISCA Speech Synthesis Workshop (SSW5), Pittsburgh, PA. Retrieved from http://www.isca-speech.org/archive-open/ssw5/ssw5-025.html
    • (2004) The 5th ISCA Speech Synthesis Workshop(SSW5)
    • Kain, A.1    Niu, X.2    Hosom, J.-P.3    Miao, Q.4    Van Santen, J.P.H.5
  • 48
    • 70349210296 scopus 로고    scopus 로고
    • Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired
    • April Paper presented at Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
    • Kain, A., & van Santen, J. (2009, April). Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
    • (2009) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Kain, A.1    Van Santen, J.2
  • 49
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds
    • Kawahara, H., Masuda-Katsuse, I., & de Cheveigné, A. (1999). Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds. Speech Communication, 27, 187-207.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 50
    • 77953705589 scopus 로고    scopus 로고
    • The Blizzard Challenge 2009
    • September Paper presented at Edinburgh, UK
    • King, S. & Karaiskos, V. (2009, September). The Blizzard Challenge 2009. Paper presented at the Blizzard Challenge Workshop, Edinburgh, UK.
    • (2009) The Blizzard Challenge Workshop
    • King, S.1    Karaiskos, V.2
  • 51
    • 0026206653 scopus 로고
    • Comparing discrimination and recognition of unfamiliar voices
    • doi:10.1016/1067-6393(91)90016-M
    • Krieman, J., & Papcun, G. (1991). Comparing discrimination and recognition of unfamiliar voices. Speech Communication, 10, 265-275. doi:10.1016/1067-6393(91)90016-M
    • (1991) Speech Communication , vol.10 , pp. 265-275
    • Krieman, J.1    Papcun, G.2
  • 53
    • 25844437809 scopus 로고
    • The ability of listeners to identify voices
    • Los Angeles, CA: UCLA Phonetics Lab
    • Ladefoged, O., & Ladefoged, J. (1980). The ability of listeners to identify voices. UCLA Working Papers in Phonetics, 49, 43-51. Los Angeles, CA: UCLA Phonetics Lab.
    • (1980) UCLA Working Papers in Phonetics , vol.49 , pp. 43-51
    • Ladefoged, O.1    Ladefoged, J.2
  • 54
    • 0023793337 scopus 로고
    • Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children
    • Lass, N. J., Ruscello, D. M., & Lakawicz, J. A. (1988). Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children. Journal of Communication Disorders, 21, 385-391.
    • (1988) Journal of Communication Disorders , vol.21 , pp. 385-391
    • Lass, N.J.1    Ruscello, D.M.2    Lakawicz, J.A.3
  • 55
    • 0033883193 scopus 로고    scopus 로고
    • The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels
    • Lavner, Y., Gath, I., & Rosenhouse, J. (2000). The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels. Speech Communication, 30, 9-26.
    • (2000) Speech Communication , vol.30 , pp. 9-26
    • Lavner, Y.1    Gath, I.2    Rosenhouse, J.3
  • 57
    • 0002482529 scopus 로고
    • Suprasegmental features of speech
    • N.J. Lass (Ed.) New York, NY: Academic Press
    • Lehiste, I. (1976). Suprasegmental features of speech. In N.J. Lass (Ed.), Contemporary issues in experimental phonetics (pp. 225-239). New York, NY: Academic Press.
    • (1976) Contemporary Issues in Experimental Phonetics , pp. 225-239
    • Lehiste, I.1
  • 59
    • 77955426622 scopus 로고    scopus 로고
    • An analysis of HMM-based prediction of articulatory movements
    • Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010a). An analysis of HMM-based prediction of articulatory movements. Speech Communication, 52, 834-846.
    • (2010) Speech Communication , vol.52 , pp. 834-846
    • Ling, Z.-H.1    Richmond, K.2    Yamagishi, J.3
  • 60
    • 79959823601 scopus 로고    scopus 로고
    • HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators
    • September Paper presented at Makuhari, Japan. Retrieved from
    • Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010b, September). HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators. Paper presented at Interspeech 2010, Makuhari, Japan. Retrieved from : http://hdl.handle. net/1842.4563.
    • (2010) Interspeech 2010
    • Ling, Z.-H.1    Richmond, K.2    Yamagishi, J.3
  • 61
    • 0031601187 scopus 로고    scopus 로고
    • Acoustic correlates of perceived versus actual sexual orientation in men ' s speech
    • Linville, S. (1998). Acoustic correlates of perceived versus actual sexual orientation in men ' s speech. Folia Phoniatrica et Logopaedica, 50, 35-48.
    • (1998) Folia Phoniatrica et Logopaedica , vol.50 , pp. 35-48
    • Linville, S.1
  • 65
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • Moulines, E., & Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9, 453-467.
    • (1990) Speech Communication , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 66
    • 33744907285 scopus 로고    scopus 로고
    • The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech
    • Munson, B., McDonald, E. C., DeBoe, N. L., & White, A. R. (2006). The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech. Journal of Phonetics, 34, 202-240.
    • (2006) Journal of Phonetics , vol.34 , pp. 202-240
    • Munson, B.1    McDonald, E.C.2    Deboe, N.L.3    White, A.R.4
  • 67
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
    • doi:10.1121/1.405558
    • Murray, I. R., & Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, 93, 1097-1108. doi:10.1121/1.405558
    • (1993) Journal of the Acoustical Society of America , vol.93 , pp. 1097-1108
    • Murray, I.R.1    Arnott, J.L.2
  • 68
    • 84906279165 scopus 로고    scopus 로고
    • Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis
    • August Paper presented at Lyon, France. Retrieved from
    • Muthukumar, P.K., Black, A.W., & Bunnell, H.T. (2013, August). Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis. Paper presented at InterSpeech 2013, Lyon, France. Retrieved from http://www. isca-speech.org/archive/interspeech- 2013/i13-0397.html
    • (2013) InterSpeech 2013
    • Muthukumar, P.K.1    Black, A.W.2    Bunnell, H.T.3
  • 69
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artifi cial neural networks
    • Narendranath, M., Murthy, H. A., Rajendran, S., & Yegnanarayana, B. (1995). Transformation of formants for voice conversion using artifi cial neural networks. Speech Communication 16, 207-216.
    • (1995) Speech Communication , vol.16 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 70
    • 85006544659 scopus 로고    scopus 로고
    • Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction
    • Nass, C., & Lee, K. M. (2001). Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction. Journal of Experimental Psychology: Applied, 7, 171-181.
    • (2001) Journal of Experimental Psychology: Applied , vol.7 , pp. 171-181
    • Nass, C.1    Lee, K.M.2
  • 71
    • 0010592593 scopus 로고
    • Speech Physiology
    • F. Minifi e, T. J. Hixon, & F. Williams (Eds.) Englewood Cliffs, NJ: Prentice-Hall
    • Netsell, R. (1973). Speech Physiology. In F. Minifi e, T. J. Hixon, & F. Williams (Eds.), Normal aspects of speech, hearing, and language (pp. 211-234). Englewood Cliffs, NJ: Prentice-Hall.
    • (1973) Normal Aspects of Speech, Hearing, and Language , pp. 211-234
    • Netsell, R.1
  • 74
    • 34547527563 scopus 로고    scopus 로고
    • A parametric approach for voice conversion
    • June Paper presented at Barecelona, Spain. Retrieved from
    • Nurminen, J, Popa, V., Tian, J., Tang, Y., & Kiss, I. (2006, June). A parametric approach for voice conversion. Paper presented at the TC-STAR Workshop on Speech-to-Speech Translation. Barecelona, Spain. Retrieved from http://www.tcstar. org/pubblicazioni/scientific-publications/Nokia/2006/ S2STranslation06-nokia3.pdf
    • (2006) The TC-STAR Workshop on Speech-to-Speech Translation
    • Nurminen, J.1    Popa, V.2    Tian, J.3    Tang, Y.4    Kiss, I.5
  • 76
    • 0036503253 scopus 로고    scopus 로고
    • Phonatory control in adults with cerebral palsy and severe dysarthria
    • Patel, R. (2002a). Phonatory control in adults with cerebral palsy and severe dysarthria. Augmentative and Alternative Communication, 18, 2-10.
    • (2002) Augmentative and Alternative Communication , vol.18 , pp. 2-10
    • Patel, R.1
  • 77
    • 0036787690 scopus 로고    scopus 로고
    • Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast
    • Patel, R. (2002b). Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast. Journal of Speech, Language, and Hearing Research, 45, 858-870.
    • (2002) Journal of Speech, Language, and Hearing Research , vol.45 , pp. 858-870
    • Patel, R.1
  • 78
    • 0347586935 scopus 로고    scopus 로고
    • Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy
    • Patel, R. (2003). Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy. Journal of Speech, Language, and Hearing Research, 46, 1401-1415.
    • (2003) Journal of Speech, Language, and Hearing Research , vol.46 , pp. 1401-1415
    • Patel, R.1
  • 79
    • 10944259978 scopus 로고    scopus 로고
    • The acoustics of contrastive prosody in adults with cerebral palsy
    • Patel, R. (2004). The acoustics of contrastive prosody in adults with cerebral palsy. Journal of Medical Speech-Language Pathology, 12, 189-193.
    • (2004) Journal of Medical Speech-Language Pathology , vol.12 , pp. 189-193
    • Patel, R.1
  • 80
    • 84865394622 scopus 로고    scopus 로고
    • Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations
    • Patel, R., & Roden, A. (2008). Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations. Journal of Medical Speech-Language Pathology, 16, 243-249.
    • (2008) Journal of Medical Speech-Language Pathology , vol.16 , pp. 243-249
    • Patel, R.1    Roden, A.2
  • 81
    • 85044897711 scopus 로고    scopus 로고
    • Using computer games to mediate caregiver-child communication for children with severe dysarthria
    • Patel, R., & Salata, A. (2006). Using computer games to mediate caregiver-child communication for children with severe dysarthria. Journal of Medical Speech-Language Pathology, 14, 279-284.
    • (2006) Journal of Medical Speech-Language Pathology , vol.14 , pp. 279-284
    • Patel, R.1    Salata, A.2
  • 82
    • 34347399580 scopus 로고    scopus 로고
    • Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report
    • Patel, R., & Watkins, C. (2007). Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report. Journal of Medical Speech-Language Pathology, 15, 149-159.
    • (2007) Journal of Medical Speech-Language Pathology , vol.15 , pp. 149-159
    • Patel, R.1    Watkins, C.2
  • 85
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specifi c excitation information from linear prediction residual of speech
    • doi:10.1016/j.specom.2006.06.002
    • Prasanna, S. R. M., Gupta, C. S., & Yegnanarayana, B. (2006). Extraction of speaker-specifi c excitation information from linear prediction residual of speech. Speech Communication 48, 1243-1261. doi:10.1016/j.specom. 2006.06.002
    • (2006) Speech Communication , vol.48 , pp. 1243-1261
    • Prasanna, S.R.M.1    Gupta, C.S.2    Yegnanarayana, B.3
  • 88
    • 0027525457 scopus 로고
    • On the intonation of sinusoidal sentences: Contour and pitch height
    • Remez, R. E., & Rubin, P. E. (1993). On the intonation of sinusoidal sentences: Contour and pitch height. Journal of the Acoustical Society of America, 94, 1983-1988.
    • (1993) Journal of the Acoustical Society of America , vol.94 , pp. 1983-1988
    • Remez, R.E.1    Rubin, P.E.2
  • 90
    • 0023756465 scopus 로고
    • Speech synthesis by rule using an optimal selection of non-uniform synthesis units
    • May Paper presented at New York, NY. doi:10.1109/ICASSP.1988.196677
    • Sagisaka, Y. (1988, May). Speech synthesis by rule using an optimal selection of non-uniform synthesis units. Paper presented at the 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New York, NY. doi:10.1109/ICASSP.1988.196677
    • (1988) The 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Sagisaka, Y.1
  • 91
    • 34547507542 scopus 로고    scopus 로고
    • Frequency warping based on mapping formant parameters
    • September Paper presented at Pittsburgh, PA
    • Shuang, Z-W., Bakis, R., Shectman, S., Chazan, D., & Qin, Y. (2006, September). Frequency warping based on mapping formant parameters. Paper presented at Interspeech 2006, Pittsburgh, PA. http://www.isca-speech.org/ archive/interspeech-2006/i06-1768.html
    • (2006) Interspeech 2006
    • Shuang, Z.-W.1    Bakis, R.2    Shectman, S.3    Chazan, D.4    Qin, Y.5
  • 92
    • 84887611468 scopus 로고    scopus 로고
    • Augmentative and alternative communication
    • J. H. Stone & M. Blouin (Eds.) Available online
    • Sigafoos, J., Schlosser, R. W., & Sutherland, D. 2013. Augmentative and alternative communication. In: J. H. Stone & M. Blouin (Eds.), International encyclopedia of rehabilitation. Available online: http://cirrie.buffalo.edu/encyclopedia/en/article/50
    • (2013) International Encyclopedia of Rehabilitation
    • Sigafoos, J.1    Schlosser, R.W.2    Sutherland, D.3
  • 94
    • 0038042432 scopus 로고    scopus 로고
    • Male voices and perceived sexual orientation: An experiment and theoretical approach
    • Smyth, R., Jacobs, G., & Rogers, H. (2003). Male voices and perceived sexual orientation: An experiment and theoretical approach. Language and Society, 32, 329-350.
    • (2003) Language and Society , vol.32 , pp. 329-350
    • Smyth, R.1    Jacobs, G.2    Rogers, H.3
  • 96
    • 85009086192 scopus 로고    scopus 로고
    • Diphone concatenation using a harmonic plus noise model of speech
    • September Paper presented at Rhodes, Greece. Retrieved from
    • Stylianou, Y., Dutoit, T., & Schroeter, J. (1997, September). Diphone concatenation using a harmonic plus noise model of speech. Paper presented at Eurospeech 1997, Rhodes, Greece. Retrieved from: http://www.isca-speech.org/ archive/eurospeech-1997/e97-0613. html
    • (1997) Eurospeech 1997
    • Stylianou, Y.1    Dutoit, T.2    Schroeter, J.3
  • 98
    • 84878394226 scopus 로고    scopus 로고
    • Text-to-speech intelligibility across speech rates
    • September Paper presented at Portland, OR. Retrieved from
    • Syrdal, A. K., Bunnell, H. T., Hertz, S. R., Mishra, T., Spiegel, M., Bickley, C., Makashay, M. J. (2012, September). Text-to-speech intelligibility across speech rates. Paper presented at InterSpeech 2012, Portland, OR. Retrieved from : http://www.isca-speech.org/archive/interspeech-2012/i12-0623. html
    • (2012) InterSpeech 2012
    • Syrdal, A.K.1    Bunnell, H.T.2    Hertz, S.R.3    Mishra, T.4    Spiegel, M.5    Bickley, C.6    Makashay, M.J.7
  • 99
    • 0003058857 scopus 로고
    • On the basic scheme and algorithms in non-uniform unit speech synthesis
    • G. Bailly, C. Benoît, & T. R. Sawallis (Eds.) Amsterdam, The Netherlands: North-Holland Publishing Co
    • Takeda, K., Abe, K., & Sagisaka, Y. (1992). On the basic scheme and algorithms in non-uniform unit speech synthesis. In G. Bailly, C. Benoît, & T. R. Sawallis (Eds.), Talking machines: Theories, models, and designs (pp. 93-105). Amsterdam, The Netherlands: North-Holland Publishing Co.
    • (1992) Talking Machines: Theories, Models, and Designs , pp. 93-105
    • Takeda, K.1    Abe, K.2    Sagisaka, Y.3
  • 101
    • 85009069262 scopus 로고    scopus 로고
    • Straight-based voice conversion algorithm based on Gaussian mixture model
    • October Paper presented at Beijing, China. Retrieved from
    • Toda, T., Lu, J., Saruwatari, H., & Shikano, K. (2000, October). Straight-based voice conversion algorithm based on Gaussian mixture model. Paper presented at the Sixth International Conference on Spoken Language Processing, Beijing, China. Retrieved from: http://hdl.handle.net/10061/8187
    • (2000) The Sixth International Conference on Spoken Language Processing
    • Toda, T.1    Lu, J.2    Saruwatari, H.3    Shikano, K.4
  • 103
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • doi:10.1109/tasl.2007.907344
    • Toda, T., Black, A. W., & Tokuda, K. (2007b). Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2222-2235. doi:10.1109/tasl.2007.907344
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 105
    • 0027930431 scopus 로고
    • Speaker race identifi cation from acoustic cues in the vocal signal
    • Walton, J., & Orlikoff, R. (1994). Speaker race identifi cation from acoustic cues in the vocal signal. Journal of Speech Language and Hearing Research 37, 4, 738-745.
    • (1994) Journal of Speech Language and Hearing Research , vol.37 , Issue.4 , pp. 738-745
    • Walton, J.1    Orlikoff, R.2
  • 107
    • 70450188371 scopus 로고    scopus 로고
    • HMM adaptation and voice conversion for the synthesis of child speech: A comparison
    • September Paper presented at Brighton, United Kingdom. Retrieved from
    • Watts, O., Yamagishi, J., King, S., & Berkling, K. (2009, September). HMM adaptation and voice conversion for the synthesis of child speech: A comparison. Paper presented at Interspeech 2009, Brighton, United Kingdom. Retrieved from: http://www.iscaspeech. org/archive/interspeech-2009/i09-2627. html
    • (2009) Interspeech 2009
    • Watts, O.1    Yamagishi, J.2    King, S.3    Berkling, K.4
  • 109
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Yamagishi, J., & Kobayashi, T. (2007). Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training. IEICE Transactions on Information and Systems, E 90-D, 533-543.
    • (2007) IEICe Transactions on Information and Systems, e , vol.90-D , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 111
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Zen, H., Tokuda, K., & Black, A. W. (2009). Statistical parametric speech synthesis. Speech Communication, 51, 1039-1064.
    • (2009) Speech Communication , vol.51 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.