SCOPUS 정보 검색 플랫폼

AAC: Augmentative and Alternative Communication

Volumn 30, Issue 3, 2014, Pages 226-236

Towards personalized speech synthesis for augmentative and alternative communication

(3) Mills, Timothy a,c Bunnell, H Timothy b Patel, Rupal a

a NORTHEASTERN UNIVERSITY (United States)

b NEMOURS CHILDREN S CLINIC (United States)

c UNIVERSITY OF ALBERTA (Canada)

Author keywords

Assistive communication; Assistive technology; Speaker identity; Speech synthesis; Voice conversion

Indexed keywords

COMMUNICATION AID; DYSARTHRIA; HUMAN; VOICE;

COMMUNICATION AIDS FOR DISABLED; DYSARTHRIA; HUMANS; VOICE;

EID: 84907046206 PISSN: 07434618 EISSN: 14773848 Source Type: Journal
DOI: 10.3109/07434618.2014.924026 Document Type: Article

Times cited : (29)

References (112)

1
- 0026881384
- Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering
- Alku, P. (1992). Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering. Speech Communication, 11, 109-118.
- (1992) Speech Communication , vol.11 , pp. 109-118
- Alku, P.¹

2
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- Arslan, L. (1999). Speaker transformation algorithm using segmental codebooks (STASC). Speech Communication, 28, 211-226.
- (1999) Speech Communication , vol.28 , pp. 211-226
- Arslan, L.¹

3
- 0032797216
- Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech
- Bachorowski, J., & Owren, M. (1999). Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. Journal of the Acoustical Society of America, 106, 1054-1063.
- (1999) Journal of the Acoustical Society of America , vol.106 , pp. 1054-1063
- Bachorowski, J.¹ Owren, M.²

4
- 0030166343
- The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences
- Benoît, C., Grice, M., & Hazan, V. (1996). The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences. Speech Communication, 18, 381-392.
- (1996) Speech Communication , vol.18 , pp. 381-392
- Benoît, C.¹ Grice, M.² Hazan, V.³

5
- 85133503504
- Diphone synthesis using unit selection
- November Paper presented at Blue Mountains, Australia. Retrieved from
- Beutnagel, M., Conkie, A., & Syrdal, A. K. (1998, November). Diphone synthesis using unit selection. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca- speech.org/archive-open/archive-papers/ssw3/ssw3-185.pdf
- (1998) The 3rd ISCA Speech Synthesis Workshop (SSW3)
- Beutnagel, M.¹ Conkie, A.² Syrdal, A.K.³

6
- 84961425284
- A statewide demographic survey of people with severe communication impairments
- Bloomberg, K., & Johnson, H. (1990). A statewide demographic survey of people with severe communication impairments. Augmentative and Alternative Communication, 6, 50-60.
- (1990) Augmentative and Alternative Communication , vol.6 , pp. 50-60
- Bloomberg, K.¹ Johnson, H.²

7
- 0001856243
- Contrastive accent and contrastive stress
- Bolinger, D. (1961). Contrastive accent and contrastive stress, Language, 37, 83-96.
- (1961) Language , vol.37 , pp. 83-96
- Bolinger, D.¹

8
- 0003708078
- Palo Alto, CA: Stanford University Press
- Bolinger, D. (1989). Intonation and its uses. Palo Alto, CA: Stanford University Press.
- (1989) Intonation and Its Uses
- Bolinger, D.¹

9
- 0026694625
- A survey of the communication-impaired population of Tayside
- Brophy-Arnott, M. B., Newell, A. F., Arnott, J. L., & Condie, D. (1992). A survey of the communication-impaired population of Tayside. European Journal of Disorders of Communication, 25, 159-173.
- (1992) European Journal of Disorders of Communication , vol.25 , pp. 159-173
- Brophy-Arnott, M.B.¹ Newell, A.F.² Arnott, J.L.³ Condie, D.⁴

10
- 84907049577
- Crafting small databases for unit selection TTS: Effects on intelligibility
- September Paper presented at Kyoto, Japan. Retrieved from
- Bunnell, H. T. (2010, September). Crafting small databases for unit selection TTS: Effects on intelligibility. Paper presented at the 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto, Japan. Retrieved from http://isw3.naist.jp/∼ tomoki/ssw7/www/doc/ssw7-proceedings-rev.pdf
- (2010) The 7th ISCA Speech Synthesis Workshop (SSW7)
- Bunnell, H.T.¹

11
- 85039153976
- A biphone constrained concatenation method for diphone synthesis
- November Paper presented at Blue Mountains, Australia. Retrieved from
- Bunnell, H. T., Hoskins, S. R., & Yarrington, D. M. (1998, November). A biphone constrained concatenation method for diphone synthesis. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca-speech.org/archive-open/archive- papers/ssw3/ssw3-171.pdf
- (1998) The 3rd ISCA Speech Synthesis Workshop(SSW3)
- Bunnell, H.T.¹ Hoskins, S.R.² Yarrington, D.M.³

12
- 85133491738
- Analysis methods for assessing TTS intelligibility
- August Paper presented at Bonn, Germany. Retrieved from
- Bunnell, H. T., & Lilley, J. (2007, August). Analysis methods for assessing TTS intelligibility. Paper presented at the 6th ISCA Speech Synthesis Workshop (SSW6), Bonn, Germany. Retrieved from http://www.isca-speech.org/ archive-open/archive-papers/ssw6/ssw6-374.pdf
- (2007) The 6th ISCA Speech Synthesis Workshop(SSW6)
- Bunnell, H.T.¹ Lilley, J.²

13
- 84899214271
- Advances in computer speech synthesis and implications for assistive technologies
- J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
- Bunnell, H. T., & Pennington, C. (2010). Advances in computer speech synthesis and implications for assistive technologies. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 71-91). Hershey, PA: IGI Global.
- (2010) Computer Synthesized Speech Technologies: Tools for Aiding Impairment , pp. 71-91
- Bunnell, H.T.¹ Pennington, C.²

14
- 33745218768
- Automatic personal synthetic voice construction
- Paper presented at Retrieved from
- Bunnell, H. T., Pennington, C., Yarrington, D., & Gray, J. (2005). Automatic personal synthetic voice construction. Paper presented at Eurospeech 2005, 89-92. Retrieved from http://www.iscaspeech. org/archive/interspeech-2005/ i05-0089.html
- (2005) Eurospeech , vol.2005 , pp. 89-92
- Bunnell, H.T.¹ Pennington, C.² Yarrington, D.³ Gray, J.⁴

15
- 0008480934
- (Doctoral dissertation). Indiana University, MI, USA
- Carrell, T. D. (1984). Contributions of fundamental frequency, formant spacing, and glottal waveform to talker identifi cation (Doctoral dissertation). Indiana University, MI, USA.
- (1984) Contributions of Fundamental Frequency, Formant Spacing, and Glottal Waveform to Talker Identifi Cation
- Carrell, T.D.¹

16
- 25944431835
- Effects of glottal waveform on the perception of talker sex
- Carrell, T. D. (1985). Effects of glottal waveform on the perception of talker sex. Journal of the Acoustical Society of America, 70, S97.
- (1985) Journal of the Acoustical Society of America , vol.70
- Carrell, T.D.¹

17
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- September Paper presented at Geneva, Switzerland. Retrieved from
- Chen, Y., Chu, M., Chang, E., Liu, J., & Liu, R. (2003, September). Voice conversion with smoothed GMM and MAP adaptation. Paper presented at Eurospeech 2003, Geneva, Switzerland. Retrieved from: http://www.isca-speech. org/archive/eurospeech-2003/e03-2413.html
- (2003) Eurospeech 2003
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

18
- 0003763278
- Tokyo, Japan: Tokyo-Kaiseikan
- Chiba, T., & Kajiyama, J. (1941). The vowel: Its nature and structure. Tokyo, Japan: Tokyo-Kaiseikan.
- (1941) The Vowel: Its Nature and Structure
- Chiba, T.¹ Kajiyama, J.²

19
- 0034490567
- Men ' s voices and women ' s choices
- Collins, S. (2000). Men ' s voices and women ' s choices. Animal Behavior, 60, 773-780.
- (2000) Animal Behavior , vol.60 , pp. 773-780
- Collins, S.¹

20
- 0742319422
- Ottawa, Canada: Statistics Canada. Retrieved from
- Cossette, L. & Duclos, É. (2002). A profi le of disability in Canada, 2001 (89-577-XIE). Ottawa, Canada: Statistics Canada. Retrieved from http://www.statcan.gc.ca/pub/89-577-x/pdf/4228016-eng.pdf
- (2002) A Profi le of Disability in Canada, 2001 (89-577-XIE)
- Cossette, L.¹ Duclos, E.²

21
- 77953693885
- Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit
- J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
- Creer, S., Green, P., Cunningham, S., & Yamagishi, J. (2010). Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 92-115). Hershey, PA: IGI Global.
- (2010) Computer Synthesized Speech Technologies: Tools for Aiding Impairment , pp. 92-115
- Creer, S.¹ Green, P.² Cunningham, S.³ Yamagishi, J.⁴

22
- 84976113790
- Falls and rises: Meanings and universals
- Cruttenden, A. (1981). Falls and rises: meanings and universals. Journal of Linguistics 17, 77-91.
- (1981) Journal of Linguistics , vol.17 , pp. 77-91
- Cruttenden, A.¹

23
- 0004239281
- Cambridge, UK: Cambridge University Press
- Cruttenden, A. (1986). Intonation. Cambridge, UK: Cambridge University Press.
- (1986) Intonation
- Cruttenden, A.¹

24
- 77953707533
- Spectral mapping using artifi cial neural networks for voice conversion
- Desai, S., Black, A. W., Yegnanarayana, B., & Prahallad, K. (2010). Spectral mapping using artifi cial neural networks for voice conversion. IEEE Transactions on Audio, Speech, and Language Processing, 18, 954-964.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 954-964
- Desai, S.¹ Black, A.W.² Yegnanarayana, B.³ Prahallad, K.⁴

25
- 85079090632
- A quantitative assessment of the relative speaker discriminating properties of phonemes
- April Paper presented at Adelaide, Australia. doi:10.1109/ICASSP.1994. 389337
- Eatock, J., & Mason, J. (1994, April). A quantitative assessment of the relative speaker discriminating properties of phonemes. Paper presented at the 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Adelaide, Australia. doi:10.1109/ICASSP.1994.389337
- (1994) The 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Eatock, J.¹ Mason, J.²

26
- 0000337137
- Articulation testing methods
- Egan, J.P. (1948). Articulation testing methods. The Laryngoscope 58, 955-991.
- (1948) The Laryngoscope , vol.58 , pp. 955-991
- Egan, J.P.¹

27
- 77953727123
- Voice conversion based on weighted frequency warping
- Erro, D., Moreno, A., & Bonafonte, A. (2010a). Voice conversion based on weighted frequency warping. IEEE Transactions on Audio, Speech, and Language Processing, 18, 922-931.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

28
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- Erro, D., Moreno, A., & Bonafonte, A. (2010b). INCA algorithm for training voice conversion systems from nonparallel corpora. IEEE Transactions on Audio, Speech, and Language Processing, 18, 944-953.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

29
- 0003418124
- The Hague, Netherlands: Mouton & Co
- Fant, G. (1960). Acoustic theory of speech production. The Hague, Netherlands: Mouton & Co.
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

30
- 0002633841
- A note on vocal tract size factors and non-uniform F-pattern scaling
- Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
- Fant, G. (1966). A note on vocal tract size factors and non-uniform F-pattern scaling. Speech Transmission Laboratories Quarterly Progress Status Report, 7 (4), 22-30. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www. speech.kth.se/prod/publications/fi les/qpsr/1966/1966-7-4-022-030.pdf
- (1966) Speech Transmission Laboratories Quarterly Progress Status Report , vol.7 , Issue.4 , pp. 22-30
- Fant, G.¹

31
- 33947684811
- A four-parameter model of glottal fl ow
- Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
- Fant, G., Liljencrants, J., & Lin, Q. (1985). A four-parameter model of glottal fl ow. Speech Transmission Laboratories Quarterly Progress Status Report, 26 (4), 1-13. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www.speech.kth. se/prod/publications/fi les/qpsr/1985/1985-26-4-001-013.pdf
- (1985) Speech Transmission Laboratories Quarterly Progress Status Report , vol.26 , Issue.4 , pp. 1-13
- Fant, G.¹ Liljencrants, J.² Lin, Q.³

32
- 13844254175
- Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices
- Feinberg, D. R., Jones, B. C., Little, A. C., Burt, D. M., & Perrett, D. I. (2005). Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices. Animal Behavior, 69, 561-568.
- (2005) Animal Behavior , vol.69 , pp. 561-568
- Feinberg, D.R.¹ Jones, B.C.² Little, A.C.³ Burt, D.M.⁴ Perrett, D.I.⁵

33
- 0031203338
- Perceiving the sex and identity of a talker without natural vocal timbre
- Fellows, J. M., Remez, R. E., & Rubin, P. E. (1997). Perceiving the sex and identity of a talker without natural vocal timbre. Perception and Psychophysics 59, 839-849.
- (1997) Perception and Psychophysics , vol.59 , pp. 839-849
- Fellows, J.M.¹ Remez, R.E.² Rubin, P.E.³

34
- 0032878792
- Morphology and development of the human vocal tract: A study using magnetic resonance imaging
- Fitch, W. T., & Giedd, J. (1999). Morphology and development of the human vocal tract: A study using magnetic resonance imaging. Journal of the Acoustical Society of America, 106, 1511-1522.
- (1999) Journal of the Acoustical Society of America , vol.106 , pp. 1511-1522
- Fitch, W.T.¹ Giedd, J.²

35
- 0028098207
- Effects of synthetic voice output on attitudes toward the augmented communicator
- Gorenflo, C., Gorenflo, D., & Santer, S. A. (1994). Effects of synthetic voice output on attitudes toward the augmented communicator. Journal of Speech and Hearing Research, 37, 64-68.
- (1994) Journal of Speech and Hearing Research , vol.37 , pp. 64-68
- Gorenflo, C.¹ Gorenflo, D.² Santer, S.A.³

36
- 0017238868
- Perceptual features of speech for males in four perceived age decades
- Hartman, D., & Danhauer, J. (1976). Perceptual features of speech for males in four perceived age decades. Journal of the Acoustical Society of America, 59, 713-715.
- (1976) Journal of the Acoustical Society of America , vol.59 , pp. 713-715
- Hartman, D.¹ Danhauer, J.²

37
- 51449107658
- LSF mapping for voice conversion with very small training sets
- April Paper presented at Las Vegas, NV
- Helander, E., Nurminen, J., & Gabbouj, M. (2008, April). LSF mapping for voice conversion with very small training sets. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, NV.
- (2008) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Helander, E.¹ Nurminen, J.² Gabbouj, M.³

38
- 79959836789
- Maximum a posteriori voice conversion using sequential Monte Carlo methods
- September Paper presented at Makuhari, Japan
- Helander, E., Silén, H., Míguez, J., & Gabbouj, M. (2010, September). Maximum a posteriori voice conversion using sequential Monte Carlo methods. Paper presented at Interspeech 2010, Makuhari, Japan.
- (2010) Interspeech 2010
- Helander, E.¹ Silén, H.² Míguez, J.³ Gabbouj, M.⁴

39
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- Helander, E., Silén, H., Virtanen, T., & Gabbouj, M. (2012). Voice conversion using dynamic kernel partial least squares regression. IEEE Transactions on Audio, Speech, and Language Processing, 20, 806-817.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , pp. 806-817
- Helander, E.¹ Silén, H.² Virtanen, T.³ Gabbouj, M.⁴

40
- 44949164829
- A model of the regularities underlying speaker variation: Evidence from hybrid synthesis
- September Paper presented at Pittsburgh, PA. Retrieved from
- Hertz, S.R. (2006, September). A model of the regularities underlying speaker variation: Evidence from hybrid synthesis. Paper presented at the Ninth International Conference on Spoken Language Processing (ICSLP). Pittsburgh, PA. Retrieved from http://www. novaspeech.com/Documents/interspeech2006.pdf
- (2006) The Ninth International Conference on Spoken Language Processing (ICSLP)
- Hertz, S.R.¹

41
- 0346905110
- The speaker identifi cation problem
- Hollien, H., & Klepper, B. (1984). The speaker identifi cation problem. Advances in Forensic Psychology and Psychiatry, 1, 87-111.
- (1984) Advances in Forensic Psychology and Psychiatry , vol.1 , pp. 87-111
- Hollien, H.¹ Klepper, B.²

42
- 0008499169
- Perceptual analysis of speaker identity
- S. Saito (Ed.) Burke, VA: IOS press
- Itoh, K., (1992). Perceptual analysis of speaker identity. In: S. Saito (Ed.), Speech science and technology (pp. 133-145). Burke, VA: IOS press.
- (1992) Speech Science and Technology , pp. 133-145
- Itoh, K.¹

43
- 0347535868
- Pitch and compass of the speaking voice
- Jassem, W. (1971). Pitch and compass of the speaking voice. Journal of the International Phonetic Association, 1, 59-68.
- (1971) Journal of the International Phonetic Association , vol.1 , pp. 59-68
- Jassem, W.¹

44
- 72249121867
- VocaliD: Personalizing text-to-speech synthesis for individuals with severe speech impairment
- New York, NY: ACM. doi:10.1145/1639642.1639704
- Jreige, C., Patel, R., & Bunnell, H. T. (2009). VocaliD: personalizing text-to-speech synthesis for individuals with severe speech impairment. Assets ' 09: Proceedings of the 11th International ACM SIGACCESS Conference on Computers and Accessibility (pp. 259-260). New York, NY: ACM. doi:10.1145/1639642.1639704
- (2009) Assets ' 09: Proceedings of the 11th International ACM SIGACCESS Conference on Computers and Accessibility , pp. 259-260
- Jreige, C.¹ Patel, R.² Bunnell, H.T.³

45
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- May Paper presented at Seattle, WA doi:10.1109/ICASSP.1998.674423
- Kain, A., & Macon, M. W. (1998, May). Spectral voice conversion for text-to-speech synthesis. Paper presented at the IEEE Interational Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seattle, WA. 285-288. doi:10.1109/ICASSP.1998.674423
- (1998) The IEEE Interational Conference on Acoustics, Speech, and Signal Processing (ICASSP) , pp. 285-288
- Kain, A.¹ MacOn, M.W.²

46
- 0034841948
- Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
- May Paper presented at Salt Lake City, UT. doi:10.1109/ICASSP.2001.941039
- Kain, A., & Macon, M. W. (2001, May). Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT. doi:10.1109/ICASSP. 2001.941039
- (2001) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Kain, A.¹ MacOn, M.W.²

47
- 85133413596
- Formant re-synthesis of dysarthric speech
- June Paper presented at Pittsburgh, PA. Retrieved from
- Kain, A., Niu, X., Hosom, J.-P., Miao, Q., & van Santen, J. P. H. (2004, June). Formant re-synthesis of dysarthric speech. Paper presented at the 5th ISCA Speech Synthesis Workshop (SSW5), Pittsburgh, PA. Retrieved from http://www.isca-speech.org/archive-open/ssw5/ssw5-025.html
- (2004) The 5th ISCA Speech Synthesis Workshop(SSW5)
- Kain, A.¹ Niu, X.² Hosom, J.-P.³ Miao, Q.⁴ Van Santen, J.P.H.⁵

48
- 70349210296
- Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired
- April Paper presented at Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
- Kain, A., & van Santen, J. (2009, April). Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
- (2009) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Kain, A.¹ Van Santen, J.²

49
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds
- Kawahara, H., Masuda-Katsuse, I., & de Cheveigné, A. (1999). Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds. Speech Communication, 27, 187-207.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

50
- 77953705589
- The Blizzard Challenge 2009
- September Paper presented at Edinburgh, UK
- King, S. & Karaiskos, V. (2009, September). The Blizzard Challenge 2009. Paper presented at the Blizzard Challenge Workshop, Edinburgh, UK.
- (2009) The Blizzard Challenge Workshop
- King, S.¹ Karaiskos, V.²

51
- 0026206653
- Comparing discrimination and recognition of unfamiliar voices
- doi:10.1016/1067-6393(91)90016-M
- Krieman, J., & Papcun, G. (1991). Comparing discrimination and recognition of unfamiliar voices. Speech Communication, 10, 265-275. doi:10.1016/1067-6393(91)90016-M
- (1991) Speech Communication , vol.10 , pp. 265-275
- Krieman, J.¹ Papcun, G.²

52
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- Kuhn, R, Junqua, J.-C., Nguyen, P., & Niedzielski, N. (2000). Rapid speaker adaptation in eigenvoice space. IEEE Transactions on Acoustics, Speech, and Signal Processing, 8, 695-707.
- (2000) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.8 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

53
- 25844437809
- The ability of listeners to identify voices
- Los Angeles, CA: UCLA Phonetics Lab
- Ladefoged, O., & Ladefoged, J. (1980). The ability of listeners to identify voices. UCLA Working Papers in Phonetics, 49, 43-51. Los Angeles, CA: UCLA Phonetics Lab.
- (1980) UCLA Working Papers in Phonetics , vol.49 , pp. 43-51
- Ladefoged, O.¹ Ladefoged, J.²

54
- 0023793337
- Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children
- Lass, N. J., Ruscello, D. M., & Lakawicz, J. A. (1988). Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children. Journal of Communication Disorders, 21, 385-391.
- (1988) Journal of Communication Disorders , vol.21 , pp. 385-391
- Lass, N.J.¹ Ruscello, D.M.² Lakawicz, J.A.³

55
- 0033883193
- The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels
- Lavner, Y., Gath, I., & Rosenhouse, J. (2000). The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels. Speech Communication, 30, 9-26.
- (2000) Speech Communication , vol.30 , pp. 9-26
- Lavner, Y.¹ Gath, I.² Rosenhouse, J.³

56
- 0004266447
- Cambridge, MA: MIT Press
- Lehiste, I. (1970). Suprasegmentals. Cambridge, MA: MIT Press.
- (1970) Suprasegmentals
- Lehiste, I.¹

57
- 0002482529
- Suprasegmental features of speech
- N.J. Lass (Ed.) New York, NY: Academic Press
- Lehiste, I. (1976). Suprasegmental features of speech. In N.J. Lass (Ed.), Contemporary issues in experimental phonetics (pp. 225-239). New York, NY: Academic Press.
- (1976) Contemporary Issues in Experimental Phonetics , pp. 225-239
- Lehiste, I.¹

58
- 68149157315
- Integrating articulatory features into HMM-based parametric speech synthesis
- doi:10.1109/tasl.2009.2014796
- Ling Z.-H., Richmond, K., Yamagishi, J., & Wang, R.-H. (2009). Integrating articulatory features into HMM-based parametric speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 17, 1171-1185. doi:10.1109/tasl.2009.2014796
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , pp. 1171-1185
- Ling, Z.-H.¹ Richmond, K.² Yamagishi, J.³ Wang, R.-H.⁴

59
- 77955426622
- An analysis of HMM-based prediction of articulatory movements
- Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010a). An analysis of HMM-based prediction of articulatory movements. Speech Communication, 52, 834-846.
- (2010) Speech Communication , vol.52 , pp. 834-846
- Ling, Z.-H.¹ Richmond, K.² Yamagishi, J.³

60
- 79959823601
- HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators
- September Paper presented at Makuhari, Japan. Retrieved from
- Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010b, September). HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators. Paper presented at Interspeech 2010, Makuhari, Japan. Retrieved from : http://hdl.handle. net/1842.4563.
- (2010) Interspeech 2010
- Ling, Z.-H.¹ Richmond, K.² Yamagishi, J.³

61
- 0031601187
- Acoustic correlates of perceived versus actual sexual orientation in men ' s speech
- Linville, S. (1998). Acoustic correlates of perceived versus actual sexual orientation in men ' s speech. Folia Phoniatrica et Logopaedica, 50, 35-48.
- (1998) Folia Phoniatrica et Logopaedica , vol.50 , pp. 35-48
- Linville, S.¹

62
- 0030696416
- Voice characteristics conversion for HMM-based speech synthesis system
- April Paper presented at Munich, Germany. doi:10.1109/ICASSP.2009.4960406
- Masuko, T., Tokuda, K., Kobayashi, T., & Imai, S. (1997, April). Voice characteristics conversion for HMM-based speech synthesis system. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Munich, Germany. doi:10.1109/ICASSP.2009.4960406
- (1997) The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

63
- 84961456523
- Identifying the nonspeaking population: A demographic study
- Matas, J., Mathy-Laikko, P., Beukelman, D., & Legresley, K. (1985). Identifying the nonspeaking population: A demographic study. Augmentative and Alternative Communication, 1, 17-31.
- (1985) Augmentative and Alternative Communication , vol.1 , pp. 17-31
- Matas, J.¹ Mathy-Laikko, P.² Beukelman, D.³ Legresley, K.⁴

64
- 0017665638
- Study of variations in the male and female glottal wave
- Monsen, R. B., & Engebretson, A. M. (1977). Study of variations in the male and female glottal wave. Journal of the Acoustical Society of America, 62, 981-993.
- (1977) Journal of the Acoustical Society of America , vol.62 , pp. 981-993
- Monsen, R.B.¹ Engebretson, A.M.²

65
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Moulines, E., & Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9, 453-467.
- (1990) Speech Communication , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

66
- 33744907285
- The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech
- Munson, B., McDonald, E. C., DeBoe, N. L., & White, A. R. (2006). The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech. Journal of Phonetics, 34, 202-240.
- (2006) Journal of Phonetics , vol.34 , pp. 202-240
- Munson, B.¹ McDonald, E.C.² Deboe, N.L.³ White, A.R.⁴

67
- 0027447292
- Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
- doi:10.1121/1.405558
- Murray, I. R., & Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, 93, 1097-1108. doi:10.1121/1.405558
- (1993) Journal of the Acoustical Society of America , vol.93 , pp. 1097-1108
- Murray, I.R.¹ Arnott, J.L.²

68
- 84906279165
- Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis
- August Paper presented at Lyon, France. Retrieved from
- Muthukumar, P.K., Black, A.W., & Bunnell, H.T. (2013, August). Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis. Paper presented at InterSpeech 2013, Lyon, France. Retrieved from http://www. isca-speech.org/archive/interspeech- 2013/i13-0397.html
- (2013) InterSpeech 2013
- Muthukumar, P.K.¹ Black, A.W.² Bunnell, H.T.³

69
- 0029254176
- Transformation of formants for voice conversion using artifi cial neural networks
- Narendranath, M., Murthy, H. A., Rajendran, S., & Yegnanarayana, B. (1995). Transformation of formants for voice conversion using artifi cial neural networks. Speech Communication 16, 207-216.
- (1995) Speech Communication , vol.16 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

70
- 85006544659
- Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction
- Nass, C., & Lee, K. M. (2001). Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction. Journal of Experimental Psychology: Applied, 7, 171-181.
- (2001) Journal of Experimental Psychology: Applied , vol.7 , pp. 171-181
- Nass, C.¹ Lee, K.M.²

71
- 0010592593
- Speech Physiology
- F. Minifi e, T. J. Hixon, & F. Williams (Eds.) Englewood Cliffs, NJ: Prentice-Hall
- Netsell, R. (1973). Speech Physiology. In F. Minifi e, T. J. Hixon, & F. Williams (Eds.), Normal aspects of speech, hearing, and language (pp. 211-234). Englewood Cliffs, NJ: Prentice-Hall.
- (1973) Normal Aspects of Speech, Hearing, and Language , pp. 211-234
- Netsell, R.¹

72
- 51549110156
- Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model
- doi:10.1109/CCE.2008.4578962
- Nguyen, B. P., & Akagi, M. (2008). Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model. Proceedings of the Second International Conference on Communications and Electronics, 224-229. doi:10.1109/CCE.2008.4578962
- (2008) Proceedings of the Second International Conference on Communications and Electronics , pp. 224-229
- Nguyen, B.P.¹ Akagi, M.²

73
- 0003542797
- Cambridge, UK: Cambridge University Press
- Nolan, F. (1983). The phonetic bases of speaker recognition. Cambridge, UK: Cambridge University Press.
- (1983) The Phonetic Bases of Speaker Recognition
- Nolan, F.¹

74
- 34547527563
- A parametric approach for voice conversion
- June Paper presented at Barecelona, Spain. Retrieved from
- Nurminen, J, Popa, V., Tian, J., Tang, Y., & Kiss, I. (2006, June). A parametric approach for voice conversion. Paper presented at the TC-STAR Workshop on Speech-to-Speech Translation. Barecelona, Spain. Retrieved from http://www.tcstar. org/pubblicazioni/scientific-publications/Nokia/2006/ S2STranslation06-nokia3.pdf
- (2006) The TC-STAR Workshop on Speech-to-Speech Translation
- Nurminen, J.¹ Popa, V.² Tian, J.³ Tang, Y.⁴ Kiss, I.⁵

75
- 84965395212
- Speech perception as a talker-contingent process
- Nygaard, L. C., Sommers, M. S., & Pisoni, D. B. (1994). Speech perception as a talker-contingent process. Psychological Science, 5, 42-46.
- (1994) Psychological Science , vol.5 , pp. 42-46
- Nygaard, L.C.¹ Sommers, M.S.² Pisoni, D.B.³

76
- 0036503253
- Phonatory control in adults with cerebral palsy and severe dysarthria
- Patel, R. (2002a). Phonatory control in adults with cerebral palsy and severe dysarthria. Augmentative and Alternative Communication, 18, 2-10.
- (2002) Augmentative and Alternative Communication , vol.18 , pp. 2-10
- Patel, R.¹

77
- 0036787690
- Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast
- Patel, R. (2002b). Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast. Journal of Speech, Language, and Hearing Research, 45, 858-870.
- (2002) Journal of Speech, Language, and Hearing Research , vol.45 , pp. 858-870
- Patel, R.¹

78
- 0347586935
- Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy
- Patel, R. (2003). Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy. Journal of Speech, Language, and Hearing Research, 46, 1401-1415.
- (2003) Journal of Speech, Language, and Hearing Research , vol.46 , pp. 1401-1415
- Patel, R.¹

79
- 10944259978
- The acoustics of contrastive prosody in adults with cerebral palsy
- Patel, R. (2004). The acoustics of contrastive prosody in adults with cerebral palsy. Journal of Medical Speech-Language Pathology, 12, 189-193.
- (2004) Journal of Medical Speech-Language Pathology , vol.12 , pp. 189-193
- Patel, R.¹

80
- 84865394622
- Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations
- Patel, R., & Roden, A. (2008). Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations. Journal of Medical Speech-Language Pathology, 16, 243-249.
- (2008) Journal of Medical Speech-Language Pathology , vol.16 , pp. 243-249
- Patel, R.¹ Roden, A.²

81
- 85044897711
- Using computer games to mediate caregiver-child communication for children with severe dysarthria
- Patel, R., & Salata, A. (2006). Using computer games to mediate caregiver-child communication for children with severe dysarthria. Journal of Medical Speech-Language Pathology, 14, 279-284.
- (2006) Journal of Medical Speech-Language Pathology , vol.14 , pp. 279-284
- Patel, R.¹ Salata, A.²

82
- 34347399580
- Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report
- Patel, R., & Watkins, C. (2007). Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report. Journal of Medical Speech-Language Pathology, 15, 149-159.
- (2007) Journal of Medical Speech-Language Pathology , vol.15 , pp. 149-159
- Patel, R.¹ Watkins, C.²

83
- 84867594339
- Local linear transformation for voice conversion
- March Paper presented at Kyoto, Japan. doi:10.1109/ICASSP.2012.6288922
- Popa, V., Silen, H., Nurminen, J., & Gabbouj, M. (2012, March). Local linear transformation for voice conversion. Paper presented at the 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Kyoto, Japan. doi:10.1109/ICASSP.2012.6288922
- (2012) The 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Popa, V.¹ Silen, H.² Nurminen, J.³ Gabbouj, M.⁴

84
- 6344261553
- The influence of sexual orientation on vowel production
- Pierrehumbert, J., Bent, T., Munson, B., Bradlow, A. R., & Bailey, J. M. (2004). The influence of sexual orientation on vowel production. Journal of the Acoustical Society of America, 116, 1905-1908.
- (2004) Journal of the Acoustical Society of America , vol.116 , pp. 1905-1908
- Pierrehumbert, J.¹ Bent, T.² Munson, B.³ Bradlow, A.R.⁴ Bailey, J.M.⁵

85
- 33748443739
- Extraction of speaker-specifi c excitation information from linear prediction residual of speech
- doi:10.1016/j.specom.2006.06.002
- Prasanna, S. R. M., Gupta, C. S., & Yegnanarayana, B. (2006). Extraction of speaker-specifi c excitation information from linear prediction residual of speech. Speech Communication 48, 1243-1261. doi:10.1016/j.specom. 2006.06.002
- (2006) Speech Communication , vol.48 , pp. 1243-1261
- Prasanna, S.R.M.¹ Gupta, C.S.² Yegnanarayana, B.³

86
- 77957744515
- HMM-based speech synthesis utilizing glottal inverse fi ltering
- doi:10.1109/TASL.2010.2045239
- Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M., & Alku, P. (2011). HMM-based speech synthesis utilizing glottal inverse fi ltering. IEEE Transactions on Audio, Speech, and Language Processing, 19, 153-165. doi:10.1109/TASL.2010.2045239
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , pp. 153-165
- Raitio, T.¹ Suni, A.² Yamagishi, J.³ Pulakka, H.⁴ Nurminen, J.⁵ Vainio, M.⁶ Alku, P.⁷

87
- 0031156447
- Talker identifi cation based on phonetic information
- Remez, R. E., Fellowes, J. M., & Rubin, P. E. (1997). Talker identifi cation based on phonetic information. Journal of Experimental Psychology, 23, 651-666.
- (1997) Journal of Experimental Psychology , vol.23 , pp. 651-666
- Remez, R.E.¹ Fellowes, J.M.² Rubin, P.E.³

88
- 0027525457
- On the intonation of sinusoidal sentences: Contour and pitch height
- Remez, R. E., & Rubin, P. E. (1993). On the intonation of sinusoidal sentences: Contour and pitch height. Journal of the Acoustical Society of America, 94, 1983-1988.
- (1993) Journal of the Acoustical Society of America , vol.94 , pp. 1983-1988
- Remez, R.E.¹ Rubin, P.E.²

89
- 4544361661
- Voice conversion through transformation of spectral and intonation features
- May Paper presented at Montreal, Canada. doi:10.1109/ICASSP.2004.1325912
- Rentzos, D., Vaseghi, S., Yan, W., & Ho, C.-H. (2004, May). Voice conversion through transformation of spectral and intonation features. Paper presented at the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Montreal, Canada. doi:10.1109/ICASSP.2004.1325912
- (2004) The 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Rentzos, D.¹ Vaseghi, S.² Yan, W.³ Ho, C.-H.⁴

90
- 0023756465
- Speech synthesis by rule using an optimal selection of non-uniform synthesis units
- May Paper presented at New York, NY. doi:10.1109/ICASSP.1988.196677
- Sagisaka, Y. (1988, May). Speech synthesis by rule using an optimal selection of non-uniform synthesis units. Paper presented at the 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New York, NY. doi:10.1109/ICASSP.1988.196677
- (1988) The 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Sagisaka, Y.¹

91
- 34547507542
- Frequency warping based on mapping formant parameters
- September Paper presented at Pittsburgh, PA
- Shuang, Z-W., Bakis, R., Shectman, S., Chazan, D., & Qin, Y. (2006, September). Frequency warping based on mapping formant parameters. Paper presented at Interspeech 2006, Pittsburgh, PA. http://www.isca-speech.org/ archive/interspeech-2006/i06-1768.html
- (2006) Interspeech 2006
- Shuang, Z.-W.¹ Bakis, R.² Shectman, S.³ Chazan, D.⁴ Qin, Y.⁵

92
- 84887611468
- Augmentative and alternative communication
- J. H. Stone & M. Blouin (Eds.) Available online
- Sigafoos, J., Schlosser, R. W., & Sutherland, D. 2013. Augmentative and alternative communication. In: J. H. Stone & M. Blouin (Eds.), International encyclopedia of rehabilitation. Available online: http://cirrie.buffalo.edu/encyclopedia/en/article/50
- (2013) International Encyclopedia of Rehabilitation
- Sigafoos, J.¹ Schlosser, R.W.² Sutherland, D.³

93
- 78649382869
- A survey of augmentative and alternative communication service provision in Hong Kong
- Siu, E., Tam, E., Sin, D, Ng, C., Lam, E., Chui, M., Lam, C. (2010). A survey of augmentative and alternative communication service provision in Hong Kong. Augmentative and Alternative Communication, 26, 289-298.
- (2010) Augmentative and Alternative Communication , vol.26 , pp. 289-298
- Siu, E.¹ Tam, E.² Sin, D.³ Ng, C.⁴ Lam, E.⁵ Chui, M.⁶ Lam, C.⁷

94
- 0038042432
- Male voices and perceived sexual orientation: An experiment and theoretical approach
- Smyth, R., Jacobs, G., & Rogers, H. (2003). Male voices and perceived sexual orientation: An experiment and theoretical approach. Language and Society, 32, 329-350.
- (2003) Language and Society , vol.32 , pp. 329-350
- Smyth, R.¹ Jacobs, G.² Rogers, H.³

95
- 0004129646
- Cambridge, MA: MIT Press
- Stevens, K. N. (1998). Acoustic phonetics. Cambridge, MA: MIT Press.
- (1998) Acoustic Phonetics
- Stevens, K.N.¹

96
- 85009086192
- Diphone concatenation using a harmonic plus noise model of speech
- September Paper presented at Rhodes, Greece. Retrieved from
- Stylianou, Y., Dutoit, T., & Schroeter, J. (1997, September). Diphone concatenation using a harmonic plus noise model of speech. Paper presented at Eurospeech 1997, Rhodes, Greece. Retrieved from: http://www.isca-speech.org/ archive/eurospeech-1997/e97-0613. html
- (1997) Eurospeech 1997
- Stylianou, Y.¹ Dutoit, T.² Schroeter, J.³

97
- 84946753271
- VTLNbased cross-language voice conversion
- December Paper presented at St. Thomas, Virgin Islands. doi:10.1109/ASRU.2003.1318521
- Sunderman, D., Ney, H, & Hoge, H. (2003, December). VTLNbased cross-language voice conversion. Paper presented at the 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), St. Thomas, Virgin Islands. doi:10.1109/ASRU.2003.1318521
- (2003) The 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
- Sunderman, D.¹ Ney, H.² Hoge, H.³

98
- 84878394226
- Text-to-speech intelligibility across speech rates
- September Paper presented at Portland, OR. Retrieved from
- Syrdal, A. K., Bunnell, H. T., Hertz, S. R., Mishra, T., Spiegel, M., Bickley, C., Makashay, M. J. (2012, September). Text-to-speech intelligibility across speech rates. Paper presented at InterSpeech 2012, Portland, OR. Retrieved from : http://www.isca-speech.org/archive/interspeech-2012/i12-0623. html
- (2012) InterSpeech 2012
- Syrdal, A.K.¹ Bunnell, H.T.² Hertz, S.R.³ Mishra, T.⁴ Spiegel, M.⁵ Bickley, C.⁶ Makashay, M.J.⁷

99
- 0003058857
- On the basic scheme and algorithms in non-uniform unit speech synthesis
- G. Bailly, C. Benoît, & T. R. Sawallis (Eds.) Amsterdam, The Netherlands: North-Holland Publishing Co
- Takeda, K., Abe, K., & Sagisaka, Y. (1992). On the basic scheme and algorithms in non-uniform unit speech synthesis. In G. Bailly, C. Benoît, & T. R. Sawallis (Eds.), Talking machines: Theories, models, and designs (pp. 93-105). Amsterdam, The Netherlands: North-Holland Publishing Co.
- (1992) Talking Machines: Theories, Models, and Designs , pp. 93-105
- Takeda, K.¹ Abe, K.² Sagisaka, Y.³

100
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
- May Paper presented at Salt Lake City, UT. doi:10.1109/ICASSP.2001.941037
- Tamura, M., Masuko, T., Tokuda, K., & Kobayashi, T. (2001, May). Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR. Paper presented at the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT. doi:10.1109/ICASSP.2001. 941037
- (2001) The 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

101
- 85009069262
- Straight-based voice conversion algorithm based on Gaussian mixture model
- October Paper presented at Beijing, China. Retrieved from
- Toda, T., Lu, J., Saruwatari, H., & Shikano, K. (2000, October). Straight-based voice conversion algorithm based on Gaussian mixture model. Paper presented at the Sixth International Conference on Spoken Language Processing, Beijing, China. Retrieved from: http://hdl.handle.net/10061/8187
- (2000) The Sixth International Conference on Spoken Language Processing
- Toda, T.¹ Lu, J.² Saruwatari, H.³ Shikano, K.⁴

102
- 34547496175
- One-to-many and many-to-one voice conversion based on eigenvoices
- April Paper presented at Honolulu, HI doi:10.1109/ICASSP.2007.367303
- Toda, T., Ohtani, Y., & Shikano, K. (2007a, April). One-to-many and many-to-one voice conversion based on eigenvoices. Paper presented at the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, HI. doi:10.1109/ICASSP.2007.367303
- (2007) The 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

103
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- doi:10.1109/tasl.2007.907344
- Toda, T., Black, A. W., & Tokuda, K. (2007b). Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2222-2235. doi:10.1109/tasl.2007.907344
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

104
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- June Paper presented at Istanbul, Turkey. doi:10.1109/ICASSP.2000.861820
- Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., & Kitamura, T. (2000, June). Speech parameter generation algorithms for HMM-based speech synthesis. Paper presented at the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Istanbul, Turkey. doi:10.1109/ICASSP.2000.861820
- (2000) The 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

105
- 0027930431
- Speaker race identifi cation from acoustic cues in the vocal signal
- Walton, J., & Orlikoff, R. (1994). Speaker race identifi cation from acoustic cues in the vocal signal. Journal of Speech Language and Hearing Research 37, 4, 738-745.
- (1994) Journal of Speech Language and Hearing Research , vol.37 , Issue.4 , pp. 738-745
- Walton, J.¹ Orlikoff, R.²

106
- 84959174906
- HMM-based synthesis of child speech
- October Paper presented at Chania, Greece. Retrieved from
- Watts, O., Yamagishi, J., Berkling, K., & King, S. (2008, October). HMM-based synthesis of child speech. Paper presented at the First Workshop on Child, Computer and Interaction (ICMI ' 08 post-conference workshop), Chania, Greece. Retrieved from: http://hdl.handle.net/1842/3817
- (2008) The First Workshop on Child, Computer and Interaction (ICMI ' 08 Post-conference Workshop)
- Watts, O.¹ Yamagishi, J.² Berkling, K.³ King, S.⁴

107
- 70450188371
- HMM adaptation and voice conversion for the synthesis of child speech: A comparison
- September Paper presented at Brighton, United Kingdom. Retrieved from
- Watts, O., Yamagishi, J., King, S., & Berkling, K. (2009, September). HMM adaptation and voice conversion for the synthesis of child speech: A comparison. Paper presented at Interspeech 2009, Brighton, United Kingdom. Retrieved from: http://www.iscaspeech. org/archive/interspeech-2009/i09-2627. html
- (2009) Interspeech 2009
- Watts, O.¹ Yamagishi, J.² King, S.³ Berkling, K.⁴

108
- 77953723062
- Synthesis of child speech with HMM adaptation and voice conversion
- doi:10.1109/TASL.2009.2035029
- Watts, O., Yamagishi, J., King, S., & Berkling, K. (2010). Synthesis of child speech with HMM adaptation and voice conversion. IEEE Transactions on Audio, Speech, and Language Processing, 18, 1005-1016. doi:10.1109/TASL.2009. 2035029
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , pp. 1005-1016
- Watts, O.¹ Yamagishi, J.² King, S.³ Berkling, K.⁴

109
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- Yamagishi, J., & Kobayashi, T. (2007). Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training. IEICE Transactions on Information and Systems, E 90-D, 533-543.
- (2007) IEICe Transactions on Information and Systems, e , vol.90-D , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

110
- 85008006694
- A robust speaker-adaptive HMMbased text-to-speech synthesis
- doi:10.1109/TASL.2009.2016394
- Yamagishi, J., Nose, T., Zen, H., Ling, Z., Toda, T., Tokuda, K., Renals, S. (2009). A robust speaker-adaptive HMMbased text-to-speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 17, 1208-1230. doi:10.1109/TASL.2009.2016394
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , pp. 1208-1230
- Yamagishi, J.¹ Nose, T.² Zen, H.³ Ling, Z.⁴ Toda, T.⁵ Tokuda, K.⁶ Renals, S.⁷

111
- 67651002140
- Statistical parametric speech synthesis
- Zen, H., Tokuda, K., & Black, A. W. (2009). Statistical parametric speech synthesis. Speech Communication, 51, 1039-1064.
- (2009) Speech Communication , vol.51 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

112
- 21144470404
- The attractive voice: What makes it so?
- Zuckerman, M., & Miyake, K. (1993). The attractive voice: What makes it so? Journal of Nonverbal Behaviour, 17, 119-135.
- (1993) Journal of Nonverbal Behaviour , vol.17 , pp. 119-135
- Zuckerman, M.¹ Miyake, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.