-
1
-
-
0026881384
-
Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering
-
Alku, P. (1992). Glottal wave analysis with pitch synchronous iterative adaptive inverse fi ltering. Speech Communication, 11, 109-118.
-
(1992)
Speech Communication
, vol.11
, pp. 109-118
-
-
Alku, P.1
-
2
-
-
0033154052
-
Speaker transformation algorithm using segmental codebooks (STASC)
-
Arslan, L. (1999). Speaker transformation algorithm using segmental codebooks (STASC). Speech Communication, 28, 211-226.
-
(1999)
Speech Communication
, vol.28
, pp. 211-226
-
-
Arslan, L.1
-
3
-
-
0032797216
-
Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech
-
Bachorowski, J., & Owren, M. (1999). Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. Journal of the Acoustical Society of America, 106, 1054-1063.
-
(1999)
Journal of the Acoustical Society of America
, vol.106
, pp. 1054-1063
-
-
Bachorowski, J.1
Owren, M.2
-
4
-
-
0030166343
-
The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences
-
Benoît, C., Grice, M., & Hazan, V. (1996). The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using semantically unpredictable sentences. Speech Communication, 18, 381-392.
-
(1996)
Speech Communication
, vol.18
, pp. 381-392
-
-
Benoît, C.1
Grice, M.2
Hazan, V.3
-
5
-
-
85133503504
-
Diphone synthesis using unit selection
-
November Paper presented at Blue Mountains, Australia. Retrieved from
-
Beutnagel, M., Conkie, A., & Syrdal, A. K. (1998, November). Diphone synthesis using unit selection. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca- speech.org/archive-open/archive-papers/ssw3/ssw3-185.pdf
-
(1998)
The 3rd ISCA Speech Synthesis Workshop (SSW3)
-
-
Beutnagel, M.1
Conkie, A.2
Syrdal, A.K.3
-
6
-
-
84961425284
-
A statewide demographic survey of people with severe communication impairments
-
Bloomberg, K., & Johnson, H. (1990). A statewide demographic survey of people with severe communication impairments. Augmentative and Alternative Communication, 6, 50-60.
-
(1990)
Augmentative and Alternative Communication
, vol.6
, pp. 50-60
-
-
Bloomberg, K.1
Johnson, H.2
-
7
-
-
0001856243
-
Contrastive accent and contrastive stress
-
Bolinger, D. (1961). Contrastive accent and contrastive stress, Language, 37, 83-96.
-
(1961)
Language
, vol.37
, pp. 83-96
-
-
Bolinger, D.1
-
8
-
-
0003708078
-
-
Palo Alto, CA: Stanford University Press
-
Bolinger, D. (1989). Intonation and its uses. Palo Alto, CA: Stanford University Press.
-
(1989)
Intonation and Its Uses
-
-
Bolinger, D.1
-
9
-
-
0026694625
-
A survey of the communication-impaired population of Tayside
-
Brophy-Arnott, M. B., Newell, A. F., Arnott, J. L., & Condie, D. (1992). A survey of the communication-impaired population of Tayside. European Journal of Disorders of Communication, 25, 159-173.
-
(1992)
European Journal of Disorders of Communication
, vol.25
, pp. 159-173
-
-
Brophy-Arnott, M.B.1
Newell, A.F.2
Arnott, J.L.3
Condie, D.4
-
10
-
-
84907049577
-
Crafting small databases for unit selection TTS: Effects on intelligibility
-
September Paper presented at Kyoto, Japan. Retrieved from
-
Bunnell, H. T. (2010, September). Crafting small databases for unit selection TTS: Effects on intelligibility. Paper presented at the 7th ISCA Speech Synthesis Workshop (SSW7), Kyoto, Japan. Retrieved from http://isw3.naist.jp/∼ tomoki/ssw7/www/doc/ssw7-proceedings-rev.pdf
-
(2010)
The 7th ISCA Speech Synthesis Workshop (SSW7)
-
-
Bunnell, H.T.1
-
11
-
-
85039153976
-
A biphone constrained concatenation method for diphone synthesis
-
November Paper presented at Blue Mountains, Australia. Retrieved from
-
Bunnell, H. T., Hoskins, S. R., & Yarrington, D. M. (1998, November). A biphone constrained concatenation method for diphone synthesis. Paper presented at the 3rd ISCA Speech Synthesis Workshop (SSW3), Blue Mountains, Australia. Retrieved from http://www.isca-speech.org/archive-open/archive- papers/ssw3/ssw3-171.pdf
-
(1998)
The 3rd ISCA Speech Synthesis Workshop(SSW3)
-
-
Bunnell, H.T.1
Hoskins, S.R.2
Yarrington, D.M.3
-
12
-
-
85133491738
-
Analysis methods for assessing TTS intelligibility
-
August Paper presented at Bonn, Germany. Retrieved from
-
Bunnell, H. T., & Lilley, J. (2007, August). Analysis methods for assessing TTS intelligibility. Paper presented at the 6th ISCA Speech Synthesis Workshop (SSW6), Bonn, Germany. Retrieved from http://www.isca-speech.org/ archive-open/archive-papers/ssw6/ssw6-374.pdf
-
(2007)
The 6th ISCA Speech Synthesis Workshop(SSW6)
-
-
Bunnell, H.T.1
Lilley, J.2
-
13
-
-
84899214271
-
Advances in computer speech synthesis and implications for assistive technologies
-
J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
-
Bunnell, H. T., & Pennington, C. (2010). Advances in computer speech synthesis and implications for assistive technologies. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 71-91). Hershey, PA: IGI Global.
-
(2010)
Computer Synthesized Speech Technologies: Tools for Aiding Impairment
, pp. 71-91
-
-
Bunnell, H.T.1
Pennington, C.2
-
14
-
-
33745218768
-
Automatic personal synthetic voice construction
-
Paper presented at Retrieved from
-
Bunnell, H. T., Pennington, C., Yarrington, D., & Gray, J. (2005). Automatic personal synthetic voice construction. Paper presented at Eurospeech 2005, 89-92. Retrieved from http://www.iscaspeech. org/archive/interspeech-2005/ i05-0089.html
-
(2005)
Eurospeech
, vol.2005
, pp. 89-92
-
-
Bunnell, H.T.1
Pennington, C.2
Yarrington, D.3
Gray, J.4
-
15
-
-
0008480934
-
-
(Doctoral dissertation). Indiana University, MI, USA
-
Carrell, T. D. (1984). Contributions of fundamental frequency, formant spacing, and glottal waveform to talker identifi cation (Doctoral dissertation). Indiana University, MI, USA.
-
(1984)
Contributions of Fundamental Frequency, Formant Spacing, and Glottal Waveform to Talker Identifi Cation
-
-
Carrell, T.D.1
-
17
-
-
84905560807
-
Voice conversion with smoothed GMM and MAP adaptation
-
September Paper presented at Geneva, Switzerland. Retrieved from
-
Chen, Y., Chu, M., Chang, E., Liu, J., & Liu, R. (2003, September). Voice conversion with smoothed GMM and MAP adaptation. Paper presented at Eurospeech 2003, Geneva, Switzerland. Retrieved from: http://www.isca-speech. org/archive/eurospeech-2003/e03-2413.html
-
(2003)
Eurospeech 2003
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
19
-
-
0034490567
-
Men ' s voices and women ' s choices
-
Collins, S. (2000). Men ' s voices and women ' s choices. Animal Behavior, 60, 773-780.
-
(2000)
Animal Behavior
, vol.60
, pp. 773-780
-
-
Collins, S.1
-
21
-
-
77953693885
-
Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit
-
J. Mullenix & S. Stern (Eds.) Hershey, PA: IGI Global
-
Creer, S., Green, P., Cunningham, S., & Yamagishi, J. (2010). Building personalized synthetic voices for individuals with dysarthria using the HTS toolkit. In J. Mullenix & S. Stern (Eds.), Computer synthesized speech technologies: Tools for aiding impairment (pp. 92-115). Hershey, PA: IGI Global.
-
(2010)
Computer Synthesized Speech Technologies: Tools for Aiding Impairment
, pp. 92-115
-
-
Creer, S.1
Green, P.2
Cunningham, S.3
Yamagishi, J.4
-
22
-
-
84976113790
-
Falls and rises: Meanings and universals
-
Cruttenden, A. (1981). Falls and rises: meanings and universals. Journal of Linguistics 17, 77-91.
-
(1981)
Journal of Linguistics
, vol.17
, pp. 77-91
-
-
Cruttenden, A.1
-
23
-
-
0004239281
-
-
Cambridge, UK: Cambridge University Press
-
Cruttenden, A. (1986). Intonation. Cambridge, UK: Cambridge University Press.
-
(1986)
Intonation
-
-
Cruttenden, A.1
-
24
-
-
77953707533
-
Spectral mapping using artifi cial neural networks for voice conversion
-
Desai, S., Black, A. W., Yegnanarayana, B., & Prahallad, K. (2010). Spectral mapping using artifi cial neural networks for voice conversion. IEEE Transactions on Audio, Speech, and Language Processing, 18, 954-964.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, pp. 954-964
-
-
Desai, S.1
Black, A.W.2
Yegnanarayana, B.3
Prahallad, K.4
-
25
-
-
85079090632
-
A quantitative assessment of the relative speaker discriminating properties of phonemes
-
April Paper presented at Adelaide, Australia. doi:10.1109/ICASSP.1994. 389337
-
Eatock, J., & Mason, J. (1994, April). A quantitative assessment of the relative speaker discriminating properties of phonemes. Paper presented at the 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Adelaide, Australia. doi:10.1109/ICASSP.1994.389337
-
(1994)
The 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Eatock, J.1
Mason, J.2
-
26
-
-
0000337137
-
Articulation testing methods
-
Egan, J.P. (1948). Articulation testing methods. The Laryngoscope 58, 955-991.
-
(1948)
The Laryngoscope
, vol.58
, pp. 955-991
-
-
Egan, J.P.1
-
27
-
-
77953727123
-
Voice conversion based on weighted frequency warping
-
Erro, D., Moreno, A., & Bonafonte, A. (2010a). Voice conversion based on weighted frequency warping. IEEE Transactions on Audio, Speech, and Language Processing, 18, 922-931.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, pp. 922-931
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
28
-
-
77953725318
-
INCA algorithm for training voice conversion systems from nonparallel corpora
-
Erro, D., Moreno, A., & Bonafonte, A. (2010b). INCA algorithm for training voice conversion systems from nonparallel corpora. IEEE Transactions on Audio, Speech, and Language Processing, 18, 944-953.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, pp. 944-953
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
30
-
-
0002633841
-
A note on vocal tract size factors and non-uniform F-pattern scaling
-
Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
-
Fant, G. (1966). A note on vocal tract size factors and non-uniform F-pattern scaling. Speech Transmission Laboratories Quarterly Progress Status Report, 7 (4), 22-30. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www. speech.kth.se/prod/publications/fi les/qpsr/1966/1966-7-4-022-030.pdf
-
(1966)
Speech Transmission Laboratories Quarterly Progress Status Report
, vol.7
, Issue.4
, pp. 22-30
-
-
Fant, G.1
-
31
-
-
33947684811
-
A four-parameter model of glottal fl ow
-
Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from
-
Fant, G., Liljencrants, J., & Lin, Q. (1985). A four-parameter model of glottal fl ow. Speech Transmission Laboratories Quarterly Progress Status Report, 26 (4), 1-13. Stockholm, Sweden: KTH Royal Institute of Technology. Retrieved from http://www.speech.kth. se/prod/publications/fi les/qpsr/1985/1985-26-4-001-013.pdf
-
(1985)
Speech Transmission Laboratories Quarterly Progress Status Report
, vol.26
, Issue.4
, pp. 1-13
-
-
Fant, G.1
Liljencrants, J.2
Lin, Q.3
-
32
-
-
13844254175
-
Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices
-
Feinberg, D. R., Jones, B. C., Little, A. C., Burt, D. M., & Perrett, D. I. (2005). Manipulations of fundamental and formant frequencies influence the attractiveness of human male voices. Animal Behavior, 69, 561-568.
-
(2005)
Animal Behavior
, vol.69
, pp. 561-568
-
-
Feinberg, D.R.1
Jones, B.C.2
Little, A.C.3
Burt, D.M.4
Perrett, D.I.5
-
33
-
-
0031203338
-
Perceiving the sex and identity of a talker without natural vocal timbre
-
Fellows, J. M., Remez, R. E., & Rubin, P. E. (1997). Perceiving the sex and identity of a talker without natural vocal timbre. Perception and Psychophysics 59, 839-849.
-
(1997)
Perception and Psychophysics
, vol.59
, pp. 839-849
-
-
Fellows, J.M.1
Remez, R.E.2
Rubin, P.E.3
-
34
-
-
0032878792
-
Morphology and development of the human vocal tract: A study using magnetic resonance imaging
-
Fitch, W. T., & Giedd, J. (1999). Morphology and development of the human vocal tract: A study using magnetic resonance imaging. Journal of the Acoustical Society of America, 106, 1511-1522.
-
(1999)
Journal of the Acoustical Society of America
, vol.106
, pp. 1511-1522
-
-
Fitch, W.T.1
Giedd, J.2
-
35
-
-
0028098207
-
Effects of synthetic voice output on attitudes toward the augmented communicator
-
Gorenflo, C., Gorenflo, D., & Santer, S. A. (1994). Effects of synthetic voice output on attitudes toward the augmented communicator. Journal of Speech and Hearing Research, 37, 64-68.
-
(1994)
Journal of Speech and Hearing Research
, vol.37
, pp. 64-68
-
-
Gorenflo, C.1
Gorenflo, D.2
Santer, S.A.3
-
36
-
-
0017238868
-
Perceptual features of speech for males in four perceived age decades
-
Hartman, D., & Danhauer, J. (1976). Perceptual features of speech for males in four perceived age decades. Journal of the Acoustical Society of America, 59, 713-715.
-
(1976)
Journal of the Acoustical Society of America
, vol.59
, pp. 713-715
-
-
Hartman, D.1
Danhauer, J.2
-
37
-
-
51449107658
-
LSF mapping for voice conversion with very small training sets
-
April Paper presented at Las Vegas, NV
-
Helander, E., Nurminen, J., & Gabbouj, M. (2008, April). LSF mapping for voice conversion with very small training sets. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, NV.
-
(2008)
The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Helander, E.1
Nurminen, J.2
Gabbouj, M.3
-
38
-
-
79959836789
-
Maximum a posteriori voice conversion using sequential Monte Carlo methods
-
September Paper presented at Makuhari, Japan
-
Helander, E., Silén, H., Míguez, J., & Gabbouj, M. (2010, September). Maximum a posteriori voice conversion using sequential Monte Carlo methods. Paper presented at Interspeech 2010, Makuhari, Japan.
-
(2010)
Interspeech 2010
-
-
Helander, E.1
Silén, H.2
Míguez, J.3
Gabbouj, M.4
-
39
-
-
84856141218
-
Voice conversion using dynamic kernel partial least squares regression
-
Helander, E., Silén, H., Virtanen, T., & Gabbouj, M. (2012). Voice conversion using dynamic kernel partial least squares regression. IEEE Transactions on Audio, Speech, and Language Processing, 20, 806-817.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, pp. 806-817
-
-
Helander, E.1
Silén, H.2
Virtanen, T.3
Gabbouj, M.4
-
40
-
-
44949164829
-
A model of the regularities underlying speaker variation: Evidence from hybrid synthesis
-
September Paper presented at Pittsburgh, PA. Retrieved from
-
Hertz, S.R. (2006, September). A model of the regularities underlying speaker variation: Evidence from hybrid synthesis. Paper presented at the Ninth International Conference on Spoken Language Processing (ICSLP). Pittsburgh, PA. Retrieved from http://www. novaspeech.com/Documents/interspeech2006.pdf
-
(2006)
The Ninth International Conference on Spoken Language Processing (ICSLP)
-
-
Hertz, S.R.1
-
42
-
-
0008499169
-
Perceptual analysis of speaker identity
-
S. Saito (Ed.) Burke, VA: IOS press
-
Itoh, K., (1992). Perceptual analysis of speaker identity. In: S. Saito (Ed.), Speech science and technology (pp. 133-145). Burke, VA: IOS press.
-
(1992)
Speech Science and Technology
, pp. 133-145
-
-
Itoh, K.1
-
44
-
-
72249121867
-
VocaliD: Personalizing text-to-speech synthesis for individuals with severe speech impairment
-
New York, NY: ACM. doi:10.1145/1639642.1639704
-
Jreige, C., Patel, R., & Bunnell, H. T. (2009). VocaliD: personalizing text-to-speech synthesis for individuals with severe speech impairment. Assets ' 09: Proceedings of the 11th International ACM SIGACCESS Conference on Computers and Accessibility (pp. 259-260). New York, NY: ACM. doi:10.1145/1639642.1639704
-
(2009)
Assets ' 09: Proceedings of the 11th International ACM SIGACCESS Conference on Computers and Accessibility
, pp. 259-260
-
-
Jreige, C.1
Patel, R.2
Bunnell, H.T.3
-
45
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
May Paper presented at Seattle, WA doi:10.1109/ICASSP.1998.674423
-
Kain, A., & Macon, M. W. (1998, May). Spectral voice conversion for text-to-speech synthesis. Paper presented at the IEEE Interational Conference on Acoustics, Speech, and Signal Processing (ICASSP), Seattle, WA. 285-288. doi:10.1109/ICASSP.1998.674423
-
(1998)
The IEEE Interational Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 285-288
-
-
Kain, A.1
MacOn, M.W.2
-
46
-
-
0034841948
-
Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
-
May Paper presented at Salt Lake City, UT. doi:10.1109/ICASSP.2001.941039
-
Kain, A., & Macon, M. W. (2001, May). Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT. doi:10.1109/ICASSP. 2001.941039
-
(2001)
The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Kain, A.1
MacOn, M.W.2
-
47
-
-
85133413596
-
Formant re-synthesis of dysarthric speech
-
June Paper presented at Pittsburgh, PA. Retrieved from
-
Kain, A., Niu, X., Hosom, J.-P., Miao, Q., & van Santen, J. P. H. (2004, June). Formant re-synthesis of dysarthric speech. Paper presented at the 5th ISCA Speech Synthesis Workshop (SSW5), Pittsburgh, PA. Retrieved from http://www.isca-speech.org/archive-open/ssw5/ssw5-025.html
-
(2004)
The 5th ISCA Speech Synthesis Workshop(SSW5)
-
-
Kain, A.1
Niu, X.2
Hosom, J.-P.3
Miao, Q.4
Van Santen, J.P.H.5
-
48
-
-
70349210296
-
Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired
-
April Paper presented at Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
-
Kain, A., & van Santen, J. (2009, April). Using speech transformation to increase speech intelligibility for the hearing-and speakingimpaired. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan. doi:10.1109/icassp.2009.4960406
-
(2009)
The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Kain, A.1
Van Santen, J.2
-
49
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds
-
Kawahara, H., Masuda-Katsuse, I., & de Cheveigné, A. (1999). Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased f0 extraction: Possible role of a repetitive structure in sounds. Speech Communication, 27, 187-207.
-
(1999)
Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
50
-
-
77953705589
-
The Blizzard Challenge 2009
-
September Paper presented at Edinburgh, UK
-
King, S. & Karaiskos, V. (2009, September). The Blizzard Challenge 2009. Paper presented at the Blizzard Challenge Workshop, Edinburgh, UK.
-
(2009)
The Blizzard Challenge Workshop
-
-
King, S.1
Karaiskos, V.2
-
51
-
-
0026206653
-
Comparing discrimination and recognition of unfamiliar voices
-
doi:10.1016/1067-6393(91)90016-M
-
Krieman, J., & Papcun, G. (1991). Comparing discrimination and recognition of unfamiliar voices. Speech Communication, 10, 265-275. doi:10.1016/1067-6393(91)90016-M
-
(1991)
Speech Communication
, vol.10
, pp. 265-275
-
-
Krieman, J.1
Papcun, G.2
-
52
-
-
0034320005
-
Rapid speaker adaptation in eigenvoice space
-
Kuhn, R, Junqua, J.-C., Nguyen, P., & Niedzielski, N. (2000). Rapid speaker adaptation in eigenvoice space. IEEE Transactions on Acoustics, Speech, and Signal Processing, 8, 695-707.
-
(2000)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.8
, pp. 695-707
-
-
Kuhn, R.1
Junqua, J.-C.2
Nguyen, P.3
Niedzielski, N.4
-
53
-
-
25844437809
-
The ability of listeners to identify voices
-
Los Angeles, CA: UCLA Phonetics Lab
-
Ladefoged, O., & Ladefoged, J. (1980). The ability of listeners to identify voices. UCLA Working Papers in Phonetics, 49, 43-51. Los Angeles, CA: UCLA Phonetics Lab.
-
(1980)
UCLA Working Papers in Phonetics
, vol.49
, pp. 43-51
-
-
Ladefoged, O.1
Ladefoged, J.2
-
54
-
-
0023793337
-
Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children
-
Lass, N. J., Ruscello, D. M., & Lakawicz, J. A. (1988). Listeners ' perceptions of nonspeech characteristics of normal and dysarthric children. Journal of Communication Disorders, 21, 385-391.
-
(1988)
Journal of Communication Disorders
, vol.21
, pp. 385-391
-
-
Lass, N.J.1
Ruscello, D.M.2
Lakawicz, J.A.3
-
55
-
-
0033883193
-
The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels
-
Lavner, Y., Gath, I., & Rosenhouse, J. (2000). The effects of acoustic modifi cations on the identifi cation of familiar voices speaking isolated vowels. Speech Communication, 30, 9-26.
-
(2000)
Speech Communication
, vol.30
, pp. 9-26
-
-
Lavner, Y.1
Gath, I.2
Rosenhouse, J.3
-
57
-
-
0002482529
-
Suprasegmental features of speech
-
N.J. Lass (Ed.) New York, NY: Academic Press
-
Lehiste, I. (1976). Suprasegmental features of speech. In N.J. Lass (Ed.), Contemporary issues in experimental phonetics (pp. 225-239). New York, NY: Academic Press.
-
(1976)
Contemporary Issues in Experimental Phonetics
, pp. 225-239
-
-
Lehiste, I.1
-
58
-
-
68149157315
-
Integrating articulatory features into HMM-based parametric speech synthesis
-
doi:10.1109/tasl.2009.2014796
-
Ling Z.-H., Richmond, K., Yamagishi, J., & Wang, R.-H. (2009). Integrating articulatory features into HMM-based parametric speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 17, 1171-1185. doi:10.1109/tasl.2009.2014796
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, pp. 1171-1185
-
-
Ling, Z.-H.1
Richmond, K.2
Yamagishi, J.3
Wang, R.-H.4
-
59
-
-
77955426622
-
An analysis of HMM-based prediction of articulatory movements
-
Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010a). An analysis of HMM-based prediction of articulatory movements. Speech Communication, 52, 834-846.
-
(2010)
Speech Communication
, vol.52
, pp. 834-846
-
-
Ling, Z.-H.1
Richmond, K.2
Yamagishi, J.3
-
60
-
-
79959823601
-
HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators
-
September Paper presented at Makuhari, Japan. Retrieved from
-
Ling, Z.-H., Richmond, K., & Yamagishi, J. (2010b, September). HMM-based Text-to-Articulation-movement prediction and analysis of critical articulators. Paper presented at Interspeech 2010, Makuhari, Japan. Retrieved from : http://hdl.handle. net/1842.4563.
-
(2010)
Interspeech 2010
-
-
Ling, Z.-H.1
Richmond, K.2
Yamagishi, J.3
-
61
-
-
0031601187
-
Acoustic correlates of perceived versus actual sexual orientation in men ' s speech
-
Linville, S. (1998). Acoustic correlates of perceived versus actual sexual orientation in men ' s speech. Folia Phoniatrica et Logopaedica, 50, 35-48.
-
(1998)
Folia Phoniatrica et Logopaedica
, vol.50
, pp. 35-48
-
-
Linville, S.1
-
62
-
-
0030696416
-
Voice characteristics conversion for HMM-based speech synthesis system
-
April Paper presented at Munich, Germany. doi:10.1109/ICASSP.2009.4960406
-
Masuko, T., Tokuda, K., Kobayashi, T., & Imai, S. (1997, April). Voice characteristics conversion for HMM-based speech synthesis system. Paper presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Munich, Germany. doi:10.1109/ICASSP.2009.4960406
-
(1997)
The IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
63
-
-
84961456523
-
Identifying the nonspeaking population: A demographic study
-
Matas, J., Mathy-Laikko, P., Beukelman, D., & Legresley, K. (1985). Identifying the nonspeaking population: A demographic study. Augmentative and Alternative Communication, 1, 17-31.
-
(1985)
Augmentative and Alternative Communication
, vol.1
, pp. 17-31
-
-
Matas, J.1
Mathy-Laikko, P.2
Beukelman, D.3
Legresley, K.4
-
65
-
-
0025543906
-
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
Moulines, E., & Charpentier, F. (1990). Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication, 9, 453-467.
-
(1990)
Speech Communication
, vol.9
, pp. 453-467
-
-
Moulines, E.1
Charpentier, F.2
-
66
-
-
33744907285
-
The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech
-
Munson, B., McDonald, E. C., DeBoe, N. L., & White, A. R. (2006). The acoustic and perceptual bases of judgments of women and men ' s sexual orientation from read speech. Journal of Phonetics, 34, 202-240.
-
(2006)
Journal of Phonetics
, vol.34
, pp. 202-240
-
-
Munson, B.1
McDonald, E.C.2
Deboe, N.L.3
White, A.R.4
-
67
-
-
0027447292
-
Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
-
doi:10.1121/1.405558
-
Murray, I. R., & Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America, 93, 1097-1108. doi:10.1121/1.405558
-
(1993)
Journal of the Acoustical Society of America
, vol.93
, pp. 1097-1108
-
-
Murray, I.R.1
Arnott, J.L.2
-
68
-
-
84906279165
-
Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis
-
August Paper presented at Lyon, France. Retrieved from
-
Muthukumar, P.K., Black, A.W., & Bunnell, H.T. (2013, August). Optimizations and fi tting procedures for the Liljencrants-Fant model for statistical parametric speech synthesis. Paper presented at InterSpeech 2013, Lyon, France. Retrieved from http://www. isca-speech.org/archive/interspeech- 2013/i13-0397.html
-
(2013)
InterSpeech 2013
-
-
Muthukumar, P.K.1
Black, A.W.2
Bunnell, H.T.3
-
69
-
-
0029254176
-
Transformation of formants for voice conversion using artifi cial neural networks
-
Narendranath, M., Murthy, H. A., Rajendran, S., & Yegnanarayana, B. (1995). Transformation of formants for voice conversion using artifi cial neural networks. Speech Communication 16, 207-216.
-
(1995)
Speech Communication
, vol.16
, pp. 207-216
-
-
Narendranath, M.1
Murthy, H.A.2
Rajendran, S.3
Yegnanarayana, B.4
-
70
-
-
85006544659
-
Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction
-
Nass, C., & Lee, K. M. (2001). Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency attraction. Journal of Experimental Psychology: Applied, 7, 171-181.
-
(2001)
Journal of Experimental Psychology: Applied
, vol.7
, pp. 171-181
-
-
Nass, C.1
Lee, K.M.2
-
71
-
-
0010592593
-
Speech Physiology
-
F. Minifi e, T. J. Hixon, & F. Williams (Eds.) Englewood Cliffs, NJ: Prentice-Hall
-
Netsell, R. (1973). Speech Physiology. In F. Minifi e, T. J. Hixon, & F. Williams (Eds.), Normal aspects of speech, hearing, and language (pp. 211-234). Englewood Cliffs, NJ: Prentice-Hall.
-
(1973)
Normal Aspects of Speech, Hearing, and Language
, pp. 211-234
-
-
Netsell, R.1
-
74
-
-
34547527563
-
A parametric approach for voice conversion
-
June Paper presented at Barecelona, Spain. Retrieved from
-
Nurminen, J, Popa, V., Tian, J., Tang, Y., & Kiss, I. (2006, June). A parametric approach for voice conversion. Paper presented at the TC-STAR Workshop on Speech-to-Speech Translation. Barecelona, Spain. Retrieved from http://www.tcstar. org/pubblicazioni/scientific-publications/Nokia/2006/ S2STranslation06-nokia3.pdf
-
(2006)
The TC-STAR Workshop on Speech-to-Speech Translation
-
-
Nurminen, J.1
Popa, V.2
Tian, J.3
Tang, Y.4
Kiss, I.5
-
75
-
-
84965395212
-
Speech perception as a talker-contingent process
-
Nygaard, L. C., Sommers, M. S., & Pisoni, D. B. (1994). Speech perception as a talker-contingent process. Psychological Science, 5, 42-46.
-
(1994)
Psychological Science
, vol.5
, pp. 42-46
-
-
Nygaard, L.C.1
Sommers, M.S.2
Pisoni, D.B.3
-
76
-
-
0036503253
-
Phonatory control in adults with cerebral palsy and severe dysarthria
-
Patel, R. (2002a). Phonatory control in adults with cerebral palsy and severe dysarthria. Augmentative and Alternative Communication, 18, 2-10.
-
(2002)
Augmentative and Alternative Communication
, vol.18
, pp. 2-10
-
-
Patel, R.1
-
77
-
-
0036787690
-
Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast
-
Patel, R. (2002b). Prosodic Control in severe dysarthria: Preserved ability to mark the question-statement contrast. Journal of Speech, Language, and Hearing Research, 45, 858-870.
-
(2002)
Journal of Speech, Language, and Hearing Research
, vol.45
, pp. 858-870
-
-
Patel, R.1
-
78
-
-
0347586935
-
Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy
-
Patel, R. (2003). Acoustic characteristics of the question-statement contrast in severe dysarthria due to cerebral palsy. Journal of Speech, Language, and Hearing Research, 46, 1401-1415.
-
(2003)
Journal of Speech, Language, and Hearing Research
, vol.46
, pp. 1401-1415
-
-
Patel, R.1
-
79
-
-
10944259978
-
The acoustics of contrastive prosody in adults with cerebral palsy
-
Patel, R. (2004). The acoustics of contrastive prosody in adults with cerebral palsy. Journal of Medical Speech-Language Pathology, 12, 189-193.
-
(2004)
Journal of Medical Speech-Language Pathology
, vol.12
, pp. 189-193
-
-
Patel, R.1
-
80
-
-
84865394622
-
Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations
-
Patel, R., & Roden, A. (2008). Intelligibility and attitudes toward a speech synthesizer using dysarthric vocalizations. Journal of Medical Speech-Language Pathology, 16, 243-249.
-
(2008)
Journal of Medical Speech-Language Pathology
, vol.16
, pp. 243-249
-
-
Patel, R.1
Roden, A.2
-
81
-
-
85044897711
-
Using computer games to mediate caregiver-child communication for children with severe dysarthria
-
Patel, R., & Salata, A. (2006). Using computer games to mediate caregiver-child communication for children with severe dysarthria. Journal of Medical Speech-Language Pathology, 14, 279-284.
-
(2006)
Journal of Medical Speech-Language Pathology
, vol.14
, pp. 279-284
-
-
Patel, R.1
Salata, A.2
-
82
-
-
34347399580
-
Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report
-
Patel, R., & Watkins, C. (2007). Stress identifi cation in speakers with dysarthria due to cerebral palsy: An initial report. Journal of Medical Speech-Language Pathology, 15, 149-159.
-
(2007)
Journal of Medical Speech-Language Pathology
, vol.15
, pp. 149-159
-
-
Patel, R.1
Watkins, C.2
-
83
-
-
84867594339
-
Local linear transformation for voice conversion
-
March Paper presented at Kyoto, Japan. doi:10.1109/ICASSP.2012.6288922
-
Popa, V., Silen, H., Nurminen, J., & Gabbouj, M. (2012, March). Local linear transformation for voice conversion. Paper presented at the 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Kyoto, Japan. doi:10.1109/ICASSP.2012.6288922
-
(2012)
The 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Popa, V.1
Silen, H.2
Nurminen, J.3
Gabbouj, M.4
-
84
-
-
6344261553
-
The influence of sexual orientation on vowel production
-
Pierrehumbert, J., Bent, T., Munson, B., Bradlow, A. R., & Bailey, J. M. (2004). The influence of sexual orientation on vowel production. Journal of the Acoustical Society of America, 116, 1905-1908.
-
(2004)
Journal of the Acoustical Society of America
, vol.116
, pp. 1905-1908
-
-
Pierrehumbert, J.1
Bent, T.2
Munson, B.3
Bradlow, A.R.4
Bailey, J.M.5
-
85
-
-
33748443739
-
Extraction of speaker-specifi c excitation information from linear prediction residual of speech
-
doi:10.1016/j.specom.2006.06.002
-
Prasanna, S. R. M., Gupta, C. S., & Yegnanarayana, B. (2006). Extraction of speaker-specifi c excitation information from linear prediction residual of speech. Speech Communication 48, 1243-1261. doi:10.1016/j.specom. 2006.06.002
-
(2006)
Speech Communication
, vol.48
, pp. 1243-1261
-
-
Prasanna, S.R.M.1
Gupta, C.S.2
Yegnanarayana, B.3
-
86
-
-
77957744515
-
HMM-based speech synthesis utilizing glottal inverse fi ltering
-
doi:10.1109/TASL.2010.2045239
-
Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M., & Alku, P. (2011). HMM-based speech synthesis utilizing glottal inverse fi ltering. IEEE Transactions on Audio, Speech, and Language Processing, 19, 153-165. doi:10.1109/TASL.2010.2045239
-
(2011)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.19
, pp. 153-165
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
-
87
-
-
0031156447
-
Talker identifi cation based on phonetic information
-
Remez, R. E., Fellowes, J. M., & Rubin, P. E. (1997). Talker identifi cation based on phonetic information. Journal of Experimental Psychology, 23, 651-666.
-
(1997)
Journal of Experimental Psychology
, vol.23
, pp. 651-666
-
-
Remez, R.E.1
Fellowes, J.M.2
Rubin, P.E.3
-
88
-
-
0027525457
-
On the intonation of sinusoidal sentences: Contour and pitch height
-
Remez, R. E., & Rubin, P. E. (1993). On the intonation of sinusoidal sentences: Contour and pitch height. Journal of the Acoustical Society of America, 94, 1983-1988.
-
(1993)
Journal of the Acoustical Society of America
, vol.94
, pp. 1983-1988
-
-
Remez, R.E.1
Rubin, P.E.2
-
89
-
-
4544361661
-
Voice conversion through transformation of spectral and intonation features
-
May Paper presented at Montreal, Canada. doi:10.1109/ICASSP.2004.1325912
-
Rentzos, D., Vaseghi, S., Yan, W., & Ho, C.-H. (2004, May). Voice conversion through transformation of spectral and intonation features. Paper presented at the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Montreal, Canada. doi:10.1109/ICASSP.2004.1325912
-
(2004)
The 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Rentzos, D.1
Vaseghi, S.2
Yan, W.3
Ho, C.-H.4
-
90
-
-
0023756465
-
Speech synthesis by rule using an optimal selection of non-uniform synthesis units
-
May Paper presented at New York, NY. doi:10.1109/ICASSP.1988.196677
-
Sagisaka, Y. (1988, May). Speech synthesis by rule using an optimal selection of non-uniform synthesis units. Paper presented at the 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), New York, NY. doi:10.1109/ICASSP.1988.196677
-
(1988)
The 1988 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Sagisaka, Y.1
-
91
-
-
34547507542
-
Frequency warping based on mapping formant parameters
-
September Paper presented at Pittsburgh, PA
-
Shuang, Z-W., Bakis, R., Shectman, S., Chazan, D., & Qin, Y. (2006, September). Frequency warping based on mapping formant parameters. Paper presented at Interspeech 2006, Pittsburgh, PA. http://www.isca-speech.org/ archive/interspeech-2006/i06-1768.html
-
(2006)
Interspeech 2006
-
-
Shuang, Z.-W.1
Bakis, R.2
Shectman, S.3
Chazan, D.4
Qin, Y.5
-
92
-
-
84887611468
-
Augmentative and alternative communication
-
J. H. Stone & M. Blouin (Eds.) Available online
-
Sigafoos, J., Schlosser, R. W., & Sutherland, D. 2013. Augmentative and alternative communication. In: J. H. Stone & M. Blouin (Eds.), International encyclopedia of rehabilitation. Available online: http://cirrie.buffalo.edu/encyclopedia/en/article/50
-
(2013)
International Encyclopedia of Rehabilitation
-
-
Sigafoos, J.1
Schlosser, R.W.2
Sutherland, D.3
-
93
-
-
78649382869
-
A survey of augmentative and alternative communication service provision in Hong Kong
-
Siu, E., Tam, E., Sin, D, Ng, C., Lam, E., Chui, M., Lam, C. (2010). A survey of augmentative and alternative communication service provision in Hong Kong. Augmentative and Alternative Communication, 26, 289-298.
-
(2010)
Augmentative and Alternative Communication
, vol.26
, pp. 289-298
-
-
Siu, E.1
Tam, E.2
Sin, D.3
Ng, C.4
Lam, E.5
Chui, M.6
Lam, C.7
-
94
-
-
0038042432
-
Male voices and perceived sexual orientation: An experiment and theoretical approach
-
Smyth, R., Jacobs, G., & Rogers, H. (2003). Male voices and perceived sexual orientation: An experiment and theoretical approach. Language and Society, 32, 329-350.
-
(2003)
Language and Society
, vol.32
, pp. 329-350
-
-
Smyth, R.1
Jacobs, G.2
Rogers, H.3
-
96
-
-
85009086192
-
Diphone concatenation using a harmonic plus noise model of speech
-
September Paper presented at Rhodes, Greece. Retrieved from
-
Stylianou, Y., Dutoit, T., & Schroeter, J. (1997, September). Diphone concatenation using a harmonic plus noise model of speech. Paper presented at Eurospeech 1997, Rhodes, Greece. Retrieved from: http://www.isca-speech.org/ archive/eurospeech-1997/e97-0613. html
-
(1997)
Eurospeech 1997
-
-
Stylianou, Y.1
Dutoit, T.2
Schroeter, J.3
-
97
-
-
84946753271
-
VTLNbased cross-language voice conversion
-
December Paper presented at St. Thomas, Virgin Islands. doi:10.1109/ASRU.2003.1318521
-
Sunderman, D., Ney, H, & Hoge, H. (2003, December). VTLNbased cross-language voice conversion. Paper presented at the 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), St. Thomas, Virgin Islands. doi:10.1109/ASRU.2003.1318521
-
(2003)
The 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
-
-
Sunderman, D.1
Ney, H.2
Hoge, H.3
-
98
-
-
84878394226
-
Text-to-speech intelligibility across speech rates
-
September Paper presented at Portland, OR. Retrieved from
-
Syrdal, A. K., Bunnell, H. T., Hertz, S. R., Mishra, T., Spiegel, M., Bickley, C., Makashay, M. J. (2012, September). Text-to-speech intelligibility across speech rates. Paper presented at InterSpeech 2012, Portland, OR. Retrieved from : http://www.isca-speech.org/archive/interspeech-2012/i12-0623. html
-
(2012)
InterSpeech 2012
-
-
Syrdal, A.K.1
Bunnell, H.T.2
Hertz, S.R.3
Mishra, T.4
Spiegel, M.5
Bickley, C.6
Makashay, M.J.7
-
99
-
-
0003058857
-
On the basic scheme and algorithms in non-uniform unit speech synthesis
-
G. Bailly, C. Benoît, & T. R. Sawallis (Eds.) Amsterdam, The Netherlands: North-Holland Publishing Co
-
Takeda, K., Abe, K., & Sagisaka, Y. (1992). On the basic scheme and algorithms in non-uniform unit speech synthesis. In G. Bailly, C. Benoît, & T. R. Sawallis (Eds.), Talking machines: Theories, models, and designs (pp. 93-105). Amsterdam, The Netherlands: North-Holland Publishing Co.
-
(1992)
Talking Machines: Theories, Models, and Designs
, pp. 93-105
-
-
Takeda, K.1
Abe, K.2
Sagisaka, Y.3
-
100
-
-
0034842740
-
Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
-
May Paper presented at Salt Lake City, UT. doi:10.1109/ICASSP.2001.941037
-
Tamura, M., Masuko, T., Tokuda, K., & Kobayashi, T. (2001, May). Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR. Paper presented at the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Salt Lake City, UT. doi:10.1109/ICASSP.2001. 941037
-
(2001)
The 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Tamura, M.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
-
101
-
-
85009069262
-
Straight-based voice conversion algorithm based on Gaussian mixture model
-
October Paper presented at Beijing, China. Retrieved from
-
Toda, T., Lu, J., Saruwatari, H., & Shikano, K. (2000, October). Straight-based voice conversion algorithm based on Gaussian mixture model. Paper presented at the Sixth International Conference on Spoken Language Processing, Beijing, China. Retrieved from: http://hdl.handle.net/10061/8187
-
(2000)
The Sixth International Conference on Spoken Language Processing
-
-
Toda, T.1
Lu, J.2
Saruwatari, H.3
Shikano, K.4
-
102
-
-
34547496175
-
One-to-many and many-to-one voice conversion based on eigenvoices
-
April Paper presented at Honolulu, HI doi:10.1109/ICASSP.2007.367303
-
Toda, T., Ohtani, Y., & Shikano, K. (2007a, April). One-to-many and many-to-one voice conversion based on eigenvoices. Paper presented at the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, HI. doi:10.1109/ICASSP.2007.367303
-
(2007)
The 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
103
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
doi:10.1109/tasl.2007.907344
-
Toda, T., Black, A. W., & Tokuda, K. (2007b). Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2222-2235. doi:10.1109/tasl.2007.907344
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
104
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
June Paper presented at Istanbul, Turkey. doi:10.1109/ICASSP.2000.861820
-
Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., & Kitamura, T. (2000, June). Speech parameter generation algorithms for HMM-based speech synthesis. Paper presented at the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Istanbul, Turkey. doi:10.1109/ICASSP.2000.861820
-
(2000)
The 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
105
-
-
0027930431
-
Speaker race identifi cation from acoustic cues in the vocal signal
-
Walton, J., & Orlikoff, R. (1994). Speaker race identifi cation from acoustic cues in the vocal signal. Journal of Speech Language and Hearing Research 37, 4, 738-745.
-
(1994)
Journal of Speech Language and Hearing Research
, vol.37
, Issue.4
, pp. 738-745
-
-
Walton, J.1
Orlikoff, R.2
-
106
-
-
84959174906
-
HMM-based synthesis of child speech
-
October Paper presented at Chania, Greece. Retrieved from
-
Watts, O., Yamagishi, J., Berkling, K., & King, S. (2008, October). HMM-based synthesis of child speech. Paper presented at the First Workshop on Child, Computer and Interaction (ICMI ' 08 post-conference workshop), Chania, Greece. Retrieved from: http://hdl.handle.net/1842/3817
-
(2008)
The First Workshop on Child, Computer and Interaction (ICMI ' 08 Post-conference Workshop)
-
-
Watts, O.1
Yamagishi, J.2
Berkling, K.3
King, S.4
-
107
-
-
70450188371
-
HMM adaptation and voice conversion for the synthesis of child speech: A comparison
-
September Paper presented at Brighton, United Kingdom. Retrieved from
-
Watts, O., Yamagishi, J., King, S., & Berkling, K. (2009, September). HMM adaptation and voice conversion for the synthesis of child speech: A comparison. Paper presented at Interspeech 2009, Brighton, United Kingdom. Retrieved from: http://www.iscaspeech. org/archive/interspeech-2009/i09-2627. html
-
(2009)
Interspeech 2009
-
-
Watts, O.1
Yamagishi, J.2
King, S.3
Berkling, K.4
-
108
-
-
77953723062
-
Synthesis of child speech with HMM adaptation and voice conversion
-
doi:10.1109/TASL.2009.2035029
-
Watts, O., Yamagishi, J., King, S., & Berkling, K. (2010). Synthesis of child speech with HMM adaptation and voice conversion. IEEE Transactions on Audio, Speech, and Language Processing, 18, 1005-1016. doi:10.1109/TASL.2009. 2035029
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, pp. 1005-1016
-
-
Watts, O.1
Yamagishi, J.2
King, S.3
Berkling, K.4
-
109
-
-
33847129573
-
Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
-
Yamagishi, J., & Kobayashi, T. (2007). Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training. IEICE Transactions on Information and Systems, E 90-D, 533-543.
-
(2007)
IEICe Transactions on Information and Systems, e
, vol.90-D
, pp. 533-543
-
-
Yamagishi, J.1
Kobayashi, T.2
-
110
-
-
85008006694
-
A robust speaker-adaptive HMMbased text-to-speech synthesis
-
doi:10.1109/TASL.2009.2016394
-
Yamagishi, J., Nose, T., Zen, H., Ling, Z., Toda, T., Tokuda, K., Renals, S. (2009). A robust speaker-adaptive HMMbased text-to-speech synthesis. IEEE Transactions on Audio, Speech, and Language Processing, 17, 1208-1230. doi:10.1109/TASL.2009.2016394
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, pp. 1208-1230
-
-
Yamagishi, J.1
Nose, T.2
Zen, H.3
Ling, Z.4
Toda, T.5
Tokuda, K.6
Renals, S.7
-
111
-
-
67651002140
-
Statistical parametric speech synthesis
-
Zen, H., Tokuda, K., & Black, A. W. (2009). Statistical parametric speech synthesis. Speech Communication, 51, 1039-1064.
-
(2009)
Speech Communication
, vol.51
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
|