SCOPUS 정보 검색 플랫폼

Text-to-Speech Synthesis

Volumn 9780521899277, Issue , 2009, Pages 1-597

Text-to-speech synthesis

(1) Taylor, Paul a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

BUILDING SIGNAL SYSTEMS; ENGINEERING EDUCATION; HIDDEN MARKOV MODELS; LINGUISTICS; MARKOV PROCESSES; SIGNAL PROCESSING; SPEECH; SPEECH SYNTHESIS; STUDENTS;

GRADUATE STUDENTS; HUMAN COMMUNICATIONS; PRACTICAL SYSTEMS; PRIOR KNOWLEDGE; SPEECH SIGNALS; TEXT ANALYSIS; TRADITIONAL TECHNIQUES; UNIT SELECTION;

AUDIO SIGNAL PROCESSING;

EID: 84925160976 PISSN: None EISSN: None Source Type: Book
DOI: 10.1017/CBO9780511816338 Document Type: Book

Times cited : (503)

References (521)

1
- 0002212488
- Chunks and dependencies: Bringing processing evidence to bear on syntax
- J. Cole, G. Green, and J. Morgan, Eds. CSLI
- Abney, S. Chunks and dependencies: Bringing processing evidence to bear on syntax. In Computational Linguistics and the Foundations of Linguistic Theory, J. Cole, G. Green, and J. Morgan, Eds. CSLI (1995), pp. 145-164.
- (1995) Computational Linguistics and the Foundations of Linguistic Theory , pp. 145-164
- Abney, S.¹

2
- 30244435612
- Hybrid sinusoidal modeling of speech without voicing decision
- Abrantes, A. J., Marques, J. S., and Trancoso, I. M. Hybrid sinusoidal modeling of speech without voicing decision. In Proceedings of Eurospeech 1991 (1991).
- (1991) Proceedings Of Eurospeech 1991
- Abrantes, A.J.¹ Marques, J.S.² Trancoso, I.M.³

3
- 0038676753
- Source filter models for time-scale pitch-scale modification of speech
- Acero, A. Source filter models for time-scale pitch-scale modification of speech. In Proceedings ofthe International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings Of the International Conference on Speech and Language Processing 1998
- Acero, A.¹

4
- 85135264071
- Formant analysis and synthesis using hidden markov models
- Acero, A. Formant analysis and synthesis using hidden Markov models. In Proceedings of Eurospeech 1999 (1999).
- (1999) Proceedings Of Eurospeech 1999
- Acero, A.¹

5
- 33645793767
- Toward phone segmentation for concatenative speech synthesis
- Adell, J., and Bonafonte, A. Toward phone segmentation for concatenative speech synthesis. In Proceedings of the 5th ISCA Speech Synthesis Workshop (2004).
- (2004) Proceedings of the 5Th ISCA Speech Synthesis Workshop
- Adell, J.¹ Bonafonte, A.²

6
- 84925038210
- New Mexico: University of New Mexico Press
- Adrien, K. Andean Worlds: Indigenous History, Culture and Consciousness. New Mexico: University of New Mexico Press (2001).
- (2001) AndeanWorlds: Indigenous History, Culture and Consciousness
- Adrien, K.¹

7
- 51449110207
- Joint extraction and prediction of fujisaki's intonation model parameters
- Aguero, P., Wimmer, K., and Bonafonte, A. Joint extraction and prediction of Fujisaki's intonation model parameters. Proceedings of International Conference on Spoken Language Processing (2004).
- (2004) Proceedings of International Conference on Spoken Language Processing
- Aguero, P.¹ Wimmer, K.² Bonafonte, A.³

8
- 0015699029
- A system for converting english text into speech
- Ainsworth, W. A system for converting English text into speech. IEEE Transactions on Audio and Electroacoustics 21 (1973), 288-290.
- (1973) IEEE Transactions on Audio and Electroacoustics , vol.21 , pp. 288-290
- Ainsworth, W.¹

9
- 85009210634
- Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis
- Alias, F., and Llora, X. Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings Of eurospeech 2003
- Alias, F.¹ Llora, X.²

10
- 0003724033
- Cambridge: Cambridge University Press
- Allen, J., Hunnicut, S., and Klatt, D. From Text to Speech: The MITalk System. Cambridge: Cambridge University Press (1987).
- (1987) From Text to Speech: The MITalk System
- Allen, J.¹ Hunnicut, S.² Klatt, D.³

11
- 33947132973
- On the analysis of communicative action
- Allwood, J. On the analysis of communicative action. Gothenburg Papers in Theoretical Linguistics 38 (1978).
- (1978) Gothenburg Papers in Theoretical Linguistics , vol.38
- Allwood, J.¹

12
- 85009279016
- The reliability of the itu-t p.85 standard for the evaluation of text-to-speech systems
- Alvarez, Y. V., and Huckvale, M. The reliability of the ITU-T p.85 standard for the evaluation of text-to-speech systems. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Alvarez, Y.V.¹ Huckvale, M.²

13
- 0031055369
- Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part ii: The rhotics
- Alwan, A., Narayanan, S., and Haker, K. Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part II: The rhotics. Journal of the Acoustical Society of America 101, 2 (1997), 1078-1089.
- (1997) Journal Ofthe Acoustical Society of America , vol.101 , Issue.2 , pp. 1078-1089
- Alwan, A.¹ Narayanan, S.² Haker, K.³

14
- 6344251260
- Word and syllable concatenation in text-to-speech synthesis
- Lewis, E., and Tatham, M. Word and syllable concatenation in text-to-speech synthesis. In Proceedings of the European Conference on Speech 1999 (1999).
- (1999) Proceedings of the European Conference on Speech 1999
- Lewis, E.¹ Tatham, M.²

15
- 85032415626
- Unit selection synthesis database development using utterance verification
- Amdal, I., and Svendsen, T. Unit selection synthesis database development using utterance verification. In Proceedings of Eurospeech 2005 (2005).
- (2005) Proceedings Of eurospeech 2005
- Amdal, I.¹ Svendsen, T.²

16
- 0012406952
- Synthesis by rule of english intonation patterns
- Anderson, M. D., Pierrehumbert, J. B., and Liberman, M. Y. Synthesis by rule of English intonation patterns. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 1984 (1984).
- (1984) Proceedings of the International Conference on Acoustics Speech and Signal Processing 1984
- Anderson, M.D.¹ Pierrehumbert, J.B.² Liberman, M.Y.³

17
- 0004950573
- Chicago, IL: University of Chicago Press
- Anderson, S. R. Phonology in the Twentieth Century. Chicago, IL: University of Chicago Press (1985).
- (1985) Phonology in the Twentieth Century
- Anderson, S.R.¹

18
- 85009153344
- Long vowel detection for letter-to-sound conversion for japanese sourced words transliterated into the alphabet
- Asano, H., Nakajima, H., Mizuno, H., and Oku, M. Long vowel detection for letter-to-sound conversion for Japanese sourced words transliterated into the alphabet. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Asano, H.¹ Nakajima, H.² Mizuno, H.³ Oku, M.⁴

19
- 0003472415
- Michigan: University of Michigan Press
- Ascher, M., and Ascher, R. Code of the Quipu: A Study in Media, Mathematics, and Culture. Michigan: University of Michigan Press (1980).
- (1980) Code of the Quipu: A Study in Media, Mathematics, and Culture
- Ascher, M.¹ Ascher, R.²

20
- 0015112070
- Speech analysis and synthesis by linear prediction of the speech wave
- Atal, B. S., and Hanauer, L. Speech analysis and synthesis by linear prediction of the speech wave. Journal of the Acoustical Society of America 50 (1971), 637-655.
- (1971) Journal of the Acoustical Society of America , vol.50 , pp. 637-655
- Atal, B.S.¹ Hanauer, L.²

21
- 2942726537
- On the phonetics and phonology of “segmental anchoring” of f0: Evidence from german
- Atterer, M., and Ladd, D. R. On the phonetics and phonology of “segmental anchoring” of F0: Evidence from German. Journal of Phonetics 32 (2004), 177-197.
- (2004) Journal of Phonetics , vol.32 , pp. 177-197
- Atterer, M.¹ Ladd, D.R.²

22
- 78649314799
- Why and how to control the authentic emotional speech corpora
- Auberge, V., Audibert, N., and Rilliard, A. Why and how to control the authentic emotional speech corpora. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings Of eurospeech 2003
- Auberge, V.¹ Audibert, N.² Rilliard, A.³

23
- 0001680172
- The dissociation of deaccenting, givenness and syntactic role in spontaneous speech
- Aylett, M. P. The dissociation of deaccenting, givenness and syntactic role in spontaneous speech. In Proceedings of the XIVth International Congress of Phonetic Science 1999, pp. 1753-1756.
- (1999) Proceedings of the Xivth International Congress of Phonetic Science , pp. 1753-1756
- Aylett, M.P.¹

24
- 85009067857
- Stochastic Suprasegmentals: Relationships between redundancy, prosodic structure and care of articulation in spontaneous speech
- Aylett, M. P. Stochastic suprasegmentals: Relationships between redundancy, prosodic structure and care of articulation in spontaneous speech. In Proceedings ofthe International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings Ofthe International Conference on Speech and Language Processing 2000
- Aylett, M.P.¹

25
- 85032425410
- Synthesising hyperarticulation in unit selection TTS
- Aylett, M. P. Synthesising hyperarticulation in unit selection TTS. In Proceedings of Eurospeech 2005 (2005).
- (2005) Proceedings of Eurospeech 2005
- Aylett, M.P.¹

26
- 84930562270
- A computational grammar of discourse-neutral prosodic phrasing in english
- Bachenko, J., and Fitzpatrick, E. A computational grammar of discourse-neutral prosodic phrasing in English. Computational Linguistics 16, 3 (1990), 155-170.
- (1990) Computational Linguistics , vol.16 , Issue.3 , pp. 155-170
- Bachenko, J.¹ Fitzpatrick, E.²

27
- 0032045825
- Phonemic transcription by analogy in text-to-speech synthesis: Novel word pronunciation and lexicon compression
- Bagshaw, P. Phonemic transcription by analogy in text-to-speech synthesis: Novel word pronunciation and lexicon compression. Computer Speech & Language 12, 2 (1998), 119-142.
- (1998) Computer Speech & Language , vol.12 , Issue.2 , pp. 119-142
- Bagshaw, P.¹

28
- 85093707396
- Enhanced pitch tracking and the processing of f0 contours for computer aided intonation teaching
- Bagshaw, P. C., Hiller, S. M., and Jack, M. A. Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching. In Proceedings of Eurospeech 1993 (1993), pp. 1003-1006.
- (1993) Proceedings of Eurospeech , vol.1993 , pp. 1003-1006
- Bagshaw, P.C.¹ Hiller, S.M.² Jack, M.A.³

29
- 21844464777
- No future for comprehensive models of intonation?
- Y. Sagisaka, N. Campbell and N. Higuchi, Eds. Berlin: Springer-Verlag
- Bailly, G. No future for comprehensive models of intonation? In Computing Prosody: Computational Models for Processing Spontaneous Speech, Y. Sagisaka, N. Campbell and N. Higuchi, Eds. Berlin: Springer-Verlag (1997), pp. 157-164.
- (1997) Computing Prosody: Computational Models for Processing Spontaneous Speech , pp. 157-164
- Bailly, G.¹

30
- 0142216141
- Audiovisual speech synthesis
- Bailly, G., Berar, M., Elisei, F., and Odisio, M. Audiovisual speech synthesis. International Journal of Speech Technology 6 (2003), 331-346.
- (2003) International Journal of Speech Technology , vol.6 , pp. 331-346
- Bailly, G.¹ Berar, M.² Elisei, F.³ Odisio, M.⁴

31
- 21844440585
- SFC: A trainable prosodic model
- Bailly, G., and Holm, B. SFC: A trainable prosodic model. Speech Communication 46, 3-4 (2005), 348-364.
- (2005) Speech Communication , vol.46 , Issue.3-4 , pp. 348-364
- Bailly, G.¹ Holm, B.²

32
- 0342484597
- Compost: A rule compiler for speech synthesis
- Bailly, G., and Tran, A. Compost: A rule compiler for speech synthesis. In Proceedings of Eurospeech 1989 (1989), pp. 136-139.
- (1989) In Proceedings Of eurospeech , vol.1989 , pp. 136-139
- Bailly, G.¹ Tran, A.²

33
- 0016663359
- The dragon system - an overview
- Baker, J. K. The DRAGON system - an overview. IEEE Transactions on Acoustics, Speech, and Signal Processing 23, 1 (1975), 24-29.
- (1975) IEEE transactions on acoustics, Speech, and Signal Processing , vol.23 , Issue.1 , pp. 24-29
- Baker, J.K.¹

34
- 34748917145
- Is there an emotion signature in international patterns? And can it be used in synthesis?
- Banziner, T., Morel, M., and Scherer, K. Is there an emotion signature in international patterns? and can it be used in synthesis? In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Of eurospeech 2003
- Banziner, T.¹ Morel, M.² Scherer, K.³

35
- 0028531866
- Characterization of rhythmic patterns for text-to-speech synthesis
- Barbosa, P., and Bailly, G. Characterization of rhythmic patterns for text-to-speech synthesis. Speech Communication 15, 1 (1994), 127-137.
- (1994) Speech Communication , vol.15 , Issue.1 , pp. 127-137
- Barbosa, P.¹ Bailly, G.²

36
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of markov chain
- Baum, L. E., Peterie, T., Souled, G., and Weiss, N. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chain. The Annals of Mathematical Statistics 41, 1 (1970), 249-336.
- (1970) The Annals of Mathematical Statistics , vol.41 , Issue.1 , pp. 249-336
- Baum, L.E.¹ Peterie, T.² Souled, G.³ Weiss, N.⁴

37
- 84925038190
- Adaptation of prosodic phrasing models
- Bell, P., Burrows, T., and Taylor, P. Adaptation of prosodic phrasing models. In Proceedings of Speech Prosody 2006 (2006).
- (2006) Proceedings of Speech Prosody 2006
- Bell, P.¹ Burrows, T.² Taylor, P.³

38
- 0141703269
- Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
- Bellegarda, J. Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy. In Acoustics, Speech, and Signal Processing, 2003. Proceedings (ICASSP’03). 2003 IEEE International Conference (2003).
- (2003) Acoustics, Speech, and Signal Processing, 2003. Proceedings (ICASSP’03). 2003 IEEE International Conference
- Bellegarda, J.¹

39
- 85032421249
- A novel discontinuity metrica for unit selection text-to-speech synthesis
- Bellegarda, J. R. A novel discontinuity metrica for unit selection text-to-speech synthesis. In 5th ISCA Workshop on Speech Synthesis (2004).
- (2004) 5Th ISCA Workshop on Speech Synthesis
- Bellegarda, J.R.¹

40
- 33846442606
- Large scale evaluation of corpus-based synthesizers: Results and lessons from the Blizzard Challenge 2005
- Bennett, C. L. Large scale evaluation of corpus-based synthesizers: Results and lessons from the Blizzard Challenge 2005. In Proceedings of Interspeech 2006 (2005).
- (2005) Proceedings of Interspeech 2006
- Bennett, C.L.¹

41
- 84883424118
- Rule-based visual speech synthesis
- Beskow, J. Rule-based visual speech synthesis. In Proceedings of Eurospeech 1995 (1995).
- (1995) Proceedings Of eurospeech 1995
- Beskow, J.¹

42
- 85133503504
- Diphone synthesis using unit selection
- Beutnagel, M., Conkie, A., and Syrdal, A. Diphone synthesis using unit selection. In Proceedings of the Third ISCA Workshop on Speech Synthesis (1998).
- (1998) Proceedings of the Third ISCA Workshop on Speech Synthesis
- Beutnagel, M.¹ Conkie, A.² Syrdal, A.³

43
- 84937296364
- Cambridge: Cambridge University Press
- Bird, S. Computational Phonology: A Constraint-Based Approach. Cambridge: Cambridge University Press, 1995.
- (1995) Computational Phonology: A Constraint-Based Approach
- Bird, S.¹

44
- 85009250482
- Investigations on joint-multigram models for grapheme-to- phoneme conversion
- Bisani, M., and Ney, H. Investigations on joint-multigram models for grapheme-to-phoneme conversion. Proceedings of the International Conference on Spoken Language Processing 1 (2002), pp. 105-108.
- (2002) Proceedings of the International Conference on Spoken Language Processing , vol.1 , pp. 105-108
- Bisani, M.¹ Ney, H.²

45
- 0003487601
- Oxford: Oxford University Press
- Bishop, C. M. Neural Networks for Pattern Recognition. Oxford: Oxford University Press (1995).
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

46
- 33846516584
- Berlin: Springer-Verlag
- Bishop, C. M. Pattern Recognition and Machine Learning. Berlin: Springer-Verlag (2006).
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

47
- 85006631929
- Unit selection and emotional speech
- Black, A. Unit selection and emotional speech. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings of Eurospeech 2003
- Black, A.¹

48
- 85032409979
- Blizzard challenge 2006
- Black, A., and Bennett, C. L. Blizzard Challenge 2006. In Proceedings of Interspeech 2006 (2006).
- (2006) Proceedings of Interspeech 2006
- Black, A.¹ Bennett, C.L.²

49
- 85133526552
- Automatically clustering similar units for unit selection in speech synthesis
- Black, A., and Taylor, P. Automatically clustering similar units for unit selection in speech synthesis. In Proceedings of Eurospeech 1997 (1997), vol. 2, pp. 601-604.
- (1997) Proceedings of Eurospeech 1997 , vol.2 , pp. 601-604
- Black, A.¹ Taylor, P.²

50
- 33947682675
- Blizzard Challenge 2006: Evaluating corpus-based speech synthesis on common datasets
- Black, A., and Tokuda, K. Blizzard Challenge 2006: Evaluating corpus-based speech synthesis on common datasets. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Black, A.¹ Tokuda, K.²

51
- 0030355540
- Generation f0 contours from tobi labels using linear regression
- Black, A. W., and Hunt, A. J. Generation F0 contours from ToBI labels using linear regression. In Computer Speech and Language (1996).
- (1996) Computer Speech and Language
- Black, A.W.¹ Hunt, A.J.²

52
- 84966301419
- Limited domain synthesis
- Black, A. W., and Lenzo, K. A. Limited domain synthesis. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of The International Conference on Speech and Language Processing 2000
- Black, A.W.¹ Lenzo, K.A.²

53
- 0342918775
- CHATR: A generic speech synthesis system
- Black, A. W., and Taylor, P. CHATR: A generic speech synthesis system. In COLING 1994 (1994), pp. 983-986.
- (1994) COLING , vol.1994 , pp. 983-986
- Black, A.W.¹ Taylor, P.²

54
- 85126702268
- Assigning phrase breaks from part-to-speech sequences
- Black, A. W., and Taylor, P. Assigning phrase breaks from part-to-speech sequences. In Proceedings ofEurospeech 1997 (1997).
- (1997) Proceedings Of eurospeech 1997
- Black, A.W.¹ Taylor, P.²

55
- 84925038181
- The Festival Speech Synthesis System. Manual and source code avaliable at
- Black, A. W., Taylor, P., and Caley, R. The Festival Speech Synthesis System. Manual and source code avaliable at http://www.cstr.ed.ac.uk/projects/festival.html, 1996-2006.
- (1996)
- Black, A.W.¹ Taylor, P.² Caley, R.³

56
- 85032424788
- A framework for generating prosody from high level linguistics descriptions
- Black, A. W., and Taylor, P. A. A framework for generating prosody from high level linguistics descriptions. In Spring Meeting, Acoustical Society of Japan (1994).
- (1994) Spring Meeting, Acoustical Society of Japan
- Black, A.W.¹ Taylor, P.A.²

57
- 0004123567
- New York: Henry Holt
- Bloomfield, L. Language. New York: Henry Holt (1933).
- (1933) Language
- Bloomfield, L.¹

58
- 84925038179
- Speech perception: Phonetic aspects
- W. J. Frawley, Ed., Oxford: Oxford University Press
- Blumstein, S., and Cutler, A. Speech perception: Phonetic aspects. In International Encyclopedia of Language, W. J. Frawley, Ed., vol. 4. Oxford: Oxford University Press (2003).
- (2003) International Encyclopedia of Language , vol.4
- Blumstein, S.¹ Cutler, A.²

59
- 33745089688
- Cambridge, MA: MIT Press
- Bod, R., Hay, j., and Jannedy, S. Probabilistic Linguistics. Cambridge, MA: MIT Press (1999).
- (1999) Probabilistic Linguistics
- Bod, R.¹ Hay, J.² Jannedy, S.³

60
- 0039326092
- PhD thesis, University of Amsterdam
- Boersma, P. Functional Phonology Formalizing the Interactions between Articulatory and Perceptual Drives. PhD thesis, University of Amsterdam (1998).
- (1998) Functional Phonology Formalizing the Interactions between Articulatory and Perceptual Drives
- Boersma, P.¹

61
- 0004350698
- London: Everyman's Library
- Boswell, J. The Life of Samuel Johnson. London: Everyman's Library (1791).
- (1791) The Life of Samuel Johnson
- Boswell, J.¹

62
- 0042868018
- Evaluation of grapheme- to-phoneme conversion for text-to-speech synthesis in french
- Boula de MareUil, P., Yvon, F., d’Alessandro, C. et al. Evaluation of grapheme- to-phoneme conversion for text-to-speech synthesis in French. Proceedings of First International Conference on Language Resources & Evaluation (1998), pp. 641-645.
- (1998) Proceedings & Evaluation , pp. 641-645
- Boula De Mareuil, P.¹ Yvon, F.² D’ Alessandro, C.³

63
- 0030142722
- Towards increasing speech recognition error rates
- Boulard, H., Hermansky, H., and Morgan, N. Towards increasing speech recognition error rates. Speech Communication 18 (1996), 205-255.
- (1996) Speech Communication , vol.18 , pp. 205-255
- Boulard, H.¹ Hermansky, H.² Morgan, N.³

64
- 0004316316
- Princeton, MA: Princeton University Press
- Boyer, C. B. History of Mathematics. Princeton, MA: Princeton University Press (1985).
- (1985) History of Mathematics
- Boyer, C.B.¹

65
- 85128404760
- A phonologically motivated method of selecting nonuniform units
- Breen, A. P., and Jackson, P. A phonologically motivated method of selecting nonuniform units. In International Conference on Speech and Language Processing (1998).
- (1998) International Conference on Speech and Language Processing
- Breen, A.P.¹ Jackson, P.²

66
- 85032405974
- Video rewrite: Visual speech synthesis from video
- Bregler, C., Covell, M., and Slaney, M. Video rewrite: Visual speech synthesis from video. In Proceedings ofEurospeech 1997 (1997).
- (1997) Proceedings Of Eurospeech 1997
- Bregler, C.¹ Covell, M.² Slaney, M.³

67
- 84867919822
- Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging
- Brill, E. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics 21, 4 (1995) 543-565.
- (1995) Computational Linguistics , vol.21 , Issue.4 , pp. 543-565
- Brill, E.¹

68
- 0032688795
- Modelling energy flow in the vocal tract with applications to glottal closure and opening detection
- Brookes, D. M., and Loke, H. P. Modelling energy flow in the vocal tract with applications to glottal closure and opening detection. In Proceedings ofthe International Conference on Acoustics, Speech, and Signal Processing, 1999 (1999).
- (1999) Proceedings Ofthe International Conference on Acoustics, Speech, and Signal Processing, 1999
- Brookes, D.M.¹ Loke, H.P.²

69
- 84955548400
- Towards an articulatory phonology
- Browman, C. P., and Goldstein, L. Towards an articulatory phonology. In Phonology Yearbook3 (1986), pp. 219-252.
- (1986) In Phonology Yearbook3 , pp. 219-252
- Browman, C.P.¹ Goldstein, L.²

70
- 0027024362
- Articulatory phonology: An overview
- Browman, C. P., and Goldstein, L. Articulatory phonology: an overview. Phonetica 49 (1992), 155-180.
- (1992) Phonetica , vol.49 , pp. 155-180
- Browman, C.P.¹ Goldstein, L.²

71
- 32244434943
- An automatic extraction method of f0 generation model parameters
- Bu, S., Yamamoto, M., and Itahashi, S. An automatic extraction method of F0 generation model parameters. IEICE Transactions on Information and Systems 89, 1 (2006), 305.
- (2006) IEICE Transactions on Information and Systems , vol.89 , Issue.1 , pp. 305
- Bu, S.¹ Yamamoto, M.² Itahashi, S.³

72
- 85009062747
- Data driven intonation modelling of 6 languages
- Buhmann, J., Vereecken, H., Fackrell, J., Martens, J. P., and Coile, B. V. Data driven intonation modelling of 6 languages. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Buhmann, J.¹ Vereecken, H.² Fackrell, J.³ Martens, J.P.⁴ Coile, B.V.⁵

73
- 85032397724
- Investigating the role of phoneme-level modifications in emotional speech resynthesis
- Bult, M., Busso, C., Tildim, S. et al. Investigating the role of phoneme-level modifications in emotional speech resynthesis. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Bult, M.¹ Busso, C.² Tildim, S.³

74
- 85009102972
- Unit selection for speech synthesis using splicing costs with weighted finite state transducers
- Bulyko, I., and Ostendorf, M. Unit selection for speech synthesis using splicing costs with weighted finite state transducers. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Bulyko, I.¹ Ostendorf, M.²

75
- 0012643736
- Prosodic vs segmental contributions to naturalness in a diphone synthesizer
- Bunnell, H. T., Hoskins, S. R., and Yarrington, D. Prosodic vs segmental contributions to naturalness in a diphone synthesizer. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Bunnell, H.T.¹ Hoskins, S.R.² Yarrington, D.³

76
- 85032404317
- Machine learning of word pronunciation: The case against abstraction
- Busser, B., Daelemans, W., and van den Bosch, A. Machine learning of word pronunciation: the case against abstraction. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Busser, B.¹ Daelemans, W.² Van Den Bosch, A.³

77
- 84994292873
- Predicting phrase breaks with memory-based learning
- Busser, B., Daelemans, W., and van den Bosch, A. Predicting phrase breaks with memory-based learning. In 4th ISCA Tutorial and Research Workshop on Speech Synthesis (2001).
- (2001) 4Th ISCA Tutorial and Research Workshop on Speech Synthesis
- Busser, B.¹ Daelemans, W.² Van Den Bosch, A.³

78
- 84936824214
- Regularity and idiomaticity in grammatical constructions
- Fillmore, C. J., Kay, P., and O’Connor, C. Regularity and idiomaticity in grammatical constructions: The case of let alone. Language 64 (1988), 501-538.
- (1988) The Case of Let Alone. Language , vol.64 , pp. 501-538
- Fillmore, C.J.¹ Kay, P.² O’ Connor, C.³

79
- 0002515370
- The generation of affect in synthesized speech
- Cahn, J. The generation of affect in synthesized speech. Journal of the American Voice I/O Society 8 (1990), 1-19.
- (1990) Journal of the American Voice I/O Society , vol.8 , pp. 1-19
- Cahn, J.¹

80
- 79951864468
- A computational memory and processing model for prosody
- Cahn, J. A computational memory and processing model for prosody. In International Conference on Speech and Language Processing (1998).
- (1998) International Conference on Speech and Language Processing
- Cahn, J.¹

81
- 85009207979
- Towards synthesizing expressive speech; designing and collecting expressive speech data
- Campbell, N. Towards synthesizing expressive speech; designing and collecting expressive speech data. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings of Eurospeech 2003
- Campbell, N.¹

82
- 34047268342
- Conventional speech synthesis and the need for some laughter
- Campbell, N. Conventional speech synthesis and the need for some laughter. IEEE Transactions on Audio, Speech and Language Processing 14, 4 (2006), 1171-1178.
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1171-1178
- Campbell, N.¹

83
- 0001717383
- Syllable-based segmental duration
- Theories, Models and Designs, C. B. G. Bailly and T. R. Sawallis, Eds. Amsterdam: Elsevier Science Publishers
- Campbell, W. N. Syllable-based segmental duration. In Talking Machines: Theories, Models and Designs, C. B. G. Bailly and T. R. Sawallis, Eds. Amsterdam: Elsevier Science Publishers (1992), pp. 211-224.
- (1992) Talking Machines , pp. 211-224
- Campbell, W.N.¹

84
- 85032415312
- A high-definition speech re-sequencing system
- Campbell, W. N. A high-definition speech re-sequencing system. In Proceedings of the Third ASA/ASJ Joint Meeting (1996), pp. 373-376.
- (1996) In Proceedings of the Third ASA/ASJ Joint Meeting , pp. 373-376
- Campbell, W.N.¹

85
- 0033677157
- Speech reconstruction frommel frequency cepstral coefficients and pitch
- Chazan, D., Hoory, R., Cohen, G., and Zibulsk, M. Speech reconstruction frommel frequency cepstral coefficients and pitch. In Proceedings ofthe International Conference on Acoustics, Speech, and Signal Processing 2000 (2000).
- (2000) Proceedings Ofthe International Conference on Acoustics, Speech, and Signal Processing 2000
- Chazan, D.¹ Hoory, R.² Cohen, G.³ Zibulsk, M.⁴

86
- 85009227369
- Conditional and joint models for grapheme-to-phoneme conversion
- Chen, S. F. Conditional and joint models for grapheme-to-phoneme conversion. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Chen, S.F.¹

87
- 85009128704
- Training prosodic phrasing rules for chinese tts systems
- Chen, W., Lin, F., and Zhang, J. L. B. Training prosodic phrasing rules for Chinese TTS systems. In Proceedings of Eurospeech 2001 (2001).
- (2001) Proceedings of Eurospeech 2001
- Chen, W.¹ Lin, F.² Zhang, J.³

88
- 0004291783
- The Hague: Mouton
- Chomsky, N. Syntactic Structures. The Hague: Mouton (1957).
- (1957) Syntactic Structures
- Chomsky, N.¹

89
- 0003647888
- Cambridge, MA: MIT Press
- Chomsky, N. Aspects of the Theory of Syntax. Cambridge, MA: MIT Press (1965).
- (1965) Aspects of the Theory of Syntax
- Chomsky, N.¹

90
- 0003793394
- New York: Praeger
- Chomsky, N. Knowledge of Language: Its Nature, Origin and Use. New York: Praeger (1986).
- (1986) Knowledge of Language: Its Nature, Origin and Use
- Chomsky, N.¹

91
- 0004119259
- London: Harper and Row
- Chomsky, N., and Halle, M. The Sound Pattern of English. London: Harper and Row (1968).
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

92
- 0034840906
- Selecting non-uniform units from a very large corpus for concatenative speech synthesizer
- Chu, M., Peng, H., Yang, H. Y., and Chang, E. Selecting non-uniform units from a very large corpus for concatenative speech synthesizer. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2001 (2001).
- (2001) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2001
- Chu, M.¹ Peng, H.² Yang, H.Y.³ Chang, E.⁴

93
- 0141480034
- Microsoft mulan - a bilingual tts system
- Chu, M., Peng, H., Zhao, y., Niu, Z., and Chan, E. Microsoft Mulan - a bilingual TTS system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003 (2003).
- (2003) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003
- Chu, M.¹ Peng, H.² Zhao, Y.³ Niu, Z.⁴ Chan, E.⁵

94
- 0025750735
- A comparison of the enhanced good-turing and deleted estimation methods for estimating probabilities of english bigrams
- Church, K. W., and Gale, W. A. A comparison of the enhanced good-Turing and deleted estimation methods for estimating probabilities of English bigrams. Computer Speech and Language 5 (1991), 19-54.
- (1991) Computer Speech and Language , vol.5 , pp. 19-54
- Church, K.W.¹ Gale, W.A.²

95
- 1842653460
- Cambridge: Cambridge University Press
- Clark, H. H. Using Language. Cambridge: Cambridge University Press (1996).
- (1996) Using Language
- Clark, H.H.¹

96
- 0003861855
- London: Harcourt Brace Jovanovich
- Clark, H. H., and Clark, E. V. Psychology and Language: An Introduction to Psycholinguistics. London: Harcourt Brace Jovanovich (1977).
- (1977) Psychology and Language: An Introduction to Psycholinguistics
- Clark, H.H.¹ Clark, E.V.²

97
- 0004147298
- Oxford: Basil Blackwell
- Clark, J., and Yallop, C. An Introduction to Phonetics and Phonology. Oxford: Basil Blackwell (1990).
- (1990) An Introduction to Phonetics and Phonology
- Clark, J.¹ Yallop, C.²

98
- 85135268904
- Objective methods for evaluating synthetic intonation
- Clark, R., and Dusterhoff, K. Objective methods for evaluating synthetic intonation. In Proceedings of Eurospeech 1999 (1999).
- (1999) Proceedings of Eurospeech 1999
- Clark, R.¹ Dusterhoff, K.²

99
- 84973381105
- Festival2-buildyour own general purpose unit selection speech synthesiser
- Clark, R. A. J., Richmond, K., and King, S. Festival2-buildyour own general purpose unit selection speech synthesiser. In 5th ISCA Workshop on Speech Synthesis (2004).
- (2004) 5Th ISCA Workshop on Speech Synthesis
- Clark, R.¹ Richmond, K.² King, S.³

100
- 84958907242
- The geometry of phonological features
- Clements, G. N. The geometry of phonological features. Phonology Yearbook 2 (1985), pp. 225-252.
- (1985) Phonology Yearbook , vol.2 , pp. 225-252
- Clements, G.N.¹

101
- 0001514782
- Modeling coarticulation in synthetic visual speech
- Cohen, M. M., and Massaro, D. W. Modeling coarticulation in synthetic visual speech. In Models and Techniques in Computer Animation (1993), pp. 141-155.
- (1993) Models and Techniques in Computer Animation , pp. 141-155
- Cohen, M.M.¹ Massaro, D.W.²

102
- 85009168875
- Speculations on the future of speech technology research
- Cole, R. Roadmaps, journeys and destinations: Speculations on the future of speech technology research. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Cole, R.R.¹

103
- 79959826505
- Linguistic features weighting for a text-to-speech system without prosody model
- Colotte, V., and Beaufort, R. Linguistic features weighting for a text-to-speech system without prosody model. In Proceedings ofEurospeech 2005 (2005).
- (2005) Proceedings Ofeurospeech 2005
- Colotte, V.¹ Beaufort, R.²

104
- 0001921478
- A robust unit selection system for speech synthesis
- Conkie, A. A robust unit selection system for speech synthesis. In 137th Meeting of the Acoustical Society of America (1999).
- (1999) 137Th Meeting of the Acoustical Society of America
- Conkie, A.¹

105
- 84871393401
- Optimal coupling of diphones
- Conkie, A. D., and Isard, S. Optimal coupling of diphones. In Proceedings of Eurospeech 1995 (1995).
- (1995) Proceedings of Eurospeech 1995
- Conkie, A.D.¹ Isard, S.²

106
- 0003677575
- The interconversion of audible and visible patterns as a basis for research in the perception of speech
- Cooper, F. S., Liberman, A. M., and Borst, J. M. The interconversion of audible and visible patterns as a basis for research in the perception of speech. Proceedings of the National Academy of Science 37, 5 (1951), 318-325.
- (1951) Proceedings of the National Academy of Science , vol.37 , Issue.5 , pp. 318-325
- Cooper, F.S.¹ Liberman, A.M.² Borst, J.M.³

107
- 0003950758
- Berlin: Springer-Verlag
- Cooper, W. E., and Sorensen, J. M. Fundamental Frequency in Sentence Production. Berlin: Springer-Verlag (1981).
- (1981) Fundamental Frequency in Sentence Production
- Cooper, W.E.¹ Sorensen, J.M.²

108
- 84985926077
- Segment selection in the lh realspeak laboratory tts system
- Coorman, G., Fackrell, J., Rutten, P., and Coile, B. V. Segment selection in the LH RealSpeak Laboratory TTS system. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Coorman, G.¹ Fackrell, J.² Rutten, P.³ Coile, B.V.⁴

109
- 0012236013
- Automatic modeling of duration in a spanish text-to-speech system using neural networks
- Cordoba, R., Vallejo, J. a., Montero, J. M. et al. Automatic modeling of duration in a Spanish text-to-speech system using neural networks. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Cordoba, R.¹ Vallejo, J.²

110
- 0038674461
- Theoretical approaches to emotion
- Cornelius, R. Theoretical approaches to emotion. In ICSA Workshop on Speech and Emotion (2000).
- (2000) ICSA Workshop on Speech and Emotion
- Cornelius, R.¹

111
- 34249753618
- Support-vector networks
- Cortes, C., and Vapnik, V. Support-vector networks. Machine Learning 20, 3 (1995), 273-297.
- (1995) Machine Learning , vol.20 , Issue.3 , pp. 273-297
- Cortes, C.¹ Vapnik, V.²

112
- 85032408336
- Multimodal databases of everyday emotion: Facing up to complexity
- Cowie, R., Devillers, L., Martin, J.-C. et al. Multimodal databases of everyday emotion: Facing up to complexity. In Proceedings ofEurospeech, Interspeech 2005 (2005).
- (2005) Proceedings Ofeurospeech, Interspeech 2005
- Cowie, R.¹ Devillers, L.² Martin, J.-C.³

113
- 85032751766
- Emotion recognition in human-computer interaction
- Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N. et al. Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine (2001), 32-80.
- (2001) IEEE Signal Processing Magazine , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³

114
- 0003798635
- Cambridge: Cambridge University Press
- Cristianini, N., and Shawe-Taylor, J. Nello Cristianini and John Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge: Cambridge University Press (2000).
- (2000) Nellocristianini and John Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
- Cristianini, N.¹ Shawe-Taylor, J.²

115
- 0038243738
- Optimized stopping criteria for tree-based unit selection in concatenative synthesis
- Cronk, A., and Macon, M. Optimized stopping criteria for tree-based unit selection in concatenative synthesis. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Cronk, A.¹ Macon, M.²

116
- 0023419762
- A globally optimising format tracker using generalised centroids
- Crowe, A., and Jack, M. A. A globally optimising format tracker using generalised centroids. Electronics Letters 23 (1987), 1019-1020.
- (1987) Electronics Letters , vol.23 , pp. 1019-1020
- Crowe, A.¹ Jack, M.A.²

117
- 0003596359
- Cambridge: Cambridge University Press
- Crystal, D. Prosodic Systems and Intonation in English. Cambridge: Cambridge University Press (1969).
- (1969) Prosodic Systems and Intonation in English
- Crystal, D.¹

118
- 0029342671
- Automatic pitch contour stylization using a model of tonal perception
- d’Alessandro, C., and Mertens, P. Automatic pitch contour stylization using a model of tonal perception. In Computer Speech and Language (1995).
- (1995) Computer Speech and Language
- D’ Alessandro, C.¹ Mertens, P.²

119
- 0032624182
- Forgetting exceptions is harmful in language learning
- Daelemans, W., Van Den Bosch, A., and Zavrel, J. Forgetting exceptions is harmful in language learning. Machine Learning 34, 1 (1999), 11-41.
- (1999) Machine Learning , vol.34 , Issue.1 , pp. 11-41
- Daelemans, W.¹ Van Den Bosch, A.² Zavrel, J.³

120
- 85009204356
- Voice quality modification for emotional speech synthesis
- d’Alessandro, C., and Doval, B. Voice quality modification for emotional speech synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- D’ Alessandro, C.¹ Doval, B.²

121
- 0029342671
- Automatic pitch contour stylization using a model of tonal perception
- d’Alessandro, C., and Mertens, P. Automatic pitch contour stylization using a model of tonal perception. Computer Speech & Language 9, 3 (1995), 257-288.
- (1995) Computer & Language , vol.9 , Issue.3 , pp. 257-288
- D’ Alessandro, C.¹ Mertens, P.²

122
- 0039666139
- Pronunciation By Analogy: Impact of implementational choices on performance
- Damper, R., and Eastmond, J. Pronunciation by analogy: Impact of implementational choices on performance. Language and Speech 40, 1 (1997), 1-23.
- (1997) Language and Speech , vol.40 , Issue.1 , pp. 1-23
- Damper, R.¹ Eastmond, J.²

123
- 0033106614
- A performance comparison of different approaches
- Damper, R., Marchand, y., Adamson, M., and Gustafson, K. Evaluating the pronunciation component of text-to-speech systems for English: A performance comparison of different approaches. Computer Speech and Language 13, 2 (1999), 155-176.
- (1999) Computer Speech and Language , vol.13 , Issue.2 , pp. 155-176
- Damper, R.¹ Marchand, Y.² Adamson, M.³

124
- 0003774595
- London: John Murray
- Darwin, C. The Expression of the Emotions in Man and Animals. London: John Murray (1872).
- (1872) The Expression of the Emotions in Man and Animals
- Darwin, C.¹

125
- 0002218499
- W. A. G. A sequential algorithm for training text classifiers
- David D. Lewis, W. A. G. A sequential algorithm for training text classifiers. In 17th ACM International Conference on Research and Development in Information Retrieval (1994).
- (1994) 17Th ACM International Conference on Research and Development in Information Retrieval
- Lewis, D.D.¹

126
- 0002402445
- Auditory correlates of vocal expression of emotional feeling
- Davitz, J. Auditory correlates of vocal expression of emotional feeling. In The Communication of Emotional Meaning. New York: McGraw-Hill (1964).
- (1964) The Communication of Emotional Meaning. New York: Mcgraw-Hill
- Davitz, J.¹

127
- 0025796358
- A program for pronunciation by analogy
- Dedina, M., and Nusbaum, H. Pronounce: A program for pronunciation by analogy. Computer Speech & Language (Print) 5, 1 (1991), 55-64.
- (1991) Computer & Language (Print) , vol.5 , Issue.1 , pp. 55-64
- Dedina, M.¹ Nusbaum, H.P.²

128
- 0003424145
- New York: John Wiley and Sons
- Deller, J. R., and Proakis, J. Discrete-Time Processing of Speech Signals. New York: John Wiley and Sons (2000).
- (2000) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.²

129
- 85009211881
- Tracking vocal track resonances using an analytical nonlinear predictor and a target guided temporal constraint
- Deng, L., Bazzi, I., and Acero, A. Tracking vocal track resonances using an analytical nonlinear predictor and a target guided temporal constraint. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Deng, L.¹ Bazzi, I.² Acero, A.³

130
- 84979899767
- Prosodic cues foremotion characterization in real-life spoken dialogs
- Devillers, L., and Vasilescu, I. Prosodic cues foremotion characterization in real-life spoken dialogs. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Devillers, L.¹ Vasilescu, I.²

131
- 85032421967
- A neural network approach for the design of the target cost function in unit-selection speech synthesis
- Diaz, F. C., Alba, J. L., and Banga, E. R. A neural network approach for the design of the target cost function in unit-selection speech synthesis. In Proceedings ofEurospeech 2005 (2005).
- (2005) Proceedings Ofeurospeech 2005
- Diaz, F.C.¹ Alba, J.L.² Banga, E.R.³

132
- 14844352803
- Alignment of l and h in bitonal pitch accents: Testing two hypotheses
- Dilley, L., Ladd, D., and Schepman, A. Alignment of L and H in bitonal pitch accents: Testing two hypotheses. Journal of Phonetics 33, 1 (2005), 115-119.
- (2005) Journal of Phonetics , vol.33 , Issue.1 , pp. 115-119
- Dilley, L.¹ Ladd, D.² Schepman, A.³

133
- 0034854701
- Trainable speech with trended hidden markov models
- Dines, J., and Sridharan, S. Trainable speech with trended hidden Markov models. In Proceedings ofthe International Conference on Acoustics, Speech, and Signal Processing 2001 (2001).
- (2001) Proceedings Ofthe International Conference on Acoustics, Speech, and Signal Processing 2001
- Dines, J.¹ Sridharan, S.²

134
- 85009143690
- Application of the trended hidden markov model to speech synthesis
- Dines, J., Sridharan, S., and Moody, M. Application of the trended hidden Markov model to speech synthesis. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Dines, J.¹ Sridharan, S.² Moody, M.³

135
- 85041486134
- Optimising unit selection with voice source and formants in the chatr speech synthesis system
- Ding, W., and Campbell, N. Optimising unit selection with voice source and formants in the Chatr speech synthesis system. In Proceedings ofEurospeech 1997 (1997).
- (1997) Proceedings Ofeurospeech 1997
- Ding, W.¹ Campbell, N.²

136
- 0012266740
- A computational grammar of discourse-neutral prosodic phrasing in english
- Divay, M., and Vitale, A. J. A computational grammar of discourse-neutral prosodic phrasing in English. Computational Linguistics 23, 4 (1997), 495-523.
- (1997) Computational Linguistics , vol.23 , Issue.4 , pp. 495-523
- Divay, M.¹ Vitale, A.J.²

137
- 0002869769
- The study of natural phonology
- D. Dinnsen, Ed. Indiana: Indiana University Press
- Donegan, P. J., and Stampe, D. The study of natural phonology. In Current Approaches to Phonological Theory, D. Dinnsen, Ed. Indiana: Indiana University Press (1979), pp. 126-173.
- (1979) Current Approaches to Phonological Theory , pp. 126-173
- Donegan, P.J.¹ Stampe, D.²

138
- 84944962517
- The ibm trainable speech synthesis system
- Donovan, R. E., and Eide, E. The IBM trainable speech synthesis system. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Donovan, R.E.¹ Eide, E.²

139
- 0032665403
- Phrase splicing and variable substitution using the ibm trainable speeech synthesis system. In proceedings of the international conference on acoustics
- Donovan, R. E., Franz, M., Sorensen, J. S., and Roukos, S. Phrase splicing and variable substitution using the IBM trainable speeech synthesis system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1999 (1999), pp. 373-376.
- (1999) Speech, and Signal Processing , vol.1999 , pp. 373-376
- Donovan, R.E.¹ Franz, M.² Sorensen, J.S.³ Roukos, S.⁴

140
- 0028996983
- Automatic speech synthesiser parameter estimation using hmms
- Donovan, R. E., and Woodland, P. C. Automatic speech synthesiser parameter estimation using HMMS. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1995 (1995).
- (1995) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1995
- Donovan, R.E.¹ Woodland, P.C.²

141
- 85128419522
- Maximum a posteriori pitch tracking
- Droppo, J., and Acero, A. Maximum a posteriori pitch tracking. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1998 (1998).
- (1998) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1998
- Droppo, J.¹ Acero, A.²

142
- 0003922190
- New York: John Wiley and Sons
- Duda, R. O., Hart, P. E., and Stork, D. G. Pattern Classification. New York: John Wiley and Sons (2000).
- (2000) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

143
- 84942494747
- Remaking speech
- Dudley, H. Remaking speech. Journal of the Acoustical Society of America 11 (1939), 169-177.
- (1939) Journal of the Acoustical Society of America , vol.11 , pp. 169-177
- Dudley, H.¹

144
- 85032410801
- Dependency and non-linear phonology
- Durand, J. Dependency and Non-Linear Phonology. Croom Helm (1986).
- (1986) Croom Helm
- Durand, J.¹

145
- 85032411920
- Explorations in dependency phonology
- Durand, J., and Anderson, J. Explorations in Dependency Phonology. Foris (1987).
- (1987) Foris
- Durand, J.¹ Anderson, J.²

146
- 85032407598
- Generating f0 contours for speech synthesis using the tilt intonation theory
- Dusterhoff, K., and Black, A. Generating f0 contours for speech synthesis using the tilt intonation theory. In Proceedings of Eurospeech 1997 (1997).
- (1997) Proceedings of Eurospeech 1997
- Dusterhoff, K.¹ Black, A.²

147
- 84860832249
- Using decision trees within the tilt intonation model to predict f0 contours
- Dusterhoff, K. E., Black, A. W., and Taylor, P. Using decision trees within the tilt intonation model to predict F0 contours. In Proceedings of Eurospeech 1999 (1999).
- (1999) Proceedings of Eurospeech 1999
- Dusterhoff, K.E.¹ Black, A.W.² Taylor, P.³

148
- 0003834176
- Dordrecht: Kluwer Academic Publishers
- Dutoit, T. An Introduction to Text to Speech Synthesis. Dordrecht: Kluwer Academic Publishers (1997).
- (1997) An Introduction to Text to Speech Synthesis
- Dutoit, T.¹

149
- 0027839344
- Text-to-speech synthesis based on an mbe re-synthesis of the segments database
- Dutoit, T., and Leich, H. Text-to-speech synthesis based on an MBE re-synthesis of the segments database. Speech Communication 13 (1993), 435-440.
- (1993) Speech Communication , vol.13 , pp. 435-440
- Dutoit, T.¹ Leich, H.²

150
- 0141702290
- Recent improvements to the ibm trainable speech synthesis system
- Eide, E., Aaron, A., Bakis, R. et al. Recent improvements to the IBM trainable speech synthesis system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003 (2003).
- (2003) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003
- Eide, E.¹ Aaron, A.² Bakis, R.³

151
- 33947635494
- A corpus-based approach to ahem expressive speech synthesis
- Eide, E., Aaron, A., Bakis, Hamza, W., and Picheny, M. J. A corpus-based approach to AHEM expressive speech synthesis. In Proceedings of the 5th ISCA Workshop on Speech Synthesis (2005).
- (2005) Proceedings of the 5Th ISCA Workshop on Speech Synthesis
- Eide, E.¹ Aaron, A.² Bakis, H.W.³ Picheny, M.J.⁴

152
- 29344435444
- Universals and cultural differences in facial expressions of emotion
- Ekman, P. Universals and cultural differences in facial expressions of emotion. In Nebraska Symposium on Motivation. University of Nebraska Press (1972).
- (1972) Nebraska Symposium on Motivation. University of Nebraska Press
- Ekman, P.¹

153
- 33646772493
- New York: John Wiley
- Ekman, P. Basic Emotions The Handbook of Cognition and Emotion. New York: John Wiley (1999).
- (1999) Basic Emotions the Handbook of Cognition and Emotion
- Ekman, P.¹

154
- 0004167520
- London: Consulting Psychologists Press
- Ekman, P, and Friesen, W. V. Unmasking the Face. London: Consulting Psychologists Press (1975).
- (1975) Unmasking the Face
- Ekman, P.¹ Friesen, W.V.²

155
- 0017269304
- Letter-to-sound rules for automatic translation of english text to phonetics
- Elovitz, H. S., Johnson, R., McHugh, A., and Shore, J. Letter-to-sound rules for automatic translation of English text to phonetics. IEEE Transactions on Acoustics, Speech, and Signal Processing 24 (1976), 446-459.
- (1976) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.24 , pp. 446-459
- Elovitz, H.S.¹ Johnson, R.² Mc Hugh, A.³ Shore, J.⁴

156
- 0002540664
- The patterns of silence: Performance structures in sentence production
- Grosjean, L. G., and Lane, H. The patterns of silence: Performance structures in sentence production. Cognitive Psychology 11 (1979), 58-81.
- (1979) Cognitive Psychology , vol.11 , pp. 58-81
- Grosjean, L.G.¹ Lane, H.²

157
- 85135273903
- Multilingual prosody modelling using cascades of regression trees and neural networks
- Fackrell, J. W. A., Vereecken, H., Martens, J. P., and Coile, B. V. Multilingual prosody modelling using cascades of regression trees and neural networks. In Proceedings of Eurospeech 1999 (1999).
- (1999) Proceedings of Eurospeech 1999
- Fackrell, J.¹ Vereecken, H.² Martens, J.P.³ Coile, B.V.⁴

158
- 0003418124
- The Hague: Mouton
- Fant, G. Acoustic Theory of Speech Production. The Hague: Mouton (1960).
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

159
- 84928451959
- Glottal flow: Models and interaction
- Fant, G. Glottal flow: models and interaction. Journal of Phonetics 14 (1986), 393-399.
- (1986) Journal of Phonetics , vol.14 , pp. 393-399
- Fant, G.¹

160
- 0000764772
- A. The use of multiple measures in taxonomic problems
- Fisher, R. A. the use of multiple measures in taxonomic problems. Annals of Eugenics 7 (1936), 179-188.
- (1936) Annals of Eugenics , vol.7 , pp. 179-188
- Fisher, R.¹

161
- 85032419628
- The generation of regional pronunciations of english for speech synthesis
- Fitt, S., and Isard, S. The generation of regional pronunciations of English for speech synthesis. In Proceedings of Eurospeech 1997 (1997).
- (1997) Proceedings of Eurospeech 1997
- Fitt, S.¹ Isard, S.²

162
- 84902066232
- Representing the environments for phonological processes in an accent-independent lexicon for synthesis of english
- Fitt, S., and Isard, S. Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Fitt, S.¹ Isard, S.²

163
- 85032418675
- The treatment of vowels preceding ‘r’ in a keyword lexicon of english
- Fitt, S., and Isard, S. The treatment of vowels preceding ‘r’ in a keyword lexicon of English. In Proceedings ofICPhS 99 (1999).
- (1999) Proceedings Oficphs 99
- Fitt, S.¹ Isard, S.²

164
- 0003757962
- Berlin: Springer-Verlag
- Flanagan, J. L. Speech Analysis, Synthesis and Perception. Berlin: Springer-Verlag (1972).
- (1972) Speech Analysis, Synthesis and Perception
- Flanagan, J.L.¹

165
- 85011187169
- Analysis of voice fundamental frequency contours for declarative sentences of japanese
- Fujisaki, H., and Hirose, K. Analysis of voice fundamental frequency contours for declarative sentences of Japanese. Journal of the Acoustical Society of Japan 5, 4 (1984), 233-241.
- (1984) Journal of the Acoustical Society of Japan , vol.5 , Issue.4 , pp. 233-241
- Fujisaki, H.¹ Hirose, K.²

166
- 0010987926
- Modeling the dynamic characteristics of voice fundamental frequency with applications to analysis and synthesis of intonation
- Fujisaki, H., and Kawai, H. Modeling the dynamic characteristics of voice fundamental frequency with applications to analysis and synthesis of intonation. In Working Group on Intonation, 13th International Congress of Linguists (1982).
- (1982) Working Group on Intonation, 13Th International Congress of Linguists
- Fujisaki, H.¹ Kawai, H.²

167
- 0023799410
- Realization of linguistic information in the voice fundamental frequency contour of the spoken japanese
- Fujisaki, H., and Kawai, H. Realization of linguistic information in the voice fundamental frequency contour of the spoken Japanese. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988 (1988).
- (1988) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988
- Fujisaki, H.¹ Kawai, H.²

168
- 0004072715
- New York: Marcel Dekker
- Furui, S. Digital Speech Processing, Synthesis andRecognition.New York: Marcel Dekker (2001).
- (2001) Digital Speech Processing, Synthesis Andrecognition
- Furui, S.¹

169
- 0003128462
- Using bilingual materials to develop word sense disambiguation methods
- Gale, W. A., Church, K. W., and Yarowsky, D. Using bilingual materials to develop word sense disambiguation methods. In International Conference on Theoretical and Methodological Issues in Machine Translation (1992), pp. 101-112.
- (1992) International Conference on Theoretical and Methodological Issues in Machine Translation , pp. 101-112
- Gale, W.A.¹ Church, K.W.² Yarowsky, D.³

170
- 33646821390
- Development of the cu-htk 2004 mandarin conversational telephone speech transcription system
- Gales, M. J. F., Jia, B., Liu, X. et al. Development of the CU-HTK 2004 Mandarin conversational telephone speech transcription system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2005 (2005).
- (2005) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2005
- Gales, M.¹ Jia, B.² Liu, X.³

171
- 34047266379
- Progress in the cu-htk broadcast news transcription system
- Gales, M. J. F., Kim, D. Y., Woodland, P. C. et al. Progress in the CU-HTK Broadcast News transcription system. IEEE Transactions on Audio, Speech, and Language Processing 14, 5 (2006), 1513-1525.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.5 , pp. 1513-1525
- Gales, M.¹ Kim, D.Y.² Woodland, P.C.³

172
- 85009238150
- Name pronunciation with a joint n-gram model for bidirectional grapheme-to-phoneme conversion
- Galescu, L., and Allen, J. Name pronunciation with a joint N-gram model for bidirectional grapheme-to-phoneme conversion. Proceedings ofICSLP (2002), pp. 109-112.
- (2002) Proceedings Oficslp , pp. 109-112
- Galescu, L.¹ Allen, J.²

173
- 0003548585
- Gaithersburg, MD (CD-ROM
- Garofolo, J. S., Lamel, L. F., Fisher, W. M. et al. The DARPA-TIMIT acoustic- phonetic continuous speech corpus. Technical report, US Department of Commerce, Gaithersburg, MD (CD-ROM, 1990).
- (1990) The DARPA-TIMIT Acoustic- Phonetic Continuous Speech Corpus. Technical Report, US Department of Commerce
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³

174
- 0003764625
- Cambridge: Cambridge University Press
- Giegerich, H. J. English Phonology: An Introduction. Cambridge: Cambridge University Press (1992).
- (1992) English Phonology: An Introduction
- Giegerich, H.J.¹

175
- 0001518475
- The organization and activation of orthographic knowledge in reading aloud
- Glushko, R. The organization and activation of orthographic knowledge in reading aloud. Journal of Experimental Psychology: Human Perception and Performance 5, 4 (1979), 674-691.
- (1979) Journal of Experimental Psychology: Human Perception and Performance , vol.5 , Issue.4 , pp. 674-691
- Glushko, R.¹

176
- 84891583348
- New York: John Wiley and Sons
- Gold, B., and Morgan, N. Speech and Audio Signal Processing: Processing and Perception of Speech and Music. New York: John Wiley and Sons (1999).
- (1999) Speech and Audio Signal Processing: Processing and Perception of Speech and Music
- Gold, B.¹ Morgan, N.²

177
- 84863883293
- Chicago, IL: University of Chicago Press
- Goldberg, A. Constructions: A Construction Grammar Approach to Argument Structure. Chicago, IL: University of Chicago Press (1995).
- (1995) Constructions: A Construction Grammar Approach to Argument Structure
- Goldberg, A.¹

178
- 0004077483
- Oxford: Clarendon Press
- Goldfarb, C. F. The SGML Handbook. Oxford: Clarendon Press (1990).
- (1990) The SGML Handbook
- Goldfarb, C.F.¹

179
- 84895711707
- An overview of autosegmental phonology
- Goldsmith, J. An overview of autosegmental phonology. Linguistic Analysis 2, 1 (1976), 23-68.
- (1976) Linguistic Analysis , vol.2 , Issue.1 , pp. 23-68
- Goldsmith, J.¹

180
- 0003773641
- Oxford: Blackwell
- Goldsmith, J. Autosegmental and Metrical Phonology. Oxford: Blackwell (1990).
- (1990) Autosegmental and Metrical Phonology
- Goldsmith, J.¹

181
- 0029292169
- Classification of methods used for the assessment of text-to-speech systems according to the demands placed on the listener
- Goldstein, M. Classification of methods used for the assessment of text-to-speech systems according to the demands placed on the listener. Speech Communication 16 (1995), 225-244.
- (1995) Speech Communication , vol.16 , pp. 225-244
- Goldstein, M.¹

182
- 85032409161
- Predicting consonant duration with bayesian belief networks
- Goubanova, O., and King, S. Predicting consonant duration with Bayesian belief networks. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Goubanova, O.¹ King, S.²

183
- 0026940107
- The use of speech synthesis in exploring different speaking styles
- Granstrom, B. The use of speech synthesis in exploring different speaking styles. Speech Communication 11, 4-5 (1992), 347-355.
- (1992) Speech Communication , vol.11 , Issue.4-5 , pp. 347-355
- Granstrom, B.¹

184
- 0000534475
- Logic and conversation
- P. Cole and J. Morgan, Eds. New York: Academic Press
- Grice, H. P. Logic and conversation. In Syntax and Semantics: Speech Acts, P. Cole and J. Morgan, Eds. New York: Academic Press (1975), vol. 3, pp. 41-58.
- (1975) Syntax and Semantics: Speech Acts , vol.3 , pp. 41-58
- Grice, H.P.¹

185
- 0024060644
- Multiband excitation vocoder
- Griffin, D., and Lim, J. Multiband excitation vocoder. IEEE Transactions on Acoustics, Speech, and Signal Processing, 36, 8 (1988).
- (1988) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.36 , pp. 8
- Griffin, D.¹ Lim, J.²

186
- 0023304697
- Prosodic structure and spoken word recognition
- Grosjean, F., and Gee, J. P. Prosodic structure and spoken word recognition. Cognition, 156 (1987).
- (1987) Cognition , pp. 156
- Grosjean, F.¹ Gee, J.P.²

187
- 6344250064
- Designing prosodic databases for automatic modelling in 6 languages
- Grover, C., Fackrell, J., Vereecken, H., Martens, J., and Van Coile, B. Designing prosodic databases for automatic modelling in 6 languages. In Proceedings ofICSLP 1998 (1998).
- (1998) Proceedings Oficslp 1998
- Grover, C.¹ Fackrell, J.² Vereecken, H.³ Martens, J.⁴ Van Coile, B.⁵

188
- 84987791079
- The august spoken dialogue system
- Gustafson, J., Lindberg, N., and Lundeberg, M. The August spoken dialogue system. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Gustafson, J.¹ Lindberg, N.² Lundeberg, M.³

189
- 0032673049
- Possible role of a repetitive structure in sounds
- Kawahara, I. M.-K., and de Cheveigne, A. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Communication 27, 187-207.
- Speech Communication , vol.27 , pp. 187-207
- Kawahara, I.M.¹ De Cheveigne, A.²

190
- 85009288219
- Cu Vocal: Corpus-based syllable concatenation for chinese speech synthesis across domains and dialects
- Meng, C. K. K., Siu, T. Y. F., and Ching, P. C. Cu vocal: Corpus-based syllable concatenation for chinese speech synthesis across domains and dialects. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Meng, C.¹ Siu, T.² Ching, P.C.³

191
- 34547326497
- A hybrid approach for grapheme-to-phoneme conversion based on a combination of partial string matching and a neural network
- Hain, H.-U. A hybrid approach for grapheme-to-phoneme conversion based on a combination of partial string matching and a neural network. In Proceedings of the International Conference on Speech and Language Processing (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing
- Hain, H.-U.¹

192
- 27744599401
- Automatic transcription of conversational telephone speech - development of the cu-htk 2002 system
- Hain, T., Woodland, P. C., Evermann, G. et al. Automatic transcription of conversational telephone speech - development of the CU-HTK 2002 system. IEEE Transactions on Audio, Speech, and Language Processing (2005).
- (2005) IEEE Transactions on Audio, Speech, and Language Processing
- Hain, T.¹ Woodland, P.C.² Evermann, G.³

193
- 85032406374
- A. Intonation and grammar in british english
- Halliday, M. A. Intonation and Grammar in British English. Mouton (1967).
- (1967) Mouton
- Halliday, M.¹

194
- 0024906968
- A diphone synthesis system based on time-domain modifications of speech
- Hamon, C., Moulines, E., and Charpentier, F. A diphone synthesis system based on time-domain modifications of speech. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing 1989 (1989).
- (1989) Proceedings of International Conference on Acoustics, Speech, and Signal Processing 1989
- Hamon, C.¹ Moulines, E.² Charpentier, F.³

195
- 56149096472
- The ibm expressive speech synthesis system
- Hamza, W., Bakis, R., Eide, E. M., Picheny, M. a., and Pitrelli, J. F. The IBM expressive speech synthesis system. In Proceedings of the International Conference on Spoken Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Spoken Language Processing 2004
- Hamza, W.¹ Bakis, R.² Eide, E.M.³ Picheny, M.A.⁴ Pitrelli, J.F.⁵

196
- 85032414131
- On building a concatenative speech synthesis system from the blizzard challenge speech databases
- Hamza, W., Bakis, R., Shuang, Z. W., and Zen, H. On building a concatenative speech synthesis system from the blizzard challenge speech databases. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Hamza, W.¹ Bakis, R.² Shuang, Z.W.³ Zen, H.⁴

197
- 0034854642
- A quantitative method for modelling context in concatenative synthesis using large speech database
- Hamza, W., Rashwan, M., and Afify, M. A quantitative method for modelling context in concatenative synthesis using large speech database. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 2001 (2001).
- (2001) Proceedings of the International Conference on Acoustics Speech and Signal Processing 2001
- Hamza, W.¹ Rashwan, M.² Afify, M.³

198
- 85032403628
- Letter-to-sound for small-footprint multilingual tts engine
- Han, K., and Chen, G. Letter-to-sound for small-footprint multilingual TTS engine. In Proceedings ofInterspeech 2004 (2004).
- (2004) Proceedings Ofinterspeech 2004
- Han, K.¹ Chen, G.²

199
- 0004221439
- London: Routledge
- Harris, R. Signs of Writing. London: Routledge (1996).
- (1996) Signs of Writing
- Harris, R.¹

200
- 85009062928
- Transformation-based learning of danish stress assignment
- Henrichsen, P. J. Transformation-based learning of Danish stress assignment. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Henrichsen, P.J.¹

201
- 0348211761
- Cambridge: Cambridge University Press
- Hertz, S. R. Papers in Laboratory Phonology I: Between the Grammar and the Physics of Speech. Cambridge: Cambridge University Press (1990).
- (1990) Papers in Laboratory Phonology I: Between the Grammar and the Physics of Speech
- Hertz, S.R.¹

202
- 0003391579
- Berlin: Springer-Verlag
- Hess, W. Pitch Determination of Speech Signals. Berlin: Springer-Verlag (1983).
- (1983) Pitch Determination of Speech Signals
- Hess, W.¹

203
- 85009080408
- Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis
- Hirai, T., Tenpaku, A., and Shikano, K. Manipulating speech pitch periods according to optimal insertion/deletion position in residual signal for intonation control in speech synthesis. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- Hirai, T.¹ Tenpaku, A.² Shikano, K.³

204
- 34547576827
- Using 5 ms segments in concatenative speech synthesis
- Hirai, t., and Tenpaku, S. Using 5 ms segments in concatenative speech synthesis. In 5th ISCA Workshop on Speech Synthesis (2005).
- (2005) 5Th ISCA Workshop on Speech Synthesis
- Hirai, T.¹ Tenpaku, S.²

205
- 85009291529
- Improved corpus-based synthesis of fundamental frequency contours using generation process model
- Hirose, K., Eto, M., and Minematsu, N. Improved corpus-based synthesis of fundamental frequency contours using generation process model. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2002
- Hirose, K.¹ Eto, M.² Minematsu, N.³

206
- 21844459140
- Corpus-based synthesis of fundamental frequency contours based on a generation process model
- Hirose, K., Eto, M., Minematsu, N., and Sakurai, A. Corpus-based synthesis of fundamental frequency contours based on a generation process model. In Proceedings of Eurospeech 2001 (2001).
- (2001) Proceedings of Eurospeech 2001
- Hirose, K.¹ Eto, M.² Minematsu, N.³ Sakurai, A.⁴

207
- 0027684991
- Predicting intonational prominence from text
- Pitch accent in context
- Hirschberg, J. Pitch accent in context: Predicting intonational prominence from text. Artificial Intelligence 63 (1993), 305-340.
- (1993) Artificial Intelligence , vol.63 , pp. 305-340
- Hirschberg, J.¹

208
- 0036027583
- Functional aspects of prosody
- Hirschberg, J. Communication and prosody: Functional aspects of prosody. Speech Communication 36 (2002), 31-43.
- (2002) Speech Communication , vol.36 , pp. 31-43
- Hirschberg, J.C.¹

209
- 84871623487
- Learning prosodic features using a tree representation
- Hirschberg, J., and Rambow, O. Learning prosodic features using a tree representation. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Hirschberg, J.¹ Rambow, O.²

210
- 85009080541
- Comparing static and dynamic features for segmental cost function calculation in concatenative speech synthesis
- Hirschfeld, D. Comparing static and dynamic features for segmental cost function calculation in concatenative speech synthesis. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Hirschfeld, D.¹

211
- 17444384443
- Automatic analysis of prosody for multilingual speech corpora
- Hirst, D. Automatic analysis of prosody for multilingual speech corpora. In Improvements in Speech Synthesis. Chichester: Wiley (2001).
- (2001) Improvements in Speech Synthesis. Chichester: Wiley
- Hirst, D.¹

212
- 0003665661
- Cambridge: Cambridge University Press
- Hirst, D., and Di Cristo, A. Intonation Systems: A Survey of Twenty Languages. Cambridge: Cambridge University Press (1998).
- (1998) Intonation Systems: A Survey of Twenty Languages
- Hirst, D.¹ Di Cristo, A.²

213
- 0142241561
- Dordrecht: Kluwer Academic Publishers
- Hirst, D., Di Cristo, A., and Espesser, R. Levels of representation and levels of analysis for the description of intonation systems. Prosody: Theory and Experiment. Dordrecht: Kluwer Academic Publishers (2000).
- (2000) Levels of Representation and Levels of Analysis for the Description of Intonation Systems. Prosody: Theory and Experiment
- Hirst, D.¹ Di Cristo, A.² Espesser, R.³

214
- 72949135793
- The origin of speech
- Hockett, C. F. The origin of speech. Scientific American 203 (1960), 88-96.
- (1960) Scientific American , vol.203 , pp. 88-96
- Hockett, C.F.¹

215
- 33645587401
- Data driven formant synthesis
- Hogberg, J. Data driven formant synthesis. In Proceedings of Eurospeech 1997 (1997).
- (1997) Proceedings of Eurospeech 1997
- Hogberg, J.¹

216
- 85009083816
- Generating prosody by superposing multi-parametric overlapping contours
- Holm, B., and Bailly, G. Generating prosody by superposing multi-parametric overlapping contours. Proceedings of the International Conference on Speech and Language Processing (2000), pp. 203-206.
- (2000) Proceedings of the International Conference on Speech and Language Processing , pp. 203-206
- Holm, B.¹ Bailly, G.²

217
- 21844457336
- Implementing various functions of prosody
- Holm, B., and Bailly, G. Learning the hidden structure of intonation: Implementing various functions of prosody. In Speech Prosody (2002), 399-402.
- (2002) Speech Prosody , pp. 399-402
- Holm, B.¹ Bailly, G.²

218
- 0015699693
- The influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer
- Holmes, J. N. The influence of the glottal waveform on the naturalness of speech from a parallel formant synthesizer. IEEE Transactions on Audio Electroacoustics 21 (1980), 298-305.
- (1980) IEEE Transactions on Audio Electroacoustics , vol.21 , pp. 298-305
- Holmes, J.N.¹

219
- 84964175806
- Speech synthesis by rule
- Holmes, J. N., Mattingly, I. G., and Shearme, J. N. Speech synthesis by rule. Language and Speech 7 (1964), 127-143.
- (1964) Language and Speech , vol.7 , pp. 127-143
- Holmes, J.N.¹ Mattingly, I.G.² Shearme, J.N.³

220
- 0031642265
- Automatic generation of synthesis units for trainable text-to-speech systems
- Hon, H., Acero, A., Huang, X., Liu, J., and Plumpe, M. Automatic generation of synthesis units for trainable text-to-speech systems. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1998 (1998).
- (1998) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 1998
- Hon, H.¹ Acero, A.² Huang, X.³ Liu, J.⁴ Plumpe, M.⁵

221
- 0001562208
- Articulation-Testing Methods: Consonantal differentiation with a closed-response set
- House, A., Williams, C., Hecker, M., and Kryter, K. Articulation-testing methods: Consonantal differentiation with a closed-response set. The Journal of the Acoustical Society of America 37 (1965), 158.
- (1965) The Journal of the Acoustical Society of America , vol.37 , pp. 158
- House, A.¹ Williams, C.² Hecker, M.³ Kryter, K.⁴

222
- 85032418829
- Duration embedded bi-hmm for expressive voice conversion
- Hsia, C.-C., We, C. H., and Liu, T.-H. Duration embedded bi-HMM for expressive voice conversion. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Hsia, C.-C.¹ We, C.H.² Liu, T.-H.³

223
- 85032407217
- System and method for performing a grapheme-to-phoneme conversion
- Huang, J., Abrego, G., and Olorenshaw, L. System and method for performing a grapheme-to-phoneme conversion. In Proceedings of Interspeech 2006 (2006).
- (2006) Proceedings of Interspeech 2006
- Huang, J.¹ Abrego, G.² Olorenshaw, L.³

224
- 0004056285
- Englewood Cliffs, NJ: Prentice-Hall
- Huang, X., Acero, a., and Hon, H.-W. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Englewood Cliffs, NJ: Prentice-Hall (2001).
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

225
- 84925057379
- Speech synthesis, speech simulation and speech science
- Huckvale, M. Speech synthesis, speech simulation and speech science. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002), pp. 1261-1264.
- (2002) Proceedings of the International Conference on Speech and Language Processing , vol.2002 , pp. 1261-1264
- Huckvale, M.¹

226
- 84925903345
- Phonological rules for a text-to-speech system
- Hunnicut, S. Phonological rules for a text-to-speech system. Americal Journal of Computational Linguistics 57 (1976), 1-72.
- (1976) Americal Journal of Computational Linguistics , vol.57 , pp. 1-72
- Hunnicut, S.¹

227
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- Hunt, A. J., and Black, A. W. Unit selection in a concatenative speech synthesis system using a large speech database. In Proceedings of the International Conference on Speech and Language Processing 1996 (1996), pp. 373-376.
- (1996) Proceedings of the International Conference on Speech and Language Processing , vol.1996 , pp. 373-376
- Hunt, A.J.¹ Black, A.W.²

228
- 70450167047
- Issues in high quality ipc analysis and synthesis
- Hunt, M., Zwierynski, D., and Carr, R. Issues in high quality IPC analysis and synthesis. In Proceedings ofEurospeech 1989 (1989), pp. 348-351.
- (1989) Proceedings Ofeurospeech , vol.1989 , pp. 348-351
- Hunt, M.¹ Zwierynski, D.² Carr, R.³

229
- 0020596154
- Cepstral analysis synthesis on the mel frequency scale
- Imai, S. Cepstral analysis synthesis on the mel frequency scale. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 1983 (1983), pp. 9396.
- (1983) Proceedings of the International Conference on Acoustics Speech and Signal Processing , vol.1983 , pp. 9396
- Imai, S.¹

230
- 33644609617
- Emotive Alert: Hmm-based emotion detection in voicemail messages
- Inanoglu, Z., and Caneel, R. Emotive alert: HMM-based emotion detection in voicemail messages. In Proceedings of the 10th International Conference on Intelligent User Interfaces (2005), pp. 251-253.
- (2005) Proceedings of the 10Th International Conference on Intelligent User Interfaces , pp. 251-253
- Inanoglu, Z.¹ Caneel, R.²

231
- 85032405743
- A repertoire of british english contours for speech synthesis
- Isard, S. D., and Pearson, M. A repertoire of British English contours for speech synthesis. In Proceedings of the 7th FASE Symposium, Speech 1988 (1988).
- (1988) Proceedings of the 7Th FASE Symposium, Speech 1988
- Isard, S.D.¹ Pearson, M.²

232
- 85032424085
- Model adaptation and adaptive training using esat algorithm for hmm-based speech synthesis
- Isogai, J., Yamagishi, J., and Kobayashi, T. Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis. In Proceedings of Eurospeech 2005 (2005).
- (2005) Proceedings of Eurospeech 2005
- Isogai, J.¹ Yamagishi, J.² Kobayashi, T.³

233
- 0014698310
- Analysis synthesis telephony based on the maximum likelihood method
- Itakura, F., and Saito, S. Analysis synthesis telephony based on the maximum likelihood method. In Reports of the 6th International Conference on Acoustics (1968).
- (1968) Reports of the 6Th International Conference on Acoustics
- Itakura, F.¹ Saito, S.²

234
- 0027699809
- Speech segment selection for concatenative synthesis based on spectral distortion minimization
- Iwahashi, N., Kaiki, N., and Sagisaka, Y. Speech segment selection for concatenative synthesis based on spectral distortion minimization. Transactions of the Institute of Electronics, Information and Communication Engineers E76A (1993), 1942-1948.
- (1993) Transactions of the Institute of Electronics, Information and Communication Engineers , pp. 1942-1948
- Iwahashi, N.¹ Kaiki, N.² Sagisaka, Y.³

235
- 85009152114
- Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction
- Matousek, D. T., and Psutka, J. Automatic segmentation for Czech concatenative speech synthesis using statistical approach with boundary-specific correction. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Matousek, D.T.¹ Psutka, J.²

236
- 85032396493
- Hybrid syllable/triphone speech synthesis
- Matousek, Z. H., and Tihelka, D. Hybrid syllable/triphone speech synthesis. In Proceedings ofEurospeech 2005 (2005).
- (2005) Proceedings Ofeurospeech 2005
- Matousek, Z.H.¹ Tihelka, D.²

237
- 0016939124
- Continuous speech recognition by statistical methods
- Jelinek, F. Continuous speech recognition by statistical methods. Proceedings of the IEEE 64 (1976), 532-556.
- (1976) Proceedings of the IEEE , vol.64 , pp. 532-556
- Jelinek, F.¹

238
- 85032398983
- Talk
- Jelinek, F. Talk. In Workshop on Evaluation of NLP Systems, Wayne, Pennsylvania (1988).
- Workshop on Evaluation of NLP Systems, Wayne, Pennsylvania (1988)
- Jelinek, F.¹

239
- 0003786003
- Cambridge, MA: MIT Press
- Jelinek, F. Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press (1998).
- (1998) Statistical Methods for Speech Recognition
- Jelinek, F.¹

240
- 85009143810
- Self-organizing letter code-book for text-to-phoneme neural network model
- Jensen, K. J., and Riis, S. Self-organizing letter code-book for text-to-phoneme neural network model. In Proceedings of the International Conference on Speech and Language Processing (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing
- Jensen, K.J.¹ Riis, S.²

241
- 0345580812
- Rules forthe generationoftobi-based american english intonation
- Jilka, M., Mohler, G., and Dogil, G. Rules forthe generationofToBI-based American English intonation. In Speech Communication 28 (1999), 83-108.
- (1999) Speech Communication , vol.28 , pp. 83-108
- Jilka, M.¹ Mohler, G.² Dogil, G.³

242
- 0004030841
- 8th edn. Cambridge: Heffer & Sons
- Jones, D. An Outline of English Phonetics, 8th edn. Cambridge: Heffer & Sons (1957).
- (1957) An Outline of English Phonetics
- Jones, D.¹

243
- 0003847769
- Englewood Cliffs, NJ: Prentice-Hall
- Jurafsky, D., and Martin, J. H. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall (2000).
- (2000) Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
- Jurafsky, D.¹ Martin, J.H.²

244
- 0034841948
- Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
- Kain, a., and Macon, M. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 2001 (2001).
- (2001) Proceedings of the International Conference on Acoustics Speech and Signal Processing 2001
- Kain, A.¹ Macon, M.²

245
- 0009765464
- London: Longman
- Katamba, F. An Introduction to Phonology. London: Longman (1989).
- (1989) An Introduction to Phonology
- Katamba, F.¹

246
- 84876497245
- Gmmbase voice conversion applied to emotional speech synthesis
- Kawanami, h., Iwami, y., Toda, t., Saruwatarai, h., and Shikano, K. GMMbase voice conversion applied to emotional speech synthesis. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings of Eurospeech 2003
- Kawanami, H.¹ Iwami, Y.² Toda, T.³ Saruwatarai, H.⁴ Shikano, K.⁵

247
- 84946981738
- One process, not two, in reading aloud: Lexical analogies do the work of non-lexical rules
- Kay, j., and Marcel, A. One process, not two, in reading aloud: Lexical analogies do the work of non-lexical rules. Quarterly Journal of Experimental Psychology 33a (1981), 397-413.
- (1981) Quarterly Journal of Experimental Psychology , vol.33 , pp. 397-413
- Kay, J.¹ Marcel, A.²

248
- 84928223726
- The internal structure of phonological elements: A theory of charm and government
- Kaye, j., Lowenstamm, j., and Vergnaud, J. R. The internal structure of phonological elements: A theory of charm and government. Phonology Yearbook 2 (1985), pp. 305-328.
- (1985) Phonology Yearbook , vol.2 , pp. 305-328
- Kaye, J.¹ Lowenstamm, J.² Vergnaud, J.R.³

249
- 34249324336
- Designing very compact decision trees forgrapheme- to-phoneme transcription
- Kienappel, A. K., and Kneser, R. Designing very compact decision trees forgrapheme- to-phoneme transcription. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Kienappel, A.K.¹ Kneser, R.²

250
- 81155150210
- On the reduction of concatenation artefacts in diphone synthesis
- Klabbers, E., and Veldhuis, R. On the reduction of concatenation artefacts in diphone synthesis. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Klabbers, E.¹ Veldhuis, R.²

251
- 0035127353
- Reducing audible spectral discontinuities
- Klabbers, E., and Veldhuis, R. Reducing audible spectral discontinuities. IEEE Transactions on Speech and Audio Processing 9, 1 (2001), 39-51.
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.1 , pp. 39-51
- Klabbers, E.¹ Veldhuis, R.²

252
- 84855929084
- Acoustic theory of terminal analog speech synthesis
- Acoustic theory of terminal analog speech synthesis
- Klatt, D. H. Acoustic theory of terminal analog speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1972 (1972), vol. 1, pp. 131-135.
- (1972) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 131-135
- Klatt, D.H.¹

253
- 0015676852
- Interaction between two factors that influence vowel duration
- Klatt, D. H. Interaction between two factors that influence vowel duration. Journal of the Acoustical Society of America 5 (1973), 1102-1104.
- (1973) Journal of the Acoustical Society of America , vol.5 , pp. 1102-1104
- Klatt, D.H.¹

254
- 0018986665
- Software for a cascade/parallel formant synthesizer
- Klatt, D. H. Software for a cascade/parallel formant synthesizer. Journal of the Acoustical Society of America 67 (1980), 971-995.
- (1980) Journal of the Acoustical Society of America , vol.67 , pp. 971-995
- Klatt, D.H.¹

255
- 0023407575
- Review of text-to-speech conversion for english
- Klatt, D. H. Review of text-to-speech conversion for English. Journal of the Acoustical Society of America 82, 3 (1987), 793-850.
- (1987) Journal of the Acoustical Society of America , vol.82 , Issue.3 , pp. 793-850
- Klatt, D.H.¹

256
- 0003637864
- Amsterdam: Elsevier
- Kleijn, W. B., and Paliwal, K. K. Speech Coding and Synthesis. Amsterdam: Elsevier (1995).
- (1995) Speech Coding and Synthesis
- Kleijn, W.B.¹ Paliwal, K.K.²

257
- 0033719622
- Improving intonationalphrasing with syntactic information
- Koehn, P., Abney, s., Hirschberg, J., and Collins, M. Improving intonationalphrasing with syntactic information. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2000
- Koehn, P.¹ Abney, S.² Hirschberg, J.³ Collins, M.⁴

258
- 29744447077
- A model of german intonation
- K.J. Kohler, Ed. Kiel: Universitat Kiel
- Kohler, K. J. A model of German intonation. In Studies in German Intonation, K. J. Kohler, Ed. Kiel: Universitat Kiel (1991).
- (1991) Studies in German Intonation
- Kohler, K.J.¹

259
- 33744686495
- The perception of accents: Peak height versus peak position
- K. J. Kohler, Ed. Kiel: Universitat Kiel
- Kohler, K. J. The perception of accents: Peak height versus peak position. In Studies in German Intonation, K. J. Kohler, Ed. Kiel: Universitat Kiel (1991), pp. 72-96.
- (1991) Studies in German Intonation , pp. 72-96
- Kohler, K.J.¹

260
- 0006552509
- Phonetics, phonology and semantics
- Terminal intonation patterns in single-accent utterances of German, K. J. Kohler, Ed. Kiel: Universitat Kiel
- Kohler, K. J. Terminal intonation patterns in single-accent utterances of German: Phonetics, phonology and semantics. In Studies in German Intonation, K. J. Kohler, Ed. Kiel: Universitat Kiel (1991), pp. 53-71.
- (1991) Studies in German Intonation , pp. 53-71
- Kohler, K.J.¹

261
- 0028996842
- Celp coding based on mel cepstral analysis
- Koishida, K., Tokuda, K., and Imai, S. CELP coding based on mel cepstral analysis. In Proceedings ofthe International Conference on Acoustics, Speech, and Signal Processing 1995 (1995).
- (1995) Proceedings Ofthe International Conference on Acoustics, Speech, and Signal Processing 1995
- Koishida, K.¹ Tokuda, K.² Imai, S.³

262
- 85009168667
- Evaluating and correcting phoneme segmentation for unit selection synthesis
- Kominek, J., Bennett, C., and Black, A. M. Evaluating and correcting phoneme segmentation for unit selection synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Kominek, J.¹ Bennett, C.² Black, A.M.³

263
- 85009091555
- A family-of-models approach to hmm-based segmentation for unit selection speech synthesis
- Kominek, J., and Black, A. W. A family-of-models approach to HMM-based segmentation for unit selection speech synthesis. In Proceedings of the International Conference on Spoken Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Spoken Language Processing 2004
- Kominek, J.¹ Black, A.W.²

264
- 85009064374
- Duration modeling for hindi text-to-speech synthesis system
- Krishna, N. s., Talukdar, P. p., Bali, K., and Ramakrishnam, A. G. Duration modeling for Hindi text-to-speech synthesis system. In Proceedings of the International Conference on Speech and Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Speech and Language Processing 2004
- Krishna, N.S.¹ Talukdar, P.P.² Bali, K.³ Ramakrishnam, A.G.⁴

265
- 0004100714
- Kubrick, S. 2001: A Space Odyssey (1968).
- (1968) 2001: A Space Odyssey
- Kubrick, S.¹

266
- 0034108523
- Phonological conditioning of peak alignment in rising pitch accents in dutch
- Ladd, D., Mennen, I., and Schepman, A. Phonological conditioning of peak alignment in rising pitch accents in Dutch. The Journal of the Acoustical Society of America 107, 2685.
- (2000) The Journal of the Acoustical Society of America , vol.107
- Ladd, D.¹ Mennen, I.² Schepman, A.³

267
- 0023715072
- Declination reset and the hierarchical organization of utterances
- Ladd, D. R. Declination reset and the hierarchical organization of utterances. Journal of the Acoustical Society of America 84, 2 (1988), 530-544.
- (1988) Journal of the Acoustical Society of America , vol.84 , Issue.2 , pp. 530-544
- Ladd, D.R.¹

268
- 84885471105
- Compound prosodic domains
- Ladd, D. R. Compound prosodic domains. Edinburgh University Linguistics Department Occasional Paper (1992).
- (1992) Edinburgh University Linguistics Department Occasional Paper
- Ladd, D.R.¹

269
- 0004190969
- Cambridge: Cambridge University Press
- Ladd, D. R. Intonational Phonology. Cambridge: Cambridge University Press (1996).
- (1996) Intonational Phonology
- Ladd, D.R.¹

270
- 84927457556
- Vowel intrinsic pitch in connected speech
- Ladd, D. R., and Silverman, K. E. A. Vowel intrinsic pitch in connected speech. Pho- netica 41 (1984), 31-40.
- (1984) Pho- Netica , vol.41 , pp. 31-40
- Ladd, D.R.¹ Silverman, K.²

271
- 0004145667
- London: Thompson
- Ladefoged, P. A Course in Phonetics. London: Thompson (2003).
- (2003) A Course in Phonetics
- Ladefoged, P.¹

272
- 84925137519
- Oxford: Blackwell Publishing
- Ladefoged, P. An Introduction to Phonetic Fieldwork and Instrumental Techniques. Oxford: Blackwell Publishing (2003).
- (2003) An Introduction to Phonetic Fieldwork and Instrumental Techniques
- Ladefoged, P.¹

273
- 84976230467
- A database design for a tts synthesis system using lexical diphones
- Lambert, T., and Breen, A. A database design for a TTS synthesis system using lexical diphones. In Proceedings of the International Conference on Speech and Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Speech and Language Processing 2004
- Lambert, T.¹ Breen, A.²

274
- 0009069640
- Cambridge: Cambridge University Press
- Lass, R. An Introduction to Basic Concepts. Cambridge: Cambridge University Press (1984).
- (1984) An Introduction to Basic Concepts
- Lass, R.¹

275
- 80054370614
- Cambridge: Cambridge University Press
- Laver, J. Principles of Phonetics. Cambridge: Cambridge University Press (1995).
- (1995) Principles of Phonetics
- Laver, J.¹

276
- 85032401207
- Speech segmentation criteria for the atr/cstr database. Technical report, centre for speech technology research
- Laver, J., Alexander, M., Bennet, C. et al. Speech segmentation criteria for the ATR/CSTR database. Technical report, Centre for Speech Technology Research, University of Edinburgh (1988).
- (1988) University of Edinburgh
- Laver, J.¹ Alexander, M.² Bennet, C.³

277
- 85009151673
- Using cross-syllable units for cantonese speech synthesis
- Law, K. M., and Lee, T. Using cross-syllable units for Cantonese speech synthesis. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Law, K.M.¹ Lee, T.²

278
- 33646639815
- The synthesis of speech from signals which have a low information rate
- W. Jackson, Ed. London: Butterworth & Co., Ltd
- Lawrence, W. The synthesis of speech from signals which have a low information rate. In Communication Theory, W. Jackson, Ed. London: Butterworth & Co., Ltd (1953), pp. 460-469.
- (1953) Communication Theory , pp. 460-469
- Lawrence, W.¹

279
- 0141703272
- Context-adaptive phone boundary refining for a tts database
- Lee, K.-S., and Kim, J. S. Context-adaptive phone boundary refining for a TTS database. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 2003 (2003).
- (2003) Proceedings of the International Conference on Acoustics Speech and Signal Processing 2003
- Lee, K.-S.¹ Kim, J.S.²

280
- 0005282123
- A computational algorithm for f0 contour generation in korean developed with prosodically labeled databases using k-tobi system
- Lee, Y. J., Lee, S., Kim, J. J., and Ko, H. J. A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using K-ToBI system. In Proceedings of the International Conference on Spoken Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Spoken Language Processing 1998
- Lee, Y.J.¹ Lee, S.² Kim, J.J.³ Ko, H.J.⁴

281
- 85032421246
- A new quantization technique for lsp parameters and its application to low bit rate multi-band excited vocoders
- Leich, H., Deketelaere, S., Dbman, I., Dothey, M., and Wery, b. A new quantization technique for LSP parameters and its application to low bit rate multi-band excited vocoders. In EUSIPCO (1992).
- (1992) EUSIPCO
- Leich, H.¹ Deketelaere, S.² Dbman, I.³ Dothey, M.⁴

282
- 0022685753
- Continuously variable duration hidden markov models for automatic speech recognition
- Levinson, S. Continuously variable duration hidden Markov models for automatic speech recognition. Computer Speech and Language 1 (1986), 29-45.
- (1986) Computer Speech and Language , vol.1 , pp. 29-45
- Levinson, S.¹

283
- 85009151475
- A graphical model approach to pitch tracking
- Li, X., Malkin, J., and Bilmes, J. A graphical model approach to pitch tracking. In Proceedings of the International Conference on Spoken Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Spoken Language Processing 2004
- Li, X.¹ Malkin, J.² Bilmes, J.³

284
- 0003903906
- MIT, Published by Indiana University Linguistics Club
- Liberman, M. The Intonational System of English. PhD thesis, MIT (1975). Published by Indiana University Linguistics Club.
- (1975) The Intonational System of English. Phd Thesis
- Liberman, M.¹

285
- 0001469109
- Intonational invariance under changes in pitch range and length
- M. Aronoff and R. T. Oehrle, Eds. Cambridge, MA: MIT Press
- Liberman, M., and Pierrehumbert, J. Intonational invariance under changes in pitch range and length. In Language Sound Structure, M. Aronoff and R. T. Oehrle, Eds. Cambridge, MA: MIT Press (1984), pp. 157-233.
- (1984) Language Sound Structure , pp. 157-233
- Liberman, M.¹ Pierrehumbert, J.²

286
- 0000106333
- On stress and linguistic rhythm
- Liberman, M. Y., and Prince, A. On stress and linguistic rhythm. Linguistic Inquiry 8 (1977), 249-336.
- (1977) Linguistic Inquiry , vol.8 , pp. 249-336
- Liberman, M.Y.¹ Prince, A.²

287
- 0004251776
- Cambridge MA: MIT Press
- Lieberman, P. Intonation, Perception and Language. Cambridge MA: MIT Press (1967).
- (1967) Intonation, Perception and Language
- Lieberman, P.¹

288
- 84892159255
- PhD thesis, Royal Institution of Technology, Stockholm
- Liljencrants, J. Reflection-type Line Analog Synthesis. PhD thesis, Royal Institution of Technology, Stockholm (1985).
- (1985) Reflection-type Line Analog Synthesis
- Liljencrants, J.¹

289
- 84995620255
- Knowledge of language origin lmproves pronunciation accuracy of proper names
- Llitjos, A., and Black, A. Knowledge of language origin lmproves pronunciation accuracy of proper names. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Llitjos, A.¹ Black, A.²

290
- 0003664659
- Cambridge: Cambridge University Press
- Lyons, J. Introduction to Theoretical Linguistics. Cambridge: Cambridge University Press (1968).
- (1968) Introduction to Theoretical Linguistics
- Lyons, J.¹

291
- 84966366503
- Rapid unit selection from a large speech corpus for concatenative speech synthesis
- Beutnagel, M. M., and Riley, M. Rapid unit selection from a large speech corpus for concatenative speech synthesis. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Beutnagel, M.M.¹ Riley, M.²

292
- 33745216020
- A text-to-speech platform for variable length optimal unit searching using perceptual cost functions
- Lee, D. P. L., and Olive, J. P. A text-to-speech platform for variable length optimal unit searching using perceptual cost functions. In Proceedings of the Fourth ISCA Workshop on Speech Synthesis (2001).
- (2001) Proceedings of the Fourth ISCA Workshop on Speech Synthesis
- Lee, D.¹ Olive, J.P.²

293
- 0005500345
- PhD thesis, Georgia Tech
- Macon, M. W. Speech Synthesis Based on Sinusoidal Modeling. PhD thesis, Georgia Tech (1996).
- (1996) Speech Synthesis Based on Sinusoidal Modeling
- Macon, M.W.¹

294
- 0015699037
- Spectral analysis of speech by linear prediction
- Makhoul, J. Spectral analysis of speech by linear prediction. IEEE Transactions on Audio and Electroacoustics 3 (1973), 140-148.
- (1973) IEEE Transactions on Audio and Electroacoustics , vol.3 , pp. 140-148
- Makhoul, J.¹

295
- 0016495091
- Linear prediction:A tutorial review
- Makhoul, J. Linear prediction: A tutorial review. Proceedings of the IEEE 63, 4 (1975), 561-580.
- (1975) Proceedings of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

296
- 85089846433
- Automatic prosody generation using suprasegmental unit selection
- Malfrere, F., Dutoit, T., and Mertens, P. Automatic prosody generation using suprasegmental unit selection. In International Conference on Speech and Language Processing (1998).
- (1998) International Conference on Speech and Language Processing
- Malfrere, F.¹ Dutoit, T.² Mertens, P.³

297
- 33646790880
- A graphical model for formant tracking
- Malkin, J., Li, X., and Bilmes, J. A graphical model for formant tracking. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2005 (2005).
- (2005) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 2005
- Malkin, J.¹ Li, X.² Bilmes, J.³

298
- 85096855936
- One-class svms for document classification
- Manevitz, L. M., and Yousef, M. One-class SVMs for document classification. Journal of Machine Learning Research 2 (2001), 139-154.
- (2001) Journal of Machine Learning Research , vol.2 , pp. 139-154
- Manevitz, L.M.¹ Yousef, M.²

299
- 0003612818
- Cambridge, MA: MIT Press
- Manning, C. D., and Schutze, H. Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press (1999).
- (1999) Foundations of Statistical Natural Language Processing
- Manning, C.D.¹ Schutze, H.²

300
- 0039255896
- A multistrategy approach to improving pronunciation by analogy
- Marchand, Y., and Damper, R. A multistrategy approach to improving pronunciation by analogy. Computational Linguistics 26, 2 (2000), 195-219.
- (2000) Computational Linguistics , vol.26 , Issue.2 , pp. 195-219
- Marchand, Y.¹ Damper, R.²

301
- 0015488387
- The sift algorithm for fundamental frequency estimation
- Markel, J. D. The SIFT algorithm for fundamental frequency estimation. IEEE Transactions on Audio and Electroacoustics 20 (1972), 367-377.
- (1972) IEEE Transactions on Audio and Electroacoustics , vol.20 , pp. 367-377
- Markel, J.D.¹

302
- 0003874959
- Berlin: Springer-Verlag
- Markel, J. D., and Gray, A. H. Linear Prediction of Speech. Berlin: Springer-Verlag (1976).
- (1976) Linear Prediction of Speech
- Markel, J.D.¹ Gray, A.H.²

303
- 85009275117
- Combining information sources for memory-based pitch accent placement
- Marsi, E., Busser, B., Daelemans, W. et al. Combining information sources for memory-based pitch accent placement. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Marsi, E.¹ Busser, B.² Daelemans, W.³

304
- 85128388568
- Categorical perception: Important phenomena or lasting myth
- Massaro, D. W. Categorical perception: Important phenomena or lasting myth. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998), pp. 2275-2278.
- (1998) Proceedings of the International Conference on Speech and Language Processing , vol.1998 , pp. 2275-2278
- Massaro, D.W.¹

305
- 0029725605
- Speech synthesis using hmms with dynamic features
- Masuko, T., Tokuda, K., Kobayashi, T., and Imai, S. Speech synthesis using HMMs with dynamic features. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 1996 (1996).
- (1996) Proceedings of the International Conference on Acoustics Speech and Signal Processing 1996
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

306
- 84940803687
- Feature geometry and dependency: A review
- McCarthy, J. Feature geometry and dependency: Areview. Phonetica 43 (1988), 84-108.
- (1988) Phoneticaa , vol.43 , pp. 84-108
- Mc Carthy, J.¹

307
- 84925218474
- Technical report, Bell System Technical Journal
- McIlroy, D. M. Synthetic English speech by rule. Technical report, Bell System Technical Journal (1973).
- (1973) Synthetic English Speech by Rule
- Mc Ilroy, D.M.¹

308
- 85032414384
- User attitudes to concatenated natural speech and text-to-speech synthesis in an automated information service
- McInnes, F. R., Attwater, D. J., Edgington, M. D., Schmidt, M. S., and Jack, M. A. User attitudes to concatenated natural speech and text-to-speech synthesis in an automated information service. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Mc Innes, F.R.¹ Attwater, D.J.² Edgington, M.D.³ Schmidt, M.S.⁴ Jack, M.A.⁵

309
- 0025807353
- Super resolution pitch determination of speech signals
- Medan, Y., Yair, E., and Chazan, D. Super resolution pitch determination of speech signals. IEEE Transactions on Signal Processing 39 (1991), 40-48.
- (1991) IEEE Transactions on Signal Processing , vol.39 , pp. 40-48
- Medan, Y.¹ Yair, E.² Chazan, D.³

310
- 85039172650
- Prosodic unit selection using an imitation speech database
- Meron, J. Prosodic unit selection using an imitation speech database. In Proceedings of the Fourth ISCA ITRW on Speech Synthesis (2001).
- (2001) Proceedings of the Fourth ISCA ITRW on Speech Synthesis
- Meron, J.¹

311
- 3042694687
- Applying fallback to prosodic unit selection from a small imitation database
- Meron, J. Applying fallback to prosodic unit selection from a small imitation database. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Meron, J.¹

312
- 0000241903
- The case of nominal extraposition
- Michaelis, L. A., and Lambrecht, K. Toward a construction-based model of language function: The case of nominal extraposition. Language 72 (1996), 215-247.
- (1996) Language , vol.72 , pp. 215-247
- Michaelis, L.A.¹ Lambrecht, K.²

313
- 85032407297
- Bigraphs as a model for mobile interaction
- Milner, R. Bigraphs as a model for mobile interaction. In Proceedings of the International Conference on Graph Transformation 2002 (2002).
- (2002) Proceedings of the International Conference on Graph Transformation 2002
- Milner, R.¹

314
- 0033692964
- Anovel approach to the fully automatic extraction of fujisaki-model parameters
- Mixdorff, H. Anovel approach to the fully automatic extraction of Fujisaki-model parameters. In International Conference on Acoustics, Speech, and Signal Processing 2000 (2000).
- (2000) International Conference on Acoustics, Speech, and Signal Processing 2000
- Mixdorff, H.¹

315
- 85119338514
- Parametric modeling of intonation using vector quantization
- Mohler, G., and Conkie, A. Parametric modeling of intonation using vector quantization. In Proceedings of the Third ESCA/IEEE Workshop on Speech Synthesis (1998), pp. 311-314.
- (1998) Proceedings of the Third ESCA/IEEE Workshop on Speech Synthesis , pp. 311-314
- Mohler, G.¹ Conkie, A.²

316
- 0020816083
- Suggested formulae for calculating auditory-filter bandwidths and excitation patterns
- Moore, B. C. J., and Glasberg, B. R. Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. Journal of the Acoustical Society of America 74 (1983), 750-753.
- (1983) Journal of the Acoustical Society of America , vol.74 , pp. 750-753
- Moore, B.¹ Glasberg, B.R.²

317
- 85032411560
- Twenty things we still don’t know about speech
- Moore, R. K. Twenty things we still don’t know about speech. In Proceedings of Eurospeech 1999 (1995).
- (1995) Proceedings of Eurospeech 1999
- Moore, R.K.¹

318
- 85032413749
- Research challenges in the automation of spoken language interaction
- Moore, R. K. Research challenges in the automation of spoken language interaction. In ISCA Tutorial and Research Workshop on Applied Spoken Language Interaction in Distributed Environments (2005).
- (2005) ISCA Tutorial and Research Workshop on Applied Spoken Language Interaction in Distributed Environments
- Moore, R.K.¹

319
- 85032425439
- Exploratory analysis of linguistic data based on genetic algorithm for robust modeling of the segmental duration of speech
- Morais, E., and Violaro, F. Exploratory analysis of linguistic data based on genetic algorithm for robust modeling of the segmental duration of speech. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Morais, E.¹ Violaro, F.²

320
- 85009156064
- A data-driven approach to source-formant type text-to-speech system
- Mori, H., Ohtsuka, T., and Kasuya, H. A data-driven approach to source-formant type text-to-speech system. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- Mori, H.¹ Ohtsuka, T.² Kasuya, H.³

321
- 0035283592
- Generating prosodic attitudes in french: Data, model and evaluation
- Morlec, Y., Bailly, G., and AubergE, V. Generating prosodic attitudes in French: data, model and evaluation. Speech Communication 33, 4 (2001), 357-371.
- (2001) Speech Communication , vol.33 , Issue.4 , pp. 357-371
- Morlec, Y.¹ Bailly, G.²

322
- 0009151070
- Time-domain and frequency-domain techniques for prosodic modification of speech
- W.B. Kleijn and K. K. Paliwal, Eds. Amsterdam: Elsevier Science B.V
- Moulines, E., and Verhelst, W. Time-domain and frequency-domain techniques for prosodic modification of speech. In Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam: Elsevier Science B.V. (1995), pp. 519-555.
- (1995) Speech Coding and Synthesis , pp. 519-555
- Moulines, E.¹ Verhelst, W.²

323
- 85009110171
- Accent label prediction by time delay neural networks using gating clusters
- Muller, A. F., and Hoffmann, R. Accent label prediction by time delay neural networks using gating clusters. In Proceedings of Eurospeech 2001 (2001).
- (2001) Proceedings of Eurospeech 2001
- Muller, A.F.¹ Hoffmann, R.²

324
- 0027447292
- A review of the literature on human vocal emotion
- Murray, I. R., and Arnott, J. L. Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America 93, 2 (1993), 1097-1108.
- (1993) Journal of the Acoustical Society of America , vol.93 , Issue.2 , pp. 1097-1108
- Murray, I.R.¹ Arnott, J.L.²

325
- 78649285978
- A stochastic approach to phoneme and accent estimation
- Nagano, T., Shinsuke, M., and Nishimura, M. A stochastic approach to phoneme and accent estimation. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Nagano, T.¹ Shinsuke, M.² Nishimura, M.³

326
- 0038243732
- English speech synthesis based on multi-layered context oriented clustering
- Nakajima, S. English speech synthesis based on multi-layered context oriented clustering. In Proceedings of the Third European Conference on Speech Communication and Technology, Eurospeech 1993 (1993).
- (1993) Proceedings of the Third European Conference on Speech Communication and Technology, Eurospeech 1993
- Nakajima, S.¹

327
- 0023749306
- Automatic generation of synthesis units based on context oriented clustering
- Nakajima, S., and Hamada, H. Automatic generation of synthesis units based on context oriented clustering. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988 (1988).
- (1988) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988
- Nakajima, S.¹ Hamada, H.²

328
- 84933479446
- PhD thesis, Harvard University
- Nakatani, C. H. The Computational Processing of Intonational Prominence: A Functional Prosody Respective. PhD thesis, Harvard University (1997).
- (1997) The Computational Processing of Intonational Prominence: A Functional Prosody Respective
- Nakatani, C.H.¹

329
- 85032418470
- Prominence variation beyond given/new
- Nakatani, C. H. Prominence variation beyond given/new. In Proceedings of Eurospeech 1997 (1997).
- (1997) Proceedings of Eurospeech 1997
- Nakatani, C.H.¹

330
- 85009115356
- Coupling dialogue and prosody computation in spoken dialogue generation
- Nakatani, C. H. Coupling dialogue and prosody computation in spoken dialogue generation. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Nakatani, C.H.¹

331
- 0029097108
- An articulatory study of fricative consonants using mri
- Narayanan, S., Alwan, A., and Haker, K. An articulatory study of fricative consonants using MRI. Journal of the Acoustical Society of America 98, 3 (1995), 1325-1347.
- (1995) Journal of the Acoustical Society of America , vol.98 , Issue.3 , pp. 1325-1347
- Narayanan, S.¹ Alwan, A.² Haker, K.³

332
- 0031042761
- The laterals
- Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part I
- Narayanan, S., Alwan, A., and Haker, K. Towards articulatory-acoustic models for liquid consonants based on MRI and EPG data. Part I: The laterals. Journal of the Acoustical Society of America 101, 2 (1997), 1064-1077.
- (1997) Journal of the Acoustical Society of America , vol.101 , Issue.2 , pp. 1064-1077
- Narayanan, S.¹ Alwan, A.² Haker, K.³

333
- 85032420350
- O’Connor, J. D., and Arnold, G. F. Intonation of Colloquial English. Longman, 1973.
- (1973) Intonation of Colloquial English
- O’ Connor, J.D.¹ Arnold, G.F.²

334
- 0034224125
- Prosynth: An integrated prosodic approach to device-independent, natural-sounding speech synthesis
- Ogden, R., Hawkins, S., House, J. et al. Prosynth: An integrated prosodic approach to device-independent, natural-sounding speech synthesis. Computer Speech and Language 14 (2000), 177-210.
- (2000) Computer Speech and Language , vol.14 , pp. 177-210
- Ogden, R.¹ Hawkins, S.² House, J.³

335
- 0008904919
- Word and sentence intonation: A quantitative model
- Ohman, S. Word and sentence intonation: A quantitative model. STL-Quarterly Progress and Staatus Report 8, 2-3 (1967), 20-54.
- (1967) STL-Quarterly Progress and Staatus Report , vol.5 , Issue.2-3 , pp. 20-54
- Ohman, S.¹

336
- 85009063812
- An improved speech analysis-synthesis algorithm based on the autoregressive with exogenous input speech production model
- Ohtsuka, T., and Kasuya, H. An improved speech analysis-synthesis algorithm based on the autoregressive with exogenous input speech production model. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- Ohtsuka, T.¹ Kasuya, H.²

337
- 85009065136
- Aperiodicity control in arx-based speech analysissynthesis method
- Ohtsuka, T., and Kasuya, H. Aperiodicity control in ARX-based speech analysissynthesis method. In Proceedings of Eurospeech 2001 (2001).
- (2001) Proceedings of Eurospeech 2001
- Ohtsuka, T.¹ Kasuya, H.²

338
- 0017632039
- Rule synthesis of speech from diadic units
- Olive, J. P. Rule synthesis of speech from diadic units. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1977 (1977).
- (1977) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1977
- Olive, J.P.¹

339
- 0016940635
- Speech resynthesis fromphoneme-relatedparameters
- Olive, J. P., and Spickenagle, N. Speech resynthesis fromphoneme-relatedparameters. Journal of the Acoustical Society of America 59 (1976), 993-996.
- (1976) Journal of the Acoustical Society of America , vol.59 , pp. 993-996
- Olive, J.P.¹ Spickenagle, N.²

340
- 85032422534
- Modelling pitch accent types for polish speech synthesis
- Oliver, D., and Clark, R. Modelling pitch accent types for Polish speech synthesis. In Proceedings ofInterspeech 2005 (1995).
- (1995) Proceedings Ofinterspeech 2005
- Oliver, D.¹ Clark, R.²

341
- 0014568991
- I. S. IEEE recommended practices for speech quality measurements
- IEEE Subcommittee on Subjective Measurements, I. S. IEEE recommended practices for speech quality measurements. IEEE Transactions on Audio and Electroacoustics 17 (1969), 227-246.
- (1969) IEEE Transactions on Audio and Electroacoustics , vol.17 , pp. 227-246

342
- 0003793552
- Englewood Cliffs, NJ: Prentice-Hall
- Oppenheim, A. L., and Schafer, R. W. Digital Signal Processing. Englewood Cliffs, NJ: Prentice-Hall (1975).
- (1975) Digital Signal Processing
- Oppenheim, A.L.¹ Schafer, R.W.²

343
- 85044582416
- Letter to sound rules for accented lexicon compression
- Pagel, V., Lenzo, K., and Black, A. Letter to sound rules for accented lexicon compression. In Proceedings of the ICSLP 1998 (1998).
- (1998) Proceedings of the ICSLP 1998
- Pagel, V.¹ Lenzo, K.² Black, A.³

344
- 0041166435
- Cambridge: Cambridge University Press
- Palmer, H. English Intonation with Systematic Exercises. Cambridge: Cambridge University Press (1922).
- (1922) English Intonation with Systematic Exercises
- Palmer, H.¹

345
- 77956899791
- Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis
- Pantazis, Y., Stylianou, Y., and Klabbers, E. Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis. In Proceedings ofEurospeech, Interspeech 2005 (2005).
- (2005) Proceedings Ofeurospeech, Interspeech 2005
- Pantazis, Y.¹ Stylianou, Y.² Klabbers, E.³

346
- 85018094829
- Computer generated animation of faces
- Parke, F. I. Computer generated animation of faces. In ACM National Conference (1972).
- (1972) ACM National Conference
- Parke, F.I.¹

347
- 0020202671
- A parametrized model for facial animations
- Parke, F. I. A parametrized model for facial animations. IEEE Transactions on Computer Graphics and Animations 2, 9 (1982), 61-70.
- (1982) IEEE Transactions on Computer Graphics and Animations , vol.2 , Issue.9 , pp. 61-70
- Parke, F.I.¹

348
- 85009168738
- Dtw-based phonetic alignment using multiple acoustic features
- Paulo, S., and Oliveira, L. C. DTW-based phonetic alignment using multiple acoustic features. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Paulo, S.¹ Oliveira, L.C.²

349
- 56149094130
- A synthesis method based on concatenation of demisyllables and a residual excited vocal tract model
- Pearson, S., Kibre, N., and Niedzielski, N. A synthesis method based on concatenation of demisyllables and a residual excited vocal tract model. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Pearson, S.¹ Kibre, N.² Niedzielski, N.³

350
- 0003614867
- Indiana: Indiana University Press
- Peirce, C. S. The Essential Peirce, Selected Philosophical Writings, Indiana: Indiana University Press (1998), vol. 2.
- (1998) The Essential Peirce, Selected Philosophical Writings , vol.2
- Peirce, C.S.¹

351
- 0003754220
- MIT, Published by Indiana University Linguistics Club
- Pierrehumbert, J. B. The Phonology and Phonetics of English Intonation. PhD thesis, MIT (1980). Published by Indiana University Linguistics Club.
- (1980) The Phonology and Phonetics of English Intonation. Phd Thesis
- Pierrehumbert, J.B.¹

352
- 85032395705
- Michigan: University of Michigan
- Pike, K. L. The Intonation of American English. Michigan: University of Michigan (1945).
- (1945) The intonation of american english
- Pike, K.L.¹

353
- 0019221277
- The mit unrestricted text-to-speech system
- Pisoni, D., and Hunnicutt, S. Perceptual evaluation of MITalk: The MIT unrestricted text-to-speech system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1980 (1980).
- (1980) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1980
- Pisoni, D.¹

354
- 0019603077
- Animating facial expression
- Platt, S. M., and Badler, N. I. Animating facial expression. Computer Graphics 15, 3 (2001), 245-252.
- (2001) Computer Graphics , vol.15 , Issue.3 , pp. 245-252
- Platt, S.M.¹ Badler, N.I.²

355
- 0347463071
- Which is more important in a concatenative speech synthesis system-pitch duration or spectral discontinuity?
- Plumpe, M., and Meredith, S. Which is more important in a concatenative speech synthesis system-pitch duration or spectral discontinuity? In Third ESCA/IEEE Workshop on Speech Synthesis (1998).
- (1998) Third ESCA/IEEE Workshop on Speech Synthesis
- Plumpe, M.¹ Meredith, S.²

356
- 0032595183
- Modeling of the glottal flow derivative waveform with application to speaker identification
- Plumpe, M. D., Quatieri, T. F., and Reynolds, D. A. Modeling of the glottal flow derivative waveform with application to speaker identification. IEEE Transactions on Speech and Audio Processing 1, 5 (1999), 569-586.
- (1999) IEEE Transactions on Speech and Audio Processing , vol.1 , Issue.5 , pp. 569-586
- Plumpe, M.D.¹ Quatieri, T.F.² Reynolds, D.A.³

357
- 85032425634
- Statistical corpus-based speech segmentation
- Pollet, V., and Coorman, G. Statistical corpus-based speech segmentation. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Pollet, V.¹ Coorman, G.²

358
- 84925037988
- Representation and evaluation
- Voice quality of synthetic speech
- Pols, L. Voice quality of synthetic speech: Representation and evaluation. In Proceedings of the International Conference on Speech and Language Processing 1994 (1994).
- (1994) Proceedings of the International Conference on Speech and Language Processing 1994
- Pols, L.¹

359
- 0030362834
- Generation of multiple synthesis inventories by a bootstrapping procedure
- Portele, T., Stober, K. H., Meyer, H., and Hess, W. Generation of multiple synthesis inventories by a bootstrapping procedure. In Proceedings of the International Conference on Speech and Language Processing 1996 (1996).
- (1996) Proceedings of the International Conference on Speech and Language Processing 1996
- Portele, T.¹ Stober, K.H.² Meyer, H.³ Hess, W.⁴

360
- 0036296863
- Minimum phone error and i-smoothing for improved discriminative training
- Povey, D., and Woodland, P. Minimum phone error and I-smoothing for improved discriminative training. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2002
- Povey, D.¹ Woodland, P.²

361
- 84941609666
- The organisation of purposeful dialogues
- Power, R. J. D. The organisation of purposeful dialogues. Linguistics 17 (1979), 107-152.
- (1979) Linguistics , vol.17 , pp. 107-152
- Power, R.¹

362
- 0003487673
- Oxford: Blackwell
- Prince, A., and Smolensky, P. Optimality Theory: Constraint Interaction in Generative Grammar. Oxford: Blackwell (2004).
- (2004) Optimality Theory: Constraint Interaction in Generative Grammar
- Prince, A.¹ Smolensky, P.²

363
- 0026850770
- Automatic classification of intonational phrase boundaries
- Qang, M. Q., and Hirschberg, J. Automatic classification of intonational phrase boundaries. Computer Speech and Language 6 (1992), 175-196.
- (1992) Computer Speech and Language , vol.6 , pp. 175-196
- Qang, M.Q.¹ Hirschberg, J.²

364
- 0003927842
- Englewood Cliffs, NJ: Prentice-Hall
- Quatieri, T. F. Speech Signal Processing. Englewood Cliffs, NJ: Prentice-Hall (2002).
- (2002) Speech Signal Processing
- Quatieri, T.F.¹

365
- 0026771806
- The derivation of prosody for text-to-speech from prosodic sentence structure
- Quene, H. The derivation of prosody for text-to-speech from prosodic sentence structure. Computer Speech & Language 6, 1 (1992), 77-98.
- (1992) Computer Speech & Language , vol.6 , Issue.1 , pp. 77-98
- Quene, H.¹

366
- 85032413247
- Multisyn voices from arctic data for blizzard challenge
- Clark, K. R., and King, S. Multisyn voices from ARCTIC data for Blizzard Challenge. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Clark, K.R.¹ King, S.²

367
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- Rabiner, L., and Juang, B.-H. Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall (1993).
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

368
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- Rabiner, L. R., and Schafer, R. W. Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall (1978).
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

369
- 85032417079
- Stochastic and syntactic techniques for predicting phrase breaks
- Read, I., and Cox, S. Stochastic and syntactic techniques for predicting phrase breaks. In Proceedings ofEurospeech 2005 (2005).
- (2005) Proceedings Ofeurospeech 2005
- Read, I.¹ Cox, S.²

370
- 85032423350
- Improving data driven part-of-speech tagging by morphologic knowledge induction
- Reichel, U. Improving data driven part-of-speech tagging by morphologic knowledge induction. In Proceedings ofAdvances in Speech Technology (2005).
- (2005) Proceedings Ofadvances in Speech Technology
- Reichel, U.¹

371
- 84870292720
- A new generation of talking heads providing flexible articulatory control for video-realistic speech animation
- Reveret, L., Bailly, G., and Badin, P. Mother: A new generation of talking heads providing flexible articulatory control for video-realistic speech animation. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- Reveret, L.¹ Bailly, G.² Badin, P.M.³

372
- 0002069313
- Tree-based modelling of segmental duration
- C. B. G Bailly and T. R. Sawallis, Eds. Amsterdam: Elsevier Science Publishers
- Riley, M. Tree-based modelling of segmental duration. In Talking Machines: Theories, Models and Designs, C. B. G Bailly and T. R. Sawallis, Eds. Amsterdam: Elsevier Science Publishers (1992), pp. 265-273.
- (1992) Talking Machines: Theories, Models and Designs , pp. 265-273
- Riley, M.¹

373
- 1442267080
- Learning decision lists
- Rivest, R. L. Learning decision lists. Machine Learning 2 (1987), 229-246.
- (1987) Machine Learning , vol.2 , pp. 229-246
- Rivest, R.L.¹

374
- 0028532054
- Roach, P. Conversion between prosodic transcription systems: “Standard British” and ToBI. Speech Communication 15, 1-2 (1994), 91-99.
- (1994) Conversion between Prosodic Transcription Systems: “Standard British” and Tobi. Speech Communication 15 , vol.1-2 , pp. 91-99
- Roach, P.¹

375
- 0000329355
- A recurrent error propagation network speech recognition system
- Robinson, t., and Fallside, F. A recurrent error propagation network speech recognition system. Computer Speech and Language 5, 3 (1991).
- (1991) Computer Speech and Language , vol.5 , pp. 3
- Robinson, T.¹ Fallside, F.²

376
- 0015008817
- Effect of glottal pulse shape on the quality of natural vowels
- Rosenberg, A. E. Effect of glottal pulse shape on the quality of natural vowels. Journal of the Acoustical Society of America 49 (1970), 583-590.
- (1970) Journal of the Acoustical Society of America , vol.49 , pp. 583-590
- Rosenberg, A.E.¹

377
- 84925193833
- Johns Hopkins University, Baltimore, MD
- Rosenberg, C., and Sejnowski, T. NETtalk: Aparallelnetworkthat learns to readaloud. EE & CS Technical Report no JHU-EECS-86/01. Johns Hopkins University, Baltimore, MD (1986).
- (1986) Aparallelnetworkthat & CS Technical Report No JHU-EECS-86/01
- Rosenberg, C.¹ Sejnowski, T.N.²

378
- 0030181584
- Prediction of abstract prosodic labels for speech synthesis
- Ross, K., and Ostendorf, M. Prediction of abstract prosodic labels for speech synthesis. In Computer Speech and Language 10, 3 (1996), 155-185.
- (1996) Computer Speech and Language , vol.10 , Issue.3 , pp. 155-185
- Ross, K.¹ Ostendorf, M.²

379
- 4544238841
- A method for automatic extraction of fujisaki- model parameters
- Rossi, P., Palmieri, F., and Cutugno, F. A method for automatic extraction of Fujisaki- model parameters. In Proceedings of Speech Prosody 2002 (2002), pp. 615-618.
- (2002) Proceedings of Speech Prosody , vol.2002 , pp. 615-618
- Rossi, P.¹ Palmieri, F.² Cutugno, F.³

380
- 85032414478
- Reliability of ema data acquired simultaneously with epg
- Rouco, A. and Recasens, D. Reliability of EMA data acquired simultaneously with EPG. In Proceedings of the ACCOR Workshop on Articulatory Databases (1995).
- (1995) Proceedings of the ACCOR Workshop on Articulatory Databases
- Rouco, A.¹ Recasens, D.²

381
- 85032405806
- Unit selection for speech synthesis based on anew acoustic target cost
- Rouibia, s., and Rosec, O. Unit selection for speech synthesis based on anew acoustic target cost. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Rouibia, S.¹ Rosec, O.²

382
- 0019606728
- An articulatory synthesizer for perceptual research
- Rubin, P., Baer, t., and Mermelstein, P. An articulatory synthesizer for perceptual research. Journal of the Acoustical Society of America 70 (1981), 32-328.
- (1981) Journal of the Acoustical Society of America , vol.70 , pp. 32-328
- Rubin, P.¹ Baer, T.² Mermelstein, P.³

383
- 85009273523
- A statistically motivated database pruning technique for unit selection synthesis
- Rutten, P., Aylett, m., Fackrell, j., and Taylor, P. A statistically motivated database pruning technique for unit selection synthesis. In Proceedings oftheICSLP (2002), pp. 125-128.
- (2002) Proceedings Oftheicslp , pp. 125-128
- Rutten, P.¹ Aylett, M.² Fackrell, J.³ Taylor, P.⁴

384
- 85128350155
- Techniques for accurate automatic annotation of speech waveforms
- Cox, R. B., and Jackson, P. Techniques for accurate automatic annotation of speech waveforms. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Cox, R.B.¹ Jackson, P.²

385
- 85009231318
- Discriminative weight training for unit-selection based speech synthesis
- Park, C. K. K., and Kim, N. S. Discriminative weight training for unit-selection based speech synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Park, C.¹ Kim, N.S.²

386
- 0004128726
- PhD thesis, MIT
- Sagey, E. The Representation of Features and Relations in Non-linear Phonology. PhD thesis, MIT (1986).
- (1986) The Representation of Features and Relations in Non-Linear Phonology
- Sagey, E.¹

387
- 0023756465
- Speech synthesis by rule using an optimal selection of non-uniform synthesis units
- Sagisaka, Y. Speech synthesis by rule using an optimal selection of non-uniform synthesis units. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988 (1988).
- (1988) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1988
- Sagisaka, Y.¹

388
- 0029747042
- High-quality speech synthesis using context-dependent syllabic units
- Saito, T., Hashimoto, Y., and Sakamoto, M. High-quality speech synthesis using context-dependent syllabic units. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1996 (1996).
- (1996) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1996
- Saito, T.¹ Hashimoto, Y.² Sakamoto, M.³

389
- 85009143747
- Stress assignment in spanish proper names
- San-Segundo, R., Montero, J. M., Cordoba, R., and Gutierrez-Arriola, J. Stress assignment in Spanish proper names. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- San-Segundo, R.¹ Montero, J.M.² Cordoba, R.³ Gutierrez-Arriola, J.⁴

390
- 85009084534
- Two features to check phonetic transcriptions in text to speech systems
- Sandri, S., and Zovato, E. Two features to check phonetic transcriptions in text to speech systems. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Sandri, S.¹ Zovato, E.²

391
- 84924515323
- Cours de linguistique generale
- Saussure, F. de. Cours de linguistique generale. In Cours de linguistique generale, C. Bally and A. Sechehaye, Eds. Dordrecht: Kluwer Academic Publishers (1916).
- Cours De Linguistique Generale, C. Bally and A. Sechehaye, Eds. Dordrecht: Kluwer Academic Publishers (1916)
- Saussure, F.D.¹

392
- 33750219846
- Amsterdam: Elsevier
- Saussure, F. de. Saussure's Second Course of Lectures on General Linguistics (1908-09). Amsterdam: Elsevier (1997).
- (1997) Saussure's Second Course of Lectures on General Linguistics (1908-09)
- Saussure, F.D.¹

393
- 0002697853
- Three dimensions of emotion
- Schlossberg, H. Three dimensions of emotion. Psychological Review 61, 2 (1954), 8188.
- (1954) Psychological Review , vol.61 , Issue.2 , pp. 8188
- Schlossberg, H.¹

394
- 85032409834
- Speech parameter generation algorithm considering global variance for hmm-based speech synthesis
- Schweitzer, A. Speech parameter generation algorithm considering global variance for HMM-based speech synthesis. In Proceedings ofEurospeech, Interspeech 2005 (2005).
- (2005) Proceedings Ofeurospeech, Interspeech 2005
- Schweitzer, A.¹

395
- 0004097793
- Cambridge: Cambridge University Press
- Searle, J. R. Speech Acts. Cambridge: Cambridge University Press (1969).
- (1969) Speech Acts
- Searle, J.R.¹

396
- 0000383868
- Parallel networks that learn to pronounce english text
- Sejnowski, T., and Rosenberg, C. Parallel networks that learn to pronounce English text. Complex Systems 1, 1 (1987), 145-168.
- (1987) Complex Systems , vol.1 , Issue.1 , pp. 145-168
- Sejnowski, T.¹ Rosenberg, C.²

397
- 0004081889
- Cambridge, MA: MIT Press
- Sejnowski, T., and Rosenberg, C. NETtalk: A Parallel Network that Learns to Read Aloud. Cambridge, MA: MIT Press (1988).
- (1988) Nettalk: A Parallel Network that Learns to Read Aloud
- Sejnowski, T.¹ Rosenberg, C.²

398
- 0003899959
- Cambridge, MA: MIT Press
- Selkirk, E. O. Phonology and Syntax. Cambridge, MA: MIT Press (1984).
- (1984) Phonology and Syntax
- Selkirk, E.O.¹

399
- 85009233137
- Refined speech segmentation for concatenative speech synthesis
- Sethy, A., and Narayanan, S. Refined speech segmentation for concatenative speech synthesis. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Sethy, A.¹ Narayanan, S.²

400
- 0000163989
- A mathematical model of communication. Technical report
- Shannon, C. E. A mathematical model of communication
- Shannon, C. E. A mathematical model of communication. Technical report, Bell System Technical Journal (1948).
- (1948) Bell System Technical Journal
- Shannon, C.E.¹

401
- 9744261285
- Urbana, IL: University of Illinois Press
- Shannon, C. E., and Weaver, W. A Mathematical Model of Communication. Urbana, IL: University of Illinois Press (1949).
- (1949) A Mathematical Model of Communication
- Shannon, C.E.¹ Weaver, W.²

402
- 85032406881
- Prosodic phrasing with inductive learning
- Sheng, Z., Jianhua, T., and Lianhong, C. Prosodic phrasing with inductive learning. In Proceedings of the International Conference on Spoken Language Processing, Interspeech 2002 (2002).
- (2002) Proceedings of the International Conference on Spoken Language Processing, Interspeech 2002
- Sheng, Z.¹ Jianhua, T.² Lianhong, C.³

403
- 85009106967
- A comparison of statistical methods and features for the prediction of prosodic structures
- Shi, Q., and Fischer, V. A comparison of statistical methods and features for the prediction of prosodic structures. In Proceedings of the International Conference on Speech and Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Speech and Language Processing 2004
- Shi, Q.¹ Fischer, V.²

404
- 85009257840
- Eigenvoices for hmm-based speech synthesis
- Shichiri, K., Sawabe, A., Yoshimura, T. et al. Eigenvoices for HMM-based speech synthesis. In Proceedings of the 8th International Conference on Spoken Language Processing (2002).
- (2002) Proceedings of the 8Th International Conference on Spoken Language Processing
- Shichiri, K.¹ Sawabe, A.² Yoshimura, T.³

405
- 0003945319
- PhD thesis, University of Cambridge
- Silverman, K. The Structure and Processing of Fundamental Frequency Contours. PhD thesis, University of Cambridge (1987).
- (1987) The Structure and Processing of Fundamental Frequency Contours
- Silverman, K.¹

406
- 0001556085
- Tobi: A standard scheme for labelling prosody
- et al
- Silverman, K., Beckman, M., Pierrehumbert, J. et al. ToBI: A standard scheme for labelling prosody. In Proceedings of the International Conference on Speech and Language Processing 1992 (1992).
- (1992) Proceedings of the International Conference on Speech and Language Processing 1992
- Silverman, K.¹ Beckman, M.² Pierrehumbert, J.³

407
- 33947703664
- The cu-htk mandarin broadcast news transcription system
- et al
- Sinha, R., Gales, M. J. F., Kim, D. Y. et al. The CU-HTK Mandarin Broadcast News transcription system. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2006 (2006).
- (2006) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2006
- Sinha, R.¹ Gales, M.² Kim, D.Y.³

408
- 12344260444
- Comparative evaluation of synthetic prosody with the purr method
- Sonntag, G. P., and Portele, T. Comparative evaluation of synthetic prosody with the PURR method. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Sonntag, G.P.¹ Portele, T.²

409
- 84966441141
- Proper name pronunciations for speech technology applications
- Spiegel, M. F. Proper name pronunciations for speech technology applications. In Proceedings of the 2002 IEEE Workshop on Speech Synthesis (2002).
- (2002) Proceedings of the 2002 IEEE Workshop on Speech Synthesis
- Spiegel, M.F.¹

410
- 0028405433
- English noun-phrase accent prediction for text-to-speech
- Sproat, R. English noun-phrase accent prediction for text-to-speech. In Computer Speech and Language (1994), vol. 8, pp. 79-94.
- (1994) Computer Speech and Language , vol.8 , pp. 79-94
- Sproat, R.¹

411
- 0004161686
- Dordrecht: Kluwer Academic Publishers
- Sproat, R. Multilingual Text-to-Speech Synthesis: The Bell Labs Approach. Dordrecht: Kluwer Academic Publishers (1997).
- (1997) Multilingual Text-To-Speech Synthesis: The Bell Labs Approach
- Sproat, R.¹

412
- 85009106548
- Corpus-based methods and hand-built methods
- Sproat, R. Corpus-based methods and hand-built methods. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Sproat, R.¹

413
- 33646670439
- A pronunciation modeling toolkit
- Sproat, R. PMtools: A pronunciation modeling toolkit. In Proceedings of the Fourth ISCA Tutorial and Research Workshop on Speech Synthesis (2001).
- (2001) Proceedings of the Fourth ISCA Tutorial and Research Workshop on Speech Synthesis
- Sproat, R.P.¹

414
- 85009212430
- Effects of voice gender and signal quality
- Stevens, C., Lees, N., and Vonwiller, J. Experimental tools to evaluate intelligibility of text-to-speech (TTS) synthesis: Effects of voice gender and signal quality. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Stevens, C.¹ Lees, N.²

415
- 84955035459
- A scale for the measurement of the psychological magnitude of pitch
- Stevens, s., Volkman, J., and Newman, E. A scale for the measurement of the psychological magnitude of pitch. Journal of the Acoustical Society of America 8 (1937), 185-190.
- (1937) Journal of the Acoustical Society of America , vol.8 , pp. 185-190
- Stevens, S.¹ Volkman, J.² Newman, E.³

416
- 0001441770
- Synthesis by word concatenation
- Stober, k., Portele, T., Wagner, P., and Hess, W. Synthesis by word concatenation. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Stober, K.¹ Portele, T.² Wagner, P.³ Hess, W.⁴

417
- 0026881761
- On the relation between voice source parameters and prosodic features in connected speech
- Strik, H., and Boves, L. On the relation between voice source parameters and prosodic features in connected speech. Speech Communication 11, 2 (1992), 167-174.
- (1992) Speech Communication , vol.11 , Issue.2 , pp. 167-174
- Strik, H.¹ Boves, L.²

418
- 21844462761
- From text to prosody without tobi
- Strom, V. From text to prosody without ToBI. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Strom, V.¹

419
- 85133551970
- Concatenative speech synthesis using a harmonic plus noise model
- Stylianou, Y. Concatenative speech synthesis using a harmonic plus noise model. In Proceedings of the Third ESCA Speech Synthesis Workshop (1998).
- (1998) Proceedings of the Third ESCA Speech Synthesis Workshop
- Stylianou, Y.¹

420
- 0034854702
- Perceptual and objective detection of discontinuities in concatenative speech synthesis
- Stylianou, Y., and Syrdal, A. K. Perceptual and objective detection of discontinuities in concatenative speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2001 (2001).
- (2001) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2001
- Stylianou, Y.¹ Syrdal, A.K.²

421
- 0034854702
- Perceptual and objective detection of discontinuities in concatenative speech synthesis
- Stylianou, Y., and Syrdal, A. K. Perceptual and objective detection of discontinuities in concatenative speech synthesis. In International Conference on Acoustics, Speech, and Signal Processing (2001).
- (2001) International Conference on Acoustics, Speech, and Signal Processing
- Stylianou, Y.¹ Syrdal, A.K.²

422
- 0027813728
- Novel-word pronunciation: A cross-language study
- 3-4
- Sullivan, K., and Damper, R. Novel-word pronunciation: A cross-language study. Speech Communication 13, 3-4 (1993), 441-452.
- (1993) Speech Communication , vol.13 , pp. 441-452
- Sullivan, K.¹ Damper, R.²

423
- 85009106023
- Intonational phrase break prediction using decision tree and n-gram model
- Sun, X., and Applebaum, T. H. Intonational phrase break prediction using decision tree and n-gram model. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Sun, X.¹ Applebaum, T.H.²

424
- 0035283546
- The effect of speech melody on voice quality
- Swerts, M., and Veldhuis, R. The effect of speech melody on voice quality. Speech Communication 33, 4 (2001), 297-303.
- (2001) Speech Communication , vol.33 , Issue.4 , pp. 297-303
- Swerts, M.¹ Veldhuis, R.²

425
- 0026880274
- D. Et Al. Evaluation of speech synthesis techniques in a comprehension task
- Sydeserff, H. A., Caley, R. J., Isard, S. D. et al. Evaluation of speech synthesis techniques in a comprehension task. Speech Communication 11, 2-3 (1992), 189-194.
- (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 189-194
- Sydeserff, H.A.¹ Caley, R.J.² Isard, S.³

426
- 85009151528
- Prosodic effects on listener detection of vowel concatenation
- Syrdal, A. K. Prosodic effects on listener detection of vowel concatenation. In Proceeding ofEurospeech 2001 (2001).
- (2001) Proceeding Ofeurospeech 2001
- Syrdal, A.K.¹

427
- 85032395010
- Data-driven perceptually based join costs
- Syrdal, A. K., and Conkie, A. D. Data-driven perceptually based join costs. In International Conference on Spoken Language Processing 2004 (2004).
- (2004) International Conference on Spoken Language Processing 2004
- Syrdal, A.K.¹ Conkie, A.D.²

428
- 44949171964
- Perceptually-based data-drivenjoin costs
- Syrdal, A. K., and Conkie, A. D. Perceptually-based data-drivenjoin costs: Comparing join types. In Proceedings of Eurospeech, Interspeech 2005 (2005).
- (2005) Proceedings of Eurospeech, Interspeech 2005
- Syrdal, A.K.¹ Conkie, A.D.²

429
- 29144475179
- Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing
- Tachibana, M., Yamagishi, J., Masuko, T., and Kobayashi, T. Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing. IEICE Transactions on Information and Systems 2005 E88-D11 (2004), 2484-2491.
- (2004) IEICE Transactions on Information and Systems 2005 , pp. 2484-2491
- Tachibana, M.¹ Yamagishi, J.² Masuko, T.³ Kobayashi, T.⁴

430
- 44949120030
- Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database
- et al
- Takahashi, T., Takeshi, F., Nishi, M. et al. Voice and emotional expression transformation based on statistics of vowel parameters in an emotional speech database. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Takahashi, T.¹ Takeshi, F.² Nishi, M.³

431
- 0001455934
- A robust algorithm for pitch tracking rapt
- W.B. Kleijn and K. K. Paliwal, Eds. Amsterdam: Elsevier
- Talkin, D. A robust algorithm for pitch tracking RAPT. In Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam: Elsevier (1995), pp. 495-518.
- (1995) Speech Coding and Synthesis , pp. 495-518
- Talkin, D.¹

432
- 84919662542
- Oxford: Oxford University Press
- Tatham, M., and Morton, K. Expression in Speech: Analysis and Synthesis. Oxford: Oxford University Press (2004).
- (2004) Expression in Speech: Analysis and Synthesis
- Tatham, M.¹ Morton, K.²

433
- 34547547622
- Hidden markov models for grapheme to phoneme conversion
- Taylor, P. Hidden Markov models for grapheme to phoneme conversion. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Taylor, P.¹

434
- 0042514318
- University of Edinburgh, Published by Indiana University Linguistics Club
- Taylor, P. A. A Phonetic Model of English Intonation. PhD thesis, University of Edinburgh (1992). Published by Indiana University Linguistics Club.
- (1992) A Phonetic Model of English Intonation. Phd Thesis
- Taylor, P.A.¹

435
- 0028529843
- The rise/fall/connection model of intonation
- Taylor, P. A. The rise/fall/connection model of intonation. Speech Communication 15 (1995), 169-186.
- (1995) Speech Communication , vol.15 , pp. 169-186
- Taylor, P.A.¹

436
- 0034008810
- Analysis and synthesis of intonation using the tilt model
- Taylor, P. A. Analysis and synthesis of intonation using the tilt model. Journal of the Acoustical Society of America 107, 4 (2000), 1697-1714.
- (2000) Journal of the Acoustical Society of America , vol.107 , Issue.4 , pp. 1697-1714
- Taylor, P.A.¹

437
- 44949153641
- The target cost formulation in unit selection speech synthesis
- Taylor, P. A. The target cost formulation in unit selection speech synthesis. In Proceedings of the International Conference on Speech and Language Processing, Interspeech 2006 (2006).
- (2006) Proceedings of the International Conference on Speech and Language Processing, Interspeech 2006
- Taylor, P.A.¹

438
- 34547539669
- Unifying unit selection and hidden markov model speech synthesis
- Taylor, P. A. Unifying unit selection and hidden Markov model speech synthesis. In Proceedings ofthe International Conference on Speech and Language Processing, Interspeech 2006 (2006).
- (2006) Proceedings Ofthe International Conference on Speech and Language Processing, Interspeech 2006
- Taylor, P.A.¹

439
- 85093895234
- Synthesizing conversational intonation from a linguistically rich input
- Taylor, P. A., and Black, A. W. Synthesizing conversational intonation from a linguistically rich input. In Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, pp. 175-178.
- In Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis , pp. 175-178
- Taylor, P.A.¹ Black, A.W.²

440
- 85135272129
- Speech synthesis by phonological structure matching
- Taylor, P. A., and Black, A. W. Speech synthesis by phonological structure matching. In Proceedings of Eurospeech 1999 (1999), pp. 623-626.
- (1999) Proceedings of Eurospeech , vol.1999 , pp. 623-626
- Taylor, P.A.¹ Black, A.W.²

441
- 0035155093
- Taylor, P. A., Black, A. W., and Caley, R. J. Heterogeneous relation graphs as a mechanism for representing linguistic information. Speech Communication special issue on annotation, 1-2 (2000), 153-174.
- (2000) Heterogeneous Relation Graphs as a Mechanism for Representing Linguistic Information. Speech Communication Special Issue on Annotation , vol.1-2 , pp. 153-174
- Taylor, P.A.¹ Black, A.W.² Caley, R.J.³

442
- 0347128737
- Intonation and dialogue context as constraints for speech recognition
- Taylor, P. A., King, s., Isard, S. D., and Wright, H. Intonation and dialogue context as constraints for speech recognition. Language and Speech 41, 3-4 (1998), 491-512.
- (1998) Language and Speech , vol.41 , Issue.3-4 , pp. 491-512
- Taylor, P.A.¹ King, S.² Isard, S.D.³ Wright, H.⁴

443
- 85135152214
- Using intonation to constrain language models in speech recognition
- Taylor, P. a., King, s., Isard, S. d., Wright, h., and Kowtko, J. Using intonation to constrain language models in speech recognition. In Proceedings ofEurospeech 1997 (1997).
- (1997) Proceedings Ofeurospeech 1997
- Taylor, P.A.¹ King, S.² Isard, S.D.³ Wright, H.⁴ Kowtko, J.⁵

444
- 85032400113
- A real time speech synthesis system
- Taylor, P. a., Nairn, I. a., Sutherland, A. M., and Jack, M. A. A real time speech synthesis system. In Proceedings of Eurospeech 1991 (1991).
- (1991) Proceedings of Eurospeech 1991
- Taylor, P.A.¹ Nairn, I.A.² Sutherland, A.M.³ Jack, M.A.⁴

445
- 0002872229
- Intonation By Rule: A perceptual quest
- ’t Hart, J., and Cohen, A. Intonation by rule: A perceptual quest. Journal of Phonetics 1 (1973), 309-327.
- (1973) Journal of Phonetics , vol.1 , pp. 309-327
- ’T Hart, J.¹ Cohen, A.²

446
- 0000372476
- Integrating different levels of intonation analysis
- ‘T Hart, J., and Collier, R. Integrating different levels of intonation analysis. Journal of Phonetics 3 (1975), 235-255.
- (1975) Journal of Phonetics , vol.3 , pp. 235-255
- T Hart, J.¹ Collier, R.²

447
- 85032420364
- Symbolic prosody driven unit selection for highly natural synthetic speech
- Tihelka, D. Symbolic prosody driven unit selection for highly natural synthetic speech. In Proceedings ofEurospeech, Interspeech 2005 (2005).
- (2005) Proceedings Ofeurospeech, Interspeech 2005
- Tihelka, D.¹

448
- 4544270859
- Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis
- Toda, t., Kawai, H., and Tsuzaki, M. Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2004
- Toda, T.¹ Kawai, H.² Tsuzaki, M.³

449
- 0141590580
- Segment selection considering local degradation of naturalness in concatenative speech synthesis
- Toda, T., Kawai, H., Tsuzaki, M., and Shikano, K. Segment selection considering local degradation of naturalness in concatenative speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003 (2003).
- (2003) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2003
- Toda, T.¹ Kawai, H.² Tsuzaki, M.³ Shikano, K.⁴

450
- 33846410497
- Speech parameter generation algorithm considering global variance for hmm-based speech synthesis
- Toda, T., and Tokuda, K. Speech parameter generation algorithm considering global variance for HMM-based speech synthesis. In Proceedings of Eurospeech, Interspeech 2005 (2005).
- (2005) Proceedings of Eurospeech, Interspeech 2005
- Toda, T.¹ Tokuda, K.²

451
- 0028996993
- Speech parameter generation from hmm using dynamic features
- Tokuda, K., Kobayashi, t., and Imai, S. Speech parameter generation from HMM using dynamic features. In Proceedings ofthe International Conference on Acoustics, Speech, and Signal Processing 1995 (1995).
- (1995) Proceedings Ofthe International Conference on Acoustics, Speech, and Signal Processing 1995
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

452
- 85031628788
- An algorithm for speech parameter generation from continuous mixture hmms with dynamic features
- Tokuda, K., Masuko, T., and Yamada, T. An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features. In Proceedings ofEurospeech 1995 (1995).
- (1995) Proceedings Ofeurospeech 1995
- Tokuda, K.¹ Masuko, T.² Yamada, T.³

453
- 0033708106
- Speech parameter generation algorithms for hmm-based speech synthesis
- Tokuda, K, Yoshimura, T., Masuko, T., Kobayashi, T., and Kitamura, T. Speech parameter generation algorithms for HMM-based speech synthesis. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2000
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

454
- 85009231267
- Trajectory modeling based on hmms with the explicit relationship between static and dynamic features
- Tokuda, k., Zen, H., and Kitamura, T. Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings of Eurospeech 2003
- Tokuda, K.¹ Zen, H.² Kitamura, T.³

455
- 85032401166
- Automatic transcription of intonation using an identified prosodic alphabet
- Touremire, D. S. Automatic transcription of intonation using an identified prosodic alphabet. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Touremire, D.S.¹

456
- 85009083859
- Feature extraction by auditory modeling for unit selection in concatenative speech synthesis
- Tsuzaki, M. Feature extraction by auditory modeling for unit selection in concatenative speech synthesis. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Tsuzaki, M.¹

457
- 85032413724
- Et al. Constructing emotional speech synthesizers with limited speech database
- Tszuki, R., Zen, H., Tokuda, K. et al. Constructing emotional speech synthesizers with limited speech database. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Tszuki, R.¹ Zen, H.² Tokuda, K.³

458
- 85089833692
- Voice quality interpolation for emotional speech synthesis
- Turk, O., Schroder, M., Bozkurt, B., and Arslan, L. Voice quality interpolation for emotional speech synthesis. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Turk, O.¹ Schroder, M.² Bozkurt, B.³ Arslan, L.⁴

459
- 85032410013
- Grapheme-to-phoneme conversion using pseudo-morphological units
- Uebler, U. Grapheme-to-phoneme conversion using pseudo-morphological units. In Proceedings of Interspeech 2002 (2002).
- (2002) Proceedings of Interspeech 2002
- Uebler, U.¹

460
- 85032410315
- A method for fully automatic analysis and modelling of voice source characteristics
- Darsinos, D. G., and Kokkinakis, G. A method for fully automatic analysis and modelling of voice source characteristics. In Proceedings ofEurospeech 1995 (1995).
- (1995) Proceedings Ofeurospeech 1995
- Darsinos, D.G.¹ Kokkinakis, G.²

461
- 0032118314
- Towards a blackboard model of accenting
- Van Deemter, K. Towards a blackboard model of accenting. Computer Speech and Language 12, 3 (1998), 143-164.
- (1998) Computer Speech and Language , vol.12 , Issue.3 , pp. 143-164
- Van Deemter, K.¹

462
- 85009080290
- Evaluation of pros-3 for the assignment of prosodic structure, compared to assignment by human experts
- van Herwijnen, O., and Terken, J. Evaluation of pros-3 for the assignment of prosodic structure, compared to assignment by human experts. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Van Herwijnen, O.¹ Terken, J.²

463
- 0028405296
- Assignment of segmental duration in text-to-speech synthesis
- van Santen, J. Assignment of segmental duration in text-to-speech synthesis. Computer Speech and Language 8 (1994), 95-128.
- (1994) Computer Speech and Language , vol.8 , pp. 95-128
- Van Santen, J.¹

464
- 21844436737
- Quantitative modeling of pitch accent alignment
- van Santen, J. Quantitative modeling of pitch accent alignment. In International Conference on Speech Prosody (2002), pp. 107-112.
- (2002) International Conference on Speech Prosody , pp. 107-112
- Van Santen, J.¹

465
- 85009187535
- Applications and computer generated expressive speech for communication disorders
- van Santen, J., Black, L., Cohen, G. et al. Applications and computer generated expressive speech for communication disorders. In Proceedings of Eurospeech 2003 (2003).
- (2003) Proceedings of Eurospeech 2003
- Van Santen, J.¹ Black, L.² Cohen, G.³

466
- 21844466234
- Synthesis of prosody using multi-level unit sequences
- van Santen, J., Kain, A., Klabbers, E., and Mishra, T. Synthesis of prosody using multi-level unit sequences. Speech Communication 46 (2005), 365-375.
- (2005) Speech Communication , vol.46 , pp. 365-375
- Van Santen, J.¹ Kain, A.² Klabbers, E.³ Mishra, T.⁴

467
- 85032406625
- Unified physiological model of audible-visible speech production
- Vatikiotis-Bateson, E., and Yehia, H. Unified physiological model of audible-visible speech production. In Proceedings ofEurospeech 1997 (1997).
- (1997) Proceedings Ofeurospeech 1997
- Vatikiotis-Bateson, E.¹ Yehia, H.²

468
- 85009085843
- Consistent pitch marking
- Veldhuis, R. Consistent pitch marking. In Proceedings of the International Conference on Speech and Language Processing 2000 (2000).
- (2000) Proceedings of the International Conference on Speech and Language Processing 2000
- Veldhuis, R.¹

469
- 85009167944
- Kalman-filter based join cost for unit-selection speech synthesis
- Vepa, J., and King, S. Kalman-filter based join cost for unit-selection speech synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Vepa, J.¹ King, S.²

470
- 85009063825
- Subjective evaluation of join cost functions used in unit selection speech synthesis
- Vepa, J., and King, S. Subjective evaluation of join cost functions used in unit selection speech synthesis. In Proceedings of the International Conference on Speech and Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Speech and Language Processing 2004
- Vepa, J.¹ King, S.²

471
- 85009279358
- Objective distance measures for spectral discontinuities in concatenative speech synthesis
- Vepa, J., King, S., and Taylor, P. Objective distance measures for spectral discontinuities in concatenative speech synthesis. In Proceedings of the International Conference on Speech and Language Processing 2002 (2002).
- (2002) Proceedings of the International Conference on Speech and Language Processing 2002
- Vepa, J.¹ King, S.² Taylor, P.³

472
- 85128348481
- Automatic prosodic labeling of 6 languages
- Vereecken, H., Martens, J., Grover, C., Fackrell, J., and Van Coile, B. Automatic prosodic labeling of 6 languages. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998), pp. 1399-1402.
- (1998) Proceedings of the International Conference on Speech and Language Processing , vol.1998 , pp. 1399-1402
- Vereecken, H.¹ Martens, J.² Grover, C.³ Fackrell, J.⁴ Van Coile, B.⁵

473
- 85009061166
- A unified view on synchronized overlap-add methods for prosodic modification of speech
- Verhelst, W., Compernolle, D. V., and Wambacq, P. A unified view on synchronized overlap-add methods for prosodic modification of speech. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000), vol. 2, pp. 6366.
- (2000) Proceedings of the International Conference on Spoken Language Processing , vol.2 , pp. 6366
- Verhelst, W.¹ Compernolle, D.V.² Wambacq, P.³

474
- 0032296808
- A stochastic model of intonation for text-to-speech synthesis
- VEronis, J., Di Cristo, P., Courtois, f., and Chaumette, C. A stochastic model of intonation for text-to-speech synthesis. Speech Communication 26, 4 (1998), 233-244.
- (1998) Speech Communication , vol.26 , Issue.4 , pp. 233-244
- VÉronis, J.¹ Di Cristo, P.² Courtois, F.³ Chaumette, C.⁴

475
- 33745214458
- Estimation of LF glottal source parameters based on an ARX model
- Vincent, D., Rosec, O., and Chonavel, T. Estimation of LF glottal source parameters based on an ARX model. In Proceedings ofEurospeech, Interspeech 2005 (2005), pp. 333-336.
- (2005) Proceedings ofEurospeech, Interspeech , pp. 333-336
- Vincent, D.¹ Rosec, O.² Chonavel, T.³

476
- 84935113569
- Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
- Viterbi, A. J. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13, 2 (1967), 260-269.
- (1967) IEEE Transactions on Information Theory , vol.13 , Issue.2 , pp. 260-269
- Viterbi, A.J.¹

477
- 4243052080
- Vienna: J. V. Degen
- von Kempelen, W. Mechanismus der menschlichen Sprache nebst Beschreibung einer sprechenden Maschine and Le Mecanisme de la parole, suivi de la description d’une machine parlante. Vienna: J. V. Degen (1791).
- (1791) Mechanismus Der Menschlichen Sprache Nebst Beschreibung Einer Sprechenden Maschine and Le Mecanisme De La Parole, Suivi De La Description d’une Machine Parlante
- Von Kempelen, W.¹

478
- 85032419083
- Comprehension of prosody in synthesized speech
- Vonwiller, J. P., King, R. W., Stevens, k., and Latimer, C. R. Comprehension of prosody in synthesized speech. In Proceedings of the Third International Australian Conference on Speech Science and Technology (1990).
- (1990) Proceedings of the Third International Australian Conference on Speech Science and Technology
- Vonwiller, J.P.¹ King, R.W.² Stevens, K.³ Latimer, C.R.⁴

479
- 85009070758
- Use of clustering information for coarticulation compensation in speech synthesis by word concatenation
- Vosnidis, C., and Digalakis, V. Use of clustering information for coarticulation compensation in speech synthesis by word concatenation. In Proceedings ofEurospeech 2001 (2001).
- (2001) Proceedings Ofeurospeech 2001
- Vosnidis, C.¹ Digalakis, V.²

480
- 34147153739
- Piecewise linear stylization of pitch via wavelet analysis
- Wang, D., and Narayanan, S. Piecewise linear stylization of pitch via wavelet analysis. In Proceedings ofInterspeech 2005 (2005).
- (2005) Proceedings Ofinterspeech 2005
- Wang, D.¹ Narayanan, S.²

481
- 4544373879
- Refining segmental boundaries for tts database using fine contextual-dependent boundary models
- Wang, L., Zhao, y., Chu, M., Zhou, j., and Cao, Z. Refining segmental boundaries for TTS database using fine contextual-dependent boundary models. In Proceedings of the International Conference on Acoustics Speech and Signal Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Acoustics Speech and Signal Processing 2004
- Wang, L.¹ Zhao, Y.² Chu, M.³ Zhou, J.⁴ Cao, Z.⁵

482
- 85032401540
- Improving letter-to-pronunciation accuracy with automatic morphologically- based stress prediction
- Webster, G. Improving letter-to-pronunciation accuracy with automatic morphologically- based stress prediction. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Webster, G.¹

483
- 0004287328
- London: Pearson Education Limited
- Wells, J. C. Longman Pronunciation Dictionary. London: Pearson Education Limited (2000).
- (2000) Longman Pronunciation Dictionary
- Wells, J.C.¹

484
- 85032421589
- Anew parametric speech analysis and synthesis technique in the frequency domain
- Wer, B. R., Leroux, A., Delbrouck, H. P., and Leclercs, J. Anew parametric speech analysis and synthesis technique in the frequency domain. In Proceedings of Eurospeech 1995 (1995).
- (1995) Proceedings of Eurospeech 1995
- Wer, B.R.¹ Leroux, A.² Delbrouck, H.P.³ Leclercs, J.⁴

485
- 0019145834
- An integrated circuit for speech synthesis
- Wiggins, R. An integrated circuit for speech synthesis. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 1980 (1980), pp. 398-401.
- (1980) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing , vol.1980 , pp. 398-401
- Wiggins, R.¹

486
- 21844471192
- Tobi or not tobi
- Wightman, C. ToBI or not ToBI. Speech Prosody (2002), 25-29.
- (2002) Speech Prosody , pp. 25-29
- Wightman, C.¹

487
- 84888214524
- Evaluation of an efficient prosody labeling system for spontaneous speech utterances
- Wightman, C. W., and Rose, R. C. Evaluation of an efficient prosody labeling system for spontaneous speech utterances. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop 1 (1999), pp. 333-336.
- (1999) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop , vol.1 , pp. 333-336
- Wightman, C.W.¹ Rose, R.C.²

488
- 84925037939
- Main page - Wikipedia, the free encyclopedia, [Online; accessed 30 January 2007]
- Wikipedia. Main page - Wikipedia, the free encyclopedia (2007). [Online; accessed 30 January 2007].
- (2007)

489
- 0029000585
- Physiological modeling of speech production: Methods for modeling soft-tissue articulators
- Wilhelms-Tricarico, R. Physiological modeling of speech production: Methods for modeling soft-tissue articulators. Journal ofthe Acoustical Society of America 97, 5 (1995), 3085-3098.
- (1995) Journal Ofthe Acoustical Society of America , vol.97 , Issue.5 , pp. 3085-3098
- Wilhelms-Tricarico, R.¹

490
- 0029704037
- A biomechanical and physiologically-based vocal tract model and its control
- Wilhelms-Tricarico, R. A biomechanical and physiologically-based vocal tract model and its control. Journal of Phonetics 24 (1996), 23-28.
- (1996) Journal of Phonetics , vol.24 , pp. 23-28
- Wilhelms-Tricarico, R.¹

491
- 85032394935
- Biomechanical and physiologically based speech modeling
- Wilhelms-Tricarico, R., and Perkell, J. S. Biomechanical and physiologically based speech modeling. In Proceedings of the SecondESCA/IEEE Workshop on Speech Synthesis (1994), pp. 17-20.
- (1994) Proceedings of the Secondesca/Ieee Workshop on Speech Synthesis , pp. 17-20
- Wilhelms-Tricarico, R.¹ Perkell, J.S.²

492
- 85032405550
- Towards a physiological model of speech production
- Wilhelms-Tricarico, R., and Perkell, J. S. Towards a physiological model of speech production. In Proceedings of the XIVth International Congress of Phonetic Science 1999, pp. 1753-1756.
- (1999) N Proceedings of the Xivth International Congress of Phonetic Science , pp. 1753-1756
- Wilhelms-Tricarico, R.¹ Perkell, J.S.²

493
- 84926270981
- A model of standard english intonation patterns
- Willems, N. J. A model of standard English intonation patterns. IPO Annual Progress Report (1983).
- (1983) IPO Annual Progress Report
- Willems, N.J.¹

494
- 0036461035
- Large scale discriminative training of hidden markov models for speech recognition
- Woodland, P. C., and Povey, D. Large scale discriminative training of hidden Markov models for speech recognition. Computer Speech and Language 16 (2002), 2547.
- (2002) Computer Speech and Language , vol.16 , pp. 2547
- Woodland, P.C.¹ Povey, D.²

495
- 81155152572
- A perceptual evaluation of distance measures for concatenative speech
- Wouters, J., and Macon, M. W. A perceptual evaluation of distance measures for concatenative speech. In Proceedings of the International Conference on Speech and Language Processing 1998 (1998).
- (1998) Proceedings of the International Conference on Speech and Language Processing 1998
- Wouters, J.¹ Macon, M.W.²

496
- 85009080327
- Unit fusion for concatenative speech synthesis
- Wouters, J., and Macon, M. W. Unit fusion for concatenative speech synthesis. In Proceedings of the International Conference on Spoken Language Processing 2000 (2000)
- (2000) Proceedings of the International Conference on Spoken Language Processing 2000
- Wouters, J.¹ Macon, M.W.²

497
- 0040494557
- An investigation of sagittal velar movement and its correlation with lip, tongue and jaw movement
- Wrench, A. A. An investigation of sagittal velar movement and its correlation with lip, tongue and jaw movement. In Proceedings of the International Congress of Phonetic Sciences (1999), pp. 435-438.
- (1999) Proceedings of the International Congress of Phonetic Sciences , pp. 435-438
- Wrench, A.A.¹

498
- 4544270860
- Minimum segmentation error based discriminative training for speech synthesis application
- Wu, Y., Kawai, H., Ni, J., and Wang, R.-H. Minimum segmentation error based discriminative training for speech synthesis application. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Acoustics, Speech, and Signal Processing 2004
- Wu, Y.¹ Kawai, H.² Ni, J.³ Wang, R.-H.⁴

499
- 85009228508
- On unit analysis for cantonese corpus-based tts
- Xu, J., Choy, T., Dong, M., Guan, C., and Li, H. On unit analysis for Cantonese corpus-based TTS. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Xu, J.¹ Choy, T.² Dong, M.³ Guan, C.⁴ Li, H.⁵

500
- 21844447847
- Speech melody as articulatory implemented communicative functions
- Xu, Y. Speech melody as articulatory implemented communicative functions. Speech Communication 46 (2005), 220-251.
- (2005) Speech Communication , vol.46 , pp. 220-251
- Xu, Y.¹

501
- 79959816265
- Speech prosody as articulated communicative functions
- Xu, Y. Speech prosody as articulated communicative functions. In Proceedings of Speech Prosody 2006 (2006).
- (2006) Proceedings of Speech Prosody 2006
- Xu, Y.¹

502
- 85135109865
- Atr - v-talk speech synthesis system
- Sagisaka, Y. Kaiki, N. I., and Mimura, K. ATR - v-TALK speech synthesis system. In Proceedings of the InternationalConference on Speech and Language Processing 1992 (1992), vol. 1, pp. 483-486.
- (1992) Proceedings of the InternationalConference on Speech and Language Processing , vol.1 , pp. 483-486
- Sagisaka, Y.K.¹ Mimura, K.²

503
- 85009080581
- Mllr adaptation for hidden semi- markov model based speech synthesis
- Yamagishi, J., Masuko, T., and Kobayashi, T. MLLR adaptation for hidden semi- Markov model based speech synthesis. In Proceedings ofthe 8th International Conference on Spoken Language Processing (2004).
- (2004) Proceedings Ofthe 8Th International Conference on Spoken Language Processing
- Yamagishi, J.¹ Masuko, T.² Kobayashi, T.³

504
- 85009177437
- Modeling of various speaking styles and emotions for hmm-based speech synthesis
- Yamagishi, J., Onishi, K., Masuko, T., and Kobayashi, T. Modeling of various speaking styles and emotions for HMM-based speech synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Yamagishi, J.¹ Onishi, K.² Masuko, T.³ Kobayashi, T.⁴

505
- 0343846863
- Homograph disambiguation in speech synthesis
- Yarowsky, D. Homograph disambiguation in speech synthesis. In Second ESCA/IEEE Workshop on Speech Synthesis (1994).
- (1994) Second ESCA/IEEE Workshop on Speech Synthesis
- Yarowsky, D.¹

506
- 85032399236
- Homograph disambiguation in text-to-speech synthesis
- Yarowsky, D. Homograph disambiguation in text-to-speech synthesis. In Computer Speech and Language (1996).
- (1996) Computer Speech and Language
- Yarowsky, D.¹

507
- 85009064449
- A prosodic phasing model for a korean text-to-speech synthesis system
- Yoon, K. A prosodic phasing model for a Korean text-to-speech synthesis system. In Proceedings of the International Conference on Speech and Language Processing 2004 (2004).
- (2004) Proceedings of the International Conference on Speech and Language Processing 2004
- Yoon, K.¹

508
- 85009139544
- Simultaneous modelling of spectrum, pitch and duration in hmm-based speech synthesis
- Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T., and Kitamura, T. Simultaneous modelling of spectrum, pitch and duration in HMM-based speech synthesis. In Proceedings ofEurospeech 1999 (1999).
- (1999) Proceedings Ofeurospeech 1999
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

509
- 85009097254
- Mixed excitation for hmm-based speech synthesis
- Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T., and Kitamura, T. Mixed excitation for HMM-based speech synthesis. In Proceedings of the European Conference on Speech Communication and Technology 2001, vol. 3 (2001), pp. 2259-2262.
- (2001) Proceedings of the European Conference on Speech Communication and Technology , vol.3 , pp. 2259-2262
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

510
- 85032400139
- Young, S. J., Evermann, G., Kershaw, D. et al. The HTKBook (1995-2006).
- (1995) The Htkbook
- Young, S.J.¹ Evermann, G.² Kershaw, D.³

511
- 85032414707
- Data pruning approach to unit selection for inventory generation of concatenative embeddable chinese tts systems
- Yu, Z.-L., Wang, K.-Z., Zu, Y.-Q., Yue, D.-J., and Chen, G.-L. Data pruning approach to unit selection for inventory generation of concatenative embeddable Chinese TTS systems. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Yu, Z.-L.¹ Wang, K.-Z.² Zu, Y.-Q.³ Yue, D.-J.⁴ Chen, G.-L.⁵

512
- 48749130217
- Pitch accent prediction
- Yuan, J., Brenier, J., and Jurafsky, D. Pitch accent prediction: Effects of genre and speaker. In Proceedings ofInterspeech (2005).
- (2005) Proceedings Ofinterspeech
- Yuan, J.¹ Brenier, J.² Jurafsky, D.³

513
- 33846446896
- An overview of nitech hmm-based speech synthesis system for blizzard challenge 2005
- Zen, H., and Toda, T. An overview of Nitech HMM-based speech synthesis system for Blizzard Challenge 2005. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Zen, H.¹ Toda, T.²

514
- 85009111560
- Hidden semi- markov model based speech synthesis
- Zen, H., Tokuda, K., Masuko, T., Kobayashi, T., and Kitamura, T. Hidden semi- Markov model based speech synthesis. In Proceedings ofthe 8th International Conference on Spoken Language Processing, Interspeech 2004 (2004).
- (2004) Proceedings Ofthe 8Th International Conference on Spoken Language Processing, Interspeech 2004
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

515
- 85009103324
- Bayesian induction of intonational phrase breaks
- Zervas, P., Maragoudakis, M., Fakotakis, N., and Kokkinakis, G. Bayesian induction of intonational phrase breaks. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Zervas, P.¹ Maragoudakis, M.² Fakotakis, N.³ Kokkinakis, G.⁴

516
- 12344252665
- Total quality evaluation of speech synthesis systems
- Zhang, J., Dong, S., and Yu, G. Total quality evaluation of speech synthesis systems. In International Conference on Speech and Language Processing (1998).
- (1998) International Conference on Speech and Language Processing
- Zhang, J.¹ Dong, S.² Yu, G.³

517
- 85032412721
- Refining phoneme segmentations using speaker-adaptive context dependent boundary models
- Zhao, Y., Wang, L., Chu, M., Soong, F. K., and Cao, Z. Refining phoneme segmentations using speaker-adaptive context dependent boundary models. In Proceedings of Interspeech 2005 (2005).
- (2005) Proceedings of Interspeech 2005
- Zhao, Y.¹ Wang, L.² Chu, M.³ Soong, F.K.⁴ Cao, Z.⁵

518
- 26944460802
- Grapheme-to-phoneme conversion based on a fast TBL algorithm in Mandarin TTS systems
- Berlin: Springer-Verlag
- Zheng, m., Shi, Q., Zhang, W., and Cai, L. Grapheme-to-phoneme conversion based on a fast TBL algorithm in Mandarin TTS systems. Lecture Notes in Computer Science. Berlin: Springer-Verlag (2005), p. 600.
- (2005) Lecture Notes in Computer Science , pp. 600
- Zheng, M.¹ Shi, Q.² Zhang, W.³ Cai, L.⁴

519
- 4544350112
- Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis
- Zolfaghari, P., Nakatani, T., and Irino, T. Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis. In Proceedings ofEurospeech 2003 (2003).
- (2003) Proceedings Ofeurospeech 2003
- Zolfaghari, P.¹ Nakatani, T.² Irino, T.³

520
- 85009092350
- Prosodic analysis of a multi-style corpus in the perspective of emotional speech synthesis
- Zovato, E., Sandri, s., Quazza, s., and Badino, L. Prosodic analysis of a multi-style corpus in the perspective of emotional speech synthesis. In Proceedings of Interspeech 2004 (2004).
- (2004) Proceedings of Interspeech 2004
- Zovato, E.¹ Sandri, S.² Quazza, S.³ Badino, L.⁴

521
- 0004236521
- Berlin: Springer-Verlag
- Zwicker, E., and Fastl, H. Psychoacoustics, Facts and Models. Berlin: Springer-Verlag (1990).
- (1990) Psychoacoustics, Facts and Models
- Zwicker, E.¹ Fastl, H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.