SCOPUS 정보 검색 플랫폼

Springer Handbooks

Volumn , Issue , 2008, Pages 437-456

Corpus-Based Speech Synthesis

(1) Dutoit, Thierry a

a FACULTÉ POLYTECHNIQUE DE MONS (Belgium)

Author keywords

Automatic Speech Recognition; Automatic Speech Recognition System; Spectral Envelope; Speech Corpus; Speech Synthesis

Indexed keywords

EID: 68949165450 PISSN: 25228692 EISSN: 25228706 Source Type: Book Series
DOI: 10.1007/978-3-540-49127-9_21 Document Type: Chapter

Times cited : (21)

References (43)

1
- 0013634269
- A study of the building blocks in speech
- C.M. Harris: A study of the building blocks in speech, J. Acoust. Soc. Am. 25, 962–969 (1953)
- (1953) J. Acoust. Soc. Am. , vol.25 , pp. 962-969
- Harris, C.M.¹

2
- 0028823541
- Speech recognition with primarily temporal cues
- R.V. Shannon, F.G. Zeng, V. Kamath, J. Wygon-ski, M. Ekelid: Speech recognition with primarily temporal cues, Science 13(5234), 270 (1995)
- (1995) Science , vol.13 , Issue.5234
- Shannon, R.V.¹ Zeng, F.G.² Kamath, V.³ Wygon-Ski, J.⁴ Ekelid, M.⁵

3
- 0345093720
- Terminal analog synthesis of continuous speech using the diphone method of segment assembly
- N.R. Dixon, H.D. Maxey: Terminal analog synthesis of continuous speech using the diphone method of segment assembly, IEEE Trans. ASSP AU-16(1), 40–50 (1968)
- (1968) IEEE Trans. ASSP AU- , vol.16 , Issue.1 , pp. 40-50
- Dixon, N.R.¹ Maxey, H.D.²

4
- 0003834176
- Kluwer Academic, Dordrecht
- T. Dutoit: An Introduction to Text-To-Speech Synthesis (Kluwer Academic, Dordrecht 1997)
- (1997) An Introduction to Text-To-Speech Synthesis
- Dutoit, T.¹

5
- 85075932515
- Improving the quality of MBROLA synthesis for non-uniform units synthesis
- ed. by S. Narayanan, A. Alwan (Prentice-Hall, Upper Saddle River
- B. Bozkurt, T. Dutoit, R. Prudon, C. d’Alessandro, V. Pagel: Improving the quality of MBROLA synthesis for non-uniform units synthesis. In: Text to Speech Synthesis: New Paradigms and Advances, ed. by S. Narayanan, A. Alwan (Prentice-Hall, Upper Saddle River 2004)
- (2004) Text to Speech Synthesis: New Paradigms and Advances
- Bozkurt, B.¹ Dutoit, T.² Prudon, R.³ D’Alessandro, C.⁴ Pagel, V.⁵

6
- 0025543906
- Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines, F. Charpentier: Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun. 9, 5–6 (1990)
- (1990) Speech Commun , vol.9 , pp. 5-6
- Moulines, E.¹ Charpentier, F.²

7
- 0035127703
- Applying the harmonic plus noise model in concatenative synthesis
- Y. Stylianou: Applying the harmonic plus noise model in concatenative synthesis, IEEE Trans. Speech Audio Process. 9(1), 21–29 (2001)
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.1 , pp. 21-29
- Stylianou, Y.¹

8
- 0027252181
- An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech
- Vol., pp
- W. Verhelst, M. Roelands: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech, Proc. ICASSP 93, Vol. 2 (1993) pp. 554–557
- (1993) Proc. ICASSP 93 , vol.2 , pp. 554-557
- Verhelst, W.¹ Roelands, M.²

9
- 0008006444
- Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation
- Vol.,) pp
- N. Morita, F. Itakura: Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation, Proc. Annu. Meeting of Acoust. Soc. Jpn., Vol. 86 (1986) pp. 9–16
- (1986) Proc. Annu. Meeting of Acoust. Soc. Jpn , vol.86 , pp. 9-16
- Morita, N.¹ Itakura, F.²

10
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- R.J. Mac Aulay, T.F. Quatieri: Speech analysis/synthesis based on a sinusoidal representation, IEEE Trans. Acoust. Speech Signal Process. 34, 744– 754 (1986)
- (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , pp. 744-754
- Mac Aulay, R.J.¹ Quatieri, T.F.²

11
- 0024665683
- Frequency-varying sinusoidal modeling of speech
- J. Marques, L. Almeida: Frequency-varying sinusoidal modeling of speech, IEEE Trans. Acoust. Speech Signal Process. 37(5), 763–765 (1989)
- (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.5 , pp. 763-765
- Marques, J.¹ Almeida, L.²

12
- 0005500345
- Ph.D. Dissertation, Georgia Institute of Technology, Atlanta
- M.W. Macon: Speech Synthesis Based on Sinusoidal Modeling, Ph.D. Dissertation (Georgia Institute of Technology, Atlanta 1996)
- (1996) Speech Synthesis Based on Sinusoidal Modeling
- Macon, M.W.¹

13
- 0027839344
- MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database
- T. Dutoit, H. Leich: MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database, Speech Commun. 13, 435–440 (1993)
- (1993) Speech Commun , vol.13 , pp. 435-440
- Dutoit, T.¹ Leich, H.²

14
- 84941167756
- Optimal coupling of diphones
- ed. by J. Olive
- A. Conkie, S. Isard: Optimal coupling of diphones, Proc. 2nd ESCA/IEEE Workshop On Speech Synthesis Mohonk, ed. by J. Olive (1994)
- (1994) Proc. 2Nd ESCA/IEEE Workshop on Speech Synthesis Mohonk
- Conkie, A.¹ Isard, S.²

15
- 0035124445
- Control of spectral dynamics in concatenative speech synthesis
- J. Wouters, M.W. Macon: Control of spectral dynamics in concatenative speech synthesis, IEEE Trans. Speech Audio Process. 9(1), 30–38 (2001)
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.1 , pp. 30-38
- Wouters, J.¹ Macon, M.W.²

16
- 25344440024
- Ph.D. Dissertation (Institut de la Communication Parlée, Grenoble, in French
- V. Aubergé: La synthèse de la parole: des règles aux lexiques, Ph.D. Dissertation (Institut de la Communication Parlée, Grenoble 1991), in French
- (1991) La synthèse De La Parole: Des règles Aux Lexiques
- Aubergé, V.¹

17
- 0027228898
- Multilingual PSOLA text-to-speech system
- Vol.,) pp
- D. Bigorne, O. Boeffard, B. Cherbonnel, F. Emerard, D. Larreur, J.L. Le Saint-Milon, I. Metayer, C. Sorin, S. White: Multilingual PSOLA text-to-speech system, Proc. Int. Conf. Acoust. Speech Signal Process., Vol. 2 (1993) pp. 187–190
- (1993) Proc. Int. Conf. Acoust. Speech Signal Process , vol.2 , pp. 187-190
- Bigorne, D.¹ Boeffard, O.² Cherbonnel, B.³ Emerard, F.⁴ Larreur, D.⁵ Le Saint-Milon, J.L.I.⁶ Metayer, C.⁷ Sorin, S.W.⁸

18
- 0012356658
- HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes
- T. Portele, W. Sendlemeier, W. Hess: HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes, Proc. First ESCA Workshop on Speech Synthesis (1990) pp. 161– 164
- (1990) Proc. First ESCA Workshop on Speech Synthesis , pp. 161-164
- Portele, T.¹ Sendlemeier, W.² Hess, W.³

19
- 0028499480
- Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering
- S. Nakajima: Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering, Speech Commun. 14, 313–324 (1994)
- (1994) Speech Commun , vol.14 , pp. 313-324
- Nakajima, S.¹

20
- 85135181226
- Improvements in an HMM-based speech synthesizer
- Vol.,) pp
- R. Donovan, P. Woodland: Improvements in an HMM-based speech synthesizer, Proc. Eurospeech 95, Vol. 1 (1995) pp. 573–576
- (1995) Proc. Eurospeech 95 , vol.1 , pp. 573-576
- Donovan, R.¹ Woodland, P.²

21
- 85068112784
- Rule synthesis of speech from diadic units
- Vol.,) pp
- J.P. Olive: Rule synthesis of speech from diadic units, Proc. ICASSP, Vol. 77 (1977) pp. 568–570
- (1977) Proc. ICASSP , vol.77 , pp. 568-570
- Olive, J.P.¹

22
- 85135109865
- ATR ν-TALK speech synthesis system
- pp
- Y. Sagisaka, N. Kaiki, N. Iwahashi, K. Mimura: ATR ν-TALK speech synthesis system, Proc. ICSLP 92, Vol. 1 (1992) pp. 483–486
- (1992) Proc. ICSLP 92 , vol.1 , pp. 483-486
- Sagisaka, Y.¹ Kaiki, N.² Iwahashi, N.³ Mimura, K.⁴

23
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- Vol.,) pp
- A.J. Hunt, A.W. Black: Unit selection in a concatenative speech synthesis system using a large speech database, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96), Vol. 1 (1996) pp. 373–376
- (1996) Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96) , vol.1 , pp. 373-376
- Hunt, A.J.¹ Black, A.W.²

24
- 0000237685
- Prosody and the selection of source units for concatenative synthesis
- ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg
- N. Campbell, A. Black: Prosody and the selection of source units for concatenative synthesis. In: Progress in Speech Synthesis, ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg 1995)
- (1995) Progress in Speech Synthesis
- Campbell, N.¹ Black, A.²

25
- 0002425861
- The AT&T next-gen TTS system
- M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, A. Syrdal: The AT&T next-gen TTS system, Proc. Joint Meeting of ASA (1999)
- (1999) Proc. Joint Meeting of ASA
- Beutnagel, M.¹ Conkie, A.² Schroeter, J.³ Stylianou, Y.⁴ Syrdal, A.⁵

26
- 85135259940
- Choose the best to modify the least: A new generation concatenative synthesis system
- Vol.,) pp
- M. Balestri, A. Paechiotti, S. Quazza, P.L. Salza, S. Sandri: Choose the best to modify the least: a new generation concatenative synthesis system, Proc. Eurospeech, Vol. 99 (1999) pp. 2291–2294
- (1999) Proc. Eurospeech , vol.99 , pp. 2291-2294
- Balestri, M.¹ Paechiotti, A.² Quazza, S.³ Salza, P.L.⁴ Sandri, S.⁵

27
- 85135272129
- Speech synthesis by phonological structure matching
- Vol.,) pp
- P. Taylor, A.W. Black: Speech synthesis by phonological structure matching, Proc. Eurospeech, Vol. 99 (1999) pp. 623–626
- (1999) Proc. Eurospeech , vol.99 , pp. 623-626
- Taylor, P.¹ Black, A.W.²

28
- 70349848071
- Join cost for unit selection speech synthesis
- ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River
- J. Vepa, S. King: Join cost for unit selection speech synthesis. In: Speech Synthesis, ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River 2004)
- (2004) Speech Synthesis
- Vepa, J.¹ King, S.²

29
- 0037236894
- Rare events and closed domains: Two delicate concepts in speech synthesis
- B. Möbius: Rare events and closed domains: Two delicate concepts in speech synthesis, Int. J. Speech Technol. 6(1), 57–71 (2003)
- (2003) Int. J. Speech Technol. , vol.6 , Issue.1 , pp. 57-71
- Möbius, B.¹

30
- 85135154775
- Combinatorial issues in text-to-speech synthesis
- Vol.,) pp
- J.P.H. van Santen: Combinatorial issues in text-to-speech synthesis, Proc. Euro. Conf. Speech Commun. Technol., Vol. 5 (1997) pp. 2511–2514
- (1997) Proc. Euro. Conf. Speech Commun. Technol , vol.5 , pp. 2511-2514
- van Santen, J.P.H.¹

31
- 84966301419
- Limited domain synthesis
- pp
- A. Black, K. Lenzo: Limited domain synthesis, Proc. ICSLP (2000) pp. 411–414
- (2000) Proc. ICSLP , pp. 411-414
- Black, A.¹ Lenzo, K.²

32
- 68249083782
- The Blizzard Challenge 2006
- C. Bennett, A. Black: The Blizzard Challenge 2006, Proc. Blizzard Challenge 2006 (2006)
- (2006) Proc. Blizzard Challenge , vol.2006
- Bennett, C.¹ Black, A.²

33
- 85075951335
- Whistler: A trainable text-to-speech system
- Vol.,) pp
- X. Huang, A. Acero, J. Adcock, H. Hon, J. Goldsmith, J. Liu, M. Plumpe: Whistler: A trainable text-to-speech system, Proc. ICSLP, Vol. 96 (1996) pp. 659– 662
- (1996) Proc. ICSLP , vol.96 , pp. 659-662
- Huang, X.¹ Acero, A.² Adcock, J.³ Hon, H.⁴ Goldsmith, J.⁵ Liu, J.⁶ Plumpe, M.⁷

34
- 33645758767
- An HMM-based approach to multilingual speech synthesis
- ed. by S. Narayanan, A. AlwanPrentice Hall, Upper Saddle River
- K. Tokuda, H. Zen, A. Black: An HMM-based approach to multilingual speech synthesis. In: Text to Speech Synthesis: New Paradigms and Advances, ed. by S. Narayanan, A. Alwan (Prentice Hall, Upper Saddle River 2004) pp. 135–153
- (2004) Text to Speech Synthesis: New Paradigms and Advances , pp. 135-153
- Tokuda, K.¹ Zen, H.² Black, A.³

35
- 85009231020
- Custom-tailoring TTS voice font – keeping the naturalness when reducing database size
- pp
- Y. Zhao, M. Chu, H. Peng, E. Chang: Custom-tailoring TTS voice font – keeping the naturalness when reducing database size, Proc. Eurospeech, Vol. 2003 (2003) pp. 2957–2960
- (2003) Proc. Eurospeech , vol.2003 , pp. 2957-2960
- Zhao, Y.¹ Chu, M.² Peng, H.³ Chang, E.⁴

36
- 33745216013
- Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling
- Vol.,) pp
- D. Chazan, R. Hoory, Z. Kons, A. Sagi, S. Shechtman, A. Sorin: Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling, Proc. Interspeech, Vol. 2005 (2005) pp. 2569–2572
- (2005) Proc. Interspeech , vol.2005 , pp. 2569-2572
- Chazan, D.¹ Hoory, R.² Kons, Z.³ Sagi, A.⁴ Shechtman, S.⁵ Sorin, A.⁶

37
- 85133526552
- Automatically clustering similar units for unit selection in speech synthesis
- Vol
- A.W. Black, P. Taylor: Automatically clustering similar units for unit selection in speech synthesis, Proc. Eurospeech, Vol. 2 (1997)
- (1997) Proc. Eurospeech , vol.2
- Black, A.W.¹ Taylor, P.²

38
- 84944962517
- The IBM trainable speech synthesis system
- Vol.,) pp
- R.E. Donovan, E.M. Eide: The IBM trainable speech synthesis system, Proc. Int. Conf. Spoken Lang. Process., Vol. 5 (1998) pp. 1703–1706
- (1998) Proc. Int. Conf. Spoken Lang. Process. , vol.5 , pp. 1703-1706
- Donovan, R.E.¹ Eide, E.M.²

39
- 84966366503
- Rapid unit selection from a large speech corpus for concatenative speech synthesis
- Vol.,) pp
- M. Beutnagel, M. Mohri, M. Riley: Rapid unit selection from a large speech corpus for concatenative speech synthesis, Proc. Eurospeech ’99, Vol. 2 (1999) pp. 607–610
- (1999) Proc. Eurospeech ’99 , vol.2 , pp. 607-610
- Beutnagel, M.¹ Mohri, M.² Riley, M.³

40
- 33745205007
- An introduction of trajectory model into hmm-based speech synthesis
- H. Zen, K. Tokuda, T. Kitamura: An introduction of trajectory model into hmm-based speech synthesis, Proc. Speech Synthesis Workshop (2005)
- (2005) Proc. Speech Synthesis Workshop
- Zen, H.¹ Tokuda, K.² Kitamura, T.³

41
- 85134885765
- A hidden Markov model approach to speech synthesis
- Vol.,) pp
- A. Falaschi, M. Giustiniani, M. Verola: A hidden Markov model approach to speech synthesis, Proc. Eurospeech, Vol. 1989 (1989) pp. 2187– 2190
- (1989) Proc. Eurospeech , vol.1989 , pp. 2187-2190
- Falaschi, A.¹ Giustiniani, M.² Verola, M.³

42
- 42649146508
- On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum
- Vol.,) pp
- K. Nakamura, T. Toda, Y. Nankaku, K. Tokuda: On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum, Proc. ICASSP, Vol. 06 (2006) pp. 93–96
- (2006) Proc. ICASSP , vol.6 , pp. 93-96
- Nakamura, K.¹ Toda, T.² Nankaku, Y.³ Tokuda, K.⁴

43
- 85123861026
- XIMERA: A new TTS from ATR based on corpus-based technologies
- pp
- H. Kawai, T. Toda, J. Ni, M. Tsuzaki, K. Tokuda: XIMERA: A new TTS from ATR based on corpus-based technologies, Proc. 5th ISCA Speech Synthesis Workshop (2004) pp. 179–184
- (2004) Proc. 5Th ISCA Speech Synthesis Workshop , pp. 179-184
- Kawai, H.¹ Toda, T.² Ni, J.³ Tsuzaki, M.⁴ Tokuda, K.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.