메뉴 건너뛰기




Volumn , Issue , 2008, Pages 437-456

Corpus-Based Speech Synthesis

Author keywords

Automatic Speech Recognition; Automatic Speech Recognition System; Spectral Envelope; Speech Corpus; Speech Synthesis

Indexed keywords


EID: 68949165450     PISSN: 25228692     EISSN: 25228706     Source Type: Book Series    
DOI: 10.1007/978-3-540-49127-9_21     Document Type: Chapter
Times cited : (21)

References (43)
  • 1
    • 0013634269 scopus 로고
    • A study of the building blocks in speech
    • C.M. Harris: A study of the building blocks in speech, J. Acoust. Soc. Am. 25, 962–969 (1953)
    • (1953) J. Acoust. Soc. Am. , vol.25 , pp. 962-969
    • Harris, C.M.1
  • 3
    • 0345093720 scopus 로고
    • Terminal analog synthesis of continuous speech using the diphone method of segment assembly
    • N.R. Dixon, H.D. Maxey: Terminal analog synthesis of continuous speech using the diphone method of segment assembly, IEEE Trans. ASSP AU-16(1), 40–50 (1968)
    • (1968) IEEE Trans. ASSP AU- , vol.16 , Issue.1 , pp. 40-50
    • Dixon, N.R.1    Maxey, H.D.2
  • 6
    • 0025543906 scopus 로고
    • Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, F. Charpentier: Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun. 9, 5–6 (1990)
    • (1990) Speech Commun , vol.9 , pp. 5-6
    • Moulines, E.1    Charpentier, F.2
  • 7
    • 0035127703 scopus 로고    scopus 로고
    • Applying the harmonic plus noise model in concatenative synthesis
    • Y. Stylianou: Applying the harmonic plus noise model in concatenative synthesis, IEEE Trans. Speech Audio Process. 9(1), 21–29 (2001)
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.1 , pp. 21-29
    • Stylianou, Y.1
  • 8
    • 0027252181 scopus 로고
    • An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech
    • Vol., pp
    • W. Verhelst, M. Roelands: An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech, Proc. ICASSP 93, Vol. 2 (1993) pp. 554–557
    • (1993) Proc. ICASSP 93 , vol.2 , pp. 554-557
    • Verhelst, W.1    Roelands, M.2
  • 9
    • 0008006444 scopus 로고
    • Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation
    • Vol.,) pp
    • N. Morita, F. Itakura: Time-scale modification algorithm for speech by use of pointer interval control overlap and add (PICOLA) and its evaluation, Proc. Annu. Meeting of Acoust. Soc. Jpn., Vol. 86 (1986) pp. 9–16
    • (1986) Proc. Annu. Meeting of Acoust. Soc. Jpn , vol.86 , pp. 9-16
    • Morita, N.1    Itakura, F.2
  • 13
    • 0027839344 scopus 로고
    • MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database
    • T. Dutoit, H. Leich: MBR-PSOLA: Text-to-speech synthesis based on an MBE resynthesis of the segments database, Speech Commun. 13, 435–440 (1993)
    • (1993) Speech Commun , vol.13 , pp. 435-440
    • Dutoit, T.1    Leich, H.2
  • 15
    • 0035124445 scopus 로고    scopus 로고
    • Control of spectral dynamics in concatenative speech synthesis
    • J. Wouters, M.W. Macon: Control of spectral dynamics in concatenative speech synthesis, IEEE Trans. Speech Audio Process. 9(1), 30–38 (2001)
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.1 , pp. 30-38
    • Wouters, J.1    Macon, M.W.2
  • 18
    • 0012356658 scopus 로고
    • HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes
    • T. Portele, W. Sendlemeier, W. Hess: HADIFIX, a system for German speech synthesis based on demisyllables, diphones, and suffixes, Proc. First ESCA Workshop on Speech Synthesis (1990) pp. 161– 164
    • (1990) Proc. First ESCA Workshop on Speech Synthesis , pp. 161-164
    • Portele, T.1    Sendlemeier, W.2    Hess, W.3
  • 19
    • 0028499480 scopus 로고
    • Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering
    • S. Nakajima: Automatic synthesis unit generation for English speech synthesis based on multi-layered context oriented clustering, Speech Commun. 14, 313–324 (1994)
    • (1994) Speech Commun , vol.14 , pp. 313-324
    • Nakajima, S.1
  • 20
    • 85135181226 scopus 로고
    • Improvements in an HMM-based speech synthesizer
    • Vol.,) pp
    • R. Donovan, P. Woodland: Improvements in an HMM-based speech synthesizer, Proc. Eurospeech 95, Vol. 1 (1995) pp. 573–576
    • (1995) Proc. Eurospeech 95 , vol.1 , pp. 573-576
    • Donovan, R.1    Woodland, P.2
  • 21
    • 85068112784 scopus 로고
    • Rule synthesis of speech from diadic units
    • Vol.,) pp
    • J.P. Olive: Rule synthesis of speech from diadic units, Proc. ICASSP, Vol. 77 (1977) pp. 568–570
    • (1977) Proc. ICASSP , vol.77 , pp. 568-570
    • Olive, J.P.1
  • 23
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • Vol.,) pp
    • A.J. Hunt, A.W. Black: Unit selection in a concatenative speech synthesis system using a large speech database, Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96), Vol. 1 (1996) pp. 373–376
    • (1996) Proc. Int. Conf. Acoust. Speech Signal Process. (ICASSP ’96) , vol.1 , pp. 373-376
    • Hunt, A.J.1    Black, A.W.2
  • 24
    • 0000237685 scopus 로고
    • Prosody and the selection of source units for concatenative synthesis
    • ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg
    • N. Campbell, A. Black: Prosody and the selection of source units for concatenative synthesis. In: Progress in Speech Synthesis, ed. by J. van Santen, R. Sproat, J. Olive, J. Hirshberg (Springer, Berlin, Heidelberg 1995)
    • (1995) Progress in Speech Synthesis
    • Campbell, N.1    Black, A.2
  • 26
    • 85135259940 scopus 로고    scopus 로고
    • Choose the best to modify the least: A new generation concatenative synthesis system
    • Vol.,) pp
    • M. Balestri, A. Paechiotti, S. Quazza, P.L. Salza, S. Sandri: Choose the best to modify the least: a new generation concatenative synthesis system, Proc. Eurospeech, Vol. 99 (1999) pp. 2291–2294
    • (1999) Proc. Eurospeech , vol.99 , pp. 2291-2294
    • Balestri, M.1    Paechiotti, A.2    Quazza, S.3    Salza, P.L.4    Sandri, S.5
  • 27
    • 85135272129 scopus 로고    scopus 로고
    • Speech synthesis by phonological structure matching
    • Vol.,) pp
    • P. Taylor, A.W. Black: Speech synthesis by phonological structure matching, Proc. Eurospeech, Vol. 99 (1999) pp. 623–626
    • (1999) Proc. Eurospeech , vol.99 , pp. 623-626
    • Taylor, P.1    Black, A.W.2
  • 28
    • 70349848071 scopus 로고    scopus 로고
    • Join cost for unit selection speech synthesis
    • ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River
    • J. Vepa, S. King: Join cost for unit selection speech synthesis. In: Speech Synthesis, ed. by A. Alwan, S. Narayanan (Prentice-Hall, Upper Saddle River 2004)
    • (2004) Speech Synthesis
    • Vepa, J.1    King, S.2
  • 29
    • 0037236894 scopus 로고    scopus 로고
    • Rare events and closed domains: Two delicate concepts in speech synthesis
    • B. Möbius: Rare events and closed domains: Two delicate concepts in speech synthesis, Int. J. Speech Technol. 6(1), 57–71 (2003)
    • (2003) Int. J. Speech Technol. , vol.6 , Issue.1 , pp. 57-71
    • Möbius, B.1
  • 30
    • 85135154775 scopus 로고    scopus 로고
    • Combinatorial issues in text-to-speech synthesis
    • Vol.,) pp
    • J.P.H. van Santen: Combinatorial issues in text-to-speech synthesis, Proc. Euro. Conf. Speech Commun. Technol., Vol. 5 (1997) pp. 2511–2514
    • (1997) Proc. Euro. Conf. Speech Commun. Technol , vol.5 , pp. 2511-2514
    • van Santen, J.P.H.1
  • 31
    • 84966301419 scopus 로고    scopus 로고
    • Limited domain synthesis
    • pp
    • A. Black, K. Lenzo: Limited domain synthesis, Proc. ICSLP (2000) pp. 411–414
    • (2000) Proc. ICSLP , pp. 411-414
    • Black, A.1    Lenzo, K.2
  • 34
    • 33645758767 scopus 로고    scopus 로고
    • An HMM-based approach to multilingual speech synthesis
    • ed. by S. Narayanan, A. AlwanPrentice Hall, Upper Saddle River
    • K. Tokuda, H. Zen, A. Black: An HMM-based approach to multilingual speech synthesis. In: Text to Speech Synthesis: New Paradigms and Advances, ed. by S. Narayanan, A. Alwan (Prentice Hall, Upper Saddle River 2004) pp. 135–153
    • (2004) Text to Speech Synthesis: New Paradigms and Advances , pp. 135-153
    • Tokuda, K.1    Zen, H.2    Black, A.3
  • 35
    • 85009231020 scopus 로고    scopus 로고
    • Custom-tailoring TTS voice font – keeping the naturalness when reducing database size
    • pp
    • Y. Zhao, M. Chu, H. Peng, E. Chang: Custom-tailoring TTS voice font – keeping the naturalness when reducing database size, Proc. Eurospeech, Vol. 2003 (2003) pp. 2957–2960
    • (2003) Proc. Eurospeech , vol.2003 , pp. 2957-2960
    • Zhao, Y.1    Chu, M.2    Peng, H.3    Chang, E.4
  • 36
    • 33745216013 scopus 로고    scopus 로고
    • Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling
    • Vol.,) pp
    • D. Chazan, R. Hoory, Z. Kons, A. Sagi, S. Shechtman, A. Sorin: Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling, Proc. Interspeech, Vol. 2005 (2005) pp. 2569–2572
    • (2005) Proc. Interspeech , vol.2005 , pp. 2569-2572
    • Chazan, D.1    Hoory, R.2    Kons, Z.3    Sagi, A.4    Shechtman, S.5    Sorin, A.6
  • 37
    • 85133526552 scopus 로고    scopus 로고
    • Automatically clustering similar units for unit selection in speech synthesis
    • Vol
    • A.W. Black, P. Taylor: Automatically clustering similar units for unit selection in speech synthesis, Proc. Eurospeech, Vol. 2 (1997)
    • (1997) Proc. Eurospeech , vol.2
    • Black, A.W.1    Taylor, P.2
  • 39
    • 84966366503 scopus 로고    scopus 로고
    • Rapid unit selection from a large speech corpus for concatenative speech synthesis
    • Vol.,) pp
    • M. Beutnagel, M. Mohri, M. Riley: Rapid unit selection from a large speech corpus for concatenative speech synthesis, Proc. Eurospeech ’99, Vol. 2 (1999) pp. 607–610
    • (1999) Proc. Eurospeech ’99 , vol.2 , pp. 607-610
    • Beutnagel, M.1    Mohri, M.2    Riley, M.3
  • 41
    • 85134885765 scopus 로고
    • A hidden Markov model approach to speech synthesis
    • Vol.,) pp
    • A. Falaschi, M. Giustiniani, M. Verola: A hidden Markov model approach to speech synthesis, Proc. Eurospeech, Vol. 1989 (1989) pp. 2187– 2190
    • (1989) Proc. Eurospeech , vol.1989 , pp. 2187-2190
    • Falaschi, A.1    Giustiniani, M.2    Verola, M.3
  • 42
    • 42649146508 scopus 로고    scopus 로고
    • On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum
    • Vol.,) pp
    • K. Nakamura, T. Toda, Y. Nankaku, K. Tokuda: On the use of phonetic information for mapping from articulatory movements to vocal tract spectrum, Proc. ICASSP, Vol. 06 (2006) pp. 93–96
    • (2006) Proc. ICASSP , vol.6 , pp. 93-96
    • Nakamura, K.1    Toda, T.2    Nankaku, Y.3    Tokuda, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.