메뉴 건너뛰기




Volumn 48, Issue 10, 2006, Pages 1227-1242

New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer

Author keywords

Cross language synthesis; Multilingual; Phone mapping; Polyglot synthesis; Voice adaptation

Indexed keywords

FORMAL LANGUAGES; MAPPING; MARKOV PROCESSES; SPEECH COMMUNICATION; SPEECH PRODUCTION AIDS; TELEPHONE;

EID: 33748468338     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2006.05.003     Document Type: Article
Times cited : (59)

References (24)
  • 1
    • 85009085813 scopus 로고    scopus 로고
    • Badino, L., Barolo, C., Quazza, S., 2004. A general approach to TTS reading of mixed-language texts. In: Proc. ICSLP, Jeju Island, Korea, pp. 849-852.
  • 2
    • 4544375503 scopus 로고    scopus 로고
    • Black, A., Lenzo, K., 2004. Multilingual text-to-speech synthesis. In: Proc. ICASSP, Montreal, Canada, pp. 761-764.
  • 3
    • 33748449843 scopus 로고    scopus 로고
    • Bonaventura, P., Gallochio, F., Micca, G., 1997. Multilingual speech recognition for flexible vocabularies. In: Proc. Eurospeech, Rhodes, Greece, pp. 355-358.
  • 4
    • 33748459240 scopus 로고    scopus 로고
    • Campbell, N., 1998. Foreign-language speech synthesis. In: Proc. ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Australia.
  • 5
    • 85009072569 scopus 로고    scopus 로고
    • Campbell, N., 2001. Talking foreign. Concatenative speech synthesis and the language barrier. In: Proc. Eurospeech, Aalborg, Denmark, pp. 337-340.
  • 6
    • 33748475496 scopus 로고    scopus 로고
    • Dijkstra, J., Pols, L.C.W., van Son, R.J.J.H., 2004. Frisian TTS, an example of bootstrapping TTS for minority languages. In: Proc. 5th ISCA Speech Synthesis Workshop, Pittsburgh, USA, pp. 97-102.
  • 7
    • 1442305910 scopus 로고    scopus 로고
    • The future of language
    • Graddol D. The future of language. Science 303 (2004) 1329-1331
    • (2004) Science , vol.303 , pp. 1329-1331
    • Graddol, D.1
  • 8
    • 0020596154 scopus 로고    scopus 로고
    • Imai S., 1983. Cepstral analysis synthesis on the Mel frequency scale. In: Proc. ICASP, Boston, USA, pp. 93-96.
  • 9
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter C.J., and Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9 2 (1995) 171-185
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 10
    • 33745184223 scopus 로고    scopus 로고
    • Liu, C., Melnar, L., 2005. An automated linguistic knowledge-based cross-language transfer method for building acoustic models for a languages without native training data. In: Proc. Eurospeech, Lisbon, Portugal, pp. 1365-1368.
  • 11
    • 0030363025 scopus 로고    scopus 로고
    • Mak, B., Barnard, E., 1996. Phone clustering using the Bhattacharyya distance. In: Proc. ICSLP, Philadelphia, USA, pp. 2005-2008.
  • 12
    • 85009129569 scopus 로고    scopus 로고
    • Mashimo M., Toda, T., Kawanami, H., Kashioka, H., Shikano, K., Campbell, N., 2001. Evaluation of cross-language voice conversion based on GMM and STRAIGHT. In: Proc. Eurospeech, Aalborg, Denmark, pp. 361-364.
  • 13
    • 0029725605 scopus 로고    scopus 로고
    • Masuko, T., Tokuda, K., Kobayashi, T., Imai, S., 1996. Speech synthesis using HMMs with dynamic features. In: Proc. ICASSP, Atlanta, USA, pp. 389-392.
  • 14
    • 85009115843 scopus 로고    scopus 로고
    • Moberg, M., Pärssinen, K., Iso-Sipilä, J., 2004. Cross-lingual phoneme mapping for multilingual synthesis systems. In: Proc. ICSLP, Jeju Island, Korea, pp. 1029-1032.
  • 15
    • 85009274666 scopus 로고    scopus 로고
    • Schultz, T., 2002. Globalphone: a multilingual speech and text database developed at Karlsruhe University. In: Proc. ICSLP, Denver, USA, pp. 345-348.
  • 16
    • 85009101138 scopus 로고    scopus 로고
    • Schultz, T., Waibel, A., 2001. Experiments on cross-language acoustic modeling. In: Proc. Eurospeech, Aalborg, Denmark, pp. 2721-2724.
  • 17
    • 33748472121 scopus 로고    scopus 로고
    • Shin, H., Bruno, R., 2003. Language use and English speaking ability. In USA Census 2000, US Census Bureau census.gov/prod/2003pubs/c2kbr-29.pdf.
  • 18
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • Shinoda K., and Watanabe T. MDL-based context-dependent subword modeling for speech recognition. J. Acoust. Soc. Jpn. (English) 21 (2000) 79-86
    • (2000) J. Acoust. Soc. Jpn. (English) , vol.21 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 19
    • 33748457707 scopus 로고    scopus 로고
    • Tamura, M., Masuko, T., Tokuda, K., Kobayashi, T., 1998. Speaker adaptation for HMM-based speech synthesis system using MLLR. In: Proc. Third ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Autralia, pp. 273-276.
  • 20
    • 85009100521 scopus 로고    scopus 로고
    • Tamura, M., Masuko, T., Tokuda, K., Kobayashi, T., 2001. Text-to-speech synthesis with arbitrary speaker's voice from average voice. In: Proc. Eurospeech 2001, Aalborg, Denmark, pp. 345-348.
  • 21
    • 33748474989 scopus 로고    scopus 로고
    • Tokuda, K., Masuko, T., Yamada, T., Kobayashi, T. Imai, S., 1995a. An algorithm for speech parameter generation from continuous mixture HMM with dynamic features. In: Proc. Eurospeech 1995, Madrid, Spain, pp. 757-760.
  • 22
    • 33748448870 scopus 로고    scopus 로고
    • Tokuda, K., Masuko, T., Imai, S., 1995b. Speech parameter generation from continous mixture HMMs with dynamic features. In: Proc. ICASSP, Detroit, USA, pp. 660-663.
  • 23
    • 33748461042 scopus 로고    scopus 로고
    • Traber, C., Huber, K., Nedir, K., Pfister, B., Keller, E., Zellner, B., 1999. From multilingual to polyglot speech synthesis. In: Proc. Eurospeech, Budapest, Hungary, pp. 835-838.
  • 24
    • 85009168839 scopus 로고    scopus 로고
    • Yu, H., Schultz, T., 2003. Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition. In Proc. Eurospeech, Geneva, Switzerland, pp. 1869-1872.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.