메뉴 건너뛰기




Volumn 4, Issue , 2007, Pages

Statistical parametric speech synthesis

Author keywords

Hidden Markov models; Speech synthesis

Indexed keywords

GENERATION SYNTHESIS; HMM-BASED SYNTHESIS; STATISTICAL PARAMETRIC SPEECH SYNTHESIS; UNIT SELECTION TECHNOLOGY;

EID: 34547526960     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2007.367298     Document Type: Conference Paper
Times cited : (182)

References (64)
  • 2
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in ICASSP, 1996, pp. 373-376.
    • (1996) ICASSP , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 4
    • 84966301419 scopus 로고    scopus 로고
    • Limited domain synthesis
    • A. Black and K. Lenzo, "Limited domain synthesis," in ICSLP, 2000, pp. 411-414.
    • (2000) ICSLP , pp. 411-414
    • Black, A.1    Lenzo, K.2
  • 5
    • 34547535865 scopus 로고    scopus 로고
    • A corpus-based approach to 〈AHEM/〉 expressive speech syndiesis authors
    • E. Eide, A. Aaron, R. Bakis, W. Hamza, M. Picheny, and J. Pitrelli, "A corpus-based approach to 〈AHEM/〉 expressive speech syndiesis authors," in ISCA SSW5, 2004.
    • (2004) ISCA , vol.SSW5
    • Eide, E.1    Aaron, A.2    Bakis, R.3    Hamza, W.4    Picheny, M.5    Pitrelli, J.6
  • 6
    • 33745206749 scopus 로고    scopus 로고
    • Large scale evaluation of corpus-based synthesizers: Results and lessons from the Blizzard Challenge 2005
    • C.Bennett, "Large scale evaluation of corpus-based synthesizers: Results and lessons from the Blizzard Challenge 2005," in Interspeech, 2005, pp. 105-108.
    • (2005) Interspeech , pp. 105-108
    • Bennett, C.1
  • 8
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 9
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 2347-2350.
    • (1999) Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 11
    • 85135181226 scopus 로고
    • Improvements in an HMM-based speech synthesiser
    • R. Donovan and P. Woodland, "Improvements in an HMM-based speech synthesiser" in Eurospeech, 1995, pp. 573-576.
    • (1995) Eurospeech , pp. 573-576
    • Donovan, R.1    Woodland, P.2
  • 12
    • 85133526552 scopus 로고    scopus 로고
    • Automatically clustering similar units for unit selection in speech synthesis
    • A. Black and P. Taylor, "Automatically clustering similar units for unit selection in speech synthesis," in Eurospeech, 1997, pp. 601-604.
    • (1997) Eurospeech , pp. 601-604
    • Black, A.1    Taylor, P.2
  • 13
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, Kobayashi T., and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," in ICASSP, 1992, pp. 137-140.
    • (1992) ICASSP , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 16
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in ICASSP, 2000, pp. 1315-1318.
    • (2000) ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 17
    • 0020596154 scopus 로고
    • Cepstral analysis synthesis on the mel frequency scale
    • S. Imai, "Cepstral analysis synthesis on the mel frequency scale," in ICASSP 83, 1983, pp. 93-96.
    • (1983) ICASSP 83 , pp. 93-96
    • Imai, S.1
  • 19
    • 34547542349 scopus 로고    scopus 로고
    • Improving Arabic HMM based speech synthesis quality
    • O. Abdel-Hamid, S. Abdou, and M. Rashwan, "Improving Arabic HMM based speech synthesis quality," in Interspeech, 2006, pp. 1332-1335.
    • (2006) Interspeech , pp. 1332-1335
    • Abdel-Hamid, O.1    Abdou, S.2    Rashwan, M.3
  • 21
    • 33846406459 scopus 로고    scopus 로고
    • Two-band excitation for HMM-based speech synthesis
    • S.-J. Kim and M.-S. Hahn, "Two-band excitation for HMM-based speech synthesis," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 378-381, 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.1 , pp. 378-381
    • Kim, S.-J.1    Hahn, M.-S.2
  • 23
    • 34547552746 scopus 로고    scopus 로고
    • The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006
    • H. Zen, T. Toda, and K. Tokuda, "The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006," in Blizzard Challenge Workshop, 2006.
    • (2006) Blizzard Challenge Workshop
    • Zen, H.1    Toda, T.2    Tokuda, K.3
  • 25
    • 85133439657 scopus 로고    scopus 로고
    • An introduction of trajectory model into HMM-based speech synthesis
    • H. Zen, K. Tokuda, and T. Kitamura, "An introduction of trajectory model into HMM-based speech synthesis," in ISCA SSW5, 2004.
    • (2004) ISCA , vol.SSW5
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 26
    • 0036297838 scopus 로고    scopus 로고
    • Robust splicing costs and efficient search with BMM models for concatenative speech synthesis
    • I. Bulyko, M. Ostendorf, and J. Bilmes, "Robust splicing costs and efficient search with BMM models for concatenative speech synthesis," in ICASSP, 2002, pp. 461-464.
    • (2002) ICASSP , pp. 461-464
    • Bulyko, I.1    Ostendorf, M.2    Bilmes, J.3
  • 27
    • 0034854701 scopus 로고    scopus 로고
    • Trainable speech synthesis with trended hidden Markov models
    • J. Dines and S. Sridharan, "Trainable speech synthesis with trended hidden Markov models," in ICASSP, 2001, pp. 833-837.
    • (2001) ICASSP , pp. 833-837
    • Dines, J.1    Sridharan, S.2
  • 28
    • 0034842551 scopus 로고    scopus 로고
    • Speech synthesis using stochastic Markov graphs
    • M. Eichner, M. Wolff, S. Ohnewald, and R. Hoffman, "Speech synthesis using stochastic Markov graphs," in ICASSP, 2001, pp. 829-832.
    • (2001) ICASSP , pp. 829-832
    • Eichner, M.1    Wolff, M.2    Ohnewald, S.3    Hoffman, R.4
  • 29
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Y.-J. Wu and R.-H. Wang, "Minimum generation error training for HMM-based speech synthesis," in ICASSP, 2006, pp. 89-92.
    • (2006) ICASSP , pp. 89-92
    • Wu, Y.-J.1    Wang, R.-H.2
  • 31
    • 34547553049 scopus 로고    scopus 로고
    • A study on conditional parameter generation from HMM based on maximum likelihood criterion
    • T. Masuko, K. Tokuda, and T. Kobayashi, "A study on conditional parameter generation from HMM based on maximum likelihood criterion," in Autumn Meeting of ASJ, 2003, pp. 209-210.
    • (2003) Autumn Meeting of ASJ , pp. 209-210
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3
  • 32
    • 33745200051 scopus 로고    scopus 로고
    • Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Eurospeech, 2001, pp. 2801-2804.
    • (2001) Eurospeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2
  • 33
    • 0030696416 scopus 로고    scopus 로고
    • Voice characteristics conversion for HMM-based speech synthesis system
    • T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Voice characteristics conversion for HMM-based speech synthesis system," in ICASSP, 1997, pp. 1611-1614.
    • (1997) ICASSP , pp. 1611-1614
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 34
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
    • M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR," in ICASSP, 2001, pp. 805-808.
    • (2001) ICASSP , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 37
    • 34547520859 scopus 로고    scopus 로고
    • HMM-based prosody modeling and synthesis for Japanese and Chinese speech synthesis
    • Tech. Rep. TR-SLT-0032, ATR-SLT
    • H. Zen, J. Lu, J. Ni, K. Tokuda, and H. Kawai, "HMM-based prosody modeling and synthesis for Japanese and Chinese speech synthesis," Tech. Rep. TR-SLT-0032, ATR-SLT, 2003.
    • (2003)
    • Zen, H.1    Lu, J.2    Ni, J.3    Tokuda, K.4    Kawai, H.5
  • 38
    • 38149113842 scopus 로고    scopus 로고
    • An HMM-based Mandarin Chinese text-to-speech system
    • Y. Qian, F. Soong, Y. Chen, and M. Chu, "An HMM-based Mandarin Chinese text-to-speech system," in ISCSLP, 2006.
    • (2006) ISCSLP
    • Qian, Y.1    Soong, F.2    Chen, Y.3    Chu, M.4
  • 39
    • 33645755910 scopus 로고    scopus 로고
    • Implementation and evaluation of an HMM-based Korean speech synthesis system
    • S.-J. Kim, J.-J. Kim, and M.-S. Hahn, "Implementation and evaluation of an HMM-based Korean speech synthesis system," IEICE Trans. Inf. & Syst., vol. E89-D, pp. 1116-1119, 2006.
    • (2006) IEICE Trans. Inf. & Syst , vol.E89-D , pp. 1116-1119
    • Kim, S.-J.1    Kim, J.-J.2    Hahn, M.-S.3
  • 41
    • 56149086860 scopus 로고    scopus 로고
    • Low resource HMM-based speech synthesis applied to German
    • C. Weiss, R. Maia, K. Tokuda, and W. Hess, "Low resource HMM-based speech synthesis applied to German," in ESSP, 2005.
    • (2005) ESSP
    • Weiss, C.1    Maia, R.2    Tokuda, K.3    Hess, W.4
  • 42
    • 85009182084 scopus 로고    scopus 로고
    • Towards the development of a Brazilian Portuguese text-to-speech system based on HMM
    • R. Maia, H. Zen, K. Tokuda, T. Kitamura, and F. Resende Jr., "Towards the development of a Brazilian Portuguese text-to-speech system based on HMM," in Eurospeech, 2003, pp. 2465-2468.
    • (2003) Eurospeech , pp. 2465-2468
    • Maia, R.1    Zen, H.2    Tokuda, K.3    Kitamura, T.4    Resende Jr., F.5
  • 47
    • 22944466413 scopus 로고    scopus 로고
    • Evaluation of the Slovenian HMM-based speech synthesis system
    • B. Vesnicer and F. Mihelic, "Evaluation of the Slovenian HMM-based speech synthesis system," in TSD, 2004, pp. 513-520.
    • (2004) TSD , pp. 513-520
    • Vesnicer, B.1    Mihelic, F.2
  • 49
    • 34547507468 scopus 로고    scopus 로고
    • M. Homayounpour and S. Mehdi, Farsi speech synthesis using hidden Markov model and decision trees, The CSI Journal on Computer Science and Engineering, 2, no. 1&3 (a), 2004.
    • M. Homayounpour and S. Mehdi, "Farsi speech synthesis using hidden Markov model and decision trees," The CSI Journal on Computer Science and Engineering, vol. 2, no. 1&3 (a), 2004.
  • 50
    • 33646769932 scopus 로고    scopus 로고
    • Polyglot synthesis using a mixture of monolingual corpora
    • J. Latorre, K. Iwano, and S. Furui, "Polyglot synthesis using a mixture of monolingual corpora," in ICASSP, 2005, vol. 1, pp. 1-4.
    • (2005) ICASSP , vol.1 , pp. 1-4
    • Latorre, J.1    Iwano, K.2    Furui, S.3
  • 51
    • 85009177437 scopus 로고    scopus 로고
    • Modeling of various speaking styles and emotions for HMM-based speech synthesis
    • J. Yamagishi, K. Onishi, T. Masuko, and T. Kobayashi, "Modeling of various speaking styles and emotions for HMM-based speech synthesis," in Interspeech, 2003, pp. 2461-2464.
    • (2003) Interspeech , pp. 2461-2464
    • Yamagishi, J.1    Onishi, K.2    Masuko, T.3    Kobayashi, T.4
  • 53
    • 33645768204 scopus 로고    scopus 로고
    • A style adaptation technique for speech synthesis using HSMM and suprasegmental features
    • M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style adaptation technique for speech synthesis using HSMM and suprasegmental features," IEICE Trans. Inf. & Syst., vol. E89-D, no. 3, pp. 1092-1099, 2006.
    • (2006) IEICE Trans. Inf. & Syst , vol.E89-D , Issue.3 , pp. 1092-1099
    • Tachibana, M.1    Yamagishi, J.2    Masuko, T.3    Kobayashi, T.4
  • 54
    • 33846405723 scopus 로고    scopus 로고
    • Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
    • H. Zen, T. Toda, M. Nakamura, and T. Tokuda, "Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, T.4
  • 55
    • 85123861026 scopus 로고    scopus 로고
    • XIMERA: A new TTS from ATR based on corpus-based technologies
    • H. Kawai, T. Toda, J. Ni, M. Tsuzaki, and K. Tokuda, "XIMERA: A new TTS from ATR based on corpus-based technologies," in ISCA SSW5, 2004.
    • (2004) ISCA , vol.SSW5
    • Kawai, H.1    Toda, T.2    Ni, J.3    Tsuzaki, M.4    Tokuda, K.5
  • 56
    • 33745202041 scopus 로고    scopus 로고
    • Unit selection for speech synthesis based on a new acoustic target cost
    • S. Rouibia and Rosec, "Unit selection for speech synthesis based on a new acoustic target cost," in Interspeech, 2005, pp. 2565-2568.
    • (2005) Interspeech , pp. 2565-2568
    • Rouibia, S.1    Rosec2
  • 57
    • 85063141494 scopus 로고    scopus 로고
    • Using 5 ms segments in concatenative speech synthesis
    • T. Hirai and S. Tenpaku, "Using 5 ms segments in concatenative speech synthesis," in ISCA SSW5, 2004.
    • (2004) ISCA , vol.SSW5
    • Hirai, T.1    Tenpaku, S.2
  • 60
    • 34547503417 scopus 로고    scopus 로고
    • HMM-based unit selection using frame sized speech segments
    • Z. Ling and R. Wang, "HMM-based unit selection using frame sized speech segments," in Interspeech, 2006, pp. 2034-2037.
    • (2006) Interspeech , pp. 2034-2037
    • Ling, Z.1    Wang, R.2
  • 61
    • 67650851756 scopus 로고    scopus 로고
    • The Blizzard Challenge 2006 CMU entry introducing hybrid trajectory-selection synthesis
    • J. Kominek and A. Black, "The Blizzard Challenge 2006 CMU entry introducing hybrid trajectory-selection synthesis," in Blizzard Challenge Workshop, 2006.
    • (2006) Blizzard Challenge Workshop
    • Kominek, J.1    Black, A.2
  • 62
    • 0003162919 scopus 로고    scopus 로고
    • HMM-based smoothing for concatenative speech synthesis
    • M. Plumpe, A. Acero, H. Hon, and X. Huang, "HMM-based smoothing for concatenative speech synthesis," in ICSLP, 1998, pp. 2751-2754.
    • (1998) ICSLP , pp. 2751-2754
    • Plumpe, M.1    Acero, A.2    Hon, H.3    Huang, X.4
  • 63
    • 85009080327 scopus 로고    scopus 로고
    • Unit fusion for concatenative speech synthesis
    • J. Wouters and M. Macon, "Unit fusion for concatenative speech synthesis," in ICSLP, 2000, pp. 302-305.
    • (2000) ICSLP , pp. 302-305
    • Wouters, J.1    Macon, M.2
  • 64
    • 34547539669 scopus 로고    scopus 로고
    • Unifying unit selection and hidden Markov model speech synthesis
    • P. Taylor, "Unifying unit selection and hidden Markov model speech synthesis," in Interspeech, 2006, pp. 1758-1761.
    • (2006) Interspeech , pp. 1758-1761
    • Taylor, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.