메뉴 건너뛰기




Volumn 14, Issue 5, 2006, Pages 1763-1771

Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

Author keywords

Join cost; Linear dynamic models (LDM); Perceptual listening tests; Smoothing; Speech synthesis; Unit selection

Indexed keywords

JOIN COSTS; LINEAR DYNAMIC MODELS (LDM); PERCEPTUAL LISTENING TESTS; SMOOTHING; UNIT SELECTION;

EID: 34047258869     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.858548     Document Type: Article
Times cited : (17)

References (25)
  • 1
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in P roc. ICASSP, 1996, pp. 373-376.
    • (1996) P roc. ICASSP , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 2
    • 84944962517 scopus 로고    scopus 로고
    • The IBM trainable speech synthesis system
    • Sydney, Australia
    • R. E. Donovan and E. M. Eide, "The IBM trainable speech synthesis system," in Proc. ICSLP, Sydney, Australia, 1998.
    • (1998) Proc. ICSLP
    • Donovan, R.E.1    Eide, E.M.2
  • 4
    • 84985926077 scopus 로고    scopus 로고
    • Segment selection in the L & H RealSpeak laboratory TTS system
    • Beijing, China
    • G. Coorman, J. Fackrell, P. Rutten, and B. van Coile, "Segment selection in the L & H RealSpeak laboratory TTS system," in Proc. ICSLP, Beijing, China, 2000.
    • (2000) Proc. ICSLP
    • Coorman, G.1    Fackrell, J.2    Rutten, P.3    van Coile, B.4
  • 5
    • 81155150210 scopus 로고    scopus 로고
    • On the reduction of concatenation artefacts in diphone synthesis
    • Sydney, Australia
    • E. Klabbers and R. Veldhuis, "On the reduction of concatenation artefacts in diphone synthesis," in Proc. ICSLP, vol. 6, Sydney, Australia, 1998, pp. 1983-1986.
    • (1998) Proc. ICSLP , vol.6 , pp. 1983-1986
    • Klabbers, E.1    Veldhuis, R.2
  • 6
    • 81155152572 scopus 로고    scopus 로고
    • Perceptual evaluation of distance measures for concatenative speech synthesis
    • Sydney, Australia
    • J. Wouters and M. Macon, "Perceptual evaluation of distance measures for concatenative speech synthesis," in Proc. ICSLP, vol. 6, Sydney, Australia, 1998, pp. 2747-2750.
    • (1998) Proc. ICSLP , vol.6 , pp. 2747-2750
    • Wouters, J.1    Macon, M.2
  • 7
    • 0034854702 scopus 로고    scopus 로고
    • Perceptual and objective detection of discontinuities in concatenative speech synthesis
    • Salt Lake City, UT
    • Y. Stylianou and A. K. Syrdal, "Perceptual and objective detection of discontinuities in concatenative speech synthesis," in Proc. ICASSP, Salt Lake City, UT, 2001.
    • (2001) Proc. ICASSP
    • Stylianou, Y.1    Syrdal, A.K.2
  • 8
    • 80051612889 scopus 로고    scopus 로고
    • A new distance measure for costing spectral discontinuities in concatenative speech synthesisers
    • Perthshire, U.K
    • R. E. Donovan, "A new distance measure for costing spectral discontinuities in concatenative speech synthesisers," in Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, U.K., 2001, pp. 59-62.
    • (2001) Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis , pp. 59-62
    • Donovan, R.E.1
  • 9
    • 0023419762 scopus 로고
    • Globally optimising formant tracker using generalized centroids
    • A. Crowe and M. A. Jack, "Globally optimising formant tracker using generalized centroids," Electron. Lett., vol. 23, no. 19, pp. 1019-1020, 1987.
    • (1987) Electron. Lett , vol.23 , Issue.19 , pp. 1019-1020
    • Crowe, A.1    Jack, M.A.2
  • 10
    • 34047270177 scopus 로고
    • Analysis of fricatives using multiple centres of gravity
    • A. A. Wrench, "Analysis of fricatives using multiple centres of gravity," in Proc. Int. Congr. Phonetic Sciences, vol. 4, 1995, pp. 460-463.
    • (1995) Proc. Int. Congr. Phonetic Sciences , vol.4 , pp. 460-463
    • Wrench, A.A.1
  • 11
    • 85009279358 scopus 로고    scopus 로고
    • Objective distance measures for spectral discontinuities in concatenative speech synthesis
    • Denver, CO
    • J. Vepa, S. King, and P. Taylor, "Objective distance measures for spectral discontinuities in concatenative speech synthesis," in Proc. ICSLP, Denver, CO, 2002.
    • (2002) Proc. ICSLP
    • Vepa, J.1    King, S.2    Taylor, P.3
  • 12
    • 84966318856 scopus 로고    scopus 로고
    • New objective distance measures for spectral discontinuities in concatenative speech synthesis
    • Santa Monica, CA, Sep
    • _, "New objective distance measures for spectral discontinuities in concatenative speech synthesis," in Proc. IEEE 2002 Workshop on Speech Synthesis, Santa Monica, CA, Sep. 2002.
    • (2002) Proc. IEEE 2002 Workshop on Speech Synthesis
    • Vepa, J.1    King, S.2    Taylor, P.3
  • 13
    • 85009167944 scopus 로고    scopus 로고
    • Kalman-filter based join cost for unit-selection speech synthesis
    • Geneva, Switzerland, Sep
    • J. Vepa and S. King, "Kalman-filter based join cost for unit-selection speech synthesis," in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003.
    • (2003) Proc. Eurospeech
    • Vepa, J.1    King, S.2
  • 14
    • 34047252102 scopus 로고    scopus 로고
    • _, Join cost for unit selection speech synthesis, in Text to Speech Synthesis: New Paradigms and Advances, A. Alwan and S. Narayanan, Eds. Upper Saddle River, NJ: Prentice-Hall, 2004.
    • _, "Join cost for unit selection speech synthesis," in Text to Speech Synthesis: New Paradigms and Advances, A. Alwan and S. Narayanan, Eds. Upper Saddle River, NJ: Prentice-Hall, 2004.
  • 16
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 17
    • 33751539300 scopus 로고
    • Line spectrum representation of linear predictor coefficients of speech signals
    • A
    • F. Itakura, "Line spectrum representation of linear predictor coefficients of speech signals," J. Acoust. Soc. Amer., vol. 57, p. S35(A), 1975.
    • (1975) J. Acoust. Soc. Amer , vol.57
    • Itakura, F.1
  • 18
    • 34047246080 scopus 로고    scopus 로고
    • F. K. Soong and B. H. Juang, Line spectrum pairs (LSP) and speech data compression, in Proc. ICASSP, 1984, pp. 1.10.1-1.10.4.
    • F. K. Soong and B. H. Juang, "Line spectrum pairs (LSP) and speech data compression," in Proc. ICASSP, 1984, pp. 1.10.1-1.10.4.
  • 19
    • 33846687633 scopus 로고    scopus 로고
    • Linear Dynamic Models for Automatic Speech Recognition,
    • Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K
    • J. Frankel, "Linear Dynamic Models for Automatic Speech Recognition," Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K., 2003.
    • (2003)
    • Frankel, J.1
  • 20
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems, Trans. Amer. Soc. Mech. Eng., Series D
    • R. E. Kalman, "A new approach to linear filtering and prediction problems," Trans. Amer. Soc. Mech. Eng., Series D, J. Basic Eng., vol. 82, pp. 35-15, 1960.
    • (1960) J. Basic Eng , vol.82 , pp. 35-15
    • Kalman, R.E.1
  • 21
    • 0002077742 scopus 로고
    • Quantization of LPC parameters
    • W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
    • K. K. Paliwal and W. B. Kleijn, "Quantization of LPC parameters," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995, pp. 433-466.
    • (1995) Speech Coding and Synthesis , pp. 433-466
    • Paliwal, K.K.1    Kleijn, W.B.2
  • 23
    • 0036497601 scopus 로고    scopus 로고
    • A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    • D. T. Chappell and J. H. Hansen, "A comparison of spectral smoothing methods for segment concatenation based speech synthesis," Speech Commun., vol. 36, pp. 343-374, 2002.
    • (2002) Speech Commun , vol.36 , pp. 343-374
    • Chappell, D.T.1    Hansen, J.H.2
  • 24
    • 34047253695 scopus 로고    scopus 로고
    • A. Black and P. Taylor, The Festival Speech Synthesis System: System Documentation, Human Communication Research Centre, Univ. of Edinburgh, Edinburgh, U.K., Tech. Rep. HCRC/TR-83, 1997.
    • A. Black and P. Taylor, "The Festival Speech Synthesis System: System Documentation," Human Communication Research Centre, Univ. of Edinburgh, Edinburgh, U.K., Tech. Rep. HCRC/TR-83, 1997.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.