SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 5, 2006, Pages 1763-1771

Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis

(2) Vepa, Jithendra b King, Simon a,c

a IEEE (United Kingdom)

b IDIAP RESEARCH INSTITUTE (Switzerland)

c UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Join cost; Linear dynamic models (LDM); Perceptual listening tests; Smoothing; Speech synthesis; Unit selection

Indexed keywords

JOIN COSTS; LINEAR DYNAMIC MODELS (LDM); PERCEPTUAL LISTENING TESTS; SMOOTHING; UNIT SELECTION;

COST FUNCTIONS; PARAMETER ESTIMATION; SPEECH PROCESSING; SPEECH RECOGNITION;

SPEECH SYNTHESIS;

EID: 34047258869 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.858548 Document Type: Article

Times cited : (17)

References (25)

1
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in P roc. ICASSP, 1996, pp. 373-376.
- (1996) P roc. ICASSP , pp. 373-376
- Hunt, A.¹ Black, A.²

2
- 84944962517
- The IBM trainable speech synthesis system
- Sydney, Australia
- R. E. Donovan and E. M. Eide, "The IBM trainable speech synthesis system," in Proc. ICSLP, Sydney, Australia, 1998.
- (1998) Proc. ICSLP
- Donovan, R.E.¹ Eide, E.M.²

3
- 0002425861
- The AT&T next-gen TTS system
- Berlin, Germany
- M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal, "The AT&T next-gen TTS system," in Proc. Joint Meeting of ASA, EAA, and DEGA. Berlin, Germany, 1999.
- (1999) Proc. Joint Meeting of ASA, EAA, and DEGA
- Beutnagel, M.¹ Conkie, A.² Schroeter, J.³ Stylianou, Y.⁴ Syrdal, A.⁵

4
- 84985926077
- Segment selection in the L & H RealSpeak laboratory TTS system
- Beijing, China
- G. Coorman, J. Fackrell, P. Rutten, and B. van Coile, "Segment selection in the L & H RealSpeak laboratory TTS system," in Proc. ICSLP, Beijing, China, 2000.
- (2000) Proc. ICSLP
- Coorman, G.¹ Fackrell, J.² Rutten, P.³ van Coile, B.⁴

5
- 81155150210
- On the reduction of concatenation artefacts in diphone synthesis
- Sydney, Australia
- E. Klabbers and R. Veldhuis, "On the reduction of concatenation artefacts in diphone synthesis," in Proc. ICSLP, vol. 6, Sydney, Australia, 1998, pp. 1983-1986.
- (1998) Proc. ICSLP , vol.6 , pp. 1983-1986
- Klabbers, E.¹ Veldhuis, R.²

6
- 81155152572
- Perceptual evaluation of distance measures for concatenative speech synthesis
- Sydney, Australia
- J. Wouters and M. Macon, "Perceptual evaluation of distance measures for concatenative speech synthesis," in Proc. ICSLP, vol. 6, Sydney, Australia, 1998, pp. 2747-2750.
- (1998) Proc. ICSLP , vol.6 , pp. 2747-2750
- Wouters, J.¹ Macon, M.²

7
- 0034854702
- Perceptual and objective detection of discontinuities in concatenative speech synthesis
- Salt Lake City, UT
- Y. Stylianou and A. K. Syrdal, "Perceptual and objective detection of discontinuities in concatenative speech synthesis," in Proc. ICASSP, Salt Lake City, UT, 2001.
- (2001) Proc. ICASSP
- Stylianou, Y.¹ Syrdal, A.K.²

8
- 80051612889
- A new distance measure for costing spectral discontinuities in concatenative speech synthesisers
- Perthshire, U.K
- R. E. Donovan, "A new distance measure for costing spectral discontinuities in concatenative speech synthesisers," in Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis, Perthshire, U.K., 2001, pp. 59-62.
- (2001) Proc. 4th ISCA Tutorial and Research Workshop on Speech Synthesis , pp. 59-62
- Donovan, R.E.¹

9
- 0023419762
- Globally optimising formant tracker using generalized centroids
- A. Crowe and M. A. Jack, "Globally optimising formant tracker using generalized centroids," Electron. Lett., vol. 23, no. 19, pp. 1019-1020, 1987.
- (1987) Electron. Lett , vol.23 , Issue.19 , pp. 1019-1020
- Crowe, A.¹ Jack, M.A.²

10
- 34047270177
- Analysis of fricatives using multiple centres of gravity
- A. A. Wrench, "Analysis of fricatives using multiple centres of gravity," in Proc. Int. Congr. Phonetic Sciences, vol. 4, 1995, pp. 460-463.
- (1995) Proc. Int. Congr. Phonetic Sciences , vol.4 , pp. 460-463
- Wrench, A.A.¹

11
- 85009279358
- Objective distance measures for spectral discontinuities in concatenative speech synthesis
- Denver, CO
- J. Vepa, S. King, and P. Taylor, "Objective distance measures for spectral discontinuities in concatenative speech synthesis," in Proc. ICSLP, Denver, CO, 2002.
- (2002) Proc. ICSLP
- Vepa, J.¹ King, S.² Taylor, P.³

12
- 84966318856
- New objective distance measures for spectral discontinuities in concatenative speech synthesis
- Santa Monica, CA, Sep
- _, "New objective distance measures for spectral discontinuities in concatenative speech synthesis," in Proc. IEEE 2002 Workshop on Speech Synthesis, Santa Monica, CA, Sep. 2002.
- (2002) Proc. IEEE 2002 Workshop on Speech Synthesis
- Vepa, J.¹ King, S.² Taylor, P.³

13
- 85009167944
- Kalman-filter based join cost for unit-selection speech synthesis
- Geneva, Switzerland, Sep
- J. Vepa and S. King, "Kalman-filter based join cost for unit-selection speech synthesis," in Proc. Eurospeech, Geneva, Switzerland, Sep. 2003.
- (2003) Proc. Eurospeech
- Vepa, J.¹ King, S.²

14
- 34047252102
- _, Join cost for unit selection speech synthesis, in Text to Speech Synthesis: New Paradigms and Advances, A. Alwan and S. Narayanan, Eds. Upper Saddle River, NJ: Prentice-Hall, 2004.
- _, "Join cost for unit selection speech synthesis," in Text to Speech Synthesis: New Paradigms and Advances, A. Alwan and S. Narayanan, Eds. Upper Saddle River, NJ: Prentice-Hall, 2004.

15
- 0003991331
- New York: Springer
- J. Olive, A. Greenwood, and J. Coleman, Acoustics of American English Speech: A Dynamic Approach. New York: Springer, 1993.
- (1993) Acoustics of American English Speech: A Dynamic Approach
- Olive, J.¹ Greenwood, A.² Coleman, J.³

16
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

17
- 33751539300
- Line spectrum representation of linear predictor coefficients of speech signals
- A
- F. Itakura, "Line spectrum representation of linear predictor coefficients of speech signals," J. Acoust. Soc. Amer., vol. 57, p. S35(A), 1975.
- (1975) J. Acoust. Soc. Amer , vol.57
- Itakura, F.¹

18
- 34047246080
- F. K. Soong and B. H. Juang, Line spectrum pairs (LSP) and speech data compression, in Proc. ICASSP, 1984, pp. 1.10.1-1.10.4.
- F. K. Soong and B. H. Juang, "Line spectrum pairs (LSP) and speech data compression," in Proc. ICASSP, 1984, pp. 1.10.1-1.10.4.

19
- 33846687633
- Linear Dynamic Models for Automatic Speech Recognition,
- Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K
- J. Frankel, "Linear Dynamic Models for Automatic Speech Recognition," Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K., 2003.
- (2003)
- Frankel, J.¹

20
- 85024429815
- A new approach to linear filtering and prediction problems, Trans. Amer. Soc. Mech. Eng., Series D
- R. E. Kalman, "A new approach to linear filtering and prediction problems," Trans. Amer. Soc. Mech. Eng., Series D, J. Basic Eng., vol. 82, pp. 35-15, 1960.
- (1960) J. Basic Eng , vol.82 , pp. 35-15
- Kalman, R.E.¹

21
- 0002077742
- Quantization of LPC parameters
- W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
- K. K. Paliwal and W. B. Kleijn, "Quantization of LPC parameters," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995, pp. 433-466.
- (1995) Speech Coding and Synthesis , pp. 433-466
- Paliwal, K.K.¹ Kleijn, W.B.²

22
- 0003834176
- Utrecht, The Netherlands: Kluwer
- T. Dutoit, An Introduction to Text-to-Speech Synthesis. Utrecht, The Netherlands: Kluwer, 1997.
- (1997) An Introduction to Text-to-Speech Synthesis
- Dutoit, T.¹

23
- 0036497601
- A comparison of spectral smoothing methods for segment concatenation based speech synthesis
- D. T. Chappell and J. H. Hansen, "A comparison of spectral smoothing methods for segment concatenation based speech synthesis," Speech Commun., vol. 36, pp. 343-374, 2002.
- (2002) Speech Commun , vol.36 , pp. 343-374
- Chappell, D.T.¹ Hansen, J.H.²

24
- 34047253695
- A. Black and P. Taylor, The Festival Speech Synthesis System: System Documentation, Human Communication Research Centre, Univ. of Edinburgh, Edinburgh, U.K., Tech. Rep. HCRC/TR-83, 1997.
- A. Black and P. Taylor, "The Festival Speech Synthesis System: System Documentation," Human Communication Research Centre, Univ. of Edinburgh, Edinburgh, U.K., Tech. Rep. HCRC/TR-83, 1997.

25
- 0004062335
- St. Paul, MN: West
- W. J. McGhee, Introductory Statistics. St. Paul, MN: West, 1985.
- (1985) Introductory Statistics
- McGhee, W.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.