SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Speech Communication

Volumn 50, Issue 1, 2008, Pages 67-80

A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis

(3) Jarifi, Safaa a Pastor, Dominique a Rosec, Olivier b

a ECOLE DES MINES DE NANTES (France)

b ORANGE LABS (France)

Author keywords

Automatic speech segmentation; Boundary model; Brandt's GLR algorithm; Hard supervision; HMM; Soft supervision; Speech synthesis

Indexed keywords

ALGORITHMS; GAUSSIAN DISTRIBUTION; HIDDEN MARKOV MODELS; SIGNAL DETECTION;

AUTOMATIC SPEECH SEGMENTATION; BRANDT'S GLR ALGORITHM; HARD SUPERVISION; SOFT SUPERVISION;

SPEECH SYNTHESIS;

EID: 35348856844 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2007.07.001 Document Type: Article

Times cited : (22)

References (18)

1
- 35348888548
- Adell, J., Bonafonte, A., 2004. Towards phone segmentation for concatenative speech synthesis. In: Proc. 5th ISCA Workshop on Speech Synthesis, June, pp. 139-144.

2
- 0023831656
- A new statistical approach for the automatic segmentation of continuous speech signals
- André-Obrecht R. A new statistical approach for the automatic segmentation of continuous speech signals. IEEE Trans. Acoust. Speech Signal Process. 36 January (1988) 29-40
- (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , Issue.January , pp. 29-40
- André-Obrecht, R.¹

3
- 0020497768
- Brandt, A.V., 1983. Detecting and estimating parameters jumps using ladder algorithms and likelihood ratio test. In: IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 1983), November, pp. 1017-1020.

4
- 0027646354
- Automatic segmentation and labelling of speech based on Hidden Markov Models
- Brugnara F., Falavigna D., and Omologo M. Automatic segmentation and labelling of speech based on Hidden Markov Models. Speech Comm. 12 August (1993) 357-370
- (1993) Speech Comm. , vol.12 , Issue.August , pp. 357-370
- Brugnara, F.¹ Falavigna, D.² Omologo, M.³

5
- 35348928855
- ITU-TRecommendation P.800.1. Mean opinion score (MOS) terminology. 2003.

6
- 35348880475
- Jarifi, S., 2007. Segmentation automatique de corpus de parole continue dédiés à la synthèse vocale. Ph.D. thesis, École Nationale Supérieure Des Télécommunications de Bretagne and University of Rennes I.

7
- 84863683402
- Jarifi, S., Pastor, D., Rosec, O., 2005. Brandt's GLR method & refined HMM segmentation for TTS synthesis application. In: 13th European Signal Processing Conf. (EUSIPCO 2005), September.

8
- 44949094565
- Jarifi, S., Pastor, D., Rosec, O., 2006. Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora. In: 9th Internat. Conf. on Spoken Language Processing (ICSLP 2006), September.

9
- 85009241673
- Kim, Y.J., Conkie, A., 2002. Automatic segmentation combining an HMM-based approach and spectral boundary correction. In: 7th Internat. Conf. on Spoken Language Processing (ICSLP 2002), September, pp. 145-148.

10
- 85009152114
- Matousek, J., Tihelka, D., Psutka, J., 2003. Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction. In: 8th European Conf. on Speech Communication and Technology (Eurospeech 2003), September, pp. 301-304.

11
- 35348841053
- Nefti, S., 2004. Segmentation automatique de la parole en phones. Correction d'étiquetage par l'introduction de mesures de confiance. Ph.D. thesis, University of Rennes I.

12
- 35348900904
- Odell, J.J., 1995. The use of context in large vocabulary speech recognition. Ph.D. thesis, The University of Cambridge.

13
- 33749336407
- Automatic speech segmentation based on boundary-type candidate selection
- Park S.S., and Kim N.S. Automatic speech segmentation based on boundary-type candidate selection. IEEE Signal Process. Lett. 13 September (2006) 640-643
- (2006) IEEE Signal Process. Lett. , vol.13 , Issue.September , pp. 640-643
- Park, S.S.¹ Kim, N.S.²

14
- 35348848607
- Park, S.S., Shin, J.W., Kim, N.S., 2006. Automatic speech segmentation with multiple statistical models. In: 9th Internat. Conf. on Spoken Language Processing (ICSLP 2006), September.

15
- 35348828704
- Torre Toledano, D., Rodríguez Crespo, M.A., Escalada Sardina, J.G., 1998. Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules. In: Third ESCA/COSCOSDA Internat. Workshop on Speech Synthesis, November, pp. 26-29.

16
- 0347968276
- Automatic phonetic segmentation
- Torre Toledano D., Hernández Gómez L.A., and Villarubia Grande L. Automatic phonetic segmentation. IEEE Trans. Speech Audio Process. 11 November (2003) 617-625
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.November , pp. 617-625
- Torre Toledano, D.¹ Hernández Gómez, L.A.² Villarubia Grande, L.³

17
- 4544373879
- Wang, L., Zhao, Y., Chu, M., Zhou, J., Cao, Z., 2004. Refining Segmental boundaries for TTS Database using fine contextual-dependent boundary models. In: IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2004), May, Vol. I, pp. 641-644.

18
- 35348893956
- Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., 2002. The HTK Book for HTK V 3.2.1.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.