메뉴 건너뛰기




Volumn 50, Issue 1, 2008, Pages 67-80

A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis

Author keywords

Automatic speech segmentation; Boundary model; Brandt's GLR algorithm; Hard supervision; HMM; Soft supervision; Speech synthesis

Indexed keywords

ALGORITHMS; GAUSSIAN DISTRIBUTION; HIDDEN MARKOV MODELS; SIGNAL DETECTION;

EID: 35348856844     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.07.001     Document Type: Article
Times cited : (22)

References (18)
  • 1
    • 35348888548 scopus 로고    scopus 로고
    • Adell, J., Bonafonte, A., 2004. Towards phone segmentation for concatenative speech synthesis. In: Proc. 5th ISCA Workshop on Speech Synthesis, June, pp. 139-144.
  • 2
    • 0023831656 scopus 로고
    • A new statistical approach for the automatic segmentation of continuous speech signals
    • André-Obrecht R. A new statistical approach for the automatic segmentation of continuous speech signals. IEEE Trans. Acoust. Speech Signal Process. 36 January (1988) 29-40
    • (1988) IEEE Trans. Acoust. Speech Signal Process. , vol.36 , Issue.January , pp. 29-40
    • André-Obrecht, R.1
  • 3
    • 0020497768 scopus 로고    scopus 로고
    • Brandt, A.V., 1983. Detecting and estimating parameters jumps using ladder algorithms and likelihood ratio test. In: IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 1983), November, pp. 1017-1020.
  • 4
    • 0027646354 scopus 로고
    • Automatic segmentation and labelling of speech based on Hidden Markov Models
    • Brugnara F., Falavigna D., and Omologo M. Automatic segmentation and labelling of speech based on Hidden Markov Models. Speech Comm. 12 August (1993) 357-370
    • (1993) Speech Comm. , vol.12 , Issue.August , pp. 357-370
    • Brugnara, F.1    Falavigna, D.2    Omologo, M.3
  • 5
    • 35348928855 scopus 로고    scopus 로고
    • ITU-TRecommendation P.800.1. Mean opinion score (MOS) terminology. 2003.
  • 6
    • 35348880475 scopus 로고    scopus 로고
    • Jarifi, S., 2007. Segmentation automatique de corpus de parole continue dédiés à la synthèse vocale. Ph.D. thesis, École Nationale Supérieure Des Télécommunications de Bretagne and University of Rennes I.
  • 7
    • 84863683402 scopus 로고    scopus 로고
    • Jarifi, S., Pastor, D., Rosec, O., 2005. Brandt's GLR method & refined HMM segmentation for TTS synthesis application. In: 13th European Signal Processing Conf. (EUSIPCO 2005), September.
  • 8
    • 44949094565 scopus 로고    scopus 로고
    • Jarifi, S., Pastor, D., Rosec, O., 2006. Cooperation between global and local methods for the automatic segmentation of speech synthesis corpora. In: 9th Internat. Conf. on Spoken Language Processing (ICSLP 2006), September.
  • 9
    • 85009241673 scopus 로고    scopus 로고
    • Kim, Y.J., Conkie, A., 2002. Automatic segmentation combining an HMM-based approach and spectral boundary correction. In: 7th Internat. Conf. on Spoken Language Processing (ICSLP 2002), September, pp. 145-148.
  • 10
    • 85009152114 scopus 로고    scopus 로고
    • Matousek, J., Tihelka, D., Psutka, J., 2003. Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction. In: 8th European Conf. on Speech Communication and Technology (Eurospeech 2003), September, pp. 301-304.
  • 11
    • 35348841053 scopus 로고    scopus 로고
    • Nefti, S., 2004. Segmentation automatique de la parole en phones. Correction d'étiquetage par l'introduction de mesures de confiance. Ph.D. thesis, University of Rennes I.
  • 12
    • 35348900904 scopus 로고    scopus 로고
    • Odell, J.J., 1995. The use of context in large vocabulary speech recognition. Ph.D. thesis, The University of Cambridge.
  • 13
    • 33749336407 scopus 로고    scopus 로고
    • Automatic speech segmentation based on boundary-type candidate selection
    • Park S.S., and Kim N.S. Automatic speech segmentation based on boundary-type candidate selection. IEEE Signal Process. Lett. 13 September (2006) 640-643
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.September , pp. 640-643
    • Park, S.S.1    Kim, N.S.2
  • 14
    • 35348848607 scopus 로고    scopus 로고
    • Park, S.S., Shin, J.W., Kim, N.S., 2006. Automatic speech segmentation with multiple statistical models. In: 9th Internat. Conf. on Spoken Language Processing (ICSLP 2006), September.
  • 15
    • 35348828704 scopus 로고    scopus 로고
    • Torre Toledano, D., Rodríguez Crespo, M.A., Escalada Sardina, J.G., 1998. Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules. In: Third ESCA/COSCOSDA Internat. Workshop on Speech Synthesis, November, pp. 26-29.
  • 17
    • 4544373879 scopus 로고    scopus 로고
    • Wang, L., Zhao, Y., Chu, M., Zhou, J., Cao, Z., 2004. Refining Segmental boundaries for TTS Database using fine contextual-dependent boundary models. In: IEEE Internat. Conf. on Acoustics, Speech and Signal Processing (ICASSP 2004), May, Vol. I, pp. 641-644.
  • 18
    • 35348893956 scopus 로고    scopus 로고
    • Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., 2002. The HTK Book for HTK V 3.2.1.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.