메뉴 건너뛰기




Volumn 40, Issue 4, 2003, Pages 503-515

Phonetic alignment: Speech synthesis-based vs. Viterbi-based

Author keywords

Hidden Markov models; Hybrid HMM ANN systems; Large speech corpora; Speech segmentation; Speech synthesis

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; DATABASE SYSTEMS; MARKOV PROCESSES; SPEECH ANALYSIS;

EID: 0037850986     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(02)00131-0     Document Type: Article
Times cited : (44)

References (36)
  • 3
    • 0001862769 scopus 로고
    • An inequally and associated maximization technique in statistical estimation of probabilistic functions of Markov processes
    • Baum L.E. An inequally and associated maximization technique in statistical estimation of probabilistic functions of Markov processes. Inequalities. 3:1972;1-8.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 5
    • 0027646354 scopus 로고
    • Automatic segmentation and labeling of speech based on hidden Markov models
    • Brugnara B., Falavigna D., Omologo M. Automatic segmentation and labeling of speech based on hidden Markov models. Speech Commun. 1993;357-370.
    • (1993) Speech Commun. , pp. 357-370
    • Brugnara, B.1    Falavigna, D.2    Omologo, M.3
  • 12
    • 0028464214 scopus 로고
    • Context-dependent connectionist probability estimation in a hybrid hidden Markov model-neural net speech recognition system
    • Franco H., Cohen M., Morgan N., Rumelhart D., Abrash V. Context-dependent connectionist probability estimation in a hybrid hidden Markov model-neural net speech recognition system. Comput. Speech Lang. 1994;211-222.
    • (1994) Comput. Speech Lang. , pp. 211-222
    • Franco, H.1    Cohen, M.2    Morgan, N.3    Rumelhart, D.4    Abrash, V.5
  • 13
    • 0025041264 scopus 로고
    • Perceptual linear predictive analysis of speech
    • Hermansky H. Perceptual linear predictive analysis of speech. J. Acoust. Soc. Am. 1990.
    • (1990) J. Acoust. Soc. Am
    • Hermansky, H.1
  • 15
    • 0038133213 scopus 로고    scopus 로고
    • Automatic speech segmentation based on DTW with the application of the Czech TTS system
    • E. Keller, G. Bailly, A. Monaghan, J. Terken, & M. Huckwale. John Wiley and Sons Ltd.
    • Horak P. Automatic speech segmentation based on DTW with the application of the Czech TTS system. Keller E., Bailly G., Monaghan A., Terken J., Huckwale M. Improvements in Speech Synthesis. 2001;331-340 John Wiley and Sons Ltd.
    • (2001) Improvements in Speech Synthesis , pp. 331-340
    • Horak, P.1
  • 17
    • 0016939124 scopus 로고
    • Continuous speech recognition by statistical methods
    • Jelinek F. Continuous speech recognition by statistical methods. Proc. IEEE. 1976;532-536.
    • (1976) Proc. IEEE , pp. 532-536
    • Jelinek, F.1
  • 21
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • Lee L., Rose R. A frequency warping approach to speaker normalization. IEEE Trans. Speech Audio Process. 6(1):1998;49-60.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 49-60
    • Lee, L.1    Rose, R.2
  • 27
    • 0012330750 scopus 로고
    • The design for the Wall Street Journal-based CSR Corpus
    • Morgan Kaufmann Publishers
    • Paul D.B., Baker J. The design for the Wall Street Journal-based CSR Corpus. DARPA Speech and Language Workshop. 1992;Morgan Kaufmann Publishers.
    • (1992) DARPA Speech and Language Workshop
    • Paul, D.B.1    Baker, J.2
  • 29
    • 0028392167 scopus 로고
    • An application of recurrent nets to phone probability estimation
    • Robinson A.J. An application of recurrent nets to phone probability estimation. Proc. IEEE Trans. Neural Network. 1994;298-305.
    • (1994) Proc. IEEE Trans. Neural Network , pp. 298-305
    • Robinson, A.J.1
  • 30
    • 0000329355 scopus 로고
    • A reccurent error propagation network speech recognition system
    • Robinson A.J., Fallside F. A reccurent error propagation network speech recognition system. Comput. Speech Lang. 1991;257-286.
    • (1991) Comput. Speech Lang. , pp. 257-286
    • Robinson, A.J.1    Fallside, F.2
  • 36
    • 0025477640 scopus 로고
    • Speech database development: TIMIT and beyond
    • Zue V., Seneff S., Glass J. Speech Database Development: TIMIT and Beyond. Speech Commun. 1990;351-356.
    • (1990) Speech Commun. , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.