메뉴 건너뛰기




Volumn 11, Issue 6, 2003, Pages 617-625

Automatic Phonetic Segmentation

Author keywords

Speech analysis; Speech recognition; Speech synthesis

Indexed keywords

FUZZY SETS; MARKOV PROCESSES; MATHEMATICAL MODELS; NEURAL NETWORKS; SPEECH RECOGNITION; STATISTICAL METHODS;

EID: 0347968276     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.813579     Document Type: Article
Times cited : (143)

References (36)
  • 1
    • 85078422187 scopus 로고
    • A preliminary statistical evaluation of manual and automatic segmentation discrepancies
    • P. Cosi, D. Falavigna, and M. Omologo, "A preliminary statistical evaluation of manual and automatic segmentation discrepancies," in Proceedings EUROSPEECH, 1991, pp. 693-696.
    • (1991) Proceedings EUROSPEECH , pp. 693-696
    • Cosi, P.1    Falavigna, D.2    Omologo, M.3
  • 2
    • 0346892469 scopus 로고    scopus 로고
    • Automatic speech segmentation for concatenative inventory selection
    • J. P. H. Van Santen, Ed: Springer
    • A. Ljolje, J. Hirschberg, and J. P. H. Van Santen, "Automatic speech segmentation for concatenative inventory selection," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 305-311.
    • (1997) Progress in Speech Synthesis , pp. 305-311
    • Ljolje, A.1    Hirschberg, J.2    Van Santen, J.P.H.3
  • 3
    • 0344612854 scopus 로고
    • Automatic segmentation of speech for ITS
    • A. Ljolje and M. D. Riley, "Automatic segmentation of speech for ITS," in Proceedings EUROSPEECH, 1993, pp. 1445-1448.
    • (1993) Proceedings EUROSPEECH , pp. 1445-1448
    • Ljolje, A.1    Riley, M.D.2
  • 6
    • 0346126969 scopus 로고    scopus 로고
    • The aligner: Text-to-speech alignment using markov models
    • J. P. H. Van Santen, Ed: Springer
    • C. W. Wightman and D. T. Talkin, "The aligner: Text-to-speech alignment using markov models," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 313-323.
    • (1997) Progress in Speech Synthesis , pp. 313-323
    • Wightman, C.W.1    Talkin, D.T.2
  • 9
    • 0346262149 scopus 로고    scopus 로고
    • Automatic diphone extraction for an Italian text-to-speech synthesis system
    • B. Angelini, C. Barolo, D. Falavigna, M. Omologo, and S. Sandri, "Automatic diphone extraction for an Italian text-to-speech synthesis system," in Proceedings EUROSPEECH, vol. II, 1997, pp. 581-584.
    • (1997) Proceedings EUROSPEECH , vol.2 , pp. 581-584
    • Angelini, B.1    Barolo, C.2    Falavigna, D.3    Omologo, M.4    Sandri, S.5
  • 10
    • 0346892468 scopus 로고
    • Labeller - A system for automatic labeling of speech continuous signal
    • R. Gubrynowicz and A. Wrzoskowicz, "Labeller - A system for automatic labeling of speech continuous signal," in Proceedings EUROSPEECH, 1993, pp. 297-300.
    • (1993) Proceedings EUROSPEECH , pp. 297-300
    • Gubrynowicz, R.1    Wrzoskowicz, A.2
  • 11
    • 0348153019 scopus 로고
    • A segmentai approach versus a centisecond one for automatic phonetic time-alignment
    • A. Farhat, G. Pérennou, and R. André-Obrecht, "A segmentai approach versus a centisecond one for automatic phonetic time-alignment," in Proceedings EUROSPEECH, 1993, pp. 657-660.
    • (1993) Proceedings EUROSPEECH , pp. 657-660
    • Farhat, A.1    Pérennou, G.2    André-Obrecht, R.3
  • 12
    • 0347522318 scopus 로고
    • A nonlinear filtering method applied to automatic segmentation of multilingual speech corpora
    • H. Kabré, G. Pérennou, and N. Vigouroux, "A nonlinear filtering method applied to automatic segmentation of multilingual speech corpora," in Proceedings EUROSPEECH, 1991, pp. 689-702.
    • (1991) Proceedings EUROSPEECH , pp. 689-702
    • Kabré, H.1    Pérennou, G.2    Vigouroux, N.3
  • 14
    • 0346892465 scopus 로고
    • Automatic labeling of speech using an acoustic-phonetic knowledge base
    • C. G. J. Houben, "Automatic labeling of speech using an acoustic-phonetic knowledge base," in Proceedings EUROSPEECH, vol. 2, 1989, pp. 104-107.
    • (1989) Proceedings EUROSPEECH , vol.2 , pp. 104-107
    • Houben, C.G.J.1
  • 16
    • 0346262153 scopus 로고    scopus 로고
    • On the use of F0 features in automatic segmentation for speech synthesis
    • Sydney, NSW, Australia
    • T. Saito, "On the use of F0 features in automatic segmentation for speech synthesis," in Proceedings of the International Conference on Spoken Language Processing, vol. VII, Sydney, NSW, Australia, 1998, pp. 2839-2842.
    • (1998) Proceedings of the International Conference on Spoken Language Processing , vol.7 , pp. 2839-2842
    • Saito, T.1
  • 17
    • 0348153016 scopus 로고
    • Robust automatic extraction of diphones with variable boundaries
    • Madrid, Spain
    • D. Yarrington, H. Timothy, and G. Ball, "Robust automatic extraction of diphones with variable boundaries," in Proceedings EUROSPEECH, Madrid, Spain, 1995, pp. 1845-1848.
    • (1995) Proceedings EUROSPEECH , pp. 1845-1848
    • Yarrington, D.1    Timothy, H.2    Ball, G.3
  • 19
    • 0346262154 scopus 로고    scopus 로고
    • Automatic segmentation: Data-driven units of speech
    • Rhodes, Greece
    • W. Beet and L. Baghay-Ravary, "Automatic segmentation: Data-driven units of speech," in Proceedings EUROSPEECH, Rhodes, Greece, 1997, pp. 505-508.
    • (1997) Proceedings EUROSPEECH , pp. 505-508
    • Beet, W.1    Baghay-Ravary, L.2
  • 23
    • 0347387926 scopus 로고
    • Fast automatic segmentation and labeling: Results on TIMIT and EUROMO
    • Madrid, Spain
    • A. Vorstermans, J. M. Martens, and B. Van Colle, "Fast automatic segmentation and labeling: Results on TIMIT and EUROMO," in Proceedings EUROSPEECH, Madrid, Spain, 1995, pp. 1397-1400.
    • (1995) Proceedings EUROSPEECH , pp. 1397-1400
    • Vorstermans, A.1    Martens, J.M.2    Van Colle, B.3
  • 24
    • 0004565879 scopus 로고    scopus 로고
    • High-quality speech synthesis for phonetic speech segmentation
    • F. Malfrère and T. Dutoit, "High-quality speech synthesis for phonetic speech segmentation," in Proceedings EUROSPEECH, 1997, pp. 2631-2634.
    • (1997) Proceedings EUROSPEECH , pp. 2631-2634
    • Malfrère, F.1    Dutoit, T.2
  • 25
    • 0346892467 scopus 로고
    • Automatic segmentation and quality evaluation of speech unit inventories for concatenation-based multilingual PSOLA text-to-speech systems
    • O. Boëffard, B. Cherbonnel, F. Emerard, and S. White, "Automatic segmentation and quality evaluation of speech unit inventories for concatenation-based multilingual PSOLA text-to-speech systems," in Proceedings EUROSPEECH, 1993, pp. 1449-1452.
    • (1993) Proceedings EUROSPEECH , pp. 1449-1452
    • Boëffard, O.1    Cherbonnel, B.2    Emerard, F.3    White, S.4
  • 29
    • 0348153018 scopus 로고    scopus 로고
    • Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules
    • N. Compbell, Ed., to be published
    • D. T. Toledano, M. A. Rodríguez, J. G. Escalada, and L. A. Hernández, Trying to mimic human segmentation of speech using HMM and fuzzy logic post-correction rules, in Progress in Speech Synthesis, N. Compbell, Ed., to be published.
    • Progress in Speech Synthesis
    • Toledano, D.T.1    Rodríguez, M.A.2    Escalada, J.G.3    Hernández, L.A.4
  • 31
    • 50549091068 scopus 로고    scopus 로고
    • Local refinement of phonetic boundaries: A general framework and its application using different transition models
    • Aalborg, Denmark, Sept.
    • D. T. Toledano and L. A. Hernández, "Local refinement of phonetic boundaries: A general framework and its application using different transition models," in Proceedings EUSOSPEECH, Aalborg, Denmark, Sept. 2001.
    • (2001) Proceedings EUSOSPEECH
    • Toledano, D.T.1    Hernández, L.A.2
  • 34
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • Apr.
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models," Computer, Speech and Language, vol. 9, no. 2, pp. 171-185, Apr. 1995.
    • (1995) Computer, Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 35
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian observations of markov chains
    • J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate gaussian observations of markov chains," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 2, pp. 291-298, 1994.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 36
    • 0030372637 scopus 로고    scopus 로고
    • A probabilistic framework for feature-based speech recognition
    • Philadelphia, PA, Oct.
    • J. Glass, J. Chang, and M. McCandless, "A probabilistic framework for feature-based speech recognition," in Proc. Int. Conf. Speech Language Processing 96, Philadelphia, PA, Oct. 1996, pp. 2277-2280.
    • (1996) Proc. Int. Conf. Speech Language Processing 96 , pp. 2277-2280
    • Glass, J.1    Chang, J.2    McCandless, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.