메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Audiovisual speech inversion by switching dynamical modeling governed by a Hidden Markov process

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE APPEARANCE MODELS; AUDIO-VISUAL SPEECH; CLASSIFICATION ANALYSIS; CORRELATION COEFFICIENT; DYNAMICAL MODELING; EVALUATION SCHEME; HIDDEN MARKOV PROCESS; INVERSION PROBLEMS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; PREDICTION ERRORS; RADIAL BASIS FUNCTIONS; ROOT MEAN SQUARED ERRORS; STATE SEQUENCES; SWITCHING LINEAR DYNAMICAL SYSTEMS; UNIFIED FRAMEWORK; VISUAL ANALYSIS;

EID: 84863731362     PISSN: 22195491     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (4)

References (23)
  • 1
    • 84966270944 scopus 로고    scopus 로고
    • Articulatory modeling: A possible role in concatenative text-to-speech synthesis
    • M. Sondhi, "Articulatory modeling: a possible role in concatenative text-to-speech synthesis," in IEEE Workshop on Speech Synthesis, Santa Monica, USA, 2002.
    • (2002) IEEE Workshop on Speech Synthesis, Santa Monica, USA
    • Sondhi, M.1
  • 5
    • 22144465830 scopus 로고    scopus 로고
    • Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion
    • S. Ouni and Y. Laprie, "Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion," Journal of Acoustical Society of America, vol. 118, no. 1, pp. 444-460, 2005.
    • (2005) Journal of Acoustical Society of America , vol.118 , Issue.1 , pp. 444-460
    • Ouni, S.1    Laprie, Y.2
  • 6
    • 0038359547 scopus 로고    scopus 로고
    • Modelling the uncertainty in recovering articulation from acoustics
    • K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics," Computer Speech and Language, vol. 17, pp. 153-172, 2003.
    • (2003) Computer Speech and Language , vol.17 , pp. 153-172
    • Richmond, K.1    King, S.2    Taylor, P.3
  • 7
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
    • T. Toda, A. W. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model," Speech Communication, vol. 50, pp. 215-227, 2008.
    • (2008) Speech Communication , vol.50 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 8
    • 0010424152 scopus 로고    scopus 로고
    • Acoustic-to-articulatory inversion using dynamical and phonological constraints
    • S. Dusan and L. Deng, "Acoustic-to-articulatory inversion using dynamical and phonological constraints," in Proceedings of Seminar on Speech Production, 2000, pp. 237-240.
    • (2000) Proceedings of Seminar on Speech Production , pp. 237-240
    • Dusan, S.1    Deng, L.2
  • 9
    • 2142659020 scopus 로고    scopus 로고
    • Estimation of articulatory movements from speech acoustics using an HMM-based speech production models
    • March
    • S. Hiroya and M. Honda, "Estimation of articulatory movements from speech acoustics using an HMM-based speech production models," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 2, pp. 175-185, March 2004.
    • (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.2 , pp. 175-185
    • Hiroya, S.1    Honda, M.2
  • 10
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behavior
    • H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Communication, vol. 26, pp. 23-43, 1998.
    • (1998) Speech Communication , vol.26 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 12
    • 33745183111 scopus 로고    scopus 로고
    • Introducing visual cues in acoustic-to-articulatory inversion
    • O. Engwall, "Introducing visual cues in acoustic-to-articulatory inversion," in Interspeech, 2005, pp. 3205-3208.
    • (2005) Interspeech , pp. 3205-3208
    • Engwall, O.1
  • 13
    • 34548378893 scopus 로고    scopus 로고
    • Reconstructing tongue movements from audio and video
    • H. Kjellström, O. Engwall, and O. Bälter, "Reconstructing tongue movements from audio and video," in Interspeech, 2006, pp. 2238-2241.
    • (2006) Interspeech , pp. 2238-2241
    • Kjellström, H.1    Engwall, O.2    Bälter, O.3
  • 14
    • 51449089369 scopus 로고    scopus 로고
    • Audiovisual-to-articulatory speech inversion using active appearance models for the face and hidden markov models for the dynamics
    • A. Katsamanis, G. Papandreou, and P.Maragos, "Audiovisual-to- articulatory speech inversion using active appearance models for the face and hidden markov models for the dynamics," in Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, 2008.
    • (2008) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing
    • Katsamanis, A.1    Papandreou, G.2    Maragos, P.3
  • 15
    • 21844452845 scopus 로고    scopus 로고
    • Resynthesis of facial and intraoral motion from simultaneous measurements
    • J. Beskow, O. Engwall, and B. Granström, "Resynthesis of facial and intraoral motion from simultaneous measurements," in Proc. of the 15th ICPhS, 2003, pp. 431-434.
    • (2003) Proc. of the 15th ICPhS , pp. 431-434
    • Beskow, J.1    Engwall, O.2    Granström, B.3
  • 18
    • 0034170950 scopus 로고    scopus 로고
    • Variational learning for switching state-space models
    • Z. Ghahramani and G. E. Hinton, "Variational learning for switching state-space models," Neural Computation, vol. 12, no. 4, pp. 831-864, 2000.
    • (2000) Neural Computation , vol.12 , Issue.4 , pp. 831-864
    • Ghahramani, Z.1    Hinton, G.E.2
  • 20
  • 22
    • 0037503670 scopus 로고    scopus 로고
    • A multichannel articulatory speech database and its application for automatic speech recognition
    • [Online]
    • A. Wrench and W. Hardcastle, "A multichannel articulatory speech database and its application for automatic speech recognition," in In Proc. 5th Seminar on Speech Production, Kloster Seeon, Bavaria, 2000, pp. 305-308. [Online]. Available: http://www.cstr.ed.ac.uk/artic
    • (2000) Proc. 5th Seminar on Speech Production, Kloster Seeon, Bavaria , pp. 305-308
    • Wrench, A.1    Hardcastle, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.