메뉴 건너뛰기




Volumn , Issue , 2007, Pages 457-460

Audiovisual-to-articulatory speech inversion using HMMs

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN MARKOV MODELS; MARKOV PROCESSES; SIGNAL PROCESSING; TECHNICAL PRESENTATIONS;

EID: 48149084421     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/MMSP.2007.4412915     Document Type: Conference Paper
Times cited : (9)

References (11)
  • 1
    • 34548378893 scopus 로고    scopus 로고
    • Reconstructing tongue movements from audio and video
    • H. Kjellstrom, O. Engwall. and O. Balter, "Reconstructing tongue movements from audio and video," in Interspeech, 2006, pp. 2238-2241.
    • (2006) Interspeech , pp. 2238-2241
    • Kjellstrom, H.1    Engwall, O.2    Balter, O.3
  • 2
    • 33745183111 scopus 로고    scopus 로고
    • Introducing visual cues in acoustic-to-articulatory inversion
    • O. Engwall, "Introducing visual cues in acoustic-to-articulatory inversion," in INTERSPEECH, 2005, pp. 3205-3208.
    • (2005) INTERSPEECH , pp. 3205-3208
    • Engwall, O.1
  • 4
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behavior
    • H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Sp. Comm., vol. 26, pp. 23-43, 1998.
    • (1998) Sp. Comm , vol.26 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 5
    • 0038359547 scopus 로고    scopus 로고
    • Modelling the uncertainty in recovering articulation from acoustics
    • K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics." Computer Speech and Language, vol. 17. pp. 153-172, 2003.
    • (2003) Computer Speech and Language , vol.17 , pp. 153-172
    • Richmond, K.1    King, S.2    Taylor, P.3
  • 6
    • 2142659020 scopus 로고    scopus 로고
    • Estimation of articulatory movements from speech acoustics using an hmm-based speech production model
    • March
    • S. Hiroya and M. Honda, "Estimation of articulatory movements from speech acoustics using an hmm-based speech production model," IEEE TSAP, vol. 12. no. 2, pp. 175-185. March 2004.
    • (2004) IEEE TSAP , vol.12 , Issue.2 , pp. 175-185
    • Hiroya, S.1    Honda, M.2
  • 7
    • 48149088768 scopus 로고    scopus 로고
    • Resynthesis of 3d tongue movements from facial data
    • O. Engwall and J. Beskow, "Resynthesis of 3d tongue movements from facial data." in EUROSPEECH, 2003.
    • (2003) EUROSPEECH
    • Engwall, O.1    Beskow, J.2
  • 8
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Tr. Multimedia, vol. 2. no. 3. pp. 141-151, 2000.
    • (2000) IEEE Tr. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 10
    • 0032023788 scopus 로고    scopus 로고
    • Wiener filters in canonical coordinates for transform coding, filtering, and quantizing
    • L. L. Scharf and J. K. Thomas, "Wiener filters in canonical coordinates for transform coding, filtering, and quantizing," IEEE TSAP, vol. 46, no. 3, pp. 647-654, 1998.
    • (1998) IEEE TSAP , vol.46 , Issue.3 , pp. 647-654
    • Scharf, L.L.1    Thomas, J.K.2
  • 11
    • 0000927638 scopus 로고    scopus 로고
    • Predicting multivariate responses in multiple linear regression
    • L. Breiman and J. H. Friedman, "Predicting multivariate responses in multiple linear regression," Journal of the Royal Stat. Soc. (B), vol. 59, no. 1, pp. 3-54, 1997.
    • (1997) Journal of the Royal Stat. Soc. (B) , vol.59 , Issue.1 , pp. 3-54
    • Breiman, L.1    Friedman, J.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.