메뉴 건너뛰기




Volumn , Issue , 2008, Pages 2237-2240

Audiovisual-to-articulatory speech inversion using active appearance models for the face and Hidden Markov Models for the dynamics

Author keywords

Articulatory; Audiovisual; Fusion; Hidden markov models; Speech inversion

Indexed keywords

ACOUSTICS; COMPUTATIONAL GRAMMARS; COMPUTER NETWORKS; DYNAMICS; FEATURE EXTRACTION; LEARNING SYSTEMS; MARKOV PROCESSES; OBJECT RECOGNITION; SIGNAL PROCESSING; SPEECH; SPEECH RECOGNITION;

EID: 51449089369     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2008.4518090     Document Type: Conference Paper
Times cited : (10)

References (14)
  • 1
    • 34548378893 scopus 로고    scopus 로고
    • Reconstructing tongue movements from audio and video
    • H. Kjellstrom, O. Engwall, and O. Balter, "Reconstructing tongue movements from audio and video," in Interspeech, 2006, pp. 2238-2241.
    • (2006) Interspeech , pp. 2238-2241
    • Kjellstrom, H.1    Engwall, O.2    Balter, O.3
  • 2
    • 33745183111 scopus 로고    scopus 로고
    • Introducing visual cues in acoustic-to-articulatory inversion
    • O. Engwall, "Introducing visual cues in acoustic-to-articulatory inversion," in INTERSPEECH, 2005, pp. 3205-3208.
    • (2005) INTERSPEECH , pp. 3205-3208
    • Engwall, O.1
  • 4
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocaltract and facial behavior
    • H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocaltract and facial behavior," Sp. Comm., vol. 26, pp. 23-43, 1998.
    • (1998) Sp. Comm , vol.26 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 5
    • 0038359547 scopus 로고    scopus 로고
    • Modelling the uncertainty in recovering articulation from acoustics
    • K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics," Computer Speech and Language, vol. 17, pp. 153-172, 2003.
    • (2003) Computer Speech and Language , vol.17 , pp. 153-172
    • Richmond, K.1    King, S.2    Taylor, P.3
  • 6
    • 2142659020 scopus 로고    scopus 로고
    • Estimation of articulatory movements from speech acoustics using an hmm-based speech production model
    • March
    • S. Hiroya and M. Honda, "Estimation of articulatory movements from speech acoustics using an hmm-based speech production model," IEEE TSAP, vol. 12, no. 2, pp. 175-185, March 2004.
    • (2004) IEEE TSAP , vol.12 , Issue.2 , pp. 175-185
    • Hiroya, S.1    Honda, M.2
  • 8
    • 48149088768 scopus 로고    scopus 로고
    • Resynthesis of 3d tongue movements from facial data
    • O. Engwall and J. Beskow, "Resynthesis of 3d tongue movements from facial data," in EUROSPEECH, 2003.
    • (2003) EUROSPEECH
    • Engwall, O.1    Beskow, J.2
  • 10
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Tr. Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
    • (2000) IEEE Tr. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 13
    • 0035680116 scopus 로고    scopus 로고
    • Rapid object detection using a boosted cascade of simple features
    • P. Viola and M.J. Jones, "Rapid object detection using a boosted cascade of simple features," in Proc. IEEE Conf. on Comp. Vision and Pat. Recog., 2001, vol. I, pp. 511-518.
    • (2001) Proc. IEEE Conf. on Comp. Vision and Pat. Recog , vol.1 , pp. 511-518
    • Viola, P.1    Jones, M.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.