SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2008, Pages 2237-2240

Audiovisual-to-articulatory speech inversion using active appearance models for the face and Hidden Markov Models for the dynamics

(3) Katsamanis, Athanassios a Papandreou, George a Maragos, Petros a

a NATIONAL TECHNICAL UNIVERSITY OF ATHENS (Greece)

Author keywords

Articulatory; Audiovisual; Fusion; Hidden markov models; Speech inversion

Indexed keywords

ACOUSTICS; COMPUTATIONAL GRAMMARS; COMPUTER NETWORKS; DYNAMICS; FEATURE EXTRACTION; LEARNING SYSTEMS; MARKOV PROCESSES; OBJECT RECOGNITION; SIGNAL PROCESSING; SPEECH; SPEECH RECOGNITION;

ARTICULATORY; AUDIOVISUAL; FUSION; INTERNATIONAL CONFERENCES; SPEECH INVERSION;

HIDDEN MARKOV MODELS;

EID: 51449089369 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2008.4518090 Document Type: Conference Paper

Times cited : (10)

References (14)

1
- 34548378893
- Reconstructing tongue movements from audio and video
- H. Kjellstrom, O. Engwall, and O. Balter, "Reconstructing tongue movements from audio and video," in Interspeech, 2006, pp. 2238-2241.
- (2006) Interspeech , pp. 2238-2241
- Kjellstrom, H.¹ Engwall, O.² Balter, O.³

2
- 33745183111
- Introducing visual cues in acoustic-to-articulatory inversion
- O. Engwall, "Introducing visual cues in acoustic-to-articulatory inversion," in INTERSPEECH, 2005, pp. 3205-3208.
- (2005) INTERSPEECH , pp. 3205-3208
- Engwall, O.¹

3
- 0036874551
- On the relationship between face movements, tongue movements, and speech acoustics
- J. Jiang, A. Alwan, P. A. Keating, E. T. Auer Jr., and L. E. Bernstein, "On the relationship between face movements, tongue movements, and speech acoustics," EURASIP Journal on Applied Signal Processing, vol. 11, pp. 1174-1188, 2002.
- (2002) EURASIP Journal on Applied Signal Processing , vol.11 , pp. 1174-1188
- Jiang, J.¹ Alwan, A.² Keating, P.A.³ Auer Jr., E.T.⁴ Bernstein, L.E.⁵

4
- 0032178592
- Quantitative association of vocaltract and facial behavior
- H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocaltract and facial behavior," Sp. Comm., vol. 26, pp. 23-43, 1998.
- (1998) Sp. Comm , vol.26 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

5
- 0038359547
- Modelling the uncertainty in recovering articulation from acoustics
- K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics," Computer Speech and Language, vol. 17, pp. 153-172, 2003.
- (2003) Computer Speech and Language , vol.17 , pp. 153-172
- Richmond, K.¹ King, S.² Taylor, P.³

6
- 2142659020
- Estimation of articulatory movements from speech acoustics using an hmm-based speech production model
- March
- S. Hiroya and M. Honda, "Estimation of articulatory movements from speech acoustics using an hmm-based speech production model," IEEE TSAP, vol. 12, no. 2, pp. 175-185, March 2004.
- (2004) IEEE TSAP , vol.12 , Issue.2 , pp. 175-185
- Hiroya, S.¹ Honda, M.²

7
- 48149084421
- Audiovisual-to- articulatory speech inversion using hmms
- A. Katsamanis, G. Papandreou, and P. Maragos, "Audiovisual-to- articulatory speech inversion using hmms," in Proceedings of IEEE Int'l Workshop on Multimedia Signal Processing (MMSP 2007).
- Proceedings of IEEE Int'l Workshop on Multimedia Signal Processing (MMSP 2007)
- Katsamanis, A.¹ Papandreou, G.² Maragos, P.³

8
- 48149088768
- Resynthesis of 3d tongue movements from facial data
- O. Engwall and J. Beskow, "Resynthesis of 3d tongue movements from facial data," in EUROSPEECH, 2003.
- (2003) EUROSPEECH
- Engwall, O.¹ Beskow, J.²

9
- 0003607151
- Acad. Press
- K. V. Mardia, J. T. Kent, and J. M. Bibby, Multivariate Analysis, Acad. Press, 1979.
- (1979) Multivariate Analysis
- Mardia, K.V.¹ Kent, J.T.² Bibby, J.M.³

10
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Tr. Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
- (2000) IEEE Tr. Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

11
- 84883424118
- Rule-based visual speech synthesis
- J. Beskow, "Rule-based visual speech synthesis," in Proc. of the 4th European Conference on Speech Communication and Technology (EUROSPEECH 95), 1995.
- (1995) Proc. of the 4th European Conference on Speech Communication and Technology (EUROSPEECH 95)
- Beskow, J.¹

12
- 0035363218
- Active appearance models
- T. F. Cootes, G. J. Edwards, and C. J. Taylor, "Active appearance models," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 23, no. 6, pp. 681-685, 2001.
- (2001) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.23 , Issue.6 , pp. 681-685
- Cootes, T.F.¹ Edwards, G.J.² Taylor, C.J.³

13
- 0035680116
- Rapid object detection using a boosted cascade of simple features
- P. Viola and M.J. Jones, "Rapid object detection using a boosted cascade of simple features," in Proc. IEEE Conf. on Comp. Vision and Pat. Recog., 2001, vol. I, pp. 511-518.
- (2001) Proc. IEEE Conf. on Comp. Vision and Pat. Recog , vol.1 , pp. 511-518
- Viola, P.¹ Jones, M.J.²

14
- 0034842342
- Asynchronous stream modeling for large vocabulary audio-visual speech recognition
- J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, 2001.
- (2001) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing
- Luettin, J.¹ Potamianos, G.² Neti, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.