SCOPUS 정보 검색 플랫폼

European Signal Processing Conference

Volumn , Issue , 2008, Pages

Audiovisual speech inversion by switching dynamical modeling governed by a Hidden Markov process

(5) Katsamanis, A a Ananthakrishnan, G b Papandreou, G a Maragos, P a Engwall, O b

a NATIONAL TECHNICAL UNIVERSITY OF ATHENS (Greece)

b ROYAL INSTITUTE OF TECHNOLOGY (Sweden)

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVE APPEARANCE MODELS; AUDIO-VISUAL SPEECH; CLASSIFICATION ANALYSIS; CORRELATION COEFFICIENT; DYNAMICAL MODELING; EVALUATION SCHEME; HIDDEN MARKOV PROCESS; INVERSION PROBLEMS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; PREDICTION ERRORS; RADIAL BASIS FUNCTIONS; ROOT MEAN SQUARED ERRORS; STATE SEQUENCES; SWITCHING LINEAR DYNAMICAL SYSTEMS; UNIFIED FRAMEWORK; VISUAL ANALYSIS;

HIDDEN MARKOV MODELS; LINEAR CONTROL SYSTEMS; RADIAL BASIS FUNCTION NETWORKS; SIGNAL PROCESSING;

IMAGE SEGMENTATION;

EID: 84863731362 PISSN: 22195491 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (23)

1
- 84966270944
- Articulatory modeling: A possible role in concatenative text-to-speech synthesis
- M. Sondhi, "Articulatory modeling: a possible role in concatenative text-to-speech synthesis," in IEEE Workshop on Speech Synthesis, Santa Monica, USA, 2002.
- (2002) IEEE Workshop on Speech Synthesis, Santa Monica, USA
- Sondhi, M.¹

2
- 33846680938
- Speech production knowledge in automatic speech recognition
- February
- S. King, J. Frankel, K. Livescu, E. McDermott, K. Richmond, and M. Wester, "Speech production knowledge in automatic speech recognition," Journal of the Acoustical Society of America, vol. 121, no. 2, pp. 723-742, February 2007.
- (2007) Journal of the Acoustical Society of America , vol.121 , Issue.2 , pp. 723-742
- King, S.¹ Frankel, J.² Livescu, K.³ McDermott, E.⁴ Richmond, K.⁵ Wester, M.⁶

3
- 0001736204
- NewYork: Marcel Dekker Inc, ch. 8
- J. Schroeter and M. M. Sondhi, Speech coding based on physiological models of speech production. NewYork: Marcel Dekker Inc, 1992, ch. 8.
- (1992) Speech Coding Based on Physiological Models of Speech Production
- Schroeter, J.¹ Sondhi, M.M.²

4
- 84894560828
- Designing the user interface of the computer-based speech training system ARTUR based on early user tests
- O. Engwall, O. Bälter, A.-M. Öster, and H. Sidenbladh- Kjellström, "Designing the user interface of the computer-based speech training system ARTUR based on early user tests," Journal of Behaviour and Information Technology, vol. 25, no. 4, pp. 353-365, 2006.
- (2006) Journal of Behaviour and Information Technology , vol.25 , Issue.4 , pp. 353-365
- Engwall, O.¹ Bälter, O.² Öster, A.-M.³ Sidenbladh-Kjellström, H.⁴

5
- 22144465830
- Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion
- S. Ouni and Y. Laprie, "Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion," Journal of Acoustical Society of America, vol. 118, no. 1, pp. 444-460, 2005.
- (2005) Journal of Acoustical Society of America , vol.118 , Issue.1 , pp. 444-460
- Ouni, S.¹ Laprie, Y.²

6
- 0038359547
- Modelling the uncertainty in recovering articulation from acoustics
- K. Richmond, S. King, and P. Taylor, "Modelling the uncertainty in recovering articulation from acoustics," Computer Speech and Language, vol. 17, pp. 153-172, 2003.
- (2003) Computer Speech and Language , vol.17 , pp. 153-172
- Richmond, K.¹ King, S.² Taylor, P.³

7
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
- T. Toda, A. W. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model," Speech Communication, vol. 50, pp. 215-227, 2008.
- (2008) Speech Communication , vol.50 , pp. 215-227
- Toda, T.¹ Black, A.W.² Tokuda, K.³

8
- 0010424152
- Acoustic-to-articulatory inversion using dynamical and phonological constraints
- S. Dusan and L. Deng, "Acoustic-to-articulatory inversion using dynamical and phonological constraints," in Proceedings of Seminar on Speech Production, 2000, pp. 237-240.
- (2000) Proceedings of Seminar on Speech Production , pp. 237-240
- Dusan, S.¹ Deng, L.²

9
- 2142659020
- Estimation of articulatory movements from speech acoustics using an HMM-based speech production models
- March
- S. Hiroya and M. Honda, "Estimation of articulatory movements from speech acoustics using an HMM-based speech production models," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 2, pp. 175-185, March 2004.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.2 , pp. 175-185
- Hiroya, S.¹ Honda, M.²

10
- 0032178592
- Quantitative association of vocal-tract and facial behavior
- H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Communication, vol. 26, pp. 23-43, 1998.
- (1998) Speech Communication , vol.26 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

11
- 0036874551
- On the relationship between face movements, tongue movements, and speech acoustics
- J. Jiang, A. Alwan, P. A. Keating, E. T. Auer Jr., and L. E. Bernstein, "On the relationship between face movements, tongue movements, and speech acoustics," EURASIP Journal on Applied Signal Processing, vol. 11, pp. 1174-1188, 2002.
- (2002) EURASIP Journal on Applied Signal Processing , vol.11 , pp. 1174-1188
- Jiang, J.¹ Alwan, A.² Keating, P.A.³ Auer Jr., E.T.⁴ Bernstein, L.E.⁵

12
- 33745183111
- Introducing visual cues in acoustic-to-articulatory inversion
- O. Engwall, "Introducing visual cues in acoustic-to-articulatory inversion," in Interspeech, 2005, pp. 3205-3208.
- (2005) Interspeech , pp. 3205-3208
- Engwall, O.¹

13
- 34548378893
- Reconstructing tongue movements from audio and video
- H. Kjellström, O. Engwall, and O. Bälter, "Reconstructing tongue movements from audio and video," in Interspeech, 2006, pp. 2238-2241.
- (2006) Interspeech , pp. 2238-2241
- Kjellström, H.¹ Engwall, O.² Bälter, O.³

14
- 51449089369
- Audiovisual-to-articulatory speech inversion using active appearance models for the face and hidden markov models for the dynamics
- A. Katsamanis, G. Papandreou, and P.Maragos, "Audiovisual-to- articulatory speech inversion using active appearance models for the face and hidden markov models for the dynamics," in Proc. Int'l Conf. Acoustics, Speech, and Signal Processing, 2008.
- (2008) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing
- Katsamanis, A.¹ Papandreou, G.² Maragos, P.³

15
- 21844452845
- Resynthesis of facial and intraoral motion from simultaneous measurements
- J. Beskow, O. Engwall, and B. Granström, "Resynthesis of facial and intraoral motion from simultaneous measurements," in Proc. of the 15th ICPhS, 2003, pp. 431-434.
- (2003) Proc. of the 15th ICPhS , pp. 431-434
- Beskow, J.¹ Engwall, O.² Granström, B.³

16
- 79952359539
- Evaluation of speech inversion using an articulatory classifier
- O. Engwall, "Evaluation of speech inversion using an articulatory classifier," in Proc. of the Seventh International Seminar on Speech Production, 2006, pp. 431-434.
- (2006) Proc. of the Seventh International Seminar on Speech Production , pp. 431-434
- Engwall, O.¹

17
- 0004093046
- Prentice Hall
- B. D. O. Anderson and J. B.Moore, Optimal Filtering. Prentice Hall, 1979.
- (1979) Optimal Filtering
- Anderson, B.D.O.¹ Moore, J.B.²

18
- 0034170950
- Variational learning for switching state-space models
- Z. Ghahramani and G. E. Hinton, "Variational learning for switching state-space models," Neural Computation, vol. 12, no. 4, pp. 831-864, 2000.
- (2000) Neural Computation , vol.12 , Issue.4 , pp. 831-864
- Ghahramani, Z.¹ Hinton, G.E.²

19
- 48149084421
- Audiovisualto-articulatory speech inversion using HMMs
- A. Katsamanis, G. Papandreou, and P.Maragos, "Audiovisualto- articulatory speech inversion using HMMs," in Proceedings of International Workshop on Multimedia Signal Processing (MMSP), 2007.
- (2007) Proceedings of International Workshop on Multimedia Signal Processing (MMSP)
- Katsamanis, A.¹ Papandreou, G.² Maragos, P.³

20
- 84863732214
- Master's thesis, The Graduate School of the University of Florida
- S. Youn, Eun, "Feature selection in support vector machines," Master's thesis, The Graduate School of the University of Florida, 2002.
- (2002) Feature Selection in Support Vector Machines
- Youn, S.¹ Eun²

21
- 0037695279
- World Scientific, Singapore
- J. Suykens, T. Van Gestel, J. De Brabanter, B. De Moor, and J. Vandewalle, Least Squares Support Vector Machines. World Scientific, Singapore, 2002.
- (2002) Least Squares Support Vector Machines
- Suykens, J.¹ Van Gestel, T.² De Brabanter, J.³ De Moor, B.⁴ Vandewalle, J.⁵

22
- 0037503670
- A multichannel articulatory speech database and its application for automatic speech recognition
- [Online]
- A. Wrench and W. Hardcastle, "A multichannel articulatory speech database and its application for automatic speech recognition," in In Proc. 5th Seminar on Speech Production, Kloster Seeon, Bavaria, 2000, pp. 305-308. [Online]. Available: http://www.cstr.ed.ac.uk/artic
- (2000) Proc. 5th Seminar on Speech Production, Kloster Seeon, Bavaria , pp. 305-308
- Wrench, A.¹ Hardcastle, W.²

23
- 0003822743
- Cambridge University
- S. Young, J. Odell, D. Ollason, V. Valtchev, and P.Woodland., The HTK Book. Cambridge University, 1997.
- (1997) The HTK Book
- Young, S.¹ Odell, J.² Ollason, D.³ Valtchev, V.⁴ Woodland, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.