SCOPUS 정보 검색 플랫폼

INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP

Volumn 5, Issue , 2006, Pages 2470-2473

Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise

(3) Almajai, Ibrahim a Milner, Ben a Darch, Jonathan a

a UNIVERSITY OF EAST ANGLIA (United Kingdom)

Author keywords

AAM; Audio visual speech; Correlation; Formants

Indexed keywords

ACOUSTIC NOISE; CORRELATION METHODS; FORECASTING; LINEAR REGRESSION; SPEECH ENHANCEMENT;

ACTIVE APPEARANCE MODELS; AUDIO-VISUAL SPEECH; CORRELATION MEASUREMENT; FORMANTS; MAXIMUM A POSTERIORI; MULTIPLE LINEAR REGRESSIONS; ORIGINAL AUDIO SIGNAL; VISUAL SPEECH FEATURES;

AUDIO ACOUSTICS;

EID: 44949122735 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (17)

References (13)

1
- 0001048664
- Visual contribution to speech intelligibility in noise
- W.H. Sumby and I. Pollack, "Visual contribution to speech intelligibility in noise", JASA, 26(2):212-215, 1954
- (1954) JASA , vol.26 , Issue.2 , pp. 212-215
- Sumby, W.H.¹ Pollack, I.²

2
- 0004052871
- Audio-visual speech recognition
- Technical Report, Center for Language and Speech Processing, Baltimore, Maryland
- C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A. Mashari and J. Zhou, "Audio-visual speech recognition", Technical Report, Center for Language and Speech Processing, Baltimore, Maryland, 2000
- (2000)
- Neti, C.¹ Potamianos, G.² Luettin, J.³ Matthews, I.⁴ Glotin, H.⁵ Vergyri, D.⁶ Sison, J.⁷ Mashari, A.⁸ Zhou, J.⁹

3
- 1842854571
- Continuous audio-visual digit recognition using N-best decision fusion
- G. Meyer, J. Mulligan and S. Wuerger, "Continuous audio-visual digit recognition using N-best decision fusion", Information Fusion, 5:91-101, 2003
- (2003) Information Fusion , vol.5 , pp. 91-101
- Meyer, G.¹ Mulligan, J.² Wuerger, S.³

4
- 0034974093
- Audio-visual enhancement of speech in noise
- L. Girin, J.L. Schwartz and G. Feng, "Audio-visual enhancement of speech in noise", JASA, 6(109):3007-3020, 2001
- (2001) JASA , vol.6 , Issue.109 , pp. 3007-3020
- Girin, L.¹ Schwartz, J.L.² Feng, G.³

5
- 84901700762
- Audiovisual speech enhancement based on association between speech envelope and video features
- F. Berthommier, "Audiovisual speech enhancement based on association between speech envelope and video features", Proc. Eurospeech, 2003
- (2003) Proc. Eurospeech
- Berthommier, F.¹

6
- 0032178592
- Quantitative association of vocal-tract and facial behaviour
- H. Yehia, P. Rubin and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behaviour", Speech Communication, 26(1):23-43, 1998
- (1998) Speech Communication , vol.26 , Issue.1 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

7
- 0036874551
- On the relationship between face movements, tongue movements and speech acoustics
- J. Jiang, A. Alwan, P.A. Keating, E.T. Auer and L.E. Bemstein, "On the relationship between face movements, tongue movements and speech acoustics", EURASIP Journal on Applied Sig. Proc., 11:1174-1188, 2002
- (2002) EURASIP Journal on Applied Sig. Proc , vol.11 , pp. 1174-1188
- Jiang, J.¹ Alwan, A.² Keating, P.A.³ Auer, E.T.⁴ Bemstein, L.E.⁵

8
- 0003417748
- Statistical models of appearance for computer vision,
- Draft report, University of Manchester, UK
- T.F. Cootes and C.J. Taylor, "Statistical models of appearance for computer vision," Draft report, University of Manchester, UK, 2001, http://www.isbe.man.ac.uk
- (2001)
- Cootes, T.F.¹ Taylor, C.J.²

9
- 23744446244
- Predicting fundamental frequency from MFCCs to enable speech reconstruction
- X. Shao and B.P. Milner, "Predicting fundamental frequency from MFCCs to enable speech reconstruction", JASA 118(2): 1134-1143, 2005
- (2005) JASA , vol.118 , Issue.2 , pp. 1134-1143
- Shao, X.¹ Milner, B.P.²

10
- 85022117615
- A. Sorin and T. Ramabadran, Extended advanced front end algorithm description, Version 1.1, ETSI STQ Aurora DSR Working Group, Tech. Rep. ES 202 212, 2003
- A. Sorin and T. Ramabadran, "Extended advanced front end algorithm description, Version 1.1", ETSI STQ Aurora DSR Working Group, Tech. Rep. ES 202 212, 2003

11
- 33947125734
- A formant tracking LP model for speech processing in car/train noise
- Q. Yan, E. Zavarehei, S. Vaseghi and D. Rentzos, "A formant tracking LP model for speech processing in car/train noise", Proc. ICSLP, 2004.
- (2004) Proc. ICSLP
- Yan, Q.¹ Zavarehei, E.² Vaseghi, S.³ Rentzos, D.⁴

12
- 0003729218
- John Wiley and Sons, Canada
- S. Chatterjee, A.S. Hadi, and B. Price, "Regression Analysis By Example", John Wiley and Sons, Canada, 2000
- (2000) Regression Analysis By Example
- Chatterjee, S.¹ Hadi, A.S.² Price, B.³

13
- 10444259183
- Visual speech synthesis using shape and appearance models,
- PhD Thesis. School of Computing Sciences, University of East Anglia, Norwich, UK
- B. Theobald. "Visual speech synthesis using shape and appearance models," PhD Thesis. School of Computing Sciences, University of East Anglia, Norwich, UK, 2003.
- (2003)
- Theobald, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.