메뉴 건너뛰기




Volumn 5, Issue , 2006, Pages 2470-2473

Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise

Author keywords

AAM; Audio visual speech; Correlation; Formants

Indexed keywords

ACOUSTIC NOISE; CORRELATION METHODS; FORECASTING; LINEAR REGRESSION; SPEECH ENHANCEMENT;

EID: 44949122735     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (17)

References (13)
  • 1
    • 0001048664 scopus 로고
    • Visual contribution to speech intelligibility in noise
    • W.H. Sumby and I. Pollack, "Visual contribution to speech intelligibility in noise", JASA, 26(2):212-215, 1954
    • (1954) JASA , vol.26 , Issue.2 , pp. 212-215
    • Sumby, W.H.1    Pollack, I.2
  • 3
    • 1842854571 scopus 로고    scopus 로고
    • Continuous audio-visual digit recognition using N-best decision fusion
    • G. Meyer, J. Mulligan and S. Wuerger, "Continuous audio-visual digit recognition using N-best decision fusion", Information Fusion, 5:91-101, 2003
    • (2003) Information Fusion , vol.5 , pp. 91-101
    • Meyer, G.1    Mulligan, J.2    Wuerger, S.3
  • 4
    • 0034974093 scopus 로고    scopus 로고
    • Audio-visual enhancement of speech in noise
    • L. Girin, J.L. Schwartz and G. Feng, "Audio-visual enhancement of speech in noise", JASA, 6(109):3007-3020, 2001
    • (2001) JASA , vol.6 , Issue.109 , pp. 3007-3020
    • Girin, L.1    Schwartz, J.L.2    Feng, G.3
  • 5
    • 84901700762 scopus 로고    scopus 로고
    • Audiovisual speech enhancement based on association between speech envelope and video features
    • F. Berthommier, "Audiovisual speech enhancement based on association between speech envelope and video features", Proc. Eurospeech, 2003
    • (2003) Proc. Eurospeech
    • Berthommier, F.1
  • 6
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behaviour
    • H. Yehia, P. Rubin and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behaviour", Speech Communication, 26(1):23-43, 1998
    • (1998) Speech Communication , vol.26 , Issue.1 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 8
    • 0003417748 scopus 로고    scopus 로고
    • Statistical models of appearance for computer vision,
    • Draft report, University of Manchester, UK
    • T.F. Cootes and C.J. Taylor, "Statistical models of appearance for computer vision," Draft report, University of Manchester, UK, 2001, http://www.isbe.man.ac.uk
    • (2001)
    • Cootes, T.F.1    Taylor, C.J.2
  • 9
    • 23744446244 scopus 로고    scopus 로고
    • Predicting fundamental frequency from MFCCs to enable speech reconstruction
    • X. Shao and B.P. Milner, "Predicting fundamental frequency from MFCCs to enable speech reconstruction", JASA 118(2): 1134-1143, 2005
    • (2005) JASA , vol.118 , Issue.2 , pp. 1134-1143
    • Shao, X.1    Milner, B.P.2
  • 10
    • 85022117615 scopus 로고    scopus 로고
    • A. Sorin and T. Ramabadran, Extended advanced front end algorithm description, Version 1.1, ETSI STQ Aurora DSR Working Group, Tech. Rep. ES 202 212, 2003
    • A. Sorin and T. Ramabadran, "Extended advanced front end algorithm description, Version 1.1", ETSI STQ Aurora DSR Working Group, Tech. Rep. ES 202 212, 2003
  • 11
    • 33947125734 scopus 로고    scopus 로고
    • A formant tracking LP model for speech processing in car/train noise
    • Q. Yan, E. Zavarehei, S. Vaseghi and D. Rentzos, "A formant tracking LP model for speech processing in car/train noise", Proc. ICSLP, 2004.
    • (2004) Proc. ICSLP
    • Yan, Q.1    Zavarehei, E.2    Vaseghi, S.3    Rentzos, D.4
  • 13
    • 10444259183 scopus 로고    scopus 로고
    • Visual speech synthesis using shape and appearance models,
    • PhD Thesis. School of Computing Sciences, University of East Anglia, Norwich, UK
    • B. Theobald. "Visual speech synthesis using shape and appearance models," PhD Thesis. School of Computing Sciences, University of East Anglia, Norwich, UK, 2003.
    • (2003)
    • Theobald, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.