메뉴 건너뛰기




Volumn 19, Issue 6, 2011, Pages 1642-1651

Visually Derived Wiener Filters for Speech Enhancement

Author keywords

Audio visual; maximum a posteriori; speech enhancement; Wiener filter

Indexed keywords


EID: 85008034388     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2010.2096212     Document Type: Article
Times cited : (85)

References (19)
  • 1
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audio-visual speech
    • Sep.
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior “Recent advances in the automatic recognition of audio-visual speech,” Proc. IEEE, vol. 91, no. 9, pp. 1306–1326, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.5
  • 3
    • 44949122735 scopus 로고    scopus 로고
    • Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise
    • Sep.
    • I. Almajai, B. Milner, and J. Darch, “Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise,” in Proc. Interspeech, Sep. 2006, pp. 2470–2473.
    • (2006) Proc. Interspeech , pp. 2470-2473
    • Almajai, I.1    Milner, B.2    Darch, J.3
  • 4
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal tract and facial behavior
    • Oct.
    • H. Yehia, P. Rubin, and E. Vatikiotis-Bateson “Quantitative association of vocal tract and facial behavior,” Speech Commun., vol. 26, no. 1, pp. 23–43, Oct. 1998.
    • (1998) Speech Commun. , vol.26 , Issue.1 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 5
    • 33744532633 scopus 로고    scopus 로고
    • Voice activity detection based on multiple statistical models
    • Jun.
    • J. Chang, N. Kim, and S. Mitra “Voice activity detection based on multiple statistical models,” IEEE Trans. Signal Process., vol. 54, no. 6, pp. 1965–1976, Jun. 2006.
    • (2006) IEEE Trans. Signal Process. , vol.54 , Issue.6 , pp. 1965-1976
    • Chang, J.1    Kim, N.2    Mitra, S.3
  • 6
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul.
    • R. Martin “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504–512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 7
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • Apr.
    • M. Berouti, R. Schwartz, and J. Makhoul, “Enhancement of speech corrupted by acoustic noise,” in Proc. ICASSP, Apr. 1979, vol. 4, pp. 208–211.
    • (1979) Proc. ICASSP , vol.4 , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 8
    • 0024909863 scopus 로고
    • On the application of hidden Markov models for enhancing noisy speech
    • Dec.
    • Y. Ephraim, D. Malah, and B.-H. Juang “On the application of hidden Markov models for enhancing noisy speech,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 12, pp. 1846–1856, Dec. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.12 , pp. 1846-1856
    • Ephraim, Y.1    Malah, D.2    Juang, B.-H.3
  • 9
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Apr.
    • Y. Ephraim and D. Malah “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 2, pp. 443–445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 11
    • 0034974093 scopus 로고    scopus 로고
    • Audio-visual enhancement of speech in noise
    • Jun.
    • L. Girin, J.-L. Schwartz, and G. Fang “Audio-visual enhancement of speech in noise,” J. Acoust. Soc. Amer., vol. 109, no. 6, pp. 3007–3020, Jun. 2001.
    • (2001) J. Acoust. Soc. Amer. , vol.109 , Issue.6 , pp. 3007-3020
    • Girin, L.1    Schwartz, J.-L.2    Fang, G.3
  • 12
    • 34547539737 scopus 로고    scopus 로고
    • Visually-derived Wiener filters for speech enhancement
    • Apr.
    • I. Almajai, B. Milner, J. Darch, and S. Vaseghi, “Visually-derived Wiener filters for speech enhancement,” in Proc. ICASSP, Apr. 2007, vol. 4, pp. 585–588.
    • (2007) Proc. ICASSP , vol.4 , pp. 585-588
    • Almajai, I.1    Milner, B.2    Darch, J.3    Vaseghi, S.4
  • 13
    • 85008063585 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm
    • ES 202 212, ver. 1.1.1, Nov.
    • ETSI, “Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm,” ETSI STQ-Aurora DSR Working Group, ES 202 212, ver. 1.1.1, Nov. 2003.
    • (2003) ETSI STQ-Aurora DSR Working Group
  • 14
    • 0035363218 scopus 로고    scopus 로고
    • Active appearance models
    • Jun.
    • T. Cootes, G. Edwards, and C. Taylor “Active appearance models,” IEEE Trans. PAMI, vol. 23, no. 6, pp. 681–685, Jun. 2001.
    • (2001) IEEE Trans. PAMI , vol.23 , Issue.6 , pp. 681-685
    • Cootes, T.1    Edwards, G.2    Taylor, C.3
  • 15
    • 1842854571 scopus 로고    scopus 로고
    • Continuous audio-visual digit recognition using N-best decision fusion
    • June
    • G. Meyer, J. Mulligan, and S. Wuerger “Continuous audio-visual digit recognition using N-best decision fusion,” Inf. Fusion, vol. 5, no. 2, pp. 91–101, June 2004.
    • (2004) Inf. Fusion , vol.5 , Issue.2 , pp. 91-101
    • Meyer, G.1    Mulligan, J.2    Wuerger, S.3
  • 17
    • 70450164887 scopus 로고    scopus 로고
    • Using audio-visual features for robust voice activity detection in clean and noisy speech
    • Lausanne, Switzerland, Aug.
    • I. Almajai and B. Milner, “Using audio-visual features for robust voice activity detection in clean and noisy speech,” in Proc. EUSIPCO, Lausanne, Switzerland, Aug. 2008.
    • (2008) Proc. EUSIPCO
    • Almajai, I.1    Milner, B.2
  • 18
    • 10444256499 scopus 로고    scopus 로고
    • Near-videore-alistic synthetic talking faces: Implementation and evaluation
    • Oct.
    • B. Theobald, J. Bangham, I. Matthews, and G. Cawley “Near-videore-alistic synthetic talking faces: Implementation and evaluation,” Speech Commun., vol. 44, pp. 127–140, Oct. 2004.
    • (2004) Speech Commun. , vol.44 , pp. 127-140
    • Theobald, B.1    Bangham, J.2    Matthews, I.3    Cawley, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.