SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 6, 2011, Pages 1642-1651

Visually Derived Wiener Filters for Speech Enhancement

(2) Almajai, Ibrahim a Milner, Ben b

a UNIVERSITY OF SURREY (United Kingdom)

b UNIVERSITY OF EAST ANGLIA (United Kingdom)

Author keywords

Audio visual; maximum a posteriori; speech enhancement; Wiener filter

Indexed keywords

EID: 85008034388 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2010.2096212 Document Type: Article

Times cited : (85)

References (19)

1
- 4544290191
- Recent advances in the automatic recognition of audio-visual speech
- Sep.
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior “Recent advances in the automatic recognition of audio-visual speech,” Proc. IEEE, vol. 91, no. 9, pp. 1306–1326, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

2
- 0036472941
- Extraction of visual features for lipreading
- Feb.
- I. Matthews, T. Cootes, J. Bangham, S. Cox, and R. Harvey “Extraction of visual features for lipreading,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 2, pp. 198–213, Feb. 2002.
- (2002) IEEE Trans. Pattern Anal. Mach. Intell. , vol.24 , Issue.2 , pp. 198-213
- Matthews, I.¹ Cootes, T.² Bangham, J.³ Cox, S.⁴ Harvey, R.⁵

3
- 44949122735
- Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise
- Sep.
- I. Almajai, B. Milner, and J. Darch, “Analysis of correlation between audio and visual speech features for clean audio feature prediction in noise,” in Proc. Interspeech, Sep. 2006, pp. 2470–2473.
- (2006) Proc. Interspeech , pp. 2470-2473
- Almajai, I.¹ Milner, B.² Darch, J.³

4
- 0032178592
- Quantitative association of vocal tract and facial behavior
- Oct.
- H. Yehia, P. Rubin, and E. Vatikiotis-Bateson “Quantitative association of vocal tract and facial behavior,” Speech Commun., vol. 26, no. 1, pp. 23–43, Oct. 1998.
- (1998) Speech Commun. , vol.26 , Issue.1 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

5
- 33744532633
- Voice activity detection based on multiple statistical models
- Jun.
- J. Chang, N. Kim, and S. Mitra “Voice activity detection based on multiple statistical models,” IEEE Trans. Signal Process., vol. 54, no. 6, pp. 1965–1976, Jun. 2006.
- (2006) IEEE Trans. Signal Process. , vol.54 , Issue.6 , pp. 1965-1976
- Chang, J.¹ Kim, N.² Mitra, S.³

6
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul.
- R. Martin “Noise power spectral density estimation based on optimal smoothing and minimum statistics,” IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504–512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

7
- 0018320733
- Enhancement of speech corrupted by acoustic noise
- Apr.
- M. Berouti, R. Schwartz, and J. Makhoul, “Enhancement of speech corrupted by acoustic noise,” in Proc. ICASSP, Apr. 1979, vol. 4, pp. 208–211.
- (1979) Proc. ICASSP , vol.4 , pp. 208-211
- Berouti, M.¹ Schwartz, R.² Makhoul, J.³

8
- 0024909863
- On the application of hidden Markov models for enhancing noisy speech
- Dec.
- Y. Ephraim, D. Malah, and B.-H. Juang “On the application of hidden Markov models for enhancing noisy speech,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 12, pp. 1846–1856, Dec. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.12 , pp. 1846-1856
- Ephraim, Y.¹ Malah, D.² Juang, B.-H.³

9
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Apr.
- Y. Ephraim and D. Malah “Speech enhancement using a minimum mean-square error log-spectral amplitude estimator,” IEEE Trans. Acoust., Speech, Signal Process., vol. 33, no. 2, pp. 443–445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

10
- 34447100796
- Boca Raton, FL: CRC
- P. Loizou, Speech Enhancement: Theory and Practice. Boca Raton, FL: CRC, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.¹

11
- 0034974093
- Audio-visual enhancement of speech in noise
- Jun.
- L. Girin, J.-L. Schwartz, and G. Fang “Audio-visual enhancement of speech in noise,” J. Acoust. Soc. Amer., vol. 109, no. 6, pp. 3007–3020, Jun. 2001.
- (2001) J. Acoust. Soc. Amer. , vol.109 , Issue.6 , pp. 3007-3020
- Girin, L.¹ Schwartz, J.-L.² Fang, G.³

12
- 34547539737
- Visually-derived Wiener filters for speech enhancement
- Apr.
- I. Almajai, B. Milner, J. Darch, and S. Vaseghi, “Visually-derived Wiener filters for speech enhancement,” in Proc. ICASSP, Apr. 2007, vol. 4, pp. 585–588.
- (2007) Proc. ICASSP , vol.4 , pp. 585-588
- Almajai, I.¹ Milner, B.² Darch, J.³ Vaseghi, S.⁴

13
- 85008063585
- Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm
- ES 202 212, ver. 1.1.1, Nov.
- ETSI, “Speech processing, transmission and quality aspects (STQ); distributed speech recognition; extended advanced front-end feature extraction algorithm; compression algorithms; back-end speech reconstruction algorithm,” ETSI STQ-Aurora DSR Working Group, ES 202 212, ver. 1.1.1, Nov. 2003.
- (2003) ETSI STQ-Aurora DSR Working Group

14
- 0035363218
- Active appearance models
- Jun.
- T. Cootes, G. Edwards, and C. Taylor “Active appearance models,” IEEE Trans. PAMI, vol. 23, no. 6, pp. 681–685, Jun. 2001.
- (2001) IEEE Trans. PAMI , vol.23 , Issue.6 , pp. 681-685
- Cootes, T.¹ Edwards, G.² Taylor, C.³

15
- 1842854571
- Continuous audio-visual digit recognition using N-best decision fusion
- June
- G. Meyer, J. Mulligan, and S. Wuerger “Continuous audio-visual digit recognition using N-best decision fusion,” Inf. Fusion, vol. 5, no. 2, pp. 91–101, June 2004.
- (2004) Inf. Fusion , vol.5 , Issue.2 , pp. 91-101
- Meyer, G.¹ Mulligan, J.² Wuerger, S.³

16
- 0004218547
- San Rafael, CA: Morgan-Kaufmann
- K. Sayood, Introduction to Data Compression. San Rafael, CA: Morgan-Kaufmann, 2000.
- (2000) Introduction to Data Compression
- Sayood, K.¹

17
- 70450164887
- Using audio-visual features for robust voice activity detection in clean and noisy speech
- Lausanne, Switzerland, Aug.
- I. Almajai and B. Milner, “Using audio-visual features for robust voice activity detection in clean and noisy speech,” in Proc. EUSIPCO, Lausanne, Switzerland, Aug. 2008.
- (2008) Proc. EUSIPCO
- Almajai, I.¹ Milner, B.²

18
- 10444256499
- Near-videore-alistic synthetic talking faces: Implementation and evaluation
- Oct.
- B. Theobald, J. Bangham, I. Matthews, and G. Cawley “Near-videore-alistic synthetic talking faces: Implementation and evaluation,” Speech Commun., vol. 44, pp. 127–140, Oct. 2004.
- (2004) Speech Commun. , vol.44 , pp. 127-140
- Theobald, B.¹ Bangham, J.² Matthews, I.³ Cawley, G.⁴

19
- 59849104058
- Nov. 2003, Tech. Rep.
- ITU, “P.835: Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms,” Nov. 2003, Tech. Rep.
- P.835: Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.