메뉴 건너뛰기




Volumn 5, Issue 4, 2013, Pages 504-516

A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios

Author keywords

Blind channel identification; Noisy and reverberated environments; Real time signal processing; Speaker diarization; Speech dereverberation; Speech enhancement

Indexed keywords

BLIND CHANNEL IDENTIFICATION; NOISY AND REVERBERATED ENVIRONMENTS; REAL-TIME SIGNAL PROCESSING; SPEAKER DIARIZATION; SPEECH DEREVERBERATION;

EID: 84890118085     PISSN: 18669956     EISSN: 18669964     Source Type: Journal    
DOI: 10.1007/s12559-012-9176-x     Document Type: Article
Times cited : (15)

References (46)
  • 4
    • 51449111990 scopus 로고    scopus 로고
    • Overlapped speech detection for improved speaker diarization in multiparty meetings
    • ICASSP 2008. IEEE international conference on, IEEE
    • Boakye K, Trueba-Hornero B, Vinyals O, Friedland G. Overlapped speech detection for improved speaker diarization in multiparty meetings. In: Acoustics, speech and signal processing, 2008. ICASSP 2008. IEEE international conference on; 2008. p. 4353-6. IEEE.
    • (2008) In: Acoustics, speech and signal processing, 2008 , pp. 4353-4356
    • Boakye, K.1    Trueba-Hornero, B.2    Vinyals, O.3    Friedland, G.4
  • 5
    • 80052264769 scopus 로고    scopus 로고
    • Extracting and associating meta-features for understanding peoples emotional behaviour: face and speech
    • Bourbakis N, Esposito A, Kavraki D. Extracting and associating meta-features for understanding peoples emotional behaviour: face and speech. Cognit Comput. 2011;3(3): 436-48.
    • (2011) Cognit Comput , vol.3 , Issue.3 , pp. 436-448
    • Bourbakis, N.1    Esposito, A.2    Kavraki, D.3
  • 7
    • 77951470786 scopus 로고    scopus 로고
    • Time-scale feature extractions for emotional speech characterization
    • Chetouani M, Mahdhaoui A, Ringeval F. Time-scale feature extractions for emotional speech characterization. Cognit Comput. 2009;1(2): 194-201.
    • (2009) Cognit Comput , vol.1 , Issue.2 , pp. 194-201
    • Chetouani, M.1    Mahdhaoui, A.2    Ringeval, F.3
  • 9
    • 18744381309 scopus 로고    scopus 로고
    • Tikhonov regularization applied to the inverse problem of option pricing: convergence analysis and rates
    • Egger H, Engl H. Tikhonov regularization applied to the inverse problem of option pricing: convergence analysis and rates. Inverse Probl. 2005;21(3): 1027-45.
    • (2005) Inverse Probl , vol.21 , Issue.3 , pp. 1027-1045
    • Egger, H.1    Engl, H.2
  • 10
    • 77955707186 scopus 로고    scopus 로고
    • A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech
    • Falk T, Zheng C, Chan W. A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Trans Audio Speech Lang Processing. 2010;18(7): 1766-1774.
    • (2010) IEEE Trans Audio Speech Lang Processing , vol.18 , Issue.7 , pp. 1766-1774
    • Falk, T.1    Zheng, C.2    Chan, W.3
  • 14
    • 38149064545 scopus 로고    scopus 로고
    • Energy constrained frequency-domain normalized lms algorithm for blind channel identification
    • Haque M, Bashar M, Naylor P, Hirose K, Hasan M. Energy constrained frequency-domain normalized lms algorithm for blind channel identification. Signal Image Video Process. 2007;1: 203-213.
    • (2007) Signal Image Video Process , vol.1 , pp. 203-213
    • Haque, M.1    Bashar, M.2    Naylor, P.3    Hirose, K.4    Hasan, M.5
  • 15
    • 67650124186 scopus 로고    scopus 로고
    • Noise robust multichannel frequency-domain lms algorithms for blind channel identification
    • Haque M, Hasan M. Noise robust multichannel frequency-domain lms algorithms for blind channel identification. IEEE Signal Process Lett. 2008;15: 305-8.
    • (2008) IEEE Signal Process Lett , vol.15 , pp. 305-308
    • Haque, M.1    Hasan, M.2
  • 17
    • 34247241719 scopus 로고    scopus 로고
    • Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations
    • Hikichi T, Delcroix M, Miyoshi M. Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations. EURASIP J Adv Signal Process. 2007;1: 1-12.
    • (2007) EURASIP J Adv Signal Process , vol.2007 , Issue.1 , pp. 1-12
    • Hikichi, T.1    Delcroix, M.2    Miyoshi, M.3
  • 18
    • 0037235030 scopus 로고    scopus 로고
    • A class of frequency-domain adaptive approaches to blind multichannel identification
    • Huang Y, Benesty J. A class of frequency-domain adaptive approaches to blind multichannel identification. IEEE Trans Speech Audio Process. 2003;51(1): 11-24.
    • (2003) IEEE Trans Speech Audio Process , vol.51 , Issue.1 , pp. 11-24
    • Huang, Y.1    Benesty, J.2
  • 21
    • 0035303151 scopus 로고    scopus 로고
    • Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise
    • IET
    • Hussain A, Campbell D. Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise. In: Vision, image and signal processing, IEE proceedings-; 2001. vol 148, p. 127-32. IET.
    • (2001) In: Vision, image and signal processing, IEE proceedings , vol.148 , pp. 127-132
    • Hussain, A.1    Campbell, D.2
  • 23
    • 35348904453 scopus 로고    scopus 로고
    • Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms
    • Kocinski J. Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms. Speech Commun. 2008;50(1): 29-37.
    • (2008) Speech Commun , vol.50 , Issue.1 , pp. 29-37
    • Kocinski, J.1
  • 26
    • 77957725494 scopus 로고    scopus 로고
    • Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions
    • Loizou P, Kim G. Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions. IEEE Trans Audio Speech Lang Processing. 2011;19(1): 47-56.
    • (2011) IEEE Trans Audio Speech Lang Processing , vol.19 , Issue.1 , pp. 47-56
    • Loizou, P.1    Kim, G.2
  • 27
    • 0023961145 scopus 로고
    • Inverse filtering of room acoustics
    • Miyoshi M, Kaneda Y. Inverse filtering of room acoustics. IEEE Trans Signal Process. 1988;36(2): 145-52.
    • (1988) IEEE Trans Signal Process , vol.36 , Issue.2 , pp. 145-152
    • Miyoshi, M.1    Kaneda, Y.2
  • 28
    • 0032123981 scopus 로고    scopus 로고
    • On the evaluation of estimated impulse responses
    • Morgan D, Benesty J, Sondhi M. On the evaluation of estimated impulse responses. IEEE Signal Process Lett. 1998;5(7): 174-76.
    • (1998) IEEE Signal Process Lett , vol.5 , Issue.7 , pp. 174-176
    • Morgan, D.1    Benesty, J.2    Sondhi, M.3
  • 31
    • 78751665098 scopus 로고    scopus 로고
    • Comparative evaluation of single-channel mmse-based noise reduction schemes for speech recognition
    • doi: 10. 1155/2010/962103
    • Principi E, Cifani S, Rotili R, Squartini S, Piazza F. Comparative evaluation of single-channel mmse-based noise reduction schemes for speech recognition. J Electr Comput Eng. 2010; p. 1-7. doi: 10. 1155/2010/962103. http://www. hindawi. com/journals/jece/2010/962103. html.
    • (2010) J Electr Comput Eng , pp. 1-7
    • Principi, E.1    Cifani, S.2    Rotili, R.3    Squartini, S.4    Piazza, F.5
  • 32
    • 84870412815 scopus 로고    scopus 로고
    • Real-time activity detection in a multi-talker reverberated environment
    • doi: 10. 1007/s12559-012-9133-8
    • Principi E, Rotili R, Wöllmer M, Eyben F, Squartini S, Schuller B. Real-time activity detection in a multi-talker reverberated environment. Cognit Comput. p. 1-12. doi: 10. 1007/s12559-012-9133-8.
    • Cognit Comput , pp. 1-12
    • Principi, E.1    Rotili, R.2    Wöllmer, M.3    Eyben, F.4    Squartini, S.5    Schuller, B.6
  • 34
    • 62949218524 scopus 로고    scopus 로고
    • A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances
    • Rotili R, Cifani S, Principi E, Squartini S, Piazza F. A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances. In: Proceedings of IEEE APCCAS; 2008. p. 434-7.
    • (2008) In: Proceedings of IEEE APCCAS , pp. 434-437
    • Rotili, R.1    Cifani, S.2    Principi, E.3    Squartini, S.4    Piazza, F.5
  • 35
    • 77957007047 scopus 로고    scopus 로고
    • Joint multichannel blind speech separation and dereverberation: a real-time algorithmic implementation
    • Rotili R, De Simone C, Perelli A, Cifani A, Squartini S. Joint multichannel blind speech separation and dereverberation: a real-time algorithmic implementation. In: Proceedings of ICIC; 2010. p. 85-93.
    • (2010) In: Proceedings of ICIC , pp. 85-93
    • Rotili, R.1    De Simone, C.2    Perelli, A.3    Cifani, A.4    Squartini, S.5
  • 38
    • 79960846940 scopus 로고    scopus 로고
    • Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge
    • Schuller B, Batliner A, Steidl S, Seppi D. Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Commun. (2011);53(9/10): 1062-87.
    • (2011) Speech Commun , vol.53 , Issue.9-10 , pp. 1062-1087
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 39
    • 77956647570 scopus 로고    scopus 로고
    • Non-linear and non-conventional speech processing: alternative techniques
    • Solé-Casals J, Zaiats V, Monte-Moreno E. Non-linear and non-conventional speech processing: alternative techniques. Cognit Comput. 2010;2(3): 133-4.
    • (2010) Cognit Comput , vol.2 , Issue.3 , pp. 133-134
    • Solé-Casals, J.1    Zaiats, V.2    Monte-Moreno, E.3
  • 40
    • 82655173885 scopus 로고    scopus 로고
    • Environmental robust speech and speaker recognition through multi-channel histogram equalization
    • Squartini S, Principi E, Rotili R, Piazza F. Environmental robust speech and speaker recognition through multi-channel histogram equalization. Neurocomputing. 2012;78(1): 111-120.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 111-120
    • Squartini, S.1    Principi, E.2    Rotili, R.3    Piazza, F.4
  • 42
    • 79955034745 scopus 로고    scopus 로고
    • Recognition of nonprototypical emotions in reverberated and noisy speech by nonnegative matrix factorization
    • Weninger F, Schuller B, Batliner A, Steidl S, Seppi D Recognition of nonprototypical emotions in reverberated and noisy speech by nonnegative matrix factorization. EURASIP J Adv Signal Process. 2011;11: 1-16.
    • (2011) EURASIP J Adv Signal Process , vol.11 , pp. 1-16
    • Weninger, F.1    Schuller, B.2    Batliner, A.3    Steidl, S.4    Seppi, D.5
  • 43
    • 78651563436 scopus 로고    scopus 로고
    • Bidirectional lstm networks for context-sensitive keyword detection in a cognitive virtual agent framework
    • Wöllmer M, Eyben F, Graves A, Schuller B, Rigoll G. Bidirectional lstm networks for context-sensitive keyword detection in a cognitive virtual agent framework. Cognit Comput. 2010;2(3): 180-90.
    • (2010) Cognit Comput , vol.2 , Issue.3 , pp. 180-190
    • Wöllmer, M.1    Eyben, F.2    Graves, A.3    Schuller, B.4    Rigoll, G.5
  • 44
    • 81355147535 scopus 로고    scopus 로고
    • Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    • Wöllmer M, Marchi E, Squartini S, Schuller B. Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting. Cogn Neurodyn. 2011;5(3): 253-64.
    • (2011) Cogn Neurodyn , vol.5 , Issue.3 , pp. 253-264
    • Wöllmer, M.1    Marchi, E.2    Squartini, S.3    Schuller, B.4
  • 46
    • 0029532509 scopus 로고
    • A least-squares approach to blind channel identification
    • Xu G, Liu H, Tong L, Kailath T. A least-squares approach to blind channel identification. IEEE Trans Signal Process. 1995;43(12): 2982-93.
    • (1995) IEEE Trans Signal Process , vol.43 , Issue.12 , pp. 2982-2993
    • Xu, G.1    Liu, H.2    Tong, L.3    Kailath, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.