SCOPUS 정보 검색 플랫폼

Volumn 5, Issue 4, 2013, Pages 504-516

A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios

(4) Rotili, Rudy a Principi, Emanuele a Squartini, Stefano a Schuller, Björn b

a UNIVERSITÀ POLITECNICA DELLE MARCHE (Italy)

b TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

Blind channel identification; Noisy and reverberated environments; Real time signal processing; Speaker diarization; Speech dereverberation; Speech enhancement

Indexed keywords

BLIND CHANNEL IDENTIFICATION; NOISY AND REVERBERATED ENVIRONMENTS; REAL-TIME SIGNAL PROCESSING; SPEAKER DIARIZATION; SPEECH DEREVERBERATION;

ALGORITHMS; COMPUTER SIMULATION; SIGNAL PROCESSING; SPEECH ENHANCEMENT;

REVERBERATION;

EID: 84890118085 PISSN: 18669956 EISSN: 18669964 Source Type: Journal
DOI: 10.1007/s12559-012-9176-x Document Type: Article

Times cited : (15)

References (46)

1
- 79957991229
- Online meeting recognizer with multichannel speaker diarization
- IEEE
- Araki S, Hori T, Fujimoto M, Watanabe S, Yoshioka T, Nakatani T, Nakamura A. Online meeting recognizer with multichannel speaker diarization. In: Signals, systems and computers (ASILOMAR), 2010 conference record of the forty fourth asilomar conference on. 2010. p. 1697-701. IEEE.
- (2010) In: Signals, systems and computers (ASILOMAR), 2010 conference record of the forty fourth asilomar conference on , pp. 1697-1701
- Araki, S.¹ Hori, T.² Fujimoto, M.³ Watanabe, S.⁴ Yoshioka, T.⁵ Nakatani, T.⁶ Nakamura, A.⁷

2
- 77950296082
- 1st edn. Springer Publishing Company, Incorporated
- Benesty J, Chen J, Huang Y, Cohen I. Noise reduction in speech processing. 1st edn. Springer Publishing Company, Incorporated. 2009.
- (2009) Noise reduction in speech processing
- Benesty, J.¹ Chen, J.² Huang, Y.³ Cohen, I.⁴

3
- 84866496272
- NU-Tech: implementing DSP algorithms in a plug-in based software platform for real time audio applications
- Paper number 6389
- Bettarelli F, Ciavattini E, Lattanzi A, Zallocco D, Squartini S, Piazza F. NU-Tech: implementing DSP algorithms in a plug-in based software platform for real time audio applications. In: Proceedings of 118th convention of the AES; 2005. p. 1-12. Paper number 6389.
- (2005) In: Proceedings of 118th convention of the AES , pp. 1-12
- Bettarelli, F.¹ Ciavattini, E.² Lattanzi, A.³ Zallocco, D.⁴ Squartini, S.⁵ Piazza, F.⁶

4
- 51449111990
- Overlapped speech detection for improved speaker diarization in multiparty meetings
- ICASSP 2008. IEEE international conference on, IEEE
- Boakye K, Trueba-Hornero B, Vinyals O, Friedland G. Overlapped speech detection for improved speaker diarization in multiparty meetings. In: Acoustics, speech and signal processing, 2008. ICASSP 2008. IEEE international conference on; 2008. p. 4353-6. IEEE.
- (2008) In: Acoustics, speech and signal processing, 2008 , pp. 4353-4356
- Boakye, K.¹ Trueba-Hornero, B.² Vinyals, O.³ Friedland, G.⁴

5
- 80052264769
- Extracting and associating meta-features for understanding peoples emotional behaviour: face and speech
- Bourbakis N, Esposito A, Kavraki D. Extracting and associating meta-features for understanding peoples emotional behaviour: face and speech. Cognit Comput. 2011;3(3): 436-48.
- (2011) Cognit Comput , vol.3 , Issue.3 , pp. 436-448
- Bourbakis, N.¹ Esposito, A.² Kavraki, D.³

6
- 33745530242
- The AMI meeting corpus: a pre-announcement
- Carletta J, Ashby S, Bourban S, Flynn M, Guillemot M, Hain T, et al. The AMI meeting corpus: a pre-announcement. Machine Learning for Multimodal Interaction; 2006. p. 28-39.
- (2006) Machine Learning for Multimodal Interaction , pp. 28-39
- Carletta, J.¹ Ashby, S.² Bourban, S.³ Flynn, M.⁴ Guillemot, M.⁵ Hain, T.⁶

7
- 77951470786
- Time-scale feature extractions for emotional speech characterization
- Chetouani M, Mahdhaoui A, Ringeval F. Time-scale feature extractions for emotional speech characterization. Cognit Comput. 2009;1(2): 194-201.
- (2009) Cognit Comput , vol.1 , Issue.2 , pp. 194-201
- Chetouani, M.¹ Mahdhaoui, A.² Ringeval, F.³

8
- 84863662539
- Springer: Springer Topics in Signal Processing
- Cohen I, Benesty J, Gannot S. Speech processing in modern communication: challenges and perspectives. Springer Topics in Signal Processing: Springer; 2010.
- (2010) Speech Processing in Modern Communication: Challenges and Perspectives
- Cohen, I.¹ Benesty, J.² Gannot, S.³

9
- 18744381309
- Tikhonov regularization applied to the inverse problem of option pricing: convergence analysis and rates
- Egger H, Engl H. Tikhonov regularization applied to the inverse problem of option pricing: convergence analysis and rates. Inverse Probl. 2005;21(3): 1027-45.
- (2005) Inverse Probl , vol.21 , Issue.3 , pp. 1027-1045
- Egger, H.¹ Engl, H.²

10
- 77955707186
- A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech
- Falk T, Zheng C, Chan W. A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech. IEEE Trans Audio Speech Lang Processing. 2010;18(7): 1766-1774.
- (2010) IEEE Trans Audio Speech Lang Processing , vol.18 , Issue.7 , pp. 1766-1774
- Falk, T.¹ Zheng, C.² Chan, W.³

11
- 79956279915
- The LIA-EURECOM RT'09 speaker diarization system
- Fredouille C, Bozonnet S, Evans N. The LIA-EURECOM RT'09 speaker diarization system. In: RT'09, NIST rich transcription workshop. Melbourne, Florida; 2009. p. 1-10.
- (2009) In: RT'09, NIST rich transcription workshop. Melbourne, Florida , pp. 1-10
- Fredouille, C.¹ Bozonnet, S.² Evans, N.³

12
- 33646776397
- Iterative algorithms for multichannel equalization in sound reproduction systems
- p. iii/269-iii/272
- Guillaume M, Grenier Y, Richard G. Iterative algorithms for multichannel equalization in sound reproduction systems. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing. 2005. vol 3, p. iii/269-iii/272.
- (2005) In: Proceedings of IEEE international conference on acoustics, speech, and signal processing , vol.3
- Guillaume, M.¹ Grenier, Y.² Richard, G.³

13
- 62749179361
- Accessed 2 Oct 2011
- Habets E. Room impulse response (RIR) generator. 2008. http://home. tiscali. nl/ehabets/rirgenerator. html. Accessed 2 Oct 2011.
- (2008) Room impulse response (RIR) generator
- Habets, E.¹

14
- 38149064545
- Energy constrained frequency-domain normalized lms algorithm for blind channel identification
- Haque M, Bashar M, Naylor P, Hirose K, Hasan M. Energy constrained frequency-domain normalized lms algorithm for blind channel identification. Signal Image Video Process. 2007;1: 203-213.
- (2007) Signal Image Video Process , vol.1 , pp. 203-213
- Haque, M.¹ Bashar, M.² Naylor, P.³ Hirose, K.⁴ Hasan, M.⁵

15
- 67650124186
- Noise robust multichannel frequency-domain lms algorithms for blind channel identification
- Haque M, Hasan M. Noise robust multichannel frequency-domain lms algorithms for blind channel identification. IEEE Signal Process Lett. 2008;15: 305-8.
- (2008) IEEE Signal Process Lett , vol.15 , pp. 305-308
- Haque, M.¹ Hasan, M.²

16
- 84863709147
- Improving robustness of blind adaptive multichannel identification algorithms using constraints
- Hasan M, Benesty J, Naylor P, Ward D. Improving robustness of blind adaptive multichannel identification algorithms using constraints. In: Proceedings of European signal processing conference (EUSIPCO), Antalya, Turkey; 2005. vol 1, p. 11-4.
- (2005) In: Proceedings of European signal processing conference (EUSIPCO), Antalya, Turkey , vol.1 , pp. 11-14
- Hasan, M.¹ Benesty, J.² Naylor, P.³ Ward, D.⁴

17
- 34247241719
- Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations
- Hikichi T, Delcroix M, Miyoshi M. Inverse filtering for speech dereverberation less sensitive to noise and room transfer function fluctuations. EURASIP J Adv Signal Process. 2007;1: 1-12.
- (2007) EURASIP J Adv Signal Process , vol.2007 , Issue.1 , pp. 1-12
- Hikichi, T.¹ Delcroix, M.² Miyoshi, M.³

18
- 0037235030
- A class of frequency-domain adaptive approaches to blind multichannel identification
- Huang Y, Benesty J. A class of frequency-domain adaptive approaches to blind multichannel identification. IEEE Trans Speech Audio Process. 2003;51(1): 11-24.
- (2003) IEEE Trans Speech Audio Process , vol.51 , Issue.1 , pp. 11-24
- Huang, Y.¹ Benesty, J.²

19
- 70450179489
- Speech overlap detection in a two-pass speaker diarization system
- Huijbregts M, van Leeuwen DA, de Jong FMG. Speech overlap detection in a two-pass speaker diarization system. In: INTERSPEECH'09; 2009. p. 1063-6.
- (2009) In: INTERSPEECH'09 , pp. 1063-1066
- Huijbregts, M.¹ van Leeuwen, D.A.² de Jong, F.M.G.³

20
- 79951587324
- Estimating dominance in multi-party meetings using speaker diarization
- Hung H, Huang Y, Friedland G, Gatica-Perez D. Estimating dominance in multi-party meetings using speaker diarization. IEEE Trans Audio Speech Lang Processing. 2011;19(4): 847-60.
- (2011) IEEE Trans Audio Speech Lang Processing , vol.19 , Issue.4 , pp. 847-860
- Hung, H.¹ Huang, Y.² Friedland, G.³ Gatica-Perez, D.⁴

21
- 0035303151
- Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise
- IET
- Hussain A, Campbell D. Intelligibility improvements using binaural diverse sub-band processing applied to speech corrupted with automobile noise. In: Vision, image and signal processing, IEE proceedings-; 2001. vol 148, p. 127-32. IET.
- (2001) In: Vision, image and signal processing, IEE proceedings , vol.148 , pp. 127-132
- Hussain, A.¹ Campbell, D.²

22
- 39149087385
- Nonlinear speech enhancement: an overview
- doi: 10. 1007/978-3-540-71505-4_12
- Hussain A, Chetouani M, Squartini S, Bastari A, Piazza F. Nonlinear speech enhancement: an overview. In: Progress in non-linear speech processing, Lecture notes in computer science; 2007. vol 4391, p. 217-48. doi: 10. 1007/978-3-540-71505-4_12.
- (2007) In: Progress in non-linear speech processing, Lecture notes in computer science , vol.4391 , pp. 217-248
- Hussain, A.¹ Chetouani, M.² Squartini, S.³ Bastari, A.⁴ Piazza, F.⁵

23
- 35348904453
- Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms
- Kocinski J. Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms. Speech Commun. 2008;50(1): 29-37.
- (2008) Speech Commun , vol.50 , Issue.1 , pp. 29-37
- Kocinski, J.¹

24
- 84890129629
- Joint noise and reverberation suppression for speech applications
- Kokkinis EK, Tsilfidis A, Georganti E, Mourjopoulos J. Joint noise and reverberation suppression for speech applications. In: Proceedings of the 130th convention of the audio engineering society; 2011. vol 9, p. 10-62.
- (2011) In: Proceedings of the 130th convention of the audio engineering society , vol.9 , pp. 10-62
- Kokkinis, E.K.¹ Tsilfidis, A.² Georganti, E.³ Mourjopoulos, J.⁴

25
- 34447100796
- Loizou P. Speech enhancement: theory and practice (Signal processing and communications). CRC; 2007.
- (2007) Speech enhancement: Theory and practice (Signal processing and communications). CRC
- Loizou, P.¹

26
- 77957725494
- Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions
- Loizou P, Kim G. Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions. IEEE Trans Audio Speech Lang Processing. 2011;19(1): 47-56.
- (2011) IEEE Trans Audio Speech Lang Processing , vol.19 , Issue.1 , pp. 47-56
- Loizou, P.¹ Kim, G.²

27
- 0023961145
- Inverse filtering of room acoustics
- Miyoshi M, Kaneda Y. Inverse filtering of room acoustics. IEEE Trans Signal Process. 1988;36(2): 145-52.
- (1988) IEEE Trans Signal Process , vol.36 , Issue.2 , pp. 145-152
- Miyoshi, M.¹ Kaneda, Y.²

28
- 0032123981
- On the evaluation of estimated impulse responses
- Morgan D, Benesty J, Sondhi M. On the evaluation of estimated impulse responses. IEEE Signal Process Lett. 1998;5(7): 174-76.
- (1998) IEEE Signal Process Lett , vol.5 , Issue.7 , pp. 174-176
- Morgan, D.¹ Benesty, J.² Sondhi, M.³

29
- 80051618981
- Heidelberg: Springer
- Naylor P, Gaubitch N. Speech dereverberation. Signals and communication technology. Heidelberg: Springer; 2010.
- (2010) Speech Dereverberation. Signals and Communication Technology
- Naylor, P.¹ Gaubitch, N.²

30
- 0003513556
- Upper Saddle River: Prentice Hall
- Oppenheim AV, Schafer RW, Buck JR. Discrete-time signal processing, 2 edn. Upper Saddle River: Prentice Hall; 1999.
- (1999) Discrete-Time Signal Processing, 2 Edn
- Oppenheim, A.V.¹ Schafer, R.W.² Buck, J.R.³

31
- 78751665098
- Comparative evaluation of single-channel mmse-based noise reduction schemes for speech recognition
- doi: 10. 1155/2010/962103
- Principi E, Cifani S, Rotili R, Squartini S, Piazza F. Comparative evaluation of single-channel mmse-based noise reduction schemes for speech recognition. J Electr Comput Eng. 2010; p. 1-7. doi: 10. 1155/2010/962103. http://www. hindawi. com/journals/jece/2010/962103. html.
- (2010) J Electr Comput Eng , pp. 1-7
- Principi, E.¹ Cifani, S.² Rotili, R.³ Squartini, S.⁴ Piazza, F.⁵

32
- 84870412815
- Real-time activity detection in a multi-talker reverberated environment
- doi: 10. 1007/s12559-012-9133-8
- Principi E, Rotili R, Wöllmer M, Eyben F, Squartini S, Schuller B. Real-time activity detection in a multi-talker reverberated environment. Cognit Comput. p. 1-12. doi: 10. 1007/s12559-012-9133-8.
- Cognit Comput , pp. 1-12
- Principi, E.¹ Rotili, R.² Wöllmer, M.³ Eyben, F.⁴ Squartini, S.⁵ Schuller, B.⁶

33
- 84865145664
- Dominance detection in a reverberated acoustic scenario
- Springer
- Principi E, Rotili R, Wöllmer M, Squartini S, Schuller B. Dominance detection in a reverberated acoustic scenario. In: Advances in neural networks-ISNN2012, Lecture notes in computer science, vol 7368. Springer; 2012.
- (2012) In: Advances in neural networks-ISNN2012, Lecture notes in computer science , vol.7368
- Principi, E.¹ Rotili, R.² Wöllmer, M.³ Squartini, S.⁴ Schuller, B.⁵

34
- 62949218524
- A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances
- Rotili R, Cifani S, Principi E, Squartini S, Piazza F. A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances. In: Proceedings of IEEE APCCAS; 2008. p. 434-7.
- (2008) In: Proceedings of IEEE APCCAS , pp. 434-437
- Rotili, R.¹ Cifani, S.² Principi, E.³ Squartini, S.⁴ Piazza, F.⁵

35
- 77957007047
- Joint multichannel blind speech separation and dereverberation: a real-time algorithmic implementation
- Rotili R, De Simone C, Perelli A, Cifani A, Squartini S. Joint multichannel blind speech separation and dereverberation: a real-time algorithmic implementation. In: Proceedings of ICIC; 2010. p. 85-93.
- (2010) In: Proceedings of ICIC , pp. 85-93
- Rotili, R.¹ De Simone, C.² Perelli, A.³ Cifani, A.⁴ Squartini, S.⁵

36
- 79957857634
- Real-time joint blind speech separation and dereverberation in presence of overlapping speakers
- Rotili R, Principi E, Squartini S, Piazza F. Real-time joint blind speech separation and dereverberation in presence of overlapping speakers. In: Proceedings of ISNN. Berlin: Springer; 2011. p. 437-46.
- (2011) In: Proceedings of ISNN. Berlin: Springer , pp. 437-446
- Rotili, R.¹ Principi, E.² Squartini, S.³ Piazza, F.⁴

37
- 84855668163
- Real-time speech recognition in a multi-talker reverberated acoustic scenario
- In: Huang DS, Gan Y, Gupta P, Gromiha M, editors, Berlin: Springer
- Rotili R, Principi E, Squartini S, Schuller B Real-time speech recognition in a multi-talker reverberated acoustic scenario. In: Huang DS, Gan Y, Gupta P, Gromiha M, editors. Advanced intelligent computing theories and applications. With aspects of artificial intelligence, Lecture notes in computer science. Berlin: Springer; 2012. p. 379-86.
- (2012) Advanced intelligent computing theories and applications. With aspects of artificial intelligence, Lecture notes in computer science , pp. 379-386
- Rotili, R.¹ Principi, E.² Squartini, S.³ Schuller, B.⁴

38
- 79960846940
- Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge
- Schuller B, Batliner A, Steidl S, Seppi D. Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Commun. (2011);53(9/10): 1062-87.
- (2011) Speech Commun , vol.53 , Issue.9-10 , pp. 1062-1087
- Schuller, B.¹ Batliner, A.² Steidl, S.³ Seppi, D.⁴

39
- 77956647570
- Non-linear and non-conventional speech processing: alternative techniques
- Solé-Casals J, Zaiats V, Monte-Moreno E. Non-linear and non-conventional speech processing: alternative techniques. Cognit Comput. 2010;2(3): 133-4.
- (2010) Cognit Comput , vol.2 , Issue.3 , pp. 133-134
- Solé-Casals, J.¹ Zaiats, V.² Monte-Moreno, E.³

40
- 82655173885
- Environmental robust speech and speaker recognition through multi-channel histogram equalization
- Squartini S, Principi E, Rotili R, Piazza F. Environmental robust speech and speaker recognition through multi-channel histogram equalization. Neurocomputing. 2012;78(1): 111-120.
- (2012) Neurocomputing , vol.78 , Issue.1 , pp. 111-120
- Squartini, S.¹ Principi, E.² Rotili, R.³ Piazza, F.⁴

41
- 52149094528
- Towards semantic analysis of conversations: a system for the live identification of speakers in meetings
- Vinyals O, Friedland G. Towards semantic analysis of conversations: a system for the live identification of speakers in meetings. In: Proceedings of IEEE international conference on semantic computing; 2008. p. 426 -31.
- (2008) In: Proceedings of IEEE international conference on semantic computing , pp. 426-431
- Vinyals, O.¹ Friedland, G.²

42
- 79955034745
- Recognition of nonprototypical emotions in reverberated and noisy speech by nonnegative matrix factorization
- Weninger F, Schuller B, Batliner A, Steidl S, Seppi D Recognition of nonprototypical emotions in reverberated and noisy speech by nonnegative matrix factorization. EURASIP J Adv Signal Process. 2011;11: 1-16.
- (2011) EURASIP J Adv Signal Process , vol.11 , pp. 1-16
- Weninger, F.¹ Schuller, B.² Batliner, A.³ Steidl, S.⁴ Seppi, D.⁵

43
- 78651563436
- Bidirectional lstm networks for context-sensitive keyword detection in a cognitive virtual agent framework
- Wöllmer M, Eyben F, Graves A, Schuller B, Rigoll G. Bidirectional lstm networks for context-sensitive keyword detection in a cognitive virtual agent framework. Cognit Comput. 2010;2(3): 180-90.
- (2010) Cognit Comput , vol.2 , Issue.3 , pp. 180-190
- Wöllmer, M.¹ Eyben, F.² Graves, A.³ Schuller, B.⁴ Rigoll, G.⁵

44
- 81355147535
- Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
- Wöllmer M, Marchi E, Squartini S, Schuller B. Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting. Cogn Neurodyn. 2011;5(3): 253-64.
- (2011) Cogn Neurodyn , vol.5 , Issue.3 , pp. 253-264
- Wöllmer, M.¹ Marchi, E.² Squartini, S.³ Schuller, B.⁴

45
- 47749119617
- The ICSI RT07s speaker diarization system
- In: Stiefelhagen R, Bowers R, Fiscus J, editors, Berlin: Springer
- Wooters C, Huijbregts M. The ICSI RT07s speaker diarization system. In: Stiefelhagen R, Bowers R, Fiscus J, editors. Multimodal technologies for perception of humans, Lecture notes in computer science. Berlin: Springer; 2008. p. 509-19.
- (2008) Multimodal technologies for perception of humans, Lecture notes in computer science , pp. 509-519
- Wooters, C.¹ Huijbregts, M.²

46
- 0029532509
- A least-squares approach to blind channel identification
- Xu G, Liu H, Tong L, Kailath T. A least-squares approach to blind channel identification. IEEE Trans Signal Process. 1995;43(12): 2982-93.
- (1995) IEEE Trans Signal Process , vol.43 , Issue.12 , pp. 2982-2993
- Xu, G.¹ Liu, H.² Tong, L.³ Kailath, T.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.