SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 4, Issue 5, 2010, Pages 785-797

Automatic beamforming for blind extraction of speech from music environment using variance of spectral flux-inspired criterion

(2) Yu, Tao a Hansen, John H L a

a UNIVERSITY OF TEXAS AT DALLAS (United States)

Author keywords

Array signal processing; blind beamforming; blind source extraction (BSE); speech enhancement

Indexed keywords

ACOUSTIC SOUND; ARRAY GEOMETRIES; ARRAY SIGNAL PROCESSING; BLIND BEAMFORMING; BLIND EXTRACTION; BLIND SOURCE EXTRACTION; DIFFUSE NOISE; FIXED-POINT ALGORITHMS; INTERFERENCE NOISE; KEY FEATURE; NOISE COVARIANCE MATRIX; NOISY OBSERVATIONS; NONGAUSSIANITY; PRIORI KNOWLEDGE; SECOND ORDERS; SPECTRAL FLUX; SUBSPACE DECOMPOSITION; TAYLOR SERIES EXPANSIONS; TIME CORRELATIONS; TIME FREQUENCY DOMAIN;

BEAMFORMING; COVARIANCE MATRIX; FEATURE EXTRACTION; OPTIMIZATION; SIGNAL PROCESSING; SPEECH ENHANCEMENT; TAYLOR SERIES; UNDERWATER ACOUSTICS;

BLIND SOURCE SEPARATION;

EID: 77956737752 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2010.2069790 Document Type: Article

Times cited : (4)

References (31)

1
- 0009590598
- New York: Springer
- M. Brandstein and D. Ward, Microphone Arrays. New York: Springer, 2001.
- (2001) Microphone Arrays
- Brandstein, M.¹ Ward, D.²

2
- 0036725739
- GSVD-based optimal filtering for single and multimicrophone speech enhancement
- Sep
- S. Doclo and M. Moonen, "GSVD-based optimal filtering for single and multimicrophone speech enhancement," IEEE Trans. Signal Process., vol.50, no.9, pp. 2230-2244, Sep. 2002.
- (2002) IEEE Trans. Signal Process. , vol.50 , Issue.9 , pp. 2230-2244
- Doclo, S.¹ Moonen, M.²

3
- 41049101164
- Frequency domain multi-channel noise reduction based on the spatial subspace decomposition and noise eigenvalue modification
- Sep
- G.Kimand N. I. N. Cho, "Frequency domain multi-channel noise reduction based on the spatial subspace decomposition and noise eigenvalue modification," Speech Commun., vol.50, pp. 382-391, Sep. 2008.
- (2008) Speech Commun , vol.50 , pp. 382-391
- Kimand, G.¹ Cho, N.I.N.²

4
- 51449123556
- Blind acoustic beamforming based on generalized eigenvalue decomposition
- Jul.
- E. Warsitz and R. Haeb-Umbach, "Blind acoustic beamforming based on generalized eigenvalue decomposition," IEEE Trans. Audio, Speech, Language Process., vol.15, no.5, pp. 1529-1539, Jul. 2007.
- (2007) IEEE Trans. Audio, Speech, Language Process. , vol.15 , Issue.5 , pp. 1529-1539
- Warsitz, E.¹ Haeb-Umbach, R.²

5
- 0027812550
- Blind beamforming for non-Gaussian signals
- J. F. Cardoso and A. Souloumiac, "Blind beamforming for non-Gaussian signals," IEE Proc. Radar Signal Process., vol.140, pp. 362-370, 1993.
- (1993) IEE Proc. Radar Signal Process. , vol.140 , pp. 362-370
- Cardoso, J.F.¹ Souloumiac, A.²

6
- 0042761032
- A new algorithm for automatic beamforming
- Z. Ding, "A new algorithm for automatic beamforming," in Proc. Conf. Signals, Syst., Comput., 1991, vol.2, pp. 689-693.
- (1991) Proc. Conf. Signals, Syst., Comput , vol.2 , pp. 689-693
- Ding, Z.¹

7
- 0036753896
- Geometric source separation: Merging convolutive source separation with geometric beamforming
- Sep
- L. Parra and C. Alvino, "Geometric source separation: merging convolutive source separation with geometric beamforming," IEEE Trans. Speech Audio Process., vol.10, no.6, pp. 352-362, Sep. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 352-362
- Parra, L.¹ Alvino, C.²

8
- 65249179051
- Blind spatial subtraction array for speech enhancement in noisy environment
- May
- Y. Takahashi, T. Takatani, K. Osakoand, H. Saruwatari, and K. Shikano, "Blind spatial subtraction array for speech enhancement in noisy environment," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.4, pp. 650-664, May 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 650-664
- Takahashi, Y.¹ Takatani, T.² Osakoand, K.³ Saruwatari, H.⁴ Shikano, K.⁵

9
- 44649121939
- Geometrically constrained independent component analysis
- Feb.
- M. Knaak, S. Araki, and S. Makino, "Geometrically constrained independent component analysis," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.2, pp. 715-726, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 715-726
- Knaak, M.¹ Araki, S.² Makino, S.³

10
- 33749553258
- Blind source separation based on a fast-convergence algorithm combining ICA and beamforming
- Mar
- H. Saruwatari, T. Kawamura, T. Nishikawa, A. Leeand, and K. Shikano, "Blind source separation based on a fast-convergence algorithm combining ICA and beamforming," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.2, pp. 666-678, Mar. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 666-678
- Saruwatari, H.¹ Kawamura, T.² Nishikawa, T.³ Leeand, A.⁴ Shikano, K.⁵

11
- 70349197558
- Combining independent component analysis with geometric information and its application to speech processing
- Apr
- W. Zhang and B. D. Rao, "Combining independent component analysis with geometric information and its application to speech processing," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 2009, pp. 3065-3068.
- (2009) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 3065-3068
- Zhang, W.¹ Rao, B.D.²

12
- 0036816475
- Content analysis for audio classification and segmentation
- Oct
- L. Lu, H. J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. Speech Audio Process., vol.10, no.7, pp. 504-516, Oct. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.7 , pp. 504-516
- Lu, L.¹ Zhang, H.J.² Jiang, H.³

13
- 34047274787
- Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
- May
- R. Huang and J. H. L. Hansen, "Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.3, pp. 907-919, May 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 907-919
- Huang, R.¹ Hansen, J.H.L.²

14
- 0003964055
- New York: Wiley
- H. L. V. Trees, Optimum Array Processing. New York: Wiley, 2002.
- (2002) Optimum Array Processing
- Trees, H.L.V.¹

15
- 0031234613
- A signal subspace tracking algorithm for microphone array processing of speech
- Sep
- S. Affes and Y. Grenier, "A signal subspace tracking algorithm for microphone array processing of speech," IEEE Trans. Speech Audio Process., vol.5, no.5, pp. 425-437, Sep. 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.5 , pp. 425-437
- Affes, S.¹ Grenier, Y.²

16
- 0003905759
- New York: Wiley
- A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. New York: Wiley, 2001.
- (2001) Independent Component Analysis
- Hyvärinen, A.¹ Karhunen, J.² Oja, E.³

17
- 41049111573
- A class of complex ICA algorithms based on the kurtosis cost function
- H. Li and T. Adal, "A class of complex ICA algorithms based on the kurtosis cost function," IEEE Trans. Neural Netw., vol.19, no.3, pp. 408-420, 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.3 , pp. 408-420
- Li, H.¹ Adal, T.²

18
- 0021645331
- Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
- Dec
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Audio Process., vol.ASSP-32, no.12, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Audio Process. , vol.ASSP-32 , Issue.12 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

19
- 0034136037
- A fast fixed-point algorithm for independent component analysis of complex valued signals
- E. Bingham and A. Hyvärinen, "A fast fixed-point algorithm for independent component analysis of complex valued signals," Int. J. Neural Syst., vol.10, pp. 1-8, 2000.
- (2000) Int. J. Neural Syst. , vol.10 , pp. 1-8
- Bingham, E.¹ Hyvärinen, A.²

20
- 0037367812
- The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
- Mar
- S. Araki, R. Mukai, S. Makino, T. Nishikawa, and H. Saruwatari, "The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech," IEEE Trans. Speech Audio Process., vol.11, no.2, pp. 109-116, Mar. 2003.
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.2 , pp. 109-116
- Araki, S.¹ Mukai, R.² Makino, S.³ Nishikawa, T.⁴ Saruwatari, H.⁵

21
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- Apr
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 1997, vol.2, pp. 1331-1334.
- (1997) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

22
- 85006586861
- Speech and music classification and separation: A review
- A. I. Al-Shoshan, "Speech and music classification and separation: A review," J. King Saud Univ., vol.19, pp. 95-133, 2006.
- (2006) J. King Saud Univ. , vol.19 , pp. 95-133
- Al-Shoshan, A.I.¹

23
- 70350573623
- Complex-valued independent component analysis for online blind speech extraction
- Nov
- B. Sällberg, N. Grbić, and I. Claesson, "Complex-valued independent component analysis for online blind speech extraction," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.8, pp. 1624-1632,Nov. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.8 , pp. 1624-1632
- Sällberg, B.¹ Grbić, N.² Claesson, I.³

24
- 0020706753
- A complex gradient operator and its application in adaptive array theory
- Feb.
- D. H. Brandwood, "A complex gradient operator and its application in adaptive array theory," in Proc. IEE, Special Iss. Adaptive Arrays, Feb. 1983, vol.130, pp. 11-17.
- (1983) Proc. IEE, Special Iss. Adaptive Arrays , vol.130 , pp. 11-17
- Brandwood, D.H.¹

25
- 0003982971
- New York: Springer
- J. Nocedal and S. J. Wright, Numerical Optimization. New York: Springer, 2006.
- (2006) Numerical Optimization
- Nocedal, J.¹ Wright, S.J.²

26
- 0025516799
- Speech enhancement for mobile telephony
- Nov.
- M. M. Goulding and J. S. Bird, "Speech enhancement for mobile telephony," IEEE Trans. Veh. Technol., vol.39, no.4, pp. 316-326, Nov.. 1990.
- (1990) IEEE Trans. Veh. Technol. , vol.39 , Issue.4 , pp. 316-326
- Goulding, M.M.¹ Bird, J.S.²

27
- 77956715553
- [Online]. Available
- [Online]. Available: http://www.utdallas.edu/research/utdrive/UTDrive- Website.htm

28
- 33645589775
- New York: Springer
- H. Abut, J. Hansen, and K. Takeda, DSP for In-Vehicle and Mobile Systems. New York: Springer, 2004.
- (2004) DSP for In-Vehicle and Mobile Systems
- Abut, H.¹ Hansen, J.² Takeda, K.³

29
- 4544265401
- Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs
- ITU-T Rec. 862
- "Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," ITU, 2000, ITU-T Rec. 862.
- (2000) ITU

30
- 0003419545
- Gaithersburg, MD: NIST
- J. S. Garofolo, Getting Started with the DARPA TIMIT CD-ROM: An Acoustic Phonetic Continuous Speech Database. Gaithersburg, MD: NIST, 1988.
- (1988) Getting Started with the DARPA TIMIT CD-ROM: An Acoustic Phonetic Continuous Speech Database
- Garofolo, J.S.¹

31
- 0033904494
- A Newton-like algorithm for complex variables with applications in blind equalization
- Feb.
- G. Yan and H. Fan, "A Newton-like algorithm for complex variables with applications in blind equalization," IEEE Trans. Signal Process., vol.48, no.2, pp. 553-556, Feb. 2000.
- (2000) IEEE Trans. Signal Process. , vol.48 , Issue.2 , pp. 553-556
- Yan, G.¹ Fan, H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.