메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 785-797

Automatic beamforming for blind extraction of speech from music environment using variance of spectral flux-inspired criterion

Author keywords

Array signal processing; blind beamforming; blind source extraction (BSE); speech enhancement

Indexed keywords

ACOUSTIC SOUND; ARRAY GEOMETRIES; ARRAY SIGNAL PROCESSING; BLIND BEAMFORMING; BLIND EXTRACTION; BLIND SOURCE EXTRACTION; DIFFUSE NOISE; FIXED-POINT ALGORITHMS; INTERFERENCE NOISE; KEY FEATURE; NOISE COVARIANCE MATRIX; NOISY OBSERVATIONS; NONGAUSSIANITY; PRIORI KNOWLEDGE; SECOND ORDERS; SPECTRAL FLUX; SUBSPACE DECOMPOSITION; TAYLOR SERIES EXPANSIONS; TIME CORRELATIONS; TIME FREQUENCY DOMAIN;

EID: 77956737752     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2069790     Document Type: Article
Times cited : (4)

References (31)
  • 2
    • 0036725739 scopus 로고    scopus 로고
    • GSVD-based optimal filtering for single and multimicrophone speech enhancement
    • Sep
    • S. Doclo and M. Moonen, "GSVD-based optimal filtering for single and multimicrophone speech enhancement," IEEE Trans. Signal Process., vol.50, no.9, pp. 2230-2244, Sep. 2002.
    • (2002) IEEE Trans. Signal Process. , vol.50 , Issue.9 , pp. 2230-2244
    • Doclo, S.1    Moonen, M.2
  • 3
    • 41049101164 scopus 로고    scopus 로고
    • Frequency domain multi-channel noise reduction based on the spatial subspace decomposition and noise eigenvalue modification
    • Sep
    • G.Kimand N. I. N. Cho, "Frequency domain multi-channel noise reduction based on the spatial subspace decomposition and noise eigenvalue modification," Speech Commun., vol.50, pp. 382-391, Sep. 2008.
    • (2008) Speech Commun , vol.50 , pp. 382-391
    • Kimand, G.1    Cho, N.I.N.2
  • 4
    • 51449123556 scopus 로고    scopus 로고
    • Blind acoustic beamforming based on generalized eigenvalue decomposition
    • Jul.
    • E. Warsitz and R. Haeb-Umbach, "Blind acoustic beamforming based on generalized eigenvalue decomposition," IEEE Trans. Audio, Speech, Language Process., vol.15, no.5, pp. 1529-1539, Jul. 2007.
    • (2007) IEEE Trans. Audio, Speech, Language Process. , vol.15 , Issue.5 , pp. 1529-1539
    • Warsitz, E.1    Haeb-Umbach, R.2
  • 6
    • 0042761032 scopus 로고
    • A new algorithm for automatic beamforming
    • Z. Ding, "A new algorithm for automatic beamforming," in Proc. Conf. Signals, Syst., Comput., 1991, vol.2, pp. 689-693.
    • (1991) Proc. Conf. Signals, Syst., Comput , vol.2 , pp. 689-693
    • Ding, Z.1
  • 7
    • 0036753896 scopus 로고    scopus 로고
    • Geometric source separation: Merging convolutive source separation with geometric beamforming
    • Sep
    • L. Parra and C. Alvino, "Geometric source separation: merging convolutive source separation with geometric beamforming," IEEE Trans. Speech Audio Process., vol.10, no.6, pp. 352-362, Sep. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 352-362
    • Parra, L.1    Alvino, C.2
  • 9
    • 44649121939 scopus 로고    scopus 로고
    • Geometrically constrained independent component analysis
    • Feb.
    • M. Knaak, S. Araki, and S. Makino, "Geometrically constrained independent component analysis," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.2, pp. 715-726, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 715-726
    • Knaak, M.1    Araki, S.2    Makino, S.3
  • 11
    • 70349197558 scopus 로고    scopus 로고
    • Combining independent component analysis with geometric information and its application to speech processing
    • Apr
    • W. Zhang and B. D. Rao, "Combining independent component analysis with geometric information and its application to speech processing," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 2009, pp. 3065-3068.
    • (2009) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 3065-3068
    • Zhang, W.1    Rao, B.D.2
  • 12
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmentation
    • Oct
    • L. Lu, H. J. Zhang, and H. Jiang, "Content analysis for audio classification and segmentation," IEEE Trans. Speech Audio Process., vol.10, no.7, pp. 504-516, Oct. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.J.2    Jiang, H.3
  • 13
    • 34047274787 scopus 로고    scopus 로고
    • Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
    • May
    • R. Huang and J. H. L. Hansen, "Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.3, pp. 907-919, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 907-919
    • Huang, R.1    Hansen, J.H.L.2
  • 15
    • 0031234613 scopus 로고    scopus 로고
    • A signal subspace tracking algorithm for microphone array processing of speech
    • Sep
    • S. Affes and Y. Grenier, "A signal subspace tracking algorithm for microphone array processing of speech," IEEE Trans. Speech Audio Process., vol.5, no.5, pp. 425-437, Sep. 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.5 , pp. 425-437
    • Affes, S.1    Grenier, Y.2
  • 17
    • 41049111573 scopus 로고    scopus 로고
    • A class of complex ICA algorithms based on the kurtosis cost function
    • H. Li and T. Adal, "A class of complex ICA algorithms based on the kurtosis cost function," IEEE Trans. Neural Netw., vol.19, no.3, pp. 408-420, 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.3 , pp. 408-420
    • Li, H.1    Adal, T.2
  • 18
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Audio Process., vol.ASSP-32, no.12, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Audio Process. , vol.ASSP-32 , Issue.12 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 19
    • 0034136037 scopus 로고    scopus 로고
    • A fast fixed-point algorithm for independent component analysis of complex valued signals
    • E. Bingham and A. Hyvärinen, "A fast fixed-point algorithm for independent component analysis of complex valued signals," Int. J. Neural Syst., vol.10, pp. 1-8, 2000.
    • (2000) Int. J. Neural Syst. , vol.10 , pp. 1-8
    • Bingham, E.1    Hyvärinen, A.2
  • 20
    • 0037367812 scopus 로고    scopus 로고
    • The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech
    • Mar
    • S. Araki, R. Mukai, S. Makino, T. Nishikawa, and H. Saruwatari, "The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech," IEEE Trans. Speech Audio Process., vol.11, no.2, pp. 109-116, Mar. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.2 , pp. 109-116
    • Araki, S.1    Mukai, R.2    Makino, S.3    Nishikawa, T.4    Saruwatari, H.5
  • 21
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • Apr
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Apr. 1997, vol.2, pp. 1331-1334.
    • (1997) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 22
    • 85006586861 scopus 로고    scopus 로고
    • Speech and music classification and separation: A review
    • A. I. Al-Shoshan, "Speech and music classification and separation: A review," J. King Saud Univ., vol.19, pp. 95-133, 2006.
    • (2006) J. King Saud Univ. , vol.19 , pp. 95-133
    • Al-Shoshan, A.I.1
  • 23
    • 70350573623 scopus 로고    scopus 로고
    • Complex-valued independent component analysis for online blind speech extraction
    • Nov
    • B. Sällberg, N. Grbić, and I. Claesson, "Complex-valued independent component analysis for online blind speech extraction," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.8, pp. 1624-1632,Nov. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.8 , pp. 1624-1632
    • Sällberg, B.1    Grbić, N.2    Claesson, I.3
  • 24
    • 0020706753 scopus 로고
    • A complex gradient operator and its application in adaptive array theory
    • Feb.
    • D. H. Brandwood, "A complex gradient operator and its application in adaptive array theory," in Proc. IEE, Special Iss. Adaptive Arrays, Feb. 1983, vol.130, pp. 11-17.
    • (1983) Proc. IEE, Special Iss. Adaptive Arrays , vol.130 , pp. 11-17
    • Brandwood, D.H.1
  • 26
    • 0025516799 scopus 로고
    • Speech enhancement for mobile telephony
    • Nov.
    • M. M. Goulding and J. S. Bird, "Speech enhancement for mobile telephony," IEEE Trans. Veh. Technol., vol.39, no.4, pp. 316-326, Nov.. 1990.
    • (1990) IEEE Trans. Veh. Technol. , vol.39 , Issue.4 , pp. 316-326
    • Goulding, M.M.1    Bird, J.S.2
  • 27
    • 77956715553 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://www.utdallas.edu/research/utdrive/UTDrive- Website.htm
  • 29
    • 4544265401 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs
    • ITU-T Rec. 862
    • "Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," ITU, 2000, ITU-T Rec. 862.
    • (2000) ITU
  • 31
    • 0033904494 scopus 로고    scopus 로고
    • A Newton-like algorithm for complex variables with applications in blind equalization
    • Feb.
    • G. Yan and H. Fan, "A Newton-like algorithm for complex variables with applications in blind equalization," IEEE Trans. Signal Process., vol.48, no.2, pp. 553-556, Feb. 2000.
    • (2000) IEEE Trans. Signal Process. , vol.48 , Issue.2 , pp. 553-556
    • Yan, G.1    Fan, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.