메뉴 건너뛰기




Volumn , Issue , 2017, Pages 16-20

An em algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures

Author keywords

Audio source separation; local Gaussian model; speaker diarisation

Indexed keywords


EID: 85023750194     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2017.7951789     Document Type: Conference Paper
Times cited : (16)

References (24)
  • 2
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker diarization systems
    • S. Tranter and D. Reynolds, "An overview of automatic speaker diarization systems," IEEE TASLP, Vol. 14, no. 5, pp. 1557-1565, 2006.
    • (2006) IEEE TASLP , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.1    Reynolds, D.2
  • 5
    • 76949094445 scopus 로고    scopus 로고
    • Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
    • A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE TASLP, Vol. 18, no. 3, pp. 550-563, 2010.
    • (2010) IEEE TASLP , vol.18 , Issue.3 , pp. 550-563
    • Ozerov, A.1    Févotte, C.2
  • 6
    • 77955675017 scopus 로고    scopus 로고
    • Under-determined reverberant audio source separation using a full-rank spatial covariance model
    • N. Duong, E. Vincent, and R. Gribonval, "Under-determined reverberant audio source separation using a full-rank spatial covariance model," IEEE TASLP, Vol. 18, no. 7, pp. 1830-1840, 2010.
    • (2010) IEEE TASLP , vol.18 , Issue.7 , pp. 1830-1840
    • Duong, N.1    Vincent, E.2    Gribonval, R.3
  • 7
    • 84897584695 scopus 로고    scopus 로고
    • A general flexible framework for the handling of prior information in audio source separation
    • A. Ozerov, E. Vincent, and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE TASLP, Vol. 20, no. 4, pp. 1118-1133, 2012.
    • (2012) IEEE TASLP , vol.20 , Issue.4 , pp. 1118-1133
    • Ozerov, A.1    Vincent, E.2    Bimbot, F.3
  • 9
    • 84910071152 scopus 로고    scopus 로고
    • A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models
    • Singapore
    • T. Higuchi, H. Takeda, N. Tomohiko, and H. Kameoka, "A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden markov models," in Interspeech, Singapore, 2014.
    • (2014) Interspeech
    • Higuchi, T.1    Takeda, H.2    Tomohiko, N.3    Kameoka, H.4
  • 10
    • 85023765413 scopus 로고    scopus 로고
    • Underdetermined blind separation and tracking of moving sources based on DOA-HMM
    • Florence, Italy
    • T. Higuchi, N. Takamune, N. Tomohiko, and H. Kameoka, "Underdetermined blind separation and tracking of moving sources based on DOA-HMM," in IEEE ICASSP, Florence, Italy, 2014.
    • (2014) IEEE ICASSP
    • Higuchi, T.1    Takamune, N.2    Tomohiko, N.3    Kameoka, H.4
  • 11
    • 84963959904 scopus 로고    scopus 로고
    • Unified approach for audio source separation with multichannel HMM and DOA mixture model
    • Nice, France
    • T. Higuchi and H. Kameoka, "Unified approach for audio source separation with multichannel HMM and DOA mixture model," in Eusipco, Nice, France, 2015.
    • (2015) Eusipco
    • Higuchi, T.1    Kameoka, H.2
  • 12
    • 0000914334 scopus 로고    scopus 로고
    • Convolutive blind separation of non-stationary sources
    • L. Parra and C. Spence, "Convolutive blind separation of non-stationary sources," IEEE TASLP, Vol. 8, no. 3, pp. 320-327, 2000.
    • (2000) IEEE TASLP , vol.8 , Issue.3 , pp. 320-327
    • Parra, L.1    Spence, C.2
  • 13
    • 0027634633 scopus 로고
    • Proper complex random processes with applications to information theory
    • F. Neeser and J. Massey, "Proper complex random processes with applications to information theory," IEEE Trans. Info. Theory, Vol. 39, no. 4, pp. 1293-1302, 1993.
    • (1993) IEEE Trans. Info. Theory , vol.39 , Issue.4 , pp. 1293-1302
    • Neeser, F.1    Massey, J.2
  • 14
    • 84976411473 scopus 로고    scopus 로고
    • A variational em algorithm for the separation of time-varying convolutive audio mixtures
    • D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, and R. Horaud, "A variational EM algorithm for the separation of time-varying convolutive audio mixtures," IEEE TASLP, Vol. 24, no. 8, pp. 1408-1423, 2016.
    • (2016) IEEE TASLP , vol.24 , Issue.8 , pp. 1408-1423
    • Kounades-Bastian, D.1    Girin, L.2    Alameda-Pineda, X.3    Gannot, S.4    Horaud, R.5
  • 15
    • 78650302778 scopus 로고    scopus 로고
    • Non-negative matrix factorization and spatial covariance model for under-determined reverberant audio source separation
    • S. Arberet, A. Ozerov, N. Q. K. Duong, E. Vincent, R. Gribonval, F. Bimbot, and P. Vandergheynst, "Non-negative matrix factorization and spatial covariance model for under-determined reverberant audio source separation," in ISSPA, 2010.
    • (2010) ISSPA
    • Arberet, S.1    Ozerov, A.2    Duong, N.Q.K.3    Vincent, E.4    Gribonval, R.5    Bimbot, F.6    Vandergheynst, P.7
  • 16
    • 0141630475 scopus 로고    scopus 로고
    • Non negative sparse representation for wiener based source separation with a single sensor
    • L. Benaroya, L. Donagh, F. Bimbot, and R. Gribonval, "Non negative sparse representation for Wiener based source separation with a single sensor," in IEEE ICASSP, Vol. 6, 2003, pp. 613-616.
    • (2003) IEEE ICASSP , vol.6 , pp. 613-616
    • Benaroya, L.1    Donagh, L.2    Bimbot, F.3    Gribonval, R.4
  • 17
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the itakura-saito divergence. with application to music analysis
    • C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Computation, Vol. 21, no. 3, pp. 793-830, 2009.
    • (2009) Neural Computation , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.-L.3
  • 18
    • 84881053943 scopus 로고    scopus 로고
    • Supervised and unsupervised speech enhancement using nonnegative matrix factorization
    • N. Mohammadiha, P. Smaragdis, and A. Leijon, "Supervised and unsupervised speech enhancement using nonnegative matrix factorization," IEEE TASLP, Vol. 21, no. 10, pp. 2140-2151, 2013.
    • (2013) IEEE TASLP , vol.21 , Issue.10 , pp. 2140-2151
    • Mohammadiha, N.1    Smaragdis, P.2    Leijon, A.3
  • 22
    • 84976377039 scopus 로고    scopus 로고
    • A comparison of computational precedence models for source separation in reverberant environments
    • C. Hummersone, R. Mason, and T. Brookes, "A comparison of computational precedence models for source separation in reverberant environments," J. Audio Eng. Soc, Vol. 61, no. 7/8, pp. 508-520, 2013.
    • (2013) J. Audio Eng. Soc , vol.61 , Issue.7-8 , pp. 508-520
    • Hummersone, C.1    Mason, R.2    Brookes, T.3
  • 23
    • 33744975847 scopus 로고    scopus 로고
    • Performance measurement in blind audio source separation
    • E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE TASLP, Vol. 14, no. 4, pp. 1462-1469, 2006.
    • (2006) IEEE TASLP , vol.14 , Issue.4 , pp. 1462-1469
    • Vincent, E.1    Gribonval, R.2    Févotte, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.