SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2017, Pages 16-20

An em algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures

(5) Kounades Bastian, Dionyssos a Girin, Laurent a,b Alameda Pineda, Xavier c Gannot, Sharon d Horaud, Radu a

a INRIA RHÔNE ALPES (France)

b UNIV GRENOBLE ALPES (France)

c UNIVERSITY OF TRENTO (Italy)

d BAR ILAN UNIVERSITY (Israel)

Author keywords

Audio source separation; local Gaussian model; speaker diarisation

Indexed keywords

EID: 85023750194 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2017.7951789 Document Type: Conference Paper

Times cited : (16)

References (24)

1
- 85013730549
- Academic Press
- P. Comon and C. Jutten, Eds., Handbook of Blind Source Separation - Independent Component Analysis and Applications. Academic Press, 2010.
- (2010) Handbook of Blind Source Separation - Independent Component Analysis and Applications
- Comon, P.¹ Jutten, C.²

2
- 34047261805
- An overview of automatic speaker diarization systems
- S. Tranter and D. Reynolds, "An overview of automatic speaker diarization systems," IEEE TASLP, Vol. 14, no. 5, pp. 1557-1565, 2006.
- (2006) IEEE TASLP , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.¹ Reynolds, D.²

3
- 85008530405
- Speaker diarization: A review of recent research
- X. Anguera Miro, S. Bozonnet, N. Evans, C. Fredouille, G. Friedland, and O. Vinyals, "Speaker diarization: A review of recent research," IEEE TASLP, Vol. 20, no. 2, pp. 356-371, 2012.
- (2012) IEEE TASLP , vol.20 , Issue.2 , pp. 356-371
- Anguera Miro, X.¹ Bozonnet, S.² Evans, N.³ Fredouille, C.⁴ Friedland, G.⁵ Vinyals, O.⁶

4
- 77955698250
- Probabilistic modeling paradigms for audio source separation
- E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, and M. E. Davies, "Probabilistic modeling paradigms for audio source separation," Machine Audition: Principles, Algorithms and Systems, pp. 162-185, 2010.
- (2010) Machine Audition: Principles, Algorithms and Systems , pp. 162-185
- Vincent, E.¹ Jafari, M.G.² Abdallah, S.A.³ Plumbley, M.D.⁴ Davies, M.E.⁵

5
- 76949094445
- Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
- A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE TASLP, Vol. 18, no. 3, pp. 550-563, 2010.
- (2010) IEEE TASLP , vol.18 , Issue.3 , pp. 550-563
- Ozerov, A.¹ Févotte, C.²

6
- 77955675017
- Under-determined reverberant audio source separation using a full-rank spatial covariance model
- N. Duong, E. Vincent, and R. Gribonval, "Under-determined reverberant audio source separation using a full-rank spatial covariance model," IEEE TASLP, Vol. 18, no. 7, pp. 1830-1840, 2010.
- (2010) IEEE TASLP , vol.18 , Issue.7 , pp. 1830-1840
- Duong, N.¹ Vincent, E.² Gribonval, R.³

7
- 84897584695
- A general flexible framework for the handling of prior information in audio source separation
- A. Ozerov, E. Vincent, and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE TASLP, Vol. 20, no. 4, pp. 1118-1133, 2012.
- (2012) IEEE TASLP , vol.20 , Issue.4 , pp. 1118-1133
- Ozerov, A.¹ Vincent, E.² Bimbot, F.³

8
- 80052714549
- Multistream speaker diarization of meetings recordings beyond mfcc and tdoa features
- D. Vijayasenan, F. Valente, and H. Bourlard, "Multistream speaker diarization of meetings recordings beyond mfcc and tdoa features," Springer handbook on speech processing and speech communication, Vol. 54, no. 1, 2012.
- (2012) Springer Handbook on Speech Processing and Speech Communication , vol.54 , Issue.1
- Vijayasenan, D.¹ Valente, F.² Bourlard, H.³

9
- 84910071152
- A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models
- Singapore
- T. Higuchi, H. Takeda, N. Tomohiko, and H. Kameoka, "A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden markov models," in Interspeech, Singapore, 2014.
- (2014) Interspeech
- Higuchi, T.¹ Takeda, H.² Tomohiko, N.³ Kameoka, H.⁴

10
- 85023765413
- Underdetermined blind separation and tracking of moving sources based on DOA-HMM
- Florence, Italy
- T. Higuchi, N. Takamune, N. Tomohiko, and H. Kameoka, "Underdetermined blind separation and tracking of moving sources based on DOA-HMM," in IEEE ICASSP, Florence, Italy, 2014.
- (2014) IEEE ICASSP
- Higuchi, T.¹ Takamune, N.² Tomohiko, N.³ Kameoka, H.⁴

11
- 84963959904
- Unified approach for audio source separation with multichannel HMM and DOA mixture model
- Nice, France
- T. Higuchi and H. Kameoka, "Unified approach for audio source separation with multichannel HMM and DOA mixture model," in Eusipco, Nice, France, 2015.
- (2015) Eusipco
- Higuchi, T.¹ Kameoka, H.²

12
- 0000914334
- Convolutive blind separation of non-stationary sources
- L. Parra and C. Spence, "Convolutive blind separation of non-stationary sources," IEEE TASLP, Vol. 8, no. 3, pp. 320-327, 2000.
- (2000) IEEE TASLP , vol.8 , Issue.3 , pp. 320-327
- Parra, L.¹ Spence, C.²

13
- 0027634633
- Proper complex random processes with applications to information theory
- F. Neeser and J. Massey, "Proper complex random processes with applications to information theory," IEEE Trans. Info. Theory, Vol. 39, no. 4, pp. 1293-1302, 1993.
- (1993) IEEE Trans. Info. Theory , vol.39 , Issue.4 , pp. 1293-1302
- Neeser, F.¹ Massey, J.²

14
- 84976411473
- A variational em algorithm for the separation of time-varying convolutive audio mixtures
- D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, and R. Horaud, "A variational EM algorithm for the separation of time-varying convolutive audio mixtures," IEEE TASLP, Vol. 24, no. 8, pp. 1408-1423, 2016.
- (2016) IEEE TASLP , vol.24 , Issue.8 , pp. 1408-1423
- Kounades-Bastian, D.¹ Girin, L.² Alameda-Pineda, X.³ Gannot, S.⁴ Horaud, R.⁵

15
- 78650302778
- Non-negative matrix factorization and spatial covariance model for under-determined reverberant audio source separation
- S. Arberet, A. Ozerov, N. Q. K. Duong, E. Vincent, R. Gribonval, F. Bimbot, and P. Vandergheynst, "Non-negative matrix factorization and spatial covariance model for under-determined reverberant audio source separation," in ISSPA, 2010.
- (2010) ISSPA
- Arberet, S.¹ Ozerov, A.² Duong, N.Q.K.³ Vincent, E.⁴ Gribonval, R.⁵ Bimbot, F.⁶ Vandergheynst, P.⁷

16
- 0141630475
- Non negative sparse representation for wiener based source separation with a single sensor
- L. Benaroya, L. Donagh, F. Bimbot, and R. Gribonval, "Non negative sparse representation for Wiener based source separation with a single sensor," in IEEE ICASSP, Vol. 6, 2003, pp. 613-616.
- (2003) IEEE ICASSP , vol.6 , pp. 613-616
- Benaroya, L.¹ Donagh, L.² Bimbot, F.³ Gribonval, R.⁴

17
- 63249085556
- Nonnegative matrix factorization with the itakura-saito divergence. with application to music analysis
- C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Computation, Vol. 21, no. 3, pp. 793-830, 2009.
- (2009) Neural Computation , vol.21 , Issue.3 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.-L.³

18
- 84881053943
- Supervised and unsupervised speech enhancement using nonnegative matrix factorization
- N. Mohammadiha, P. Smaragdis, and A. Leijon, "Supervised and unsupervised speech enhancement using nonnegative matrix factorization," IEEE TASLP, Vol. 21, no. 10, pp. 2140-2151, 2013.
- (2013) IEEE TASLP , vol.21 , Issue.10 , pp. 2140-2151
- Mohammadiha, N.¹ Smaragdis, P.² Leijon, A.³

19
- 33846516584
- Springer
- C. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.¹

20
- 84867497700
- Linear mixing models for active listening of music productions in realistic studio conditions
- N. Sturmel, A. Liutkus, J. Pinel, L. Girin, S. Marchand, G. Richard, R. Badeau, and L. Daudet, "Linear mixing models for active listening of music productions in realistic studio conditions," in Proc. Audio Eng. Soc, 2012.
- (2012) Proc. Audio Eng. Soc
- Sturmel, N.¹ Liutkus, A.² Pinel, J.³ Girin, L.⁴ Marchand, S.⁵ Richard, G.⁶ Badeau, R.⁷ Daudet, L.⁸

21
- 3042518464
- Timit acoustic-phonetic continuous speech corpus
- Philadelphia
- J. S. Garofolo, L. F Lamel, W M. Fisher, J. G Fiscus, D. S. Pallett, N. L. Dahlgren, and V. Zue, "Timit acoustic-phonetic continuous speech corpus," 1993, Linguistic Data Consortium, Philadelphia.
- (1993) Linguistic Data Consortium
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶ Zue, V.⁷

22
- 84976377039
- A comparison of computational precedence models for source separation in reverberant environments
- C. Hummersone, R. Mason, and T. Brookes, "A comparison of computational precedence models for source separation in reverberant environments," J. Audio Eng. Soc, Vol. 61, no. 7/8, pp. 508-520, 2013.
- (2013) J. Audio Eng. Soc , vol.61 , Issue.7-8 , pp. 508-520
- Hummersone, C.¹ Mason, R.² Brookes, T.³

23
- 33744975847
- Performance measurement in blind audio source separation
- E. Vincent, R. Gribonval, and C. Févotte, "Performance measurement in blind audio source separation," IEEE TASLP, Vol. 14, no. 4, pp. 1462-1469, 2006.
- (2006) IEEE TASLP , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Févotte, C.³

24
- 84873425853
- Non-negative hidden Markov modeling of audio with application to source separation
- St. Malo, France
- G J. Mysore, P. Smaragdis, and R. Bliksha, "Non-negative hidden markov modeling of audio with application to source separation," in Proc. Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), St. Malo, France, 2010.
- (2010) Proc. Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA)
- Mysore, G.J.¹ Smaragdis, P.² Bliksha, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.