SCOPUS 정보 검색 플랫폼

2014 IEEE Workshop on Spoken Language Technology, SLT 2014 - Proceedings

Volumn , Issue , 2014, Pages 390-395

Constrained speaker diarization of TV series based on visual patterns

(2) Bost, Xavier a Linares, Georges a

a UNIVERSITY OF AVIGNON (France)

Author keywords

Kagglomerative clustering; Speaker diarization; Video structuration

Indexed keywords

CLUSTERING ALGORITHMS;

ACOUSTIC CONDITIONS; ACOUSTIC VARIABILITY; AGGLOMERATIVE CLUSTERING; BACKGROUND MUSICS; KAGGLOMERATIVE CLUSTERING; SPEAKER DIARIZATION; STRUCTURATION; TWO STEP METHOD;

SPEECH RECOGNITION;

EID: 84946685118 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2014.7078606 Document Type: Conference Paper

Times cited : (17)

References (14)

1
- 80051623447
- Speaker diarization of heterogeneous web video files: A preliminary study
- Pierre Clement, Thierry Bazillon, and Corinne Fredouille, "Speaker diarization of heterogeneous web video files: A preliminary study, " in Acoustics, Speech and Signal Process-ing (ICASSP), 2011 IEEE International Conference on. IEEE, 201 1, pp. 4432-443 5.
- (2011) Acoustics, Speech and Signal Process-ing (ICASSP), 2011 IEEE International Conference On. IEEE , pp. 4432-4435
- Clement, P.¹ Bazillon, T.² Fredouille, C.³

2
- 70349214881
- Multi-modal speaker diarization of real-world meetings using compresseddomain video features
- April
- G. Friedland, H. Hung, and Chuohao Yeo, "Multi-modal speaker diarization of real-world meetings using compresseddomain video features, " in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, April 2009, pp. 4069-4072.
- (2009) Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on , pp. 4069-4072
- Friedland, G.¹ Hung, H.² Yeo, C.³

3
- 33745530242
- The ami meeting corpus: A pre-announcement
- Springer-Verlag
- Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, lain McCowan, Wilfried Post, Dennis Reidsma, and Pierre Wellner, "The ami meeting corpus: A pre-announcement, " in Proceedings of the Second International Conference on Machine Learning for Multimodal Interaction, Berlin, Heidelberg, 2006, MLMI' 05, p p. 28-39, Springer-Verlag.
- Proceedings of the Second International Conference on Machine Learning for Multimodal Interaction, Berlin, Heidelberg, 2006, MLMI' 05 , pp. 28-39
- Carletta, J.¹ Ashby, S.² Bourban, S.³ Flynn, M.⁴ Guillemot, M.⁵ Hain, T.⁶ Kadlec, J.⁷ Karaiskos, V.⁸ Kraaij, W.⁹ Kronenthal, M.¹⁰ Lathoud, G.¹¹ Lincoln, M.¹² Lisowska, A.¹³ McCowan, L.¹⁴ Post, W.¹⁵ Reidsma, D.¹⁶ Wellner, P.¹⁷

4
- 84865776156
- Comparing multi-stage approaches for cross-show speaker diarization
- Viet-Anh Tran, Viet B ac Le, Claude B arras, and Lori Lamel, " Comparing multi-stage approaches for cross-show speaker diarization., " in INTERSPEECH, 20 1 1, pp. 1 053-1056.
- (2011) INTERSPEECH , pp. 1053-1056
- Tran, V.¹ Le Ac, V.B.² Arras, C.B.³ Lamel, L.⁴

5
- 84883313615
- Unsupervised face identification in tv content using audio-visual sources
- Meriem Bendris, Benoit Favre, Delphine Charlet, Geraldine Damnati, Gregory Senay, Remi Auguste, and Jean Martinet, "Unsupervised face identification in tv content using audio-visual sources, " in Content Based Multimedia Indexing (CBMI), 2013 1 1 th International Workshop on. IEEE, 20 1 3, pp. 243-249.
- (2013) Content Based Multimedia Indexing (CBMI), 201311 Th International Workshop On. IEEE , pp. 243-249
- Bendris, M.¹ Favre, B.² Charlet, D.³ Damnati, G.⁴ Senay, G.⁵ Auguste, R.⁶ Martinet, J.⁷

6
- 84867605264
- Segmentation of tv shows into scenes using speaker diarization and speech recognition
- Herve Bredin, "Segmentation of tv shows into scenes using speaker diarization and speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 20 1 2, pp. 2377-23 80.
- (2012) Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference On. IEEE , pp. 2377-2380
- Bredin, H.¹

7
- 0035093241
- Temporal video segmentation: A survey
- 2001
- Irena Koprinska and Sergio Carrato, "Temporal video segmentation: A survey, " Signal processing: Image communication, vol. 1 6, no. 5, pp. 477-500, 200 1.
- Signal Processing: Image Communication , vol.16 , Issue.5 , pp. 477-500
- Koprinska, I.¹ Carrato, S.²

8
- 79951609039
- Front-end factor analysis for speaker verification
- 2011
- Najim Dehak, Patrick Kenny, Recta Dehak, Pierre Dumouchel, and Pierre Ouellet, "Front-end factor analysis for speaker verification, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 4, pp. 7 8 8-798, 201 1.
- Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

9
- 84865753339
- Intersession compensation and scoring methods in the i-vectors space for speaker recognition
- Pierre-Michel Bousquet, Driss Matrouf, and Jean-Frano;ois Bonastre, "Intersession compensation and scoring methods in the i-vectors space for speaker recognition., " in INTERSPEECH, 20 1 1, pp. 485-4 8 8.
- (2011) INTERSPEECH , pp. 485-488
- Bousquet, P.¹ Matrouf, D.² Jean-Frano³ Bonastre, O.⁴

10
- 0023453329
- Silhouettes: A graphical aid t o the interpretation and validation of cluster analysis
- Peter J Rousseeuw, "Silhouettes: A graphical aid t o the interpretation and validation of cluster analysis, " Journal of computational and applied mathematics, vol. 20, pp. 5 3-D5, 1987.
- (1987) Journal of Computational and Applied Mathematics , vol.20 , pp. 53-D5
- Rousseeuw, P.J.¹

11
- 0001687441
- Comparison of video shot boundary detection techniques
- John S Boreczky and Lawrence A Rowe, "Comparison of video shot boundary detection techniques, " Journal of Electronic Imaging, vol. 5, no. 2, pp. 1 22-l 28, 1 996.
- (1996) Journal of Electronic Imaging , vol.5 , Issue.2 , pp. 122-128
- Boreczky, J.S.¹ Rowe, L.A.²

12
- 84906274473
- An open-source state-of-the-art toolbox for broadcast news diarization
- number EPFL-CONF-192762
- Mickael Rouvier, Gregor Dupuy, Paul Gay, Elie Khoury, Teva Merlin, and Sylvain Meignier, "An open-source state-of-the-art toolbox for broadcast news diarization, " in INTERSPEECH, 20 1 3, number EPFL-CONF-1 92762.
- (2013) INTERSPEECH
- Rouvier, M.¹ Dupuy, G.² Gay, P.³ Khoury, E.⁴ Merlin, T.⁵ Meignier, S.⁶

13
- 78049378635
- The lia-eurecom rt' 09 speaker diarization system: Enhancements in speaker modelling and cluster purification
- Simon Bozonnet, Nicholas WD Evans, and Corinne Fredouille, "The lia-eurecom rt' 09 speaker diarization system: enhancements in speaker modelling and cluster purification, " in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 495 8-496 1.
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference On. IEEE , pp. 4958-4961
- Bozonnet, S.¹ Evans, N.W.D.² Fredouille, C.³

14
- 78650898482
- Lium spkdiarization: An open source toolkit for diarization
- Sylvain Meignier and Teva Merlin, "Lium spkdiarization: An open source toolkit for diarization, " in CMU SPUD Workshop, 2010, vol. 20 1 0.
- (2010) CMU SPUD Workshop , vol.2010
- Meignier, S.¹ Merlin, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.