-
1
-
-
84889324982
-
Clustering speakers by their voices
-
Seattle, WA, May
-
A. Solomonoff, A. Mielke, M. Schmidt, and H. Gish, "Clustering speakers by their voices," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, vol. 2, pp. 75-760.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.2
, pp. 75-760
-
-
Solomonoff, A.1
Mielke, A.2
Schmidt, M.3
Gish, H.4
-
2
-
-
33745200276
-
The Cambridge University March 2005 speaker diarisation system
-
Lisbon, Portugal, Sep.
-
R. Sinha, S. E. Tranter,M. J. F. Gales, and P. C.Woodland, "The Cambridge University March 2005 speaker diarisation system," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2437-2440.
-
(2005)
Proc. Eur. Conf. Speech Commun. Technol.
, pp. 2437-2440
-
-
Sinha, R.1
Tranter, S.E.2
Gales, M.J.F.3
Woodland, P.C.4
-
4
-
-
4544361760
-
Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation
-
Montreal, QC, Canada May
-
H. G. Kim and T. Sikora, "Comparison of MPEG-7 audio spectrum projection features and MFCC applied to speaker recognition, sound classification and audio segmentation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 5, pp. 925-928.
-
(2004)
Proc.IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.5
, pp. 925-928
-
-
Kim, H.G.1
Sikora, T.2
-
5
-
-
84979955147
-
Audio spectrum projection based on several basis decomposition algorithms applied to general sound recognition and audio segmentation
-
Vienna, Austria Sep.
-
H. G. Kim and T. Sikora, "Audio spectrum projection based on several basis decomposition algorithms applied to general sound recognition and audio segmentation," in Proc. 12th Eur. Signal Process. Conf., Vienna, Austria, Sep. 2004, pp. 1047-1050.
-
(2004)
Proc. 12th Eur. Signal Process. Conf.
, pp. 1047-1050
-
-
Kim, H.G.1
Sikora, T.2
-
6
-
-
34547324377
-
Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme
-
Kos, Greece May
-
M. Kotti, E. Benetos, and C. Kotropoulos, "Automatic speaker change detection with the Bayesian information criterion using MPEG-7 features and a fusion scheme," in Proc. IEEE Int. Symp. Circuits Syst., Kos, Greece, May 2006.
-
(2006)
Proc. IEEE Int. Symp. Circuits Syst.
-
-
Kotti, M.1
Benetos, E.2
Kotropoulos, C.3
-
7
-
-
34247559206
-
Automatic speaker segmentation using multiple features and distance measures: A comparison of three approaches
-
Toronto, ON, Canada, Jul.
-
M. Kotti, L. G. P. M. Martins, E. Benetos, J. S. Cardoso, and C. Kotropoulos, "Automatic speaker segmentation using multiple features and distance measures: A comparison of three approaches," in Proc. IEEE Int. Conf. Multimedia Expo, Toronto, ON, Canada, Jul. 2006, pp. 1101-1104.
-
(2006)
Proc. IEEE Int. Conf. Multimedia Expo
, pp. 1101-1104
-
-
Kotti, M.1
Martins, L.G.P.M.2
Benetos, E.3
Cardoso, J.S.4
Kotropoulos, C.5
-
8
-
-
34047261805
-
An overview of automatic speaker diarization systems
-
Sep.
-
S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.5
, pp. 1557-1565
-
-
Tranter, S.E.1
Reynolds, D.A.2
-
9
-
-
4544247119
-
Online speaker clustering
-
Montreal, QC, Canada May
-
D. Liu and F. Kubala, "Online speaker clustering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 1, pp. 333-336.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 333-336
-
-
Liu, D.1
Kubala, F.2
-
10
-
-
64149092838
-
Speaker clustering of speech utterances using a voice characteristic reference space
-
Jeju Island, Korea Oct.
-
W. H. Tsai, S. S. Cheng, and H. M. Wang, "Speaker clustering of speech utterances using a voice characteristic reference space," in Proc. 8th Int. Conf. Spoken Lang. Process., Jeju Island, Korea, Oct. 2004.
-
(2004)
Proc. 8th Int. Conf. Spoken Lang. Process.
-
-
Tsai, W.H.1
Cheng, S.S.2
Wang, H.M.3
-
11
-
-
67349120575
-
Speaker diarization using autoassociative neural networks
-
S. Jothilakshmi, V. Ramalingam, and S. Palanivel, "Speaker diarization using autoassociative neural networks," Eng. Applicat. Artif. Intell., vol. 22, pp. 667-675, 2009.
-
(2009)
Eng. Applicat. Artif. Intell.
, vol.22
, pp. 667-675
-
-
Jothilakshmi, S.1
Ramalingam, V.2
Palanivel, S.3
-
12
-
-
0141809272
-
E-HMM approach for learning and adapting sound models for speaker indexing
-
Crete, Greece Jun.
-
S. Meignier, J. F. Bonastre, and S. Igounet, "E-HMM approach for learning and adapting sound models for speaker indexing," in Proc. Odyssey Speaker Lang. Recognition Workshop, Crete, Greece, Jun. 2001, pp. 175-180.
-
(2001)
Proc. Odyssey Speaker Lang. Recognition Workshop
, pp. 175-180
-
-
Meignier, S.1
Bonastre, J.F.2
Igounet, S.3
-
13
-
-
85009289298
-
Unknown-multiple speaker clustering using HMM
-
Sep.
-
J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan, "Unknown- multiple speaker clustering using HMM," in Proc. 7th Int. Conf. Spoken Lang. Process., Sep. 2002, pp. 573-576.
-
(2002)
Proc. 7th Int. Conf. Spoken Lang. Process.
, pp. 573-576
-
-
Ajmera, J.1
Bourlard, H.2
Lapidot, I.3
McCowan, I.4
-
14
-
-
29044442235
-
Step-by-step and integrated approaches in broadcast news speaker diarization
-
Apr.-July
-
S. Meignier, D. Moraru, C. Fredouille, J. F. Bonastre, and L. Besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization," Comput. Speech Lang., vol. 20, no. 2-3, pp. 303-330, Apr.-July 2006.
-
(2006)
Comput. Speech Lang.
, vol.20
, Issue.2-3
, pp. 303-330
-
-
Meignier, S.1
Moraru, D.2
Fredouille, C.3
Bonastre, J.F.4
Besacier, L.5
-
15
-
-
34047266609
-
Multistage speaker diarization of broadcast news
-
Sep.
-
C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Multistage speaker diarization of broadcast news," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1505-1512, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.5
, pp. 1505-1512
-
-
Barras, C.1
Zhu, X.2
Meignier, S.3
Gauvain, J.-L.4
-
16
-
-
34047266379
-
Progress in the CU-HTK broadcast news transcription system
-
DOI 10.1109/TASL.2006.878264
-
M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, "Progress in the CU-HTK broadcast news transcription system," IEEE Trans. Speech Audio Process., vol. 14, no. 5, pp. 1513-1525, Sep. 2006. (Pubitemid 46547578)
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.5
, pp. 1513-1525
-
-
Gales, M.J.F.1
Kim, D.Y.2
Woodland, P.C.3
Chan, H.Y.4
Mrva, D.5
Sinha, R.6
Tranter, S.E.7
-
17
-
-
77956497615
-
-
[Online]. Available:
-
[Online]. Available: http://www.itl.nist.gov/iad/mig/tests/rt/
-
-
-
-
19
-
-
47749119617
-
The ICSI RT07S speaker diarization system
-
Berlin, Germany: Springer, vol. LNCS 4625
-
C. Wooters and M. Huijbregts, "The ICSI RT07S speaker diarization system," in Multimodal Technologies for Perception of Humans. Berlin, Germany: Springer, 2009, vol. LNCS 4625, pp. 509-519.
-
(2009)
Multimodal Technologies for Perception of Humans
, pp. 509-519
-
-
Wooters, C.1
Huijbregts, M.2
-
20
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
Oct.
-
D. A. Reynolds, T. F. Quatiery, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, Oct. 2000.
-
(2000)
Digital Signal Process.
, vol.10
, pp. 19-41
-
-
Reynolds, D.A.1
Quatiery, T.F.2
Dunn, R.B.3
-
21
-
-
0031233424
-
Speaker recognition: A tutorial
-
Sep.
-
J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
-
(1997)
Proc. IEEE
, vol.85
, Issue.9
, pp. 1437-1462
-
-
Campbell, J.P.1
-
22
-
-
38949122754
-
Speaker segmentation and clustering
-
May
-
M. Kotti, V. Moschou, and C. Kotropoulos, "Speaker segmentation and clustering," Signal Process., vol. 88, no. 5, pp. 1091-1124, May 2008.
-
(2008)
Signal Process.
, vol.88
, Issue.5
, pp. 1091-1124
-
-
Kotti, M.1
Moschou, V.2
Kotropoulos, C.3
-
23
-
-
0031331636
-
Unsupervised speaker classification using self-organizing maps
-
Amelia Island, FL Sep.
-
I. Voitovetsky, H. Guterman, and A. Cohen, "Unsupervised speaker classification using self-organizing maps," in Proc. IEEE Workshop Neural Netw. Signal Process., Amelia Island, FL, Sep. 1997, pp. 578-587.
-
(1997)
Proc. IEEE Workshop Neural Netw. Signal Process.
, pp. 578-587
-
-
Voitovetsky, I.1
Guterman, H.2
Cohen, A.3
-
24
-
-
0036650810
-
Unsupervised speaker recognition based on competition between self-organizing maps
-
Jul.
-
I. Lapidot, H. Guterman, and A. Cohen, "Unsupervised speaker recognition based on competition between self-organizing maps," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 877-887, Jul. 2002.
-
(2002)
IEEE Trans. Neural Netw.
, vol.13
, Issue.4
, pp. 877-887
-
-
Lapidot, I.1
Guterman, H.2
Cohen, A.3
-
25
-
-
33745185104
-
Combining speaker identification and BIC for speaker diarization
-
Lisbon, Portugal Sep.
-
X. Zhu, C. Barras, S. Meignier, and J.-L. Gauvain, "Combining speaker identification and BIC for speaker diarization," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2441-2444.
-
(2005)
Proc. Eur. Conf. Speech Commun. Technol.
, pp. 2441-2444
-
-
Zhu, X.1
Barras, C.2
Meignier, S.3
Gauvain, J.-L.4
-
26
-
-
0003128649
-
Automatic speaker clustering
-
Chantilly, VA
-
H. Jin, F. Kubala, and R. Schwartz, "Automatic speaker clustering," in Proc. Speech Recognition Workshop, Chantilly, VA, 1997, pp. 108-111.
-
(1997)
Proc. Speech Recognition Workshop
, pp. 108-111
-
-
Jin, H.1
Kubala, F.2
Schwartz, R.3
-
27
-
-
0026400244
-
Segregation of speakers for speech recognition and speaker identification
-
Toronto, ON, Canada, Apr.
-
H. Gish, M. H. Siu, and R. Rohlicek, "Segregation of speakers for speech recognition and speaker identification," in Proc. 1991 IEEE Int. Conf. Acoust., Speech, Signal Process., Toronto, ON, Canada, Apr. 1991, pp. 873-876.
-
(1991)
Proc. 1991 IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 873-876
-
-
Gish, H.1
Siu, M.H.2
Rohlicek, R.3
-
28
-
-
17444365032
-
Unsupervised speaker segmentation and tracking in real-time audio content analysis
-
Apr.
-
L. Lu and H. Zhang, "Unsupervised speaker segmentation and tracking in real-time audio content analysis," Multimedia Syst., vol. 10, no. 4, pp. 332-343, Apr. 2005.
-
(2005)
Multimedia Syst.
, vol.10
, Issue.4
, pp. 332-343
-
-
Lu, L.1
Zhang, H.2
-
29
-
-
85128356454
-
Partitioning and transcription of broadcast news data
-
Sydney, Australia Dec.
-
J.-L. Gauvain, L. Lamel, and G. Adda, "Partitioning and transcription of broadcast news data," in Proc. 5th Int. Conf. Spoken Lang. Process., Sydney, Australia, Dec. 1998, pp. 1335-1338.
-
(1998)
Proc. 5th Int. Conf. Spoken Lang. Process.
, pp. 1335-1338
-
-
Gauvain, J.-L.1
Lamel, L.2
Adda, G.3
-
31
-
-
84875953283
-
Clustering via the Bayesian information criterion with applications in speech recognition
-
Seattle, WA May
-
S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, vol. 2, pp. 645-648.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.2
, pp. 645-648
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
-
32
-
-
78650540904
-
Improved speaker segmentation and segments clustering using the Bayesian information criterion
-
Sep.
-
A. Tritschler and R. Gopinath, "Improved speaker segmentation and segments clustering using the Bayesian information criterion," in Proc. 6th Eur. Conf. Speech Commun. Techol., Sep. 1999, pp. 679-682.
-
(1999)
Proc. 6th Eur. Conf. Speech Commun. Techol.
, pp. 679-682
-
-
Tritschler, A.1
Gopinath, R.2
-
33
-
-
0034273195
-
DISTBIC:Aspeaker-based segmentation for audio data indexing
-
Sep.
-
P. Delacourt and C. J.Wellekens, "DISTBIC:Aspeaker-based segmentation for audio data indexing," Speech Commun., vol. 32, pp. 111-126, Sep. 2000.
-
(2000)
Speech Commun.
, vol.32
, pp. 111-126
-
-
Delacourt, P.1
Wellekens, C.J.2
-
35
-
-
77956526830
-
-
[Online].Available:
-
[Online]. Available: http://www.praat.org
-
-
-
-
36
-
-
0001835850
-
Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
-
P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound," in Proc. Inst. Phon. Sci., 1993, vol. 17, pp. 97-110.
-
(1993)
Proc. Inst. Phon. Sci.
, vol.17
, pp. 97-110
-
-
Boersma, P.1
-
37
-
-
66149116378
-
Computationally efficient and robust BIC-based speaker segmentation
-
Jul.
-
M. Kotti, E. Benetos, and C. Kotropoulos, "Computationally efficient and robust BIC-based speaker segmentation," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 920-933, Jul. 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.16
, Issue.5
, pp. 920-933
-
-
Kotti, M.1
Benetos, E.2
Kotropoulos, C.3
-
38
-
-
70350451584
-
Robust detection of phone segments in continuous speech using model selection criteria with few observations
-
Feb.
-
G. Almpanidis, M. Kotti, and C. Kotropoulos, "Robust detection of phone segments in continuous speech using model selection criteria with few observations," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 287-298, Feb. 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.17
, Issue.2
, pp. 287-298
-
-
Almpanidis, G.1
Kotti, M.2
Kotropoulos, C.3
-
40
-
-
21244468777
-
Combining multiple clusterings using evidence accumulation
-
Jun.
-
L. N. Fred and A. K. Jain, "Combining multiple clusterings using evidence accumulation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 6, pp. 835-850, Jun. 2005.
-
(2005)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.27
, Issue.6
, pp. 835-850
-
-
Fred, L.N.1
Jain, A.K.2
-
41
-
-
27544491443
-
A CLUE for CLUster ensembles
-
Sep.
-
K. Hornik, "A CLUE for CLUster ensembles," J. Statist. Software, vol. 14, no. 12, Sep. 2005.
-
(2005)
J. Statist. Software
, vol.14
, Issue.12
-
-
Hornik, K.1
-
42
-
-
33745772326
-
Cluster ensemble and its applications in gene expression analysis
-
X. Hu and I. Yoo, "Cluster ensemble and its applications in gene expression analysis," in ACM Int. Conf. Proc. Series, 2004, vol. 55, pp. 297-302.
-
(2004)
ACM Int. Conf. Proc. Series
, vol.55
, pp. 297-302
-
-
Hu, X.1
Yoo, I.2
-
43
-
-
84957012677
-
Finding consistent clusters in data partitions
-
New York: Springer, LNCS 2096
-
A. Fred, "Finding consistent clusters in data partitions," in Multiple Classifier Systems. New York: Springer, 2001, vol. LNCS 2096, pp. 309-318.
-
(2001)
Multiple Classifier Systems
, pp. 309-318
-
-
Fred, A.1
-
44
-
-
0038391443
-
Bagging to improve the accuracy of a clustering procedure
-
S. Dudoit and J. Fridlyand, "Bagging to improve the accuracy of a clustering procedure," BioInformatics, vol. 19, no. 9, pp. 1090-1099, 2003.
-
(2003)
BioInformatics
, vol.19
, Issue.9
, pp. 1090-1099
-
-
Dudoit, S.1
Fridlyand, J.2
-
45
-
-
0442296539
-
Bagging for path-based clustering
-
Nov.
-
B. Fischer and J. M. Buhmann, "Bagging for path-based clustering," IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 11, pp. 1411-1415, Nov. 2003.
-
(2003)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.25
, Issue.11
, pp. 1411-1415
-
-
Fischer, B.1
Buhmann, J.M.2
-
47
-
-
0003882879
-
-
Providence, RI: American Mathematical Society
-
F. R. K. Chung, Spectral Graph Theory. Providence, RI: American Mathematical Society, 1997.
-
(1997)
Spectral Graph Theory
-
-
Chung, F.R.K.1
-
48
-
-
34548583274
-
A tutorial on spectral clustering
-
U. von Luxburg, "A tutorial on spectral clustering," Statist. Comput., vol. 17, no. 4, pp. 395-416, 2007.
-
(2007)
Statist. Comput.
, vol.17
, Issue.4
, pp. 395-416
-
-
Von Luxburg, U.1
-
49
-
-
44949264065
-
A spectral clustering approach to speaker diarization
-
Pittsburgh, PA
-
H. Ning, M. Liu, H. Tang, and T. S. Huang, "A spectral clustering approach to speaker diarization," in Proc. 9th Int. Conf. Spoken Lang. Process. (ICSLP), Pittsburgh, PA, 2006.
-
(2006)
Proc. 9th Int. Conf. Spoken Lang. Process. (ICSLP)
-
-
Ning, H.1
Liu, M.2
Tang, H.3
Huang, T.S.4
-
50
-
-
77956548258
-
2004 RT-03 MDE training data speech
-
Philadelphia, PA [Online]. Available:
-
S. Strassel, C. Walker, and H. Lee, "2004 RT-03 MDE training data speech," in Linguist. Data Consortium, Philadelphia, PA [Online]. Available: http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId= LDC2004S08
-
Linguist. Data Consortium
-
-
Strassel, S.1
Walker, C.2
Lee, H.3
-
51
-
-
77956499309
-
MUSCLE movie-database: A multimodal corpus with rich annotation for dialogue and saliency detection
-
Marrakech, Morocco May
-
D. Spachos, A. Zlantintsi, V. Moschou, P. Antonopoulos, K. Tzimouli, E. Benetos, M. Kotti, C. Kotropoulos, N. Nikolaidis, P. Maragos, and I. Pitas, "MUSCLE movie-database: A multimodal corpus with rich annotation for dialogue and saliency detection," in Proc. LREC 2008 Workshop Multimodal Corpora, Marrakech, Morocco, May 26-27, 2008.
-
(2008)
Proc. LREC 2008 Workshop Multimodal Corpora
, pp. 26-27
-
-
Spachos, D.1
Zlantintsi, A.2
Moschou, V.3
Antonopoulos, P.4
Tzimouli, K.5
Benetos, E.6
Kotti, M.7
Kotropoulos, C.8
Nikolaidis, N.9
Maragos, P.10
Pitas, I.11
|