-
1
-
-
34047261805
-
An overview of automatic speaker diarization systems
-
S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Transactions on Audio, Speech &Language Processing, vol. 14, no. 5, pp. 1557-1565, 2006
-
(2006)
IEEE Transactions on Audio, Speech &Language Processing
, vol.14
, Issue.5
, pp. 1557-1565
-
-
Tranter, S.E.1
Reynolds, D.A.2
-
2
-
-
85008530405
-
Speaker diarization: A review of recent research
-
X. A. Miró, S. Bozonnet, N.W. D. Evans, C. Fredouille, G. Friedland, and O. Vinyals, "Speaker diarization: A review of recent research," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 2, pp. 356-370, 2012
-
(2012)
IEEE Transactions on Audio, Speech &Language Processing
, vol.20
, Issue.2
, pp. 356-370
-
-
Miró, X.A.1
Bozonnet, S.2
Evans, N.W.D.3
Fredouille, C.4
Friedland, G.5
Vinyals, O.6
-
3
-
-
84875953283
-
Clustering via the Bayesian information criterion with applications in speech recognition
-
(Seattle,WA)
-
S. S. Chen and P. S. Gopalakrishnan, "Clustering via the Bayesian information criterion with applications in speech recognition," in ICASSP, (Seattle,WA), pp. 645-648, 1998
-
(1998)
ICASSP
, pp. 645-648
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
-
4
-
-
84865770392
-
Speaker linking in large data sets
-
Brno, Czech Republic, June 28-July 1, 2010
-
D. A. Leeuwen, "Speaker linking in large data sets," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 35, 2010
-
(2010)
Odyssey 2010
, pp. 35
-
-
Leeuwen, D.A.1
-
5
-
-
84865759467
-
The speaker partitioning problem
-
Brno, Czech Republic, June 28-July 1, 2010
-
N. Brummer and E. Villiers, "The speaker partitioning problem," in Odyssey 2010, Brno, Czech Republic, June 28-July 1, 2010, p. 34, 2010
-
(2010)
Odyssey 2010
, pp. 34
-
-
Brummer, N.1
Villiers, E.2
-
6
-
-
84865729834
-
Partitioning of two-speaker conversation datasets
-
Florence, Italy, August 27-31, 2011
-
C. Vaquero, A. Ortega, and E. Lleida, "Partitioning of two-speaker conversation datasets," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 385-388, 2011
-
(2011)
INTERSPEECH
, pp. 385-388
-
-
Vaquero, C.1
Ortega, A.2
Lleida, E.3
-
7
-
-
84867606902
-
Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach
-
Kyoto, Japan, March 25-30, 2012
-
H. Ghaemmaghami, D. Dean, R. Vogt, and S. Sridharan, "Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach," in ICASSP 2012, Kyoto, Japan, March 25-30, 2012, pp. 4185-4188, 2012
-
(2012)
ICASSP 2012
, pp. 4185-4188
-
-
Ghaemmaghami, H.1
Dean, D.2
Vogt, R.3
Sridharan, S.4
-
8
-
-
84874227906
-
Speaker diarization and linking of large corpora
-
Miami, FL, USA, December 2-5, 2012
-
M. Ferras and H. Boudard, "Speaker diarization and linking of large corpora," in IEEE SLT, Miami, FL, USA, December 2-5, 2012, pp. 280-285, 2012
-
(2012)
IEEE SLT
, pp. 280-285
-
-
Ferras, M.1
Boudard, H.2
-
9
-
-
84865776156
-
Comparing multi-stage approaches for cross-show speaker diarization
-
Florence, Italy, August 27-31, 2011
-
V. Tran, V. B. Le, C. Barras, and L. Lamel, "Comparing multi-stage approaches for cross-show speaker diarization," in INTERSPEECH, Florence, Italy, August 27-31, 2011, pp. 1053-1056, 2011
-
(2011)
INTERSPEECH
, pp. 1053-1056
-
-
Tran, V.1
Le, V.B.2
Barras, C.3
Lamel, L.4
-
10
-
-
84865734172
-
Investigation of crossshow speaker diarization
-
Italy, August 27-31, 2011
-
Q. Yang, Q. Jin, and T. Schultz, "Investigation of crossshow speaker diarization," in INTERSPEECH Florence, Italy, August 27-31, 2011, pp. 2925-2928, 2011
-
(2011)
INTERSPEECH Florence
, pp. 2925-2928
-
-
Yang, Q.1
Jin, Q.2
Schultz, T.3
-
11
-
-
84906274473
-
An open-source state-of-The-art toolbox for broadcast news diarization
-
Lyon, France, August 25-29, 2013
-
M. Rouvier, G. Dupuy, P. Gay, E. el Khoury, T. Merlin, and S. Meignier, "An open-source state-of-The-art toolbox for broadcast news diarization," in INTERSPEECH, Lyon, France, August 25-29, 2013, pp. 1477-1481, 2013
-
(2013)
INTERSPEECH
, pp. 1477-1481
-
-
Rouvier, M.1
Dupuy, G.2
Gay, P.3
El Khoury, E.4
Merlin, T.5
Meignier, S.6
-
12
-
-
84973386174
-
Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news
-
(Genoa, Italy)
-
S. Galliano, E. Geoffrois, G. Gravier, J. F. Bonastre, D. Mostefa, and K. Choukri, "Corpus description of the ESTER evaluation campaign for the rich transcription of French broadcast news," in LREC, (Genoa, Italy), pp. 139-142, 2006
-
(2006)
LREC
, pp. 139-142
-
-
Galliano, S.1
Geoffrois, E.2
Gravier, G.3
Bonastre, J.F.4
Mostefa, D.5
Choukri, K.6
-
13
-
-
84910061411
-
The first official REPERE evaluation
-
O. Galibert and J. Kahn, "The first official REPERE evaluation," in SLAM, 2013
-
(2013)
SLAM
-
-
Galibert, O.1
Kahn, J.2
-
14
-
-
84873834890
-
Speaker diarization of broadcast news in Albayzin 2010 evaluation campaign
-
M. Zelenak, H. Schulz, and J. Hernando, "Speaker diarization of broadcast news in Albayzin 2010 evaluation campaign," EURASIP Journal on Audio, Speech and Music Processing, vol. 19, pp. 1-9, 2012
-
(2012)
EURASIP Journal on Audio, Speech and Music Processing
, vol.19
, pp. 1-9
-
-
Zelenak, M.1
Schulz, H.2
Hernando, J.3
-
15
-
-
85010742974
-
The MGB challenge: Evaluating multi-genre broadcast media transcription
-
Scottsdale, AZ, 2015
-
P. Bell, M. J. F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Webster, and P. Woodland, "The MGB Challenge: Evaluating Multi-genre Broadcast Media Transcription," in ASRU 2015, Scottsdale, AZ, 2015, 2015
-
(2015)
ASRU 2015
-
-
Bell, P.1
Gales, M.J.F.2
Hain, T.3
Kilgour, J.4
Lanchantin, P.5
Liu, X.6
McParland, A.7
Renals, S.8
Saz, O.9
Webster, M.10
Woodland, P.11
-
16
-
-
84964437387
-
-
Accessed: 08-07-2015
-
"Diarisation error rate scoring code, NIST." http://www.itl.nist.gov/iad/mig/tests/rt/2006-spring/code/md-eval-v21.pl. Accessed: 08-07-2015
-
Diarisation Error Rate Scoring Code
-
-
-
17
-
-
84905238677
-
-
Brno University Accessed: 08-07-2015
-
"Neural Network Trainer TNet, Brno University." http://speech.fit.vutbr.cz/software/neural-networktrainer-tnet. Accessed: 08-07-2015
-
Neural Network Trainer TNet
-
-
-
18
-
-
85009254284
-
TRAPS-classifiers of temporal patterns
-
Sydney, Australia, 30th November-4th December 1998
-
H. Hermansky and S. Sharma, "TRAPS-classifiers of temporal patterns," in The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November-4th December 1998, 1998
-
(1998)
The 5th International Conference on Spoken Language Processing, Incorporating the 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre
-
-
Hermansky, H.1
Sharma, S.2
-
19
-
-
84946687643
-
Semi-supervised DNN training in meeting recognition
-
(South Lake Tahoe, CA)
-
P. Zhang, Y. Liu, and T. Hain, "Semi-supervised DNN training in meeting recognition," in Proceedings of SLT, (South Lake Tahoe, CA), 2014
-
(2014)
Proceedings of SLT
-
-
Zhang, P.1
Liu, Y.2
Hain, T.3
-
20
-
-
40249083942
-
The segmentation of multi-channel meeting recordings for automatic speech recognition
-
J. Dines, J. Vepa, and T. Hain, "The segmentation of multi-channel meeting recordings for automatic speech recognition," in Interspeech'06, 2006
-
(2006)
Interspeech'06
-
-
Dines, J.1
Vepa, J.2
Hain, T.3
-
21
-
-
70450152040
-
SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for Dutch
-
Brighton, United Kingdom, September 6-10, 2009
-
M. Huijbregts, R. Ordelman, L. Werff, and F. M. G. Jong, "SHoUT, the university of twente submission to the n-best 2008 speech recognition evaluation for dutch," in INTERSPEECH, Brighton, United Kingdom, September 6-10, 2009, pp. 2575-2578, 2009
-
(2009)
INTERSPEECH
, pp. 2575-2578
-
-
Huijbregts, M.1
Ordelman, R.2
Werff, L.3
Jong, F.M.G.4
-
23
-
-
85008520364
-
Transcribing meetings with the AMIDA systems
-
T. Hain, L. Burget, J. Dines, P. Garner, F. Grezl, A. Hannani, M. Huijbregts, M. Karafiat, M. Lincoln, and V. Wan, "Transcribing meetings with the AMIDA systems," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 2, pp. 486-498, 2012
-
(2012)
IEEE Transactions on Audio, Speech &Language Processing
, vol.20
, Issue.2
, pp. 486-498
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garner, P.4
Grezl, F.5
Hannani, A.6
Huijbregts, M.7
Karafiat, M.8
Lincoln, M.9
Wan, V.10
-
24
-
-
84905269643
-
Using neural network front-ends on far field multiple microphones based speech recognition
-
May
-
Y. Liu, P. Zhang, and T. Hain, "Using neural network front-ends on far field multiple microphones based speech recognition," in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 5542-5546, May 2014
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
, pp. 5542-5546
-
-
Liu, Y.1
Zhang, P.2
Hain, T.3
-
25
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in INTERSPEECH, 2012
-
(2012)
INTERSPEECH
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
|