-
1
-
-
0030247355
-
Robust speaker recognition: A feature-based approach
-
Sept.
-
R. J. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition: a feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sept. 1996.
-
(1996)
IEEE Signal Process. Mag.
, vol.13
, Issue.5
, pp. 58-71
-
-
Mammone, R.J.1
Zhang, X.2
Ramachandran, R.3
-
2
-
-
0031233424
-
Speaker recognition: A tutorial
-
Sep.
-
J. P. Campbell, "Speaker recognition: a tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1436-1462, Sep. 1997.
-
(1997)
Proc. IEEE
, vol.85
, Issue.9
, pp. 1436-1462
-
-
Campbell, J.P.1
-
3
-
-
85009282223
-
Speaker change detection using a new weighted distance measure
-
S. Kwon and S. Narayanan, "Speaker change detection using a new weighted distance measure," in Proc. Int. Conf. Spoken Language Processing, vol. 4, 2002, pp. 2537-2540.
-
(2002)
Proc. Int. Conf. Spoken Language Processing
, vol.4
, pp. 2537-2540
-
-
Kwon, S.1
Narayanan, S.2
-
4
-
-
0033279185
-
Multi-modal people ID for a multimedia meeting browser
-
J. Yang, X. Zhu, R. Gross, J. Kominek, Y. Pan, and A. Waibel, "Multi-modal people ID for a multimedia meeting browser," in Proc. 7th ACM Int. Conf. Multimedia, Part 1, 1999, pp. 159-168.
-
(1999)
Proc. 7th ACM Int. Conf. Multimedia, Part 1
, pp. 159-168
-
-
Yang, J.1
Zhu, X.2
Gross, R.3
Kominek, J.4
Pan, Y.5
Waibel, A.6
-
5
-
-
85009266843
-
Unsupervised speaker segmentation of telephone conversations
-
A. Rosenberg, A. Gorin, and S. Parthasarathy, "Unsupervised speaker segmentation of telephone conversations," in Proc. Int. Conf. Spoken Language Processing, vol. 1, 2002, pp. 565-568.
-
(2002)
Proc. Int. Conf. Spoken Language Processing
, vol.1
, pp. 565-568
-
-
Rosenberg, A.1
Gorin, A.2
Parthasarathy, S.3
-
6
-
-
33846278175
-
A method for on-line speaker indexing using generic reference models
-
S. Kwon and S. Narayanan, "A method for on-line speaker indexing using generic reference models," in Proc. Eurospeech 2003, 2003, pp. 2653-2656.
-
(2003)
Proc. Eurospeech 2003
, pp. 2653-2656
-
-
Kwon, S.1
Narayanan, S.2
-
8
-
-
0036816475
-
Content analysis for audio classification and segmemtation
-
L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmemtation," IEEE Trans. Speech Audio Process., vol. 10, no. 7, pp. 504-516, 2002.
-
(2002)
IEEE Trans. Speech Audio Process.
, vol.10
, Issue.7
, pp. 504-516
-
-
Lu, L.1
Zhang, H.-J.2
Jiang, H.3
-
9
-
-
0032659936
-
Speaker indexing for news articles, debates, and drama in broadcasted TV programs
-
M. Nishida and Y. Ariki, "Speaker indexing for news articles, debates, and drama in broadcasted TV programs," in Proc. IEEE Int. Conf. Multimedia Computing and Systems, vol. 2, 1999, pp. 466-471.
-
(1999)
Proc. IEEE Int. Conf. Multimedia Computing and Systems
, vol.2
, pp. 466-471
-
-
Nishida, M.1
Ariki, Y.2
-
10
-
-
84889324982
-
Clustering speakers by their voices
-
A. Solomonoff, A. Mielke, M. Schmidt, and H. Gish, "Clustering speakers by their voices," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 2, 1998, pp. 12-15.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.2
, pp. 12-15
-
-
Solomonoff, A.1
Mielke, A.2
Schmidt, M.3
Gish, H.4
-
11
-
-
0141478771
-
UBM-based real-time speaker segmentation for broadcasting news
-
T. Wu, L. Lu, K. Chen, and H. Zhang, "UBM-based real-time speaker segmentation for broadcasting news," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 2, 2003, pp. 193-196.
-
(2003)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.2
, pp. 193-196
-
-
Wu, T.1
Lu, L.2
Chen, K.3
Zhang, H.4
-
12
-
-
0141814603
-
Online speaker clustering
-
D. Liu and F. Kubala, "Online speaker clustering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, 2003, pp. 572-575.
-
(2003)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 572-575
-
-
Liu, D.1
Kubala, F.2
-
13
-
-
0001341735
-
Introduction to Monte Carlo methods
-
M. I. Jordan, Ed. Cambridge, MA: MIT Press
-
D. J. C. MacKay, "Introduction to Monte Carlo methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999, pp. 175-204.
-
(1999)
Learning in Graphical Models
, pp. 175-204
-
-
MacKay, D.J.C.1
-
14
-
-
0033709111
-
Supervised classification using MCMC methods
-
M. Davy, C. Doncarli, and J. Tourneret, "Supervised classification using MCMC methods," in Proc. Int. Conf. Acoustics, Speech and Signal Processing ( ICASSP'2000), 2000, pp. 33-36.
-
(2000)
Proc. Int. Conf. Acoustics, Speech and Signal Processing ( ICASSP'2000)
, pp. 33-36
-
-
Davy, M.1
Doncarli, C.2
Tourneret, J.3
-
17
-
-
0002615167
-
Speaker adaptation: Techniques and challenges
-
Keystone, CO, Dec.
-
P. C. Woodland, "Speaker adaptation: techniques and challenges," in Proc. IEEE Workshop Automatic Speech Recognition and Understanding, Keystone, CO, Dec. 1999, pp. 85-90.
-
(1999)
Proc. IEEE Workshop Automatic Speech Recognition and Understanding
, pp. 85-90
-
-
Woodland, P.C.1
-
18
-
-
0009577929
-
Cohorts based custom models for rapid speaker and dialect adaptation
-
J. Wu and E. Chang, "Cohorts based custom models for rapid speaker and dialect adaptation," in Proc. Eurospeech, 2001, pp. 1261-1264.
-
(2001)
Proc. Eurospeech
, pp. 1261-1264
-
-
Wu, J.1
Chang, E.2
-
19
-
-
0033707070
-
A speaker tracking system based on speaker turn detection for NIST evaluation
-
J.-F. Bonastre, C. Delacourt, T. Fredouille, T. Merlin, and C. Wellekens, "A speaker tracking system based on speaker turn detection for NIST evaluation," in Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, 2000, pp. 1177-1180.
-
(2000)
Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing
, pp. 1177-1180
-
-
Bonastre, J.-F.1
Delacourt, C.2
Fredouille, T.3
Merlin, T.4
Wellekens, C.5
-
20
-
-
85009251523
-
Hierarchical Gaussian mixture model for speaker verification
-
M. Liu, E. Chang, and B.-Q. Dai, "Hierarchical Gaussian mixture model for speaker verification," in Proc. Int. Conf. Spoken Language Processing, vol. 2, 2002, pp. 1353-1356.
-
(2002)
Proc. Int. Conf. Spoken Language Processing
, vol.2
, pp. 1353-1356
-
-
Liu, M.1
Chang, E.2
Dai, B.-Q.3
-
21
-
-
0034857759
-
Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
-
K. Mori and S. Nakagawa, "Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2001, pp. 413-416.
-
(2001)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 413-416
-
-
Mori, K.1
Nakagawa, S.2
-
22
-
-
0002595416
-
Speaker, environment, and channel change detection and clustering via the Bayesian information criterion
-
S. Chen and P. Gopalakrishnan, "Speaker, environment, and channel change detection and clustering via the Bayesian information criterion," in Proc. DARPA Speech Recognition Workshop, 1998, pp. 127-132.
-
(1998)
Proc. DARPA Speech Recognition Workshop
, pp. 127-132
-
-
Chen, S.1
Gopalakrishnan, P.2
-
23
-
-
85009089453
-
Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
-
B. Zhou and J. H. L. Hansen, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 714-717.
-
(2000)
Proc. Int. Conf. Spoken Language Processing
, pp. 714-717
-
-
Zhou, B.1
Hansen, J.H.L.2
|