-
3
-
-
0024220237
-
Auto association by multi layer perceptrons and singular value decomposition
-
Bourlard H., and Kamp Y. Auto association by multi layer perceptrons and singular value decomposition. Biol. Cybernet. 59 (1988) 291-294
-
(1988)
Biol. Cybernet.
, vol.59
, pp. 291-294
-
-
Bourlard, H.1
Kamp, Y.2
-
6
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (1980) 357-366
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
7
-
-
0034273195
-
DISTBIC: a speaker based segmentation for audio data indexing
-
Delacourt P., and Wellekens C. DISTBIC: a speaker based segmentation for audio data indexing. Speech Commun. 32 (2000) 111-126
-
(2000)
Speech Commun.
, vol.32
, pp. 111-126
-
-
Delacourt, P.1
Wellekens, C.2
-
8
-
-
41149119412
-
Speaker diarization using one-class support vector machines
-
Fergani B., Davy M., and Houacine A. Speaker diarization using one-class support vector machines. Speech Commun. 50 (2008) 355-365
-
(2008)
Speech Commun.
, vol.50
, pp. 355-365
-
-
Fergani, B.1
Davy, M.2
Houacine, A.3
-
13
-
-
0026113980
-
Nonlinear principal component analysis using auto associative neural networks
-
Kramer M.A. Nonlinear principal component analysis using auto associative neural networks. AIChE 37 (1991) 233-243
-
(1991)
AIChE
, vol.37
, pp. 233-243
-
-
Kramer, M.A.1
-
15
-
-
29044442235
-
Step by step and integrated approaches in broadcast news speaker diarization
-
Meignier S., Moraru D., Fredouille C., Bonastre J.F., and Besacier L. Step by step and integrated approaches in broadcast news speaker diarization. Comput. Speech Lang. 20 (2006) 303-330
-
(2006)
Comput. Speech Lang.
, vol.20
, pp. 303-330
-
-
Meignier, S.1
Moraru, D.2
Fredouille, C.3
Bonastre, J.F.4
Besacier, L.5
-
17
-
-
67349254637
-
-
Fall
-
NIST, 2004. Fall 2004 Rich Transcription (RT-04F) 〈www.nist.gov/speech/tests/rt/rt2004/fall/docs/ rto4feval-plan-v14.pdf〉.
-
(2004)
2004 Rich Transcription (RT-04F)
-
-
-
18
-
-
67349141382
-
-
Ph.D. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras
-
Palanivel, S., 2004. Person authentication using speech, face and visual speech. Ph.D. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras.
-
(2004)
Person authentication using speech, face and visual speech
-
-
Palanivel, S.1
-
19
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
special issue on NIST 1999 Speaker Recognition Workshop
-
Reynolds D.A., Quatieri T.F., and Dunn R.B. Speaker verification using adapted Gaussian mixture models. Digital Signal Process. Rev. J. 10 1-3 (2000) 19-41 special issue on NIST 1999 Speaker Recognition Workshop
-
(2000)
Digital Signal Process. Rev. J.
, vol.10
, Issue.1-3
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
20
-
-
0002782496
-
Automatic segmentation, classification and clustering of broadcast news audio
-
Sieglar M., Jain U., Raj B., and Stern R. Automatic segmentation, classification and clustering of broadcast news audio. Proceedings of the DARPA Speech Recognition Workshop (1997) 97-99
-
(1997)
Proceedings of the DARPA Speech Recognition Workshop
, pp. 97-99
-
-
Sieglar, M.1
Jain, U.2
Raj, B.3
Stern, R.4
-
22
-
-
85009265801
-
An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi speakers
-
Siu M.H., Rohlicek R., and Gish H. An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi speakers. Proc. of the IEEE International Conference on Acoustic, Speech, and Signal Processing (1992) 189-192
-
(1992)
Proc. of the IEEE International Conference on Acoustic, Speech, and Signal Processing
, pp. 189-192
-
-
Siu, M.H.1
Rohlicek, R.2
Gish, H.3
-
23
-
-
84889324982
-
Clustering speakers by their voices
-
Solomonoff A., Mielke A., Schmidt M., and Gish H. Clustering speakers by their voices. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (1998) 757-760
-
(1998)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 757-760
-
-
Solomonoff, A.1
Mielke, A.2
Schmidt, M.3
Gish, H.4
-
27
-
-
0035989168
-
AANN: an alternative to GMM for pattern recognition
-
Yegnanarayana B., and Kishore S.P. AANN: an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
-
(2002)
Neural Networks
, vol.15
, pp. 459-469
-
-
Yegnanarayana, B.1
Kishore, S.P.2
|