SCOPUS 정보 검색 플랫폼

Engineering Applications of Artificial Intelligence

Volumn 22, Issue 4-5, 2009, Pages 667-675

Speaker diarization using autoassociative neural networks

(3) Jothilakshmi, S a Ramalingam, V a Palanivel, S a

Author keywords

Autoassociative neural networks; Mel frequency cepstral coefficients; Speaker clustering; Speaker diarization; Speaker segmentation

Indexed keywords

AUTOASSOCIATIVE NEURAL NETWORKS; MEL FREQUENCY CEPSTRAL COEFFICIENTS; SPEAKER CLUSTERING; SPEAKER DIARIZATION; SPEAKER SEGMENTATION;

FEATURE EXTRACTION; SPEECH RECOGNITION;

NEURAL NETWORKS;

EID: 67349120575 PISSN: 09521976 EISSN: None Source Type: Journal
DOI: 10.1016/j.engappai.2009.01.012 Document Type: Article

Times cited : (24)

References (27)

1
- 84946742526
- A robust speaker clustering algorithm
- Ajmera J., and Wooters C. A robust speaker clustering algorithm. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (2003) 411-416
- (2003) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 411-416
- Ajmera, J.¹ Wooters, C.²

2
- 0029267621
- Learning in multilayered networks used as autoassociators
- Bianchini M., Frasconi P., and Gori M. Learning in multilayered networks used as autoassociators. IEEE Trans. Neural Networks 6 (1995) 512-515
- (1995) IEEE Trans. Neural Networks , vol.6 , pp. 512-515
- Bianchini, M.¹ Frasconi, P.² Gori, M.³

3
- 0024220237
- Auto association by multi layer perceptrons and singular value decomposition
- Bourlard H., and Kamp Y. Auto association by multi layer perceptrons and singular value decomposition. Biol. Cybernet. 59 (1988) 291-294
- (1988) Biol. Cybernet. , vol.59 , pp. 291-294
- Bourlard, H.¹ Kamp, Y.²

4
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Chen S.S., and Gopalakrishnan P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop (1998) 127-132
- (1998) Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
- Chen, S.S.¹ Gopalakrishnan, P.²

5
- 85009128756
- Metric SEQDAC: a hybrid approach for audio segmentation
- Cheng S., and Wang H. Metric SEQDAC: a hybrid approach for audio segmentation. Proceedings of the 8th International Conference on Spoken Language Process (2004) 1617-1620
- (2004) Proceedings of the 8th International Conference on Spoken Language Process , pp. 1617-1620
- Cheng, S.¹ Wang, H.²

6
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (1980) 357-366
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

7
- 0034273195
- DISTBIC: a speaker based segmentation for audio data indexing
- Delacourt P., and Wellekens C. DISTBIC: a speaker based segmentation for audio data indexing. Speech Commun. 32 (2000) 111-126
- (2000) Speech Commun. , vol.32 , pp. 111-126
- Delacourt, P.¹ Wellekens, C.²

8
- 41149119412
- Speaker diarization using one-class support vector machines
- Fergani B., Davy M., and Houacine A. Speaker diarization using one-class support vector machines. Speech Commun. 50 (2008) 355-365
- (2008) Speech Commun. , vol.50 , pp. 355-365
- Fergani, B.¹ Davy, M.² Houacine, A.³

9
- 85128356454
- Partitioning and transcription of broadcast news data
- Gauvain J.L., Lamel L., and Adda G. Partitioning and transcription of broadcast news data. International Conference on Spoken Language Processing (1998) 1335-1338
- (1998) International Conference on Spoken Language Processing , pp. 1335-1338
- Gauvain, J.L.¹ Lamel, L.² Adda, G.³

10
- 0003413187
- Prentice-Hall, Englewood Cliffs, NJ
- Haykin S. Neural Networks: A Comprehensive Foundation (1999), Prentice-Hall, Englewood Cliffs, NJ
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

11
- 0003998592
- M.S. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras
- Kishore, S.P., 2000. Speaker verification using autoassociative neural networks model. M.S. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras.
- (2000) Speaker verification using autoassociative neural networks model
- Kishore, S.P.¹

12
- 38949122754
- Speaker segmentation and clustering
- Kotti M., Moschou V., and Kotropoulas C. Speaker segmentation and clustering. Signal Process. 88 (2007) 1091-1124
- (2007) Signal Process. , vol.88 , pp. 1091-1124
- Kotti, M.¹ Moschou, V.² Kotropoulas, C.³

13
- 0026113980
- Nonlinear principal component analysis using auto associative neural networks
- Kramer M.A. Nonlinear principal component analysis using auto associative neural networks. AIChE 37 (1991) 233-243
- (1991) AIChE , vol.37 , pp. 233-243
- Kramer, M.A.¹

14
- 0141809272
- E-HMM approach for learning and adapting sound models for speaker indexing
- Meigneir S., Bonastre J.F., and Igounet S. E-HMM approach for learning and adapting sound models for speaker indexing. Proceedings of the Speaker Odyssey-The Speaker Recognition Workshop (2001) 175-180
- (2001) Proceedings of the Speaker Odyssey-The Speaker Recognition Workshop , pp. 175-180
- Meigneir, S.¹ Bonastre, J.F.² Igounet, S.³

15
- 29044442235
- Step by step and integrated approaches in broadcast news speaker diarization
- Meignier S., Moraru D., Fredouille C., Bonastre J.F., and Besacier L. Step by step and integrated approaches in broadcast news speaker diarization. Comput. Speech Lang. 20 (2006) 303-330
- (2006) Comput. Speech Lang. , vol.20 , pp. 303-330
- Meignier, S.¹ Moraru, D.² Fredouille, C.³ Bonastre, J.F.⁴ Besacier, L.⁵

16
- 33947623018
- Using a priori information for speaker diarization
- Moraru D., Besacier L., and Castelli E. Using a priori information for speaker diarization. Proceedings of the Odyssey 2004 Workshop on Speaker Recognition (2004) 355-362
- (2004) Proceedings of the Odyssey 2004 Workshop on Speaker Recognition , pp. 355-362
- Moraru, D.¹ Besacier, L.² Castelli, E.³

17
- 67349254637
- Fall
- NIST, 2004. Fall 2004 Rich Transcription (RT-04F) 〈www.nist.gov/speech/tests/rt/rt2004/fall/docs/ rto4feval-plan-v14.pdf〉.
- (2004) 2004 Rich Transcription (RT-04F)

18
- 67349141382
- Ph.D. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras
- Palanivel, S., 2004. Person authentication using speech, face and visual speech. Ph.D. Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras.
- (2004) Person authentication using speech, face and visual speech
- Palanivel, S.¹

19
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- special issue on NIST 1999 Speaker Recognition Workshop
- Reynolds D.A., Quatieri T.F., and Dunn R.B. Speaker verification using adapted Gaussian mixture models. Digital Signal Process. Rev. J. 10 1-3 (2000) 19-41 special issue on NIST 1999 Speaker Recognition Workshop
- (2000) Digital Signal Process. Rev. J. , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

20
- 0002782496
- Automatic segmentation, classification and clustering of broadcast news audio
- Sieglar M., Jain U., Raj B., and Stern R. Automatic segmentation, classification and clustering of broadcast news audio. Proceedings of the DARPA Speech Recognition Workshop (1997) 97-99
- (1997) Proceedings of the DARPA Speech Recognition Workshop , pp. 97-99
- Sieglar, M.¹ Jain, U.² Raj, B.³ Stern, R.⁴

21
- 33745200276
- The Cambridge University March 2005 speaker diarization system
- Sinha R., Tranter S.E., Gales M.J.F., and Woodland P.C. The Cambridge University March 2005 speaker diarization system. Proceedings of the European Conference on Speech Communications and Technology (2005) 2437-2440
- (2005) Proceedings of the European Conference on Speech Communications and Technology , pp. 2437-2440
- Sinha, R.¹ Tranter, S.E.² Gales, M.J.F.³ Woodland, P.C.⁴

22
- 85009265801
- An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi speakers
- Siu M.H., Rohlicek R., and Gish H. An unsupervised, sequential learning algorithm for segmentation of speech waveforms with multi speakers. Proc. of the IEEE International Conference on Acoustic, Speech, and Signal Processing (1992) 189-192
- (1992) Proc. of the IEEE International Conference on Acoustic, Speech, and Signal Processing , pp. 189-192
- Siu, M.H.¹ Rohlicek, R.² Gish, H.³

23
- 84889324982
- Clustering speakers by their voices
- Solomonoff A., Mielke A., Schmidt M., and Gish H. Clustering speakers by their voices. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (1998) 757-760
- (1998) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 757-760
- Solomonoff, A.¹ Mielke, A.² Schmidt, M.³ Gish, H.⁴

24
- 34047261805
- An overview of automatic speaker diarization systems
- Tranter S.E., and Reynolds D.A. An overview of automatic speaker diarization systems. IEEE Trans. Audio Speech Lang. Process. 14 5 (2006) 1557-1565
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

25
- 85076109151
- Audio indexing using speaker identification
- Wilcox L., Kimber D., and Chen F. Audio indexing using speaker identification. Proceedings of the SPIE Conference on Automatic Systems for the Inspection and Identification of Humans (1994) 149-157
- (1994) Proceedings of the SPIE Conference on Automatic Systems for the Inspection and Identification of Humans , pp. 149-157
- Wilcox, L.¹ Kimber, D.² Chen, F.³

26
- 0004312284
- Prentice-Hall, New Delhi
- Yegnanarayana B. Artificial Neural Networks (1999), Prentice-Hall, New Delhi
- (1999) Artificial Neural Networks
- Yegnanarayana, B.¹

27
- 0035989168
- AANN: an alternative to GMM for pattern recognition
- Yegnanarayana B., and Kishore S.P. AANN: an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
- (2002) Neural Networks , vol.15 , pp. 459-469
- Yegnanarayana, B.¹ Kishore, S.P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.