SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2006, Pages

Fast and robust speaker clustering using the earth mover's distance and MIXMAX models

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC WAVE VELOCITY; HIERARCHICAL SYSTEMS; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD; VIDEO SIGNAL PROCESSING;

LIKELIHOOD RATIOS; SPEAKER CLUSTERING; SPEAKER MODELS;

SPEECH RECOGNITION;

EID: 33947649029 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (13)

1
- 0034313871
- The Earth Mover's Distance as a Metric for Image Retrieval
- Y. Rubner, C. Tomasi, and L.J. Guibas, "The Earth Mover's Distance as a Metric for Image Retrieval," International Journal of Computer Vision, vol. 40, pp. 99-121, 2000.
- (2000) International Journal of Computer Vision , vol.40 , pp. 99-121
- Rubner, Y.¹ Tomasi, C.² Guibas, L.J.³

2
- 0028420014
- Integrated Models of Signal and Background with Application to Speaker Identification in Noise
- R.C. Rose, E.M. Hofstetter, and D.A. Reynolds, "Integrated Models of Signal and Background with Application to Speaker Identification in Noise," IEEE Trans. on Speech and Audio Processing, vol. 2, pp. 245-258, 1994.
- (1994) IEEE Trans. on Speech and Audio Processing , vol.2 , pp. 245-258
- Rose, R.C.¹ Hofstetter, E.M.² Reynolds, D.A.³

3
- 0003128649
- Automatic Speaker Clustering
- H. Jin, F. Kubala, and R. Schwartz, "Automatic Speaker Clustering," in Proc. of the DARPA Speech Recognition Workshop, 1997. pp. 108-111.
- (1997) Proc. of the DARPA Speech Recognition Workshop , pp. 108-111
- Jin, H.¹ Kubala, F.² Schwartz, R.³

4
- 84889324982
- Clustering Speakers by Their Voices
- A. Solomonoff, A. Mielke, M. Schmidt, and H. Gish, "Clustering Speakers by Their Voices," in IEEE Proc. of ICASSP, 1998. pp. 757-760.
- (1998) IEEE Proc. of ICASSP , pp. 757-760
- Solomonoff, A.¹ Mielke, A.² Schmidt, M.³ Gish, H.⁴

5
- 84946742526
- A Robust Speaker Clustering Algorithm
- J. Ajmera and C. Wooters, "A Robust Speaker Clustering Algorithm," in IEEE ASRU Workshop, 2003, pp. 411-416.
- (2003) IEEE ASRU Workshop , pp. 411-416
- Ajmera, J.¹ Wooters, C.²

6
- 4544247119
- Online speaker clustering
- D. Liu and F. Kubala, "Online speaker clustering," in IEEE Proc. of ICASSP, 2004, vol. 1, pp. 333-336.
- (2004) IEEE Proc. of ICASSP , vol.1 , pp. 333-336
- Liu, D.¹ Kubala, F.²

7
- 33745207347
- A Distance Measure Between GMMs Based on the Unscented Transform and its Application to Speaker Recognition
- J. Goldberger and H. Aronowitz, "A Distance Measure Between GMMs Based on the Unscented Transform and its Application to Speaker Recognition," in Proc. of Interspeech, 2005, pp. 1985-1989.
- (2005) Proc. of Interspeech , pp. 1985-1989
- Goldberger, J.¹ Aronowitz, H.²

8
- 84892185828
- A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition
- H.S.M. Beigi, S.H. Maes, and J.S. Sorensen, "A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition," in IEEE Proc. of ICASSP, 1998, vol. 2, pp. 753-756.
- (1998) IEEE Proc. of ICASSP , vol.2 , pp. 753-756
- Beigi, H.S.M.¹ Maes, S.H.² Sorensen, J.S.³

9
- 27844537742
- Ph.D. thesis, Technical University of Kaiserslautern, Germany
- S. Baumann, Artifcial Listening Systems: Modellierung und Approximation der individuellen Perception von Musikähnlichkeit, Ph.D. thesis, Technical University of Kaiserslautern, Germany, 2005.
- (2005) Artifcial Listening Systems: Modellierung und Approximation der individuellen Perception von Musikähnlichkeit
- Baumann, S.¹

10
- 32844474859
- Language-Adaptive Persian Speech Recognition
- N. Srinivasamurthy and S. Narayanan, "Language-Adaptive Persian Speech Recognition," in Proc. of EUROSPEECH, 2003, pp. 3137-3140.
- (2003) Proc. of EUROSPEECH , pp. 3137-3140
- Srinivasamurthy, N.¹ Narayanan, S.²

11
- 33947649429
- Description of MPEG-7 Content Set
- MPEG-7 Requirement Group
- MPEG-7 Requirement Group, "Description of MPEG-7 Content Set," ISO/IEC JTC1/SC29/WG11/N2467, 1998.
- (1998) ISO/IEC JTC1/SC29/WG11/N2467

12
- 0034229795
- A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress
- S.E. Bou-Ghazale and J.H.L. Hansen, "A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress," IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 429-442, 2000.
- (2000) IEEE Trans. on Speech and Audio Processing , vol.8 , pp. 429-442
- Bou-Ghazale, S.E.¹ Hansen, J.H.L.²

13
- 4043050072
- Content-Based Movie Analysis and Indexing Based on Audiovisual Cues
- Y. Li, S.S. Narayanan, and C.C.J. Kuo, "Content-Based Movie Analysis and Indexing Based on Audiovisual Cues," IEEE Trans. on Circuits and Systems for Video Technology, vol. 14, pp. 1073-1085, 2004.
- (2004) IEEE Trans. on Circuits and Systems for Video Technology , vol.14 , pp. 1073-1085
- Li, Y.¹ Narayanan, S.S.² Kuo, C.C.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.