메뉴 건너뛰기




Volumn 1, Issue , 2006, Pages

Fast and robust speaker clustering using the earth mover's distance and MIXMAX models

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC WAVE VELOCITY; HIERARCHICAL SYSTEMS; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD; VIDEO SIGNAL PROCESSING;

EID: 33947649029     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (13)
  • 2
    • 0028420014 scopus 로고
    • Integrated Models of Signal and Background with Application to Speaker Identification in Noise
    • R.C. Rose, E.M. Hofstetter, and D.A. Reynolds, "Integrated Models of Signal and Background with Application to Speaker Identification in Noise," IEEE Trans. on Speech and Audio Processing, vol. 2, pp. 245-258, 1994.
    • (1994) IEEE Trans. on Speech and Audio Processing , vol.2 , pp. 245-258
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 5
    • 84946742526 scopus 로고    scopus 로고
    • A Robust Speaker Clustering Algorithm
    • J. Ajmera and C. Wooters, "A Robust Speaker Clustering Algorithm," in IEEE ASRU Workshop, 2003, pp. 411-416.
    • (2003) IEEE ASRU Workshop , pp. 411-416
    • Ajmera, J.1    Wooters, C.2
  • 6
    • 4544247119 scopus 로고    scopus 로고
    • Online speaker clustering
    • D. Liu and F. Kubala, "Online speaker clustering," in IEEE Proc. of ICASSP, 2004, vol. 1, pp. 333-336.
    • (2004) IEEE Proc. of ICASSP , vol.1 , pp. 333-336
    • Liu, D.1    Kubala, F.2
  • 7
    • 33745207347 scopus 로고    scopus 로고
    • A Distance Measure Between GMMs Based on the Unscented Transform and its Application to Speaker Recognition
    • J. Goldberger and H. Aronowitz, "A Distance Measure Between GMMs Based on the Unscented Transform and its Application to Speaker Recognition," in Proc. of Interspeech, 2005, pp. 1985-1989.
    • (2005) Proc. of Interspeech , pp. 1985-1989
    • Goldberger, J.1    Aronowitz, H.2
  • 8
    • 84892185828 scopus 로고    scopus 로고
    • A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition
    • H.S.M. Beigi, S.H. Maes, and J.S. Sorensen, "A Distance Measure Between Collections of Distributions and its Application to Speaker Recognition," in IEEE Proc. of ICASSP, 1998, vol. 2, pp. 753-756.
    • (1998) IEEE Proc. of ICASSP , vol.2 , pp. 753-756
    • Beigi, H.S.M.1    Maes, S.H.2    Sorensen, J.S.3
  • 10
    • 32844474859 scopus 로고    scopus 로고
    • Language-Adaptive Persian Speech Recognition
    • N. Srinivasamurthy and S. Narayanan, "Language-Adaptive Persian Speech Recognition," in Proc. of EUROSPEECH, 2003, pp. 3137-3140.
    • (2003) Proc. of EUROSPEECH , pp. 3137-3140
    • Srinivasamurthy, N.1    Narayanan, S.2
  • 11
    • 33947649429 scopus 로고    scopus 로고
    • Description of MPEG-7 Content Set
    • MPEG-7 Requirement Group
    • MPEG-7 Requirement Group, "Description of MPEG-7 Content Set," ISO/IEC JTC1/SC29/WG11/N2467, 1998.
    • (1998) ISO/IEC JTC1/SC29/WG11/N2467
  • 12
    • 0034229795 scopus 로고    scopus 로고
    • A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress
    • S.E. Bou-Ghazale and J.H.L. Hansen, "A Comparative Study of Traditional and Newly Proposed Features for Recognition of Speech Under Stress," IEEE Trans. on Speech and Audio Processing, vol. 8, pp. 429-442, 2000.
    • (2000) IEEE Trans. on Speech and Audio Processing , vol.8 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.