메뉴 건너뛰기




Volumn 13, Issue 5, 2005, Pages 1004-1013

Unsupervised speaker indexing using generic models

Author keywords

Generic models; Localized search algorithm (LSA); Markov chain Monte Carlo (MCMC) method; Maximum a posteriori (MAP); Sample speaker models (SSM); Universal background model (UBM); Universal gender models (UGM); Unsupervised speaker indexing

Indexed keywords

GENERIC MODELS; LOCALIZED SEARCH ALGORITHMS (LSA); MARKOV CHAIN MONTE CARLO (MCMC) METHODS; MAXIMUM A POSTERIORI; SAMPLE SPEAKER MODELS (SSM); UNSUPERVISED SPEAKER INDEXING; UNVERSAL BACKGROUND MODELS (UBM); UNVERSAL GENDER MODELS (UGM);

EID: 27644599375     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.851981     Document Type: Article
Times cited : (38)

References (23)
  • 1
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition: A feature-based approach
    • Sept.
    • R. J. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition: a feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sept. 1996.
    • (1996) IEEE Signal Process. Mag. , vol.13 , Issue.5 , pp. 58-71
    • Mammone, R.J.1    Zhang, X.2    Ramachandran, R.3
  • 2
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • Sep.
    • J. P. Campbell, "Speaker recognition: a tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1436-1462, Sep. 1997.
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1436-1462
    • Campbell, J.P.1
  • 3
    • 85009282223 scopus 로고    scopus 로고
    • Speaker change detection using a new weighted distance measure
    • S. Kwon and S. Narayanan, "Speaker change detection using a new weighted distance measure," in Proc. Int. Conf. Spoken Language Processing, vol. 4, 2002, pp. 2537-2540.
    • (2002) Proc. Int. Conf. Spoken Language Processing , vol.4 , pp. 2537-2540
    • Kwon, S.1    Narayanan, S.2
  • 6
    • 33846278175 scopus 로고    scopus 로고
    • A method for on-line speaker indexing using generic reference models
    • S. Kwon and S. Narayanan, "A method for on-line speaker indexing using generic reference models," in Proc. Eurospeech 2003, 2003, pp. 2653-2656.
    • (2003) Proc. Eurospeech 2003 , pp. 2653-2656
    • Kwon, S.1    Narayanan, S.2
  • 8
    • 0036816475 scopus 로고    scopus 로고
    • Content analysis for audio classification and segmemtation
    • L. Lu, H.-J. Zhang, and H. Jiang, "Content analysis for audio classification and segmemtation," IEEE Trans. Speech Audio Process., vol. 10, no. 7, pp. 504-516, 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.7 , pp. 504-516
    • Lu, L.1    Zhang, H.-J.2    Jiang, H.3
  • 9
    • 0032659936 scopus 로고    scopus 로고
    • Speaker indexing for news articles, debates, and drama in broadcasted TV programs
    • M. Nishida and Y. Ariki, "Speaker indexing for news articles, debates, and drama in broadcasted TV programs," in Proc. IEEE Int. Conf. Multimedia Computing and Systems, vol. 2, 1999, pp. 466-471.
    • (1999) Proc. IEEE Int. Conf. Multimedia Computing and Systems , vol.2 , pp. 466-471
    • Nishida, M.1    Ariki, Y.2
  • 13
    • 0001341735 scopus 로고    scopus 로고
    • Introduction to Monte Carlo methods
    • M. I. Jordan, Ed. Cambridge, MA: MIT Press
    • D. J. C. MacKay, "Introduction to Monte Carlo methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999, pp. 175-204.
    • (1999) Learning in Graphical Models , pp. 175-204
    • MacKay, D.J.C.1
  • 18
    • 0009577929 scopus 로고    scopus 로고
    • Cohorts based custom models for rapid speaker and dialect adaptation
    • J. Wu and E. Chang, "Cohorts based custom models for rapid speaker and dialect adaptation," in Proc. Eurospeech, 2001, pp. 1261-1264.
    • (2001) Proc. Eurospeech , pp. 1261-1264
    • Wu, J.1    Chang, E.2
  • 20
    • 85009251523 scopus 로고    scopus 로고
    • Hierarchical Gaussian mixture model for speaker verification
    • M. Liu, E. Chang, and B.-Q. Dai, "Hierarchical Gaussian mixture model for speaker verification," in Proc. Int. Conf. Spoken Language Processing, vol. 2, 2002, pp. 1353-1356.
    • (2002) Proc. Int. Conf. Spoken Language Processing , vol.2 , pp. 1353-1356
    • Liu, M.1    Chang, E.2    Dai, B.-Q.3
  • 21
    • 0034857759 scopus 로고    scopus 로고
    • Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
    • K. Mori and S. Nakagawa, "Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2001, pp. 413-416.
    • (2001) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 413-416
    • Mori, K.1    Nakagawa, S.2
  • 22
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment, and channel change detection and clustering via the Bayesian information criterion
    • S. Chen and P. Gopalakrishnan, "Speaker, environment, and channel change detection and clustering via the Bayesian information criterion," in Proc. DARPA Speech Recognition Workshop, 1998, pp. 127-132.
    • (1998) Proc. DARPA Speech Recognition Workshop , pp. 127-132
    • Chen, S.1    Gopalakrishnan, P.2
  • 23
    • 85009089453 scopus 로고    scopus 로고
    • Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
    • B. Zhou and J. H. L. Hansen, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 714-717.
    • (2000) Proc. Int. Conf. Spoken Language Processing , pp. 714-717
    • Zhou, B.1    Hansen, J.H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.