메뉴 건너뛰기




Volumn 15, Issue 7, 2007, Pages 2033-2043

Efficient speaker recognition using approximated cross entropy (ACE)

Author keywords

Speaker identification; Speaker indexing; Speaker recognition; Speaker retrieval; Speaker verification

Indexed keywords

SPEAKER IDENTIFICATION; SPEAKER INDEXING; SPEAKER RECOGNITION; SPEAKER RETRIEVAL; SPEAKER VERIFICATION;

EID: 64249095146     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.902059     Document Type: Article
Times cited : (31)

References (43)
  • 1
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, 2000.
    • (2000) Digital Signal Process , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 2
    • 0025682333 scopus 로고
    • Text-independent speaker identification using automatic acoustic segmentation
    • R. C. Rose and D. A. Reynolds, "Text-independent speaker identification using automatic acoustic segmentation," in Proc. ICASSP, 1990, pp. 293-296.
    • (1990) Proc. ICASSP , pp. 293-296
    • Rose, R.C.1    Reynolds, D.A.2
  • 3
    • 0003988385 scopus 로고
    • A Gaussian mixture modeling approach to text-independent speaker identification,
    • Ph.D. dissertation, Georgia Inst. Technol, Atlanta
    • D. A. Reynolds, "A Gaussian mixture modeling approach to text-independent speaker identification," Ph.D. dissertation, Georgia Inst. Technol., Atlanta, 1992.
    • (1992)
    • Reynolds, D.A.1
  • 4
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 6
    • 17344377138 scopus 로고    scopus 로고
    • Speaker verification using text-constrained Gaussian mixture models
    • D. E. Sturim, D. A. Reynolds, R. B. Dunn, and T. F. Quatieri, "Speaker verification using text-constrained Gaussian mixture models," in Proc. ICASSP, 2002, pp. 677-680.
    • (2002) Proc. ICASSP , pp. 677-680
    • Sturim, D.E.1    Reynolds, D.A.2    Dunn, R.B.3    Quatieri, T.F.4
  • 8
    • 85009152786 scopus 로고    scopus 로고
    • Text independent speaker recognition using speaker dependent word spotting
    • Aronowitz, D. Burshtein, and A. Amir, "Text independent speaker recognition using speaker dependent word spotting," in Proc. Interspeech, 2004, pp. 1789-1792.
    • (2004) Proc. Interspeech , pp. 1789-1792
    • Aronowitz, D.B.1    Amir, A.2
  • 9
    • 33646348224 scopus 로고    scopus 로고
    • Improved phonetic speaker recognition using lattice decoding
    • A. Hatch, B. Peskin, and A. Stolcke, "Improved phonetic speaker recognition using lattice decoding," in Proc. ICASSP, 2005, pp. 169-172.
    • (2005) Proc. ICASSP , pp. 169-172
    • Hatch, A.1    Peskin, B.2    Stolcke, A.3
  • 11
    • 33646785082 scopus 로고    scopus 로고
    • A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification
    • H. Aronowitz, D. Burshtein, and A. Amir, "A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification," in Proc. ICASSP, 2005, pp. 729-732.
    • (2005) Proc. ICASSP , pp. 729-732
    • Aronowitz, H.1    Burshtein, D.2    Amir, A.3
  • 12
    • 33745188228 scopus 로고    scopus 로고
    • Modeling intra-speaker variability for speaker recognition
    • H. Aronowitz, D. Irony, and D. Burshtein, "Modeling intra-speaker variability for speaker recognition," in Proc. Interspeech, 2005, pp. 2177-2180.
    • (2005) Proc. Interspeech , pp. 2177-2180
    • Aronowitz, H.1    Irony, D.2    Burshtein, D.3
  • 13
    • 33947670889 scopus 로고    scopus 로고
    • Experiments in session variability modeling for speaker verification
    • R. Vogt and S. Sridharan, "Experiments in session variability modeling for speaker verification," in Proc. ICASSP, 2006, pp. 897-900.
    • (2006) Proc. ICASSP , pp. 897-900
    • Vogt, R.1    Sridharan, S.2
  • 14
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation
    • W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation," in Proc. ICASSP, 2006, pp. 97-100.
    • (2006) Proc. ICASSP , pp. 97-100
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3    Solomonoff, A.4
  • 15
    • 37649026844 scopus 로고    scopus 로고
    • Efficient language identification using Anchor models and support vector machines
    • E. Noor and H. Aronowitz, "Efficient language identification using Anchor models and support vector machines," in Proc. ISCA Odyssey Workshop, 2006, pp. 1-6.
    • (2006) Proc. ISCA Odyssey Workshop , pp. 1-6
    • Noor, E.1    Aronowitz, H.2
  • 16
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
    • (2000) Digital Signal Process , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 17
    • 1642346107 scopus 로고    scopus 로고
    • Audio indexing: What has been accomplished and the road ahead
    • I. M. Chagolleau and N. P. Vallès, "Audio indexing: What has been accomplished and the road ahead," in Proc. 6th Joint Conf. Inf. Sci., 2002, pp. 911-914.
    • (2002) Proc. 6th Joint Conf. Inf. Sci , pp. 911-914
    • Chagolleau, I.M.1    Vallès, N.P.2
  • 18
    • 33646914432 scopus 로고    scopus 로고
    • Speech and language technologies for audio indexing and retrieval
    • Aug
    • J. Makhoul, F. Kubala, T. Leek, L. Daben, N. Long, R. Schwartz, and A. Srivastava, "Speech and language technologies for audio indexing and retrieval," Proc. IEEE, vol. 88, no. 8, pp. 1338-1353, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1338-1353
    • Makhoul, J.1    Kubala, F.2    Leek, T.3    Daben, L.4    Long, N.5    Schwartz, R.6    Srivastava, A.7
  • 19
    • 0001355166 scopus 로고    scopus 로고
    • A study of computation speed-ups of the GMM-UBM speaker recognition system
    • J. McLaughlin, D. A. Reynolds, and T. Gleason, "A study of computation speed-ups of the GMM-UBM speaker recognition system," in Proc. Eurospeech, 1999, pp. 1215-1218.
    • (1999) Proc. Eurospeech , pp. 1215-1218
    • McLaughlin, J.1    Reynolds, D.A.2    Gleason, T.3
  • 21
    • 0141591546 scopus 로고    scopus 로고
    • Speaker identification by anchor models with PCA/LDA post-processing
    • Y. Mami, D. Charlet, and F. Lannion, "Speaker identification by anchor models with PCA/LDA post-processing," in Proc. ICASSP, 2004, pp. 180-183.
    • (2004) Proc. ICASSP , pp. 180-183
    • Mami, Y.1    Charlet, D.2    Lannion, F.3
  • 22
    • 33646769986 scopus 로고    scopus 로고
    • A correlation metric for speaker tracking using Anchor models
    • M. Collet, D. Charlet, and F. Bimbot, "A correlation metric for speaker tracking using Anchor models," in Proc. ICASSP, 2005, pp. 713-716.
    • (2005) Proc. ICASSP , pp. 713-716
    • Collet, M.1    Charlet, D.2    Bimbot, F.3
  • 23
    • 33745191867 scopus 로고    scopus 로고
    • Probabilistic Anchor models approach for speaker verification
    • M. Collet, Y. Mami, D. Charlet, and F. Bimbot, "Probabilistic Anchor models approach for speaker verification," in Proc. Interspeech, 2005, pp. 2005-2008.
    • (2005) Proc. Interspeech , pp. 2005-2008
    • Collet, M.1    Mami, Y.2    Charlet, D.3    Bimbot, F.4
  • 24
    • 85009103726 scopus 로고    scopus 로고
    • Speaker indexing in audio archives using test utterance Gaussian mixture modeling
    • H. Aronowitz, D. Burshtein, and A. Amir, "Speaker indexing in audio archives using test utterance Gaussian mixture modeling," in Proc. ICSLP, 2004, pp. 609-612.
    • (2004) Proc. ICSLP , pp. 609-612
    • Aronowitz, H.1    Burshtein, D.2    Amir, A.3
  • 25
    • 24144462539 scopus 로고    scopus 로고
    • Speaker indexing in audio archives using Gaussian mixture scoring simulation
    • MLMI: Proceedings of the Workshop on Machine Learning for Multimodal Interaction. New York: Springer-Verlag
    • H. Aronowitz, D. Burshtein, and A. Amir, "Speaker indexing in audio archives using Gaussian mixture scoring simulation," in MLMI: Proceedings of the Workshop on Machine Learning for Multimodal Interaction. New York: Springer-Verlag LNCS, 2004, pp. 243-252.
    • (2004) LNCS , pp. 243-252
    • Aronowitz, H.1    Burshtein, D.2    Amir, A.3
  • 26
    • 33745186891 scopus 로고    scopus 로고
    • Efficient speaker identification and retrieval
    • H. Aronowitz and D. Burshtein, "Efficient speaker identification and retrieval," in Proc. Interspeech, 2005, pp. 2433-2436.
    • (2005) Proc. Interspeech , pp. 2433-2436
    • Aronowitz, H.1    Burshtein, D.2
  • 27
    • 0028996950 scopus 로고
    • Covariance estimation methods for channel robust text-independent speaker identification
    • M. Schmidt, H. Gish, and A. Mielke, "Covariance estimation methods for channel robust text-independent speaker identification," in Proc. ICASSP, 1995, pp. 333-336.
    • (1995) Proc. ICASSP , pp. 333-336
    • Schmidt, M.1    Gish, H.2    Mielke, A.3
  • 28
    • 85009090964 scopus 로고    scopus 로고
    • Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification
    • W. H. Tsai, W. W. Chang, Y. C. Chu, and C. S. Huang, "Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification," in Proc. Eurospeech, 2001, pp. 771-774.
    • (2001) Proc. Eurospeech , pp. 771-774
    • Tsai, W.H.1    Chang, W.W.2    Chu, Y.C.3    Huang, C.S.4
  • 29
    • 34547516258 scopus 로고    scopus 로고
    • Approximating the Kullback Leibler divergence between Gaussian mixture models
    • J. Hershey and P. Olsen, "Approximating the Kullback Leibler divergence between Gaussian mixture models," in Proc. ICASSP, 2007, pp. 317-320.
    • (2007) Proc. ICASSP , pp. 317-320
    • Hershey, J.1    Olsen, P.2
  • 31
    • 85009112348 scopus 로고    scopus 로고
    • Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems
    • A. Chan, J. Sherwani, R. Mosur, and A. Rudnicky, "Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems," in Proc. ICSLP, 2004, pp. 289-292.
    • (2004) Proc. ICSLP , pp. 289-292
    • Chan, A.1    Sherwani, J.2    Mosur, R.3    Rudnicky, A.4
  • 32
    • 0041360472 scopus 로고    scopus 로고
    • Efficient text-independent speaker verification with structural Gaussian mixture models and neural network
    • B. Xiang and T. Berger, "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 447-456, 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.5 , pp. 447-456
    • Xiang, B.1    Berger, T.2
  • 33
    • 64249128435 scopus 로고    scopus 로고
    • quot;Switchboard 2 Phase II, Univ. Pennsylvania, Philadelphia, PA. [Online]. Available: http://www.ldc.upenn.edu/Catalog/docs/Switchboard2-Phase2
    • quot;Switchboard 2 Phase II," Univ. Pennsylvania, Philadelphia, PA. [Online]. Available: http://www.ldc.upenn.edu/Catalog/docs/Switchboard2-Phase2
  • 34
    • 64249153927 scopus 로고    scopus 로고
    • quot;The NIST Year 2004 Speaker Recognition Evaluation Plan, NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2003
    • quot;The NIST Year 2004 Speaker Recognition Evaluation Plan," NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2003
  • 35
    • 64249162255 scopus 로고    scopus 로고
    • quot;The NIST Year 2004 Speaker Recognition Evaluation Plan, NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2004
    • quot;The NIST Year 2004 Speaker Recognition Evaluation Plan," NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2004
  • 37
    • 85075924869 scopus 로고    scopus 로고
    • Comparison of background normalization methods for text-independent speaker verification
    • D. A. Reynolds, "Comparison of background normalization methods for text-independent speaker verification," in Proc. Eurospeech, 1997, pp. 963-966.
    • (1997) Proc. Eurospeech , pp. 963-966
    • Reynolds, D.A.1
  • 39
  • 41
    • 85009165992 scopus 로고    scopus 로고
    • Model compression for GMM based speaker recognition systems
    • D. A. Reynolds, "Model compression for GMM based speaker recognition systems," in Proc. Eurospeech, 2003, pp. 2005-2008.
    • (2003) Proc. Eurospeech , pp. 2005-2008
    • Reynolds, D.A.1
  • 42
    • 64249126167 scopus 로고    scopus 로고
    • Trainable speaker diarization
    • to be published
    • H. Aronowitz, "Trainable speaker diarization," in Proc. Interspeech, 2007, to be published.
    • (2007) Proc. Interspeech
    • Aronowitz, H.1
  • 43
    • 64249157596 scopus 로고    scopus 로고
    • Speaker recognition using kernel-PCA and intersession variability modeling
    • to be published
    • H. Aronowitz, "Speaker recognition using kernel-PCA and intersession variability modeling," in Proc. Interspeech, 2007, to be published.
    • (2007) Proc. Interspeech
    • Aronowitz, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.