SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 7, 2007, Pages 2033-2043

Efficient speaker recognition using approximated cross entropy (ACE)

(2) Aronowitz, Hagai a,b Burshtein, David c

a BAR ILAN UNIVERSITY (Israel)

b IBM T J WATSON RESEARCH CENTER (United States)

c TEL AVIV UNIVERSITY (Israel)

Author keywords

Speaker identification; Speaker indexing; Speaker recognition; Speaker retrieval; Speaker verification

Indexed keywords

SPEAKER IDENTIFICATION; SPEAKER INDEXING; SPEAKER RECOGNITION; SPEAKER RETRIEVAL; SPEAKER VERIFICATION;

INDEXING (OF INFORMATION); LOUDSPEAKERS; MAGNETOSTRICTIVE DEVICES; TRELLIS CODES;

SPEECH RECOGNITION;

EID: 64249095146 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.902059 Document Type: Article

Times cited : (31)

References (43)

1
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, 2000.
- (2000) Digital Signal Process , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

2
- 0025682333
- Text-independent speaker identification using automatic acoustic segmentation
- R. C. Rose and D. A. Reynolds, "Text-independent speaker identification using automatic acoustic segmentation," in Proc. ICASSP, 1990, pp. 293-296.
- (1990) Proc. ICASSP , pp. 293-296
- Rose, R.C.¹ Reynolds, D.A.²

3
- 0003988385
- A Gaussian mixture modeling approach to text-independent speaker identification,
- Ph.D. dissertation, Georgia Inst. Technol, Atlanta
- D. A. Reynolds, "A Gaussian mixture modeling approach to text-independent speaker identification," Ph.D. dissertation, Georgia Inst. Technol., Atlanta, 1992.
- (1992)
- Reynolds, D.A.¹

4
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

5
- 29044444825
- Support vector machines for speaker and language recognition
- W. M. Campbell, J. P. Campbell, D. A. Reynolds, E. Singer, and P. A. Torres-Carrasquillo, "Support vector machines for speaker and language recognition," Comput. Speech Lang., vol. 20, no. 2-3, pp. 210-229, 2006.
- (2006) Comput. Speech Lang , vol.20 , Issue.2-3 , pp. 210-229
- Campbell, W.M.¹ Campbell, J.P.² Reynolds, D.A.³ Singer, E.⁴ Torres-Carrasquillo, P.A.⁵

6
- 17344377138
- Speaker verification using text-constrained Gaussian mixture models
- D. E. Sturim, D. A. Reynolds, R. B. Dunn, and T. F. Quatieri, "Speaker verification using text-constrained Gaussian mixture models," in Proc. ICASSP, 2002, pp. 677-680.
- (2002) Proc. ICASSP , pp. 677-680
- Sturim, D.E.¹ Reynolds, D.A.² Dunn, R.B.³ Quatieri, T.F.⁴

7
- 37649032100
- Improvements in MLLRtransform- based speaker recognition
- A. Stolcke, L. Ferrer, and S. Kajarekar, "Improvements in MLLRtransform- based speaker recognition," in Proc. ISCA Odyssey Workshop, 2006.
- (2006) Proc. ISCA Odyssey Workshop
- Stolcke, A.¹ Ferrer, L.² Kajarekar, S.³

8
- 85009152786
- Text independent speaker recognition using speaker dependent word spotting
- Aronowitz, D. Burshtein, and A. Amir, "Text independent speaker recognition using speaker dependent word spotting," in Proc. Interspeech, 2004, pp. 1789-1792.
- (2004) Proc. Interspeech , pp. 1789-1792
- Aronowitz, D.B.¹ Amir, A.²

9
- 33646348224
- Improved phonetic speaker recognition using lattice decoding
- A. Hatch, B. Peskin, and A. Stolcke, "Improved phonetic speaker recognition using lattice decoding," in Proc. ICASSP, 2005, pp. 169-172.
- (2005) Proc. ICASSP , pp. 169-172
- Hatch, A.¹ Peskin, B.² Stolcke, A.³

10
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 4, pp. 1435-1447, May 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

11
- 33646785082
- A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification
- H. Aronowitz, D. Burshtein, and A. Amir, "A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification," in Proc. ICASSP, 2005, pp. 729-732.
- (2005) Proc. ICASSP , pp. 729-732
- Aronowitz, H.¹ Burshtein, D.² Amir, A.³

12
- 33745188228
- Modeling intra-speaker variability for speaker recognition
- H. Aronowitz, D. Irony, and D. Burshtein, "Modeling intra-speaker variability for speaker recognition," in Proc. Interspeech, 2005, pp. 2177-2180.
- (2005) Proc. Interspeech , pp. 2177-2180
- Aronowitz, H.¹ Irony, D.² Burshtein, D.³

13
- 33947670889
- Experiments in session variability modeling for speaker verification
- R. Vogt and S. Sridharan, "Experiments in session variability modeling for speaker verification," in Proc. ICASSP, 2006, pp. 897-900.
- (2006) Proc. ICASSP , pp. 897-900
- Vogt, R.¹ Sridharan, S.²

14
- 33947696754
- SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation
- W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation," in Proc. ICASSP, 2006, pp. 97-100.
- (2006) Proc. ICASSP , pp. 97-100
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³ Solomonoff, A.⁴

15
- 37649026844
- Efficient language identification using Anchor models and support vector machines
- E. Noor and H. Aronowitz, "Efficient language identification using Anchor models and support vector machines," in Proc. ISCA Odyssey Workshop, 2006, pp. 1-6.
- (2006) Proc. ISCA Odyssey Workshop , pp. 1-6
- Noor, E.¹ Aronowitz, H.²

16
- 0033884857
- Score normalization for text-independent speaker verification systems
- R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

17
- 1642346107
- Audio indexing: What has been accomplished and the road ahead
- I. M. Chagolleau and N. P. Vallès, "Audio indexing: What has been accomplished and the road ahead," in Proc. 6th Joint Conf. Inf. Sci., 2002, pp. 911-914.
- (2002) Proc. 6th Joint Conf. Inf. Sci , pp. 911-914
- Chagolleau, I.M.¹ Vallès, N.P.²

18
- 33646914432
- Speech and language technologies for audio indexing and retrieval
- Aug
- J. Makhoul, F. Kubala, T. Leek, L. Daben, N. Long, R. Schwartz, and A. Srivastava, "Speech and language technologies for audio indexing and retrieval," Proc. IEEE, vol. 88, no. 8, pp. 1338-1353, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1338-1353
- Makhoul, J.¹ Kubala, F.² Leek, T.³ Daben, L.⁴ Long, N.⁵ Schwartz, R.⁶ Srivastava, A.⁷

19
- 0001355166
- A study of computation speed-ups of the GMM-UBM speaker recognition system
- J. McLaughlin, D. A. Reynolds, and T. Gleason, "A study of computation speed-ups of the GMM-UBM speaker recognition system," in Proc. Eurospeech, 1999, pp. 1215-1218.
- (1999) Proc. Eurospeech , pp. 1215-1218
- McLaughlin, J.¹ Reynolds, D.A.² Gleason, T.³

20
- 0034855361
- Speaker indexing in large audio databases using anchor models
- D. E. Sturim, D. A. Reynolds, E. Singer, and J. P. Campbell, "Speaker indexing in large audio databases using anchor models," in Proc. IEEE ICASSP, 2001, pp. 429-432.
- (2001) Proc. IEEE ICASSP , pp. 429-432
- Sturim, D.E.¹ Reynolds, D.A.² Singer, E.³ Campbell, J.P.⁴

21
- 0141591546
- Speaker identification by anchor models with PCA/LDA post-processing
- Y. Mami, D. Charlet, and F. Lannion, "Speaker identification by anchor models with PCA/LDA post-processing," in Proc. ICASSP, 2004, pp. 180-183.
- (2004) Proc. ICASSP , pp. 180-183
- Mami, Y.¹ Charlet, D.² Lannion, F.³

22
- 33646769986
- A correlation metric for speaker tracking using Anchor models
- M. Collet, D. Charlet, and F. Bimbot, "A correlation metric for speaker tracking using Anchor models," in Proc. ICASSP, 2005, pp. 713-716.
- (2005) Proc. ICASSP , pp. 713-716
- Collet, M.¹ Charlet, D.² Bimbot, F.³

23
- 33745191867
- Probabilistic Anchor models approach for speaker verification
- M. Collet, Y. Mami, D. Charlet, and F. Bimbot, "Probabilistic Anchor models approach for speaker verification," in Proc. Interspeech, 2005, pp. 2005-2008.
- (2005) Proc. Interspeech , pp. 2005-2008
- Collet, M.¹ Mami, Y.² Charlet, D.³ Bimbot, F.⁴

24
- 85009103726
- Speaker indexing in audio archives using test utterance Gaussian mixture modeling
- H. Aronowitz, D. Burshtein, and A. Amir, "Speaker indexing in audio archives using test utterance Gaussian mixture modeling," in Proc. ICSLP, 2004, pp. 609-612.
- (2004) Proc. ICSLP , pp. 609-612
- Aronowitz, H.¹ Burshtein, D.² Amir, A.³

25
- 24144462539
- Speaker indexing in audio archives using Gaussian mixture scoring simulation
- MLMI: Proceedings of the Workshop on Machine Learning for Multimodal Interaction. New York: Springer-Verlag
- H. Aronowitz, D. Burshtein, and A. Amir, "Speaker indexing in audio archives using Gaussian mixture scoring simulation," in MLMI: Proceedings of the Workshop on Machine Learning for Multimodal Interaction. New York: Springer-Verlag LNCS, 2004, pp. 243-252.
- (2004) LNCS , pp. 243-252
- Aronowitz, H.¹ Burshtein, D.² Amir, A.³

26
- 33745186891
- Efficient speaker identification and retrieval
- H. Aronowitz and D. Burshtein, "Efficient speaker identification and retrieval," in Proc. Interspeech, 2005, pp. 2433-2436.
- (2005) Proc. Interspeech , pp. 2433-2436
- Aronowitz, H.¹ Burshtein, D.²

27
- 0028996950
- Covariance estimation methods for channel robust text-independent speaker identification
- M. Schmidt, H. Gish, and A. Mielke, "Covariance estimation methods for channel robust text-independent speaker identification," in Proc. ICASSP, 1995, pp. 333-336.
- (1995) Proc. ICASSP , pp. 333-336
- Schmidt, M.¹ Gish, H.² Mielke, A.³

28
- 85009090964
- Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification
- W. H. Tsai, W. W. Chang, Y. C. Chu, and C. S. Huang, "Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identification," in Proc. Eurospeech, 2001, pp. 771-774.
- (2001) Proc. Eurospeech , pp. 771-774
- Tsai, W.H.¹ Chang, W.W.² Chu, Y.C.³ Huang, C.S.⁴

29
- 34547516258
- Approximating the Kullback Leibler divergence between Gaussian mixture models
- J. Hershey and P. Olsen, "Approximating the Kullback Leibler divergence between Gaussian mixture models," in Proc. ICASSP, 2007, pp. 317-320.
- (2007) Proc. ICASSP , pp. 317-320
- Hershey, J.¹ Olsen, P.²

30
- 34147133261
- An investigation of Gaussian shortlists
- D. B. Paul, "An investigation of Gaussian shortlists," in Proc. IEEE Workshop Automatic Speech Recognition and Understanding, 1999, pp. 209-212.
- (1999) Proc. IEEE Workshop Automatic Speech Recognition and Understanding , pp. 209-212
- Paul, D.B.¹

31
- 85009112348
- Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems
- A. Chan, J. Sherwani, R. Mosur, and A. Rudnicky, "Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems," in Proc. ICSLP, 2004, pp. 289-292.
- (2004) Proc. ICSLP , pp. 289-292
- Chan, A.¹ Sherwani, J.² Mosur, R.³ Rudnicky, A.⁴

32
- 0041360472
- Efficient text-independent speaker verification with structural Gaussian mixture models and neural network
- B. Xiang and T. Berger, "Efficient text-independent speaker verification with structural Gaussian mixture models and neural network," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 447-456, 2003.
- (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.5 , pp. 447-456
- Xiang, B.¹ Berger, T.²

33
- 64249128435
- quot;Switchboard 2 Phase II, Univ. Pennsylvania, Philadelphia, PA. [Online]. Available: http://www.ldc.upenn.edu/Catalog/docs/Switchboard2-Phase2
- quot;Switchboard 2 Phase II," Univ. Pennsylvania, Philadelphia, PA. [Online]. Available: http://www.ldc.upenn.edu/Catalog/docs/Switchboard2-Phase2

34
- 64249153927
- quot;The NIST Year 2004 Speaker Recognition Evaluation Plan, NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2003
- quot;The NIST Year 2004 Speaker Recognition Evaluation Plan," NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2003

35
- 64249162255
- quot;The NIST Year 2004 Speaker Recognition Evaluation Plan, NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2004
- quot;The NIST Year 2004 Speaker Recognition Evaluation Plan," NIST, Gaithersburg, MD. [Online]. Available: http://www.nist.gov/speech/tests/spk/2004

36
- 85046873967
- The DET curve in assessment of detection task performance
- A. Martin, D. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech, 1997, pp. 1895-1898.
- (1997) Proc. Eurospeech , pp. 1895-1898
- Martin, A.¹ Doddington, D.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.⁵

37
- 85075924869
- Comparison of background normalization methods for text-independent speaker verification
- D. A. Reynolds, "Comparison of background normalization methods for text-independent speaker verification," in Proc. Eurospeech, 1997, pp. 963-966.
- (1997) Proc. Eurospeech , pp. 963-966
- Reynolds, D.A.¹

38
- 0009589650
- Online, Available
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, [Online]. Available: http://www.etsi.org/stq
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms

39
- 85073258179
- Feature warping for robust speaker verification
- J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in Proc. ISCA Odyssey Workshop, 2001, pp. 213-218.
- (2001) Proc. ISCA Odyssey Workshop , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

40
- 21844434603
- SRI's 2004 NIST speaker recognition evaluation system
- S. S. Kajarekar, L. Ferrer, E. Shriberg, K. Sonmez, A. Stolcke, A. Venkataraman, and J. Zheng, "SRI's 2004 NIST speaker recognition evaluation system," in Proc. ICASSP, 2005, pp. 173-176.
- (2005) Proc. ICASSP , pp. 173-176
- Kajarekar, S.S.¹ Ferrer, L.² Shriberg, E.³ Sonmez, K.⁴ Stolcke, A.⁵ Venkataraman, A.⁶ Zheng, J.⁷

41
- 85009165992
- Model compression for GMM based speaker recognition systems
- D. A. Reynolds, "Model compression for GMM based speaker recognition systems," in Proc. Eurospeech, 2003, pp. 2005-2008.
- (2003) Proc. Eurospeech , pp. 2005-2008
- Reynolds, D.A.¹

42
- 64249126167
- Trainable speaker diarization
- to be published
- H. Aronowitz, "Trainable speaker diarization," in Proc. Interspeech, 2007, to be published.
- (2007) Proc. Interspeech
- Aronowitz, H.¹

43
- 64249157596
- Speaker recognition using kernel-PCA and intersession variability modeling
- to be published
- H. Aronowitz, "Speaker recognition using kernel-PCA and intersession variability modeling," in Proc. Interspeech, 2007, to be published.
- (2007) Proc. Interspeech
- Aronowitz, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.