SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 131, Issue 5, 2012, Pages EL368-EL374

Spectro-temporal modulation energy based mask for robust speaker identification

(3) Chi, Tai Shih a Lin, Ting Han a Hsu, Chung Chien a

a NATIONAL CHIAO TUNG UNIVERSITY (Taiwan)

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO ACOUSTICS; LOUDSPEAKERS; MODULATION; SIGNAL TO NOISE RATIO; SPEECH; TENSORS;

CEPSTRAL COEFFICIENTS; MEL FREQUENCY CEPSTRAL CO-EFFICIENT; ROBUST SPEAKER IDENTIFICATION; ROBUST SPEAKER RECOGNITION; SPARSE REPRESENTATION; SPEAKER CHARACTERISTICS; SPEAKER IDENTIFICATION; SPECTRO-TEMPORAL MODULATIONS;

SPEECH RECOGNITION;

ALGORITHM; ARTICLE; BIOLOGICAL MODEL; COCHLEA; COMPUTER SIMULATION; FEMALE; HEARING; HUMAN; IMMUNOLOGY; MALE; NERVE CELL; NOISE; PERCEPTION; PHYSIOLOGY; SOUND DETECTION; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION;

ALGORITHMS; COCHLEA; COMPUTER SIMULATION; FEMALE; HEARING; HUMANS; MALE; MODELS, BIOLOGICAL; NEURONS; NOISE; PERCEPTUAL MASKING; SOUND SPECTROGRAPHY; SPEECH INTELLIGIBILITY; SPEECH PERCEPTION;

EID: 84863799485 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.3697534 Document Type: Article

Times cited : (15)

References (19)

1
- 0031233424
- Speaker recognition: A tutorial
- J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE 85, 1437-1462 (1997).
- (1997) Proc. IEEE , vol.85 , pp. 1437-1462
- Campbell, J.P.¹

2
- 0029355999
- Speaker identification and verification using Gaussian mixture speaker models
- D. A. Reynolds, "Speaker identification and verification using gaussian mixture speaker models," Speech Commun. 17, 91-108 (1995).
- (1995) Speech Commun. , vol.17 , pp. 91-108
- Reynolds, D.A.¹

3
- 63249107289
- Robust speaker recognition in noisy conditions
- M. Ji, T. J. Hazen, J. R. Glass, and D. A. Reynolds, "Robust speaker recognition in noisy conditions," IEEE Trans. Audio, Speech, Lang. Process. 15, 1711-1723 (2007).
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , pp. 1711-1723
- Ji, M.¹ Hazen, T.J.² Glass, J.R.³ Reynolds, D.A.⁴

4
- 51449101666
- Robust speaker identification using auditory features and computational auditory scene analysis
- Y. Shao and D. L. Wang, "Robust speaker identification using auditory features and computational auditory scene analysis," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2008), pp. 1589-1592.
- (2008) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 1589-1592
- Shao, Y.¹ Wang, D.L.²

5
- 57349117784
- Auditory sparse representation for robust speaker recognition based on tensor structure
- Q. Wu and L. Zhang, "Auditory sparse representation for robust speaker recognition based on tensor structure," EURASIP J. Audio, Speech, Music Process. 2008, 578612 (2008).
- (2008) EURASIP J. Audio, Speech, Music Process. , vol.2008
- Wu, Q.¹ Zhang, L.²

6
- 70449360175
- Modulation spectral features for robust far-field speaker identification
- T. H. Falk and W.-Y. Chan, "Modulation spectral features for robust far-field speaker identification," IEEE Trans. Audio, Speech, Lang. Process. 18, 90-100 (2010).
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , pp. 90-100
- Falk, T.H.¹ Chan, W.-Y.²

7
- 0031187171
- Speech recognition by machines and humans
- R. P. Lippmann, "Speech recognition by machines and humans," Speech Commun. 22, 1-15 (1997).
- (1997) Speech Commun. , vol.22 , pp. 1-15
- Lippmann, R.P.¹

8
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- edited by P. Divenyi (Kluwer, Norwell, MA)
- D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, edited by P. Divenyi (Kluwer, Norwell, MA, 2005), pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

9
- 33845354768
- Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
- D. S. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Am. 120 (6), 4007-4018 (2006).
- (2006) J. Acoust. Soc. Am. , vol.120 , Issue.6 , pp. 4007-4018
- Brungart, D.S.¹ Chang, P.S.² Simpson, B.D.³ Wang, D.L.⁴

10
- 64649103540
- Speech intelligibility in background noise with ideal binary time-frequency masking
- D. L. Wang, U. Kjems, M. S. Pedersen, J. B. Boldt, and T. Lunner, "Speech intelligibility in background noise with ideal binary time-frequency masking," J. Acoust. Soc. Am. 125 (4), 2336-2347 (2009).
- (2009) J. Acoust. Soc. Am. , vol.125 , Issue.4 , pp. 2336-2347
- Wang, D.L.¹ Kjems, U.² Pedersen, M.S.³ Boldt, J.B.⁴ Lunner, T.⁵

11
- 70349093614
- An algorithm that improves speech intelligibility in noise for normal-hearing listeners
- G. Kim, Y. Lu, Y. Hu, and P. C. Loizou, "An algorithm that improves speech intelligibility in noise for normal-hearing listeners," J. Acoust. Soc. Am. 126 (3), 1486-1494 (2009).
- (2009) J. Acoust. Soc. Am. , vol.126 , Issue.3 , pp. 1486-1494
- Kim, G.¹ Lu, Y.² Hu, Y.³ Loizou, P.C.⁴

12
- 23744508888
- Multi-resolution spectro-temporal analysis of complex sounds
- T. Chi, P. Ru, and S. A. Shamma, "Multi-resolution spectro-temporal analysis of complex sounds," J. Acoust. Soc. Am. 118 (2), 887-906 (2005).
- (2005) J. Acoust. Soc. Am. , vol.118 , Issue.2 , pp. 887-906
- Chi, T.¹ Ru, P.² Shamma, S.A.³

13
- 67651044226
- Spectro-temporal analysis of speech using 2-D Gabor filters
- T. Ezzat, J. Bouvrie, and T. Poggio, "Spectro-temporal analysis of speech using 2-D Gabor filters," in Proceedings of the International Conference on Spoken Language Processing (2007), pp. 506-509.
- (2007) Proceedings of the International Conference on Spoken Language Processing , pp. 506-509
- Ezzat, T.¹ Bouvrie, J.² Poggio, T.³

14
- 0038711696
- A spectro-temporal modulation index (STMI) for assessment of speech intelligibility
- M. Elhilali, T. Chi, and S. A. Shamma, "A spectro-temporal modulation index (STMI) for assessment of speech intelligibility," Speech Commun. 41, 331-348 (2003).
- (2003) Speech Commun. , vol.41 , pp. 331-348
- Elhilali, M.¹ Chi, T.² Shamma, S.A.³

15
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- M. Cooke, J. Barker, S. Cunningham, and X. Shao, "An audio-visual corpus for speech perception and automatic speech recognition," J. Acoust. Soc. Am. 120 (5), 2421-2424 (2006).
- (2006) J. Acoust. Soc. Am. , vol.120 , Issue.5 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

16
- 84890477287
- Robust emotion recognition by spectro-temporal modulation statistic features
- T.-S. Chi, L.-Y. Yeh, and C.-C. Hsu, "Robust emotion recognition by spectro-temporal modulation statistic features," J. Ambient Intell. Human. Comput. 3 (2), 47-60 (2012).
- (2012) J. Ambient Intell. Human. Comput. , vol.3 , Issue.2 , pp. 47-60
- Chi, T.-S.¹ Yeh, L.-Y.² Hsu, C.-C.³

17
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process. 2 (4), 578-589 (1994).
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

18
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reyolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process. 10, 19-41 (2000).
- (2000) Digital Signal Process. , vol.10 , pp. 19-41
- Reyolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

19
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech Lang. Process. 15 (7), 2130-2140 (2007).
- (2007) IEEE Trans. Audio, Speech Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.