SCOPUS 정보 검색 플랫폼

Eurasip Journal on Audio, Speech, and Music Processing

Volumn 2012, Issue 1, 2012, Pages

Biomimetic multi-resolution analysis for robust speaker recognition

(4) Nemala, Sridhar Krishna a Zotkin, Dmitry N b Duraiswami, Ramani b Elhilali, Mounya a

a Johns Hopkins University (United States)

b UNIVERSITY OF MARYLAND (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEECH PROCESSING SYSTEMS; BACKGROUND DISTORTIONS; BIOMIMETIC APPROACHES; INFORMATION REPRESENTATION; NONSTATIONARY NOISE; ROBUST SPEAKER RECOGNITION; SPEAKER RECOGNITION; SPEAKER VERIFICATION;

BIOMIMETICS; FEATURE EXTRACTION; MACHINERY; SPEECH PROCESSING;

SPEECH RECOGNITION;

EID: 84881313414 PISSN: 16874714 EISSN: 16874722 Source Type: Journal
DOI: 10.1186/1687-4722-2012-22 Document Type: Article

Times cited : (5)

References (29)

1
- 70449615870
- Springer Berlin 1250.68001 10.1007/978-0-387-77592-0
- Beigi H: Fundamentals of Speaker Recognition. Springer, Berlin; 2011.
- (2011) Fundamentals of Speaker Recognition
- Beigi, H.¹

2
- 33947655538
- Springer Berlin
- Greenberg S, Popper A, Ainsworth W: Speech Processing in the Auditory System. Springer, Berlin; 2004.
- (2004) Speech Processing in the Auditory System
- Greenberg, S.¹ Popper, A.² Ainsworth, W.³

3
- 79957903950
- Complex spectral interactions encoded by auditory cortical neurons: Relationship between bandwidth and pattern
- O'Connor K, Yin P, Petkov C, Sutter M: Complex spectral interactions encoded by auditory cortical neurons: relationship between bandwidth and pattern. Front Syst. Neurosci 2010, 4:4-145.
- (2010) Front Syst. Neurosci , vol.4 , pp. 4-145
- O'Connor, K.¹ Yin, P.² Petkov, C.³ Sutter, M.⁴

4
- 45549100188
- Speech analysis in a model of the central auditory system
- 10.1109/TASL.2007.900102
- Woojay J, Juang B: Speech analysis in a model of the central auditory system. IEEE Trans. Speech Audio Process 2007, 15:1802-1817.
- (2007) IEEE Trans. Speech Audio Process , vol.15 , pp. 1802-1817
- Woojay, J.¹ Juang, B.²

5
- 70349206338
- Robust speech feature extraction based on Gabor filtering and tensor factorization
- Wu Q, Zhang L, Shi G: Robust speech feature extraction based on Gabor filtering and tensor factorization. Proc. IEEE Intl. Conf. Acoust. Speech Signal Proc., Taipei, Taiwan 2009, 4649-4652.
- (2009) Proc. IEEE Intl. Conf. Acoust. Speech Signal Proc., Taipei, Taiwan , pp. 4649-4652
- Wu, Q.¹ Zhang, L.² Shi, G.³

6
- 0038711696
- A spectro-temporal modulation index (STMI) for assessment of speech intelligibility
- 10.1016/S0167-6393(02)00134-6
- Elhilali M, Chi T, Shamma SA: A spectro-temporal modulation index (STMI) for assessment of speech intelligibility. Speech Commun 2003, 41:331-348.
- (2003) Speech Commun , vol.41 , pp. 331-348
- Elhilali, M.¹ Chi, T.² Shamma, S.A.³

7
- 63549114783
- The modulation transfer function for speech intelligibility
- 10.1371/journal.pcbi.1000302
- Elliott T, Theunissen F: The modulation transfer function for speech intelligibility. PLoS Comput. Biol 2009, 5:e1000302.
- (2009) PLoS Comput. Biol , vol.5 , pp. 1000302
- Elliott, T.¹ Theunissen, F.²

8
- 84887048542
- NIST 2010 speaker recognition evaluation
- NIST 2010 speaker recognition evaluation http://www.nist.gov/speech/ tests/sre/2010

9
- 0026626445
- Auditory representations of acoustic signals
- 10.1109/18.119739
- Yang X, Wang K, Shamma SA: Auditory representations of acoustic signals. IEEE Trans. Inf. Theory 1992, 38:824-839.
- (1992) IEEE Trans. Inf. Theory , vol.38 , pp. 824-839
- Yang, X.¹ Wang, K.² Shamma, S.A.³

10
- 0028462212
- Self-normalization noise-robustness in early auditory representations
- 10.1109/89.294356
- Wang K, Shamma SA: Self-normalization noise-robustness in early auditory representations. IEEE Trans. Speech Audio Process 1994, 2:421-435.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , pp. 421-435
- Wang, K.¹ Shamma, S.A.²

11
- 0003045511
- Spectral envelope coding in cat primary auditory cortex: Properties of ripple transfer functions
- Schreiner C, Calhoun B: Spectral envelope coding in cat primary auditory cortex: properties of ripple transfer functions. J. Aud. Neurosc 1995, 1:39-61.
- (1995) J. Aud. Neurosc , vol.1 , pp. 39-61
- Schreiner, C.¹ Calhoun, B.²

12
- 0028859810
- Ripple analysis in ferret primary auditory cortex. Iii. Topographic distribution of ripple response parameters
- Versnel H, Kowalski N, Shamma SA: Ripple analysis in ferret primary auditory cortex. iii. topographic distribution of ripple response parameters. J. Aud. Neurosc 1995, 1:271-286.
- (1995) J. Aud. Neurosc , vol.1 , pp. 271-286
- Versnel, H.¹ Kowalski, N.² Shamma, S.A.³

13
- 0003548585
- vol LDC93S1 Linguistic Data Consortium Philadelphia
- Garofolo JS, Lamel LF, Fisher WM, Fiscus JG, Pallett DS, Dahlgren NL: DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus. vol LDC93S1 Linguistic Data Consortium, Philadelphia; 1993.
- (1993) DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶

14
- 0036082510
- Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex
- Miller L, Escabi M, Read H, Schreiner C: Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex. J. Neurophysiol 2002,87(1):516-527.
- (2002) J. Neurophysiol , vol.87 , Issue.1 , pp. 516-527
- Miller, L.¹ Escabi, M.² Read, H.³ Schreiner, C.⁴

15
- 84889281816
- 2 Wiley-Interscience New York 1140.94001
- Cover T, Thomas J: Elements of Information Theory. 2nd edition. Wiley-Interscience, New York; 2006.
- (2006) Elements of Information Theory
- Cover, T.¹ Thomas, J.²

16
- 0028517164
- RASTA processing of speech
- 10.1109/89.326616
- Hermansky H, Morgan N: RASTA processing of speech. IEEE Trans. Speech Audio Process 1994,2(4):382-395.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 382-395
- Hermansky, H.¹ Morgan, N.²

17
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- 10.1016/j.specom.2009.08.009
- Kinnunen T, Lib H: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 2010, 52:12-40.
- (2010) Speech Commun , vol.52 , pp. 12-40
- Kinnunen, T.¹ Lib, H.²

18
- 84867584657
- The UMD-JHU 2011 speaker recognition system
- Kyoto Japan
- Garcia-Romero D, et al.: The UMD-JHU 2011 speaker recognition system. In Proc. IEEE Intl. Conf. Acoust. Speech Signal Proc. Kyoto, Japan; 2012:4229-4232.
- (2012) Proc. IEEE Intl. Conf. Acoust. Speech Signal Proc , pp. 4229-4232
- Garcia-Romero, D.¹

19
- 43249091937
- Speaker and session variability in gmm-based speaker verification
- 10.1109/TASL.2007.894527
- Kenny P, Boulianne G, Ouellet P, Dumouchel P: Speaker and session variability in gmm-based speaker verification. IEEE Trans. Audio Speech Lang. Process 2007, 15:1448-1460.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

20
- 84921385643
- Joint factor analysis for speaker recognition reinterpreted as signal coding using overcomplete dictionaries
- Brno Czech Republic
- Garcia-Romero D, Espy-Wilson C: Joint factor analysis for speaker recognition reinterpreted as signal coding using overcomplete dictionaries. In Proc. Odyssey Speaker and Language Recognition Workshop. Brno, Czech Republic; 2010:117-124.
- (2010) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 117-124
- Garcia-Romero, D.¹ Espy-Wilson, C.²

21
- 0033884857
- Score normalization for text-independent speaker verification system
- 10.1006/dspr.1999.0360
- Auckenthaler R, Carey M, Lloyd-Thomas H: Score normalization for text-independent speaker verification system. Digit. Signal Proc 2000,1(10):42-54.
- (2000) Digit. Signal Proc , vol.1 , Issue.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

22
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- 4 Beijing China
- Hirsch H, Pearce D: The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In ISCA ITRW ASR2000. vol. 4 Beijing, China; 2000:29-32.
- (2000) ISCA ITRW ASR2000 , pp. 29-32
- Hirsch, H.¹ Pearce, D.²

23
- 84887049796
- NIST 2011 speaker recognition evaluation
- NIST 2011 speaker recognition evaluation http://www.nist.gov/itl/iad/mig/ best.cfm

24
- 27844598720
- Neuromimetic sound representation for percept detection and manipulation
- 1138.94388 10.1155/ASP.2005.1350
- Zotkin D, Chi T, Shamma SA, Duraiswami R: Neuromimetic sound representation for percept detection and manipulation. EURASIP J. App. Sig. Process 2005, 2005:1350-1364.
- (2005) EURASIP J. App. Sig. Process , vol.2005 , pp. 1350-1364
- Zotkin, D.¹ Chi, T.² Shamma, S.A.³ Duraiswami, R.⁴

25
- 0018906941
- A physical method for measuring speech-transmission quality
- 10.1121/1.384464
- Steeneken H, Houtgast T: A physical method for measuring speech-transmission quality. J. Acoust. Soc. Am 1979, 67:318-326.
- (1979) J. Acoust. Soc. Am , vol.67 , pp. 318-326
- Steeneken, H.¹ Houtgast, T.²

26
- 0027957839
- Effect of temporal envelope smearing on speech reception
- 10.1121/1.408467
- Drullman R, Festen J, Plomp R: Effect of temporal envelope smearing on speech reception. J. Acoust. Soc. Am 1994, 95:1053-1064.
- (1994) J. Acoust. Soc. Am , vol.95 , pp. 1053-1064
- Drullman, R.¹ Festen, J.² Plomp, R.³

27
- 0032945152
- Syllable intelligibility for temporally filtered lpc cepstral trajectories
- 10.1121/1.426895
- Arai T, Pavel M, Hermansky H, Avendano C: Syllable intelligibility for temporally filtered lpc cepstral trajectories. J. Acoust. Soc. Am 1999, 105:2783-2791.
- (1999) J. Acoust. Soc. Am , vol.105 , pp. 2783-2791
- Arai, T.¹ Pavel, M.² Hermansky, H.³ Avendano, C.⁴

28
- 77950864085
- The Role of Temporal Dynamics in Understanding Spoken Language
- IOS Press Amsterdam
- Greenberg S, Arai T, Grant K: The Role of Temporal Dynamics in Understanding Spoken Language. In NATO Science Series: Life and Behavioural Sciences. IOS Press, Amsterdam; 2006:171-190.
- (2006) NATO Science Series: Life and Behavioural Sciences , pp. 171-190
- Greenberg, S.¹ Arai, T.² Grant, K.³

29
- 34447100796
- CRC Press Boca Raton
- Loizou P: Speech Enhancement: Theory and Practice. CRC Press, Boca Raton; 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.