메뉴 건너뛰기




Volumn 2012, Issue 1, 2012, Pages

Biomimetic multi-resolution analysis for robust speaker recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEECH PROCESSING SYSTEMS; BACKGROUND DISTORTIONS; BIOMIMETIC APPROACHES; INFORMATION REPRESENTATION; NONSTATIONARY NOISE; ROBUST SPEAKER RECOGNITION; SPEAKER RECOGNITION; SPEAKER VERIFICATION;

EID: 84881313414     PISSN: 16874714     EISSN: 16874722     Source Type: Journal    
DOI: 10.1186/1687-4722-2012-22     Document Type: Article
Times cited : (5)

References (29)
  • 3
    • 79957903950 scopus 로고    scopus 로고
    • Complex spectral interactions encoded by auditory cortical neurons: Relationship between bandwidth and pattern
    • O'Connor K, Yin P, Petkov C, Sutter M: Complex spectral interactions encoded by auditory cortical neurons: relationship between bandwidth and pattern. Front Syst. Neurosci 2010, 4:4-145.
    • (2010) Front Syst. Neurosci , vol.4 , pp. 4-145
    • O'Connor, K.1    Yin, P.2    Petkov, C.3    Sutter, M.4
  • 4
    • 45549100188 scopus 로고    scopus 로고
    • Speech analysis in a model of the central auditory system
    • 10.1109/TASL.2007.900102
    • Woojay J, Juang B: Speech analysis in a model of the central auditory system. IEEE Trans. Speech Audio Process 2007, 15:1802-1817.
    • (2007) IEEE Trans. Speech Audio Process , vol.15 , pp. 1802-1817
    • Woojay, J.1    Juang, B.2
  • 6
    • 0038711696 scopus 로고    scopus 로고
    • A spectro-temporal modulation index (STMI) for assessment of speech intelligibility
    • 10.1016/S0167-6393(02)00134-6
    • Elhilali M, Chi T, Shamma SA: A spectro-temporal modulation index (STMI) for assessment of speech intelligibility. Speech Commun 2003, 41:331-348.
    • (2003) Speech Commun , vol.41 , pp. 331-348
    • Elhilali, M.1    Chi, T.2    Shamma, S.A.3
  • 7
    • 63549114783 scopus 로고    scopus 로고
    • The modulation transfer function for speech intelligibility
    • 10.1371/journal.pcbi.1000302
    • Elliott T, Theunissen F: The modulation transfer function for speech intelligibility. PLoS Comput. Biol 2009, 5:e1000302.
    • (2009) PLoS Comput. Biol , vol.5 , pp. 1000302
    • Elliott, T.1    Theunissen, F.2
  • 8
    • 84887048542 scopus 로고    scopus 로고
    • NIST 2010 speaker recognition evaluation
    • NIST 2010 speaker recognition evaluation http://www.nist.gov/speech/ tests/sre/2010
  • 9
    • 0026626445 scopus 로고
    • Auditory representations of acoustic signals
    • 10.1109/18.119739
    • Yang X, Wang K, Shamma SA: Auditory representations of acoustic signals. IEEE Trans. Inf. Theory 1992, 38:824-839.
    • (1992) IEEE Trans. Inf. Theory , vol.38 , pp. 824-839
    • Yang, X.1    Wang, K.2    Shamma, S.A.3
  • 10
    • 0028462212 scopus 로고
    • Self-normalization noise-robustness in early auditory representations
    • 10.1109/89.294356
    • Wang K, Shamma SA: Self-normalization noise-robustness in early auditory representations. IEEE Trans. Speech Audio Process 1994, 2:421-435.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , pp. 421-435
    • Wang, K.1    Shamma, S.A.2
  • 11
    • 0003045511 scopus 로고
    • Spectral envelope coding in cat primary auditory cortex: Properties of ripple transfer functions
    • Schreiner C, Calhoun B: Spectral envelope coding in cat primary auditory cortex: properties of ripple transfer functions. J. Aud. Neurosc 1995, 1:39-61.
    • (1995) J. Aud. Neurosc , vol.1 , pp. 39-61
    • Schreiner, C.1    Calhoun, B.2
  • 12
    • 0028859810 scopus 로고
    • Ripple analysis in ferret primary auditory cortex. Iii. Topographic distribution of ripple response parameters
    • Versnel H, Kowalski N, Shamma SA: Ripple analysis in ferret primary auditory cortex. iii. topographic distribution of ripple response parameters. J. Aud. Neurosc 1995, 1:271-286.
    • (1995) J. Aud. Neurosc , vol.1 , pp. 271-286
    • Versnel, H.1    Kowalski, N.2    Shamma, S.A.3
  • 14
    • 0036082510 scopus 로고    scopus 로고
    • Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex
    • Miller L, Escabi M, Read H, Schreiner C: Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex. J. Neurophysiol 2002,87(1):516-527.
    • (2002) J. Neurophysiol , vol.87 , Issue.1 , pp. 516-527
    • Miller, L.1    Escabi, M.2    Read, H.3    Schreiner, C.4
  • 17
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • 10.1016/j.specom.2009.08.009
    • Kinnunen T, Lib H: An overview of text-independent speaker recognition: from features to supervectors. Speech Commun 2010, 52:12-40.
    • (2010) Speech Commun , vol.52 , pp. 12-40
    • Kinnunen, T.1    Lib, H.2
  • 20
    • 84921385643 scopus 로고    scopus 로고
    • Joint factor analysis for speaker recognition reinterpreted as signal coding using overcomplete dictionaries
    • Brno Czech Republic
    • Garcia-Romero D, Espy-Wilson C: Joint factor analysis for speaker recognition reinterpreted as signal coding using overcomplete dictionaries. In Proc. Odyssey Speaker and Language Recognition Workshop. Brno, Czech Republic; 2010:117-124.
    • (2010) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 117-124
    • Garcia-Romero, D.1    Espy-Wilson, C.2
  • 21
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification system
    • 10.1006/dspr.1999.0360
    • Auckenthaler R, Carey M, Lloyd-Thomas H: Score normalization for text-independent speaker verification system. Digit. Signal Proc 2000,1(10):42-54.
    • (2000) Digit. Signal Proc , vol.1 , Issue.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 22
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • 4 Beijing China
    • Hirsch H, Pearce D: The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In ISCA ITRW ASR2000. vol. 4 Beijing, China; 2000:29-32.
    • (2000) ISCA ITRW ASR2000 , pp. 29-32
    • Hirsch, H.1    Pearce, D.2
  • 23
    • 84887049796 scopus 로고    scopus 로고
    • NIST 2011 speaker recognition evaluation
    • NIST 2011 speaker recognition evaluation http://www.nist.gov/itl/iad/mig/ best.cfm
  • 24
    • 27844598720 scopus 로고    scopus 로고
    • Neuromimetic sound representation for percept detection and manipulation
    • 1138.94388 10.1155/ASP.2005.1350
    • Zotkin D, Chi T, Shamma SA, Duraiswami R: Neuromimetic sound representation for percept detection and manipulation. EURASIP J. App. Sig. Process 2005, 2005:1350-1364.
    • (2005) EURASIP J. App. Sig. Process , vol.2005 , pp. 1350-1364
    • Zotkin, D.1    Chi, T.2    Shamma, S.A.3    Duraiswami, R.4
  • 25
    • 0018906941 scopus 로고
    • A physical method for measuring speech-transmission quality
    • 10.1121/1.384464
    • Steeneken H, Houtgast T: A physical method for measuring speech-transmission quality. J. Acoust. Soc. Am 1979, 67:318-326.
    • (1979) J. Acoust. Soc. Am , vol.67 , pp. 318-326
    • Steeneken, H.1    Houtgast, T.2
  • 26
    • 0027957839 scopus 로고
    • Effect of temporal envelope smearing on speech reception
    • 10.1121/1.408467
    • Drullman R, Festen J, Plomp R: Effect of temporal envelope smearing on speech reception. J. Acoust. Soc. Am 1994, 95:1053-1064.
    • (1994) J. Acoust. Soc. Am , vol.95 , pp. 1053-1064
    • Drullman, R.1    Festen, J.2    Plomp, R.3
  • 27
    • 0032945152 scopus 로고    scopus 로고
    • Syllable intelligibility for temporally filtered lpc cepstral trajectories
    • 10.1121/1.426895
    • Arai T, Pavel M, Hermansky H, Avendano C: Syllable intelligibility for temporally filtered lpc cepstral trajectories. J. Acoust. Soc. Am 1999, 105:2783-2791.
    • (1999) J. Acoust. Soc. Am , vol.105 , pp. 2783-2791
    • Arai, T.1    Pavel, M.2    Hermansky, H.3    Avendano, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.