메뉴 건너뛰기




Volumn 21, Issue 7, 2012, Pages 1765-1773

Using DTW neural-based MFCC warping to improve emotional speech recognition

Author keywords

Dynamic time warping; Emotion; Frequency warping; Neural network; Speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; CALCULATION PROCESS; COMBINED STRUCTURE; DYNAMIC TIME WARPING; EMOTION; EMOTIONAL SPEECH; EMOTIONAL SPEECH RECOGNITION; EMOTIONAL STATE; FREQUENCY RANGES; FREQUENCY WARPING; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MULTI LAYER PERCEPTRON; RECOGNITION RATES; WARPING FACTORS;

EID: 84866447267     PISSN: 09410643     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00521-011-0620-8     Document Type: Article
Times cited : (40)

References (34)
  • 1
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: a survey of the literature
    • Strik H, Cucchiarini C (1999) Modeling pronunciation variation for ASR: a survey of the literature. Speech Commun 29: 225-246.
    • (1999) Speech Commun , vol.29 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 4
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: resources, features, and methods
    • Ververidis D, Kotropoulos C (2006) Emotional speech recognition: resources, features, and methods. Speech Commun 48: 1162-1181.
    • (2006) Speech Commun , vol.48 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 16
    • 84864948871 scopus 로고    scopus 로고
    • Pitch in emotional speech and emotional speech recognition using pitch frequency
    • Gharavian D, Sheikhan M, Janipour M (2010) Pitch in emotional speech and emotional speech recognition using pitch frequency. Majlesi J Electr Eng 4(1): 19-24.
    • (2010) Majlesi J Electr Eng , vol.4 , Issue.1 , pp. 19-24
    • Gharavian, D.1    Sheikhan, M.2    Janipour, M.3
  • 17
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • Bosch LT (2003) Emotions, speech and the ASR framework. Speech Commun 40: 213-225.
    • (2003) Speech Commun , vol.40 , pp. 213-225
    • Bosch, L.T.1
  • 18
    • 79955539267 scopus 로고    scopus 로고
    • Contextual invariant-integration features for improved speaker-independent speech recognition
    • doi: 10. 1016/j. specom. 2011. 02. 002 Article in Press
    • Müller F, Mertins A (2011) Contextual invariant-integration features for improved speaker-independent speech recognition. Speech Commun. doi: 10. 1016/j. specom. 2011. 02. 002 Article in Press.
    • (2011) Speech Commun
  • 21
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales MJF (1998) Maximum likelihood linear transformations for HMM-based speech recognition. Comput Speech Lang 12: 75-98.
    • (1998) Comput Speech Lang , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 28
    • 26844479120 scopus 로고    scopus 로고
    • Warped discrete cosine transform-based noisy speech enhancement
    • Chang J-H (2005) Warped discrete cosine transform-based noisy speech enhancement. IEEE Trans Circuits Syst II 52: 535-539.
    • (2005) IEEE Trans Circuits Syst II , vol.52 , pp. 535-539
    • Chang, J.-H.1
  • 29
    • 44949157762 scopus 로고    scopus 로고
    • Frequency warping by linear transformation of standard MFCC
    • Panchapagesan S (2006) Frequency warping by linear transformation of standard MFCC. Proceedings of interspeech, pp 397-400.
    • (2006) Proceedings of Interspeech , pp. 397-400
    • Panchapagesan, S.1
  • 31
    • 77955423547 scopus 로고    scopus 로고
    • Fiction support for realistic portrayals of fear-type emotional manifestations
    • Clavel C, Vasilescu I, Devillers L (2011) Fiction support for realistic portrayals of fear-type emotional manifestations. Comput Speech Lang 25: 63-83.
    • (2011) Comput Speech Lang , vol.25 , pp. 63-83
    • Clavel, C.1    Vasilescu, I.2    Devillers, L.3
  • 34
    • 0016049328 scopus 로고
    • An Algorithm for formant extraction using linear prediction spectra
    • McCandless SS (1974) An Algorithm for formant extraction using linear prediction spectra. IEEE Trans Acoustics Speech Signal Process 22: 135-141.
    • (1974) IEEE Trans Acoustics Speech Signal Process , vol.22 , pp. 135-141
    • McCandless, S.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.