메뉴 건너뛰기




Volumn 88, Issue , 2017, Pages 39-64

Empirical Mode Decomposition for adaptive AM-FM analysis of Speech: A Review

Author keywords

AM FM; EMD; LP; MFCC; Speech Processing; Wavelet

Indexed keywords

FOURIER ANALYSIS; FREQUENCY MODULATION; SIGNAL ANALYSIS; SPEECH PROCESSING;

EID: 85009788345     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2016.12.004     Document Type: Review
Times cited : (75)

References (150)
  • 1
    • 33646255447 scopus 로고    scopus 로고
    • Further intelligibility results from human listening tests using the short-time phase spectrum
    • Alsteris, L.D., Paliwal, K.K., Further intelligibility results from human listening tests using the short-time phase spectrum. Speech Commun. 48:6 (2006), 727–736.
    • (2006) Speech Commun. , vol.48 , Issue.6 , pp. 727-736
    • Alsteris, L.D.1    Paliwal, K.K.2
  • 2
    • 0018656516 scopus 로고
    • Epoch extraction from linear prediction residual for identification of closed glottis interval
    • Ananthapadmanabha, T., Yegnanarayana, B., Epoch extraction from linear prediction residual for identification of closed glottis interval. Acoust. Speech Signal Process. IEEE Trans. 27:4 (1979), 309–319.
    • (1979) Acoust. Speech Signal Process. IEEE Trans. , vol.27 , Issue.4 , pp. 309-319
    • Ananthapadmanabha, T.1    Yegnanarayana, B.2
  • 3
    • 0003435075 scopus 로고    scopus 로고
    • Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms
    • Oxford University Press Oxford, UK
    • Bäck, T., Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms. 1996, Oxford University Press, Oxford, UK.
    • (1996)
    • Bäck, T.1
  • 4
    • 38749108114 scopus 로고    scopus 로고
    • Private emotions versus social interaction: a data-driven approach towards analysing emotion in speech
    • Batliner, A., Steidl, S., Hacker, C., Nöth, E., Private emotions versus social interaction: a data-driven approach towards analysing emotion in speech. User Model. User-Adapt. Interact. 18:1–2 (2008), 175–206, 10.1007/s11257-007-9039-4.
    • (2008) User Model. User-Adapt. Interact. , vol.18 , Issue.1-2 , pp. 175-206
    • Batliner, A.1    Steidl, S.2    Hacker, C.3    Nöth, E.4
  • 5
    • 33846449094 scopus 로고    scopus 로고
    • Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients’ speech signal with unilateral vocal fold paralysis
    • Behroozmand, R., Almasganj, F., Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients’ speech signal with unilateral vocal fold paralysis. Comput. Biol. Med. 37:4 (2007), 474–485, 10.1016/j.compbiomed.2006.08.016.
    • (2007) Comput. Biol. Med. , vol.37 , Issue.4 , pp. 474-485
    • Behroozmand, R.1    Almasganj, F.2
  • 6
    • 67650565075 scopus 로고    scopus 로고
    • Springer Handbook of Speech Processing
    • Springer Science & Business Media
    • Benesty, J., Sondhi, M.M., Huang, Y., Springer Handbook of Speech Processing. 2008, Springer Science & Business Media.
    • (2008)
    • Benesty, J.1    Sondhi, M.M.2    Huang, Y.3
  • 7
    • 0037776118 scopus 로고    scopus 로고
    • Time Frequency Analysis
    • Gulf Professional Publishing
    • Boashash, B., Time Frequency Analysis. 2003, Gulf Professional Publishing.
    • (2003)
    • Boashash, B.1
  • 10
    • 38549125582 scopus 로고    scopus 로고
    • Voiced speech analysis by empirical mode decomposition
    • Springer
    • Bouzid, A., Ellouze, N., Voiced speech analysis by empirical mode decomposition. Advances in Nonlinear Speech Processing, 2007, Springer, 213–220.
    • (2007) Advances in Nonlinear Speech Processing , pp. 213-220
    • Bouzid, A.1    Ellouze, N.2
  • 11
    • 0027874671 scopus 로고
    • Am-fm energy detection and separation in noise using multiband energy operators
    • Bovik, A.C., Maragos, P., Quatieri, T.F., Am-fm energy detection and separation in noise using multiband energy operators. Signal Process. IEEE Trans. 41:12 (1993), 3245–3265.
    • (1993) Signal Process. IEEE Trans. , vol.41 , Issue.12 , pp. 3245-3265
    • Bovik, A.C.1    Maragos, P.2    Quatieri, T.F.3
  • 12
    • 79953288449 scopus 로고    scopus 로고
    • Data driven design of filter bank for speech recognition
    • Springer Lecture Notes in Computer Science
    • Burget, L., Heřmanský, H., Data driven design of filter bank for speech recognition. Text, Speech and Dialogue, 2001, Springer, 299–304 Lecture Notes in Computer Science.
    • (2001) Text, Speech and Dialogue , pp. 299-304
    • Burget, L.1    Heřmanský, H.2
  • 14
    • 44949251671 scopus 로고    scopus 로고
    • Data-driven design of front-end filter bank for Lombard speech recognition
    • Pittsburgh, Pennsylvania
    • BǒrilH. and Fousek, P. and Pollák, P., Data-driven design of front-end filter bank for Lombard speech recognition. Proceedings of INTERSPEECH 2006 - ICSLP, 2006, 381–384 Pittsburgh, Pennsylvania.
    • (2006) Proceedings of INTERSPEECH 2006 - ICSLP , pp. 381-384
    • BǒrilH. and Fousek, P. and Pollák, P.,1
  • 15
    • 67649119677 scopus 로고    scopus 로고
    • Optimizing feature complementarity by evolution strategy: application to automatic speaker verification
    • Charbuillet, C., Gas, B., Chetouani, M., Zarader, J., Optimizing feature complementarity by evolution strategy: application to automatic speaker verification. Speech Commun. 51:9 (2009), 724–731.
    • (2009) Speech Commun. , vol.51 , Issue.9 , pp. 724-731
    • Charbuillet, C.1    Gas, B.2    Chetouani, M.3    Zarader, J.4
  • 16
    • 84857276262 scopus 로고    scopus 로고
    • Emd-based filtering (emdf) of low-frequency noise for speech enhancement
    • Chatlani, N., Soraghan, J.J., Emd-based filtering (emdf) of low-frequency noise for speech enhancement. Audio Speech Lang. Process. IEEE Trans. 20:4 (2012), 1158–1166.
    • (2012) Audio Speech Lang. Process. IEEE Trans. , vol.20 , Issue.4 , pp. 1158-1166
    • Chatlani, N.1    Soraghan, J.J.2
  • 17
    • 4544254654 scopus 로고    scopus 로고
    • A technique to improve the empirical mode decomposition in the Hilbert-Huang transform
    • Chen, Y., Feng, M.Q., A technique to improve the empirical mode decomposition in the Hilbert-Huang transform. Earthquake Eng. Eng. Vib. 2:1 (2003), 75–85.
    • (2003) Earthquake Eng. Eng. Vib. , vol.2 , Issue.1 , pp. 75-85
    • Chen, Y.1    Feng, M.Q.2
  • 18
    • 0003733873 scopus 로고
    • Time-Frequency Analysis
    • Prentice Hall PTR Englewood Cliffs, NJ:
    • Cohen, L., Time-Frequency Analysis. 1406, 1995, Prentice Hall PTR Englewood Cliffs, NJ:.
    • (1995) , vol.1406
    • Cohen, L.1
  • 19
    • 0026686048 scopus 로고
    • Entropy-based algorithms for best basis selection
    • Coifman, R., Wickerhauser, M.V., Entropy-based algorithms for best basis selection. IEEE Trans. Inf. Theory 38:2 (1992), 713–718.
    • (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 713-718
    • Coifman, R.1    Wickerhauser, M.V.2
  • 20
    • 84904579720 scopus 로고    scopus 로고
    • Improved complete ensemble emd: a suitable tool for biomedical signal processing
    • Colominas, M.A., Schlotthauer, G., Torres, M.E., Improved complete ensemble emd: a suitable tool for biomedical signal processing. Biomed. Signal Process. Control 14 (2014), 19–29.
    • (2014) Biomed. Signal Process. Control , vol.14 , pp. 19-29
    • Colominas, M.A.1    Schlotthauer, G.2    Torres, M.E.3
  • 21
    • 84933676946 scopus 로고    scopus 로고
    • An unconstrained optimization approach to empirical mode decomposition
    • Colominas, M.A., Schlotthauer, G., Torres, M.E., An unconstrained optimization approach to empirical mode decomposition. Digit. Signal Process. 40 (2015), 164–175.
    • (2015) Digit. Signal Process. , vol.40 , pp. 164-175
    • Colominas, M.A.1    Schlotthauer, G.2    Torres, M.E.3
  • 25
    • 0031012371 scopus 로고    scopus 로고
    • Acoustic characteristics of the piriform fossa in models and humans
    • Dang, J., Honda, K., Acoustic characteristics of the piriform fossa in models and humans. J. Acoust. Soc. Am. 101:1 (1997), 456–465.
    • (1997) J. Acoust. Soc. Am. , vol.101 , Issue.1 , pp. 456-465
    • Dang, J.1    Honda, K.2
  • 27
    • 0003424145 scopus 로고
    • Discrete-Time Processing of Speech Signals
    • Macmillan Publishing NewYork
    • Deller, J.R., Proakis, J.G., Hansen, J.H., Discrete-Time Processing of Speech Signals. 1993, Macmillan Publishing, NewYork.
    • (1993)
    • Deller, J.R.1    Proakis, J.G.2    Hansen, J.H.3
  • 32
    • 70450198169 scopus 로고    scopus 로고
    • Glottal closure and opening instant detection from speech signals.
    • Drugman, T., Dutoit, T., Glottal closure and opening instant detection from speech signals. Interspeech, 2009, 2891–2894.
    • (2009) Interspeech , pp. 2891-2894
    • Drugman, T.1    Dutoit, T.2
  • 34
    • 0037363455 scopus 로고    scopus 로고
    • Approximations with evolutionary pursuit
    • Ferreira da Silva, A.R., Approximations with evolutionary pursuit. Signal Process. 83:3 (2003), 465–481.
    • (2003) Signal Process. , vol.83 , Issue.3 , pp. 465-481
    • Ferreira da Silva, A.R.1
  • 36
    • 23344453279 scopus 로고    scopus 로고
    • Empirical mode decompositions as data-driven wavelet-like expansions
    • Flandrin, P., Goncalves, P., Empirical mode decompositions as data-driven wavelet-like expansions. Int. J. Wavelets Multiresolution Inf. Process. 2:04 (2004), 477–496.
    • (2004) Int. J. Wavelets Multiresolution Inf. Process. , vol.2 , Issue.4 , pp. 477-496
    • Flandrin, P.1    Goncalves, P.2
  • 38
    • 34547520971 scopus 로고    scopus 로고
    • Detrending and Denoising with Empirical Mode Decompositions
    • Citeseer
    • Flandrin, P., Gonçalves, P., Rilling, G., et al. Detrending and Denoising with Empirical Mode Decompositions. 2004, Citeseer.
    • (2004)
    • Flandrin, P.1    Gonçalves, P.2    Rilling, G.3
  • 40
    • 47049116566 scopus 로고    scopus 로고
    • Comparative evaluation of various MFCC implementations on the speaker verification task
    • Ganchev, T., Fakotakis, N., Kokkinakis, G., Comparative evaluation of various MFCC implementations on the speaker verification task. Proceedings of the SPECOM, 1, 2005, 191–194.
    • (2005) Proceedings of the SPECOM , vol.1 , pp. 191-194
    • Ganchev, T.1    Fakotakis, N.2    Kokkinakis, G.3
  • 42
    • 66149120614 scopus 로고    scopus 로고
    • Speaker identification using instantaneous frequencies
    • Grimaldi, M., Cummins, F., Speaker identification using instantaneous frequencies. Audio Speech Lang. Process. IEEE Trans. 16:6 (2008), 1097–1111.
    • (2008) Audio Speech Lang. Process. IEEE Trans. , vol.16 , Issue.6 , pp. 1097-1111
    • Grimaldi, M.1    Cummins, F.2
  • 43
    • 84888271524 scopus 로고    scopus 로고
    • Speech denoising based on empirical mode decomposition and improved thresholding
    • Springer
    • Hadhami, I., Bouzid, A., Speech denoising based on empirical mode decomposition and improved thresholding. Advances in Nonlinear Speech Processing, 2013, Springer, 200–207.
    • (2013) Advances in Nonlinear Speech Processing , pp. 200-207
    • Hadhami, I.1    Bouzid, A.2
  • 44
    • 79956708123 scopus 로고
    • Speech Production and Speech Modelling
    • Springer Science & Business Media
    • Hardcastle, W.J., Marchal, A., Speech Production and Speech Modelling. 55, 1990, Springer Science & Business Media.
    • (1990) , vol.55
    • Hardcastle, W.J.1    Marchal, A.2
  • 45
    • 84865743286 scopus 로고    scopus 로고
    • Robust speaker recognition in non-stationary room environments based on empirical mode decomposition.
    • Hasan, T., Hansen, J.H., Robust speaker recognition in non-stationary room environments based on empirical mode decomposition. INTERSPEECH, 2011, 2733–2736.
    • (2011) INTERSPEECH , pp. 2733-2736
    • Hasan, T.1    Hansen, J.H.2
  • 46
    • 58049207757 scopus 로고    scopus 로고
    • Suppression of residual noise from speech signals using empirical mode decomposition
    • Hasan, T., Hasan, M.K., Suppression of residual noise from speech signals using empirical mode decomposition. Signal Process. Lett. IEEE 16:1 (2009), 2–5.
    • (2009) Signal Process. Lett. IEEE , vol.16 , Issue.1 , pp. 2-5
    • Hasan, T.1    Hasan, M.K.2
  • 48
    • 79952707334 scopus 로고    scopus 로고
    • Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech
    • He, L., Lech, M., Maddage, N.C., Allen, N.B., Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech. Biomed. Signal Process. Control 6:2 (2011), 139–146.
    • (2011) Biomed. Signal Process. Control , vol.6 , Issue.2 , pp. 139-146
    • He, L.1    Lech, M.2    Maddage, N.C.3    Allen, N.B.4
  • 49
    • 84979784313 scopus 로고    scopus 로고
    • Advances in Non-Linear Modeling for Speech Processing
    • Springer Science & Business Media
    • Holambe, R.S., Deshpande, M.S., Advances in Non-Linear Modeling for Speech Processing. 2012, Springer Science & Business Media.
    • (2012)
    • Holambe, R.S.1    Deshpande, M.S.2
  • 52
    • 33846169080 scopus 로고    scopus 로고
    • Speech formant frequency estimation based on Hilbert-Huang transform
    • Huang, H., Chen, X.-x., Speech formant frequency estimation based on Hilbert-Huang transform. J.-ZHEJIANG Univ. Eng. Sci., 40(11), 2006, 1926.
    • (2006) J.-ZHEJIANG Univ. Eng. Sci. , vol.40 , Issue.11 , pp. 1926
    • Huang, H.1    Chen, X.-X.2
  • 53
    • 32644438199 scopus 로고    scopus 로고
    • Speech pitch determination based on Hilbert-Huang transform
    • Huang, H., Pan, J., Speech pitch determination based on Hilbert-Huang transform. Signal Process. 86:4 (2006), 792–803.
    • (2006) Signal Process. , vol.86 , Issue.4 , pp. 792-803
    • Huang, H.1    Pan, J.2
  • 54
    • 85009732552 scopus 로고    scopus 로고
    • Empirical mode decomposition and Hilbert spectral analysis
    • Huang, N.E., Empirical mode decomposition and Hilbert spectral analysis. 1998.
    • (1998)
    • Huang, N.E.1
  • 55
    • 85115665605 scopus 로고    scopus 로고
    • Hilbert-Huang Transform and Its Applications
    • World Scientific
    • Huang, N.E., Shen, S.S., Hilbert-Huang Transform and Its Applications. 5, 2005, World Scientific.
    • (2005) , vol.5
    • Huang, N.E.1    Shen, S.S.2
  • 56
    • 5444236478 scopus 로고    scopus 로고
    • The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis
    • Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., Yen, N.-C., Tung, C.C., Liu, H.H., The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. London. Ser.A 454:1971 (1998), 903–995.
    • (1998) Proc. R. Soc. London. Ser.A , vol.454 , Issue.1971 , pp. 903-995
    • Huang, N.E.1    Shen, Z.2    Long, S.R.3    Wu, M.C.4    Shih, H.H.5    Zheng, Q.6    Yen, N.-C.7    Tung, C.C.8    Liu, H.H.9
  • 58
    • 85009773294 scopus 로고    scopus 로고
    • Empirical mode decomposition for advanced speech signal processing
    • Islam Molla, M.K., Das, S., Hamid, M.E., Hirose, K., Empirical mode decomposition for advanced speech signal processing. J. Signal Process. 17:6 (2013), 215–229.
    • (2013) J. Signal Process. , vol.17 , Issue.6 , pp. 215-229
    • Islam Molla, M.K.1    Das, S.2    Hamid, M.E.3    Hirose, K.4
  • 59
    • 0033328948 scopus 로고    scopus 로고
    • Teager energy based feature parameters for speech recognition in car noise
    • Jabloun, F., Cetin, A.E., Erzin, E., Teager energy based feature parameters for speech recognition in car noise. Signal Process. Lett. IEEE 6:10 (1999), 259–261.
    • (1999) Signal Process. Lett. IEEE , vol.6 , Issue.10 , pp. 259-261
    • Jabloun, F.1    Cetin, A.E.2    Erzin, E.3
  • 61
    • 84979272271 scopus 로고    scopus 로고
    • Classification of environmental background noise sources using Hilbert-Huang transform
    • Jhanwar, D., Sharma, K.K., Modani, S., Classification of environmental background noise sources using Hilbert-Huang transform. Int. J. Signal Process. Syst., 1, 2013.
    • (2013) Int. J. Signal Process. Syst. , vol.1
    • Jhanwar, D.1    Sharma, K.K.2    Modani, S.3
  • 63
    • 0001059592 scopus 로고
    • Some observations on vocal tract operation from a fluid flow point of view
    • Kaiser, J.F., Some observations on vocal tract operation from a fluid flow point of view. Vocal Fold Physiol., 1983, 358–386.
    • (1983) Vocal Fold Physiol. , pp. 358-386
    • Kaiser, J.F.1
  • 65
    • 84879074231 scopus 로고    scopus 로고
    • Pathological speech signal analysis and classification using empirical mode decomposition
    • Kaleem, M., Ghoraani, B., Guergachi, A., Krishnan, S., Pathological speech signal analysis and classification using empirical mode decomposition. Med. Biol. Eng. Comput. 51:7 (2013), 811–821.
    • (2013) Med. Biol. Eng. Comput. , vol.51 , Issue.7 , pp. 811-821
    • Kaleem, M.1    Ghoraani, B.2    Guergachi, A.3    Krishnan, S.4
  • 67
    • 12844282873 scopus 로고    scopus 로고
    • Individual variation of the hypopharyngeal cavities and its acoustic effects
    • Kitamura, T., Honda, K., Takemoto, H., Individual variation of the hypopharyngeal cavities and its acoustic effects. Acoust. Sci. Technol. 26:1 (2005), 16–26.
    • (2005) Acoust. Sci. Technol. , vol.26 , Issue.1 , pp. 16-26
    • Kitamura, T.1    Honda, K.2    Takemoto, H.3
  • 68
    • 43949145296 scopus 로고    scopus 로고
    • Improved emd using doubly-iterative sifting and high order spline interpolation
    • Kopsinis, Y., McLaughlin, S., Improved emd using doubly-iterative sifting and high order spline interpolation. EURASIP J. Adv. Signal Process., 2008, 2008, 120.
    • (2008) EURASIP J. Adv. Signal Process. , vol.2008 , pp. 120
    • Kopsinis, Y.1    McLaughlin, S.2
  • 69
    • 63449122839 scopus 로고    scopus 로고
    • Development of emd-based denoising methods inspired by wavelet thresholding
    • Kopsinis, Y., McLaughlin, S., Development of emd-based denoising methods inspired by wavelet thresholding. Signal Process. IEEE Trans. 57:4 (2009), 1351–1362.
    • (2009) Signal Process. IEEE Trans. , vol.57 , Issue.4 , pp. 1351-1362
    • Kopsinis, Y.1    McLaughlin, S.2
  • 70
    • 79955928904 scopus 로고    scopus 로고
    • Speech emotion recognition using novel hht-teo based features
    • Li, X., Li, X., Speech emotion recognition using novel hht-teo based features. J. Comput. 6:5 (2011), 989–998.
    • (2011) J. Comput. , vol.6 , Issue.5 , pp. 989-998
    • Li, X.1    Li, X.2
  • 72
    • 40249090511 scopus 로고    scopus 로고
    • An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
    • Lu, X., Dang, J., An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification. Speech Commun. 50:4 (2008), 312–322.
    • (2008) Speech Commun. , vol.50 , Issue.4 , pp. 312-322
    • Lu, X.1    Dang, J.2
  • 75
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • Maragos, P., Kaiser, J.F., Quatieri, T.F., Energy separation in signal modulations with application to speech analysis. Signal Process. IEEE Trans. 41:10 (1993), 3024–3051.
    • (1993) Signal Process. IEEE Trans. , vol.41 , Issue.10 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 76
    • 0028460895 scopus 로고
    • Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmm's
    • Matsui, T., Furui, S., Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmm's. Speech Audio Process. IEEE Trans. 2:3 (1994), 456–459.
    • (1994) Speech Audio Process. IEEE Trans. , vol.2 , Issue.3 , pp. 456-459
    • Matsui, T.1    Furui, S.2
  • 77
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • McAulay, R., Quatieri, T.F., Speech analysis/synthesis based on a sinusoidal representation. Acoust. Speech Signal Process. IEEE Trans. 34:4 (1986), 744–754.
    • (1986) Acoust. Speech Signal Process. IEEE Trans. , vol.34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.F.2
  • 80
    • 84867211310 scopus 로고    scopus 로고
    • Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model.
    • Molla, M.K.I., Hirose, K., Minematsu, N., Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model. INTERSPEECH, 2008, 2530–2533.
    • (2008) INTERSPEECH , pp. 2530-2533
    • Molla, M.K.I.1    Hirose, K.2    Minematsu, N.3
  • 81
    • 85032750831 scopus 로고    scopus 로고
    • Auditory perception and cognition
    • Munkong, R., Juang, B.-H., Auditory perception and cognition. Signal Process. Mag. IEEE 25:3 (2008), 98–117, 10.1109/MSP.2008.918418.
    • (2008) Signal Process. Mag. IEEE , vol.25 , Issue.3 , pp. 98-117
    • Munkong, R.1    Juang, B.-H.2
  • 82
    • 0037751491 scopus 로고
    • PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India Ph.D. thesis
    • Murthy, H.A., Algorithms for Processing Fourier Transform Phase of Signals, 1992, PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India Ph.D. thesis.
    • (1992) Algorithms for Processing Fourier Transform Phase of Signals
    • Murthy, H.A.1
  • 83
    • 0024681756 scopus 로고
    • Effectiveness of representation of signals through group delay functions
    • Murthy, K.M., Yegnanarayana, B., Effectiveness of representation of signals through group delay functions. Signal Process. 17:2 (1989), 141–150.
    • (1989) Signal Process. , vol.17 , Issue.2 , pp. 141-150
    • Murthy, K.M.1    Yegnanarayana, B.2
  • 86
    • 84899710850 scopus 로고    scopus 로고
    • A new approach of audio emotion recognition
    • Ooi, C.S., Seng, K.P., Ang, L.-M., Chew, L.W., A new approach of audio emotion recognition. Expert Syst. Appl. 41:13 (2014), 5858–5869, 10.1016/j.eswa.2014.03.026.
    • (2014) Expert Syst. Appl. , vol.41 , Issue.13 , pp. 5858-5869
    • Ooi, C.S.1    Seng, K.P.2    Ang, L.-M.3    Chew, L.W.4
  • 87
    • 85009100883 scopus 로고    scopus 로고
    • Usefulness of phase spectrum in human speech perception.
    • Paliwal, K.K., Alsteris, L.D., Usefulness of phase spectrum in human speech perception. INTERSPEECH, 2003.
    • (2003) INTERSPEECH
    • Paliwal, K.K.1    Alsteris, L.D.2
  • 88
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of stft phase spectrum in human listening tests
    • Paliwal, K.K., Alsteris, L.D., On the usefulness of stft phase spectrum in human listening tests. Speech Commun. 45:2 (2005), 153–170.
    • (2005) Speech Commun. , vol.45 , Issue.2 , pp. 153-170
    • Paliwal, K.K.1    Alsteris, L.D.2
  • 89
    • 78049294305 scopus 로고    scopus 로고
    • Adaptive am–fm signal decomposition with application to speech analysis
    • Pantazis, Y., Rosec, O., Stylianou, Y., Adaptive am–fm signal decomposition with application to speech analysis. IEEE Trans. Audio Speech Lang. Process. 19:2 (2011), 290–300.
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.2 , pp. 290-300
    • Pantazis, Y.1    Rosec, O.2    Stylianou, Y.3
  • 90
    • 0141626061 scopus 로고    scopus 로고
    • The wavelet tutorial
    • Polikar, R., The wavelet tutorial. 1996.
    • (1996)
    • Polikar, R.1
  • 91
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • Potamianos, A., Maragos, P., Speech formant frequency and bandwidth tracking using multiband energy demodulation. J. Acoust. Soc. Am. 99:6 (1996), 3795–3806.
    • (1996) J. Acoust. Soc. Am. , vol.99 , Issue.6 , pp. 3795-3806
    • Potamianos, A.1    Maragos, P.2
  • 93
    • 84893027328 scopus 로고    scopus 로고
    • A bag-of-tones model with MFCC features for musical genre classification
    • Motoda H. Wu Z. Cao L. Zaiane O. Yao M. Wang W. Springer Berlin Heidelberg
    • Qin, Z., Liu, W., Wan, T., A bag-of-tones model with MFCC features for musical genre classification. Motoda, H., Wu, Z., Cao, L., Zaiane, O., Yao, M., Wang, W., (eds.) Advanced Data Mining and Applications Lecture Notes in Computer Science, 8346, 2013, Springer Berlin Heidelberg, 564–575, 10.1007/978-3-642-53914-5_48.
    • (2013) Advanced Data Mining and Applications, Lecture Notes in Computer Science , vol.8346 , pp. 564-575
    • Qin, Z.1    Liu, W.2    Wan, T.3
  • 95
    • 0003425258 scopus 로고
    • Digital Processing of Speech Signals
    • Prentice-hall Englewood Cliffs
    • Rabiner, L.R., Schafer, R.W., Digital Processing of Speech Signals. 100, 1978, Prentice-hall Englewood Cliffs.
    • (1978) , vol.100
    • Rabiner, L.R.1    Schafer, R.W.2
  • 96
  • 98
    • 84876432523 scopus 로고    scopus 로고
    • PhD thesis, Ecole normale supérieure de Lyon Ph.D. thesis
    • Rilling, G., Décompositions modales empiriques, 2007, PhD thesis, Ecole normale supérieure de Lyon Ph.D. thesis.
    • (2007) Décompositions modales empiriques
    • Rilling, G.1
  • 99
    • 33947638915 scopus 로고    scopus 로고
    • On the influence of sampling on the empirical mode decomposition.
    • Rilling, G., Flandrin, P., On the influence of sampling on the empirical mode decomposition. ICASSP (3), 2006, 444–447.
    • (2006) ICASSP (3) , pp. 444-447
    • Rilling, G.1    Flandrin, P.2
  • 100
    • 85008018510 scopus 로고    scopus 로고
    • One or two frequencies? The empirical mode decomposition answers
    • Rilling, G., Flandrin, P., One or two frequencies? The empirical mode decomposition answers. Signal Process. IEEE Trans. 56:1 (2008), 85–95.
    • (2008) Signal Process. IEEE Trans. , vol.56 , Issue.1 , pp. 85-95
    • Rilling, G.1    Flandrin, P.2
  • 101
    • 33646819710 scopus 로고    scopus 로고
    • Empirical mode decomposition, fractional Gaussian noise and Hurst exponent estimation.
    • Rilling, G., Flandrin, P., Gonçalves, P., Empirical mode decomposition, fractional Gaussian noise and Hurst exponent estimation. ICASSP (4), 2005, 489–492.
    • (2005) ICASSP (4) , pp. 489-492
    • Rilling, G.1    Flandrin, P.2    Gonçalves, P.3
  • 104
    • 84877703493 scopus 로고    scopus 로고
    • Wavelet adaptation for automatic voice disorders sorting
    • Saeedi, N.E., Almasganj, F., Wavelet adaptation for automatic voice disorders sorting. Comput. Biol. Med. 43:6 (2013), 699–704, 10.1016/j.compbiomed.2013.03.006.
    • (2013) Comput. Biol. Med. , vol.43 , Issue.6 , pp. 699-704
    • Saeedi, N.E.1    Almasganj, F.2
  • 105
    • 84872166152 scopus 로고    scopus 로고
    • A novel windowing technique for efficient computation of MFCC for speaker recognition
    • Sahidullah, M., Saha, G., A novel windowing technique for efficient computation of MFCC for speaker recognition. Signal Process. Lett. IEEE 20:2 (2013), 149–152, 10.1109/LSP.2012.2235067.
    • (2013) Signal Process. Lett. IEEE , vol.20 , Issue.2 , pp. 149-152
    • Sahidullah, M.1    Saha, G.2
  • 107
    • 0000453879 scopus 로고
    • Local discriminant bases and their applications
    • Saito, N., Coifman, R., Local discriminant bases and their applications. J. Math. Imaging Vis. 5:4 (1995), 337–358, 10.1007/BF01250288.
    • (1995) J. Math. Imaging Vis. , vol.5 , Issue.4 , pp. 337-358
    • Saito, N.1    Coifman, R.2
  • 112
    • 84979763198 scopus 로고    scopus 로고
    • A better decomposition of speech obtained using modified empirical mode decomposition
    • Sharma, R., Prasanna, S.M., A better decomposition of speech obtained using modified empirical mode decomposition. Digit. Signal Process. 58 (2016), 26–39 http://dx.doi.org/10.1016/j.dsp.2016.07.012.
    • (2016) Digit. Signal Process. , vol.58 , pp. 26-39
    • Sharma, R.1    Prasanna, S.M.2
  • 114
  • 116
    • 4444368779 scopus 로고    scopus 로고
    • Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition
    • Skowronski, M., Harris, J., Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition. J. Acoust. Soc. Am. 116:3 (2004), 1774–1780.
    • (2004) J. Acoust. Soc. Am. , vol.116 , Issue.3 , pp. 1774-1780
    • Skowronski, M.1    Harris, J.2
  • 117
    • 0029375490 scopus 로고
    • Determination of instants of significant excitation in speech using group delay function
    • Smits, R., Yegnanarayana, B., Determination of instants of significant excitation in speech using group delay function. Speech Audio Process. IEEE Trans. 3:5 (1995), 325–333.
    • (1995) Speech Audio Process. IEEE Trans. , vol.3 , Issue.5 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 118
    • 34548794790 scopus 로고    scopus 로고
    • Determination of instants of significant excitation in speech using Hilbert envelope and group delay function
    • Sreenivasa Rao, K., Prasanna, S., Yegnanarayana, B., Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. Signal Process. Lett. IEEE 14:10 (2007), 762–765.
    • (2007) Signal Process. Lett. IEEE , vol.14 , Issue.10 , pp. 762-765
    • Sreenivasa Rao, K.1    Prasanna, S.2    Yegnanarayana, B.3
  • 119
    • 70449388050 scopus 로고    scopus 로고
    • Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech
    • Logos Verlag
    • Steidl, S., Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech. 2009, Logos Verlag.
    • (2009)
    • Steidl, S.1
  • 120
    • 84922740791 scopus 로고    scopus 로고
    • Joint variable frame rate and length analysis for speech recognition under adverse conditions
    • Tan, Z.-H., Kraljevski, I., Joint variable frame rate and length analysis for speech recognition under adverse conditions. Comput. Electr. Eng. 40:7 (2014), 2139–2149.
    • (2014) Comput. Electr. Eng. , vol.40 , Issue.7 , pp. 2139-2149
    • Tan, Z.-H.1    Kraljevski, I.2
  • 121
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • Teager, H., Some observations on oral air flow during phonation. Acoust. Speech Signal Process. IEEE Trans. 28:5 (1980), 599–601.
    • (1980) Acoust. Speech Signal Process. IEEE Trans. , vol.28 , Issue.5 , pp. 599-601
    • Teager, H.1
  • 122
    • 0003236089 scopus 로고
    • Evidence for nonlinear sound production mechanisms in the vocal tract
    • Springer
    • Teager, H., Teager, S., Evidence for nonlinear sound production mechanisms in the vocal tract. Speech Production and Speech Modelling, 1990, Springer, 241–261.
    • (1990) Speech Production and Speech Modelling , pp. 241-261
    • Teager, H.1    Teager, S.2
  • 123
    • 85008529793 scopus 로고    scopus 로고
    • Estimation of glottal closing and opening instants in voiced speech using the yaga algorithm
    • Thomas, M.R., Gudnason, J., Naylor, P.A., Estimation of glottal closing and opening instants in voiced speech using the yaga algorithm. Audio Speech Lang. Process. IEEE Trans. 20:1 (2012), 82–91.
    • (2012) Audio Speech Lang. Process. IEEE Trans. , vol.20 , Issue.1 , pp. 82-91
    • Thomas, M.R.1    Gudnason, J.2    Naylor, P.A.3
  • 125
    • 0034443356 scopus 로고    scopus 로고
    • Automatic speaker identification by means of mel cepstrum, wavelets and wavelets packets
    • Paper No. TU–E201–02
    • Torres, H.M., Rufiner, H.L., Automatic speaker identification by means of mel cepstrum, wavelets and wavelets packets. Proceedings of the Chicago 2000 World Congress IEEE EMBS, 2000 Paper No. TU–E201–02.
    • (2000) Proceedings of the Chicago 2000 World Congress IEEE EMBS
    • Torres, H.M.1    Rufiner, H.L.2
  • 128
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: resources, features, and methods
    • Ververidis, D., Kotropoulos, C., Emotional speech recognition: resources, features, and methods. Speech Commun. 48:9 (2006), 1162–1181, 10.1016/j.specom.2006.04.003.
    • (2006) Speech Commun. , vol.48 , Issue.9 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 129
    • 84979622932 scopus 로고    scopus 로고
    • Multi-objective optimisation of wavelet features for phoneme recognition
    • Vignolo, L.D., Rufiner, H.L., Milone, D.H., Multi-objective optimisation of wavelet features for phoneme recognition. IET Signal Proc. 10:6 (2016), 685–691, 10.1049/iet-spr.2015.0568.
    • (2016) IET Signal Proc. , vol.10 , Issue.6 , pp. 685-691
    • Vignolo, L.D.1    Rufiner, H.L.2    Milone, D.H.3
  • 131
    • 84872842796 scopus 로고    scopus 로고
    • Genetic wavelet packets for speech recognition
    • Vignolo, L.D., Milone, D.H., Rufiner, H.L., Genetic wavelet packets for speech recognition. Expert Syst. Appl. 40:6 (2013), 2350–2359, 10.1016/j.eswa.2012.10.050.
    • (2013) Expert Syst. Appl. , vol.40 , Issue.6 , pp. 2350-2359
    • Vignolo, L.D.1    Milone, D.H.2    Rufiner, H.L.3
  • 134
    • 79953280364 scopus 로고    scopus 로고
    • Evolutionary splines for Cepstral filterbank optimization in phoneme classification
    • Vignolo, L.D., Rufiner, H.L., Milone, D.H., Goddard, J.C., Evolutionary splines for Cepstral filterbank optimization in phoneme classification. EURASIP J. Adv. Signal Process. 2011 (2011), 8:1–8:14.
    • (2011) EURASIP J. Adv. Signal Process. , vol.2011 , pp. 81-8:14
    • Vignolo, L.D.1    Rufiner, H.L.2    Milone, D.H.3    Goddard, J.C.4
  • 135
    • 79959978998 scopus 로고    scopus 로고
    • Best basis-based wavelet packet entropy feature extraction and hierarchical eeg classification for epileptic detection
    • Wang, D., Miao, D., Xie, C., Best basis-based wavelet packet entropy feature extraction and hierarchical eeg classification for epileptic detection. Expert Syst. Appl. 38:11 (2011), 14314–14320, 10.1016/j.eswa.2011.05.096.
    • (2011) Expert Syst. Appl. , vol.38 , Issue.11 , pp. 14314-14320
    • Wang, D.1    Miao, D.2    Xie, C.3
  • 137
    • 79151483819 scopus 로고    scopus 로고
    • Speaker identification system using empirical mode decomposition and an artificial neural network
    • Wu, J.-D., Tsai, Y.-J., Speaker identification system using empirical mode decomposition and an artificial neural network. Expert Syst. Appl. 38:5 (2011), 6112–6117.
    • (2011) Expert Syst. Appl. , vol.38 , Issue.5 , pp. 6112-6117
    • Wu, J.-D.1    Tsai, Y.-J.2
  • 138
    • 17944381277 scopus 로고    scopus 로고
    • Improved MFCC-based feature for robust speaker identification
    • Wu, Z., Cao, Z., Improved MFCC-based feature for robust speaker identification. Tsinghua Sci. Technol. 10:2 (2005), 158–161.
    • (2005) Tsinghua Sci. Technol. , vol.10 , Issue.2 , pp. 158-161
    • Wu, Z.1    Cao, Z.2
  • 139
    • 2542525254 scopus 로고    scopus 로고
    • A study of the characteristics of white noise using the empirical mode decomposition method
    • Wu, Z., Huang, N.E., A study of the characteristics of white noise using the empirical mode decomposition method. Proc. R. Soc. London. Ser.A 460:2046 (2004), 1597–1611.
    • (2004) Proc. R. Soc. London. Ser.A , vol.460 , Issue.2046 , pp. 1597-1611
    • Wu, Z.1    Huang, N.E.2
  • 140
    • 80052078099 scopus 로고    scopus 로고
    • Ensemble empirical mode decomposition: a noise-assisted data analysis method
    • Wu, Z., Huang, N.E., Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv. Adapt. Data Anal. 1:01 (2009), 1–41.
    • (2009) Adv. Adapt. Data Anal. , vol.1 , Issue.1 , pp. 1-41
    • Wu, Z.1    Huang, N.E.2
  • 141
    • 28444477853 scopus 로고    scopus 로고
    • A novel pitch period detection algorithm based on Hilbert-Huang transform
    • Springer
    • Yang, Z., Huang, D., Yang, L., A novel pitch period detection algorithm based on Hilbert-Huang transform. Advances in Biometric Person Authentication, 2005, Springer, 586–593.
    • (2005) Advances in Biometric Person Authentication , pp. 586-593
    • Yang, Z.1    Huang, D.2    Yang, L.3
  • 144
    • 22544440896 scopus 로고    scopus 로고
    • Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
    • Yegnanarayana, B., Prasanna, S., Zachariah, J.M., Gupta, C.S., Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. SpeechAudio Process. IEEE Trans. 13:4 (2005), 575–582.
    • (2005) SpeechAudio Process. IEEE Trans. , vol.13 , Issue.4 , pp. 575-582
    • Yegnanarayana, B.1    Prasanna, S.2    Zachariah, J.M.3    Gupta, C.S.4
  • 145
    • 79956369785 scopus 로고    scopus 로고
    • Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method
    • Yeh, J.-R., Shieh, J.-S., Huang, N.E., Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Adv. Adapt. Data Anal. 2:02 (2010), 135–156.
    • (2010) Adv. Adapt. Data Anal. , vol.2 , Issue.2 , pp. 135-156
    • Yeh, J.-R.1    Shieh, J.-S.2    Huang, N.E.3
  • 146
    • 79952092015 scopus 로고    scopus 로고
    • Optimized discriminative transformations for speech features based on minimum classification error
    • Zamani, B., Akbari, A., Nasersharif, B., Jalalvand, A., Optimized discriminative transformations for speech features based on minimum classification error. Pattern Recognit. Lett. 32:7 (2011), 948–955, 10.1016/j.patrec.2011.01.017.
    • (2011) Pattern Recognit. Lett. , vol.32 , Issue.7 , pp. 948-955
    • Zamani, B.1    Akbari, A.2    Nasersharif, B.3    Jalalvand, A.4
  • 147
    • 0035506942 scopus 로고    scopus 로고
    • Comparison of different implementations of MFCC
    • Zheng, F., Zhang, G., Song, Z., Comparison of different implementations of MFCC. J. Comput. Sci. Technol. 16:6 (2001), 582–589, 10.1007/BF02943243.
    • (2001) J. Comput. Sci. Technol. , vol.16 , Issue.6 , pp. 582-589
    • Zheng, F.1    Zhang, G.2    Song, Z.3
  • 148
    • 84897134590 scopus 로고    scopus 로고
    • A novel speech emotion recognition method via incomplete sparse least square regression
    • 1–1
    • Zheng, W., Xin, M., Wang, X., Wang, B., A novel speech emotion recognition method via incomplete sparse least square regression. Signal Process. Lett. IEEE, PP(99), 2014, 10.1109/LSP.2014.2308954 1–1.
    • (2014) Signal Process. Lett. IEEE , vol.PP , Issue.99
    • Zheng, W.1    Xin, M.2    Wang, X.3    Wang, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.