메뉴 건너뛰기




Volumn 51, Issue 12, 2009, Pages 1206-1223

Analysis and classification of speech signals by generalized fractal dimension features

Author keywords

Broad class phoneme classification; Feature extraction; Generalized fractal dimensions

Indexed keywords

BROAD CLASS PHONEME CLASSIFICATION; CLASSIFICATION OF SPEECH; FEATURE VECTORS; FRACTAL FEATURE; FRACTAL THEORY; GENERALIZED FRACTAL DIMENSIONS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; NON-LINEAR SIGNAL PROCESSING; PHASE SPACES; PHONEME CLASSIFICATION; QUALITATIVE ASPECTS; RAW MEASUREMENTS; SPECTRAL CONTENT; SPEECH SIGNALS; SPEECH SOUNDS; STATISTICAL PARAMETERS;

EID: 69849087531     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2009.06.005     Document Type: Article
Times cited : (53)

References (48)
  • 2
    • 0030676943 scopus 로고    scopus 로고
    • Adeyemi, O., Boudreaux-Bartels, F.G., 1997. Improved accuracy in the singularity spectrum of multifractal chaotic time series. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-97, Munich, Germany, pp. 2377-2380.
    • Adeyemi, O., Boudreaux-Bartels, F.G., 1997. Improved accuracy in the singularity spectrum of multifractal chaotic time series. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-97, Munich, Germany, pp. 2377-2380.
  • 4
    • 0001463465 scopus 로고
    • Statistical description of chaotic attractors: the dimension function
    • Badii R., and Politi A. Statistical description of chaotic attractors: the dimension function. J. Statist. Phys. 40 5-6 (1985) 725-750
    • (1985) J. Statist. Phys. , vol.40 , Issue.5-6 , pp. 725-750
    • Badii, R.1    Politi, A.2
  • 6
    • 18844405278 scopus 로고
    • On the multifractal nature of fully developed turbulence and chaotic systems
    • Benzi R., Paladin G., Parisi G., and Vulpiani A. On the multifractal nature of fully developed turbulence and chaotic systems. J. Phys. A 17 (1984) 3521-3531
    • (1984) J. Phys. A , vol.17 , pp. 3521-3531
    • Benzi, R.1    Paladin, G.2    Parisi, G.3    Vulpiani, A.4
  • 8
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust., Speech, Signal Process. 28 4 (1980) 357-366
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 9
    • 0031198059 scopus 로고    scopus 로고
    • Production models as a structural basis for automatic speech recognition
    • Deng L., Ramsay G., and Sun D. Production models as a structural basis for automatic speech recognition. Speech Commun. 22 (1997) 93-111
    • (1997) Speech Commun. , vol.22 , pp. 93-111
    • Deng, L.1    Ramsay, G.2    Sun, D.3
  • 11
    • 40749093037 scopus 로고
    • Measuring the strangeness of strange attractors
    • Grassberger P., and Procaccia I. Measuring the strangeness of strange attractors. Physica D 9 1-2 (1983) 189-208
    • (1983) Physica D , vol.9 , Issue.1-2 , pp. 189-208
    • Grassberger, P.1    Procaccia, I.2
  • 12
    • 0030720034 scopus 로고    scopus 로고
    • Characterization of attractors in speech signals
    • Greenwood G.W. Characterization of attractors in speech signals. BioSystems 44 (1997) 161-165
    • (1997) BioSystems , vol.44 , pp. 161-165
    • Greenwood, G.W.1
  • 13
    • 0001699778 scopus 로고
    • Fractal nature of turbulence as manifested in turbulent diffusion strange attractors
    • Hentschel H.G.E., and Procaccia I. Fractal nature of turbulence as manifested in turbulent diffusion strange attractors. Phys. Rev. A 27 (1983) 1266-1269
    • (1983) Phys. Rev. A , vol.27 , pp. 1266-1269
    • Hentschel, H.G.E.1    Procaccia, I.2
  • 14
    • 0346372923 scopus 로고
    • The infinite number of generalized dimensions of fractals and strange attractors
    • Hentschel H.G.E., and Procaccia I. The infinite number of generalized dimensions of fractals and strange attractors. Physica D 8 3 (1983) 435-444
    • (1983) Physica D , vol.8 , Issue.3 , pp. 435-444
    • Hentschel, H.G.E.1    Procaccia, I.2
  • 15
    • 33847442992 scopus 로고
    • Analysis of vocal disorders with methods from nonlinear dynamics
    • Herzel H., Berry D., Titze I., and Saleh M. Analysis of vocal disorders with methods from nonlinear dynamics. NCVS Status Progress Rep. 4 (1993) 177-193
    • (1993) NCVS Status Progress Rep. , vol.4 , pp. 177-193
    • Herzel, H.1    Berry, D.2    Titze, I.3    Saleh, M.4
  • 16
    • 85055273953 scopus 로고
    • Some fluid dynamic aspects of speech
    • Hirschberg A. Some fluid dynamic aspects of speech. Bull. Commun. Parlée 2 (1992) 7-30
    • (1992) Bull. Commun. Parlée , vol.2 , pp. 7-30
    • Hirschberg, A.1
  • 18
    • 0004088074 scopus 로고
    • Efficient algorithms for computing fractal dimensions
    • Mayer-Kress G. (Ed), Springer-Verlag, Berlin
    • Hunt F., and Sullivan F. Efficient algorithms for computing fractal dimensions. In: Mayer-Kress G. (Ed). Dimensions and Entropies in Chaotic Systems (1986), Springer-Verlag, Berlin
    • (1986) Dimensions and Entropies in Chaotic Systems
    • Hunt, F.1    Sullivan, F.2
  • 20
    • 0001059592 scopus 로고
    • Some observations on vocal tract operation from a fluid flow point of view
    • Titze, I.R, Scherer, R.C, Eds, Denver Center for Performing Arts, Denver, CO, pp
    • Kaiser, J.F., 1983. Some observations on vocal tract operation from a fluid flow point of view. In: Titze, I.R., Scherer, R.C. (Eds.), Vocal Fold Physiology: Biomechanics, Acoustics and Phonatory Control, Denver Center for Performing Arts, Denver, CO, pp. 358-386.
    • (1983) Vocal Fold Physiology: Biomechanics, Acoustics and Phonatory Control , pp. 358-386
    • Kaiser, J.F.1
  • 22
  • 23
    • 0029764970 scopus 로고    scopus 로고
    • Kubin, G., 1996. Synthesis and coding of continuous speech with the nonlinear oscillator model. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-96, 1, Atlanta, USA, p. 267.
    • Kubin, G., 1996. Synthesis and coding of continuous speech with the nonlinear oscillator model. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-96, Vol. 1, Atlanta, USA, p. 267.
  • 24
    • 0030037756 scopus 로고    scopus 로고
    • Nonlinear dynamical analysis of speech
    • Kumar A., and Mullick S.K. Nonlinear dynamical analysis of speech. J. Acoust. Soc. Am. 100 1 (1996) 615-629
    • (1996) J. Acoust. Soc. Am. , vol.100 , Issue.1 , pp. 615-629
    • Kumar, A.1    Mullick, S.K.2
  • 27
    • 0026392349 scopus 로고    scopus 로고
    • Maragos, P., 1991. Fractal aspects of speech signals: dimension and interpolation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-91, pp. 417-420.
    • Maragos, P., 1991. Fractal aspects of speech signals: dimension and interpolation. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-91, pp. 417-420.
  • 28
    • 0033006667 scopus 로고    scopus 로고
    • Fractal dimensions of speech sounds: computation and application to automatic speech recognition
    • Maragos P., and Potamianos A. Fractal dimensions of speech sounds: computation and application to automatic speech recognition. J. Acoust. Soc. Am. 105 3 (1999) 1925-1932
    • (1999) J. Acoust. Soc. Am. , vol.105 , Issue.3 , pp. 1925-1932
    • Maragos, P.1    Potamianos, A.2
  • 29
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • Maragos P., Kaiser J.F., and Quatieri T.F. Energy separation in signal modulations with application to speech analysis. IEEE Trans. Signal Process. 41 10 (1993) 3024-3051
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.10 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 30
    • 0026123844 scopus 로고
    • The multifractal nature of turbulent energy dissipation
    • Meneveau C., and Sreenivasan K.R. The multifractal nature of turbulent energy dissipation. J. Fluid Mech. 224 (1991) 429-484
    • (1991) J. Fluid Mech. , vol.224 , pp. 429-484
    • Meneveau, C.1    Sreenivasan, K.R.2
  • 31
    • 0028919849 scopus 로고
    • A nonlinear dynamical systems analysis of fricative consonants
    • Narayanan S., and Alwan A. A nonlinear dynamical systems analysis of fricative consonants. J. Acoust. Soc. Am. 97 4 (1995) 2511-2524
    • (1995) J. Acoust. Soc. Am. , vol.97 , Issue.4 , pp. 2511-2524
    • Narayanan, S.1    Alwan, A.2
  • 33
    • 69849103259 scopus 로고    scopus 로고
    • Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition
    • Papandreou G., Katsamanis A., Pitsikalis V., and Maragos P. Adaptive multimodal fusion by uncertainty compensation with application to audiovisual speech recognition. IEEE Trans. Audio, Speech Language Process. 17 3 (2009) 423-435
    • (2009) IEEE Trans. Audio, Speech Language Process. , vol.17 , Issue.3 , pp. 423-435
    • Papandreou, G.1    Katsamanis, A.2    Pitsikalis, V.3    Maragos, P.4
  • 35
    • 0036289924 scopus 로고    scopus 로고
    • Pitsikalis, V., Maragos, P., 2002. Speech analysis and feature extraction using chaotic models. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-02, Orlando, USA, pp. 533-536.
    • Pitsikalis, V., Maragos, P., 2002. Speech analysis and feature extraction using chaotic models. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-02, Orlando, USA, pp. 533-536.
  • 36
    • 33750102668 scopus 로고    scopus 로고
    • Filtered dynamics and fractal dimensions for noisy speech recognition
    • Pitsikalis V., and Maragos P. Filtered dynamics and fractal dimensions for noisy speech recognition. IEEE Signal Proc. Lett. 13 11 (2006) 711-714
    • (2006) IEEE Signal Proc. Lett. , vol.13 , Issue.11 , pp. 711-714
    • Pitsikalis, V.1    Maragos, P.2
  • 38
    • 15044345504 scopus 로고    scopus 로고
    • Audio-visual automatic speech recognition: an overview
    • Bailly G., Vatikiotis-Bateson E., and Perrier P. (Eds), MIT Press (Chapter 10)
    • Potamianos G., Neti C., Luettin J., and Matthews I. Audio-visual automatic speech recognition: an overview. In: Bailly G., Vatikiotis-Bateson E., and Perrier P. (Eds). Issues in Visual and Audio-Visual Speech Processing (2004), MIT Press (Chapter 10)
    • (2004) Issues in Visual and Audio-Visual Speech Processing
    • Potamianos, G.1    Neti, C.2    Luettin, J.3    Matthews, I.4
  • 41
    • 0000779360 scopus 로고
    • Detecting strange attractors in turbulence
    • Takens F. Detecting strange attractors in turbulence. Dynam. Systems Turbulence 898 (1981) 366-381
    • (1981) Dynam. Systems Turbulence , vol.898 , pp. 366-381
    • Takens, F.1
  • 42
    • 69849090992 scopus 로고    scopus 로고
    • Teager, H.M., Teager, S.M., 1989. Evidence for nonlinear sound production mechanisms in the vocal tract. In: Hardcastle, W.J., Marchal (Eds.), Speech Production and Speech Modelling NATO ASI Series D, 55.
    • Teager, H.M., Teager, S.M., 1989. Evidence for nonlinear sound production mechanisms in the vocal tract. In: Hardcastle, W.J., Marchal (Eds.), Speech Production and Speech Modelling NATO ASI Series D, Vol. 55.
  • 43
    • 0003278399 scopus 로고
    • Infinite-dimensional dynamical systems in mechanics and physics
    • Springer-Verlag, New York
    • Temam R. Infinite-dimensional dynamical systems in mechanics and physics. Applied Mathematical Science Vol. 68 (1993), Springer-Verlag, New York
    • (1993) Applied Mathematical Science , vol.68
    • Temam, R.1
  • 44
    • 85011603315 scopus 로고
    • A finite element model of fluid flow in the vocal tract
    • Thomas T.J. A finite element model of fluid flow in the vocal tract. Comput. Speech Language 1 (1986) 131-151
    • (1986) Comput. Speech Language , vol.1 , pp. 131-151
    • Thomas, T.J.1
  • 45
    • 0035205717 scopus 로고    scopus 로고
    • Surogate analysis for detecting nonlinear dynamics in normal vowels
    • Tokuda I., Miyano T., and Aihara K. Surogate analysis for detecting nonlinear dynamics in normal vowels. J. Acoust. Soc. Am. 110 6 (2001) 3207-3217
    • (2001) J. Acoust. Soc. Am. , vol.110 , Issue.6 , pp. 3207-3217
    • Tokuda, I.1    Miyano, T.2    Aihara, K.3
  • 46
    • 0026385270 scopus 로고    scopus 로고
    • Townshend, B., 1991. Nonlinear prediction of speech signals. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-91, pp. 425-428.
    • Townshend, B., 1991. Nonlinear prediction of speech signals. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP-91, pp. 425-428.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.