메뉴 건너뛰기




Volumn 53, Issue 10, 2006, Pages 1943-1953

Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters

Author keywords

Cepstral parameters; F Ratio; Fisher's discriminant ratio; Gaussian mixture models; Short term analysis; Voice disorders

Indexed keywords

CEPSTRAL PARAMETERS; F-RATIO; FISHER'S DISCRIMINANT RATIO; GAUSSIAN MIXTURE MODELS; GAUSSIAN MIXTURES; SHORT-TERM ANALYSIS; VOICE DISORDERS;

EID: 33749525148     PISSN: 00189294     EISSN: None     Source Type: Journal    
DOI: 10.1109/TBME.2006.871883     Document Type: Article
Times cited : (289)

References (44)
  • 1
    • 0031193932 scopus 로고    scopus 로고
    • Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases
    • Jul./Aug.
    • B. Boyanov and S. Hadjitodorov, "Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases," IEEE Eng. Med. Biol. Mag., vol. 16, no. 4, pp. 74-82, Jul./Aug. 1997.
    • (1997) IEEE Eng. Med. Biol. Mag. , vol.16 , Issue.4 , pp. 74-82
    • Boyanov, B.1    Hadjitodorov, S.2
  • 2
    • 0022966946 scopus 로고
    • Normalized noise energy as an acoustic measure to evaluate pathologic voice
    • Nov.
    • H. Kasuya, S. Ogawa, K. Mashima, and S. Ebihara, "Normalized noise energy as an acoustic measure to evaluate pathologic voice," J. Acoust. Soc. Am., vol. 80, no. 5, pp. 1329-1334, Nov. 1986.
    • (1986) J. Acoust. Soc. Am. , vol.80 , Issue.5 , pp. 1329-1334
    • Kasuya, H.1    Ogawa, S.2    Mashima, K.3    Ebihara, S.4
  • 3
    • 33544464436 scopus 로고
    • An acoustic analysis of pathological voice and its application to the evaluation of laryngeal pathology
    • H. Kasuya, S. Ogawa, Y. Kikuchi, and S. Ebihara, "An acoustic analysis of pathological voice and its application to the evaluation of laryngeal pathology," Speech Commun., vol. 5, pp. 171-181, 1986.
    • (1986) Speech Commun. , vol.5 , pp. 171-181
    • Kasuya, H.1    Ogawa, S.2    Kikuchi, Y.3    Ebihara, S.4
  • 4
    • 0034332006 scopus 로고    scopus 로고
    • Adaptive noise energy estimation in pathological speech signals
    • Nov.
    • C. Manfredi, "Adaptive noise energy estimation in pathological speech signals," IEEE Trans. Biomed, Eng., vol. 47, no. 11, pp. 1538-1543, Nov. 2000.
    • (2000) IEEE Trans. Biomed, Eng. , vol.47 , Issue.11 , pp. 1538-1543
    • Manfredi, C.1
  • 5
    • 0030793150 scopus 로고    scopus 로고
    • Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals
    • Y. Qi and R. E. Hillman, "Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals," J. Acoust. Soc. Am., vol. 102, no. 1, pp. 537-543, 1997.
    • (1997) J. Acoust. Soc. Am. , vol.102 , Issue.1 , pp. 537-543
    • Qi, Y.1    Hillman, R.E.2
  • 6
    • 0020319209 scopus 로고
    • Harmonics-to-noise ratio as an index of the degree of hoarseness
    • E. Yumoto, W. Gould, and T. Baer, "Harmonics-to-noise ratio as an index of the degree of hoarseness," J. Acoust. Soc. Am., vol. 71, no. 6, pp. 1544-1550, 1982.
    • (1982) J. Acoust. Soc. Am. , vol.71 , Issue.6 , pp. 1544-1550
    • Yumoto, E.1    Gould, W.2    Baer, T.3
  • 7
    • 0031187694 scopus 로고    scopus 로고
    • Glottal-to-noise excitation ratio - A new measure for describing pathological voices
    • D. Michaelis, T. Gramss, and H. W. Strube, "Glottal-to-noise excitation ratio - A new measure for describing pathological voices," Acustica/Acta Acustica, vol. 83, pp. 700-706, 1997.
    • (1997) Acustica/Acta Acustica , vol.83 , pp. 700-706
    • Michaelis, D.1    Gramss, T.2    Strube, H.W.3
  • 8
    • 0027285715 scopus 로고
    • A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals
    • Apr.
    • G. de Krom, "A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals," J. Speech, Hearing Res., vol. 36, no. 2, pp. 254-266, Apr. 1993.
    • (1993) J. Speech, Hearing Res. , vol.36 , Issue.2 , pp. 254-266
    • De Krom, G.1
  • 9
    • 0025346441 scopus 로고
    • Short-term stability measures for the evaluation of vocal quality
    • Jun.
    • S. Feijoo and C. Hernández, "Short-term stability measures for the evaluation of vocal quality," J. Speech, Hearing Res., vol. 33, pp. 324-334, Jun. 1990.
    • (1990) J. Speech, Hearing Res. , vol.33 , pp. 324-334
    • Feijoo, S.1    Hernández, C.2
  • 10
    • 0026708151 scopus 로고
    • Vocal tremor analysis with the vocal demodulator
    • W. Winholtz, "Vocal tremor analysis with the vocal demodulator," J. Speech, Hearing Res., no. 35, pp. 562-563, 1992.
    • (1992) J. Speech, Hearing Res. , Issue.35 , pp. 562-563
    • Winholtz, W.1
  • 12
    • 0033624779 scopus 로고    scopus 로고
    • Laryngeal pathology detection by means of class-specific neural maps
    • Mar.
    • S. Hadjitodorov, B. Boyanov, and B. Teston, "Laryngeal pathology detection by means of class-specific neural maps," IEEE Trans. Inform. Technol. Biomed., vol. 4, pp. 68-73, Mar. 2000.
    • (2000) IEEE Trans. Inform. Technol. Biomed. , vol.4 , pp. 68-73
    • Hadjitodorov, S.1    Boyanov, B.2    Teston, B.3
  • 13
    • 0021274794 scopus 로고
    • Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness
    • Mar.
    • E. Yumoto, Y. Sasaki, and H. Okamura, "Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness," J. Speech, Hearing Res., vol. 27, no. 1, pp. 2-6, Mar. 1984.
    • (1984) J. Speech, Hearing Res. , vol.27 , Issue.1 , pp. 2-6
    • Yumoto, E.1    Sasaki, Y.2    Okamura, H.3
  • 14
    • 0034168475 scopus 로고    scopus 로고
    • Identification of pathological voices using glottal noise measures
    • Apr.
    • V. Parsa and D. G. Jamieson, "Identification of pathological voices using glottal noise measures," J. Speech, Language, Hearing Res., vol. 43, no. 2, pp. 469-485, Apr. 2000.
    • (2000) J. Speech, Language, Hearing Res. , vol.43 , Issue.2 , pp. 469-485
    • Parsa, V.1    Jamieson, D.G.2
  • 16
    • 2142763511 scopus 로고    scopus 로고
    • Pitch estimation for noise retrieval in time and frequency domain
    • C. Manfredi, L. Pierazzi, and P. Bruscaglioni, "Pitch estimation for noise retrieval in time and frequency domain," Med. Biol. Eng. Comput., vol. 37, no. 2, I, pp. 532-533, 1999.
    • (1999) Med. Biol. Eng. Comput. , vol.37 , Issue.1-2 , pp. 532-533
    • Manfredi, C.1    Pierazzi, L.2    Bruscaglioni, P.3
  • 17
    • 0030053943 scopus 로고    scopus 로고
    • A noninvasive technique for detecting hypernasal speech using a nonlinear operator
    • D. Cairns, J. H. Hansen, and J. Riski, "A noninvasive technique for detecting hypernasal speech using a nonlinear operator," IEEE Trans. Biomed. Eng., vol. 43, pp. 33-45, 1996.
    • (1996) IEEE Trans. Biomed. Eng. , vol.43 , pp. 33-45
    • Cairns, D.1    Hansen, J.H.2    Riski, J.3
  • 18
    • 0030130858 scopus 로고    scopus 로고
    • Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection
    • Apr.
    • L. Gavidia-Ceballos and J. H. L. Hansen, "Direct speech feature estimation using an iterative EM algorithm for vocal fold pathology detection," IEEE Trans. Biomed, Eng., vol. 43, no. 4, pp. 373-383, Apr. 1996.
    • (1996) IEEE Trans. Biomed, Eng. , vol.43 , Issue.4 , pp. 373-383
    • Gavidia-Ceballos, L.1    Hansen, J.H.L.2
  • 19
    • 0032030556 scopus 로고    scopus 로고
    • A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
    • Mar.
    • J. H. L. Hansen, L. Gavidia-Ceballos, and J. F. Kaiser, "A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment," IEEE Trans. Biomed. Eng., vol. 45, no. 3, pp. 300-313, Mar. 1998.
    • (1998) IEEE Trans. Biomed. Eng. , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 20
    • 0036753078 scopus 로고    scopus 로고
    • Pathological voice quality assessment using artificial neural networks
    • T. Ritchings, M. McGillion, and C. Moore, "Pathological voice quality assessment using artificial neural networks," Med. Eng. Phys., vol. 24, no. 8, pp. 561-564, 2002.
    • (2002) Med. Eng. Phys. , vol.24 , Issue.8 , pp. 561-564
    • Ritchings, T.1    McGillion, M.2    Moore, C.3
  • 21
    • 0026546098 scopus 로고
    • Detection of laryngeal function using speech and electroglottographic data
    • Jan.
    • D. G. Childers and K. Sung-Bae, "Detection of laryngeal function using speech and electroglottographic data," IEEE Trans. Biomed. Eng., vol. 39, no. 1, pp. 19-25, Jan. 1992.
    • (1992) IEEE Trans. Biomed. Eng. , vol.39 , Issue.1 , pp. 19-25
    • Childers, D.G.1    Sung-Bae, K.2
  • 22
    • 0033982157 scopus 로고    scopus 로고
    • Adaptive estimation of residue signal for voice pathology diagnosis
    • Jan.
    • M. Oliveira Rosa, J. C. Pereira, and M. Grellet, "Adaptive estimation of residue signal for voice pathology diagnosis," IEEE Trans. Biomed. Eng., vol. 47, no. 1, pp. 96-104, Jan. 2000.
    • (2000) IEEE Trans. Biomed. Eng. , vol.47 , Issue.1 , pp. 96-104
    • Oliveira Rosa, M.1    Pereira, J.C.2    Grellet, M.3
  • 23
    • 9144269627 scopus 로고    scopus 로고
    • Automatic detection of voice impairments by means of short-term cepstral parameters and neural network-based detectors
    • Feb.
    • J. I. Godino-Llorente and P. Gómez-Vilda, "Automatic detection of voice impairments by means of short-term cepstral parameters and neural network-based detectors," IEEE Trans. Biomed. Eng., vol. 51, no. 2, pp. 380-384, Feb. 2004.
    • (2004) IEEE Trans. Biomed. Eng. , vol.51 , Issue.2 , pp. 380-384
    • Godino-Llorente, J.I.1    Gómez-Vilda, P.2
  • 24
    • 38249012552 scopus 로고
    • Dimensionality reduction of the enhanced feature set for the HMM-based speech recognizer
    • K. Paliwal, "Dimensionality reduction of the enhanced feature set for the HMM-based speech recognizer," Digital Signal Process., vol. 2, pp. 157-173, 1992.
    • (1992) Digital Signal Process. , vol.2 , pp. 157-173
    • Paliwal, K.1
  • 29
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • Jul.
    • S. E. Bou-Ghazale and J. H. L. Hansen, "A comparative study of traditional and newly proposed features for recognition of speech under stress," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 429-442, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.H.L.2
  • 31
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 33
    • 0026720941 scopus 로고
    • Inverse filtering
    • B. Fritzell, "Inverse filtering," J. Voice, vol. 6, no. 111, p. 114, 1992.
    • (1992) J. Voice , vol.6 , Issue.111 , pp. 114
    • Fritzell, B.1
  • 34
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • APR.
    • S. Furui, "Cepstral analysis technique for automatic speaker verification," IEEE Acoust., Speech, Signal Process. Mag., vol. 29, no. 2, pp. 254-272, APR. 1981.
    • (1981) IEEE Acoust., Speech, Signal Process. Mag. , vol.29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 35
    • 0030287048 scopus 로고    scopus 로고
    • The expectation-maximization algorithm
    • Nov.
    • T. K. Moon, "The expectation-maximization algorithm," IEEE Signal Process. Mag., vol. 13, no. 6, pp. 47-60, Nov. 1996.
    • (1996) IEEE Signal Process. Mag. , vol.13 , Issue.6 , pp. 47-60
    • Moon, T.K.1
  • 36
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 37
    • 0029355999 scopus 로고
    • Speaker identification using Gaussian mixture speaker mode
    • D. A. Reynolds, "Speaker identification using Gaussian mixture speaker mode," Speech Commun., vol. 17, pp. 91-108, 1995.
    • (1995) Speech Commun. , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 41
    • 0020083498 scopus 로고
    • The meaning and use of the area under the receiver operating characteristic (ROC) curve
    • J. A. Hanley and B. McNeil, "The meaning and use of the area under the receiver operating characteristic (ROC) curve," Radiology, vol. 143, pp. 29-36, 1982.
    • (1982) Radiology , vol.143 , pp. 29-36
    • Hanley, J.A.1    McNeil, B.2
  • 42
    • 0020524559 scopus 로고
    • A method of comparing the areas under receiver operating characteristics curves derived from the same cases
    • Sep.
    • J. A. Hanley and B. J. McNeil, "A method of comparing the areas under receiver operating characteristics curves derived from the same cases," Radiology, vol. 148, no. 3, pp. 839-843, Sep. 1983.
    • (1983) Radiology , vol.148 , Issue.3 , pp. 839-843
    • Hanley, J.A.1    McNeil, B.J.2
  • 43
    • 85046873967 scopus 로고    scopus 로고
    • The DET curve in assessment of detection task performance
    • Rhodes, Crete
    • A. Martin, G. R. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech '97, Rhodes, Crete, 1997, vol. IV, pp. 1895-1898.
    • (1997) Proc. Eurospeech '97 , vol.4 , pp. 1895-1898
    • Martin, A.1    Doddington, G.R.2    Kamm, T.3    Ordowski, M.4    Przybocki, M.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.