메뉴 건너뛰기




Volumn 43, Issue 9, 2010, Pages 3100-3112

An improved method for voice pathology detection by means of a HMM-based feature space transformation

Author keywords

Dynamic feature space transformation; Hidden Markov models; Minimum classification error; Pathological voice

Indexed keywords

FEATURE EXTRACTION; HIDDEN MARKOV MODELS; MARKOV PROCESSES;

EID: 79960343078     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2010.03.019     Document Type: Article
Times cited : (70)

References (57)
  • 2
    • 0036477056 scopus 로고    scopus 로고
    • Face recognition using kernel principal component analysis
    • DOI 10.1109/97.991133, PII S1070990802034004
    • K. I. Kim, K. Juang, H. J. Kim, Face recognition using kernel principal component analysis, IEEE Signal Processing Letters 9 (2) (2002) 40-42. (Pubitemid 34490978)
    • (2002) IEEE Signal Processing Letters , vol.9 , Issue.2 , pp. 40-42
    • Kim, K.I.1    Jung, K.2    Kim, H.J.3
  • 3
    • 27544443176 scopus 로고    scopus 로고
    • Accounting for probe-level noise in principal component analysis of microarray data
    • DOI 10.1093/bioinformatics/bti617
    • G. Sanguinetti, M. Milo, M. Rattray, N. Lawrence, Accounting for prove-level noise in principal component analysis of microarray data, Bioinformatics 21 (19) (2005) 3748-3754. (Pubitemid 41535523)
    • (2005) Bioinformatics , vol.21 , Issue.19 , pp. 3748-3754
    • Sanguinetti, G.1    Milo, M.2    Rattray, M.3    Lawrence, N.D.4
  • 4
    • 0042198967 scopus 로고    scopus 로고
    • Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition
    • DOI 10.1016/S0031-3203(03)00044-X
    • X. Wang, K. K. Paliwal, Feature extraction and dimensionality reduction algorithms and their applications in vowel recognition, Pattern Recognition 36 (10) (2003) 2429-2439. (Pubitemid 36947223)
    • (2003) Pattern Recognition , vol.36 , Issue.10 , pp. 2429-2439
    • Wang, X.1    Paliwal, K.K.2
  • 6
    • 0031193932 scopus 로고    scopus 로고
    • Acoustic analysis of pathological voices: A voice analysis system for the screening and laryngeal diseases
    • DOI 10.1109/51.603651
    • B. Boyanov, S. Hadjitodorov, Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases, IEEE Engineering in Medicine & Biology Magazine 16 (4) (1997) 74-82. (Pubitemid 27318793)
    • (1997) IEEE Engineering in Medicine and Biology Magazine , vol.16 , Issue.4 , pp. 74-82
    • Boyanov, B.1    Hadjitodorov, S.2
  • 7
    • 0027285715 scopus 로고
    • A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals
    • G. de Krom, A cepstrum-based technique for determining a harmonics-tonoise ratio in speech signals, Journal of Speech and Hearing Research 36 (2) (1993) 254-266. (Pubitemid 23123559)
    • (1993) Journal of Speech and Hearing Research , vol.36 , Issue.2 , pp. 254-266
    • De Krom, G.1
  • 8
    • 0025346441 scopus 로고
    • Short-term stability measures for the evaluation of vocal quality
    • S. Feijoo, C. Hernández-Espinosa, Short-term stability measures for the evaluation of vocal quality, Journal of Speech and Hearing Research 33 (1990) 324-334. (Pubitemid 20200326)
    • (1990) Journal of Speech and Hearing Research , vol.33 , Issue.2 , pp. 324-334
    • Feijoo, S.1    Hernandez, C.2
  • 9
    • 0022966946 scopus 로고
    • Normalized noise energy as an acoustic measure to evaluate pathologic voice
    • DOI 10.1121/1.394384
    • H. Kasuya, S. Ogawa, K. Mashima, S. Ebihara, Normalized noise energy as an acoustic measure to evaluate pathologic voice, Journal of the Acoustical Society of America 80 (5) (1986) 1329-1334. (Pubitemid 17183478)
    • (1986) Journal of the Acoustical Society of America , vol.80 , Issue.5 , pp. 1329-1334
    • Kasuya, H.1    Ogawa, S.2    Mashima, K.3    Ebihara, S.4
  • 10
    • 0031187694 scopus 로고    scopus 로고
    • Glottal-to-noise excitation ratio - A new measure for describing pathological voices
    • D. Michaelis, T. Gramss, H. W. Strube, Glottal-to-noise excitation ratio-a new measure for describing pathological voices, Acustica/Acta Acustica 83 (1997) 700-706.
    • (1997) Acustica/Acta Acustica , vol.83 , pp. 700-706
    • Michaelis, D.1    Gramss, T.2    Strube, H.W.3
  • 11
    • 0034332006 scopus 로고    scopus 로고
    • Adaptive noise energy estimation in pathological speech signals
    • C. Manfredi, Adaptive noise energy estimation in pathological speech signals, IEEE Transactions on Biomedical Engineering 47 (11) (2000) 1538-1543.
    • (2000) IEEE Transactions on Biomedical Engineering , vol.47 , Issue.11 , pp. 1538-1543
    • Manfredi, C.1
  • 12
    • 0030793150 scopus 로고    scopus 로고
    • Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals
    • DOI 10.1121/1.419726
    • Y. Qi, R. E. Hillman, Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals, Journal of the Acoustical Society of America 102 (1) (1997) 537-543. (Pubitemid 27300533)
    • (1997) Journal of the Acoustical Society of America , vol.102 , Issue.1 , pp. 537-543
    • Qi, Y.1    Hillman, R.E.2
  • 13
    • 0020319209 scopus 로고
    • Harmonics-to-noise ratio as an index of the degree of hoarseness
    • DOI 10.1121/1.387808
    • E. Yumoto, W. J. Gould, T. Baer, Harmonics-to-noise ratio as an index of the degree of hoarseness, Journal of the Acoustical Society of America 71 (6) (1982) 1544-1550. (Pubitemid 12019689)
    • (1982) Journal of the Acoustical Society of America , vol.71 , Issue.6 , pp. 1544-1550
    • Yumoto, E.1    Gould, W.J.2    Baer, T.3
  • 14
    • 0023310877 scopus 로고
    • The measurement of the signal-to-noise ratio (SNR) in continuous speech
    • F. Klingholtz, F. Martin, The measurement of the signal-to-noise ratio (SNR) in continuous speech, Speech Communication 6 (1) (1987) 15-26.
    • (1987) Speech Communication , vol.6 , Issue.1 , pp. 15-26
    • Klingholtz, F.1    Martin, F.2
  • 15
    • 85078509029 scopus 로고
    • Acoustic model and evaluation of pathological voice production
    • Berlin, Germany
    • D. Deliyski, Acoustic model and evaluation of pathological voice production, in: Proceedings of Eurospeech'93, vol. 3, Berlin, Germany, 1993, pp. 1969-1972.
    • (1993) Proceedings of Eurospeech'93 , vol.3 , pp. 1969-1972
    • Deliyski, D.1
  • 17
    • 0021274794 scopus 로고
    • Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness
    • E. Yumoto, Y. Sasaki, H. Okamura, Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness, Journal of Speech and Hearing Research 27 (1) (1984) 2-6. (Pubitemid 14135533)
    • (1984) Journal of Speech and Hearing Research , vol.27 , Issue.1 , pp. 2-6
    • Yumoto, E.1    Sasaki, Y.2    Okamura, H.3
  • 18
  • 19
    • 0032030556 scopus 로고    scopus 로고
    • A nonlinear operator-based speech feature analysis method with application to vocal fold pathology assessment
    • DOI 10.1109/10.661155
    • L. Gavidia-Ceballos, J. H. Hansen, J. Kaiser, A nonlinear operator based speech feature analysis methods with application to vocal fold pathological assessment, IEEE Transactions on Biomedical Engineering 45 (3) (1998) 300-313. (Pubitemid 28128655)
    • (1998) IEEE Transactions on Biomedical Engineering , vol.45 , Issue.3 , pp. 300-313
    • Hansen, J.H.L.1    Gavidia-Ceballos, L.2    Kaiser, J.F.3
  • 20
    • 0036077574 scopus 로고    scopus 로고
    • A computer system for acoustic analysis of pathological voices and laryngeal diseases screening
    • DOI 10.1016/S1350-4533(02)00031-0, PII S1350453302000310
    • S. Hadjitodorov, P. Mitev, A computer system for acoustic analysis of pathological voices and laryngeal disease screening, Medical Engineering & Physics 24 (6) (2002) 419-429. (Pubitemid 34808729)
    • (2002) Medical Engineering and Physics , vol.24 , Issue.6 , pp. 419-429
    • Hadjitodorov, S.1    Mitev, P.2
  • 21
    • 33745855023 scopus 로고
    • Massachusetts Eye and Ear Infirmary, Version. 1.03 CD-ROM, Kay Elemetrics Corporation, Lincoln Park, NJ
    • Massachusetts Eye and Ear Infirmary, Voice Disorders Database, Version. 1.03 [CD-ROM], Kay Elemetrics Corporation, Lincoln Park, NJ, 1994.
    • (1994) Voice Disorders Database
  • 22
    • 33750455161 scopus 로고    scopus 로고
    • Methodological issues in the development of automatic systems for voice pathology detection
    • DOI 10.1016/j.bspc.2006.06.003, PII S1746809406000267
    • N. Sáenz-Lechón, J. I. Godino-Llorente, V. Osma-Ruiz, P. Gómez-Vilda, Methodological issues in the development of automatic systems for voice pathology detection, Biomedical Signal Processing and Control 1 (2) (2006) 120-128. (Pubitemid 44644978)
    • (2006) Biomedical Signal Processing and Control , vol.1 , Issue.2 , pp. 120-128
    • Saenz-Lechon, N.1    Godino-Llorente, J.I.2    Osma-Ruiz, V.3    Gomez-Vilda, P.4
  • 23
    • 33344459401 scopus 로고    scopus 로고
    • Chaos in voice, from modeling to measurement
    • DOI 10.1016/j.jvoice.2005.01.001, PII S0892199705000044
    • J. J. Jiang, Y. Zhang, C. McGilligan, Chaos in voice, from modeling to measurement, Journal of Voice 20 (1) (2006) 2-17. (Pubitemid 43289083)
    • (2006) Journal of Voice , vol.20 , Issue.1 , pp. 2-17
    • Jiang, J.J.1    Zhang, Y.2    McGilligan, C.3
  • 24
    • 0035555882 scopus 로고    scopus 로고
    • Automatic detection of pathologies in the voice by HOS based parameters
    • DOI 10.1155/S1110865701000336
    • J. B. Alonso, J. de Leon, I. Alonso, M. A. Ferrer, Automatic detection of pathologies in the voice by HOS based parameters, EURASIP Journal on Advanced Signal Processing 2001 (4) (2001) 275-284. (Pubitemid 34784330)
    • (2001) Eurasip Journal on Applied Signal Processing , vol.2001 , Issue.4 , pp. 275-284
    • Alonso, J.B.1    De Leon, J.2    Alonso, I.3    Ferrer, M.A.4
  • 25
    • 14844284692 scopus 로고    scopus 로고
    • Discrimination of pathological voices using a time-frequency approach
    • DOI 10.1109/TBME.2004.842962
    • K. Umapathy, S. Krishnan, V. Parsa, D. G. Jamieson, Discrimination of pathological voices using a time-frequency approach, IEEE Transactions on Biomedical Engineering 52 (3) (2005) 421-430. (Pubitemid 40343739)
    • (2005) IEEE Transactions on Biomedical Engineering , vol.52 , Issue.3 , pp. 421-430
    • Umapathy, K.1    Krishnan, S.2    Parsa, V.3    Jamieson, D.G.4
  • 26
    • 33846238213 scopus 로고    scopus 로고
    • Study of harmonics-to-noise ratio and critical-band energy spectrum of speech as acoustic indicators of laryngeal and voice pathology
    • ID 85286
    • K. Shama, A. Krishna, N. U. Cholayya, Study of harmonics-to-noise ratio and critical-band energy spectrum of speech as acoustic indicators of laryngeal and voice pathology, EURASIP Journal on Advances in Signal Processing 2007 (2007) 9 ID 85286.
    • (2007) EURASIP Journal on Advances in Signal Processing , vol.2007 , pp. 9
    • Shama, K.1    Krishna, A.2    Cholayya, N.U.3
  • 27
    • 0036753078 scopus 로고    scopus 로고
    • Pathological voice quality assessment using artificial neural networks
    • DOI 10.1016/S1350-4533(02)00064-4, PII S1350453302000644
    • R. T. Ritchings, M. A. McGillion, C. J. Moore, Pathological voice quality assessment using artificial neural networks, Medical Engineering & Physics 24 (8) (2002) 561-564. (Pubitemid 35266697)
    • (2002) Medical Engineering and Physics , vol.24 , Issue.7-8 , pp. 561-564
    • Ritchings, R.T.1    McGillion, M.2    Moore, C.J.3
  • 28
    • 39449138479 scopus 로고    scopus 로고
    • Artificial Neural Network-based Classification to Screen for Dysphonia Using Psychoacoustic Scaling of Acoustic Voice Features
    • DOI 10.1016/j.jvoice.2006.09.003, PII S0892199706001238
    • R. Linder, A. E. Albers, M. Hess, S. J. Pöppl, R. Schönweiler, Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features, Journal of Voice 22 (2) (2008) 155-163. (Pubitemid 351273981)
    • (2008) Journal of Voice , vol.22 , Issue.2 , pp. 155-163
    • Linder, R.1    Albers, A.E.2    Hess, M.3    Poppl, S.J.4    Schonweiler, R.5
  • 29
    • 67651124866 scopus 로고    scopus 로고
    • Automatic detection of laryngeal pathologies in records of sustained vowels by means of mel-frequency cepstral coefficients parameters and differentiation of patients by sex
    • R. Fraile, N. Sáenz-Lechón, J. I. Godino-Llorente, V. Osma-Ruiz, C. Fredouille, Automatic detection of laryngeal pathologies in records of sustained vowels by means of mel-frequency cepstral coefficients parameters and differentiation of patients by sex, Folia Phoniatrica et Logopaedica 61 (3) (2009) 146-152.
    • (2009) Folia Phoniatrica et Logopaedica , vol.61 , Issue.3 , pp. 146-152
    • Fraile, R.1    Sáenz-Lechón, N.2    Godino-Llorente, J.I.3    Osma-Ruiz, V.4    Fredouille, C.5
  • 30
    • 33749525148 scopus 로고    scopus 로고
    • Dimensionality reduction of a pathological voice quality assessment system based on gaussian mixture models and short-term cepstral parameters
    • DOI 10.1109/TBME.2006.871883
    • J. I. Godino-Llorente, P. Gómez-Vilda, M. Blanco-Velasco, Dimensionality reduction of a pathological voice quality assessment system based on Gaussian mixture models and short-term cepstral parameters, IEEE Transactions on Biomedical Engineering 53 (10) (2006) 1943-1953. (Pubitemid 44526241)
    • (2006) IEEE Transactions on Biomedical Engineering , vol.53 , Issue.10 , pp. 1943-1953
    • Godino-Llorente, J.I.1    Gomez-Vilda, P.2    Blanco-Velasco, M.3
  • 38
    • 0035046834 scopus 로고    scopus 로고
    • Neural network based input selection and diagnosis of pathologic voices
    • C. Hernández, M. Fernández, P. Gómez, Neural network based input selection and diagnosis of pathologic voices, Journal on Neural Networks 1 (1) (2001) 49-63.
    • (2001) Journal on Neural Networks , vol.1 , Issue.1 , pp. 49-63
    • Hernández, C.1    Fernández, M.2    Gómez, P.3
  • 41
    • 2642647093 scopus 로고    scopus 로고
    • Selection and combination of acoustic features for description of pathologic voices
    • DOI 10.1121/1.421305
    • D. Michaelis, M. Fröhlich, H. W. Strube, Selection and combination of acoustic features for the description of pathologic voices, Journal of the Acoustical Society of America 103 (1998) 1628-1639. (Pubitemid 28115174)
    • (1998) Journal of the Acoustical Society of America , vol.103 , Issue.3 , pp. 1628-1639
    • Michaelis, D.1    Frohlich, M.2    Strube, H.W.3
  • 44
    • 68949128774 scopus 로고    scopus 로고
    • On the use of the correlation between acoustic descriptors for the normal/pathological voices discrimination
    • doi:10.1155/2009/173967
    • T. Dubuisson, T. Dutoit, B. Gosselin, M. Remacle, On the use of the correlation between acoustic descriptors for the normal/pathological voices discrimination, EURASIP Journal on Advances in Signal Processing, doi:10.1155/2009/173967.
    • EURASIP Journal on Advances in Signal Processing
    • Dubuisson, T.1    Dutoit, T.2    Gosselin, B.3    Remacle, M.4
  • 47
    • 0031146514 scopus 로고    scopus 로고
    • HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features
    • PII S1063667697031842
    • R. Chengalvarayan, L. Deng, HMM-based speech recognition using statedependent, discriminatively derived transforms on mel-warped DFT features, IEEE Transactions on Speech and Audio Processing 5 (3) (1997) 243-256. (Pubitemid 127745997)
    • (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.3 , pp. 243-256
    • Chengalvarayan, R.1    Deng, L.2
  • 48
  • 53
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, IEEE Proceedings 77 (2) (1989) 257-286.
    • (1989) IEEE Proceedings , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 56
    • 0020083498 scopus 로고
    • The meaning and use of the area under a receiver operating characteristic (ROC) curve
    • J. A. Hanley, B. J. McNeil, The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology 143 (1) (1982) 29-36. (Pubitemid 12142173)
    • (1982) Radiology , vol.143 , Issue.1 , pp. 29-36
    • Hanley, J.A.1    McNeil, B.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.