메뉴 건너뛰기




Volumn 14, Issue 6, 2006, Pages 1935-1947

Single-ended speech quality measurement using machine learning methods

Author keywords

Mean opinion score (MOS); Objective quality measurement; Quality model; Single ended measurement; Speech communication; Speech distortions; Speech enhancement; Speech quality; Subjective quality

Indexed keywords

MEAN OPINION SCORE (MOS); OBJECTIVE QUALITY MEASUREMENT; QUALITY MODEL; SINGLE-ENDED MEASUREMENT; SPEECH DISTORTIONS; SPEECH QUALITY; SUBJECTIVE QUALITY;

EID: 34547533971     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.883253     Document Type: Article
Times cited : (92)

References (42)
  • 1
    • 64549152633 scopus 로고    scopus 로고
    • quot;Subjective performance assessment of telephone-band and wideband digital codecs, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
    • quot;Subjective performance assessment of telephone-band and wideband digital codecs," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
  • 2
    • 64549100891 scopus 로고    scopus 로고
    • quot;Mean opinion score (MOS) terminology, ITU, Geneva, Switzerland, 2003, ITU-T Rec. P.800.1.
    • quot;Mean opinion score (MOS) terminology," ITU, Geneva, Switzerland, 2003, ITU-T Rec. P.800.1.
  • 6
    • 0000008694 scopus 로고
    • An objective measure for predicting subjective quality of speech coders
    • Jun
    • S. Wang, A. Sekey, and A. Gersho, "An objective measure for predicting subjective quality of speech coders," IEEE J. Sel. Areas Commun., vol. 10, no. 5, pp. 819-829, Jun. 1992.
    • (1992) IEEE J. Sel. Areas Commun , vol.10 , Issue.5 , pp. 819-829
    • Wang, S.1    Sekey, A.2    Gersho, A.3
  • 7
    • 64549091222 scopus 로고    scopus 로고
    • quot;Objective quality measurement of telephone-band (300-3400 Hz) speech codecs, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.861.
    • quot;Objective quality measurement of telephone-band (300-3400 Hz) speech codecs," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.861.
  • 8
    • 0032636247 scopus 로고    scopus 로고
    • Objective estimation of perceived speech quality-Part I: Development of the measuring normalizing block technique
    • Jul
    • S. Voran, "Objective estimation of perceived speech quality-Part I: Development of the measuring normalizing block technique," IEEE Trans. Speech Audio Process., vol. 7, no. 4, pp. 371-382, Jul. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.4 , pp. 371-382
    • Voran, S.1
  • 9
    • 0032683490 scopus 로고    scopus 로고
    • Objective estimation of perceived speech quality-Part II: Evaluation of the measuring normalizing block technique
    • Jul
    • , "Objective estimation of perceived speech quality-Part II: Evaluation of the measuring normalizing block technique," IEEE Trans. Speech Audio Process., vol. 7, no. 4, pp. 383-390, Jul. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.4 , pp. 383-390
  • 10
    • 27844458923 scopus 로고    scopus 로고
    • Objective speech quality measurement using statistical data mining
    • Jun
    • W. Zha and W.-Y. Chan, "Objective speech quality measurement using statistical data mining," EURASIP J. Appl. Signal Process., vol. 2005, no. 9, pp. 1410-1424, Jun. 2005.
    • (2005) EURASIP J. Appl. Signal Process , vol.2005 , Issue.9 , pp. 1410-1424
    • Zha, W.1    Chan, W.-Y.2
  • 11
    • 64549085122 scopus 로고    scopus 로고
    • quot;Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs, ITU, Geneva, Switzerland, 2001, ITU-T Rec. P.862.
    • quot;Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," ITU, Geneva, Switzerland, 2001, ITU-T Rec. P.862.
  • 12
    • 0028736841 scopus 로고
    • Output-based objective speech quality
    • Jun
    • J. Liang and R. Kubichek, "Output-based objective speech quality," in Proc. IEEE Vehicular Technol. Conf., Jun. 1994, vol. 3, pp. 1719-1723.
    • (1994) Proc. IEEE Vehicular Technol. Conf , vol.3 , pp. 1719-1723
    • Liang, J.1    Kubichek, R.2
  • 13
    • 31344477352 scopus 로고    scopus 로고
    • Nonintrusive speech quality estimation using Gaussian mixture models
    • Feb
    • T. H. Falk and W.-Y. Chan, "Nonintrusive speech quality estimation using Gaussian mixture models," IEEE Signal Process. Lett., vol. 13, no. 2, pp. 108-111, Feb. 2006.
    • (2006) IEEE Signal Process. Lett , vol.13 , Issue.2 , pp. 108-111
    • Falk, T.H.1    Chan, W.-Y.2
  • 15
    • 27644596289 scopus 로고    scopus 로고
    • ANIQUE: An auditory model for single-ended speech quality estimation
    • Sep
    • D.-S. Kim, "ANIQUE: An auditory model for single-ended speech quality estimation," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 821-831, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 821-831
    • Kim, D.-S.1
  • 16
    • 64549086519 scopus 로고    scopus 로고
    • quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications, ITU, Geneva, Switzerland, 2004, ITU-T P.563
    • quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications," ITU, Geneva, Switzerland, 2004, ITU-T P.563.
  • 17
    • 64549135181 scopus 로고    scopus 로고
    • quot;Adaptive multi-rate (AMR) speech codec: Voice activity detector (VAD), release 6, 2004, 3GPP2 TS 26.094.
    • quot;Adaptive multi-rate (AMR) speech codec: Voice activity detector (VAD), release 6," 2004, 3GPP2 TS 26.094.
  • 18
    • 64549162163 scopus 로고    scopus 로고
    • W. B. Kleijn and K. K. Paliwal, Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995, ch. A Robust Algorithm for Pitch Tracking (RAPT), pp. 495-518.
    • W. B. Kleijn and K. K. Paliwal, Speech Coding and Synthesis. Amsterdam, The Netherlands: Elsevier, 1995, ch. A Robust Algorithm for Pitch Tracking (RAPT), pp. 495-518.
  • 19
    • 0025041264 scopus 로고
    • Perceptual linear prediction (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear prediction (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 22
    • 0016059486 scopus 로고
    • Digital coding of speech waveforms: PCM, DPCM, and DM quantizers
    • May
    • N. Jayant, "Digital coding of speech waveforms: PCM, DPCM, and DM quantizers," Proc. IEEE, vol. 62, no. 5, pp. 611-632, May 1974.
    • (1974) Proc. IEEE , vol.62 , Issue.5 , pp. 611-632
    • Jayant, N.1
  • 23
    • 64549152219 scopus 로고    scopus 로고
    • quot;Modulated noise reference unit-MNRU, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.810.
    • quot;Modulated noise reference unit-MNRU," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.810.
  • 24
    • 33947626644 scopus 로고    scopus 로고
    • Enhanced non-intrusive speech quality measurement using degradation models
    • May
    • T. H. Falk and W.-Y Chan, "Enhanced non-intrusive speech quality measurement using degradation models," in Proc. Int. Conf. Acoust., Speech, Signal Process., May 2006, vol. I, pp. 837-840.
    • (2006) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 837-840
    • Falk, T.H.1    Chan, W.-Y.2
  • 25
    • 64549140901 scopus 로고    scopus 로고
    • S. Voran, Observations on the t-reference condition for speech coder evaluation, 1992, CCITT SG-12, Document Number SQ.13.92.
    • S. Voran, "Observations on the t-reference condition for speech coder evaluation," 1992, CCITT SG-12, Document Number SQ.13.92.
  • 28
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • L. Breiman, "Random forests," Mach. Learn., vol. 45, no. 1, pp. 5-32, 2001.
    • (2001) Mach. Learn , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 29
    • 0002432565 scopus 로고
    • Multivariate adaptive regression splines
    • Mar
    • J. H. Friedman, "Multivariate adaptive regression splines," Ann. Stat., vol. 19, no. 1, pp. 1-141, Mar. 1991.
    • (1991) Ann. Stat , vol.19 , Issue.1 , pp. 1-141
    • Friedman, J.H.1
  • 30
    • 64549099151 scopus 로고    scopus 로고
    • Perception of temporal discontinuity impairments in coded speech-Proposal for objective estimators and some subjective test results
    • May
    • S. Voran, "Perception of temporal discontinuity impairments in coded speech-Proposal for objective estimators and some subjective test results," in Proc. Int. Conf. Measurement Speech Audio Quality Netw., May 2003.
    • (2003) Proc. Int. Conf. Measurement Speech Audio Quality Netw
    • Voran, S.1
  • 31
    • 64549114333 scopus 로고    scopus 로고
    • J. F. Canny, Finding edges and lines in images, MIT-Artificial Intelligence Laboratory, 1983, Tech Rep. 720.
    • J. F. Canny, "Finding edges and lines in images," MIT-Artificial Intelligence Laboratory, 1983, Tech Rep. 720.
  • 32
    • 0032716108 scopus 로고    scopus 로고
    • Continuous assessment of time-varying speech quality
    • Nov
    • M. Hansen and B. Kollmeier, "Continuous assessment of time-varying speech quality," J. Acoust. Soc. Amer., vol. 106, no. 5, pp. 2888-2899, Nov. 1999.
    • (1999) J. Acoust. Soc. Amer , vol.106 , Issue.5 , pp. 2888-2899
    • Hansen, M.1    Kollmeier, B.2
  • 34
    • 64549146847 scopus 로고    scopus 로고
    • quot;ITU-T coded-speech database, ITU, Geneva, Switzerland, 1998, ITU-T Rec. P.Suppl. 23.
    • quot;ITU-T coded-speech database," ITU, Geneva, Switzerland, 1998, ITU-T Rec. P.Suppl. 23.
  • 35
    • 85013603035 scopus 로고    scopus 로고
    • Performance of current perceptual objective speech quality measures
    • L. Thorpe and W. Yang, "Performance of current perceptual objective speech quality measures," in Proc. IEEE Speech Coding Workshop, 1999, pp. 144-146.
    • (1999) Proc. IEEE Speech Coding Workshop , pp. 144-146
    • Thorpe, L.1    Yang, W.2
  • 36
    • 64549093937 scopus 로고    scopus 로고
    • quot;Mapping function for transforming P.862 raw result scores to MOSLQO, ITU, Geneva, Switzerland, 2003, ITU-T Rec. P.862.1.
    • quot;Mapping function for transforming P.862 raw result scores to MOSLQO," ITU, Geneva, Switzerland, 2003, ITU-T Rec. P.862.1.
  • 37
    • 64549100027 scopus 로고    scopus 로고
    • New objective measures for characterisation of noise suppression algorithms
    • Sep
    • E. Paajanen, B. Ayad, and V. Mattila, "New objective measures for characterisation of noise suppression algorithms," in Proc. IEEE Speech Coding Workshop, Sep. 2000, pp. 23-25.
    • (2000) Proc. IEEE Speech Coding Workshop , pp. 23-25
    • Paajanen, E.1    Ayad, B.2    Mattila, V.3
  • 38
    • 33947667509 scopus 로고    scopus 로고
    • Subjective comparison of speech enhancement algorithms
    • May
    • Y. Hu and P. Loizou, "Subjective comparison of speech enhancement algorithms," in Proc. Int. Conf. Acoust., Speech, Signal Process., May 2006, vol. I, pp. 153-156.
    • (2006) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 153-156
    • Hu, Y.1    Loizou, P.2
  • 39
    • 64549092585 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • submitted for publication
    • , "Subjective comparison and evaluation of speech enhancement algorithms," in Speech Commun., 2006, submitted for publication.
    • (2006) Speech Commun
  • 40
    • 64549159539 scopus 로고    scopus 로고
    • quot;Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms, ITU, Geneva, Switzerland, 2003, ITU-T Rec, P.835
    • quot;Subjective test methodology for evaluating speech communication systems that include noise suppression algorithms," ITU, Geneva, Switzerland, 2003, ITU-T Rec., P.835.
  • 41
    • 64549137759 scopus 로고    scopus 로고
    • quot;Application guide for objective quality measurement based on Recommendations P.862, P.862.1 and P.862.2, ITU, Geneva, Switzerland, 2005, ITU-T Rec. P.862.3.
    • quot;Application guide for objective quality measurement based on Recommendations P.862, P.862.1 and P.862.2," ITU, Geneva, Switzerland, 2005, ITU-T Rec. P.862.3.
  • 42
    • 44949128271 scopus 로고    scopus 로고
    • Evaluation of objective measures for speech enhancement
    • to be published
    • Y. Hu and P. Loizou, "Evaluation of objective measures for speech enhancement," in Proc. Int. Conf. Spoken Language Process., 2006, to be published.
    • (2006) Proc. Int. Conf. Spoken Language Process
    • Hu, Y.1    Loizou, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.