메뉴 건너뛰기




Volumn 14, Issue 6, 2006, Pages 2014-2023

Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech

Author keywords

Computational auditory scene analysis (CASA); Grouping; Monaural speech separation; Objective quality assessment of speech (OQAS); Segmentation

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA); GROUPING; MONAURAL SPEECH SEPARATION; OBJECTIVE QUALITY ASSESSMENT OF SPEECH (OQAS); SEGMENTATION;

EID: 40949108726     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.883258     Document Type: Article
Times cited : (54)

References (31)
  • 1
    • 80052339383 scopus 로고
    • Some experiments in the recognition of speech with one and two ears
    • C. Cherry, "Some experiments in the recognition of speech with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-981, 1953.
    • (1953) J. Acoust. Soc. Amer , vol.25 , pp. 975-981
    • Cherry, C.1
  • 2
    • 0036649241 scopus 로고    scopus 로고
    • Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
    • Jul
    • A. K. Barros, T. Rutkowski, F. Itakura, and N. Ohnishi, "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 888-893, Jul. 2002.
    • (2002) IEEE Trans. Neural Netw , vol.13 , Issue.4 , pp. 888-893
    • Barros, A.K.1    Rutkowski, T.2    Itakura, F.3    Ohnishi, N.4
  • 3
    • 0030193445 scopus 로고    scopus 로고
    • Two decades of array signal processing research: The parametric approach
    • Jul
    • H. Krim and M. Viberg, "Two decades of array signal processing research: The parametric approach," IEEE Signal Process. Mag., vol. 13, no. 4, pp. 67-94, Jul. 1996.
    • (1996) IEEE Signal Process. Mag , vol.13 , Issue.4 , pp. 67-94
    • Krim, H.1    Viberg, M.2
  • 5
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
    • (1994) Comput. Speech Lang , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 7
    • 0003794341 scopus 로고    scopus 로고
    • Prediction-driven computational auditory scene analysis,
    • Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci, Mass. Inst. Technol, Cambridge
    • D. P. W. Ellis, "Prediction-driven computational auditory scene analysis," Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci., Mass. Inst. Technol., Cambridge, 1996.
    • (1996)
    • Ellis, D.P.W.1
  • 9
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Netw , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 10
    • 0003982501 scopus 로고
    • A theory and computational model of auditory monaural sound separation,
    • Ph.D. dissertation, Dept. Elect. Eng, Stanford Univ, Stanford, CA
    • M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1985.
    • (1985)
    • Weintraub, M.1
  • 11
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Sep
    • G. N. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.N.1    Wang, D.L.2
  • 12
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 13
    • 0032670621 scopus 로고    scopus 로고
    • A blackboard architecture for computational auditory scene analysis
    • D. Godsmark and G. J. Brown, "A blackboard architecture for computational auditory scene analysis," Speech Commun., vol. 27, pp. 351-366, 1999.
    • (1999) Speech Commun , vol.27 , pp. 351-366
    • Godsmark, D.1    Brown, G.J.2
  • 14
    • 64549131872 scopus 로고    scopus 로고
    • quot;Subjective performance assessment of telephone-band and wideband digital codecs, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
    • quot;Subjective performance assessment of telephone-band and wideband digital codecs," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
  • 16
    • 0029750932 scopus 로고    scopus 로고
    • Vector quantization techniques for outputbased objective speech quality
    • May
    • C. Jin and R. Kubichek, "Vector quantization techniques for outputbased objective speech quality," in Proc. Int. Conf. Acoust., Speech, Signal Process., May 1996, vol. 1, pp. 491-494.
    • (1996) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 491-494
    • Jin, C.1    Kubichek, R.2
  • 17
    • 27644596289 scopus 로고    scopus 로고
    • ANIQUE: An auditory model for single-ended speech quality estimation
    • Sep
    • D. S. Kim, "ANIQUE: An auditory model for single-ended speech quality estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 13, no. 5, pp. 821-831, Sep. 2005.
    • (2005) IEEE Trans. Audio, Speech, Lang. Process , vol.13 , Issue.5 , pp. 821-831
    • Kim, D.S.1
  • 18
    • 64549163428 scopus 로고    scopus 로고
    • quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications, ITU, Geneva, Switzerland, 2004, ITU-T P.563
    • quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications," ITU, Geneva, Switzerland, 2004, ITU-T P.563.
  • 19
    • 64549133123 scopus 로고    scopus 로고
    • NiQA-product description Psytechnics Limited, Online, Available
    • NiQA-product description Psytechnics Limited, 2003 [Online]. Available: http://www.psytechnics.com/pages/products/niqa.php
    • (2003)
  • 20
    • 64549155841 scopus 로고    scopus 로고
    • NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc, Online, Available
    • NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc., 2001 [Online]. Available: http://www.swissqual.com/HTML/ninapage.htm
    • (2001)
  • 22
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary mask as the computational goal of auditory scene analysis
    • P. Divenyi, Ed. Norwell, MA: Kluwer
    • D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, pp. 181-197.
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 23
    • 0037750051 scopus 로고    scopus 로고
    • Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming,
    • Ph.D. dissertation, Dept. Elect. Comput. Eng, Northwestern Univ, Evanston, IL
    • L. A. Drake, "Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming," Ph.D. dissertation, Dept. Elect. Comput. Eng., Northwestern Univ, Evanston, IL, 2001.
    • (2001)
    • Drake, L.A.1
  • 25
    • 64549153752 scopus 로고    scopus 로고
    • Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.
    • Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.
  • 26
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Feb
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Feb. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 27
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • Jul
    • Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
    • Martin1
  • 28
    • 0032702589 scopus 로고    scopus 로고
    • Temporal coding of periodicity pitch in the auditory system: An overview
    • P. Cariani, "Temporal coding of periodicity pitch in the auditory system: An overview," Neural Plasticity, vol. 6, pp. 147-172, 1999.
    • (1999) Neural Plasticity , vol.6 , pp. 147-172
    • Cariani, P.1
  • 29
    • 0030846123 scopus 로고    scopus 로고
    • A unitary model of pitch perception
    • R. Meddis and L. O'Mard, "A unitary model of pitch perception," J. Acoust. Soc. Amer., vol. 102, pp. 1811-1820, 1997.
    • (1997) J. Acoust. Soc. Amer , vol.102 , pp. 1811-1820
    • Meddis, R.1    O'Mard, L.2
  • 30
    • 0002296637 scopus 로고
    • On the importance of time-A temporal representation of sound
    • M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley
    • M. Slaney and R. F. Lyon, "On the importance of time-A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley, 1993, pp. 95-116.
    • (1993) Visual Representations of Speech Signals , pp. 95-116
    • Slaney, M.1    Lyon, R.F.2
  • 31
    • 33646786460 scopus 로고    scopus 로고
    • Separation of fricatives and affricates
    • G. Hu and D. L.Wang, "Separation of fricatives and affricates," in Proc. ICASSP, 2005, vol. 1, pp. 1101-1104.
    • (2005) Proc. ICASSP , vol.1 , pp. 1101-1104
    • Hu, G.1    Wang, D.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.