SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 6, 2006, Pages 2014-2023

Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech

(4) Li, Peng a Guan, Yong a Xu, Bo a Liu, Wenju a

Author keywords

Computational auditory scene analysis (CASA); Grouping; Monaural speech separation; Objective quality assessment of speech (OQAS); Segmentation

Indexed keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA); GROUPING; MONAURAL SPEECH SEPARATION; OBJECTIVE QUALITY ASSESSMENT OF SPEECH (OQAS); SEGMENTATION;

ACOUSTIC INTENSITY; PATIENT REHABILITATION; SEPARATION; SIGNAL PROCESSING; SIGNAL TO NOISE RATIO; SPEECH ANALYSIS;

QUALITY CONTROL;

EID: 40949108726 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.883258 Document Type: Article

Times cited : (54)

References (31)

1
- 80052339383
- Some experiments in the recognition of speech with one and two ears
- C. Cherry, "Some experiments in the recognition of speech with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-981, 1953.
- (1953) J. Acoust. Soc. Amer , vol.25 , pp. 975-981
- Cherry, C.¹

2
- 0036649241
- Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
- Jul
- A. K. Barros, T. Rutkowski, F. Itakura, and N. Ohnishi, "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 888-893, Jul. 2002.
- (2002) IEEE Trans. Neural Netw , vol.13 , Issue.4 , pp. 888-893
- Barros, A.K.¹ Rutkowski, T.² Itakura, F.³ Ohnishi, N.⁴

3
- 0030193445
- Two decades of array signal processing research: The parametric approach
- Jul
- H. Krim and M. Viberg, "Two decades of array signal processing research: The parametric approach," IEEE Signal Process. Mag., vol. 13, no. 4, pp. 67-94, Jul. 1996.
- (1996) IEEE Signal Process. Mag , vol.13 , Issue.4 , pp. 67-94
- Krim, H.¹ Viberg, M.²

4
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis
- Bregman, A.S.¹

5
- 0028531926
- Computational auditory scene analysis
- G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
- (1994) Comput. Speech Lang , vol.8 , pp. 297-336
- Brown, G.J.¹ Cooke, M.P.²

6
- 0003479143
- Cambridge, U.K, Cambridge Univ. Press
- M. P. Cooke,Modeling Auditory Processing and Organization. Cambridge, U.K.: Cambridge Univ. Press, 1993.
- (1993) Modeling Auditory Processing and Organization
- Cooke, M.P.¹

7
- 0003794341
- Prediction-driven computational auditory scene analysis,
- Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci, Mass. Inst. Technol, Cambridge
- D. P. W. Ellis, "Prediction-driven computational auditory scene analysis," Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci., Mass. Inst. Technol., Cambridge, 1996.
- (1996)
- Ellis, D.P.W.¹

8
- 0003444613
- Mahwah, NJ: Lawrence Erlbaum
- D. F. Rosenthal and H. G. Okuno, Computational Auditory Scene Analysis. Mahwah, NJ: Lawrence Erlbaum, 1998.
- (1998) Computational Auditory Scene Analysis
- Rosenthal, D.F.¹ Okuno, H.G.²

9
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- May
- D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
- (1999) IEEE Trans. Neural Netw , vol.10 , Issue.3 , pp. 684-697
- Wang, D.L.¹ Brown, G.J.²

10
- 0003982501
- A theory and computational model of auditory monaural sound separation,
- Ph.D. dissertation, Dept. Elect. Eng, Stanford Univ, Stanford, CA
- M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1985.
- (1985)
- Weintraub, M.¹

11
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Sep
- G. N. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
- (2004) IEEE Trans. Neural Netw , vol.15 , Issue.5 , pp. 1135-1150
- Hu, G.N.¹ Wang, D.L.²

12
- 0142026377
- Speech segregation based on sound localization
- N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
- (2003) J. Acoust. Soc. Amer , vol.114 , pp. 2236-2252
- Roman, N.¹ Wang, D.L.² Brown, G.J.³

13
- 0032670621
- A blackboard architecture for computational auditory scene analysis
- D. Godsmark and G. J. Brown, "A blackboard architecture for computational auditory scene analysis," Speech Commun., vol. 27, pp. 351-366, 1999.
- (1999) Speech Commun , vol.27 , pp. 351-366
- Godsmark, D.¹ Brown, G.J.²

14
- 64549131872
- quot;Subjective performance assessment of telephone-band and wideband digital codecs, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
- quot;Subjective performance assessment of telephone-band and wideband digital codecs," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.

15
- 0034428801
- Nonintrusive speech-quality assessment using vocal-tract models
- Dec
- P. Gray,M. P. Hollier, and R. E. Massara, "Nonintrusive speech-quality assessment using vocal-tract models," Proc. Inst. Elect. Eng.-Vision, Image Signal Process., vol. 147, no. 6, pp. 493-501, Dec. 2000.
- (2000) Proc. Inst. Elect. Eng.-Vision, Image Signal Process , vol.147 , Issue.6 , pp. 493-501
- Gray, P.¹ Hollier, M.P.² Massara, R.E.³

16
- 0029750932
- Vector quantization techniques for outputbased objective speech quality
- May
- C. Jin and R. Kubichek, "Vector quantization techniques for outputbased objective speech quality," in Proc. Int. Conf. Acoust., Speech, Signal Process., May 1996, vol. 1, pp. 491-494.
- (1996) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 491-494
- Jin, C.¹ Kubichek, R.²

17
- 27644596289
- ANIQUE: An auditory model for single-ended speech quality estimation
- Sep
- D. S. Kim, "ANIQUE: An auditory model for single-ended speech quality estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 13, no. 5, pp. 821-831, Sep. 2005.
- (2005) IEEE Trans. Audio, Speech, Lang. Process , vol.13 , Issue.5 , pp. 821-831
- Kim, D.S.¹

18
- 64549163428
- quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications, ITU, Geneva, Switzerland, 2004, ITU-T P.563
- quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications," ITU, Geneva, Switzerland, 2004, ITU-T P.563.

19
- 64549133123
- NiQA-product description Psytechnics Limited, Online, Available
- NiQA-product description Psytechnics Limited, 2003 [Online]. Available: http://www.psytechnics.com/pages/products/niqa.php
- (2003)

20
- 64549155841
- NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc, Online, Available
- NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc., 2001 [Online]. Available: http://www.swissqual.com/HTML/ninapage.htm
- (2001)

21
- 0003789815
- 4th ed. San Diego, CA: Academic
- B. C. J. Moore, An Introduction to the Psychology of Hearing, 4th ed. San Diego, CA: Academic, 1997.
- (1997) An Introduction to the Psychology of Hearing
- Moore, B.C.J.¹

22
- 84892233308
- On ideal binary mask as the computational goal of auditory scene analysis
- P. Divenyi, Ed. Norwell, MA: Kluwer
- D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, pp. 181-197.
- (2005) Speech Separation by Humans and Machines , pp. 181-197
- Wang, D.L.¹

23
- 0037750051
- Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming,
- Ph.D. dissertation, Dept. Elect. Comput. Eng, Northwestern Univ, Evanston, IL
- L. A. Drake, "Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming," Ph.D. dissertation, Dept. Elect. Comput. Eng., Northwestern Univ, Evanston, IL, 2001.
- (2001)
- Drake, L.A.¹

24
- 0003444613
- Mahwah, NJ: Lawrence Erlbaum
- D. F. Rosenthal and H. G. Okuno, Computational Auditory Scene Analysis. Mahwah, NJ: Lawrence Erlbaum, 1998.
- (1998) Computational Auditory Scene Analysis
- Rosenthal, D.F.¹ Okuno, H.G.²

25
- 64549153752
- Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.
- Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.

26
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Feb
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Feb. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

27
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul
- Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.5 , pp. 504-512
- Martin¹

28
- 0032702589
- Temporal coding of periodicity pitch in the auditory system: An overview
- P. Cariani, "Temporal coding of periodicity pitch in the auditory system: An overview," Neural Plasticity, vol. 6, pp. 147-172, 1999.
- (1999) Neural Plasticity , vol.6 , pp. 147-172
- Cariani, P.¹

29
- 0030846123
- A unitary model of pitch perception
- R. Meddis and L. O'Mard, "A unitary model of pitch perception," J. Acoust. Soc. Amer., vol. 102, pp. 1811-1820, 1997.
- (1997) J. Acoust. Soc. Amer , vol.102 , pp. 1811-1820
- Meddis, R.¹ O'Mard, L.²

30
- 0002296637
- On the importance of time-A temporal representation of sound
- M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley
- M. Slaney and R. F. Lyon, "On the importance of time-A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley, 1993, pp. 95-116.
- (1993) Visual Representations of Speech Signals , pp. 95-116
- Slaney, M.¹ Lyon, R.F.²

31
- 33646786460
- Separation of fricatives and affricates
- G. Hu and D. L.Wang, "Separation of fricatives and affricates," in Proc. ICASSP, 2005, vol. 1, pp. 1101-1104.
- (2005) Proc. ICASSP , vol.1 , pp. 1101-1104
- Hu, G.¹ Wang, D.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.