-
1
-
-
80052339383
-
Some experiments in the recognition of speech with one and two ears
-
C. Cherry, "Some experiments in the recognition of speech with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-981, 1953.
-
(1953)
J. Acoust. Soc. Amer
, vol.25
, pp. 975-981
-
-
Cherry, C.1
-
2
-
-
0036649241
-
Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
-
Jul
-
A. K. Barros, T. Rutkowski, F. Itakura, and N. Ohnishi, "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 888-893, Jul. 2002.
-
(2002)
IEEE Trans. Neural Netw
, vol.13
, Issue.4
, pp. 888-893
-
-
Barros, A.K.1
Rutkowski, T.2
Itakura, F.3
Ohnishi, N.4
-
3
-
-
0030193445
-
Two decades of array signal processing research: The parametric approach
-
Jul
-
H. Krim and M. Viberg, "Two decades of array signal processing research: The parametric approach," IEEE Signal Process. Mag., vol. 13, no. 4, pp. 67-94, Jul. 1996.
-
(1996)
IEEE Signal Process. Mag
, vol.13
, Issue.4
, pp. 67-94
-
-
Krim, H.1
Viberg, M.2
-
5
-
-
0028531926
-
Computational auditory scene analysis
-
G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1994.
-
(1994)
Comput. Speech Lang
, vol.8
, pp. 297-336
-
-
Brown, G.J.1
Cooke, M.P.2
-
7
-
-
0003794341
-
Prediction-driven computational auditory scene analysis,
-
Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci, Mass. Inst. Technol, Cambridge
-
D. P. W. Ellis, "Prediction-driven computational auditory scene analysis," Ph.D. dissertation, Dept. Elect. Eng. Comput. Sci., Mass. Inst. Technol., Cambridge, 1996.
-
(1996)
-
-
Ellis, D.P.W.1
-
9
-
-
0032682770
-
Separation of speech from interfering sounds based on oscillatory correlation
-
May
-
D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
-
(1999)
IEEE Trans. Neural Netw
, vol.10
, Issue.3
, pp. 684-697
-
-
Wang, D.L.1
Brown, G.J.2
-
10
-
-
0003982501
-
A theory and computational model of auditory monaural sound separation,
-
Ph.D. dissertation, Dept. Elect. Eng, Stanford Univ, Stanford, CA
-
M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Dept. Elect. Eng., Stanford Univ., Stanford, CA, 1985.
-
(1985)
-
-
Weintraub, M.1
-
11
-
-
4644265990
-
Monaural speech segregation based on pitch tracking and amplitude modulation
-
Sep
-
G. N. Hu and D. L. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw., vol. 15, no. 5, pp. 1135-1150, Sep. 2004.
-
(2004)
IEEE Trans. Neural Netw
, vol.15
, Issue.5
, pp. 1135-1150
-
-
Hu, G.N.1
Wang, D.L.2
-
12
-
-
0142026377
-
Speech segregation based on sound localization
-
N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
-
(2003)
J. Acoust. Soc. Amer
, vol.114
, pp. 2236-2252
-
-
Roman, N.1
Wang, D.L.2
Brown, G.J.3
-
13
-
-
0032670621
-
A blackboard architecture for computational auditory scene analysis
-
D. Godsmark and G. J. Brown, "A blackboard architecture for computational auditory scene analysis," Speech Commun., vol. 27, pp. 351-366, 1999.
-
(1999)
Speech Commun
, vol.27
, pp. 351-366
-
-
Godsmark, D.1
Brown, G.J.2
-
14
-
-
64549131872
-
-
quot;Subjective performance assessment of telephone-band and wideband digital codecs, ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
-
quot;Subjective performance assessment of telephone-band and wideband digital codecs," ITU, Geneva, Switzerland, 1996, ITU-T Rec. P.830.
-
-
-
-
15
-
-
0034428801
-
Nonintrusive speech-quality assessment using vocal-tract models
-
Dec
-
P. Gray,M. P. Hollier, and R. E. Massara, "Nonintrusive speech-quality assessment using vocal-tract models," Proc. Inst. Elect. Eng.-Vision, Image Signal Process., vol. 147, no. 6, pp. 493-501, Dec. 2000.
-
(2000)
Proc. Inst. Elect. Eng.-Vision, Image Signal Process
, vol.147
, Issue.6
, pp. 493-501
-
-
Gray, P.1
Hollier, M.P.2
Massara, R.E.3
-
16
-
-
0029750932
-
Vector quantization techniques for outputbased objective speech quality
-
May
-
C. Jin and R. Kubichek, "Vector quantization techniques for outputbased objective speech quality," in Proc. Int. Conf. Acoust., Speech, Signal Process., May 1996, vol. 1, pp. 491-494.
-
(1996)
Proc. Int. Conf. Acoust., Speech, Signal Process
, vol.1
, pp. 491-494
-
-
Jin, C.1
Kubichek, R.2
-
17
-
-
27644596289
-
ANIQUE: An auditory model for single-ended speech quality estimation
-
Sep
-
D. S. Kim, "ANIQUE: An auditory model for single-ended speech quality estimation," IEEE Trans. Audio, Speech, Lang. Process., vol. 13, no. 5, pp. 821-831, Sep. 2005.
-
(2005)
IEEE Trans. Audio, Speech, Lang. Process
, vol.13
, Issue.5
, pp. 821-831
-
-
Kim, D.S.1
-
18
-
-
64549163428
-
-
quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications, ITU, Geneva, Switzerland, 2004, ITU-T P.563
-
quot;Single-ended method for objective speech quality assessment in narrow-band telephony applications," ITU, Geneva, Switzerland, 2004, ITU-T P.563.
-
-
-
-
19
-
-
64549133123
-
-
NiQA-product description Psytechnics Limited, Online, Available
-
NiQA-product description Psytechnics Limited, 2003 [Online]. Available: http://www.psytechnics.com/pages/products/niqa.php
-
(2003)
-
-
-
20
-
-
64549155841
-
-
NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc, Online, Available
-
NiNA-SwissQual's Non-intrusive algorithm for estimating the subjective quality of live speech Swiss Qual Inc., 2001 [Online]. Available: http://www.swissqual.com/HTML/ninapage.htm
-
(2001)
-
-
-
22
-
-
84892233308
-
On ideal binary mask as the computational goal of auditory scene analysis
-
P. Divenyi, Ed. Norwell, MA: Kluwer
-
D. L. Wang, "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2005, pp. 181-197.
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.L.1
-
23
-
-
0037750051
-
Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming,
-
Ph.D. dissertation, Dept. Elect. Comput. Eng, Northwestern Univ, Evanston, IL
-
L. A. Drake, "Sound source separation via computational auditory scene analysis (CASA)-enhanced beamforming," Ph.D. dissertation, Dept. Elect. Comput. Eng., Northwestern Univ, Evanston, IL, 2001.
-
(2001)
-
-
Drake, L.A.1
-
25
-
-
64549153752
-
-
Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.
-
Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs ITU, Geneva, Switzerland, 2001, ITU-T P.862.
-
-
-
-
26
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
Feb
-
S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Feb. 1979.
-
(1979)
IEEE Trans. Acoust., Speech, Signal Process
, vol.27
, Issue.2
, pp. 113-120
-
-
Boll, S.F.1
-
27
-
-
0035396555
-
Noise power spectral density estimation based on optimal smoothing and minimum statistics
-
Jul
-
Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, Issue.5
, pp. 504-512
-
-
Martin1
-
28
-
-
0032702589
-
Temporal coding of periodicity pitch in the auditory system: An overview
-
P. Cariani, "Temporal coding of periodicity pitch in the auditory system: An overview," Neural Plasticity, vol. 6, pp. 147-172, 1999.
-
(1999)
Neural Plasticity
, vol.6
, pp. 147-172
-
-
Cariani, P.1
-
29
-
-
0030846123
-
A unitary model of pitch perception
-
R. Meddis and L. O'Mard, "A unitary model of pitch perception," J. Acoust. Soc. Amer., vol. 102, pp. 1811-1820, 1997.
-
(1997)
J. Acoust. Soc. Amer
, vol.102
, pp. 1811-1820
-
-
Meddis, R.1
O'Mard, L.2
-
30
-
-
0002296637
-
On the importance of time-A temporal representation of sound
-
M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley
-
M. Slaney and R. F. Lyon, "On the importance of time-A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, Eds. New York:Wiley, 1993, pp. 95-116.
-
(1993)
Visual Representations of Speech Signals
, pp. 95-116
-
-
Slaney, M.1
Lyon, R.F.2
-
31
-
-
33646786460
-
Separation of fricatives and affricates
-
G. Hu and D. L.Wang, "Separation of fricatives and affricates," in Proc. ICASSP, 2005, vol. 1, pp. 1101-1104.
-
(2005)
Proc. ICASSP
, vol.1
, pp. 1101-1104
-
-
Hu, G.1
Wang, D.L.2
|