-
1
-
-
0018455820
-
Image method for efficiently simulating small-room acoustics
-
Allen, J. B., and Berkley, D. A. (1979). "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Am. 65, 943-950.
-
(1979)
J. Acoust. Soc. Am.
, vol.65
, pp. 943-950
-
-
Allen, J.B.1
Berkley, D.A.2
-
2
-
-
0036649241
-
Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets
-
Barros, A. K., Rutkowski, T., Itakura, F., and Ohnishi, N. (2002). "Estimation of speech embedded in a reverberant and noisy environment by independent component analysis and wavelets," IEEE Trans. Neural Netw. 13, 888-893.
-
(2002)
IEEE Trans. Neural Netw.
, vol.13
, pp. 888-893
-
-
Barros, A.K.1
Rutkowski, T.2
Itakura, F.3
Ohnishi, N.4
-
3
-
-
0038028639
-
AR processes and sources can be reconstructed from degenerate mixtures
-
Balan, R., Jourjine, A., and Rosca, J. (1999). "AR processes and sources can be reconstructed from degenerate mixtures," Proc. 1st Int. Workshop on Independent Component Analysis and Signal Separation, pp. 467-472.
-
(1999)
Proc. 1st Int. Workshop on Independent Component Analysis and Signal Separation
, pp. 467-472
-
-
Balan, R.1
Jourjine, A.2
Rosca, J.3
-
6
-
-
0039334758
-
The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions
-
Bronkhorst, A. (2000). "The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions," Acustica 86, 117-128.
-
(2000)
Acustica
, vol.86
, pp. 117-128
-
-
Bronkhorst, A.1
-
7
-
-
0028531926
-
Computational auditory scene analysis
-
Brown, G. J., and Cooke, M. (1994). "Computational auditory scene analysis," Comput. Speech Lang. 8, 297-336.
-
(1994)
Comput. Speech Lang.
, vol.8
, pp. 297-336
-
-
Brown, G.J.1
Cooke, M.2
-
8
-
-
33644639591
-
Separation of speech by computational auditory scene analysis
-
J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York)
-
Brown, G. J., and Wang, D. L. (2005). "Separation of speech by computational auditory scene analysis," in Speech Enhancement, J. Benesty, S. Makino, and J. Chen, eds. (Springer, New York), pp. 371-402.
-
(2005)
Speech Enhancement
, pp. 371-402
-
-
Brown, G.J.1
Wang, D.L.2
-
9
-
-
33745741350
-
-
unpublished
-
Brungart, D., Chang, P., Simpson, B., and Wang, D. L. (2006). "Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask," (unpublished).
-
(2006)
Isolating the Energetic Component of Speech-on-speech Masking with An Ideal Binary Time-frequency Mask
-
-
Brungart, D.1
Chang, P.2
Simpson, B.3
Wang, D.L.4
-
10
-
-
0016522748
-
Anthropometric manikin for acoustic research
-
Burkhard, M. D., and Sachs, R. M. (1975). "Anthropometric manikin for acoustic research," J. Acoust. Soc. Am. 58, 214-222.
-
(1975)
J. Acoust. Soc. Am.
, vol.58
, pp. 214-222
-
-
Burkhard, M.D.1
Sachs, R.M.2
-
12
-
-
0035342414
-
Robust automatic speech recognition with missing and unreliable acoustic data
-
Cooke, M. P., Green, P., Josifovski, L., and Vizinho, A. (2001). "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun. 34, 267-285.
-
(2001)
Speech Commun.
, vol.34
, pp. 267-285
-
-
Cooke, M.P.1
Green, P.2
Josifovski, L.3
Vizinho, A.4
-
13
-
-
0242440783
-
Effects of reverberation on perceptual segregation of competing voices
-
Culling, J. F., Hodder, K. I., and Toh, C. Y. (2003). "Effects of reverberation on perceptual segregation of competing voices," J. Acoust. Soc. Am. 114, 2871-2876.
-
(2003)
J. Acoust. Soc. Am.
, vol.114
, pp. 2871-2876
-
-
Culling, J.F.1
Hodder, K.I.2
Toh, C.Y.3
-
14
-
-
0001698589
-
Auditory grouping
-
B. C. J. Moore, ed. (Academic, London)
-
Darwin, C. J., and Carlyon, R. P. (1995). "Auditory grouping," in The Handbook of Perception and Cognition, vol. 6, B. C. J. Moore, ed. (Academic, London), pp. 387-424.
-
(1995)
The Handbook of Perception and Cognition
, vol.6
, pp. 387-424
-
-
Darwin, C.J.1
Carlyon, R.P.2
-
15
-
-
0033939839
-
Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention
-
Darwin, C. J., and Hukin, R. W. (2000). "Effects of reverberation on spatial, prosodic, and vocal-tract size cues to selective attention," J. Acoust. Soc. Am. 108, 335-342.
-
(2000)
J. Acoust. Soc. Am.
, vol.108
, pp. 335-342
-
-
Darwin, C.J.1
Hukin, R.W.2
-
16
-
-
0029345417
-
A signal subspace approach for speech enhancement
-
Ephraim, Y., and Trees, H. L. (1995). "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Process. 3, 251-266.
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, pp. 251-266
-
-
Ephraim, Y.1
Trees, H.L.2
-
17
-
-
0030674105
-
Two-channel blind deconvolution for non-minimum phase impulse responses
-
Furuya, K., and Kaneda, Y. (1997). "Two-channel blind deconvolution for non-minimum phase impulse responses," Proc. ICASSP, pp. 1315-1318.
-
(1997)
Proc. ICASSP
, pp. 1315-1318
-
-
Furuya, K.1
Kaneda, Y.2
-
19
-
-
0036289676
-
Acoustic diversity for improved speech recognition in reverberant environments
-
Gillespie, B. W., and Atlas, L. E. (2002). "Acoustic diversity for improved speech recognition in reverberant environments," Proc. ICASSP, pp. 557-560.
-
(2002)
Proc. ICASSP
, pp. 557-560
-
-
Gillespie, B.W.1
Atlas, L.E.2
-
20
-
-
0034857681
-
Speech dereverberation via maximum-kurtosis subband adaptive filtering
-
Gillespie, B. W., Malvar, H. S., and Florencio, D. A. F. (2001). "Speech dereverberation via maximum-kurtosis subband adaptive filtering," Proc. ICASSP, vol. 6, pp. 3701-3704.
-
(2001)
Proc. ICASSP
, vol.6
, pp. 3701-3704
-
-
Gillespie, B.W.1
Malvar, H.S.2
Florencio, D.A.F.3
-
22
-
-
0141788523
-
Separation of stop consonants
-
Hu, G., and Wang, D. L. (2003). "Separation of stop consonants," Proc. ICASSP, vol. 2, pp. 749-752.
-
(2003)
Proc. ICASSP
, vol.2
, pp. 749-752
-
-
Hu, G.1
Wang, D.L.2
-
23
-
-
4644265990
-
Monaural speech segregation based on pitch tracking and amplitude modulation
-
Hu, G., and Wang, D. L. (2004). "Monaural speech segregation based on pitch tracking and amplitude modulation," IEEE Trans. Neural Netw. 15, 1135-1150.
-
(2004)
IEEE Trans. Neural Netw.
, vol.15
, pp. 1135-1150
-
-
Hu, G.1
Wang, D.L.2
-
24
-
-
33646786460
-
Separation of fricatives and affricates
-
Hu, G., and Wang, D. L. (2005). "Separation of fricatives and affricates," Proc. ICASSP vol. 1, pp. 1101-1104.
-
(2005)
Proc. ICASSP
, vol.1
, pp. 1101-1104
-
-
Hu, G.1
Wang, D.L.2
-
25
-
-
0038630563
-
Single channel signal separation using time-domain basis functions
-
Jang, G.-J., Lee, T.-W., and Oh, Y.-H. (2003). "Single channel signal separation using time-domain basis functions" IEEE Signal Process. Lett. 10(6), 168-171.
-
(2003)
IEEE Signal Process. Lett.
, vol.10
, Issue.6
, pp. 168-171
-
-
Jang, G.-J.1
Lee, T.-W.2
Oh, Y.-H.3
-
26
-
-
0001463644
-
A duplex theory of pitch perception
-
Licklider, J. C. R. (1951). "A duplex theory of pitch perception," Experientia 7, 128-134.
-
(1951)
Experientia
, vol.7
, pp. 128-134
-
-
Licklider, J.C.R.1
-
27
-
-
0342571948
-
A speech separation system that is robust to reverberation
-
Luo, H. Y., and Denbigh, P. N. (1994). "A speech separation system that is robust to reverberation," Proc. ISSIPNN, pp. 339-342.
-
(1994)
Proc. ISSIPNN
, pp. 339-342
-
-
Luo, H.Y.1
Denbigh, P.N.2
-
28
-
-
4544267645
-
Perceptual Kalman filtering for speech enhancement in colored noise
-
Ma, N., Bouchard, M., and Goubran, R. (2004). "Perceptual Kalman filtering for speech enhancement in colored noise," Proc. ICASSP, vol. 1, pp. 717-720.
-
(2004)
Proc. ICASSP
, vol.1
, pp. 717-720
-
-
Ma, N.1
Bouchard, M.2
Goubran, R.3
-
29
-
-
0035396555
-
Noise power spectral density estimation based on optimal smoothing and minimum statistics
-
Martin, R. (2001). "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process. 9, 504-512.
-
(2001)
IEEE Trans. Speech Audio Process.
, vol.9
, pp. 504-512
-
-
Martin, R.1
-
31
-
-
0020325263
-
Monaural and binaural speech perception in reverberation for listeners of various ages
-
Nabelek, A. K., and Robinson, P. K. (1982). "Monaural and binaural speech perception in reverberation for listeners of various ages," J. Acoust. Soc. Am. 71, 1242-1248.
-
(1982)
J. Acoust. Soc. Am.
, vol.71
, pp. 1242-1248
-
-
Nabelek, A.K.1
Robinson, P.K.2
-
32
-
-
0141830958
-
Blind dereverberation of single channel speech signal based on harmonic structure
-
Nakatani, T., and Miyoshi, M. (2003). "Blind dereverberation of single channel speech signal based on harmonic structure," Proc. ICASSP, pp. 92-95.
-
(2003)
Proc. ICASSP
, pp. 92-95
-
-
Nakatani, T.1
Miyoshi, M.2
-
33
-
-
0018494073
-
Invertibility of a room impulse response
-
Neely, S. T., and Allen, J. B. (1979). "Invertibility of a room impulse response," J. Acoust. Soc. Am. 66, 165-169.
-
(1979)
J. Acoust. Soc. Am.
, vol.66
, pp. 165-169
-
-
Neely, S.T.1
Allen, J.B.2
-
35
-
-
4644304197
-
A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
-
Palomaki, K. J., Brown, G. J., and Wang, D. L. (2004). "A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation," Speech Commun. 43, 361-378.
-
(2004)
Speech Commun.
, vol.43
, pp. 361-378
-
-
Palomaki, K.J.1
Brown, G.J.2
Wang, D.L.3
-
36
-
-
0003548690
-
-
Applied Psychology Unit, Cambridge
-
Patterson, R. D., Nimmo-Smith, I., Holdsworth, J., and Price, P. (1988). "APU Report 2341: An efficient auditory interbank based on the gamma-tone function," Applied Psychology Unit, Cambridge.
-
(1988)
APU Report 2341: An Efficient Auditory Interbank Based on the Gamma-tone Function
-
-
Patterson, R.D.1
Nimmo-Smith, I.2
Holdsworth, J.3
Price, P.4
-
37
-
-
0016916948
-
Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)
-
Plomp, R. (1976). "Binaural and monaural speech intelligibility of connected discourse in reverberation as a function of a single competing sound source (speech or noise)," Acustica 34, 200-211.
-
(1976)
Acustica
, vol.34
, pp. 200-211
-
-
Plomp, R.1
-
38
-
-
0142026377
-
Speech segregation based on sound localization
-
Roman, N., Wang, D. L., and Brown, G. J. (2003). "Speech segregation based on sound localization," J. Acoust. Soc. Am. 114, 2236-2252.
-
(2003)
J. Acoust. Soc. Am.
, vol.114
, pp. 2236-2252
-
-
Roman, N.1
Wang, D.L.2
Brown, G.J.3
-
39
-
-
0031124228
-
A pitch determination and voice/unvoiced decision algorithm for noisy speech
-
Rouat, J., Liu, Y. C., and Morissette, D. (1997). "A pitch determination and voice/unvoiced decision algorithm for noisy speech," Speech Commun. 21, 191-207.
-
(1997)
Speech Commun.
, vol.21
, pp. 191-207
-
-
Rouat, J.1
Liu, Y.C.2
Morissette, D.3
-
40
-
-
33744996003
-
Model-based sequential organization in cochannel speech
-
Shao, Y., and Wang, D. L. (2006). "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Proc. 14, 289-298.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Proc.
, vol.14
, pp. 289-298
-
-
Shao, Y.1
Wang, D.L.2
-
41
-
-
0035254668
-
A sound segregation algorithm for reverberant conditions
-
Shamsoddini, A., and Denbigh, P. N. (2001). "A sound segregation algorithm for reverberant conditions," Speech Commun. 33, 179-196.
-
(2001)
Speech Commun.
, vol.33
, pp. 179-196
-
-
Shamsoddini, A.1
Denbigh, P.N.2
-
42
-
-
0002296637
-
On the importance of time - A temporal representation of sound
-
M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York)
-
Slaney, M., and Lyon, R. F. (1993). "On the importance of time - A temporal representation of sound," in Visual Representations of Speech Signals, M. P. Cooke, S. Beet, and M. Crawford, eds. (Wiley, New York), pp. 95-116.
-
(1993)
Visual Representations of Speech Signals
, pp. 95-116
-
-
Slaney, M.1
Lyon, R.F.2
-
43
-
-
85009151411
-
On binary and ratio time-frequency masks for robust speech recognition
-
Srinivasan, S., Roman, N., and Wang, D. L. (2004). "On binary and ratio time-frequency masks for robust speech recognition," Proc. ICSLP, pp. 2541-2544.
-
(2004)
Proc. ICSLP
, pp. 2541-2544
-
-
Srinivasan, S.1
Roman, N.2
Wang, D.L.3
-
44
-
-
84892233308
-
On ideal binary mask as the computational goal of auditory scene analysis
-
P. Divenyi, ed. (Kluwer Academic, Norwell, MA)
-
Wang, D. L. (2005). "On ideal binary mask as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, ed. (Kluwer Academic, Norwell, MA), pp. 181-197.
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.L.1
-
45
-
-
0032682770
-
Separation of speech from interfering sounds based on oscillatory correlation
-
Wang, D. L., and Brown, G. J. (1999). "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw. 10, 684-697.
-
(1999)
IEEE Trans. Neural Netw.
, vol.10
, pp. 684-697
-
-
Wang, D.L.1
Brown, G.J.2
-
48
-
-
33745761716
-
A two-stage algorithm for one-microphone reverberant speech enhancement
-
Wu, M., and Wang, D. L. (2006). "A two-stage algorithm for one-microphone reverberant speech enhancement," IEEE Trans. Audio, Speech, Lang. Proc. 10, 774-784.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Proc.
, vol.10
, pp. 774-784
-
-
Wu, M.1
Wang, D.L.2
-
49
-
-
0037767686
-
A multipitch tracking algorithm for noisy speech
-
Wu, M., Wang, D. L., and Brown, G. J. (2003) "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Process. 11, 229-241.
-
(2003)
IEEE Trans. Speech Audio Process.
, vol.11
, pp. 229-241
-
-
Wu, M.1
Wang, D.L.2
Brown, G.J.3
-
50
-
-
1242316819
-
Blind source separation by sparse decomposition
-
S. J. Roberts and R. M. Everson, eds. (Cambridge University Press, Cambridge)
-
Zibulevsky, M., Pearlmutter, B. A., Bofill, P., and Kisilev, P. (2001). "Blind source separation by sparse decomposition," in Independent Component Analysis: Principles and Practice, S. J. Roberts and R. M. Everson, eds. (Cambridge University Press, Cambridge).
-
(2001)
Independent Component Analysis: Principles and Practice
-
-
Zibulevsky, M.1
Pearlmutter, B.A.2
Bofill, P.3
Kisilev, P.4
|