-
2
-
-
82255178542
-
-
D. L. Wang and G. J. Brown, Eds.. Hoboken, NJ: Wiley/IEEE Press
-
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, D. L. Wang and G. J. Brown, Eds.. Hoboken, NJ: Wiley/IEEE Press, 2006.
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
-
-
-
3
-
-
85008004589
-
Reverberation
-
D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press
-
G. J. Brown and K. J. Palomaki, "Reverberation," in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press, 2006, pp. 209-250.
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
, pp. 209-250
-
-
Brown, G.J.1
Palomaki, K.J.2
-
4
-
-
33744971131
-
Mask estimation for missing data speech recognition based on statistics of binaural interaction
-
Jan.
-
S. Harding, J. Barker, and G. J. Brown, "Mask estimation for missing data speech recognition based on statistics of binaural interaction," IEEE Trans. Audio, Speech, and Lang. Process., vol.14, no.1, pp. 58-67, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, and Lang. Process
, vol.14
, Issue.1
, pp. 58-67
-
-
Harding, S.1
Barker, J.2
Brown, G.J.3
-
5
-
-
33845361885
-
Binaural segregation in multisource reverberant environments
-
N. Roman, S. Srinivasan, and D. L. Wang, "Binaural segregation in multisource reverberant environments," J. Acoust. Soc. Amer., vol.120, no.6, pp. 4040-4051, 2006.
-
(2006)
J. Acoust. Soc. Amer
, vol.120
, Issue.6
, pp. 4040-4051
-
-
Roman, N.1
Srinivasan, S.2
Wang, D.L.3
-
6
-
-
50249101640
-
Sparseness-based 2CH BSS using the em algorithm in reverberant environment
-
Oct
-
Y. Izumi, N. Ono, and S. Sagayama, "Sparseness-based 2CH BSS using the EM algorithm in reverberant environment," in Proc. WASPAA, Oct. 2007, pp. 147-150.
-
(2007)
Proc. WASPAA
, pp. 147-150
-
-
Izumi, Y.1
Ono, N.2
Sagayama, S.3
-
7
-
-
50249183469
-
EM localization and separation using interaural level and phase cues
-
Oct
-
M. I. Mandel and D. P. W. Ellis, "EM localization and separation using interaural level and phase cues," in Proc. WASPAA, Oct. 2007, pp. 275-278.
-
(2007)
Proc. WASPAA
, pp. 275-278
-
-
Mandel, M.I.1
Ellis, D.P.W.2
-
8
-
-
50249118229
-
A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures
-
Oct
-
H. Sawada, S. Araki, and S. Makino, "A two-state frequency-domain blind source separation method for underdetermined convolutive mixtures," in Proc. WASPAA, Oct. 2007, pp. 139-142.
-
(2007)
Proc. WASPAA
, pp. 139-142
-
-
Sawada, H.1
Araki, S.2
Makino, S.3
-
10
-
-
0029127703
-
Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay
-
J. F. Culling and Q. S. Summerfield, "Perceptual separation of concurrent speech sounds: Absence of across-frequency grouping by common interaural delay," J. Acoust. Soc. Amer., vol.98, pp. 785-797, 1995.
-
(1995)
J. Acoust. Soc. Amer
, vol.98
, pp. 785-797
-
-
Culling, J.F.1
Summerfield, Q.S.2
-
11
-
-
0033144658
-
Auditory objects of attention: The role of interaural time differences
-
C. J. Darwin and R. W. Hukin, "Auditory objects of attention: The role of interaural time differences," J. Exp. Psychol. Hum. Percept. Perform., vol.25, pp. 617-629, 1999.
-
(1999)
J. Exp. Psychol. Hum. Percept. Perform
, vol.25
, pp. 617-629
-
-
Darwin, C.J.1
Hukin, R.W.2
-
12
-
-
0003127954
-
How we localize sounds
-
Nov.
-
W. M. Hartmann, "How we localize sounds," Phys. Today, pp. 24-29, Nov. 1999.
-
(1999)
Phys. Today
, pp. 24-29
-
-
Hartmann, W.M.1
-
13
-
-
56249137775
-
Spatial hearing and perceiving sources
-
W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer
-
C. J. Darwin, "Spatial hearing and perceiving sources," in Auditory Perception of Sound Sources, W. A. Yost, A. N. Popper, and R. R. Fay, Eds. New York: Springer, 2007, pp. 215-232.
-
(2007)
Auditory Perception of Sound Sources
, pp. 215-232
-
-
Darwin, C.J.1
-
14
-
-
0035254668
-
A sound segregation algorithm for reverberant conditions
-
A. Shamsoddini and P. N. Denbigh, "A sound segregation algorithm for reverberant conditions," Speech Commun., vol.33, pp. 179-196, 2001.
-
(2001)
Speech Commun
, vol.33
, pp. 179-196
-
-
Shamsoddini, A.1
Denbigh, P.N.2
-
15
-
-
70349210869
-
A speech fragment approach to localising multiple speakers in reverberant environments
-
Apr.
-
H. Christensen, N. Ma, S. N. Wrigley, and J. Barker, "A speech fragment approach to localising multiple speakers in reverberant environments," in Proc. ICASSP, Apr. 2009, pp. 4593-4596.
-
(2009)
Proc. ICASSP
, pp. 4593-4596
-
-
Christensen, H.1
Ma, N.2
Wrigley, S.N.3
Barker, J.4
-
16
-
-
70349216477
-
On the role of localization cues in binaural segregation of reverberant speech
-
Apr.
-
J. Woodruff and D. L. Wang, "On the role of localization cues in binaural segregation of reverberant speech," in Proc. ICASSP, Apr. 2009, pp. 2205-2208.
-
(2009)
Proc. ICASSP
, pp. 2205-2208
-
-
Woodruff, J.1
Wang, D.L.2
-
17
-
-
77955678360
-
Integrating monaural and binaural analysis for localizing multiple reverberant sound sources
-
Mar.
-
J. Woodruff and D. L. Wang, "Integrating monaural and binaural analysis for localizing multiple reverberant sound sources," in Proc. ICASSP, Mar. 2010, pp. 2706-2709.
-
(2010)
Proc. ICASSP
, pp. 2706-2709
-
-
Woodruff, J.1
Wang, D.L.2
-
18
-
-
70349448618
-
An algorithm for speech segregation of co-channel speech
-
Apr.
-
S. Vishnubhotla and C. Y. Epsy-Wilson, "An algorithm for speech segregation of co-channel speech," in Proc. ICASSP, Apr. 2009, pp. 109-112.
-
(2009)
Proc. ICASSP
, pp. 109-112
-
-
Vishnubhotla, S.1
Epsy-Wilson, C.Y.2
-
19
-
-
65249103478
-
A supervised learning approach to monaural segregation of reverberant speech
-
Z. Jin and D. L. Wang, "A supervised learning approach to monaural segregation of reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol.17, pp. 625-638, 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process
, vol.17
, pp. 625-638
-
-
Jin, Z.1
Wang, D.L.2
-
20
-
-
49249107353
-
Segregation of unvoiced speech from nonspeech interference
-
G. Hu and D. L. Wang, "Segregation of unvoiced speech from nonspeech interference," J. Acoust. Soc. Amer., vol.124, pp. 1306-1319, 2008.
-
(2008)
J. Acoust. Soc. Amer
, vol.124
, pp. 1306-1319
-
-
Hu, G.1
Wang, D.L.2
-
21
-
-
67349134831
-
Sequential organization of speech in computational auditory scene analysis
-
Y. Shao and D. L. Wang, "Sequential organization of speech in computational auditory scene analysis," Speech Commun., vol.51, pp. 657-667, 2009.
-
(2009)
Speech Commun
, vol.51
, pp. 657-667
-
-
Shao, Y.1
Wang, D.L.2
-
23
-
-
0029041417
-
HRTF measurements of a KEMAR
-
W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Amer., vol.97, pp. 3907-3908, 1995.
-
(1995)
J. Acoust. Soc. Amer
, vol.97
, pp. 3907-3908
-
-
Gardner, W.G.1
Martin, K.D.2
-
24
-
-
0018455820
-
Image method for efficiently simulating small-room acoustics
-
J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, pp. 943-950, 1979.
-
(1979)
J. Acoust. Soc. Amer
, vol.65
, pp. 943-950
-
-
Allen, J.B.1
Berkley, D.A.2
-
25
-
-
0003548585
-
-
J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA TIMIT acoustic phonetic continuous speech corpus," 1993.
-
(1993)
DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
-
26
-
-
0142056390
-
-
Cambridge, U.K., Tech. Rep., MRC Applied Psychology Unit
-
R. D. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function," Cambridge, U.K., Tech. Rep., MRC Applied Psychology Unit, 1988.
-
(1988)
An Efficient Auditory Filterbank Based on the Gammatone Function
-
-
Patterson, R.D.1
Nimmo-Smith, I.2
Holdsworth, J.3
Rice, P.4
-
27
-
-
0025110885
-
Derivation of auditory filter shapes from notched-noise data
-
B. R. Glasberg and B. C. J. Moore, "Derivation of auditory filter shapes from notched-noise data," Hear. Res., vol.47, pp. 103-138, 1990.
-
(1990)
Hear. Res
, vol.47
, pp. 103-138
-
-
Glasberg, B.R.1
Moore, B.C.J.2
-
28
-
-
77955695149
-
A tandem algorithm for pitch estimation and voiced speech segregation
-
to be published
-
G. Hu and D. L. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., 2010, to be published.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process
-
-
Hu, G.1
Wang, D.L.2
-
29
-
-
85045165251
-
-
Ph.D. dissertation, The Ohio State Univ., Columbus, OH
-
G. Hu, "Monaural speech organization and segregation," Ph.D. dissertation, The Ohio State Univ., Columbus, OH, 2006.
-
(2006)
Monaural Speech Organization and Segregation
-
-
Hu, G.1
-
32
-
-
0142026377
-
Speech segregation based on sound localization
-
N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol.114, no.4, pp. 2236-2252, 2003.
-
(2003)
J. Acoust. Soc. Amer
, vol.114
, Issue.4
, pp. 2236-2252
-
-
Roman, N.1
Wang, D.L.2
Brown, G.J.3
-
33
-
-
0032845228
-
The precedence effect
-
R. Y. Litovsky, H. S. Colburn, W. A. Yost, and S. J. Guzman, "The precedence effect," J. Acoust. Soc. Amer., vol.106, pp. 1633-1654, 1999.
-
(1999)
J. Acoust. Soc. Amer
, vol.106
, pp. 1633-1654
-
-
Litovsky, R.Y.1
Colburn, H.S.2
Yost, W.A.3
Guzman, S.J.4
-
34
-
-
9644281074
-
Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
-
C. Faller and J. Merimaa, "Source localization in complex listening situations: Selection of binaural cues based on interaural coherence," J. Acoust. Soc. Amer., vol.116, no.5, pp. 3075-3089, 2004.
-
(2004)
J. Acoust. Soc. Amer
, vol.116
, Issue.5
, pp. 3075-3089
-
-
Faller, C.1
Merimaa, J.2
-
35
-
-
33947155770
-
Learning a precedence effect-like weighting function for the generalized cross-correlation framework
-
Nov.
-
K. W. Wilson and T. Darrell, "Learning a precedence effect-like weighting function for the generalized cross-correlation framework," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 2156-2164, Nov. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.6
, pp. 2156-2164
-
-
Wilson, K.W.1
Darrell, T.2
-
36
-
-
33744996003
-
Model-based sequential organization in cochannel speech
-
Jan.
-
Y. Shao and D. L. Wang, "Model-based sequential organization in cochannel speech," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.1, pp. 289-298, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.1
, pp. 289-298
-
-
Shao, Y.1
Wang, D.L.2
-
37
-
-
0003343412
-
Robust localization in reverberant rooms
-
M. Brandstein and D.Ward, Eds. New York: Springer, ch. 8
-
J. H. DiBiase, H. F. Silverman, and M. S. Brandstein, "Robust localization in reverberant rooms," in Microphone Arrays: Signal Processing Techniques and Applications, M. Brandstein and D.Ward, Eds. New York: Springer, 2001, ch. 8, pp. 157-180.
-
(2001)
Microphone Arrays: Signal Processing Techniques and Applications
, pp. 157-180
-
-
Dibiase, J.H.1
Silverman, H.F.2
Brandstein, M.S.3
-
38
-
-
0033778326
-
Localization of multiple sound sources with two microphones
-
C. Liu, B. C. Wheeler,W. D. O'Brien, R. C. Bilger, C. R. Lansing, and A. S. Feng, "Localization of multiple sound sources with two microphones," J. Acoust. Soc. Amer., vol.108, no.4, pp. 1888-1905, 2000.
-
(2000)
J. Acoust. Soc. Amer
, vol.108
, Issue.4
, pp. 1888-1905
-
-
Liu, C.1
Wheeler, B.C.2
O'Brien, W.D.3
Bilger, R.C.4
Lansing, C.R.5
Feng, A.S.6
-
39
-
-
84892233308
-
On ideal binary masks as the computational goal of auditory scene analysis
-
P. Divenyi, Ed. Boston, MA: Kluwer
-
D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Boston, MA: Kluwer, 2005, pp. 181-197.
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.L.1
-
40
-
-
33845354768
-
Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask
-
D. Brungart, P. S. Chang, B. D. Simpson, and D. L. Wang, "Isolating the energetic component of speech-on-speech masking with an ideal binary time-frequency mask," J. Acoust. Soc. Amer., vol.120, pp. 4007-4018, 2006.
-
(2006)
J. Acoust. Soc. Amer
, vol.120
, pp. 4007-4018
-
-
Brungart, D.1
Chang, P.S.2
Simpson, B.D.3
Wang, D.L.4
|