-
1
-
-
3442876970
-
Phase-based dual-microphone robust speech enhancement
-
Aarabi P. Shi G. (2004). Phase-based dual-microphone robust speech enhancement. IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics, 34, 1763–1773.
-
(2004)
IEEE Transactions on Systems, Man, and Cybernetics—Part B: Cybernetics
, vol.34
, pp. 1763-1773
-
-
Aarabi, P.1
Shi, G.2
-
3
-
-
4544333241
-
Underdetermined blind separation for speech in speech in real environments with sparseness and ICA
-
(May) Montreal, Quebec, Canada.
-
Araki S. Makino S. Blin A. Mukai R. Sawada H. (2004, May). Underdetermined blind separation for speech in speech in real environments with sparseness and ICA. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal processing (Vol. III, pp. 881–884), Montreal, Quebec, Canada.
-
(2004)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal processing
, vol.3
, pp. 881-884
-
-
Araki, S.1
Makino, S.2
Blin, A.3
Mukai, R.4
Sawada, H.5
-
5
-
-
33646759922
-
Reducing musical noise by a fine-shift overlap-and-add method applied to source separation using a time-frequency mask
-
(March) Philadelphia, PA.
-
Araki S. Makino S. Sawada H. Mukai R. (2005, March). Reducing musical noise by a fine-shift overlap-and-add method applied to source separation using a time-frequency mask. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. III, pp. 81–84), Philadelphia, PA.
-
(2005)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.3
, pp. 81-84
-
-
Araki, S.1
Makino, S.2
Sawada, H.3
Mukai, R.4
-
7
-
-
34247223586
-
Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors
-
Araki S. Sawada H. Mukai R. Makino S. (2007). Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing, 87, 1833–1847.
-
(2007)
Signal Processing
, vol.87
, pp. 1833-1847
-
-
Araki, S.1
Sawada, H.2
Mukai, R.3
Makino, S.4
-
8
-
-
85009063707
-
Soft decisions in missing data techniques for robust automatic speech recognition
-
(October) Beijing, China.
-
Barker J. Josifovski L. Cooke M. Green P. (2000, October). Soft decisions in missing data techniques for robust automatic speech recognition. In Proceedings of Sixth International Conference on Spoken Language Processing (Vol. 1, pp. 373–376), Beijing, China.
-
(2000)
Proceedings of Sixth International Conference on Spoken Language Processing
, vol.1
, pp. 373-376
-
-
Barker, J.1
Josifovski, L.2
Cooke, M.3
Green, P.4
-
11
-
-
0002706411
-
Modeling human sound-source localization and the cocktail-party-effect
-
Bodden M. (1993). Modeling human sound-source localization and the cocktail-party-effect. Acta Acustica, 1, 43–55.
-
(1993)
Acta Acustica
, vol.1
, pp. 43-55
-
-
Bodden, M.1
-
16
-
-
85008004589
-
Reverberation
-
In Wang D. L. Brown G. J. (Eds.) Hoboken, NJ: Wiley/IEEE Press
-
Brown G. J. Palomäki K. J. (2006). Reverberation. In Wang D. L. Brown G. J. (Eds.), Computational auditory scene analysis: Principles, algorithms, and applications (pp. 209–250). Hoboken, NJ: Wiley/IEEE Press.
-
(2006)
Computational auditory scene analysis: Principles, algorithms, and applications
, pp. 209-250
-
-
Brown, G.J.1
Palomäki, K.J.2
-
20
-
-
0028413241
-
Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor
-
Cappe O. (1994). Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 2, 345–349.
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, pp. 345-349
-
-
Cappe, O.1
-
22
-
-
33745217651
-
-
Unpublished master's thesis, Department of Computer Science and Engineering, The Ohio State University, Columbus
-
Chang P. (2004). Exploration of behavioral, physiological, and computational approaches to auditory scene analysis. Unpublished master's thesis, Department of Computer Science and Engineering, The Ohio State University, Columbus.
-
(2004)
Exploration of behavioral, physiological, and computational approaches to auditory scene analysis
-
-
Chang, P.1
-
23
-
-
0035342414
-
Robust automatic speech recognition with missing and unreliable acoustic data
-
Cooke M. Green P. Josifovski L. Vizinho A. (2001). Robust automatic speech recognition with missing and unreliable acoustic data. Speech Communication, 34, 267–285.
-
(2001)
Speech Communication
, vol.34
, pp. 267-285
-
-
Cooke, M.1
Green, P.2
Josifovski, L.3
Vizinho, A.4
-
26
-
-
4544247268
-
A method for directionally-disjoint source separation in convolutive environment
-
(May) Montreal, Quebec, Canada.
-
Dubnov S. Tabrikian J. Arnon-Targan M. (2004, May). A method for directionally-disjoint source separation in convolutive environment. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. V, pp. 489–492), Montreal, Quebec, Canada.
-
(2004)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.5
, pp. 489-492
-
-
Dubnov, S.1
Tabrikian, J.2
Arnon-Targan, M.3
-
27
-
-
33746589116
-
Speech source separation in convolutive environments using space-time-frequency analysis
-
Article 38412
-
Dubnov S. Tabrikian J. Arnon-Targan M. (2006). Speech source separation in convolutive environments using space-time-frequency analysis. EURASIP Journal on Applied Signal Processing, 2006, Article 38412, 11 pages.
-
(2006)
EURASIP Journal on Applied Signal Processing
, vol.2006
, pp. 11
-
-
Dubnov, S.1
Tabrikian, J.2
Arnon-Targan, M.3
-
30
-
-
0023922474
-
Excess masking among listeners with a sensorineural hearing loss
-
Gagne J.-P. (1988). Excess masking among listeners with a sensorineural hearing loss. Journal of the Acoustical Society of America, 83, 2311–2321.
-
(1988)
Journal of the Acoustical Society of America
, vol.83
, pp. 2311-2321
-
-
Gagne, J.-P.1
-
32
-
-
33744971131
-
Mask estimation for missing data speech recognition based on statistics of binaural interaction
-
Harding S. Barker J. Brown G. J. (2006). Mask estimation for missing data speech recognition based on statistics of binaural interaction. IEEE Transactions on Audio, Speech, and Language Processing, 14, 58–67.
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.14
, pp. 58-67
-
-
Harding, S.1
Barker, J.2
Brown, G.J.3
-
33
-
-
84998077855
-
-
(Ellis A. J., Trans., 2nd English ed.). New York: Dover
-
Helmholtz H. (1863). On the sensation of tone (Ellis A. J., Trans., 2nd English ed.). New York: Dover.
-
(1863)
On the sensation of tone
-
-
Helmholtz, H.1
-
35
-
-
4644265990
-
Monaural speech segregation based on pitch tracking and amplitude modulation
-
Hu G. Wang D. L. (2004). Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Transactions on Neural Networks, 15, 1135–1150.
-
(2004)
IEEE Transactions on Neural Networks
, vol.15
, pp. 1135-1150
-
-
Hu, G.1
Wang, D.L.2
-
36
-
-
46049084696
-
An auditory scene analysis approach to monaural speech segregation
-
In Hansler E. Schmidt G. (Eds.) Heidelberg, Germany: Springer
-
Hu G. Wang D. L. (2006). An auditory scene analysis approach to monaural speech segregation. In Hansler E. Schmidt G. (Eds.), Topics in acoustic echo and noise control (pp. 485–515). Heidelberg, Germany: Springer.
-
(2006)
Topics in acoustic echo and noise control
, pp. 485-515
-
-
Hu, G.1
Wang, D.L.2
-
37
-
-
49249107353
-
Segregation of unvoiced speech from nonspeech interference
-
Hu G. Wang D. L. (2008). Segregation of unvoiced speech from nonspeech interference. Journal of the Acoustical Society of America, 124, 1306–1319.
-
(2008)
Journal of the Acoustical Society of America
, vol.124
, pp. 1306-1319
-
-
Hu, G.1
Wang, D.L.2
-
39
-
-
0014568991
-
IEEE recommended practice for speech quality measurements
-
IEEE. (1969). IEEE recommended practice for speech quality measurements. IEEE Transactions on Audio and Electroacoustics, 17, 225–246.
-
(1969)
IEEE Transactions on Audio and Electroacoustics
, vol.17
, pp. 225-246
-
-
-
40
-
-
0033692661
-
Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures
-
(June) Istanbul, Turkey.
-
Jourjine A. Rickard S. Yilmaz O. (2000, June). Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. 5, pp. 2985–2988), Istanbul, Turkey.
-
(2000)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.5
, pp. 2985-2988
-
-
Jourjine, A.1
Rickard, S.2
Yilmaz, O.3
-
41
-
-
33751279943
-
Multichannel dynamic-range compression using digital frequency warping
-
Kates J. M. Arehart K. H. (2005). Multichannel dynamic-range compression using digital frequency warping. EURASIP Journal on Applied Signal Processing, 18, 3003–3014.
-
(2005)
EURASIP Journal on Applied Signal Processing
, vol.18
, pp. 3003-3014
-
-
Kates, J.M.1
Arehart, K.H.2
-
43
-
-
33749058582
-
Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
-
(October) New Paltz, NY.
-
Kolossa D. Klimas A. Orglmeister R. (2005, October). Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (pp. 82–85), New Paltz, NY.
-
(2005)
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
, pp. 82-85
-
-
Kolossa, D.1
Klimas, A.2
Orglmeister, R.3
-
46
-
-
41849093721
-
Effect of spectral resolution on the intelligibility of ideal binary masked speech
-
Li N. Loizou P. C. (2008a). Effect of spectral resolution on the intelligibility of ideal binary masked speech. Journal of the Acoustical Society of America, 123, EL59–EL64.
-
(2008)
Journal of the Acoustical Society of America
, vol.123
, pp. EL59-EL64
-
-
Li, N.1
Loizou, P.C.2
-
47
-
-
40749125179
-
Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction
-
Li N. Loizou P. C. (2008b). Factors influencing intelligibility of ideal binary-masked speech: Implications for noise reduction. Journal of the Acoustical Society of America, 123, 1673–1682.
-
(2008)
Journal of the Acoustical Society of America
, vol.123
, pp. 1673-1682
-
-
Li, N.1
Loizou, P.C.2
-
48
-
-
40949108726
-
Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech
-
Li P. Guan Y. Xu B. Liu W. (2006). Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech. IEEE Transactions on Audio, Speech, and Language Processing, 14, 2014–2023.
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.14
, pp. 2014-2023
-
-
Li, P.1
Guan, Y.2
Xu, B.3
Liu, W.4
-
57
-
-
33749539632
-
Blind separation of acoustic signals combining SIMO-model-based independent component analysis and binary masking
-
Mori Y. Saruwatari H. Takatani T. Ukai S. Shikano K. Hiekata T. et al., (2006). Blind separation of acoustic signals combining SIMO-model-based independent component analysis and binary masking. EURASIP Journal on Applied Signal Processing, 2006(20), 1–17
-
(2006)
EURASIP Journal on Applied Signal Processing
, vol.2006
, Issue.20
, pp. 1-17
-
-
Mori, Y.1
Saruwatari, H.2
Takatani, T.3
Ukai, S.4
Shikano, K.5
Hiekata, T.6
-
58
-
-
0024753593
-
Speech recognition using noise-adaptive prototypes
-
Nadas A. Nahamoo D. Picheny M. A. (1989). Speech recognition using noise-adaptive prototypes. IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 1495–1503.
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, pp. 1495-1503
-
-
Nadas, A.1
Nahamoo, D.2
Picheny, M.A.3
-
59
-
-
0028012490
-
Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise
-
Nilsson M. Soli S. Sullivan J. A. (1994). Development of the hearing in noise test for the measurement of speech reception thresholds in quiet and in noise. Journal of the Acoustical Society of America, 95, 1085–1099.
-
(1994)
Journal of the Acoustical Society of America
, vol.95
, pp. 1085-1099
-
-
Nilsson, M.1
Soli, S.2
Sullivan, J.A.3
-
60
-
-
2942539074
-
Techniques for handling convolutional distortion with “missing data” automatic speech recognition
-
Palomäki K. J. Brown G. J. Barker J. (2004). Techniques for handling convolutional distortion with “missing data” automatic speech recognition. Speech Communication, 43, 123–142.
-
(2004)
Speech Communication
, vol.43
, pp. 123-142
-
-
Palomäki, K.J.1
Brown, G.J.2
Barker, J.3
-
63
-
-
35648992055
-
Monophonic sound source separation with an unsupervised network of spiking neurones
-
Pichevar R. Rouat J. (2007). Monophonic sound source separation with an unsupervised network of spiking neurones. Neurocomputing, 71, 109–120.
-
(2007)
Neurocomputing
, vol.71
, pp. 109-120
-
-
Pichevar, R.1
Rouat, J.2
-
64
-
-
33845940172
-
A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation
-
Article 84186
-
Radfar M. H. Dansereau R. M. Sayadiyan A. (2007). A maximum likelihood estimation of vocal-tract-related filter characteristics for single channel speech separation. EURASIP Journal on Audio, Speech, and Music Processing, 2007, Article 84186, 15 pages.
-
(2007)
EURASIP Journal on Audio, Speech, and Music Processing
, vol.2007
, pp. 15
-
-
Radfar, M.H.1
Dansereau, R.M.2
Sayadiyan, A.3
-
65
-
-
56249144712
-
Soft mask methods for single-channel speaker separation
-
Reddy A. M. Raj B. (2007). Soft mask methods for single-channel speaker separation. IEEE Transactions on Audio, Speech, and Language Processing, 15, 1766–1776.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, pp. 1766-1776
-
-
Reddy, A.M.1
Raj, B.2
-
68
-
-
4644328243
-
Binaural sound separation for multisource reverberant environments
-
Montreal Quebec, Canada
-
Roman N. Wang D. L. (2004). Binaural sound separation for multisource reverberant environments. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. II, pp. 373–376), Montreal Quebec, Canada.
-
(2004)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.2
, pp. 373-376
-
-
Roman, N.1
Wang, D.L.2
-
71
-
-
34250639628
-
Two-stage blind source separation based on ICA and binary masking for real-time robot audition system
-
(August) Edmont, Alberta, Canada.
-
Saruwatari H. Mori Y. Takatani T. Ukai S. Shikano K. Hiekata T. et al., (2005, August). Two-stage blind source separation based on ICA and binary masking for real-time robot audition system. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2005 (pp. 2303–2308), Edmont, Alberta, Canada.
-
(2005)
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2005
, pp. 2303-2308
-
-
Saruwatari, H.1
Mori, Y.2
Takatani, T.3
Ukai, S.4
Shikano, K.5
Hiekata, T.6
-
72
-
-
33645163182
-
Blind extraction of a dominant source signal from mixtures of many sources
-
Philadelphia, PA.
-
Sawada H. Araki S. Mukai R. Makino S. (2005). Blind extraction of a dominant source signal from mixtures of many sources. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (Vol. III, pp. 61–64), Philadelphia, PA.
-
(2005)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.3
, pp. 61-64
-
-
Sawada, H.1
Araki, S.2
Mukai, R.3
Makino, S.4
-
73
-
-
33847771459
-
Blind extraction of dominant target sources using ICA and time-frequency masking
-
Sawada H. Araki S. Mukai R. Makino S. (2006). Blind extraction of dominant target sources using ICA and time-frequency masking. IEEE Transactions on Audio, Speech, and Language Processing, 14, 2165–2173.
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.14
, pp. 2165-2173
-
-
Sawada, H.1
Araki, S.2
Mukai, R.3
Makino, S.4
-
75
-
-
33750311718
-
Binary and ratio time-frequency masks for robust speech recognition
-
Srinivasan S. Roman N. Wang D. L. (2006). Binary and ratio time-frequency masks for robust speech recognition. Speech Communication, 48, 1486–1501.
-
(2006)
Speech Communication
, vol.48
, pp. 1486-1501
-
-
Srinivasan, S.1
Roman, N.2
Wang, D.L.3
-
76
-
-
56249136428
-
Transforming binary uncertainties for robust speech recognition
-
Srinivasan S. Wang D. L. (2007). Transforming binary uncertainties for robust speech recognition. IEEE Transactions on Audio, Speech, and Language Processing, 15, 2130–2140.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, pp. 2130-2140
-
-
Srinivasan, S.1
Wang, D.L.2
-
80
-
-
0023985457
-
Beamforming: A versatile approach to spatial filtering
-
(April)
-
van Veen B. D. Buckley K. M. (1988, April). Beamforming: A versatile approach to spatial filtering. IEEE ASSP Magazine, pp. 4–24.
-
(1988)
IEEE ASSP Magazine
, pp. 4-24
-
-
van Veen, B.D.1
Buckley, K.M.2
-
82
-
-
84892233308
-
On ideal binary mask as the computational goal of auditory scene analysis
-
In Divenyi P. (Ed.) Norwell, MA: Kluwer Academic
-
Wang D. L. (2005). On ideal binary mask as the computational goal of auditory scene analysis. In Divenyi P. (Ed.), Speech separation by humans and machines (pp. 181–197). Norwell, MA: Kluwer Academic.
-
(2005)
Speech separation by humans and machines
, pp. 181-197
-
-
Wang, D.L.1
-
83
-
-
0032682770
-
Separation of speech from interfering sounds based on oscillatory correlation
-
Wang D. L. Brown G. J. (1999). Separation of speech from interfering sounds based on oscillatory correlation. IEEE Transactions on Neural Networks, 10, 684–697.
-
(1999)
IEEE Transactions on Neural Networks
, vol.10
, pp. 684-697
-
-
Wang, D.L.1
Brown, G.J.2
-
88
-
-
3142694930
-
Blind separation of speech mixtures via time-frequency masking
-
Yilmaz O. Rickard S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52, 1830–1847.
-
(2004)
IEEE Transactions on Signal Processing
, vol.52
, pp. 1830-1847
-
-
Yilmaz, O.1
Rickard, S.2
|