-
1
-
-
50249115096
-
Evaluating speech separation systems
-
P. Divenyi, Ed. Norwell, MA: Kluwer, ch.
-
D. P. W. Ellis, "Evaluating speech separation systems," in Speech Separation by Humans and Machines, P. Divenyi, Ed. Norwell, MA: Kluwer, 2004, ch. 20, pp. 295-304.
-
(2004)
Speech Separation by Humans and Machines
, vol.20
, pp. 295-304
-
-
Ellis, D.P.W.1
-
2
-
-
33845354768
-
Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation
-
D. S. Brungart, P. S. Chang, B. D. Simpson, and D. Wang, "Isolating the energetic component of speech-on-speech masking with ideal time-frequency segregation," J. Acoust. Soc. Amer., vol.120, no.6, pp. 4007-4018, 2006.
-
(2006)
J. Acoust. Soc. Amer
, vol.120
, Issue.6
, pp. 4007-4018
-
-
Brungart, D.S.1
Chang, P.S.2
Simpson, B.D.3
Wang, D.4
-
3
-
-
53949095896
-
Speech perception of noise with binary gains
-
D. Wang, U. Kjems, M. S. Pedersen, J. B. Boldt, and T. Lunner, "Speech perception of noise with binary gains," J. Acoust. Soc. Amer., vol.124, no.4, pp. 2303-2307, 2008.
-
(2008)
J. Acoust. Soc. Amer
, vol.124
, Issue.4
, pp. 2303-2307
-
-
Wang, D.1
Kjems, U.2
Pedersen, M.S.3
Boldt, J.B.4
Lunner, T.5
-
4
-
-
0003982501
-
-
Ph.D. dissertation, Stanford Univ. Dept. of Elect. Eng., Stanford, CA
-
M. Weintraub, "A theory and computational model of auditory monaural sound separation," Ph.D. dissertation, Stanford Univ. Dept. of Elect. Eng., Stanford, CA, 1985.
-
(1985)
A Theory and Computational Model of Auditory Monaural Sound Separation
-
-
Weintraub, M.1
-
5
-
-
44149106061
-
Evaluation of objective quality measures for speech enhancement
-
Jan.
-
Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol.16, no.1, pp. 229-238, Jan. 2008.
-
(2008)
IEEE Trans. Audio, Speech, Lang. Process
, vol.16
, Issue.1
, pp. 229-238
-
-
Hu, Y.1
Loizou, P.C.2
-
6
-
-
54949092435
-
Perceptual evaluation of blind source separation for robust speech recognition
-
Oct.
-
L. D. Persia, D. Milone, H. Rufiner, and M.Yanagida, "Perceptual evaluation of blind source separation for robust speech recognition," Signal Process., vol.88, no.10, pp. 2578-2583, Oct. 2008.
-
(2008)
Signal Process
, vol.88
, Issue.10
, pp. 2578-2583
-
-
Persia, L.D.1
Milone, D.2
Rufiner, H.3
Yanagida, M.4
-
7
-
-
0028823541
-
Speech recognition with primarily temporal cues
-
Oct.
-
R. V. Shannon, F.-G. Zeng, V. Kamath, J. Wygonski, and M. Ekelid, "Speech recognition with primarily temporal cues," Science, vol.270, no.5234, pp. 303-304, Oct. 1995.
-
(1995)
Science
, vol.270
, Issue.5234
, pp. 303-304
-
-
Shannon, R.V.1
Zeng, F.-G.2
Kamath, V.3
Wygonski, J.4
Ekelid, M.5
-
8
-
-
0033282527
-
Testing the ability of speech recognizers to measure the effectiveness of encoding algorithms for digital speech transmission
-
C. M. Chernick, S. Leigh, K. L. Mills, and R. Toense, "Testing the ability of speech recognizers to measure the effectiveness of encoding algorithms for digital speech transmission," in Proc. IEEE Military Commun. Conf., 1999, vol.2, pp. 1468-1472.
-
(1999)
Proc. IEEE Military Commun. Conf.
, vol.2
, pp. 1468-1472
-
-
Chernick, C.M.1
Leigh, S.2
Mills, K.L.3
Toense, R.4
-
9
-
-
84948437341
-
Speech recognition performance as an effective perceived quality predictor
-
W. Jiang and H. Schulzrinne, "Speech recognition performance as an effective perceived quality predictor," in Proc. IEEE Int. Workshop Quality of Service, 2002, pp. 269-275.
-
(2002)
Proc. IEEE Int. Workshop Quality of Service
, pp. 269-275
-
-
Jiang, W.1
Schulzrinne, H.2
-
10
-
-
4544238561
-
Monaural speech separation
-
G. Hu and D. Wang, "Monaural speech separation," Adv. Neur. Inf. Process. Syst., vol.15, pp. 1221-1228, 2003.
-
(2003)
Adv. Neur. Inf. Process. Syst
, vol.15
, pp. 1221-1228
-
-
Hu, G.1
Wang, D.2
-
11
-
-
3142694930
-
Blind separation of speech mixtures via time-frequency masking
-
Jul.
-
O. Y?lmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol.52, no.7, pp. 1830-1847, Jul. 2004.
-
(2004)
IEEE Trans. Signal Process
, vol.52
, Issue.7
, pp. 1830-1847
-
-
Ylmaz, O.1
Rickard, S.2
-
13
-
-
58149196390
-
On the optimality of ideal binary time-frequency masks
-
Mar.
-
Y. Li and D.Wang, "On the optimality of ideal binary time-frequency masks," Speech Commun., vol.51, no.3, pp. 230-239, Mar. 2009.
-
(2009)
Speech Commun
, vol.51
, Issue.3
, pp. 230-239
-
-
Li, Y.1
Wang, D.2
-
14
-
-
77950114888
-
Effects of pitch and spatial separation on selective attention in anechoic and reverberant environments
-
S. Bressler and B. G. Shinn-Cunningham, "Effects of pitch and spatial separation on selective attention in anechoic and reverberant environments," J. Acoust. Soc. Amer., vol.123, no.5, pp. 2978-2978, 2008.
-
(2008)
J. Acoust. Soc. Amer
, vol.123
, Issue.5
, pp. 2978-2978
-
-
Bressler, S.1
Shinn-Cunningham, B.G.2
-
15
-
-
0035106984
-
Informational and energetic masking effects in the perception of two simultaneous talkers
-
D. S. Brungart, "Informational and energetic masking effects in the perception of two simultaneous talkers," J. Acoust. Soc. Amer., vol.109, no.3, pp. 1101-1109, 2001.
-
(2001)
J. Acoust. Soc. Amer
, vol.109
, Issue.3
, pp. 1101-1109
-
-
Brungart, D.S.1
-
16
-
-
27744532546
-
Precedence-based speech segregation in a virtual auditory environment
-
D. S. Brungart, B. D. Simpson, and R. L. Freyman, "Precedence-based speech segregation in a virtual auditory environment," J. Acoust. Soc. Amer., vol.118, no.5, pp. 3241-3251, 2005.
-
(2005)
J. Acoust. Soc. Amer
, vol.118
, Issue.5
, pp. 3241-3251
-
-
Brungart, D.S.1
Simpson, B.D.2
Freyman, R.L.3
-
17
-
-
29244442934
-
The advantage of knowing where to listen
-
G. Kidd, T. L. Arbogast, C. R. Mason, and F. J. Gallun, "The advantage of knowing where to listen," J. Acoust. Soc. Amer., vol.118, no.6, pp. 3804-3815, 2005.
-
(2005)
J. Acoust. Soc. Amer
, vol.118
, Issue.6
, pp. 3804-3815
-
-
Kidd, G.1
Arbogast, T.L.2
Mason, C.R.3
Gallun, F.J.4
-
18
-
-
17344368194
-
Release from masking due to spatial separation of sources in the identification of nonspeech auditory patterns
-
C. R. Mason, T. L. Rohtla, and P. S. Deliwala, "Release from masking due to spatial separation of sources in the identification of nonspeech auditory patterns," J. Acoust. Soc. Amer., vol.104, no.1, pp. 422-431, 1998.
-
(1998)
J. Acoust. Soc. Amer
, vol.104
, Issue.1
, pp. 422-431
-
-
Mason, C.R.1
Rohtla, T.L.2
Deliwala, P.S.3
-
19
-
-
2142813014
-
Effect of number of masking talkers and auditory priming on informational masking in speech recognition
-
R. L. Freyman, U. Balakrishnan, and K. S. Helfer, "Effect of number of masking talkers and auditory priming on informational masking in speech recognition," J. Acoust. Soc. Amer., vol.115, no.5, pp. 2246-2256, 2004.
-
(2004)
J. Acoust. Soc. Amer
, vol.115
, Issue.5
, pp. 2246-2256
-
-
Freyman, R.L.1
Balakrishnan, U.2
Helfer, K.S.3
-
20
-
-
42749094234
-
Object-based auditory and visual attention
-
May
-
B. G. Shinn-Cunningham, "Object-based auditory and visual attention," Trends in Cognitive Sci., vol.12, no.5, pp. 182-186, May 2008.
-
(2008)
Trends in Cognitive Sci
, vol.12
, Issue.5
, pp. 182-186
-
-
Shinn-Cunningham, B.G.1
-
21
-
-
18744392833
-
Localizing nearby sound sources in a classroom: Binaural room impulse responses
-
B. G. Shinn-Cunningham, N. Kopco, and T. J. Martin, "Localizing nearby sound sources in a classroom: Binaural room impulse responses," J. Acoust. Soc. Amer., vol.117, no.5, pp. 3100-3115, 2005.
-
(2005)
J. Acoust. Soc. Amer
, vol.117
, Issue.5
, pp. 3100-3115
-
-
Shinn-Cunningham, B.G.1
Kopco, N.2
Martin, T.J.3
-
22
-
-
85008544097
-
Model-based expectation maximization source separation and localization
-
Feb.
-
M. I. Mandel, R. J.Weiss, and D. P. W. Ellis, "Model-based expectation maximization source separation and localization," IEEE Trans. Audio, Speech, Lang. Process., vol.18, no.2, pp. 382-394, Feb. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process
, vol.18
, Issue.2
, pp. 382-394
-
-
Mandel, M.I.1
Weiss, R.J.2
Ellis, D.P.W.3
-
23
-
-
0033692661
-
Blind separation of disjoint orthogonal signals: Demixing n sources from 2 mixtures
-
A. Jourjine, S. Rickard, and O. Y?lmaz, "Blind separation of disjoint orthogonal signals: Demixing n sources from 2 mixtures," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2000, vol.5, pp. 2985-2988.
-
(2000)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.5
, pp. 2985-2988
-
-
Jourjine, A.1
Rickard, S.2
Ylmaz, O.3
-
24
-
-
50249118229
-
A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures
-
H. Sawada, S. Araki, and S. Makino, "A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust., 2007, pp. 139-142.
-
(2007)
Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust
, pp. 139-142
-
-
Sawada, H.1
Araki, S.2
Makino, S.3
-
25
-
-
84872736510
-
A source localization/separation/respatialization system based on unsupervised classification of interaural cues
-
J. Mouba and S. Marchand, "A source localization/separation/ respatialization system based on unsupervised classification of interaural cues," in Proc. Int. Conf. Digital Audio Effects, 2006, pp. 233-238.
-
(2006)
Proc. Int. Conf. Digital Audio Effects
, pp. 233-238
-
-
Mouba, J.1
Marchand, S.2
-
26
-
-
4644336054
-
Reconstruction of missing features for robust speech recognition
-
Sep.
-
B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of missing features for robust speech recognition," Speech. Commun., vol.43, no.4, pp. 275-296, Sep. 2004.
-
(2004)
Speech. Commun
, vol.43
, Issue.4
, pp. 275-296
-
-
Raj, B.1
Seltzer, M.L.2
Stern, R.M.3
-
27
-
-
0035342414
-
Robust automatic speech recognition with missing and unreliable acoustic data
-
Jun.
-
M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech. Commun., vol.34, no.3, pp. 267-285, Jun. 2001.
-
(2001)
Speech. Commun
, vol.34
, Issue.3
, pp. 267-285
-
-
Cooke, M.1
Green, P.2
Josifovski, L.3
Vizinho, A.4
-
28
-
-
33749058582
-
Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques
-
D. Kolossa, A. Klimas, and R. Orglmeister, "Separation and robust recognition of noisy, convolutive speech mixtures using time-frequency masking and missing data techniques," in Proc. IEEEWorkshop Applicat. Signal Process. Audio Acoust., 2005, pp. 82-85.
-
(2005)
Proc. IEEEWorkshop Applicat. Signal Process. Audio Acoust
, pp. 82-85
-
-
Kolossa, D.1
Klimas, A.2
Orglmeister, R.3
-
29
-
-
0025681008
-
Hidden markov model decomposition of speech and noise
-
A. P. Varga and R. K. Moore, "Hidden markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 1990, vol.2, pp. 845-848.
-
(1990)
Proc. IEEE Int. Conf. Acoust. Speech, Signal Process
, vol.2
, pp. 845-848
-
-
Varga, A.P.1
Moore, R.K.2
-
30
-
-
50249086925
-
Monaural speech separation using source-adapted models
-
R. J. Weiss and D. P. W. Ellis, "Monaural speech separation using source-adapted models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2007, pp. 114-117.
-
(2007)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 114-117
-
-
Weiss, R.J.1
Ellis, D.P.W.2
-
31
-
-
70450194258
-
Super-human multi-talker speech recognition: A graphical modeling approach
-
Jan.
-
J. R. Hershey, S. J. Rennie, P. A. Olsen, and T. T. Kristjansson, "Super-human multi-talker speech recognition: A graphical modeling approach," Comput. Speech Lang., Jan. 2009.
-
(2009)
Comput. Speech Lang
-
-
Hershey, J.R.1
Rennie, S.J.2
Olsen, P.A.3
Kristjansson, T.T.4
-
32
-
-
4644317224
-
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
-
Sep.
-
M. Seltzer, B. Raj, and R. Stern, "A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition," Speech Commun., vol.43, no.4, pp. 379-393, Sep. 2004.
-
(2004)
Speech Commun
, vol.43
, Issue.4
, pp. 379-393
-
-
Seltzer, M.1
Raj, B.2
Stern, R.3
-
33
-
-
34547543067
-
Missing feature speech recognition using dereverberation and echo suppression in reverberant environments
-
H.-M. Park and R. M. Stern, "Missing feature speech recognition using dereverberation and echo suppression in reverberant environments," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2007, vol.4, pp. 381-384.
-
(2007)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.4
, pp. 381-384
-
-
Park, H.-M.1
Stern, R.M.2
-
34
-
-
0141804849
-
Effects of small room reverberation upon the recognition of some consonant features
-
S. A. Gelfand and S. Silman, "Effects of small room reverberation upon the recognition of some consonant features," J. Acoust. Soc. Amer., vol.66, no.1, pp. 22-29, 1979.
-
(1979)
J. Acoust. Soc. Amer
, vol.66
, Issue.1
, pp. 22-29
-
-
Gelfand, S.A.1
Silman, S.2
-
35
-
-
67149088353
-
The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation
-
E. Vincent, S. Araki, and P. Bofill, "The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation," Ind. Compon. Anal. Signal Separat., pp. 734-741, 2009.
-
(2009)
Ind. Compon. Anal. Signal Separat
, pp. 734-741
-
-
Vincent, E.1
Araki, S.2
Bofill, P.3
-
36
-
-
33744975847
-
Performance measurement in blind audio source separation
-
DOI 10.1109/TSA.2005.858005
-
E. Vincent, R. Gribonval, and C. Fevotte, "Performance measurement in blind audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1462-1469, Jul. 2006. (Pubitemid 46547636)
-
(2006)
IEEE Transactions on Audio, Speech and Language Processing
, vol.14
, Issue.4
, pp. 1462-1469
-
-
Vincent, E.1
Gribonval, R.2
Fevotte, C.3
-
37
-
-
70349204584
-
An em algorithm for localizing multiple sound sources in reverberant environments
-
B. Schölkopf, J. Platt, and T. Hoffman, Eds. Cambridge, MA: MIT Press
-
M. I. Mandel, D. P. W. Ellis, and T. Jebara, "An EM algorithm for localizing multiple sound sources in reverberant environments," in Advances in Neural Information Processing Systems, B. Schölkopf, J. Platt, and T. Hoffman, Eds. Cambridge, MA: MIT Press, 2007, pp. 953-960.
-
(2007)
Advances in Neural Information Processing Systems
, pp. 953-960
-
-
Mandel, M.I.1
Ellis, D.P.W.2
Jebara, T.3
|