-
1
-
-
0032682770
-
Separation of speech from interfering sounds based on oscillatory correlation
-
May
-
D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Netw., vol. 10, no. 3, pp. 684-697, May 1999.
-
(1999)
IEEE Trans. Neural Netw
, vol.10
, Issue.3
, pp. 684-697
-
-
Wang, D.L.1
Brown, G.J.2
-
2
-
-
8344232372
-
A maximum likelihood approach to single-channel source separation
-
G.-J. Jang and T.-W. Lee, "A maximum likelihood approach to single-channel source separation," J. Mach. Learning Res., no. 4, pp. 1365-1392, 2003.
-
(2003)
J. Mach. Learning Res
, Issue.4
, pp. 1365-1392
-
-
Jang, G.-J.1
Lee, T.-W.2
-
3
-
-
4544247508
-
Multiband audio modeling for single-channel acoustic source separation
-
May
-
M. Reyes-Gomez, D. Ellis, and N. Jojic, "Multiband audio modeling for single-channel acoustic source separation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Proc. (ICASSP'04), May 2004, vol. 5, pp. 641-644.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Proc. (ICASSP'04)
, vol.5
, pp. 641-644
-
-
Reyes-Gomez, M.1
Ellis, D.2
Jojic, N.3
-
5
-
-
84898946024
-
One microphone source separation
-
Cambridge, MA: MIT Press
-
S. T. Roweis, "One microphone source separation," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2001, vol. 13, pp. 793-799.
-
(2001)
Advances in Neural Information Processing Systems
, vol.13
, pp. 793-799
-
-
Roweis, S.T.1
-
6
-
-
35048894844
-
Wiener based source separation with HMM/GMM using a single sensor
-
Nara, Japan, Apr
-
L. Benaroya and F. Bimbot, "Wiener based source separation with HMM/GMM using a single sensor," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'03), Nara, Japan, Apr. 2003, pp. 957-961.
-
(2003)
Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'03)
, pp. 957-961
-
-
Benaroya, L.1
Bimbot, F.2
-
7
-
-
4644257621
-
Single microphone source separation using high resolution signal reconstruction
-
T. Kristjansson, H. Attias, and J. Hershey, "Single microphone source separation using high resolution signal reconstruction," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04), 2004, vol. 2, pp. 817-820.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04)
, vol.2
, pp. 817-820
-
-
Kristjansson, T.1
Attias, H.2
Hershey, J.3
-
8
-
-
85159664446
-
SINOLA: A new analysis/synthesis method using spectrum peak shape distortion, phase and reassigned spectrum
-
Oct
-
G. Peeters and X. Rodet, "SINOLA: A new analysis/synthesis method using spectrum peak shape distortion, phase and reassigned spectrum," in Proc. Int. Comput. Music Conf. (ICMC'99), Oct. 1999, pp. 153-156.
-
(1999)
Proc. Int. Comput. Music Conf. (ICMC'99)
, pp. 153-156
-
-
Peeters, G.1
Rodet, X.2
-
9
-
-
0036293936
-
On the approximate W-disjoint orthogonality of speech
-
Orlando, FL, May
-
S. Rickard and O. Yilmaz, "On the approximate W-disjoint orthogonality of speech," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'02), Orlando, FL, May 2002, vol. 3, pp. 3049-3052.
-
(2002)
IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'02)
, vol.3
, pp. 3049-3052
-
-
Rickard, S.1
Yilmaz, O.2
-
11
-
-
35048837133
-
Underdetermined source separation with structured source priors
-
Granada, Spain, Sep
-
E. Vincent and X. Rodet, "Underdetermined source separation with structured source priors," in Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'04), Granada, Spain, Sep. 2004, pp. 327-334.
-
(2004)
Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'04)
, pp. 327-334
-
-
Vincent, E.1
Rodet, X.2
-
12
-
-
4544386386
-
Low complexity Bayesian single channel source separation
-
T. Beierholm, B. D. Pedersen, and O. Winther, "Low complexity Bayesian single channel source separation," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04), 2004, vol. 5, pp. 529-532.
-
(2004)
IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04)
, vol.5
, pp. 529-532
-
-
Beierholm, T.1
Pedersen, B.D.2
Winther, O.3
-
13
-
-
33947659500
-
Model-based monaural source separation using a vector-quantized phase-vocoder representation
-
Toulouse, France, May
-
D. Ellis and R.Weiss, "Model-based monaural source separation using a vector-quantized phase-vocoder representation," in IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06), Toulouse, France, May 2006, vol. 5, pp. 957-960.
-
(2006)
IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06)
, vol.5
, pp. 957-960
-
-
Ellis, D.1
Weiss, R.2
-
15
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
-
(1977)
J. R. Statist. Soc
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
16
-
-
0141743693
-
New EM algorithms for source separation and deconvolution
-
H. Attias, "New EM algorithms for source separation and deconvolution," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03), 2003, vol. 5, pp. 297-300.
-
(2003)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'03)
, vol.5
, pp. 297-300
-
-
Attias, H.1
-
17
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
-
Apr
-
J. Gauvain and C. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.1
Lee, C.2
-
18
-
-
0000159105
-
On adaptive decision rules and decision parameter adaptation for automatic speech recognition
-
Aug
-
C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, no. 8, pp. 1241-1269, Aug. 2000.
-
(2000)
Proc. IEEE
, vol.88
, Issue.8
, pp. 1241-1269
-
-
Lee, C.-H.1
Huo, Q.2
-
19
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
A. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., no. 10, pp. 19-41, 2000.
-
(2000)
Digital Signal Process
, Issue.10
, pp. 19-41
-
-
Reynolds, A.1
Quatieri, T.2
Dunn, R.3
-
20
-
-
0013288412
-
Dynamic Bayesian networks: Representation, inference and learning,
-
Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA, Jul
-
K. P. Murphy, "Dynamic Bayesian networks: Representation, inference and learning," Ph.D. dissertation, Univ. California Berkeley, Berkeley, CA, Jul. 2002.
-
(2002)
-
-
Murphy, K.P.1
-
21
-
-
0033225865
-
An introduction to variational methods for graphical models
-
M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical models," Learning in Graphical Models, vol. 37, no. 2, pp. 183-233, 1999.
-
(1999)
Learning in Graphical Models
, vol.37
, Issue.2
, pp. 183-233
-
-
Jordan, M.I.1
Ghahramani, Z.2
Jaakkola, T.S.3
Saul, L.K.4
-
22
-
-
0009623939
-
Flexible speaker adaptation using maximum likelihood linear regression
-
C. Leggetter and P.Woodland, "Flexible speaker adaptation using maximum likelihood linear regression," in ARPA Spoken Lang. Technol. Workshop, 1995, pp. 104-109.
-
(1995)
ARPA Spoken Lang. Technol. Workshop
, pp. 104-109
-
-
Leggetter, C.1
Woodland, P.2
-
23
-
-
0030359637
-
Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation
-
Philadelphia, PA
-
M. Gales, D. Pye, and P. Woodland, "Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP'96), Philadelphia, PA, 1996, vol. 3, pp. 1832-1835.
-
(1996)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP'96)
, vol.3
, pp. 1832-1835
-
-
Gales, M.1
Pye, D.2
Woodland, P.3
-
24
-
-
0030640789
-
Structural MAP speaker adaptation using hierarchical priors
-
Santa Barbara, CA, Dec
-
K. Shinoda and C.-H. Lee, "Structural MAP speaker adaptation using hierarchical priors," in Proc. IEEE Workshop Speech Recognition Understanding, Santa Barbara, CA, Dec. 1997, pp. 381-388.
-
(1997)
Proc. IEEE Workshop Speech Recognition Understanding
, pp. 381-388
-
-
Shinoda, K.1
Lee, C.-H.2
-
25
-
-
85009097035
-
Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
-
Beijing, China, Oct
-
K.-T. Chen, W.-W. Liau, H.-M. Wang, and L.-S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP'00), Beijing, China, Oct. 2000, pp. 742-745.
-
(2000)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP'00)
, pp. 742-745
-
-
Chen, K.-T.1
Liau, W.-W.2
Wang, H.-M.3
Lee, L.-S.4
-
26
-
-
33744968614
-
Audio source separation with a single sensor
-
Jan
-
L. Benaroya, F. Bimbot, and R. Gribonval, "Audio source separation with a single sensor," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 191-199, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.1
, pp. 191-199
-
-
Benaroya, L.1
Bimbot, F.2
Gribonval, R.3
-
27
-
-
0028420014
-
Integrated models of signal and background with application to speaker identification in noise
-
Apr
-
R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.2
, pp. 245-257
-
-
Rose, R.C.1
Hofstetter, E.M.2
Reynolds, D.A.3
-
28
-
-
4444245782
-
Blind clustering of popular music recordings based on singer voice characteristics
-
W.-H. Tsai, D. Rogers, and H.-M. Wang, "Blind clustering of popular music recordings based on singer voice characteristics," Comput. Music J., vol. 28, no. 3, pp. 68-78, 2004.
-
(2004)
Comput. Music J
, vol.28
, Issue.3
, pp. 68-78
-
-
Tsai, W.-H.1
Rogers, D.2
Wang, H.-M.3
-
30
-
-
4444229791
-
Singer identification in popular music recordings using voice coding features
-
Oct
-
Y. E. Kim and B. Whitman, "Singer identification in popular music recordings using voice coding features," in Proc. Int. Symp. Music Inf. Retrieval (ISMIR'02), Oct. 2002, pp. 164-169.
-
(2002)
Proc. Int. Symp. Music Inf. Retrieval (ISMIR'02)
, pp. 164-169
-
-
Kim, Y.E.1
Whitman, B.2
-
31
-
-
13444291977
-
Singing voice detection in popular music
-
New York, Oct
-
T. L. Nwe, A. Shenoy, and Y. Wang, "Singing voice detection in popular music," in Proc. ACM Multimedia Conf., New York, Oct. 2004, pp. 324-327.
-
(2004)
Proc. ACM Multimedia Conf
, pp. 324-327
-
-
Nwe, T.L.1
Shenoy, A.2
Wang, Y.3
-
32
-
-
4544255234
-
Automatic detection and tracking of target singer in multi-singer music recordings
-
Montreal, QC, Canada
-
W. H. Tsai and H. M. Wang, "Automatic detection and tracking of target singer in multi-singer music recordings," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04),Montreal, QC, Canada, 2004, vol. 4, pp. 221-224.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04)
, vol.4
, pp. 221-224
-
-
Tsai, W.H.1
Wang, H.M.2
-
33
-
-
0032595188
-
Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
-
Sep
-
R. Vergin, D. O'Shaughnessy, and A. Farhat, "Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 525-532, Sep. 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, Issue.5
, pp. 525-532
-
-
Vergin, R.1
O'Shaughnessy, D.2
Farhat, A.3
-
34
-
-
33745686986
-
One microphone singing voice separation using source-adapted models
-
Mohonk, NY, Oct
-
A. Ozerov, P. Philippe, R. Gribonval, and F. Bimbot, "One microphone singing voice separation using source-adapted models," in IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'05), Mohonk, NY, Oct. 2005, pp. 90-93.
-
(2005)
IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA'05)
, pp. 90-93
-
-
Ozerov, A.1
Philippe, P.2
Gribonval, R.3
Bimbot, F.4
-
35
-
-
0028517016
-
Space-alternating generalized expectation- maximization algorithm
-
Oct
-
J. A. Fessler and A. O. Hero, "Space-alternating generalized expectation- maximization algorithm," IEEE Trans. Signal Process., vol. 42, no. 10, pp. 2664-2677, Oct. 1994.
-
(1994)
IEEE Trans. Signal Process
, vol.42
, Issue.10
, pp. 2664-2677
-
-
Fessler, J.A.1
Hero, A.O.2
-
37
-
-
84867608170
-
Low-resource noise-robust feature post-processing on aurora 2.0
-
C.-P. Chen, J. Bilmes, and K. Kirchhoff, "Low-resource noise-robust feature post-processing on aurora 2.0," in Proc. Int. Conf. Spoken Lang. Process. (ICSLP'02), 2002, pp. 2445-2448.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process. (ICSLP'02)
, pp. 2445-2448
-
-
Chen, C.-P.1
Bilmes, J.2
Kirchhoff, K.3
-
38
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eur. Conf. Speech Commun. Technol. (EuroSpeech'97), 1997, pp. 1895-1898.
-
(1997)
Proc. Eur. Conf. Speech Commun. Technol. (EuroSpeech'97)
, pp. 1895-1898
-
-
Martin, A.1
Doddington, G.2
Kamm, T.3
Ordowski, M.4
Przybocki, M.5
-
39
-
-
0020102027
-
Least squares quantization in PCM
-
Mar
-
S. P. Lloyd, "Least squares quantization in PCM," IEEE Trans. Inf. Theory, vol. IT-28, no. 2, pp. 129-137, Mar. 1982.
-
(1982)
IEEE Trans. Inf. Theory
, vol.IT-28
, Issue.2
, pp. 129-137
-
-
Lloyd, S.P.1
-
40
-
-
0348196088
-
Proposals for performance measurement in source separation
-
Apr
-
R. Gribonval, L. Benaroya, E. Vincent, and C. Févotte, "Proposals for performance measurement in source separation," in Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'03), Apr. 2003, pp. 763-768.
-
(2003)
Proc. Int. Conf. Ind. Compon. Anal. Blind Source Separation (ICA'03)
, pp. 763-768
-
-
Gribonval, R.1
Benaroya, L.2
Vincent, E.3
Févotte, C.4
|