SCOPUS 정보 검색 플랫폼

IEEE Signal Processing Magazine

Volumn 32, Issue 2, 2015, Pages 125-144

Compositional models for audio processing: Uncovering the structure of sound mixtures

(4) Virtanen, Tuomas a Gemmeke, Jort Florent b Raj, Bhiksha c Smaragdis, Paris d

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

b UNIVERSITY OF LEUVEN (Belgium)

c MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

d UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO ACOUSTICS;

AUDIO PROCESSING; COMPOSITIONAL DATA; COMPOSITIONAL MODELS; LINEAR COMBINATIONS; NON NEGATIVES; SOUND MIXTURES; TOTAL COUNTS;

POPULATION STATISTICS;

EID: 85032751297 PISSN: 10535888 EISSN: 15580792 Source Type: Journal
DOI: 10.1109/MSP.2013.2288990 Document Type: Article

Times cited : (68)

References (64)

1
- 84891283756
- Hoboken NJ: Wiley
- A. Cichocki, R. Zdunek, A. H. Phan, and S. Amari, Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation. Hoboken, NJ: Wiley, 2009.
- (2009) Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation
- Cichocki, A.¹ Zdunek, R.² Phan, A.H.³ Amari, S.⁴

2
- 38049021850
- Convolutive speech bases and their application to supervised speech separation
- P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Processing, vol. 15, no. 1, pp. 1-12, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Processing , vol.15 , Issue.1 , pp. 1-12
- Smaragdis, P.¹

3
- 50249152311
- Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
- T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Processing, vol. 15, no. 3, pp. 1066-1074, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Processing , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

4
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Processing, vol. 19, no. 7, pp. 2067-2080, 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Processing , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J.¹ Virtanen, T.² Hurmalainen, A.³

5
- 84873616077
- Musical instrument recognition in polyphonic audio using source-filter model for sound separation
- Kobe, Japan
- T. Heittola, A. Klapuri, and T. Virtanen, "Musical instrument recognition in polyphonic audio using source-filter model for sound separation," in Proc. Int. Conf. Music Information Retrieval, Kobe, Japan, 2009, pp. 327-332.
- (2009) Proc. Int. Conf. Music Information Retrieval , pp. 327-332
- Heittola, T.¹ Klapuri, A.² Virtanen, T.³

6
- 84893675434
- The TUM+TUT+KUL approach to the CHiME Challenge 2013: Multi-stream ASR exploiting BLSTM networks and sparse NMF
- Vancouver, Canada
- J. T. Geiger, F. Weninger, A. Hurmalainen, J. F. Gemmeke, M. Wllmer, B. Schuller, G. Rigoll, and T. Virtanen, "The TUM+TUT+KUL approach to the CHiME Challenge 2013: Multi-stream ASR exploiting BLSTM networks and sparse NMF," in Proc. 2nd Int. Workshop on Machine Listening in Multisource Environments, Vancouver, Canada, 2013, pp. 25-30.
- (2013) Proc. 2nd Int. Workshop on Machine Listening in Multisource Environments , pp. 25-30
- Geiger, J.T.¹ Weninger, F.² Hurmalainen, A.³ Gemmeke, J.F.⁴ Wllmer, M.⁵ Schuller, B.⁶ Rigoll, G.⁷ Virtanen, T.⁸

7
- 18444370569
- Nonnegative features of spectro-temporal sounds for classification
- Y.-C. Cho and S. Choi, "Nonnegative features of spectro-temporal sounds for classification," Pattern Recognit. Lett., vol. 26, no. 9, pp. 1327-1336, 2005.
- (2005) Pattern Recognit. Lett. , vol.26 , Issue.9 , pp. 1327-1336
- Cho, Y.-C.¹ Choi, S.²

8
- 76949083547
- Enforcing harmonicity and smoothness in Bayesian nonnegative matrix factorization applied to polyphonic music transcription
- N. Bertin, R. Badeau, and E. Vincent, "Enforcing harmonicity and smoothness in Bayesian nonnegative matrix factorization applied to polyphonic music transcription," IEEE Trans. Audio, Speech, Lang. Processing, vol. 18, no. 3, pp. 538-549, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Processing , vol.18 , Issue.3 , pp. 538-549
- Bertin, N.¹ Badeau, R.² Vincent, E.³

9
- 33745219863
- Bandwidth expansion of narrowband speech using non-negative matrix factorization
- Lisbon, Portugal
- D. Bansal, B. Raj, and P. Smaragdis, "Bandwidth expansion of narrowband speech using non-negative matrix factorization," in Proc. EUROSPEECH, Lisbon, Portugal, 2005, pp. 1505-1508.
- (2005) Proc. EUROSPEECH , pp. 1505-1508
- Bansal, D.¹ Raj, B.² Smaragdis, P.³

10
- 76949094445
- Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation
- A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. Audio, Speech, Lang. Processing, vol. 18, no. 3, pp. 550-563, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Processing , vol.18 , Issue.3 , pp. 550-563
- Ozerov, A.¹ Févotte, C.²

11
- 77952744810
- Sparse representations in audio & music: From coding to source separation
- M. D. Plumbley, T. Blumensath, L. Daudet, R. Gribonval, and M. E. Davies, "Sparse representations in audio & music: From coding to source separation," Proc. IEEE, vol. 98, no. 6, pp. 995-1005, 2009.
- (2009) Proc. IEEE , vol.98 , Issue.6 , pp. 995-1005
- Plumbley, M.D.¹ Blumensath, T.² Daudet, L.³ Gribonval, R.⁴ Davies, M.E.⁵

12
- 84866031901
- Object-based audio coding using nonnegative matrix factorization for the spectrogram representation
- J. Nikunen and T. Virtanen, "Object-based audio coding using nonnegative matrix factorization for the spectrogram representation," in Proc. 128th Audio Engineering Society Convention, London, 2010.
- (2010) Proc. 128th Audio Engineering Society Convention, London
- Nikunen, J.¹ Virtanen, T.²

13
- 63249085556
- Nonnegative matrix factorization with the Itakura-Saito divergence. with application to music analysis
- C. Févotte, N. Bertin, and J.-L. Durrieu, "Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis," Neural Computat., vol. 21, no. 3, pp. 793-830, 2009.
- (2009) Neural Computat. , vol.21 , Issue.3 , pp. 793-830
- Févotte, C.¹ Bertin, N.² Durrieu, J.-L.³

14
- 85162005859
- Sparse overcomplete latent variable decomposition of counts data
- M. Shashanka, B. Raj, and P. Smaragdis, "Sparse overcomplete latent variable decomposition of counts data," in Proc. Neural Information Processing Systems, Vancouver, Canada, 2007, pp. 1313-1320.
- (2007) Proc. Neural Information Processing Systems, Vancouver, Canada , pp. 1313-1320
- Shashanka, M.¹ Raj, B.² Smaragdis, P.³

15
- 41249089920
- On the equivalence between nonnegative matrix factorization and probabilistic latent semantic indexing
- C. Ding, T. Li, and W. Ping, "On the equivalence between nonnegative matrix factorization and probabilistic latent semantic indexing," Computat. Stat. Data Anal., vol. 52, no. 8, pp. 3913-3927, 2008.
- (2008) Computat. Stat. Data Anal. , vol.52 , Issue.8 , pp. 3913-3927
- Ding, C.¹ Li, T.² Ping, W.³

16
- 0004236521
- Berlin: Springer-Verlag
- E. Zwicker and H. Fastl, Psychoacoustics: Facts and Models. Berlin: Springer-Verlag, 1990.
- (1990) Psychoacoustics: Facts and Models
- Zwicker, E.¹ Fastl, H.²

17
- 0004712975
- San Diego, CA: Academic Press
- B. C. J. Moore, Ed., Hearing-Handbook of Perception and Cognition, 2nd ed. San Diego, CA: Academic Press, 1995.
- (1995) Hearing-Handbook of Perception and Cognition, 2nd Ed
- Moore, B.C.J.¹

18
- 84870706588
- Optimal cost function and magnitude power for NMF-based speech separation and music interpolation
- Santander, Spain
- B. King, C. Févotte, and P. Smaragdis, "Optimal cost function and magnitude power for NMF-based speech separation and music interpolation," in Proc. IEEE Int. Workshop on Machine Learning for Signal Processing, Santander, Spain, 2012, pp. 1-6.
- (2012) Proc. IEEE Int. Workshop on Machine Learning for Signal Processing , pp. 1-6
- King, B.¹ Févotte, C.² Smaragdis, P.³

19
- 84878114938
- Constrained nonnegative sparse coding using learnt instrument templates for realtime music transcription
- J. Carabias-Orti, F. Rodriguez-Serrano, P. Vera-Candeas, F. Canadas-Quesada, and N. Ruiz-Reyes, "Constrained nonnegative sparse coding using learnt instrument templates for realtime music transcription," in Proc. Engineering Applications of Artificial Intelligence, 2013, pp. 1671-1680.
- (2013) Proc. Engineering Applications of Artificial Intelligence , pp. 1671-1680
- Carabias-Orti, J.¹ Rodriguez-Serrano, F.² Vera-Candeas, P.³ Canadas-Quesada, F.⁴ Ruiz-Reyes, N.⁵

20
- 84866042020
- Optimization and parallelization of monaural source separation algorithms in the openBliSSART toolkit
- F. Weninger and B. Schuller, "Optimization and parallelization of monaural source separation algorithms in the openBliSSART toolkit," J. Signal Process. Syst., vol. 69, no. 3, pp. 267-277, 2012.
- (2012) J. Signal Process. Syst. , vol.69 , Issue.3 , pp. 267-277
- Weninger, F.¹ Schuller, B.²

21
- 80051594038
- Algorithms for nonnegative matrix factorization with the beta-divergence
- C. Févotte and J. Idier, "Algorithms for nonnegative matrix factorization with the beta-divergence," Neural Computat., vol. 23, no. 9, pp. 2421-2456, 2011.
- (2011) Neural Computat. , vol.23 , Issue.9 , pp. 2421-2456
- Févotte, C.¹ Idier, J.²

22
- 0001093042
- Algorithms for nonnegative matrix factorization
- Denver, CO
- D. D. Lee and H. S. Seung, "Algorithms for nonnegative matrix factorization," in Proc. Neural Information Processing Systems, Denver, CO, 2000, pp. 556-562.
- (2000) Proc. Neural Information Processing Systems , pp. 556-562
- Lee, D.D.¹ Seung, H.S.²

23
- 34247173538
- Nonnegative matrix factorization with constrained second-order optimization
- R. Zdunek and A. Cichocki, "Nonnegative matrix factorization with constrained second-order optimization," Signal Process., vol. 87, no. 8, pp. 1904-1916, 2007.
- (2007) Signal Process. , vol.87 , Issue.8 , pp. 1904-1916
- Zdunek, R.¹ Cichocki, A.²

24
- 84863012243
- Fast nonnegative matrix factorization: An active-set-like method and comparisons
- J. Kim and H. Park, "Fast nonnegative matrix factorization: An active-set-like method and comparisons," SIAM J. Sci. Comput., vol. 33, no. 6, pp. 3261-3281, 2011.
- (2011) SIAM J. Sci. Comput. , vol.33 , Issue.6 , pp. 3261-3281
- Kim, J.¹ Park, H.²

25
- 84886818613
- Active-set Newton algorithm for overcomplete nonnegative representations of audio
- T. Virtanen, J. Gemmeke, and B. Raj, "Active-set Newton algorithm for overcomplete nonnegative representations of audio," IEEE Trans. Audio, Speech, Lang. Processing, vol. 21, no. 11, 2013.
- IEEE Trans. Audio, Speech, Lang. Processing , vol.21 , Issue.11 , pp. 2013
- Virtanen, T.¹ Gemmeke, J.² Raj, B.³

26
- 0034818212
- Unsupervised learning by probabilistic latent semantic analysis
- T. Hofmann, "Unsupervised learning by probabilistic latent semantic analysis," Mach. Learn., vol. 42, no. 1-2, pp. 177-196, 2001.
- (2001) Mach. Learn. , vol.42 , Issue.1-2 , pp. 177-196
- Hofmann, T.¹

27
- 47649133016
- Probabilistic latent variable models as nonnegative factorizations
- M. Shashanka, B. Raj, and P. Smaragdis, "Probabilistic latent variable models as nonnegative factorizations," Computat. Intell. Neurosci., vol. 2008, 2008.
- (2008) Computat. Intell. Neurosci. , vol.2008
- Shashanka, M.¹ Raj, B.² Smaragdis, P.³

28
- 78349237022
- Nonnegative hidden Markov modeling of audio with application to source separation
- St. Malo, France
- G. J. Mysore, P. Smaragdis, and B. Raj, "Nonnegative hidden Markov modeling of audio with application to source separation," in Proc. 9th Int. Conf. Latent Variable Analysis and Signal Separation, St. Malo, France, 2010, pp. 140-148.
- (2010) Proc. 9th Int. Conf. Latent Variable Analysis and Signal Separation , pp. 140-148
- Mysore, G.J.¹ Smaragdis, P.² Raj, B.³

29
- 81855166765
- Missing data imputation for timefrequency representations of audio signals
- P. Smaragdis, B. Raj, and M. Shashanka, "Missing data imputation for timefrequency representations of audio signals," J. Signal Process. Syst., vol. 11, no. 3, pp. 361-370, 2011.
- (2011) J. Signal Process. Syst. , vol.11 , Issue.3 , pp. 361-370
- Smaragdis, P.¹ Raj, B.² Shashanka, M.³

30
- 47649123078
- Theorems on positive data: On the uniqueness of NMF
- H. Laurberg, M. G. Christensen, M. D. Plumbley, L. K. Hansen, and S. H. Jensen, "Theorems on positive data: On the uniqueness of NMF," Computat. Intell. Neurosci., vol. 2008, 2008.
- (2008) Computat. Intell. Neurosci. , vol.2008
- Laurberg, H.¹ Christensen, M.G.² Plumbley, M.D.³ Hansen, L.K.⁴ Jensen, S.H.⁵

31
- 10944227316
- Sparse coding and NMF
- Budapest, Hungary
- J. Eggert and E. Korner, "Sparse coding and NMF," in Proc. IEEE Int. Joint Conf. Neural Networks, Budapest, Hungary, 2004, pp. 2529-2533.
- (2004) Proc. IEEE Int. Joint Conf. Neural Networks , pp. 2529-2533
- Eggert, J.¹ Korner, E.²

32
- 84900510076
- Nonnegative matrix factorization with sparseness constraints
- P. O. Hoyer, "Nonnegative matrix factorization with sparseness constraints," J. Mach. Learn. Res., vol. 5, pp. 1457-1469, 2004.
- (2004) J. Mach. Learn. Res. , vol.5 , pp. 1457-1469
- Hoyer, P.O.¹

33
- 38149085924
- Ph.D. dissertation Natl. Univ. of Ireland, Maynooth
- P. D. O. Grady, "Sparse separation of underdetermined speech mixtures," Ph.D. dissertation, Natl. Univ. of Ireland, Maynooth, 2007.
- (2007) Sparse Separation of Underdetermined Speech Mixtures
- Grady, P.D.O.¹

34
- 84863746770
- Spectral covariance in prior distributions of nonnegative matrix factorization based speech separation
- Glasgow, Scotland
- T. Virtanen, "Spectral covariance in prior distributions of nonnegative matrix factorization based speech separation," in Proc. European Signal Processing Conf., Glasgow, Scotland, 2009, pp. 1933-1937.
- (2009) Proc. European Signal Processing Conf. , pp. 1933-1937
- Virtanen, T.¹

35
- 84858719009
- A sparse non-parametric approach for single channel separation of known sounds
- Vancouver, Canada
- P. Smaragdis, M. Shashanka, and B. Raj, "A sparse non-parametric approach for single channel separation of known sounds," in Proc. Neural Information Processing Systems, Vancouver, Canada, 2009, pp. 1705-1713.
- (2009) Proc. Neural Information Processing Systems , pp. 1705-1713
- Smaragdis, P.¹ Shashanka, M.² Raj, B.³

36
- 0021407831
- Signal estimation from modified short-time Fourier transform
- D. Griffin and J. Lim, "Signal estimation from modified short-time Fourier transform," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 32, no. 2, pp. 236-242, 1984.
- (1984) IEEE Trans. Acoustics, Speech, Signal Processing , vol.32 , Issue.2 , pp. 236-242
- Griffin, D.¹ Lim, J.²

37
- 84873346243
- Consistent Wiener filtering for audio source separation
- J. Le Roux and E. Vincent, "Consistent Wiener filtering for audio source separation," IEEE Signal Processing Lett., vol. 20, no. 3, pp. 217-220, 2013.
- (2013) IEEE Signal Processing Lett. , vol.20 , Issue.3 , pp. 217-220
- Le Roux, J.¹ Vincent, E.²

38
- 30844440076
- K-SVD and its nonnegative variant for dictionary design
- San Diego, CA
- M. Aharon, M. Elad, and A. Bruckstein, "K-SVD and its nonnegative variant for dictionary design," in Proc. SPIE Conf. Wavelet Applications in Signal and Image Processing XI, San Diego, CA, 2005, pp. 327-339.
- (2005) Proc. SPIE Conf. Wavelet Applications in Signal and Image Processing , vol.11 , pp. 327-339
- Aharon, M.¹ Elad, M.² Bruckstein, A.³

39
- 85032751965
- Compressive sensing
- R. G. Baraniuk, "Compressive sensing," IEEE Signal Processing Mag., vol. 24, no. 4, pp. 118-121, 2007.
- (2007) IEEE Signal Processing Mag. , vol.24 , Issue.4 , pp. 118-121
- Baraniuk, R.G.¹

40
- 80051647466
- Itakura-Saito nonnegative matrix factorization with group sparsity
- Prague, Czech Republic
- A. Lefèvre, F. Bach, and C. Févotte, "Itakura-Saito nonnegative matrix factorization with group sparsity," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Prague, Czech Republic, 2011, pp. 21-24.
- (2011) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. 21-24
- Lefèvre, A.¹ Bach, F.² Févotte, C.³

41
- 67650927380
- Bayesian inference for nonnegative matrix factorisation models
- A. T. Cemgil, "Bayesian inference for nonnegative matrix factorisation models," Computat. Intell. Neurosci., vol. 2009, 2009.
- (2009) Computat. Intell. Neurosci. , vol.2009
- Cemgil, A.T.¹

42
- 84863798093
- Infinite nonnegative matrix factorizations
- Aalborg, Denmark
- M. N. Schmidt and M. Mørup, "Infinite nonnegative matrix factorizations," in Proc. European Signal Processing Conf., Aalborg, Denmark, 2010.
- (2010) Proc. European Signal Processing Conf.
- Schmidt, M.N.¹ Mørup, M.²

43
- 67149090611
- Bayesian nonnegative matrix factorization
- Paraty, Brazil
- M. N. Schmidt, O. Winther, and L. K. Hansen, "Bayesian nonnegative matrix factorization," in Proc. 8th Int. Conf. Independent Component Analysis and Blind Signal Separation, Paraty, Brazil, 2009, pp. 540-547.
- (2009) Proc. 8th Int. Conf. Independent Component Analysis and Blind Signal Separation , pp. 540-547
- Schmidt, M.N.¹ Winther, O.² Hansen, L.K.³

44
- 84878609401
- Group sparsity for speaker identity discrimination in factorisation-based speech recognition
- Portland, OR, Oregon
- A. Hurmalainen, R. Saeidi, and T. Virtanen, "Group sparsity for speaker identity discrimination in factorisation-based speech recognition," in Proc. Interspeech 2012, Portland, OR, Oregon.
- (2012) Proc. Interspeech
- Hurmalainen, A.¹ Saeidi, R.² Virtanen, T.³

45
- 78049392891
- Bayesian compressive sensing for phonetic classification
- Dallas, TX
- T. N. Sainath, A. Carmi, D. Kanevsky, and B. Ramabhadran, "Bayesian compressive sensing for phonetic classification," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Dallas, TX, 2010, pp. 4370-4373.
- (2010) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. 4370-4373
- Sainath, T.N.¹ Carmi, A.² Kanevsky, D.³ Ramabhadran, B.⁴

46
- 79959843124
- Using sparse representations for exemplar based continuous digit recognition
- Glasgow, Scotland
- J. Gemmeke, L. ten Bosch, L. Boves, and B. Cranen, "Using sparse representations for exemplar based continuous digit recognition," in Proc. European Signal Processing Conf., Glasgow, Scotland, 2009, pp. 24-28.
- (2009) Proc. European Signal Processing Conf. , pp. 24-28
- Gemmeke, J.¹ Ten Bosch, L.² Boves, L.³ Cranen, B.⁴

47
- 84865759533
- Mapping sparse representation to state likelihoods in noise-robust automatic speech recognition
- Florence, Italy
- K. Mahkonen, A. Hurmalainen, T. Virtanen, and J. F. Gemmeke, "Mapping sparse representation to state likelihoods in noise-robust automatic speech recognition," in Proc. Interspeech 2011, Florence, Italy, pp. 465-468.
- (2011) Proc. Interspeech , pp. 465-468
- Mahkonen, K.¹ Hurmalainen, A.² Virtanen, T.³ Gemmeke, J.F.⁴

48
- 84878576404
- Using sparse classification outputs as feature observations for noise-robust ASR
- Portland, OR
- Y. Sun, B. Cranen, J. F. Gemmeke, L. Boves, L. ten Bosch, and M. M. Doss, "Using sparse classification outputs as feature observations for noise-robust ASR," in Proc. Interspeech 2012, Portland, OR.
- (2012) Proc. Interspeech
- Sun, Y.¹ Cranen, B.² Gemmeke, J.F.³ Boves, L.⁴ Ten Bosch, L.⁵ Doss, M.M.⁶

49
- 84878572738
- Enhancing exemplar-based posteriors for speech recognition tasks
- Portland, OR
- T. N. Sainath, D. Nahamoo, D. Kanevsky, and B. Ramabhadran, "Enhancing exemplar-based posteriors for speech recognition tasks," in Proc. Interspeech 2012, Portland, OR.
- (2012) Proc. Interspeech
- Sainath, T.N.¹ Nahamoo, D.² Kanevsky, D.³ Ramabhadran, B.⁴

50
- 34547524604
- Bandwidth expansion with a Polya Urn model
- Honolulu, HI
- B. Raj, R. Singh, M. Shashanka, and P. Smaragdis, "Bandwidth expansion with a Polya Urn model," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Honolulu, HI, 2007, pp. IV-597-IV-600.
- (2007) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. IV597-IV600
- Raj, B.¹ Singh, R.² Shashanka, M.³ Smaragdis, P.⁴

51
- 84874248255
- Exemplar-based voice conversion in noisy environment
- R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. IEEE Spoken Language Technology Workshop, 2012, pp. 313-317.
- (2012) Proc. IEEE Spoken Language Technology Workshop , pp. 313-317
- Takashima, R.¹ Takiguchi, T.² Ariki, Y.³

52
- 84901803470
- Exemplar-based voice conversion using nonnegative spectrogram deconvolution
- Barcelona, Spain
- Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar-based voice conversion using nonnegative spectrogram deconvolution," in Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013, pp. 201-206.
- (2013) Proc. 8th ISCA Speech Synthesis Workshop , pp. 201-206
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

53
- 77949695902
- Compressive sensing for missing data imputation in noise robust speech recognition
- J. F. Gemmeke, H. Vanhamme, B. Cranen, and L. Boves, "Compressive sensing for missing data imputation in noise robust speech recognition," IEEE J. Sel. Top. Signal Processing, vol. 4, no. 2, pp. 272-287, 2010.
- (2010) IEEE J. Sel. Top. Signal Processing , vol.4 , Issue.2 , pp. 272-287
- Gemmeke, J.F.¹ Vanhamme, H.² Cranen, B.³ Boves, L.⁴

54
- 79953665879
- Computational auditory induction as a missing-data model-fitting problem with Bregman divergence
- J. Le Roux, H. Kameoka, N. Ono, A. de Cheveigné, and S. Sagayama, "Computational auditory induction as a missing-data model-fitting problem with Bregman divergence," SIAM J. Sci. Comput., vol. 54, no. 5, pp. 658-676, 2011.
- (2011) SIAM J. Sci. Comput. , vol.54 , Issue.5 , pp. 658-676
- Le Roux, J.¹ Kameoka, H.² Ono, N.³ De Cheveigné, A.⁴ Sagayama, S.⁵

55
- 80052984197
- A musically motivated mid-level representation for pitch estimation and musical audio source separation
- J.-L. Durrieu, B. David, and G. Richard, "A musically motivated mid-level representation for pitch estimation and musical audio source separation," IEEE J. Sel. Top. Signal Processing, vol. 5, no. 6, pp. 1180-1191, 2011.
- (2011) IEEE J. Sel. Top. Signal Processing , vol.5 , Issue.6 , pp. 1180-1191
- Durrieu, J.-L.¹ David, B.² Richard, G.³

56
- 80053034549
- Musical instrument sound multi-excitation model for nonnegative spectrogram factorization
- J. Carabias-Orti, T. Virtanen, P. Vera-Candeas, N. Ruiz-Reyes, and F. Canadas-Quesada, "Musical instrument sound multi-excitation model for nonnegative spectrogram factorization," IEEE J. Sel. Top. Signal Processing, vol. 5, no. 6, pp. 1144-1158, 2011.
- (2011) IEEE J. Sel. Top. Signal Processing , vol.5 , Issue.6 , pp. 1144-1158
- Carabias-Orti, J.¹ Virtanen, T.² Vera-Candeas, P.³ Ruiz-Reyes, N.⁴ Canadas-Quesada, F.⁵

57
- 85162323716
- Generalized coupled tensor factorization
- Granada, Spain
- Y. K. Yilmaz, A. T. Cemgil, and U. Simsekli, "Generalized coupled tensor factorization," in Proc. Neural Information Processing Systems, Granada, Spain, 2011, pp. 2151-2159.
- (2011) Proc. Neural Information Processing Systems , pp. 2151-2159
- Yilmaz, Y.K.¹ Cemgil, A.T.² Simsekli, U.³

58
- 84897584695
- A general flexible framework for the handling of prior information in audio source separation
- A. Ozerov, E. Vincent, and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 4, pp. 1118-1133, 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.4 , pp. 1118-1133
- Ozerov, A.¹ Vincent, E.² Bimbot, F.³

59
- 80051616429
- I-divergencebased dereverberation method with auxiliary function approach
- Prague, Czech Republic
- N. Yasuraoka, H. Kameoka, T. Yoshioka, and H. G. Okuno, "I-divergencebased dereverberation method with auxiliary function approach," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Prague, Czech Republic, 2011, pp. 369-372.
- (2011) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. 369-372
- Yasuraoka, N.¹ Kameoka, H.² Yoshioka, T.³ Okuno, H.G.⁴

60
- 78049380787
- Latent-variable decomposition based dereverberation of monaural and multi-channel signals
- Dallas, TX
- R. Singh, B. Raj, and P. Smaragdis, "Latent-variable decomposition based dereverberation of monaural and multi-channel signals," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Dallas, TX, 2010, pp. 1914-1917.
- (2010) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. 1914-1917
- Singh, R.¹ Raj, B.² Smaragdis, P.³

61
- 84857258863
- The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments
- Florence, Italy
- F. Weninger, J. Geiger, M. Wöllmer, B. Schuller, and G. Rigoll, "The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments," in Proc. Int. Workshop on Machine Listening in Multisource Environments, Florence, Italy, 2011, pp. 24-29.
- (2011) Proc. Int. Workshop on Machine Listening in Multisource Environments , pp. 24-29
- Weninger, F.¹ Geiger, J.² Wöllmer, M.³ Schuller, B.⁴ Rigoll, G.⁵

62
- 47649088496
- Extended nonnegative tensor factorisation models for musical source separation
- D. FitzGerald, M. Cranitch, and E. Coyle, "Extended nonnegative tensor factorisation models for musical source separation," Computat. Intell. Neurosci., vol. 2008, 2008.
- (2008) Computat. Intell. Neurosci. , vol.2008
- Fitzgerald, D.¹ Cranitch, M.² Coyle, E.³

63
- 0002740437
- Foundations of the PARAFAC procedure: Models and conditions for an explanatory" multimodal factor analysis
- R. A. Harshman, "Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multimodal factor analysis," in UCLA Working Papers in Phonetics, vol. 16, pp. 1-84, 1970.
- (1970) UCLA Working Papers in Phonetics , vol.16 , pp. 1-84
- Harshman, R.A.¹

64
- 80051614386
- Formulations and algorithms for multichannel complex NMF
- Prague, Czech Republic
- H. Sawada, H. Kameoka, S. Araki, and N. Ueda, "Formulations and algorithms for multichannel complex NMF," in Proc. IEEE Int. Conf. Audio, Speech and Signal Processing, Prague, Czech Republic, 2011, pp. 229-232.
- (2011) Proc. IEEE Int. Conf. Audio, Speech and Signal Processing , pp. 229-232
- Sawada, H.¹ Kameoka, H.² Araki, S.³ Ueda, N.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.