메뉴 건너뛰기




Volumn 69, Issue 3, 2012, Pages 267-277

Optimization and parallelization of monaural source separation algorithms in the openBliSSART toolkit

Author keywords

Audio source separation; Parallel computing; Speech enhancement

Indexed keywords

APPLICATION FRAMEWORKS; AUDIO EFFECTS; AUDIO SOURCE SEPARATION; AUDIO-RECOGNITION; COMPUTATION TIME; COMPUTE UNIFIED DEVICE ARCHITECTURES; END-USER APPLICATIONS; GRAPHICS PROCESSING UNITS; MATRIX FACTORIZATIONS; MEMORY USAGE; MONAURAL SOURCE; NUMERICAL OPTIMIZATIONS; PARALLEL PROCESSING; PARALLELIZATIONS; REALTIME PROCESSING; SEPARATION ALGORITHMS;

EID: 84866042020     PISSN: 19398018     EISSN: 19398115     Source Type: Journal    
DOI: 10.1007/s11265-012-0673-7     Document Type: Article
Times cited : (27)

References (38)
  • 5
    • 63249085556 scopus 로고    scopus 로고
    • Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
    • Févotte, C., Bertin, N., & Durrieu, J. L. (2009). Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis. Neural Computation, 21(3), 793-830.
    • (2009) Neural Computation , vol.21 , Issue.3 , pp. 793-830
    • Févotte, C.1    Bertin, N.2    Durrieu, J.L.3
  • 6
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • DOI 10.1109/JPROC.2004.840301, Program Generation, Optimization and Platform Adaptation
    • Frigo, M., &Johnson, S. G. (2005). The design and implementation of FFTW3. Proceedings of the IEEE, 93(2), 216-231. (Pubitemid 40851223)
    • (2005) Proceedings of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 8
    • 84863740422 scopus 로고    scopus 로고
    • Toward a practical implementation of exemplarbased noise robust ASR
    • Gemmeke, J. F., Hurmalainen, A., Virtanen, T., & Sun, Y. (2011). Toward a practical implementation of exemplarbased noise robust ASR. In Proc. of EUSIPCO (pp. 1490-1494).
    • (2011) Proc. of EUSIPCO , pp. 1490-1494
    • Gemmeke, J.F.1    Hurmalainen, A.2    Virtanen, T.3    Sun, Y.4
  • 10
    • 84876152720 scopus 로고    scopus 로고
    • Sound event detection in multisource environments using source separation
    • Florence, Italy
    • Heittola, T., Mesaros, A., Virtanen, T., & Eronen, A. (2011). Sound event detection in multisource environments using source separation. In Proc. of CHiME workshop (pp. 86-90). Florence, Italy.
    • (2011) Proc. of CHiME Workshop , pp. 86-90
    • Heittola, T.1    Mesaros, A.2    Virtanen, T.3    Eronen, A.4
  • 11
    • 84863690059 scopus 로고    scopus 로고
    • Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine
    • Antalya, Turkey
    • Helen, M., & Virtanen, T. (2005). Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. In Proc. of EUSIPCO. Antalya, Turkey.
    • (2005) Proc. of EUSIPCO.
    • Helen, M.1    Virtanen, T.2
  • 12
    • 80051620372 scopus 로고    scopus 로고
    • Nonnegative matrix deconvolution in noise robust speech recognition
    • Prague, Czech Republic
    • Hurmalainen, A., Gemmeke, J., & Virtanen, T. (2011). Nonnegative matrix deconvolution in noise robust speech recognition. In Proc. of ICASSP (pp. 4588-4591). Prague, Czech Republic.
    • (2011) Proc. of ICASSP , pp. 4588-4591
    • Hurmalainen, A.1    Gemmeke, J.2    Virtanen, T.3
  • 14
    • 84898964201 scopus 로고    scopus 로고
    • Algorithms for nonnegative matrix factorization
    • Vancouver, Canada
    • Lee, D. D., & Seung, H. S. (2001). Algorithms for nonnegative matrix factorization. In Proc. of NIPS (pp. 556-562). Vancouver, Canada.
    • (2001) Proc. of NIPS , pp. 556-562
    • Lee, D.D.1    Seung, H.S.2
  • 16
    • 70350232529 scopus 로고    scopus 로고
    • Discovering convolutive speech phones using sparseness and nonnegativity constraints
    • London, UK
    • O'Grady, P. D., & Pearlmutter, B. A. (2007). Discovering convolutive speech phones using sparseness and nonnegativity constraints. In Proc. of ICA. London, UK.
    • (2007) Proc. of ICA.
    • O'Grady, P.D.1    Pearlmutter, B.A.2
  • 17
    • 77950116181 scopus 로고    scopus 로고
    • Factorial scaled hidden Markov model for polyphonic audio representation and source separation
    • Mohonk, NY, United States
    • Ozerov, A., Févotte, C., & Charbit, M. (2009). Factorial scaled hidden Markov model for polyphonic audio representation and source separation. In Proc. of WASPAA (pp. 121-124). Mohonk, NY, United States.
    • (2009) Proc. of WASPAA , pp. 121-124
    • Ozerov, A.1    Févotte, C.2    Charbit, M.3
  • 18
    • 84866037355 scopus 로고    scopus 로고
    • Using the FASST source separation toolbox for noise robust speech recognition
    • Florence, Italy
    • Ozerov, A., & Vincent, E. (2011). Using the FASST source separation toolbox for noise robust speech recognition. In Proc. of CHiME workshop (pp. 86-87). Florence, Italy.
    • (2011) Proc. of CHiME Workshop , pp. 86-87
    • Ozerov, A.1    Vincent, E.2
  • 19
    • 79959818117 scopus 로고    scopus 로고
    • Nonnegative matrix factorization based compensation of music for automatic speech recognition
    • Makuhari, Japan
    • Raj, B., Virtanen, T., Chaudhuri, S., & Singh, R. (2010). Nonnegative matrix factorization based compensation of music for automatic speech recognition. In Proc. of Interspeech. Makuhari, Japan.
    • (2010) Proc. of Interspeech.
    • Raj, B.1    Virtanen, T.2    Chaudhuri, S.3    Singh, R.4
  • 20
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • Pittsburgh, PA, USA
    • Schmidt, M. N., & Olsson, R. K. (2006). Single-channel speech separation using sparse non-negative matrix factorization. In Proc. of Interspeech. Pittsburgh, PA, USA.
    • (2006) Proc. of Interspeech.
    • Schmidt, M.N.1    Olsson, R.K.2
  • 22
    • 78049361438 scopus 로고    scopus 로고
    • Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
    • Dallas, TX, USA
    • Schuller, B., & Weninger, F. (2010). Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization. In Proc. of ICASSP (pp. 5054-5057). Dallas, TX, USA.
    • (2010) Proc. of ICASSP , pp. 5054-5057
    • Schuller, B.1    Weninger, F.2
  • 23
    • 78049362257 scopus 로고    scopus 로고
    • Non-negative matrix factorization as noise-robust feature extractor for speech recognition
    • Dallas, TX, USA
    • Schuller, B., Weninger, F., Wöllmer, M., Sun, Y., & Rigoll, G. (2010). Non-negative matrix factorization as noise-robust feature extractor for speech recognition. In Proc. of ICASSP (pp. 4562-4565). Dallas, TX, USA.
    • (2010) Proc. of ICASSP , pp. 4562-4565
    • Schuller, B.1    Weninger, F.2    Wöllmer, M.3    Sun, Y.4    Rigoll, G.5
  • 24
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Smaragdis, P. (2007). Convolutive speech bases and their application to supervised speech separation. IEEE Transactions on Audio, Speech and Language Processing, 15(1), 1-14.
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.1 , pp. 1-14
    • Smaragdis, P.1
  • 26
    • 38149078937 scopus 로고    scopus 로고
    • Supervised and semi-supervised separation of sounds from singlechannel mixtures
    • Berlin: Springer
    • Smaragdis, P., Raj, B., & Shashanka, M. (2007). Supervised and semi-supervised separation of sounds from singlechannel mixtures. In Proc. of ICA (pp. 414-421). Berlin: Springer.
    • (2007) Proc. of ICA , pp. 414-421
    • Smaragdis, P.1    Raj, B.2    Shashanka, M.3
  • 27
    • 32844468881 scopus 로고    scopus 로고
    • Extraction of drum tracks from polyphonic music using independent subspace analysis
    • Nara, Japan
    • Uhle, C., Dittmar, C., &Sporer, T. (2003). Extraction of drum tracks from polyphonic music using independent subspace analysis. In Proc. of ICA. Nara, Japan.
    • (2003) Proc. of ICA.
    • Uhle, C.1    Dittmar, C.2    Sporer, T.3
  • 29
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
    • Virtanen, T. (2007). Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria. IEEE Transactions on Audio, Speech and Language Processing, 15(3), 1066-1074.
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 30
    • 67650142420 scopus 로고    scopus 로고
    • A multiplicative algorithm for convolutive non-negative matrix factorization based on squared Euclidean distance
    • Wang, W., Cichocki, A., & Chambers, J. A. (2009). A multiplicative algorithm for convolutive non-negative matrix factorization based on squared Euclidean distance. IEEE Transactions on Signal Processing, 57(7), 2858-2864.
    • (2009) IEEE Transactions on Signal Processing , vol.57 , Issue.7 , pp. 2858-2864
    • Wang, W.1    Cichocki, A.2    Chambers, J.A.3
  • 31
    • 84857258863 scopus 로고    scopus 로고
    • The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments
    • Florence, Italy
    • Weninger, F., Geiger, J., Wöllmer, M., Schuller, B., & Rigoll, G. (2011). The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments. In Proc. of CHiME workshop (pp. 24-29). Florence, Italy.
    • (2011) Proc. of CHiME Workshop , pp. 24-29
    • Weninger, F.1    Geiger, J.2    Wöllmer, M.3    Schuller, B.4    Rigoll, G.5
  • 32
    • 80051618211 scopus 로고    scopus 로고
    • Open-BliSSART: Design and evaluation of a research toolkit for blind source separation in audio recognition tasks
    • Prague, Czech Republic
    • Weninger, F., Lehmann, A., & Schuller, B. (2011). open-BliSSART: Design and evaluation of a research toolkit for blind source separation in audio recognition tasks. In Proc. of ICASSP (pp. 1625-1628). Prague, Czech Republic.
    • (2011) Proc. of ICASSP , pp. 1625-1628
    • Weninger, F.1    Lehmann, A.2    Schuller, B.3
  • 34
    • 80051621128 scopus 로고    scopus 로고
    • Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory
    • Prague, Czech Republic
    • Weninger, F., Schuller, B., Wöllmer, M., & Rigoll, G. (2011). Localization of non-linguistic events in spontaneous speech by non-negative matrix factorization and long short-term memory. In Proc. of ICASSP (pp. 5840-5843). Prague, Czech Republic.
    • (2011) Proc. of ICASSP , pp. 5840-5843
    • Weninger, F.1    Schuller, B.2    Wöllmer, M.3    Rigoll, G.4
  • 35
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • DOI 10.1016/S0167-8191(00)00087-9
    • Whaley, R. C., Petitet, A., & Dongarra, J. (2001). Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2), 3-35. (Pubitemid 32264775)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Clint Whaley, R.1    Petitet, A.2    Dongarra, J.J.3
  • 36
    • 84867198451 scopus 로고    scopus 로고
    • Regularized non-negative matrix factorization with temporal dependencies for speech denoising
    • Brisbane, Australia
    • Wilson, K. W., Raj, B., & Smaragdis, P. (2008). Regularized non-negative matrix factorization with temporal dependencies for speech denoising. In Proc. of Interspeech. Brisbane, Australia.
    • (2008) Proc. of Interspeech.
    • Wilson, K.W.1    Raj, B.2    Smaragdis, P.3
  • 37
    • 58149151095 scopus 로고    scopus 로고
    • Accelerating density functional calculations with graphics processing unit
    • Yasuda, K. (2008). Accelerating density functional calculations with graphics processing unit. Journal of Chemical Theory and Computation, 4, 1230-1236.
    • (2008) Journal of Chemical Theory and Computation , vol.4 , pp. 1230-1236
    • Yasuda, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.