메뉴 건너뛰기




Volumn , Issue , 2011, Pages 551-588

Sound Source Separation

Author keywords

Actual source separation techniques and quality assessment; Binaural source separation, source segregation spatial cues, extracted from signals at both ears; Computational auditory scene analysis (CASA) artificial systems, mimicking localization separation process; Delay and sum beamformer and null beamformer fixed designs, and source DOAs; Distribution, power normalized magnitude STFT coefficients of speech source; Frequency domain independent component analysis; Separation of sources from single channel (monophonic) mixtures challenging; Sound recording and processing need for specific digital audio effects for sounds; Sound source separation; Time frequency representations decreasing overlap between sources

Indexed keywords


EID: 84886063988     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9781119991298.ch14     Document Type: Chapter
Times cited : (3)

References (38)
  • 3
    • 0004106092 scopus 로고    scopus 로고
    • Spatial Hearing
    • MIT Press, Cambridge, MA
    • J. Blauert. Spatial Hearing. MIT Press, Cambridge, MA, 2001.
    • (2001)
    • Blauert, J.1
  • 4
    • 0017939936 scopus 로고
    • Auditory streaming and the building of timbre
    • A. S. Bregman and S. Pinker. Auditory streaming and the building of timbre. Canadian Journal of Psychology, 32(1): 19-31, 1978.
    • (1978) Canadian Journal of Psychology , vol.32 , Issue.1 , pp. 19-31
    • Bregman, A.S.1    Pinker, S.2
  • 5
    • 0003684441 scopus 로고
    • Auditory Scene Analysis: The Perceptual Organization of Sound
    • MIT Press, Cambridge, MA
    • A. S. Bregman. Auditory Scene Analysis: The Perceptual Organization of Sound. MIT Press, Cambridge, MA, 1990.
    • (1990)
    • Bregman, A.S.1
  • 6
    • 0003980102 scopus 로고    scopus 로고
    • Microphone Arrays: Signal Processing Techniques and Applications
    • Springer, New York, NY
    • M. S. Brandstein and D. B. Ward (eds). Microphone Arrays: Signal Processing Techniques and Applications. Springer, New York, NY, 2001.
    • (2001)
    • Brandstein, M.S.1    Ward, D.B.2
  • 7
    • 9644281074 scopus 로고    scopus 로고
    • Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
    • November
    • C. Faller and J. Merimaa. Source localization in complex listening situations: Selection of binaural cues based on interaural coherence. Journal of the Acoustical Society of America, 116(5): 3075-3089, November 2004.
    • (2004) Journal of the Acoustical Society of America , vol.116 , Issue.5 , pp. 3075-3089
    • Faller, C.1    Merimaa, J.2
  • 8
    • 0027292061 scopus 로고
    • Combined evaluation of interaural time and intensity differences: Psychoacoustic results and computer modeling
    • W. Gaik. Combined evaluation of interaural time and intensity differences: Psychoacoustic results and computer modeling. Journal of the Acoustical Society of America, 94(1): 98-110, 1993.
    • (1993) Journal of the Acoustical Society of America , vol.94 , Issue.1 , pp. 98-110
    • Gaik, W.1
  • 9
    • 64149103500 scopus 로고
    • Perception of Attack Transients in Musical Tones
    • PhD thesis, Department of Music, Stanford University, California, USA
    • J. W. Gordon. Perception of Attack Transients in Musical Tones. PhD thesis, Department of Music, Stanford University, California, USA, 1984.
    • (1984)
    • Gordon, J.W.1
  • 10
    • 0008316428 scopus 로고    scopus 로고
    • Pitch-based streaming in auditory perception
    • In N. Griffith and P. Todd (eds.), Musical networks, MIT Press, Cambridge, MA
    • S. Grossberg. Pitch-based streaming in auditory perception. In N. Griffith and P. Todd (eds.), Musical networks, pp 117-140, MIT Press, Cambridge, MA, 1996.
    • (1996) , pp. 117-140
    • Grossberg, S.1
  • 11
    • 84874127157 scopus 로고    scopus 로고
    • Sparse component analysis
    • In P. Comon and C. Jutten (eds), Handbook of Blind Source Separation, Academic Press, Oxford, UK
    • R. Gribonval and M. Zibulevsky. Sparse component analysis. In P. Comon and C. Jutten (eds), Handbook of Blind Source Separation, pp. 367-420. Academic Press, Oxford, UK, 2010.
    • (2010) , pp. 367-420
    • Gribonval, R.1    Zibulevsky, M.2
  • 12
    • 0001654548 scopus 로고
    • Pitch perception and the segregation and integration of auditory entities
    • In Gerald M. Edelman, W. Einar Gall and W. Maxwell Cowan (eds.), Auditory Function: Neurobiological Bases of Hearing, Wiley, New York, USA
    • W. M. Hartmann. Pitch perception and the segregation and integration of auditory entities. In Gerald M. Edelman, W. Einar Gall and W. Maxwell Cowan (eds.), Auditory Function: Neurobiological Bases of Hearing, pp. 623-645. Wiley, New York, USA, 1988.
    • (1988) , pp. 623-645
    • Hartmann, W.M.1
  • 13
    • 34047253222 scopus 로고    scopus 로고
    • A method for separation of overlapping partials based on similarity of temporal envelopes in multi-channel mixtures
    • May
    • Viste and G. Evangelista. A method for separation of overlapping partials based on similarity of temporal envelopes in multi-channel mixtures. IEEE Trans. on Audio, Speech, and Language Processing, 14(3): 1051-1061, May 2006.
    • (2006) IEEE Trans. on Audio, Speech, and Language Processing , vol.14 , Issue.3 , pp. 1051-1061
    • Viste1    Evangelista, G.2
  • 15
    • 0032442773 scopus 로고    scopus 로고
    • Coincidence detection in the auditory system: 50 years after Jeffress
    • P. X. Joris, P. H. Smith, and T. C. T. Yin. Coincidence detection in the auditory system: 50 years after Jeffress. Neuron, 21: 1235-1238, 1998.
    • (1998) Neuron , vol.21 , pp. 1235-1238
    • Joris, P.X.1    Smith, P.H.2    Yin, T.C.T.3
  • 16
    • 0347337997 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation based on harmonicity and spectral smoothness
    • November
    • A. P. Klapuri. Multiple fundamental frequency estimation based on harmonicity and spectral smoothness. IEEE Transactions on Speech and Audio Processing, 11(6): 804-816, November 2003.
    • (2003) IEEE Transactions on Speech and Audio Processing , vol.11 , Issue.6 , pp. 804-816
    • Klapuri, A.P.1
  • 17
    • 84866518463 scopus 로고    scopus 로고
    • Modélisation Sinusoïdale des Sons Polyphoniques
    • PhD thesis, LaBRI, University of Bordeaux 1, Talence, France, December, In French
    • M. Lagrange. Modélisation Sinusoïdale des Sons Polyphoniques. PhD thesis, LaBRI, University of Bordeaux 1, Talence, France, December 2004. In French.
    • (2004)
    • Lagrange, M.1
  • 18
    • 84923923639 scopus 로고    scopus 로고
    • A new dissimilarity metric for the clustering of partials using the common variation cue
    • In Proceedings of the International Computer Music Conference (ICMC), Barcelona, Spain, September
    • M. Lagrange. A new dissimilarity metric for the clustering of partials using the common variation cue. In Proceedings of the International Computer Music Conference (ICMC), Barcelona, Spain, September 2005.
    • (2005)
    • Lagrange, M.1
  • 19
    • 0022976531 scopus 로고
    • Extension of a binaural cross-correlation model by contralateral inhibition. i.simulation of lateralization for stationary signals
    • W. Lindemann. Extension of a binaural cross-correlation model by contralateral inhibition. i.simulation of lateralization for stationary signals. Journal of the Acoustical Society of America, 80(6): 1608-1622, 1986.
    • (1986) Journal of the Acoustical Society of America , vol.80 , Issue.6 , pp. 1608-1622
    • Lindemann, W.1
  • 20
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by non-negative matrix factorization
    • 21 October
    • D. D. Lee and H. S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401: 788-791, 21 October 1999.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 21
    • 0024814752 scopus 로고
    • Segregation of concurrent sounds: effects of frequency modulation coherence
    • S. McAdams. Segregation of concurrent sounds: effects of frequency modulation coherence. Journal of the Acoustical Society of America, 86(6): 2148-2159, 1989.
    • (1989) Journal of the Acoustical Society of America , vol.86 , Issue.6 , pp. 2148-2159
    • McAdams, S.1
  • 22
    • 79953823160 scopus 로고    scopus 로고
    • Binaural Audio Signal Processing Using Interaural Coherence Matching
    • PhD thesis, EPflLausanne, Switzerland
    • F. Menzer. Binaural Audio Signal Processing Using Interaural Coherence Matching. PhD thesis, EPflLausanne, Switzerland, 2010.
    • (2010)
    • Menzer, F.1
  • 23
    • 67149085959 scopus 로고    scopus 로고
    • Cumulative state coherence transform for a robust two-channel multiple source localization
    • In Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
    • F. Nesta, P. Svaizer and M. Omologo. Cumulative state coherence transform for a robust two-channel multiple source localization. In Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation, pp. 290-297, 2009.
    • (2009) , pp. 290-297
    • Nesta, F.1    Svaizer, P.2    Omologo, M.3
  • 25
    • 85132855875 scopus 로고    scopus 로고
    • Frequency domain blind source separation
    • In S. Makino, T.-W. Lee, and H. Sawada (eds), Blind Speech Separation, Springer, Dordrecht, The Netherlands
    • H. Sawada, S. Araki and S. Makino. Frequency domain blind source separation. In S. Makino, T.-W. Lee, and H. Sawada (eds), Blind Speech Separation, pp. 47-78. Springer, Dordrecht, The Netherlands, 2007.
    • (2007) , pp. 47-78
    • Sawada, H.1    Araki, S.2    Makino, S.3
  • 26
    • 84945116938 scopus 로고    scopus 로고
    • Non-negative matrix factorization for polyphonic music transcription
    • In Proceedings of the 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, 19-22 October
    • P. Smaragdis and J. C. Brown. Non-negative matrix factorization for polyphonic music transcription. In Proceedings of the 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 177-180, New Paltz, New York, 19-22 October 2003.
    • (2003) , pp. 177-180
    • Smaragdis, P.1    Brown, J.C.2
  • 27
    • 33745711487 scopus 로고    scopus 로고
    • Nonnegative matrix factor 2-D deconvolution for blind single channel source separation
    • In Independent Component Analysis and Signal Separation, International Conference on, volume 3889 of Lecture Notes in Computer Science (LNCS), Springer, New York, NY April
    • M. N. Schmidt and M. Mørup. Nonnegative matrix factor 2-D deconvolution for blind single channel source separation. In Independent Component Analysis and Signal Separation, International Conference on, volume 3889 of Lecture Notes in Computer Science (LNCS), pp. 700-707. Springer, New York, NY April 2006.
    • (2006) , pp. 700-707
    • Schmidt, M.N.1    Mørup, M.2
  • 28
    • 35048843291 scopus 로고    scopus 로고
    • Non-negative matrix factor deconvolution: Extraction of multiple sound sources from monophonic inputs
    • In Independent Component Analysis and Blind Signal Separation: Proceedings of the Fifth International Conference (ICA 2004), Granada, Spain, September 22-24
    • P. Smaragdis. Non-negative matrix factor deconvolution: Extraction of multiple sound sources from monophonic inputs. In Independent Component Analysis and Blind Signal Separation: Proceedings of the Fifth International Conference (ICA 2004), pp. 494-499, Granada, Spain, September 22-24 2004.
    • (2004) , pp. 494-499
    • Smaragdis, P.1
  • 29
    • 38049021850 scopus 로고    scopus 로고
    • Convolutive speech bases and their application to supervised speech separation
    • Jan.
    • P. Smaragdis. Convolutive speech bases and their application to supervised speech separation. IEEE Transactions on Audio, Speech, and Language Processing, 15(1): 1-12, Jan. 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.1 , pp. 1-12
    • Smaragdis, P.1
  • 30
    • 0000966913 scopus 로고    scopus 로고
    • Models of binaural perception
    • In R. H. Gilkey and T. R. Anderson (eds.), Binaural and Spatial Hearing in Real and Virtual Environments, Lawrence Erlbaum Associates
    • R. M. Stern and C. Trahiotis. Models of binaural perception. In R. H. Gilkey and T. R. Anderson (eds.), Binaural and Spatial Hearing in Real and Virtual Environments, pp. 499-531. Lawrence Erlbaum Associates, 1997.
    • (1997) , pp. 499-531
    • Stern, R.M.1    Trahiotis, C.2
  • 32
    • 32844461077 scopus 로고    scopus 로고
    • Separation of sound sources by convolutive sparse coding
    • In Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing (SAPA 2004), Jeju, Korea, 3 October
    • T. Virtanen. Separation of sound sources by convolutive sparse coding. In Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing (SAPA 2004), Jeju, Korea, 3 October 2004.
    • (2004)
    • Virtanen, T.1
  • 33
    • 67650563190 scopus 로고    scopus 로고
    • Binaural Localization and Separation Techniques
    • PhD thesis, EPFL, Lausanne, Switzerland
    • H. Viste. Binaural Localization and Separation Techniques. PhD thesis, EPFL, Lausanne, Switzerland, 2004.
    • (2004)
    • Viste, H.1
  • 34
    • 77955698250 scopus 로고    scopus 로고
    • Probabilistic modeling paradigms for audio source separation
    • In W. Wang (ed), Machine Audition: Principles, Algorithms and Systems. IGI Global, Hershey, PA
    • E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley and M. E. Davies. Probabilistic modeling paradigms for audio source separation. In W. Wang (ed), Machine Audition: Principles, Algorithms and Systems. IGI Global, Hershey, PA, 2010.
    • (2010)
    • Vincent, E.1    Jafari, M.G.2    Abdallah, S.A.3    Plumbley, M.D.4    Davies, M.E.5
  • 35
    • 82255178542 scopus 로고    scopus 로고
    • Computational Auditory Scene Analysis: Principles, Algorithms and Applications
    • Wiley-IEEE Press, Hoboken, NJ
    • D. L. Wang and G. J. Brown (eds). Computational Auditory Scene Analysis: Principles, Algorithms and Applications. Wiley-IEEE Press, Hoboken, NJ, 2006.
    • (2006)
    • Wang, D.L.1    Brown, G.J.2
  • 36
    • 34547519511 scopus 로고    scopus 로고
    • Investigating single-channel audio source separation methods based on non-negative matrix factorization
    • In A. K. Nandi and X. Zhu (eds), Proceedings of the ICA Research Network International Workshop, 18-19 Sept 2006
    • B. Wang and M. D. Plumbley. Investigating single-channel audio source separation methods based on non-negative matrix factorization. In A. K. Nandi and X. Zhu (eds), Proceedings of the ICA Research Network International Workshop, 18-19 Sept 2006 , pp. 17-20, 2006.
    • (2006) , pp. 17-20
    • Wang, B.1    Plumbley, M.D.2
  • 37
    • 0004166168 scopus 로고
    • Experimental Psychology
    • Holt, New York, NY
    • R. S. Woodworth and H. Schlosberg. Experimental Psychology. Holt, New York, NY 1954.
    • (1954)
    • Woodworth, R.S.1    Schlosberg, H.2
  • 38
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • Ö. Yi{dotless}lmaz and S. T. Rickard. Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52(7): 1830-1847, 2004.
    • (2004) IEEE Transactions on Signal Processing , vol.52 , Issue.7 , pp. 1830-1847
    • Yilmaz, Ö.1    Rickard, S.T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.