메뉴 건너뛰기




Volumn 71, Issue 1-3, 2007, Pages 109-120

Monophonic sound source separation with an unsupervised network of spiking neurones

Author keywords

Amplitude modulation; Auditory maps; Auditory scene analysis; Neurones; Source separation; Speech enhancement; Spikes

Indexed keywords

AMPLITUDE MODULATION; BLIND SOURCE SEPARATION; CLASSIFICATION (OF INFORMATION); NEURONS; SPEECH ENHANCEMENT; UNSUPERVISED LEARNING;

EID: 35648992055     PISSN: 09252312     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.neucom.2007.08.001     Document Type: Article
Times cited : (11)

References (65)
  • 1
    • 0031049135 scopus 로고    scopus 로고
    • Multimodal representation of space in the posterior parietal cortex and its use in planning movements
    • Andersen R., Snyder L., Bradley D., and Xing J. Multimodal representation of space in the posterior parietal cortex and its use in planning movements. Ann. Rev. Neurosci. 20 (1997) 303
    • (1997) Ann. Rev. Neurosci. , vol.20 , pp. 303
    • Andersen, R.1    Snyder, L.2    Bradley, D.3    Xing, J.4
  • 3
    • 0035125193 scopus 로고    scopus 로고
    • Wavelet speech enhancement based on the teager energy operator
    • Bahoura M., and Rouat J. Wavelet speech enhancement based on the teager energy operator. IEEE Signal Process. Lett. 8 (2001) 10-12
    • (2001) IEEE Signal Process. Lett. , vol.8 , pp. 10-12
    • Bahoura, M.1    Rouat, J.2
  • 4
    • 85009063003 scopus 로고    scopus 로고
    • M. Bahoura, J. Rouat, New approach for wavelet speech enhancement, Eurospeech 2001, Danemark, 2001, pp. 1937-1940.
  • 5
    • 35648952317 scopus 로고    scopus 로고
    • F. Berthommier, G. Meyer, Improving of amplitude modulation maps for f0-dependent segregation of harmonic sounds, in: Eurospeech'97, 1997.
  • 6
    • 0036505590 scopus 로고    scopus 로고
    • Unsupervised clustering with spiking neurons by sparse temporal coding and multilayer RBF networks
    • Bohte S.M., Poutré H.L., and Kok J.N. Unsupervised clustering with spiking neurons by sparse temporal coding and multilayer RBF networks. IEEE Trans. Neural Networks 13 2 (2002) 426-435
    • (2002) IEEE Trans. Neural Networks , vol.13 , Issue.2 , pp. 426-435
    • Bohte, S.M.1    Poutré, H.L.2    Kok, J.N.3
  • 8
    • 0028531926 scopus 로고
    • Computational auditory scene analaysis
    • Brown G., and Cooke M. Computational auditory scene analaysis. Comput. Speech Language (1994) 297-336
    • (1994) Comput. Speech Language , pp. 297-336
    • Brown, G.1    Cooke, M.2
  • 9
    • 35648931564 scopus 로고    scopus 로고
    • M. Cooke, Modelling auditory processing and organisation, Ph.D. Thesis, University of Sheffield (published in the Distinguished Dissertations in Computer Science Series, University of Cambridge Press, paper back, 2005).
  • 10
    • 35648937098 scopus 로고    scopus 로고
    • M. Cooke, 〈http://www.dcs.shef.ac.uk/∼martin/〉, 2004.
  • 11
    • 0035478859 scopus 로고    scopus 로고
    • The auditory organization of speech and other sources in listeners and computational models
    • Cooke M., and Ellis D. The auditory organization of speech and other sources in listeners and computational models. Speech Commun. (2001) 141-177
    • (2001) Speech Commun. , pp. 141-177
    • Cooke, M.1    Ellis, D.2
  • 12
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Commun. 34 (2001) 267-285
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 13
    • 35648948990 scopus 로고    scopus 로고
    • D. Ellis, Prediction-driven computational auditory scene analysis, Ph.D. Thesis, MIT, 1996.
  • 14
    • 0037153152 scopus 로고    scopus 로고
    • Multiplicative computation in a visual neuron sensitive to looming
    • Gabbiani F., Krapp H., Koch C., and Laurent G. Multiplicative computation in a visual neuron sensitive to looming. Nature 420 (2002) 320-324
    • (2002) Nature , vol.420 , pp. 320-324
    • Gabbiani, F.1    Krapp, H.2    Koch, C.3    Laurent, G.4
  • 15
    • 35648986749 scopus 로고    scopus 로고
    • F. Gaillard, Analyse de scènes auditives computationnelle (CASA): Un nouvel outil de marquage du plan temps-fréquence par détection d'harmonicité exploitant une statistique de passage par zéro, Ph.D. Thesis, INPG, 1999.
  • 17
    • 0028053082 scopus 로고
    • A computational model of the auditory periphery for speech and hearing research
    • Giguere C., and Woodland P.C. A computational model of the auditory periphery for speech and hearing research. J. Am. Statist. Assoc. (1994) 331-349
    • (1994) J. Am. Statist. Assoc. , pp. 331-349
    • Giguere, C.1    Woodland, P.C.2
  • 18
    • 35649024921 scopus 로고    scopus 로고
    • S. Grossberg, K. Govindarajan, L. Wyse, M. Cohen, M. ARTSTREAM: a neural network model of auditory scene analysis and source segregation, Neural Networks, 2003.
  • 19
    • 85009205023 scopus 로고    scopus 로고
    • S. Harding, G. Meyer, Multi-resolution auditory scene analysis: robust speech recognition using pattern-matching from a noisy signal, In: EUROSPEECH, September 2003, pp. 2109-2112.
  • 20
    • 35648997087 scopus 로고    scopus 로고
    • C.K. Henkel, The auditory system, in: D.E. Haines (Ed.), Fundamental Neuroscience, Churchill Livingstone, 1997.
  • 21
    • 0029637779 scopus 로고
    • Pattern recognition computation using action potential timing for stimulus representation
    • Hopfield J. Pattern recognition computation using action potential timing for stimulus representation. Nature 376 (1995) 33-36
    • (1995) Nature , vol.376 , pp. 33-36
    • Hopfield, J.1
  • 22
    • 35648993113 scopus 로고    scopus 로고
    • G. Hu, D. Wang, Monaural speech segregation based on pitch tracking and amplitude modulation, Technical Report, Ohio State University, 2002.
  • 23
    • 85143189623 scopus 로고    scopus 로고
    • G. Hu, D. Wang, Separation of stop consonants, in: ICASSP Hong Kong, 2003.
  • 24
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • Hu G., and Wang D. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks 15 (2004) 1135-1150
    • (2004) IEEE Trans. Neural Networks , vol.15 , pp. 1135-1150
    • Hu, G.1    Wang, D.2
  • 25
    • 85143191452 scopus 로고    scopus 로고
    • T. Irino, R. Patterson, Speech segregation using event synchronous auditory vocoder, in: ICASSP, 2003, vol. V, pp. 525-528.
  • 26
    • 0031642430 scopus 로고    scopus 로고
    • T. Irino, M. Unoki, A time-varying, analysis/synthesis auditory filterbank using the gammachirp, in: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 98, May 1998, Seattle, Washington, vol. 6, pp. 3653-3656.
  • 27
    • 0038630563 scopus 로고    scopus 로고
    • Single-channel signal separation using time-domain basis functions
    • Jang G., and Lee T. Single-channel signal separation using time-domain basis functions. IEEE-SPL (2003) 168-171
    • (2003) IEEE-SPL , pp. 168-171
    • Jang, G.1    Lee, T.2
  • 28
    • 8344232372 scopus 로고    scopus 로고
    • A maximum likelihood approach to single channel source separation
    • Jang G., and Lee T. A maximum likelihood approach to single channel source separation. J. Mach. Learn. Res. 4 (2003) 1365-1392
    • (2003) J. Mach. Learn. Res. , vol.4 , pp. 1365-1392
    • Jang, G.1    Lee, T.2
  • 29
    • 0036972191 scopus 로고    scopus 로고
    • Effects of age on contralateral suppression of distorsion product otoacoustic emissions in human listeners with normal hearing
    • Kim S., Frisina D.R., and Frisina R.D. Effects of age on contralateral suppression of distorsion product otoacoustic emissions in human listeners with normal hearing. Audiol. Neuro Otol. 7 (2002) 348-357
    • (2002) Audiol. Neuro Otol. , vol.7 , pp. 348-357
    • Kim, S.1    Frisina, D.R.2    Frisina, R.D.3
  • 30
    • 0035280043 scopus 로고    scopus 로고
    • A comparison of auditory and blind separation techniques for speech segregation
    • Kouwe A.J.W., Wang D.L., and Brown G.J. A comparison of auditory and blind separation techniques for speech segregation. IEEE Trans. Speech Audio Process. 9 (2001) 189-195
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 189-195
    • Kouwe, A.J.W.1    Wang, D.L.2    Brown, G.J.3
  • 31
    • 0032675721 scopus 로고    scopus 로고
    • G. Kubin, W.B. Kleijn, On speech coding in a perceptual domain, in: ICASSP, March 1999, Phoenix, Arizona, vol. 1, pp. 205-208.
  • 32
    • 0034761668 scopus 로고    scopus 로고
    • Distributed synchrony in a cell assembly of piking neurons
    • Levy N., Horn D., Meilijson I., and Ruppin E. Distributed synchrony in a cell assembly of piking neurons. Neural Networks 14 6-7 (2001) 815-824
    • (2001) Neural Networks , vol.14 , Issue.6-7 , pp. 815-824
    • Levy, N.1    Horn, D.2    Meilijson, I.3    Ruppin, E.4
  • 33
    • 0029973032 scopus 로고    scopus 로고
    • The ipsilaterally evoked olivocochlearreflex causes rapid adaptation of the 2f1-f2 distortion product otoacoustic emission
    • Liberman M., Puria S., and Guinan J.J. The ipsilaterally evoked olivocochlearreflex causes rapid adaptation of the 2f1-f2 distortion product otoacoustic emission. J. Acoust. Soc. Am. 99 (1996) 2572-3584
    • (1996) J. Acoust. Soc. Am. , vol.99 , pp. 2572-3584
    • Liberman, M.1    Puria, S.2    Guinan, J.J.3
  • 34
    • 35648952316 scopus 로고    scopus 로고
    • C. von der Malsburg, 1981. The correlation theory of brain function, Technical Report Internal Report 81-2, Max-Planck Institute for Biophysical Chemistry.
  • 35
    • 0032857224 scopus 로고    scopus 로고
    • The what and why of binding: the modeler's perspective
    • von der Malsburg C. The what and why of binding: the modeler's perspective. Neuron (1999) 95-104
    • (1999) Neuron , pp. 95-104
    • von der Malsburg, C.1
  • 37
    • 0002938154 scopus 로고    scopus 로고
    • Scene analysis
    • Hawkins H., McMullen T., Popper A., and Fay R. (Eds), Springer, New York
    • Mellinger D.K., and Mont-Reynaud B.M. Scene analysis. In: Hawkins H., McMullen T., Popper A., and Fay R. (Eds). Auditory Computation (1996), Springer, New York 271-331
    • (1996) Auditory Computation , pp. 271-331
    • Mellinger, D.K.1    Mont-Reynaud, B.M.2
  • 38
    • 35648974336 scopus 로고    scopus 로고
    • G. Meyer, D. Yang, W. Ainsworth, in: Applying a model of concurrent vowel segregation to real speech, Computational models of auditory function, S. Greenberg, M. Slaney (Eds.), 2001, pp. 297-310.
  • 39
    • 0016130757 scopus 로고
    • A model for visual shape recognition
    • Milner P. A model for visual shape recognition. Psychol. Rev. 81 (1974) 521-535
    • (1974) Psychol. Rev. , vol.81 , pp. 521-535
    • Milner, P.1
  • 40
    • 84862627060 scopus 로고    scopus 로고
    • J. Nix, M. Kleinschmidt, V. Hohmann, Computational auditory scene analysis by using statistics of high-dimensional speech dynamics and sound source direction, in: EUROSPEECH, September 2003, pp. 1441-1444.
  • 42
    • 0035853534 scopus 로고    scopus 로고
    • Auditory spatial receptive fields created by multiplication
    • Pena J., and Konishi M. Auditory spatial receptive fields created by multiplication. Science 292 (2001) 249-252
    • (2001) Science , vol.292 , pp. 249-252
    • Pena, J.1    Konishi, M.2
  • 43
    • 35648987257 scopus 로고    scopus 로고
    • R. Pichevar, 〈http://www-edu.gel.usherbrooke.ca/picr1601/〉 2007.
  • 44
    • 84945190617 scopus 로고    scopus 로고
    • R. Pichevar, J. Rouat, Cochleotopic/AMtopic (CAM) and Cochleotopic/Spectrotopic (CSM) map based sound source separation using relaxation oscillatory neurons, in: IEEE Neural Networks for Signal Processing Workshop, Toulouse, France, 2003.
  • 45
    • 35648997542 scopus 로고    scopus 로고
    • R. Pichevar, J. Rouat, A quantitative evaluation of a bio-inspired sound segregation technique for two- and three-source mixtures sounds, in: Lecture Notes in Computer Science, Springer, Berlin, 2004, vol. 3445, pp. 430-435.
  • 46
    • 35649000122 scopus 로고    scopus 로고
    • R. Pichevar, J. Rouat, C. Feldbauer, G. Kubin, A bio-inspired sound source separation technique in combination with an enhanced FIR gammatone analysis/synthesis filterbank, in: EUSIPCO Vienna, 2004.
  • 47
    • 0032069218 scopus 로고    scopus 로고
    • Improvement of speech spectrogram accuracy by the method of reassignment
    • Plante F., Meyer G., and Ainsworth W. Improvement of speech spectrogram accuracy by the method of reassignment. IEEE Trans. Speech Audio Process. (1998) 282-287
    • (1998) IEEE Trans. Speech Audio Process. , pp. 282-287
    • Plante, F.1    Meyer, G.2    Ainsworth, W.3
  • 48
    • 0141813716 scopus 로고    scopus 로고
    • M.J. Reyes-Gomez, B. Raj, D. Ellis, Multi-channel source separation by factorial HMMs, in: ICASSP, 2003.
  • 49
    • 0032833663 scopus 로고    scopus 로고
    • Are cortical models really bound by the binding problem?
    • Riesenhuber M., and Poggio T. Are cortical models really bound by the binding problem?. Neuron 84 (1999) 87-93
    • (1999) Neuron , vol.84 , pp. 87-93
    • Riesenhuber, M.1    Poggio, T.2
  • 51
    • 35648985207 scopus 로고    scopus 로고
    • D.F. Rosenthal, H.G. Okuno (Eds.), 1998. Computational Auditory Scene Analysis, L. Erlbaum.
  • 52
    • 35648988072 scopus 로고    scopus 로고
    • J. Rouat, 〈http://www.gel.usherbrooke.ca/rouat/〉 2005.
  • 53
    • 0031124228 scopus 로고    scopus 로고
    • A pitch determination and voiced/unvoiced decision algorithm for noisy speech
    • Rouat J., Liu Y.C., and Morissette D. A pitch determination and voiced/unvoiced decision algorithm for noisy speech. Speech Commun. 21 (1997) 191-207
    • (1997) Speech Commun. , vol.21 , pp. 191-207
    • Rouat, J.1    Liu, Y.C.2    Morissette, D.3
  • 54
    • 27844480473 scopus 로고    scopus 로고
    • J. Rouat, R. Pichevar, Source separation with one ear: proposition for an anthropomorphic approach, EURASIP J. Appl. Signal Process. (9) (2005) 1365-1374.
  • 55
    • 85009230793 scopus 로고    scopus 로고
    • S. Roweis, Factorial models and refiltering for speech separation and denoising, in: Eurospeech 2003.
  • 56
    • 35648960395 scopus 로고    scopus 로고
    • S.T. Roweis, One microphone source seperation, in: NIPS, Denver, USA, 2000.
  • 57
    • 0032166087 scopus 로고    scopus 로고
    • HMM based strategies for enhancement of speech signals embedded in nonstationary noise
    • Sameti H., Sheikhzadeh H., Deng L., and Brennan R. HMM based strategies for enhancement of speech signals embedded in nonstationary noise. IEEE Trans. Speech Audio Process. (1998) 445-455
    • (1998) IEEE Trans. Speech Audio Process. , pp. 445-455
    • Sameti, H.1    Sheikhzadeh, H.2    Deng, L.3    Brennan, R.4
  • 58
    • 35649015463 scopus 로고    scopus 로고
    • J.L. Schwartz, P. Escudier, Auditory processing in a post-cochlear neural network: vowel spectrum processing based on spike synchrony, in: EUROSPEECH, 1989, pp. 247-253.
  • 59
    • 0030365519 scopus 로고    scopus 로고
    • P. Tang, J. Rouat, Modeling neurons in the anteroventral cochlear nucleus for amplitude modulation (AM) processing: application to speech sound, in: Proceedings of the International Conference on Spoken Language Processing, October 1996, p. Th.P.2S2.2.
  • 60
    • 0003390528 scopus 로고    scopus 로고
    • An auditory cortical theory of auditory stream segregation
    • Todd N. An auditory cortical theory of auditory stream segregation. Network Comput. Neural Syst. 7 (1996) 349-356
    • (1996) Network Comput. Neural Syst. , vol.7 , pp. 349-356
    • Todd, N.1
  • 61
    • 0347409529 scopus 로고    scopus 로고
    • J.-M. Valin, F. Michaud, J. Rouat, D. Ltourneau, Robust sound source localization using a microphone array on a mobile robot, in: IEEE/RSJ-International Conference on Intelligent Robots and Systems, October 2003.
  • 62
    • 4544237508 scopus 로고    scopus 로고
    • J.-M. Valin, J. Rouat, F. Michaud, Microphone array post-filter for separation of simultaneous non-stationary sources, in: IEEE International Conference on Acoustics Speech Signal Processing, 2004.
  • 63
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • Wang D., and Brown G.J. Separation of speech from interfering sounds based on oscillatory correlation. IEEE Trans. Neural Networks 10 3 (1999) 684-697
    • (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
    • Wang, D.1    Brown, G.J.2
  • 64
    • 0031570072 scopus 로고    scopus 로고
    • Image segmentation based on oscillatory correlation
    • Wang D., and Terman D. Image segmentation based on oscillatory correlation. Neural Comput. 9 (1997) 805-836
    • (1997) Neural Comput. , vol.9 , pp. 805-836
    • Wang, D.1    Terman, D.2
  • 65
    • 35649011021 scopus 로고    scopus 로고
    • M. Weintraub, A theory and computational model of auditory monaural sound separation, Ph.D. Thesis, Stanford, 1985.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.