SCOPUS 정보 검색 플랫폼

Neurocomputing

Volumn 71, Issue 1-3, 2007, Pages 109-120

Monophonic sound source separation with an unsupervised network of spiking neurones

(2) Pichevar, Ramin a Rouat, Jean a

a UNIVERSITÉ DE SHERBROOKE (Canada)

Author keywords

Amplitude modulation; Auditory maps; Auditory scene analysis; Neurones; Source separation; Speech enhancement; Spikes

Indexed keywords

AMPLITUDE MODULATION; BLIND SOURCE SEPARATION; CLASSIFICATION (OF INFORMATION); NEURONS; SPEECH ENHANCEMENT; UNSUPERVISED LEARNING;

LOG SPECTRAL DISTORTION; MONOPHONIC SOUND SOURCE SEPARATION; SPIKING NEURONES;

PATTERN RECOGNITION SYSTEMS;

ACOUSTICS; ALGORITHM; ARTICLE; ARTIFICIAL NEURAL NETWORK; AUDITORY SYSTEM; COCHLEAR NUCLEUS; CONTROLLED STUDY; CORRELATION ANALYSIS; FEEDBACK SYSTEM; FEMALE; FILTER; HUMAN; HUMAN EXPERIMENT; INFORMATION PROCESSING; INTERMETHOD COMPARISON; MALE; MATHEMATICAL COMPUTING; NOISE; NORMAL HUMAN; PRIORITY JOURNAL; SOUND DETECTION; SPEECH; SYNAPSE; TRAINING;

EID: 35648992055 PISSN: 09252312 EISSN: None Source Type: Journal
DOI: 10.1016/j.neucom.2007.08.001 Document Type: Article

Times cited : (11)

References (65)

1
- 0031049135
- Multimodal representation of space in the posterior parietal cortex and its use in planning movements
- Andersen R., Snyder L., Bradley D., and Xing J. Multimodal representation of space in the posterior parietal cortex and its use in planning movements. Ann. Rev. Neurosci. 20 (1997) 303
- (1997) Ann. Rev. Neurosci. , vol.20 , pp. 303
- Andersen, R.¹ Snyder, L.² Bradley, D.³ Xing, J.⁴

2
- 0038443474
- Joint acoustic and modulation frequency
- Atlas L., and Shamma S.A. Joint acoustic and modulation frequency. EURASIP J. Appl. Signal Process. 7 (2003) 668-675
- (2003) EURASIP J. Appl. Signal Process. , vol.7 , pp. 668-675
- Atlas, L.¹ Shamma, S.A.²

3
- 0035125193
- Wavelet speech enhancement based on the teager energy operator
- Bahoura M., and Rouat J. Wavelet speech enhancement based on the teager energy operator. IEEE Signal Process. Lett. 8 (2001) 10-12
- (2001) IEEE Signal Process. Lett. , vol.8 , pp. 10-12
- Bahoura, M.¹ Rouat, J.²

4
- 85009063003
- M. Bahoura, J. Rouat, New approach for wavelet speech enhancement, Eurospeech 2001, Danemark, 2001, pp. 1937-1940.

5
- 35648952317
- F. Berthommier, G. Meyer, Improving of amplitude modulation maps for f0-dependent segregation of harmonic sounds, in: Eurospeech'97, 1997.

6
- 0036505590
- Unsupervised clustering with spiking neurons by sparse temporal coding and multilayer RBF networks
- Bohte S.M., Poutré H.L., and Kok J.N. Unsupervised clustering with spiking neurons by sparse temporal coding and multilayer RBF networks. IEEE Trans. Neural Networks 13 2 (2002) 426-435
- (2002) IEEE Trans. Neural Networks , vol.13 , Issue.2 , pp. 426-435
- Bohte, S.M.¹ Poutré, H.L.² Kok, J.N.³

7
- 0003684441
- MIT Press, Cambridge, MA
- Bregman A. Auditory Scene Analysis (1990), MIT Press, Cambridge, MA
- (1990) Auditory Scene Analysis
- Bregman, A.¹

8
- 0028531926
- Computational auditory scene analaysis
- Brown G., and Cooke M. Computational auditory scene analaysis. Comput. Speech Language (1994) 297-336
- (1994) Comput. Speech Language , pp. 297-336
- Brown, G.¹ Cooke, M.²

9
- 35648931564
- M. Cooke, Modelling auditory processing and organisation, Ph.D. Thesis, University of Sheffield (published in the Distinguished Dissertations in Computer Science Series, University of Cambridge Press, paper back, 2005).

10
- 35648937098
- M. Cooke, 〈http://www.dcs.shef.ac.uk/∼martin/〉, 2004.

11
- 0035478859
- The auditory organization of speech and other sources in listeners and computational models
- Cooke M., and Ellis D. The auditory organization of speech and other sources in listeners and computational models. Speech Commun. (2001) 141-177
- (2001) Speech Commun. , pp. 141-177
- Cooke, M.¹ Ellis, D.²

12
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- Cooke M., Green P., Josifovski L., and Vizinho A. Robust automatic speech recognition with missing and unreliable acoustic data. Speech Commun. 34 (2001) 267-285
- (2001) Speech Commun. , vol.34 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

13
- 35648948990
- D. Ellis, Prediction-driven computational auditory scene analysis, Ph.D. Thesis, MIT, 1996.

14
- 0037153152
- Multiplicative computation in a visual neuron sensitive to looming
- Gabbiani F., Krapp H., Koch C., and Laurent G. Multiplicative computation in a visual neuron sensitive to looming. Nature 420 (2002) 320-324
- (2002) Nature , vol.420 , pp. 320-324
- Gabbiani, F.¹ Krapp, H.² Koch, C.³ Laurent, G.⁴

15
- 35648986749
- F. Gaillard, Analyse de scènes auditives computationnelle (CASA): Un nouvel outil de marquage du plan temps-fréquence par détection d'harmonicité exploitant une statistique de passage par zéro, Ph.D. Thesis, INPG, 1999.

16
- 0004017463
- Cambridge University Press, Cambridge
- Gerstner W. Spiking neuron models: single neurons, populations, plasticity (2002), Cambridge University Press, Cambridge
- (2002) Spiking neuron models: single neurons, populations, plasticity
- Gerstner, W.¹

17
- 0028053082
- A computational model of the auditory periphery for speech and hearing research
- Giguere C., and Woodland P.C. A computational model of the auditory periphery for speech and hearing research. J. Am. Statist. Assoc. (1994) 331-349
- (1994) J. Am. Statist. Assoc. , pp. 331-349
- Giguere, C.¹ Woodland, P.C.²

18
- 35649024921
- S. Grossberg, K. Govindarajan, L. Wyse, M. Cohen, M. ARTSTREAM: a neural network model of auditory scene analysis and source segregation, Neural Networks, 2003.

19
- 85009205023
- S. Harding, G. Meyer, Multi-resolution auditory scene analysis: robust speech recognition using pattern-matching from a noisy signal, In: EUROSPEECH, September 2003, pp. 2109-2112.

20
- 35648997087
- C.K. Henkel, The auditory system, in: D.E. Haines (Ed.), Fundamental Neuroscience, Churchill Livingstone, 1997.

21
- 0029637779
- Pattern recognition computation using action potential timing for stimulus representation
- Hopfield J. Pattern recognition computation using action potential timing for stimulus representation. Nature 376 (1995) 33-36
- (1995) Nature , vol.376 , pp. 33-36
- Hopfield, J.¹

22
- 35648993113
- G. Hu, D. Wang, Monaural speech segregation based on pitch tracking and amplitude modulation, Technical Report, Ohio State University, 2002.

23
- 85143189623
- G. Hu, D. Wang, Separation of stop consonants, in: ICASSP Hong Kong, 2003.

24
- 4644265990
- Monaural speech segregation based on pitch tracking and amplitude modulation
- Hu G., and Wang D. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks 15 (2004) 1135-1150
- (2004) IEEE Trans. Neural Networks , vol.15 , pp. 1135-1150
- Hu, G.¹ Wang, D.²

25
- 85143191452
- T. Irino, R. Patterson, Speech segregation using event synchronous auditory vocoder, in: ICASSP, 2003, vol. V, pp. 525-528.

26
- 0031642430
- T. Irino, M. Unoki, A time-varying, analysis/synthesis auditory filterbank using the gammachirp, in: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 98, May 1998, Seattle, Washington, vol. 6, pp. 3653-3656.

27
- 0038630563
- Single-channel signal separation using time-domain basis functions
- Jang G., and Lee T. Single-channel signal separation using time-domain basis functions. IEEE-SPL (2003) 168-171
- (2003) IEEE-SPL , pp. 168-171
- Jang, G.¹ Lee, T.²

28
- 8344232372
- A maximum likelihood approach to single channel source separation
- Jang G., and Lee T. A maximum likelihood approach to single channel source separation. J. Mach. Learn. Res. 4 (2003) 1365-1392
- (2003) J. Mach. Learn. Res. , vol.4 , pp. 1365-1392
- Jang, G.¹ Lee, T.²

29
- 0036972191
- Effects of age on contralateral suppression of distorsion product otoacoustic emissions in human listeners with normal hearing
- Kim S., Frisina D.R., and Frisina R.D. Effects of age on contralateral suppression of distorsion product otoacoustic emissions in human listeners with normal hearing. Audiol. Neuro Otol. 7 (2002) 348-357
- (2002) Audiol. Neuro Otol. , vol.7 , pp. 348-357
- Kim, S.¹ Frisina, D.R.² Frisina, R.D.³

30
- 0035280043
- A comparison of auditory and blind separation techniques for speech segregation
- Kouwe A.J.W., Wang D.L., and Brown G.J. A comparison of auditory and blind separation techniques for speech segregation. IEEE Trans. Speech Audio Process. 9 (2001) 189-195
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 189-195
- Kouwe, A.J.W.¹ Wang, D.L.² Brown, G.J.³

31
- 0032675721
- G. Kubin, W.B. Kleijn, On speech coding in a perceptual domain, in: ICASSP, March 1999, Phoenix, Arizona, vol. 1, pp. 205-208.

32
- 0034761668
- Distributed synchrony in a cell assembly of piking neurons
- Levy N., Horn D., Meilijson I., and Ruppin E. Distributed synchrony in a cell assembly of piking neurons. Neural Networks 14 6-7 (2001) 815-824
- (2001) Neural Networks , vol.14 , Issue.6-7 , pp. 815-824
- Levy, N.¹ Horn, D.² Meilijson, I.³ Ruppin, E.⁴

33
- 0029973032
- The ipsilaterally evoked olivocochlearreflex causes rapid adaptation of the 2f1-f2 distortion product otoacoustic emission
- Liberman M., Puria S., and Guinan J.J. The ipsilaterally evoked olivocochlearreflex causes rapid adaptation of the 2f1-f2 distortion product otoacoustic emission. J. Acoust. Soc. Am. 99 (1996) 2572-3584
- (1996) J. Acoust. Soc. Am. , vol.99 , pp. 2572-3584
- Liberman, M.¹ Puria, S.² Guinan, J.J.³

34
- 35648952316
- C. von der Malsburg, 1981. The correlation theory of brain function, Technical Report Internal Report 81-2, Max-Planck Institute for Biophysical Chemistry.

35
- 0032857224
- The what and why of binding: the modeler's perspective
- von der Malsburg C. The what and why of binding: the modeler's perspective. Neuron (1999) 95-104
- (1999) Neuron , pp. 95-104
- von der Malsburg, C.¹

36
- 0022633118
- A neural cocktail-party processor
- von der Malsburg C., and Schneider W. A neural cocktail-party processor. Biol. Cybern. (1986) 29-40
- (1986) Biol. Cybern. , pp. 29-40
- von der Malsburg, C.¹ Schneider, W.²

37
- 0002938154
- Scene analysis
- Hawkins H., McMullen T., Popper A., and Fay R. (Eds), Springer, New York
- Mellinger D.K., and Mont-Reynaud B.M. Scene analysis. In: Hawkins H., McMullen T., Popper A., and Fay R. (Eds). Auditory Computation (1996), Springer, New York 271-331
- (1996) Auditory Computation , pp. 271-331
- Mellinger, D.K.¹ Mont-Reynaud, B.M.²

38
- 35648974336
- G. Meyer, D. Yang, W. Ainsworth, in: Applying a model of concurrent vowel segregation to real speech, Computational models of auditory function, S. Greenberg, M. Slaney (Eds.), 2001, pp. 297-310.

39
- 0016130757
- A model for visual shape recognition
- Milner P. A model for visual shape recognition. Psychol. Rev. 81 (1974) 521-535
- (1974) Psychol. Rev. , vol.81 , pp. 521-535
- Milner, P.¹

40
- 84862627060
- J. Nix, M. Kleinschmidt, V. Hohmann, Computational auditory scene analysis by using statistics of high-dimensional speech dynamics and sound source direction, in: EUROSPEECH, September 2003, pp. 1441-1444.

41
- 0000460671
- Complex sounds and auditory images
- Cazals Y., Demany L., and Horner K. (Eds), Pergamon Press, Oxford
- Patterson R., Robinson K., Holdsworth J., McKeown D., Zhang C., and Allerhand M. Complex sounds and auditory images. In: Cazals Y., Demany L., and Horner K. (Eds). Auditory Physiology and Perception (1992), Pergamon Press, Oxford 429-446
- (1992) Auditory Physiology and Perception , pp. 429-446
- Patterson, R.¹ Robinson, K.² Holdsworth, J.³ McKeown, D.⁴ Zhang, C.⁵ Allerhand, M.⁶

42
- 0035853534
- Auditory spatial receptive fields created by multiplication
- Pena J., and Konishi M. Auditory spatial receptive fields created by multiplication. Science 292 (2001) 249-252
- (2001) Science , vol.292 , pp. 249-252
- Pena, J.¹ Konishi, M.²

43
- 35648987257
- R. Pichevar, 〈http://www-edu.gel.usherbrooke.ca/picr1601/〉 2007.

44
- 84945190617
- R. Pichevar, J. Rouat, Cochleotopic/AMtopic (CAM) and Cochleotopic/Spectrotopic (CSM) map based sound source separation using relaxation oscillatory neurons, in: IEEE Neural Networks for Signal Processing Workshop, Toulouse, France, 2003.

45
- 35648997542
- R. Pichevar, J. Rouat, A quantitative evaluation of a bio-inspired sound segregation technique for two- and three-source mixtures sounds, in: Lecture Notes in Computer Science, Springer, Berlin, 2004, vol. 3445, pp. 430-435.

46
- 35649000122
- R. Pichevar, J. Rouat, C. Feldbauer, G. Kubin, A bio-inspired sound source separation technique in combination with an enhanced FIR gammatone analysis/synthesis filterbank, in: EUSIPCO Vienna, 2004.

47
- 0032069218
- Improvement of speech spectrogram accuracy by the method of reassignment
- Plante F., Meyer G., and Ainsworth W. Improvement of speech spectrogram accuracy by the method of reassignment. IEEE Trans. Speech Audio Process. (1998) 282-287
- (1998) IEEE Trans. Speech Audio Process. , pp. 282-287
- Plante, F.¹ Meyer, G.² Ainsworth, W.³

48
- 0141813716
- M.J. Reyes-Gomez, B. Raj, D. Ellis, Multi-channel source separation by factorial HMMs, in: ICASSP, 2003.

49
- 0032833663
- Are cortical models really bound by the binding problem?
- Riesenhuber M., and Poggio T. Are cortical models really bound by the binding problem?. Neuron 84 (1999) 87-93
- (1999) Neuron , vol.84 , pp. 87-93
- Riesenhuber, M.¹ Poggio, T.²

50
- 0142026377
- Speech segregation based on sound localization
- Roman N., Wang D., and Brown G. Speech segregation based on sound localization. J. Acoust. Soc. Am. (2003)
- (2003) J. Acoust. Soc. Am.
- Roman, N.¹ Wang, D.² Brown, G.³

51
- 35648985207
- D.F. Rosenthal, H.G. Okuno (Eds.), 1998. Computational Auditory Scene Analysis, L. Erlbaum.

52
- 35648988072
- J. Rouat, 〈http://www.gel.usherbrooke.ca/rouat/〉 2005.

53
- 0031124228
- A pitch determination and voiced/unvoiced decision algorithm for noisy speech
- Rouat J., Liu Y.C., and Morissette D. A pitch determination and voiced/unvoiced decision algorithm for noisy speech. Speech Commun. 21 (1997) 191-207
- (1997) Speech Commun. , vol.21 , pp. 191-207
- Rouat, J.¹ Liu, Y.C.² Morissette, D.³

54
- 27844480473
- J. Rouat, R. Pichevar, Source separation with one ear: proposition for an anthropomorphic approach, EURASIP J. Appl. Signal Process. (9) (2005) 1365-1374.

55
- 85009230793
- S. Roweis, Factorial models and refiltering for speech separation and denoising, in: Eurospeech 2003.

56
- 35648960395
- S.T. Roweis, One microphone source seperation, in: NIPS, Denver, USA, 2000.

57
- 0032166087
- HMM based strategies for enhancement of speech signals embedded in nonstationary noise
- Sameti H., Sheikhzadeh H., Deng L., and Brennan R. HMM based strategies for enhancement of speech signals embedded in nonstationary noise. IEEE Trans. Speech Audio Process. (1998) 445-455
- (1998) IEEE Trans. Speech Audio Process. , pp. 445-455
- Sameti, H.¹ Sheikhzadeh, H.² Deng, L.³ Brennan, R.⁴

58
- 35649015463
- J.L. Schwartz, P. Escudier, Auditory processing in a post-cochlear neural network: vowel spectrum processing based on spike synchrony, in: EUROSPEECH, 1989, pp. 247-253.

59
- 0030365519
- P. Tang, J. Rouat, Modeling neurons in the anteroventral cochlear nucleus for amplitude modulation (AM) processing: application to speech sound, in: Proceedings of the International Conference on Spoken Language Processing, October 1996, p. Th.P.2S2.2.

60
- 0003390528
- An auditory cortical theory of auditory stream segregation
- Todd N. An auditory cortical theory of auditory stream segregation. Network Comput. Neural Syst. 7 (1996) 349-356
- (1996) Network Comput. Neural Syst. , vol.7 , pp. 349-356
- Todd, N.¹

61
- 0347409529
- J.-M. Valin, F. Michaud, J. Rouat, D. Ltourneau, Robust sound source localization using a microphone array on a mobile robot, in: IEEE/RSJ-International Conference on Intelligent Robots and Systems, October 2003.

62
- 4544237508
- J.-M. Valin, J. Rouat, F. Michaud, Microphone array post-filter for separation of simultaneous non-stationary sources, in: IEEE International Conference on Acoustics Speech Signal Processing, 2004.

63
- 0032682770
- Separation of speech from interfering sounds based on oscillatory correlation
- Wang D., and Brown G.J. Separation of speech from interfering sounds based on oscillatory correlation. IEEE Trans. Neural Networks 10 3 (1999) 684-697
- (1999) IEEE Trans. Neural Networks , vol.10 , Issue.3 , pp. 684-697
- Wang, D.¹ Brown, G.J.²

64
- 0031570072
- Image segmentation based on oscillatory correlation
- Wang D., and Terman D. Image segmentation based on oscillatory correlation. Neural Comput. 9 (1997) 805-836
- (1997) Neural Comput. , vol.9 , pp. 805-836
- Wang, D.¹ Terman, D.²

65
- 35649011021
- M. Weintraub, A theory and computational model of auditory monaural sound separation, Ph.D. Thesis, Stanford, 1985.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.