SCOPUS 정보 검색 플랫폼

International Journal of Neural Systems

Volumn 25, Issue 1, 2015, Pages

Acoustic space learning for sound-source separation and localization on binaural manifolds

(3) Deleforge, Antoine a Forbes, Florence a Horaud, Radu a

a INRIA RHÔNE ALPES (France)

Author keywords

Binaural hearing; EM inference; manifold learning; mixture of regressors; sound localization; sound source separation

Indexed keywords

ACOUSTIC GENERATORS; AUDITION; BAYESIAN NETWORKS; CLUSTERING ALGORITHMS; DIMENSIONALITY REDUCTION; FREQUENCY ESTIMATION; INFERENCE ENGINES; MAXIMUM PRINCIPLE; PIECEWISE LINEAR TECHNIQUES; SEPARATION;

BINAURAL HEARING; EM INFERENCE; MANIFOLD LEARNING; SOUND LOCALIZATION; SOUND SOURCE SEPARATION;

SOURCE SEPARATION;

ACOUSTICS; ASSOCIATION; BAYES THEOREM; HUMAN; LEARNING; PHYSIOLOGY; PRINCIPAL COMPONENT ANALYSIS; SIGNAL PROCESSING; SOUND DETECTION; SPECTROSCOPY; THEORETICAL MODEL;

ACOUSTICS; BAYES THEOREM; CUES; HUMANS; LEARNING; MODELS, THEORETICAL; PRINCIPAL COMPONENT ANALYSIS; SIGNAL PROCESSING, COMPUTER-ASSISTED; SOUND LOCALIZATION; SPECTRUM ANALYSIS;

EID: 84924985938 PISSN: 01290657 EISSN: 17936462 Source Type: Journal
DOI: 10.1142/S0129065714400036 Document Type: Article

Times cited : (86)

References (52)

1
- 0003742220
- (MIT Press)
- J. Blauert, Spatial Hearing: The Psychophysics of Human Sound Localization (MIT Press, 1997).
- (1997) Spatial Hearing: The Psychophysics of Human Sound Localization
- Blauert, J.¹

2
- 82255178542
- (IEEE Press)
- D. Wang and G. J. Brown, Computational Auditory Scene Analysis : Principles, Algorithms and Applications (IEEE Press, 2006).
- (2006) Computational Auditory Scene Analysis : Principles, Algorithms and Applications
- Wang, D.¹ Brown, G.J.²

3
- 84859997410
- The cocktail party robot: Sound source separation and localisation with an active binaural head
- A. Deleforge and R. P. Horaud, The cocktail party robot: Sound source separation and localisation with an active binaural head, In Proc. 7th ACM/IEEE Int. Conf. Human Robot Interaction (HRI) (2012), pp. 431-438.
- (2012) Proc. 7th ACM/IEEE Int. Conf. Human Robot Interaction (HRI) , pp. 431-438
- Deleforge, A.¹ Horaud, R.P.²

4
- 80052339383
- Some experiment on the recognition of speech, with one and with two ears
- E. C. Cherry, Some experiment on the recognition of speech, with one and with two ears, J. Acoust. Soc. Am. 25(5) (1953) 975-979.
- (1953) J. Acoust. Soc. Am. , vol.25 , Issue.5 , pp. 975-979
- Cherry, E.C.¹

5
- 22944480530
- The cocktail party problem
- S. Haykin and Z. Chen, The cocktail party problem, Neural Comput. 17 (2005) 1875-1902.
- (2005) Neural Comput. , vol.17 , pp. 1875-1902
- Haykin, S.¹ Chen, Z.²

6
- 0000705358
- On our perception of sound direction
- L. Rayleigh, On our perception of sound direction, Philos. Mag. 13 (1907) 214-232.
- (1907) Philos. Mag. , vol.13 , pp. 214-232
- Rayleigh, L.¹

7
- 0026044290
- Sound localization by human listeners
- J. C. Middlebrooks and D. M. Green, Sound localization by human listeners, Annu. Rev. Psychol. 42 (1991) 135-159.
- (1991) Annu. Rev. Psychol. , vol.42 , pp. 135-159
- Middlebrooks, J.C.¹ Green, D.M.²

8
- 0004166168
- (Holt)
- R. S. Woodworth and H. Schlosberg, Experimental Psychology (Holt, 1965).
- (1965) Experimental Psychology
- Woodworth, R.S.¹ Schlosberg, H.²

9
- 0031813079
- Spectrotemporal factors in two-dimensional human sound localization
- P. M. Hofman and A. J. Van Opstal, Spectrotemporal factors in two-dimensional human sound localization, J. Acoust. Soc. Am. 103(5) (1998) 2634-2648.
- (1998) J. Acoust. Soc. Am. , vol.103 , Issue.5 , pp. 2634-2648
- Hofman, P.M.¹ Van Opstal, A.J.²

10
- 73949127259
- Azimuthal source localization using interaural coherence in a robotic dog: Modeling and application
- R. Liu and Y. Wang, Azimuthal source localization using interaural coherence in a robotic dog: Modeling and application, Robotica 28(7) (2010) 1013-1020.
- (2010) Robotica , vol.28 , Issue.7 , pp. 1013-1020
- Liu, R.¹ Wang, Y.²

11
- 84869795684
- Geometrically constrained robust time delay estimation using non-coplanar microphone arrays
- X. Alameda-Pineda and R. P. Horaud, Geometrically constrained robust time delay estimation using non-coplanar microphone arrays, in Proc. 20th Eur. Signal Processing Conf. (EUSIPCO) (2012), pp. 1309-1313.
- (2012) Proc. 20th Eur. Signal Processing Conf. (EUSIPCO) , pp. 1309-1313
- Alameda-Pineda, X.¹ Horaud, R.P.²

12
- 70349204584
- An em algorithm for localizing multiple sound sources in reverberant environments
- M. I. Mandel, D. P. W. Ellis and T. Jebara, An EM algorithm for localizing multiple sound sources in reverberant environments, in Proc. Neural Information Processing Systems (NIPS) Conf. (2007), pp. 953-960.
- (2007) Proc. Neural Information Processing Systems (NIPS) Conf. , pp. 953-960
- Mandel, M.I.¹ Ellis, D.P.W.² Jebara, T.³

13
- 84857334222
- A latently constrained mixture model for audio source separation and localization
- A. Deleforge and R. P. Horaud, A latently constrained mixture model for audio source separation and localization, in Proc. 10th Int. Conf., LVA/ICA (2012), pp. 372-379.
- (2012) Proc. 10th Int. Conf., LVA/ICA , pp. 372-379
- Deleforge, A.¹ Horaud, R.P.²

14
- 84872299752
- Binaural localization of multiple sources in reverberant and noisy environments
- J. Woodruff and D. Wang, Binaural localization of multiple sources in reverberant and noisy environments, IEEE Trans. Acoust., Speech, Signal Process. 20(5) (2012) 1503-1512.
- (2012) IEEE Trans. Acoust., Speech, Signal Process. , vol.20 , Issue.5 , pp. 1503-1512
- Woodruff, J.¹ Wang, D.²

15
- 3142694930
- Blind separation of speech mixtures via time-frequency masking
- O. Yilmaz and S. Rickard, Blind separation of speech mixtures via time-frequency masking, IEEE Trans. Signal Process. 52 (2004) 1830-1847.
- (2004) IEEE Trans. Signal Process. , vol.52 , pp. 1830-1847
- Yilmaz, O.¹ Rickard, S.²

16
- 33846803485
- On the use of spatial cues to improve binaural source separation
- H. Viste and G. Evangelista, On the use of spatial cues to improve binaural source separation, in Proc. Int. Conf. Digital Audio Effects (DAFX) (2003), pp. 209-213.
- (2003) Proc. Int. Conf. Digital Audio Effects (DAFX) , pp. 209-213
- Viste, H.¹ Evangelista, G.²

17
- 84885582292
- 2D binaural sound localization: For urban search and rescue robotics
- A. R. Kullaib, M. Al-Mualla and D. Vernon, 2D binaural sound localization: For urban search and rescue robotics, in Proc. Mobile Robotics (2009), pp. 423-435.
- (2009) Proc. Mobile Robotics , pp. 423-435
- Kullaib, A.R.¹ Al-Mualla, M.² Vernon, D.³

18
- 34548740335
- Robotic localization and separation of concurrent sound sources using self-splitting competitive learning
- F. Keyrouz, W. Maier and K. Diepold, Robotic localization and separation of concurrent sound sources using self-splitting competitive learning, in Proc. IEEE Symp. Computational Intelligence in Image and Signal Processing (CIISP) (2007), pp. 340-345.
- (2007) Proc. IEEE Symp. Computational Intelligence in Image and Signal Processing (CIISP) , pp. 340-345
- Keyrouz, F.¹ Maier, W.² Diepold, K.³

19
- 34250663912
- Sound localization for humanoid robots - Building audio-motor maps based on the HRTF
- J. Hörnstein, M. Lopes, J. Santos-Victor and F. Lacerda, Sound localization for humanoid robots - Building audio-motor maps based on the HRTF, in Proc. IEEE/RSJ IROS Int. Conf. Intelligent Robots and Systems (2006), pp. 1170-1176.
- (2006) Proc. IEEE/RSJ IROS Int. Conf. Intelligent Robots and Systems , pp. 1170-1176
- Hörnstein, J.¹ Lopes, M.² Santos-Victor, J.³ Lacerda, F.⁴

20
- 33644671582
- Relearning sound localization with new ears
- P. M. Hofman, J. G. Van Riswick, A. J. Van Opstal et al., Relearning sound localization with new ears, Nature Neurosci. 1(5) (1998) 417-421.
- (1998) Nature Neurosci. , vol.1 , Issue.5 , pp. 417-421
- Hofman, P.M.¹ Van Riswick, J.G.² Van Opstal, A.J.³

21
- 33748640807
- A review of learning with normal and altered sound-localization cues in human adults
- B. A. Wright and Y. Zhang, A review of learning with normal and altered sound-localization cues in human adults, Int. J. Audiol. 45(S1) (2006) 92-98.
- (2006) Int. J. Audiol. , vol.45 , Issue.S1 , pp. 92-98
- Wright, B.A.¹ Zhang, Y.²

22
- 41549142403
- A sensorimotor approach to sound localization
- M. Aytekin, C. F. Moss and J. Z. Simon, A sensorimotor approach to sound localization, Neural Comput. 20(3) (2008) 603-635.
- (2008) Neural Comput. , vol.20 , Issue.3 , pp. 603-635
- Aytekin, M.¹ Moss, C.F.² Simon, J.Z.³

23
- 0010938132
- (Science Press, New York), by G. B. Halsted, Trans. of La valeur de la science (1905)
- H. Poincaré, The Foundations of Science; Science and Hypothesis, the Value of Science, Science and Method (Science Press, New York, 1929), by G. B. Halsted, Trans. of La valeur de la science (1905).
- (1929) The Foundations of Science; Science and Hypothesis, the Value of Science, Science and Method
- Poincaré, H.¹

24
- 0035495009
- A sensorimotor account of vision and visual consciousness
- J. K. O'Regan and A. Noe, A sensorimotor account of vision and visual consciousness, Behav. Brain Sci. 24 (2001) 939-1031.
- (2001) Behav. Brain Sci. , vol.24 , pp. 939-1031
- O'Regan, J.K.¹ Noe, A.²

25
- 0141629808
- Movement-produced stimulation in the development of visually guided behavior
- R. Held and A. Hein, Movement-produced stimulation in the development of visually guided behavior, J. Comp. Physiol. Psychol. 56(5) (1963) 872-876.
- (1963) J. Comp. Physiol. Psychol. , vol.56 , Issue.5 , pp. 872-876
- Held, R.¹ Hein, A.²

26
- 84870718978
- 2D sound-source localization on the binaural manifold
- (IEEE, Santander, Spain)
- A. Deleforge and R. P. Horaud, 2D sound-source localization on the binaural manifold, in IEEE Workshop on Machine Learning for Signal Processing (IEEE, Santander, Spain, 2012), pp. 1-6.
- (2012) IEEE Workshop on Machine Learning for Signal Processing , pp. 1-6
- Deleforge, A.¹ Horaud, R.P.²

27
- 84890515034
- Variational em for binaural sound-source separation and localization
- (IEEE, Vancouver, Canada)
- A. Deleforge, F. Forbes and R. P. Horaud, Variational EM for binaural sound-source separation and localization, in ICASSP 2013-38th Int. Conf. Acoustics, Speech, and Signal Processing (IEEE, Vancouver, Canada, 2013), pp. 76-80.
- (2013) ICASSP 2013-38th Int. Conf. Acoustics, Speech, and Signal Processing , pp. 76-80
- Deleforge, A.¹ Forbes, F.² Horaud, R.P.³

28
- 0038705102
- One microphone source separation
- (MIT Press)
- S. T. Roweis, One microphone source separation, in Advances in Neural Information Processing Systems, Vol. 13 (MIT Press, 2000), pp. 793-799.
- (2000) Advances in Neural Information Processing Systems , vol.13 , pp. 793-799
- Roweis, S.T.¹

29
- 78349276635
- Single microphone blind audio source separation using EMKalman filter and short+long term AR modeling
- (Springer)
- S. Bensaid, A. Schutz and D. T. M. Slock, Single microphone blind audio source separation using EMKalman filter and short+long term AR modeling, in Latent Variable Analysis and Signal Separation (Springer, 2010), pp. 106-113.
- (2010) Latent Variable Analysis and Signal Separation , pp. 106-113
- Bensaid, S.¹ Schutz, A.² Slock, D.T.M.³

30
- 77956012298
- Academic Press (Elsevier))
- P. Comon and C. Jutten, Handbook of Blind Source Separation, Independent Component Analysis and Applications (Academic Press (Elsevier), 2010).
- (2010) Handbook of Blind Source Separation, Independent Component Analysis and Applications
- Comon, P.¹ Jutten, C.²

31
- 77955675017
- Under-determined reverberant audio source separation using a full-rank spatial covariance model
- N. Q. K. Duong, E. Vincent and R. Gribonval, Under-determined reverberant audio source separation using a full-rank spatial covariance model, IEEE Trans. Audio Signal Lang. Process. 18(7) (2010) 1830-1840.
- (2010) IEEE Trans. Audio Signal Lang. Process. , vol.18 , Issue.7 , pp. 1830-1840
- Duong, N.Q.K.¹ Vincent, E.² Gribonval, R.³

32
- 85008544097
- Modelbased expectation-maximization source separation and localization
- M. I. Mandel, R. J. Weiss and D. P. W. Ellis, Modelbased expectation-maximization source separation and localization, IEEE Trans. Audio, Speech Lang. Process. 18(2) (2010) 382-394.
- (2010) IEEE Trans. Audio, Speech Lang. Process. , vol.18 , Issue.2 , pp. 382-394
- Mandel, M.I.¹ Weiss, R.J.² Ellis, D.P.W.³

33
- 33745819990
- Overview and recent advances in partial least squares
- (Springer)
- R. Rosipal and N. Krämer, Overview and recent advances in partial least squares, in Subspace, Latent Structure and Feature Selection, Vol. 3940 (Springer, 2006), pp. 34-51.
- (2006) Subspace, Latent Structure and Feature Selection , vol.3940 , pp. 34-51
- Rosipal, R.¹ Krämer, N.²

34
- 84945116550
- Sliced inverse regression for dimension reduction
- K. C. Li, Sliced inverse regression for dimension reduction, J. Am. Stat. Assoc. 86(414) (1991) 316-327.
- (1991) J. Am. Stat. Assoc. , vol.86 , Issue.414 , pp. 316-327
- Li, K.C.¹

35
- 53549120920
- Kernel sliced inverse regression with applications to classification
- H. M. Wu, Kernel sliced inverse regression with applications to classification, J. Comput. Graph. Stat. 17(3) (2008) 590-610.
- (2008) J. Comput. Graph. Stat. , vol.17 , Issue.3 , pp. 590-610
- Wu, H.M.¹

36
- 38249004888
- Mixtures of linear regressions
- R. D. de Veaux, Mixtures of linear regressions, Comput. Stat. Data Anal. 8(3) (1989) 227-245.
- (1989) Comput. Stat. Data Anal. , vol.8 , Issue.3 , pp. 227-245
- De Veaux, R.D.¹

37
- 85140116568
- An alternative model for mixtures of experts
- L. Xu, M. I. Jordan and G. E. Hinton, An alternative model for mixtures of experts, in Proc. Neural Information Processing Systems (NIPS) Conf., Vol. 7 (1995), pp. 633-640.
- (1995) Proc. Neural Information Processing Systems (NIPS) Conf. , vol.7 , pp. 633-640
- Xu, L.¹ Jordan, M.I.² Hinton, G.E.³

38
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. W. Macon, Spectral voice conversion for text-to-speech synthesis, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), Vol. 1 (1998), pp. 285-288.
- (1998) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , vol.1 , pp. 285-288
- Kain, A.¹ MacOn, M.W.²

39
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé and E. Moulines, Continuous probabilistic transform for voice conversion, IEEE Trans. Acoust., Speech, Signal Process. 6 (1998) 131-142.
- (1998) IEEE Trans. Acoust., Speech, Signal Process. , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

40
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
- T. Toda, A. Black and K. Tokuda, Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model, Speech Commun. 50(3) (2008) 215-227.
- (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
- Toda, T.¹ Black, A.² Tokuda, K.³

41
- 70349218133
- Mixture of probabilistic linear regressions: A unified view of GMM-based mapping techniques
- Y. Qiao and N. Minematsu, Mixture of probabilistic linear regressions: A unified view of GMM-based mapping techniques, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) (2009) pp. 3913-3916.
- (2009) Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , pp. 3913-3916
- Qiao, Y.¹ Minematsu, N.²

42
- 84925011012
- arXiv.1308.2302
- A. Deleforge, F. Forbes and R. Horaud, Highdimensional regression with gaussian mixtures and partially-latent response variables, arXiv.1308.2302 (2013).
- (2013) Highdimensional Regression with Gaussian Mixtures and Partially-latent Response Variables
- Deleforge, A.¹ Forbes, F.² Horaud, R.³

43
- 65549154412
- Numerical study on source-distance dependency of head-related transfer functions
- M. Otani, T. Hirahara and S. Ise, Numerical study on source-distance dependency of head-related transfer functions, J. Acoust. Soc. Am. 125(5) (2009) 3253-3261.
- (2009) J. Acoust. Soc. Am. , vol.125 , Issue.5 , pp. 3253-3261
- Otani, M.¹ Hirahara, T.² Ise, S.³

44
- 0003548585
- Linguistic Data Consortium, Philadelphia
- J. S. Garofolo et al., TIMIT Acoustic-Phonetic Continuous Speech Corpus, Linguistic Data Consortium, Philadelphia (1993).
- (1993) TIMIT Acoustic-Phonetic Continuous Speech Corpus
- Garofolo, J.S.¹

45
- 83455194596
- Supervised source localization using diffusion kernels
- R. Talmon, I. Cohen and S. Gannot, Supervised source localization using diffusion kernels, in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (2011), pp. 245-248.
- (2011) Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , pp. 245-248
- Talmon, R.¹ Cohen, I.² Gannot, S.³

46
- 84867920399
- Principal manifolds and nonlinear dimensionality reduction via tangent space alignment
- (English Edition)
- Z. Zhang and H. Zha, Principal manifolds and nonlinear dimensionality reduction via tangent space alignment, Journal of Shanghai University (English Edition) 8(4) (2004) 406-424.
- (2004) Journal of Shanghai University , vol.8 , Issue.4 , pp. 406-424
- Zhang, Z.¹ Zha, H.²

47
- 13844295342
- The variational Bayesian em algorithm for incomplete data: With application to scoring graphical model structures
- M. Beal and Z. Ghahramani, The variational Bayesian EM algorithm for incomplete data: With application to scoring graphical model structures, Bayesian Stat. 7 (2003) 453-464.
- (2003) Bayesian Stat. , vol.7 , pp. 453-464
- Beal, M.¹ Ghahramani, Z.²

48
- 0036881034
- Self-localizing dynamic microphone arrays
- P. Aarabi, Self-localizing dynamic microphone arrays, IEEE Trans. Syst. Man, Cybern. C, Appl. Rev. 32(4) (2002) 474-484.
- (2002) IEEE Trans. Syst. Man, Cybern. C, Appl. Rev. , vol.32 , Issue.4 , pp. 474-484
- Aarabi, P.¹

49
- 84872736510
- A source localization/separation/respatialization system based on unsupervised classification of interaural cues
- (Montreal, Canada)
- J. Mouba and S. Marchand, A source localization/separation/respatialization system based on unsupervised classification of interaural cues, in Proc. Int. Conf. Digital Audio Effects (Montreal, Canada, 2006), pp. 233-238.
- (2006) Proc. Int. Conf. Digital Audio Effects , pp. 233-238
- Mouba, J.¹ Marchand, S.²

50
- 11144223199
- A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics
- H. Buchner, R. Aichner and W. Kellermann, A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics, IEEE Trans. Audio, Speech Lang. Process. 13(1) (2005) 120-134.
- (2005) IEEE Trans. Audio, Speech Lang. Process. , vol.13 , Issue.1 , pp. 120-134
- Buchner, H.¹ Aichner, R.² Kellermann, W.³

51
- 50249118229
- A Two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures
- (New Paltz, NY)
- H. Sawada, S. Araki and S. Makino, A Two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures, in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (New Paltz, NY, 2007), pp. 139-142.
- (2007) Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) , pp. 139-142
- Sawada, H.¹ Araki, S.² Makino, S.³

52
- 33744975847
- Performance measurement in blind audio source separation
- E. Vincent, R. Gribonval and C. Févotte, Performance measurement in blind audio source separation, IEEE Trans. Audio, Speech Lang. Process. 14(4) (2006) 1462-1469.
- (2006) IEEE Trans. Audio, Speech Lang. Process. , vol.14 , Issue.4 , pp. 1462-1469
- Vincent, E.¹ Gribonval, R.² Févotte, C.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.