메뉴 건너뛰기




Volumn 25, Issue 1, 2015, Pages

Acoustic space learning for sound-source separation and localization on binaural manifolds

Author keywords

Binaural hearing; EM inference; manifold learning; mixture of regressors; sound localization; sound source separation

Indexed keywords

ACOUSTIC GENERATORS; AUDITION; BAYESIAN NETWORKS; CLUSTERING ALGORITHMS; DIMENSIONALITY REDUCTION; FREQUENCY ESTIMATION; INFERENCE ENGINES; MAXIMUM PRINCIPLE; PIECEWISE LINEAR TECHNIQUES; SEPARATION;

EID: 84924985938     PISSN: 01290657     EISSN: 17936462     Source Type: Journal    
DOI: 10.1142/S0129065714400036     Document Type: Article
Times cited : (86)

References (52)
  • 3
    • 84859997410 scopus 로고    scopus 로고
    • The cocktail party robot: Sound source separation and localisation with an active binaural head
    • A. Deleforge and R. P. Horaud, The cocktail party robot: Sound source separation and localisation with an active binaural head, In Proc. 7th ACM/IEEE Int. Conf. Human Robot Interaction (HRI) (2012), pp. 431-438.
    • (2012) Proc. 7th ACM/IEEE Int. Conf. Human Robot Interaction (HRI) , pp. 431-438
    • Deleforge, A.1    Horaud, R.P.2
  • 4
    • 80052339383 scopus 로고
    • Some experiment on the recognition of speech, with one and with two ears
    • E. C. Cherry, Some experiment on the recognition of speech, with one and with two ears, J. Acoust. Soc. Am. 25(5) (1953) 975-979.
    • (1953) J. Acoust. Soc. Am. , vol.25 , Issue.5 , pp. 975-979
    • Cherry, E.C.1
  • 5
    • 22944480530 scopus 로고    scopus 로고
    • The cocktail party problem
    • S. Haykin and Z. Chen, The cocktail party problem, Neural Comput. 17 (2005) 1875-1902.
    • (2005) Neural Comput. , vol.17 , pp. 1875-1902
    • Haykin, S.1    Chen, Z.2
  • 6
    • 0000705358 scopus 로고
    • On our perception of sound direction
    • L. Rayleigh, On our perception of sound direction, Philos. Mag. 13 (1907) 214-232.
    • (1907) Philos. Mag. , vol.13 , pp. 214-232
    • Rayleigh, L.1
  • 9
    • 0031813079 scopus 로고    scopus 로고
    • Spectrotemporal factors in two-dimensional human sound localization
    • P. M. Hofman and A. J. Van Opstal, Spectrotemporal factors in two-dimensional human sound localization, J. Acoust. Soc. Am. 103(5) (1998) 2634-2648.
    • (1998) J. Acoust. Soc. Am. , vol.103 , Issue.5 , pp. 2634-2648
    • Hofman, P.M.1    Van Opstal, A.J.2
  • 10
    • 73949127259 scopus 로고    scopus 로고
    • Azimuthal source localization using interaural coherence in a robotic dog: Modeling and application
    • R. Liu and Y. Wang, Azimuthal source localization using interaural coherence in a robotic dog: Modeling and application, Robotica 28(7) (2010) 1013-1020.
    • (2010) Robotica , vol.28 , Issue.7 , pp. 1013-1020
    • Liu, R.1    Wang, Y.2
  • 11
    • 84869795684 scopus 로고    scopus 로고
    • Geometrically constrained robust time delay estimation using non-coplanar microphone arrays
    • X. Alameda-Pineda and R. P. Horaud, Geometrically constrained robust time delay estimation using non-coplanar microphone arrays, in Proc. 20th Eur. Signal Processing Conf. (EUSIPCO) (2012), pp. 1309-1313.
    • (2012) Proc. 20th Eur. Signal Processing Conf. (EUSIPCO) , pp. 1309-1313
    • Alameda-Pineda, X.1    Horaud, R.P.2
  • 13
    • 84857334222 scopus 로고    scopus 로고
    • A latently constrained mixture model for audio source separation and localization
    • A. Deleforge and R. P. Horaud, A latently constrained mixture model for audio source separation and localization, in Proc. 10th Int. Conf., LVA/ICA (2012), pp. 372-379.
    • (2012) Proc. 10th Int. Conf., LVA/ICA , pp. 372-379
    • Deleforge, A.1    Horaud, R.P.2
  • 14
    • 84872299752 scopus 로고    scopus 로고
    • Binaural localization of multiple sources in reverberant and noisy environments
    • J. Woodruff and D. Wang, Binaural localization of multiple sources in reverberant and noisy environments, IEEE Trans. Acoust., Speech, Signal Process. 20(5) (2012) 1503-1512.
    • (2012) IEEE Trans. Acoust., Speech, Signal Process. , vol.20 , Issue.5 , pp. 1503-1512
    • Woodruff, J.1    Wang, D.2
  • 15
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • O. Yilmaz and S. Rickard, Blind separation of speech mixtures via time-frequency masking, IEEE Trans. Signal Process. 52 (2004) 1830-1847.
    • (2004) IEEE Trans. Signal Process. , vol.52 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2
  • 17
    • 84885582292 scopus 로고    scopus 로고
    • 2D binaural sound localization: For urban search and rescue robotics
    • A. R. Kullaib, M. Al-Mualla and D. Vernon, 2D binaural sound localization: For urban search and rescue robotics, in Proc. Mobile Robotics (2009), pp. 423-435.
    • (2009) Proc. Mobile Robotics , pp. 423-435
    • Kullaib, A.R.1    Al-Mualla, M.2    Vernon, D.3
  • 21
    • 33748640807 scopus 로고    scopus 로고
    • A review of learning with normal and altered sound-localization cues in human adults
    • B. A. Wright and Y. Zhang, A review of learning with normal and altered sound-localization cues in human adults, Int. J. Audiol. 45(S1) (2006) 92-98.
    • (2006) Int. J. Audiol. , vol.45 , Issue.S1 , pp. 92-98
    • Wright, B.A.1    Zhang, Y.2
  • 22
    • 41549142403 scopus 로고    scopus 로고
    • A sensorimotor approach to sound localization
    • M. Aytekin, C. F. Moss and J. Z. Simon, A sensorimotor approach to sound localization, Neural Comput. 20(3) (2008) 603-635.
    • (2008) Neural Comput. , vol.20 , Issue.3 , pp. 603-635
    • Aytekin, M.1    Moss, C.F.2    Simon, J.Z.3
  • 24
    • 0035495009 scopus 로고    scopus 로고
    • A sensorimotor account of vision and visual consciousness
    • J. K. O'Regan and A. Noe, A sensorimotor account of vision and visual consciousness, Behav. Brain Sci. 24 (2001) 939-1031.
    • (2001) Behav. Brain Sci. , vol.24 , pp. 939-1031
    • O'Regan, J.K.1    Noe, A.2
  • 25
    • 0141629808 scopus 로고
    • Movement-produced stimulation in the development of visually guided behavior
    • R. Held and A. Hein, Movement-produced stimulation in the development of visually guided behavior, J. Comp. Physiol. Psychol. 56(5) (1963) 872-876.
    • (1963) J. Comp. Physiol. Psychol. , vol.56 , Issue.5 , pp. 872-876
    • Held, R.1    Hein, A.2
  • 29
    • 78349276635 scopus 로고    scopus 로고
    • Single microphone blind audio source separation using EMKalman filter and short+long term AR modeling
    • (Springer)
    • S. Bensaid, A. Schutz and D. T. M. Slock, Single microphone blind audio source separation using EMKalman filter and short+long term AR modeling, in Latent Variable Analysis and Signal Separation (Springer, 2010), pp. 106-113.
    • (2010) Latent Variable Analysis and Signal Separation , pp. 106-113
    • Bensaid, S.1    Schutz, A.2    Slock, D.T.M.3
  • 31
    • 77955675017 scopus 로고    scopus 로고
    • Under-determined reverberant audio source separation using a full-rank spatial covariance model
    • N. Q. K. Duong, E. Vincent and R. Gribonval, Under-determined reverberant audio source separation using a full-rank spatial covariance model, IEEE Trans. Audio Signal Lang. Process. 18(7) (2010) 1830-1840.
    • (2010) IEEE Trans. Audio Signal Lang. Process. , vol.18 , Issue.7 , pp. 1830-1840
    • Duong, N.Q.K.1    Vincent, E.2    Gribonval, R.3
  • 32
  • 33
  • 34
    • 84945116550 scopus 로고
    • Sliced inverse regression for dimension reduction
    • K. C. Li, Sliced inverse regression for dimension reduction, J. Am. Stat. Assoc. 86(414) (1991) 316-327.
    • (1991) J. Am. Stat. Assoc. , vol.86 , Issue.414 , pp. 316-327
    • Li, K.C.1
  • 35
    • 53549120920 scopus 로고    scopus 로고
    • Kernel sliced inverse regression with applications to classification
    • H. M. Wu, Kernel sliced inverse regression with applications to classification, J. Comput. Graph. Stat. 17(3) (2008) 590-610.
    • (2008) J. Comput. Graph. Stat. , vol.17 , Issue.3 , pp. 590-610
    • Wu, H.M.1
  • 36
    • 38249004888 scopus 로고
    • Mixtures of linear regressions
    • R. D. de Veaux, Mixtures of linear regressions, Comput. Stat. Data Anal. 8(3) (1989) 227-245.
    • (1989) Comput. Stat. Data Anal. , vol.8 , Issue.3 , pp. 227-245
    • De Veaux, R.D.1
  • 40
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model
    • T. Toda, A. Black and K. Tokuda, Statistical mapping between articulatory movements and acoustic spectrum using a gaussian mixture model, Speech Commun. 50(3) (2008) 215-227.
    • (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 43
    • 65549154412 scopus 로고    scopus 로고
    • Numerical study on source-distance dependency of head-related transfer functions
    • M. Otani, T. Hirahara and S. Ise, Numerical study on source-distance dependency of head-related transfer functions, J. Acoust. Soc. Am. 125(5) (2009) 3253-3261.
    • (2009) J. Acoust. Soc. Am. , vol.125 , Issue.5 , pp. 3253-3261
    • Otani, M.1    Hirahara, T.2    Ise, S.3
  • 46
    • 84867920399 scopus 로고    scopus 로고
    • Principal manifolds and nonlinear dimensionality reduction via tangent space alignment
    • (English Edition)
    • Z. Zhang and H. Zha, Principal manifolds and nonlinear dimensionality reduction via tangent space alignment, Journal of Shanghai University (English Edition) 8(4) (2004) 406-424.
    • (2004) Journal of Shanghai University , vol.8 , Issue.4 , pp. 406-424
    • Zhang, Z.1    Zha, H.2
  • 47
    • 13844295342 scopus 로고    scopus 로고
    • The variational Bayesian em algorithm for incomplete data: With application to scoring graphical model structures
    • M. Beal and Z. Ghahramani, The variational Bayesian EM algorithm for incomplete data: With application to scoring graphical model structures, Bayesian Stat. 7 (2003) 453-464.
    • (2003) Bayesian Stat. , vol.7 , pp. 453-464
    • Beal, M.1    Ghahramani, Z.2
  • 49
    • 84872736510 scopus 로고    scopus 로고
    • A source localization/separation/respatialization system based on unsupervised classification of interaural cues
    • (Montreal, Canada)
    • J. Mouba and S. Marchand, A source localization/separation/respatialization system based on unsupervised classification of interaural cues, in Proc. Int. Conf. Digital Audio Effects (Montreal, Canada, 2006), pp. 233-238.
    • (2006) Proc. Int. Conf. Digital Audio Effects , pp. 233-238
    • Mouba, J.1    Marchand, S.2
  • 50
    • 11144223199 scopus 로고    scopus 로고
    • A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics
    • H. Buchner, R. Aichner and W. Kellermann, A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics, IEEE Trans. Audio, Speech Lang. Process. 13(1) (2005) 120-134.
    • (2005) IEEE Trans. Audio, Speech Lang. Process. , vol.13 , Issue.1 , pp. 120-134
    • Buchner, H.1    Aichner, R.2    Kellermann, W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.