메뉴 건너뛰기




Volumn , Issue , 2012, Pages 431-438

The cocktail party robot: Sound source separation and localisation with an active binaural head

Author keywords

blind source separation; computational auditory scene analysis; em algorithm; learning

Indexed keywords

ACOUSTIC SIGNALS; ARTIFICIAL HEAD; BINAURAL HEARING; COCKTAIL PARTY; COMPUTATIONAL AUDITORY SCENE ANALYSIS; ELECTRONIC DEVICE; EM ALGORITHMS; HUMAN-ROBOT COMMUNICATION; LEARNING; LOCALISATION; MIXTURE MODEL; PROBABILISTIC MODELS; SOUND SOURCE SEPARATION; SPEED OF CONVERGENCE; TWO MICROPHONES;

EID: 84859997410     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2157689.2157834     Document Type: Conference Paper
Times cited : (40)

References (26)
  • 2
    • 0017626192 scopus 로고
    • Short term spectral analysis, synthesis, and modification by discrete Fourier transform
    • DOI 10.1109/TASSP.1977.1162950
    • J. Allen. Short-term spectral analysis, synthesis, and modification by discrete fourier transform. IEEE Trans. Acous., Speech and Signal Process., 25(3):235-238, 1977. (Pubitemid 8196363)
    • (1977) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.25 , Issue.3 , pp. 235-238
    • Allen, J.B.1
  • 3
    • 41549142403 scopus 로고    scopus 로고
    • A sensorimotor approach to sound localization
    • M. Aytekin, C. F. Moss, and J. Z. Simon. A sensorimotor approach to sound localization. Neural Computation, 20(3):603-635, 2008.
    • (2008) Neural Computation , vol.20 , Issue.3 , pp. 603-635
    • Aytekin, M.1    Moss, C.F.2    Simon, J.Z.3
  • 4
    • 78349276635 scopus 로고    scopus 로고
    • Single microphone blind audio source separation using EM-Kalman filter and short+long term AR modeling
    • S. Bensaid, A. Schutz, and D. T. M. Slock. Single microphone blind audio source separation using EM-Kalman filter and short+long term AR modeling. In Latent Variable Analysis and Signal Separation, pages 106-113, 2010.
    • (2010) Latent Variable Analysis and Signal Separation , pp. 106-113
    • Bensaid, S.1    Schutz, A.2    Slock, D.T.M.3
  • 5
    • 0001626339 scopus 로고
    • A classification EM algorithm for clustering and two stochastic versions
    • G. Celeux and G. Govaert. A classification EM algorithm for clustering and two stochastic versions. Computational Statistics and Data Analysis, 14(3):315-332, 1992.
    • (1992) Computational Statistics and Data Analysis , vol.14 , Issue.3 , pp. 315-332
    • Celeux, G.1    Govaert, G.2
  • 9
    • 22944480530 scopus 로고    scopus 로고
    • The cocktail party problem
    • DOI 10.1162/0899766054322964
    • S. Haykin and Z. Chen. The cocktail party problem. Neural Computation, 17:1875-1902, 2005. (Pubitemid 41053627)
    • (2005) Neural Computation , vol.17 , Issue.9 , pp. 1875-1902
    • Haykin, S.1    Chen, Z.2
  • 10
    • 34250663912 scopus 로고    scopus 로고
    • Sound localization for humanoid robots - Building audiomotor maps based on the HRTF
    • J. Hörnstein, M. Lopes, J. Santos-Victor, and F. Lacerda. Sound localization for humanoid robots - building audiomotor maps based on the HRTF. In Proc. of IEEE/RSJ IROS, pages 1170-1176, 2006.
    • (2006) Proc. of IEEE/RSJ IROS , pp. 1170-1176
    • Hörnstein, J.1    Lopes, M.2    Santos-Victor, J.3    Lacerda, F.4
  • 11
    • 34548740335 scopus 로고    scopus 로고
    • Robotic localization and separation of concurrent sound sources using self-splitting competitive learning
    • Hawaii, Apr.
    • F. Keyrouz, W. Maier, and K. Diepold. Robotic localization and separation of concurrent sound sources using self-splitting competitive learning. In Proc. of IEEE CIISP, pages 340-345, Hawaii, Apr. 2007.
    • (2007) Proc. of IEEE CIISP , pp. 340-345
    • Keyrouz, F.1    Maier, W.2    Diepold, K.3
  • 12
    • 33947655001 scopus 로고    scopus 로고
    • A new method for binaural 3D localization based on HRTFs
    • May
    • F. Keyrouz, Y. Naous, and K. Diepold. A new method for binaural 3D localization based on HRTFs. In Proc. of IEEE ICASSP, volume 5, May 2006.
    • (2006) Proc. of IEEE ICASSP , vol.5
    • Keyrouz, F.1    Naous, Y.2    Diepold, K.3
  • 13
    • 78651517803 scopus 로고    scopus 로고
    • Conjugate mixture models for clustering multimodal data
    • Feb.
    • V. Khalidov, F. Forbes, and R. P. Horaud. Conjugate mixture models for clustering multimodal data. Neural Computation, 23(2):517-557, Feb. 2011.
    • (2011) Neural Computation , vol.23 , Issue.2 , pp. 517-557
    • Khalidov, V.1    Forbes, F.2    Horaud, R.P.3
  • 16
    • 30844435714 scopus 로고    scopus 로고
    • Sound source localization in real sound fields based on empirical statistics of interaural parameters
    • DOI 10.1121/1.2139619
    • J. Nix and V. Hohmann. Sound source localization in real sound fields based on empirical statistics of interaural parameters. Journal of the Acoustical Society of America, 119(1):463-479, 2006. (Pubitemid 43104728)
    • (2006) Journal of the Acoustical Society of America , vol.119 , Issue.1 , pp. 463-479
    • Nix, J.1    Hohmann, V.2
  • 17
    • 0035495009 scopus 로고    scopus 로고
    • A sensorimotor account of vision and visual consciousness
    • J. K. O'Regan and A. Noe. A sensorimotor account of vision and visual consciousness. Behavioral and Brain Sciences, 24:939-1031, 2001.
    • (2001) Behavioral and Brain Sciences , vol.24 , pp. 939-1031
    • O'Regan, J.K.1    Noe, A.2
  • 18
    • 65549154412 scopus 로고    scopus 로고
    • Numerical study on source-distance dependency of head-related transfer functions
    • M. Otani, T. Hirahara, and S. Ise. Numerical study on source-distance dependency of head-related transfer functions. Journal of the Acoustical Society of America, 125(5):3253-61, 2009.
    • (2009) Journal of the Acoustical Society of America , vol.125 , Issue.5 , pp. 3253-3261
    • Otani, M.1    Hirahara, T.2    Ise, S.3
  • 21
    • 18744392833 scopus 로고    scopus 로고
    • Localizing nearby sound sources in a classroom: Binaural room impulse responses
    • DOI 10.1121/1.1872572
    • B. Shinn-Cunningham, N. Kopco, and T. J. Martin. Localizing nearby sound sources in a classroom: Binaural room impulse responses. Journal of the Acoustical Society of America, 117(5):3100-3115, 2005. (Pubitemid 40675172)
    • (2005) Journal of the Acoustical Society of America , vol.117 , Issue.5 , pp. 3100-3115
    • Shinn-Cunningham, B.G.1    Kopco, N.2    Martin, T.J.3
  • 23
  • 25
    • 3142694930 scopus 로고    scopus 로고
    • Blind separation of speech mixtures via time-frequency masking
    • O. Yilmaz and S. Rickard. Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52:1830-1847, 2004.
    • (2004) IEEE Transactions on Signal Processing , vol.52 , pp. 1830-1847
    • Yilmaz, O.1    Rickard, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.