메뉴 건너뛰기




Volumn 16, Issue 4, 2008, Pages 728-739

Binaural tracking of multiple moving sources

Author keywords

Binaural processing; Hidden Markov model (HMM); Moving source tracking; Multisource tracking

Indexed keywords

ACROSS TIME; AUDITORY SCENE ANALYSIS; BINAURAL CUES; BINAURAL PROCESSING; FREQUENCY CHANNELS; HIDDEN MARKOV MODEL (HMM); HMM MODELS; LIKELIHOOD FUNCTIONS; MOVING SOUND SOURCES; MOVING SOURCE TRACKING; MULTISOURCE TRACKING; SOURCE LOCATIONS; TARGET SPACES; TIME FRAMES; TIME FREQUENCIES; TRACKING ALGORITHMS;

EID: 64849095806     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.918978     Document Type: Article
Times cited : (78)

References (48)
  • 1
    • 0032142014 scopus 로고    scopus 로고
    • Environmental conditions and acoustic transduction in hands-free speech recognition
    • M. Omologo, P. Svaizer, and M. Matasoni, "Environmental conditions and acoustic transduction in hands-free speech recognition," Speech Commun., vol. 25, no. 1-3, pp. 75-95, 1998.
    • (1998) Speech Commun , vol.25 , Issue.1-3 , pp. 75-95
    • Omologo, M.1    Svaizer, P.2    Matasoni, M.3
  • 2
    • 4544339441 scopus 로고    scopus 로고
    • Clustering and segmenting speakers and dieir locations in meetings
    • J. Ajmera, G. Lathoud, and L. McCowan, "Clustering and segmenting speakers and dieir locations in meetings," Proc. ICASSP, vol. 1, pp. 605-608, 2004.
    • (2004) Proc. ICASSP , vol.1 , pp. 605-608
    • Ajmera, J.1    Lathoud, G.2    McCowan, L.3
  • 3
    • 85079282164 scopus 로고    scopus 로고
    • A Bayesian approach to multiple-target tracking
    • D. L. Hall and J. LLinas, Eds. Boca Raton, FL: CRC
    • L. D. Stone, "A Bayesian approach to multiple-target tracking," in Handbook of Multisensor Fusion, D. L. Hall and J. LLinas, Eds. Boca Raton, FL: CRC, 2001.
    • (2001) Handbook of Multisensor Fusion
    • Stone, L.D.1
  • 4
    • 33847141944 scopus 로고    scopus 로고
    • Target tracking
    • S. Stergiopoulos, Ed. Boca Raton, FL: CRC
    • W. Koch, "Target tracking," in Advanced Signal Processing Handbook, S. Stergiopoulos, Ed. Boca Raton, FL: CRC, 2001.
    • (2001) Advanced Signal Processing Handbook
    • Koch, W.1
  • 5
    • 0018689571 scopus 로고
    • An algorithm for tracking multiple targets
    • Dec
    • D. Reid, "An algorithm for tracking multiple targets," IEEE Trans. Autom. Control, vol. AC-24, no. 6, pp. 84-90, Dec. 1979.
    • (1979) IEEE Trans. Autom. Control , vol.AC-24 , Issue.6 , pp. 84-90
    • Reid, D.1
  • 6
    • 0016552254 scopus 로고
    • Tracking in a cluttered environment with probabilistic data association
    • Y. Bar-Shalom and E. Tse, "Tracking in a cluttered environment with probabilistic data association," Automatica, vol. 11, pp. 451-460, 1975.
    • (1975) Automatica , vol.11 , pp. 451-460
    • Bar-Shalom, Y.1    Tse, E.2
  • 7
    • 0033715157 scopus 로고    scopus 로고
    • A new pruning/merging algorithm for MHT multitarget tracking
    • K. Buckley, A. Vaddiraju, and R. Perry, "A new pruning/merging algorithm for MHT multitarget tracking," in Proc. IEEE Int. Radar Conf., 2000, pp. 71-75.
    • (2000) Proc. IEEE Int. Radar Conf , pp. 71-75
    • Buckley, K.1    Vaddiraju, A.2    Perry, R.3
  • 8
    • 0030782871 scopus 로고    scopus 로고
    • A hybrid bootstrap filter for target tracking clutter
    • N. Gordon, "A hybrid bootstrap filter for target tracking clutter," IEEE Trans. Aerosp. Electron. Syst., vol. 33, no. 1, pp. 353-358, 1997.
    • (1997) IEEE Trans. Aerosp. Electron. Syst , vol.33 , Issue.1 , pp. 353-358
    • Gordon, N.1
  • 9
    • 0032136153 scopus 로고    scopus 로고
    • Condensation-Conditional density propagation for visual tracking
    • Jan
    • M. Isard and A. Blake, "Condensation-Conditional density propagation for visual tracking," Int. J. Comput. Vision, vol. 29, no. 1, pp. 5-28, Jan. 1998.
    • (1998) Int. J. Comput. Vision , vol.29 , Issue.1 , pp. 5-28
    • Isard, M.1    Blake, A.2
  • 10
    • 27544460134 scopus 로고    scopus 로고
    • Sequential Monte Carlo methods for Bayesian multi-target filtering with random finite sets
    • Oct
    • B.-N. Vo, S. Singh, and A. Doucet, "Sequential Monte Carlo methods for Bayesian multi-target filtering with random finite sets." IEEE Trans. Aerosp. Electron. Syst., vol. 41, no. 4, pp. 1224-1245, Oct. 2005.
    • (2005) IEEE Trans. Aerosp. Electron. Syst , vol.41 , Issue.4 , pp. 1224-1245
    • Vo, B.-N.1    Singh, S.2    Doucet, A.3
  • 11
    • 0026373025 scopus 로고
    • Multiple target tracking and multiple frequency line tracking using hidden Markov models
    • Dec
    • X. Xie and R. J. Evans, "Multiple target tracking and multiple frequency line tracking using hidden Markov models," IEEE Trans. Signal Process., vol. 39, no. 12, pp. 2659-2676, Dec. 1991.
    • (1991) IEEE Trans. Signal Process , vol.39 , Issue.12 , pp. 2659-2676
    • Xie, X.1    Evans, R.J.2
  • 12
    • 0030782875 scopus 로고    scopus 로고
    • Data fusion and tracking using HMMs in a distributed sensor network
    • F. Martinerie, "Data fusion and tracking using HMMs in a distributed sensor network," IEEE Trans. Aerosp. Electron. Syst., vol. 33, no. 1, pp. 11-28, 1997.
    • (1997) IEEE Trans. Aerosp. Electron. Syst , vol.33 , Issue.1 , pp. 11-28
    • Martinerie, F.1
  • 13
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • Aug
    • C. Knapp and G. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, no. 4, pp. 320-327, Aug. 1976.
    • (1976) IEEE Trans. Acoust. Speech Signal Process , vol.ASSP-24 , Issue.4 , pp. 320-327
    • Knapp, C.1    Carter, G.2
  • 14
    • 0030193445 scopus 로고    scopus 로고
    • Two decades of array signal processing research: The parametric approach
    • Jul
    • H. Krim and M. Viberg, "Two decades of array signal processing research: The parametric approach," IEEE Signal Process. Mag., vol. 13, no. 4, pp. 67-94, Jul. 1996.
    • (1996) IEEE Signal Process. Mag , vol.13 , Issue.4 , pp. 67-94
    • Krim, H.1    Viberg, M.2
  • 15
    • 0030681710 scopus 로고    scopus 로고
    • Tracking multiple talkers using microphone-array measurements
    • D. E. Sturim, M. S. Brandstein, and H. F. Silverman, "Tracking multiple talkers using microphone-array measurements," Proc. ICASSP, vol. 1, pp. 371-374, 1997.
    • (1997) Proc. ICASSP , vol.1 , pp. 371-374
    • Sturim, D.E.1    Brandstein, M.S.2    Silverman, H.F.3
  • 16
    • 64849102959 scopus 로고    scopus 로고
    • Multi-array fusion for beamforming and localization of moving speakers
    • I. Potamitis, G. Tremoulis, and N. Fakotakis, "Multi-array fusion for beamforming and localization of moving speakers," Proc. Eurospeech, vol. 2, pp. 1721-1724, 2003.
    • (2003) Proc. Eurospeech , vol.2 , pp. 1721-1724
    • Potamitis, I.1    Tremoulis, G.2    Fakotakis, N.3
  • 17
    • 0034842138 scopus 로고    scopus 로고
    • Nonlinear filtering for speaker tracking in noisy and reverberant environments
    • J. Vermaak and A. Blake, "Nonlinear filtering for speaker tracking in noisy and reverberant environments," Proc. ICASSP, vol. 5, pp. 3021-3024, 2001.
    • (2001) Proc. ICASSP , vol.5 , pp. 3021-3024
    • Vermaak, J.1    Blake, A.2
  • 18
    • 0347337998 scopus 로고    scopus 로고
    • Particle filtering algorithms for hacking an acoustic source in a reverberant environment
    • Nov
    • D. B. Ward, E. A. Lehmann, and R. C. Williamson, "Particle filtering algorithms for hacking an acoustic source in a reverberant environment," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 826-836, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 826-836
    • Ward, D.B.1    Lehmann, E.A.2    Williamson, R.C.3
  • 19
    • 0036905669 scopus 로고    scopus 로고
    • Particle filters for tracking an unknown number of sources
    • Dec
    • J.-R. Larocque, J. P. Reilly, and W. Ng, "Particle filters for tracking an unknown number of sources," IEEE Trans. Signal Process., vol. 50, no. 12, pp. 2926-2937, Dec. 2002.
    • (2002) IEEE Trans. Signal Process , vol.50 , Issue.12 , pp. 2926-2937
    • Larocque, J.-R.1    Reilly, J.P.2    Ng, W.3
  • 20
    • 33750390953 scopus 로고    scopus 로고
    • Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach
    • Sep
    • W.-K. Ma, B.-N. Vo, S. S. Singh, and A. Baddeley, "Tracking an unknown time-varying number of speakers using TDOA measurements: A random finite set approach," IEEE Trans. Signal Process., vol. 54, no. 9, pp. 3291-3304, Sep. 2006.
    • (2006) IEEE Trans. Signal Process , vol.54 , Issue.9 , pp. 3291-3304
    • Ma, W.-K.1    Vo, B.-N.2    Singh, S.S.3    Baddeley, A.4
  • 21
    • 52149108294 scopus 로고    scopus 로고
    • Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering
    • Mar
    • J. Nix and V. Hohmann, "Combined estimation of spectral envelopes and sound source direction of concurrent voices by multidimensional statistical filtering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 995-1008, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 995-1008
    • Nix, J.1    Hohmann, V.2
  • 22
    • 84880877816 scopus 로고    scopus 로고
    • K. Nakadai, K. Hidai, H. Mizoguchi, H. G. Okuno, and H. Ki - tano, Real-time auditory and visual multiple-object tracking for humanoids, Proc. 17th IJCAI, pp. 1425-1432, 2001.
    • K. Nakadai, K. Hidai, H. Mizoguchi, H. G. Okuno, and H. Ki - tano, "Real-time auditory and visual multiple-object tracking for humanoids," Proc. 17th IJCAI, pp. 1425-1432, 2001.
  • 23
    • 0142026377 scopus 로고    scopus 로고
    • Speech segregation based on sound localization
    • N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Amer., vol. 114, pp. 2236-2252, 2003.
    • (2003) J. Acoust. Soc. Amer , vol.114 , pp. 2236-2252
    • Roman, N.1    Wang, D.L.2    Brown, G.J.3
  • 24
    • 84872895918 scopus 로고    scopus 로고
    • Localization-based grouping
    • A. S. Feng and D. L. Jones, D. L. Wang and G. J. Brown, Eds, New York: Wiley/IEEE Press
    • A. S. Feng and D. L. Jones., D. L. Wang and G. J. Brown, Eds., "Localization-based grouping," in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications. New York: Wiley/IEEE Press, 2006, pp. 187-207.
    • (2006) Computational Auditory Scene Analysis: Principles, Algorithms, and Applications , pp. 187-207
  • 25
    • 0016522748 scopus 로고
    • Anthropometric manikin for acoustic research
    • M. D. Burkhard and R. M. Sachs, "Anthropometric manikin for acoustic research," J. Acoust. Soc. Amer., vol. 58, pp. 214-222, 1975.
    • (1975) J. Acoust. Soc. Amer , vol.58 , pp. 214-222
    • Burkhard, M.D.1    Sachs, R.M.2
  • 26
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M. Wu, D. L. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 229-241, May 2003.
    • (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3
  • 27
    • 4644333729 scopus 로고    scopus 로고
    • Pitch tracking and speech enhancement in noisy and reverberant environments,
    • Ph.D. dissertation, Comput. Inf. Sci, The Ohio State Univ, Columbus
    • M. Wu, "Pitch tracking and speech enhancement in noisy and reverberant environments," Ph.D. dissertation, Comput. Inf. Sci., The Ohio State Univ., Columbus, 2003.
    • (2003)
    • Wu, M.1
  • 29
    • 30844435714 scopus 로고    scopus 로고
    • Sound source localization in real sound fields based on empirical statistics of interaural parameters
    • J. Nix and V. Hohmann, "Sound source localization in real sound fields based on empirical statistics of interaural parameters," J. Acoust. Soc. Amer., vol. 119, pp. 463-479, 2006.
    • (2006) J. Acoust. Soc. Amer , vol.119 , pp. 463-479
    • Nix, J.1    Hohmann, V.2
  • 30
    • 0004089083 scopus 로고
    • HRTF measurements of a KEMAR dummy-head microphone
    • MIT Media Lab Perceptual Computing Tech. Rep, 280
    • W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR dummy-head microphone," MIT Media Lab Perceptual Computing Tech. Rep. #280, 1994.
    • (1994)
    • Gardner, W.G.1    Martin, K.D.2
  • 31
  • 32
    • 0017713218 scopus 로고
    • Transformation characteristics of the external human ear
    • S. Mehrgardt and V. Mellert, "Transformation characteristics of the external human ear,"J. Acoust. Soc. Amer., vol. 61, pp. 1567-1576, 1977.
    • (1977) J. Acoust. Soc. Amer , vol.61 , pp. 1567-1576
    • Mehrgardt, S.1    Mellert, V.2
  • 34
    • 0026628307 scopus 로고
    • A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction
    • D. J. Kistler and F. L. Wightman, "A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction," J. Acoust. Soc. Amer., vol. 91, pp. 1637-1647, 1992.
    • (1992) J. Acoust. Soc. Amer , vol.91 , pp. 1637-1647
    • Kistler, D.J.1    Wightman, F.L.2
  • 38
    • 0031119324 scopus 로고    scopus 로고
    • A model for prediction of thresholds, loudness and partial loudness
    • B. C. J. Moore, B. R. Glasberg, and T. Baer, "A model for prediction of thresholds, loudness and partial loudness," J. Audio Eng. Soc, vol. 45, pp. 224-240, 1997.
    • (1997) J. Audio Eng. Soc , vol.45 , pp. 224-240
    • Moore, B.C.J.1    Glasberg, B.R.2    Baer, T.3
  • 41
    • 0026220146 scopus 로고
    • A computer model of binaural localization for stereo imaging measurement
    • E. A. MacPherson, "A computer model of binaural localization for stereo imaging measurement," J. Audio Eng. Soc., vol. 39. pp. 604-622. 1991.
    • (1991) J. Audio Eng. Soc , vol.39 , pp. 604-622
    • MacPherson, E.A.1
  • 42
    • 0035528674 scopus 로고    scopus 로고
    • Idiot's Bayes-Not so stupid after all?
    • D. J. Hand and K. Yu, "Idiot's Bayes-Not so stupid after all?," Int. Stat. Review, vol. 69, pp. 385-398, 2001.
    • (2001) Int. Stat. Review , vol.69 , pp. 385-398
    • Hand, D.J.1    Yu, K.2
  • 46
    • 4644304197 scopus 로고    scopus 로고
    • A binaural processor for missing data speech recognition in the presence of noise and small - room reverberation
    • K. J. Palomäki, G. J. Brown, and D. L. Wang, "A binaural processor for missing data speech recognition in the presence of noise and small - room reverberation,"Speech Comm., vol. 43, pp. 361-378. 2004.
    • (2004) Speech Comm , vol.43 , pp. 361-378
    • Palomäki, K.J.1    Brown, G.J.2    Wang, D.L.3
  • 47
    • 9644281074 scopus 로고    scopus 로고
    • Source localization in complex listening situations: Selection of binaural cues based on interaural coherence
    • C. Faller and J. Merimaa, "Source localization in complex listening situations: Selection of binaural cues based on interaural coherence," J. Acoust. Soc. Amer., vol. 116, pp. 3075-3089, 2004.
    • (2004) J. Acoust. Soc. Amer , vol.116 , pp. 3075-3089
    • Faller, C.1    Merimaa, J.2
  • 48
    • 34047275587 scopus 로고    scopus 로고
    • An auditory onset detection algorithm for improved automatic source localization
    • May
    • B. Supper, T. Brookes, and F. Rumsey, "An auditory onset detection algorithm for improved automatic source localization," IEEE Trans. Speech Audio Process., vol. 14, no. 3, pp. 1008-1017, May 2006.
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.3 , pp. 1008-1017
    • Supper, B.1    Brookes, T.2    Rumsey, F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.