SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 8, 2007, Pages 2257-2269

Speech enhancement and recognition in meetings with an audio-visual sensor array

(3) Maganti, Hari Krishna a Gatica Perez, Daniel b McCowan, Iain c

a UNIVERSITY OF ULM (Germany)

b EPFL (Switzerland)

c QUEENSLAND UNIVERSITY OF TECHNOLOGY (Australia)

Author keywords

Audio x2013; visual fusion; Microphone array processing; Multiobject tracking; Speech enhancement; Speech recognition

Indexed keywords

AUDIO VISUALS; AUDIO-VISUAL SENSORS; BEAM-FORMING TECHNIQUES; INTEGRATED APPROACHES; MICROPHONE ARRAY PROCESSING; MICROPHONE ARRAYS; MULTIOBJECT TRACKING; POST-FILTERING; RECOGNITION PERFORMANCE; SPATIAL FILTERING; SPEAKER TRACKING; SPEECH ACQUISITIONS; SPEECH RECOGNITION SYSTEMS; SPEECH SIGNALS; TABLE TOPS;

ARRAY PROCESSING; BEAMFORMING; COMMUNICATION CHANNELS (INFORMATION THEORY); IMAGE SENSORS; MICROPHONES; QUALITY CONTROL; SPEECH ANALYSIS; SPEECH ENHANCEMENT;

SPEECH RECOGNITION;

EID: 40249089621 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.906197 Document Type: Article

Times cited : (52)

References (59)

1
- 26844474912
- Living laboratories: The future computing environments group at the Georgia Institute of Technology
- Hague, Apr
- G. Abowd et al., "Living laboratories: The future computing environments group at the Georgia Institute of Technology," in Proc. Conf. Human Factors in Comput. Syst. (CHI), Hague, Apr. 2000, pp. 215-216.
- (2000) Proc. Conf. Human Factors in Comput. Syst. (CHI) , pp. 215-216
- Abowd, G.¹

2
- 10244242647
- Detection and separation of speech event using audio and video information fusion
- F. Asano et al., "Detection and separation of speech event using audio and video information fusion," J. Appl. Signal Process., vol. 11, pp. 1727-1738, 2004.
- (2004) J. Appl. Signal Process , vol.11 , pp. 1727-1738
- Asano, F.¹

3
- 0005540823
- New York: ACM
- R. Baeza-Yates and B. Ribeiro-Neto,Modern Information Retrieval. New York: ACM, 1999.
- (1999) Modern Information Retrieval
- Baeza-Yates, R.¹ Ribeiro-Neto, B.²

4
- 0344044776
- Audio-video sensor fusion with probabilistic graphical models
- Copenhagen, May
- M. Beal, H. Attias, and N. Jojic, "Audio-video sensor fusion with probabilistic graphical models," in Proc. Eur. Conf. Comput. Vision (ECCV), Copenhagen, May 2002.
- (2002) Proc. Eur. Conf. Comput. Vision (ECCV)
- Beal, M.¹ Attias, H.² Jojic, N.³

5
- 0032665455
- Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement
- J. Bitzer, K. S. Uwe, and K. Kammeyer, "Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1999, vol. 5, pp. 2965-2968.
- (1999) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , vol.5 , pp. 2965-2968
- Bitzer, J.¹ Uwe, K.S.² Kammeyer, K.³

6
- 38049107298
- A generative approach to audio-visual person tracking
- Southampton, U.K, Apr
- R. Brunelli et al., "A generative approach to audio-visual person tracking," in Proc. CLEAR Evaluation Workshop, Southampton, U.K., Apr. 2006, pp. 55-68.
- (2006) Proc. CLEAR Evaluation Workshop , pp. 55-68
- Brunelli, R.¹

7
- 4544347587
- Multiple person and speaker activity tracking with a particle filter
- Montreal, QC, Canada, May
- N. Checka, K. Wilson, M. Siracusa, and T. Darrell, "Multiple person and speaker activity tracking with a particle filter," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Montreal, QC, Canada, May 2004, pp. V-881-V-884.
- (2004) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
- Checka, N.¹ Wilson, K.² Siracusa, M.³ Darrell, T.⁴

8
- 0029304865
- Human and machine recognition of faces: A survey
- May
- R. Chellapa, C.Wilson, and A. Sirohey, "Human and machine recognition of faces: A survey," Proc. IEEE, vol. 83, no. 5, pp. 705-740, May 1995.
- (1995) Proc. IEEE , vol.83 , Issue.5 , pp. 705-740
- Chellapa, R.¹ Wilson, C.² Sirohey, A.³

9
- 21244492850
- Real-time speaker tracking using particle filter sensor fusion
- Mar
- Y. Chen and Y. Rui, "Real-time speaker tracking using particle filter sensor fusion," Proc. IEEE, vol. 92, no. 3, pp. 485-494, Mar. 2004.
- (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 485-494
- Chen, Y.¹ Rui, Y.²

10
- 33745564272
- Automatic speech recognition and speech activity detection in the chil smart room
- Edinburgh, U.K, Jul
- S. M. Chu, E. Marcheret, and G. Potamianos, "Automatic speech recognition and speech activity detection in the chil smart room," in Proc. JointWorkshop Multimodal Interaction and Related Machine Learning Algorithms (MLMI), Edinburgh, U.K., Jul. 2005, pp. 332-343.
- (2005) Proc. JointWorkshop Multimodal Interaction and Related Machine Learning Algorithms (MLMI) , pp. 332-343
- Chu, S.M.¹ Marcheret, E.² Potamianos, G.³

11
- 84896473177
- Measurement of correlation coefficients in reverberant sound fields
- R. K. Cook, R. V. Waterhouse, R. D. Berendt, S. Edelman, and M. C. Thompson, Jr, "Measurement of correlation coefficients in reverberant sound fields," J. Acoust. Soc. Amer., vol. 27, pp. 1072-1077, 1955.
- (1955) J. Acoust. Soc. Amer , vol.27 , pp. 1072-1077
- Cook, R.K.¹ Waterhouse, R.V.² Berendt, R.D.³ Edelman, S.⁴ Thompson Jr, M.C.⁵

12
- 84943735747
- Robust adaptive beamforming
- Oct
- H. Cox, R. Zeskind, and M. Owen, "Robust adaptive beamforming," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-35, no. 10, pp. 1365-1376, Oct. 1987.
- (1987) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-35 , Issue.10 , pp. 1365-1376
- Cox, H.¹ Zeskind, R.² Owen, M.³

13
- 0022738930
- Practical supergain
- Jun
- H. Cox, R. Zeskind, and I. Kooij, "Practical supergain," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-34, no. 3, pp. 393-397, Jun. 1986.
- (1986) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-34 , Issue.3 , pp. 393-397
- Cox, H.¹ Zeskind, R.² Kooij, I.³

14
- 0030715160
- Multi-modal tracking of faces for video communications
- San Juan, Puerto Rico, Jun
- J. Crowley and P. Berard, "Multi-modal tracking of faces for video communications," in Proc. Conf. Comput. Vision Pattern Recognition (CVPR), San Juan, Puerto Rico, Jun. 1997, pp. 640-645.
- (1997) Proc. Conf. Comput. Vision Pattern Recognition (CVPR) , pp. 640-645
- Crowley, J.¹ Berard, P.²

15
- 4344692646
- A high-accuracy, low-latency technique for talker localization in reverberant environments,
- Ph.D. dissertation, Brown Univ, Providence, RI
- J. DiBiase, "A high-accuracy, low-latency technique for talker localization in reverberant environments," Ph.D. dissertation, Brown Univ., Providence, RI, 2000.
- (2000)
- DiBiase, J.¹

16
- 0003343412
- Robust localization in reverberant rooms
- New York: Springer
- J. DiBiase, H. Silverman, and M. Brandstein, "Robust localization in reverberant rooms," in Microphone Arrays. New York: Springer, 2001, vol. 8, pp. 157-180.
- (2001) Microphone Arrays , vol.8 , pp. 157-180
- DiBiase, J.¹ Silverman, H.² Brandstein, M.³

17
- 0003363117
- Superdirectional microphone arrays
- S. Gay and J. Benesty, Eds. Norwell, MA: Kluwer, ch. 10, pp
- G. W. Elko, "Superdirectional microphone arrays," in Acoustic Signal Processing for Telecommunication, S. Gay and J. Benesty, Eds. Norwell, MA: Kluwer, 2000, ch. 10, pp. 181-237.
- (2000) Acoustic Signal Processing for Telecommunication , pp. 181-237
- Elko, G.W.¹

18
- 0003665481
- New York: Springer-Verlag
- A. Doucet, N. de Freitas, and N. Gordon, Sequential Monte Carlo Methods in Practice. New York: Springer-Verlag, 2001.
- (2001) Sequential Monte Carlo Methods in Practice
- Doucet, A.¹ de Freitas, N.² Gordon, N.³

19
- 0009622481
- Learning joint statistical models for audio-visual fusion and segregation
- Denver, CO, Dec
- J. Fisher, T. Darrell, W. T. Freeman, and P. Viola, "Learning joint statistical models for audio-visual fusion and segregation," in Proc. Neural Inf. Process. Syst. (NIPS), Denver, CO, Dec. 2000, pp. 772-778.
- (2000) Proc. Neural Inf. Process. Syst. (NIPS) , pp. 772-778
- Fisher, J.¹ Darrell, T.² Freeman, W.T.³ Viola, P.⁴

20
- 33749440990
- A mixed-state i-Particle filter for multi-camera speaker tracking
- Nice, France, Oct
- D. Gatica-Perez, G. Lathoud, I. McCowan, and J.-M. Odobez, "A mixed-state i-Particle filter for multi-camera speaker tracking," in Proc. IEEE Conf. Comput. Vision, Workshop on Multimedia Technologies for E-learning and Collaboration(ICCV-WOMTEC), Nice, France, Oct. 2003.
- (2003) Proc. IEEE Conf. Comput. Vision, Workshop on Multimedia Technologies for E-learning and Collaboration(ICCV-WOMTEC)
- Gatica-Perez, D.¹ Lathoud, G.² McCowan, I.³ Odobez, J.-M.⁴

21
- 32344434893
- Multimodal multispeaker probabilistic tracking in meetings
- Trento, Italy, Oct
- D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Multimodal multispeaker probabilistic tracking in meetings," in Proc. IEEE Conf. Multimedia Interfaces (ICMI), Trento, Italy, Oct. 2005, pp. 183-190.
- (2005) Proc. IEEE Conf. Multimedia Interfaces (ICMI) , pp. 183-190
- Gatica-Perez, D.¹ Lathoud, G.² Odobez, J.-M.³ McCowan, I.⁴

22
- 64149093817
- Audiovisual probabilistic tracking of multiple speakers in meetings
- Feb
- D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Audiovisual probabilistic tracking of multiple speakers in meetings," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 601-616, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.2 , pp. 601-616
- Gatica-Perez, D.¹ Lathoud, G.² Odobez, J.-M.³ McCowan, I.⁴

23
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Acoust., Speech. Signal Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Acoust., Speech. Signal Process , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

24
- 0035681891
- Microphone array source localization using realizable delay vectors
- New York, Oct
- S. M. Griebel and M. S. Brandstein, "Microphone array source localization using realizable delay vectors," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA), New York, Oct. 2001, pp. 71-74.
- (2001) Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust. (WASPAA) , pp. 71-74
- Griebel, S.M.¹ Brandstein, M.S.²

25
- 33745533302
- The development of the AMI system for the transcription of speech in meetings
- Edinburgh, U.K, Jul
- T. Hain et al., "The development of the AMI system for the transcription of speech in meetings," in Proc. Joint Workshop Multimodal Interaction and Related Mach. Learn. Algorithms (MLMI), Edinburgh, U.K., Jul. 2005, pp. 344-356.
- (2005) Proc. Joint Workshop Multimodal Interaction and Related Mach. Learn. Algorithms (MLMI) , pp. 344-356
- Hain, T.¹

26
- 0003440081
- 2nd ed. Cambridge, U.K, Cambridge Univ. Press
- R. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, 2nd ed. Cambridge, U.K.: Cambridge Univ. Press, 2001.
- (2001) Multiple View Geometry in Computer Vision
- Hartley, R.¹ Zisserman, A.²

27
- 0032136153
- CONDENSATION: Conditional density propagation for visual tracking
- M. Isard and A. Blake, "CONDENSATION: Conditional density propagation for visual tracking," Proc. Int. J. Comput. Vision, vol. 29, no. 1, pp. 5-28, 1998.
- (1998) Proc. Int. J. Comput. Vision , vol.29 , Issue.1 , pp. 5-28
- Isard, M.¹ Blake, A.²

28
- 0037774471
- Audio-visual localization of multiple speakers in a video teleconferencing setting
- B. Kapralos, M. Jenkin, and E. Milios, "Audio-visual localization of multiple speakers in a video teleconferencing setting," Int. J. Imaging Syst. Technol., vol. 13, pp. 95-105, 2003.
- (2003) Int. J. Imaging Syst. Technol , vol.13 , pp. 95-105
- Kapralos, B.¹ Jenkin, M.² Milios, E.³

29
- 64149097954
- 3D audiovisual person tracking using Kalman filtering and information theory
- Southampton, U.K, Apr
- N. Katsarakis et al., "3D audiovisual person tracking using Kalman filtering and information theory," in Proc. CLEAR Evaluation Workshop, Southampton, U.K., Apr. 2006, pp. 45-54.
- (2006) Proc. CLEAR Evaluation Workshop , pp. 45-54
- Katsarakis, N.¹

30
- 35048868406
- An MCMC-based particle filter for tracking multiple interacting targets
- Prague, May
- Z. Khan, T. Balch, and F. Dellaert, "An MCMC-based particle filter for tracking multiple interacting targets," in Proc. Eur. Conf. Comput. Vision (ECCV), Prague, May 2004, pp. 279-290.
- (2004) Proc. Eur. Conf. Comput. Vision (ECCV) , pp. 279-290
- Khan, Z.¹ Balch, T.² Dellaert, F.³

31
- 0033707896
- HMM adaptation and microphone array processing for distant speech recognition
- Istanbul, Turkey, Jun
- J. Kleban and Y. Gong, "HMM adaptation and microphone array processing for distant speech recognition," in Proc. Int. Conf. Acoust. , Speech, Signal Process. (ICASSP), Istanbul, Turkey, Jun. 2000, pp. 1411-1414.
- (2000) Proc. Int. Conf. Acoust. , Speech, Signal Process. (ICASSP) , pp. 1411-1414
- Kleban, J.¹ Gong, Y.²

32
- 0016990291
- The generalized correlation method for estimation of time delay
- Aug
- C. Knapp and G. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech. Signal Process., vol. ASSP-24, no. 4, pp. 320-327, Aug. 1976.
- (1976) IEEE Trans. Acoust., Speech. Signal Process , vol.ASSP-24 , Issue.4 , pp. 320-327
- Knapp, C.¹ Carter, G.²

33
- 0030193445
- Two decades of array signal processing research: The parametric approach
- Jul
- H. Krim and M. Viberg, "Two decades of array signal processing research: The parametric approach," IEEE Signal Process. Mag., vol. 13, no. 4, pp. 67-94, Jul. 1996.
- (1996) IEEE Signal Process. Mag , vol.13 , Issue.4 , pp. 67-94
- Krim, H.¹ Viberg, M.²

34
- 84890538086
- A sector-based approach for localization of multiple speakers with microphone arrays
- Jeju, Korea, Oct
- G. Lathoud and I. McCowan, "A sector-based approach for localization of multiple speakers with microphone arrays," in Proc. ISCA Workshop Statistical and Perceptual Audio Process. (SAPA), Jeju, Korea, Oct. 2004.
- (2004) Proc. ISCA Workshop Statistical and Perceptual Audio Process. (SAPA)
- Lathoud, G.¹ McCowan, I.²

35
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Comput. Speech Lang , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

36
- 0004182828
- New York: Springer-Verlag
- J. S. Liu, Monte Carlo Strategies in Scientific Computing. New York: Springer-Verlag, 2001.
- (2001) Monte Carlo Strategies in Scientific Computing
- Liu, J.S.¹

37
- 33846217002
- The multichannelWall Street Journal audio-visual corpus (MC-WSJ-AV): Specification and initial experiments
- San Juan, Puerto Rico, Dec
- M. Lincoln, I. McCowan, J. Vepa, and H. K. Maganti, "The multichannelWall Street Journal audio-visual corpus (MC-WSJ-AV): Specification and initial experiments," in IEEE Autom. Speech Recognition Understanding Workshop (ASRU), San Juan, Puerto Rico, Dec. 2005, pp. 357-362.
- (2005) IEEE Autom. Speech Recognition Understanding Workshop (ASRU) , pp. 357-362
- Lincoln, M.¹ McCowan, I.² Vepa, J.³ Maganti, H.K.⁴

38
- 0009653561
- Post-filtering techniques
- New York: Springer
- K. S. Uwe, J. Bitzer, and C. Marro, "Post-filtering techniques," in Microphone Arrays. New York: Springer, 2001, vol. 3, pp. 36-60.
- (2001) Microphone Arrays , vol.3 , pp. 36-60
- Uwe, K.S.¹ Bitzer, J.² Marro, C.³

39
- 33745529486
- Microphone array driven speech recognition: Influence of localization on the word error rate
- Edinburgh, U.K, Jul
- M. Wolfel, K. Nickel, and J. McDonough, "Microphone array driven speech recognition: Influence of localization on the word error rate," in Proc. Joint Workshop Multimodal Interaction and Related Mach. Learn. Algorithms (MLMI), Edinburgh, U.K., Jul. 2005, pp. 320-331.
- (2005) Proc. Joint Workshop Multimodal Interaction and Related Mach. Learn. Algorithms (MLMI) , pp. 320-331
- Wolfel, M.¹ Nickel, K.² McDonough, J.³

40
- 0032072917
- Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering
- May
- C. Marro, Y. Mahieux, and K. U. Simmer, "Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering," IEEE Trans. Speech Audio Process., vol. 6, no. 3, pp. 240-259, May 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.3 , pp. 240-259
- Marro, C.¹ Mahieux, Y.² Simmer, K.U.³

41
- 0346707504
- Microphone array post-filter based on noise field coherence
- Nov
- I. McCowan and H. Bourlard, "Microphone array post-filter based on noise field coherence," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 709-716, Nov. 2003.
- (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 709-716
- McCowan, I.¹ Bourlard, H.²

42
- 33750570839
- Speech acquisition in meetings with an audio-visual sensor array
- Amsterdam, The Netherlands, Jul
- I. McCowan, M. Hari-Krishna, D. Gatica-Perez, D. Moore, and S. Ba, "Speech acquisition in meetings with an audio-visual sensor array," in Proc. IEEE Int. Conf. Multimedia (ICME), Amsterdam, The Netherlands, Jul. 2005, pp. 1382-1385.
- (2005) Proc. IEEE Int. Conf. Multimedia (ICME) , pp. 1382-1385
- McCowan, I.¹ Hari-Krishna, M.² Gatica-Perez, D.³ Moore, D.⁴ Ba, S.⁵

43
- 0030677479
- Multi-channel speech enhancment in a car environment using Wiener filtering and spectral subtraction
- Munich, Germany, Apr
- J.Meyer and K. U. Simmer, "Multi-channel speech enhancment in a car environment using Wiener filtering and spectral subtraction," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Munich, Germany, Apr. 1997, pp. 1167-1170.
- (1997) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 1167-1170
- Meyer, J.¹ Simmer, K.U.²

44
- 0011990786
- The meeting project at ICSI
- San Diego, CA, Mar
- N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke, "The meeting project at ICSI," in Proc. Human Lang. Technol. Conf., San Diego, CA, Mar. 2001, pp. 1-7.
- (2001) Proc. Human Lang. Technol. Conf , pp. 1-7
- Morgan, N.¹ Baron, D.² Edwards, J.³ Ellis, D.⁴ Gelbart, D.⁵ Janin, A.⁶ Pfau, T.⁷ Shriberg, E.⁸ Stolcke, A.⁹

45
- 0141631692
- Microphone array speech recognition: Experiments on overlapping speech in meetings
- Hong Kong, Apr
- D. Moore and I. McCowan, "Microphone array speech recognition: Experiments on overlapping speech in meetings," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Hong Kong, Apr. 2003, pp. V-497-V-500.
- (2003) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
- Moore, D.¹ McCowan, I.²

46
- 34547165111
- An audio-visual particle filter for speaker tracking on the CLEAR'06 evaluation dataset
- Southampton, U.K, Apr
- K. Nickel, T. Gehrig, H. K. Ekenel, J. McDonough, and R. Stiefelhagen, "An audio-visual particle filter for speaker tracking on the CLEAR'06 evaluation dataset," in Proc. CLEAR Evaluation Workshop, Southampton, U.K., Apr. 2006, pp. 69-80.
- (2006) Proc. CLEAR Evaluation Workshop , pp. 69-80
- Nickel, K.¹ Gehrig, T.² Ekenel, H.K.³ McDonough, J.⁴ Stiefelhagen, R.⁵

47
- 33745577702
- The rich transcription 2005 spring meeting recognition evaluation
- Edinburgh, U.K, Jul
- J. G. Fiscus, N. Radde, J. S. Garofolo, A. Le, J. Ajot, and C. Laprun, "The rich transcription 2005 spring meeting recognition evaluation," in Proc. NIST MLMI Meeting Recognition Workshop, Edinburgh, U.K., Jul. 2005, pp. 369-389.
- (2005) Proc. NIST MLMI Meeting Recognition Workshop , pp. 369-389
- Fiscus, J.G.¹ Radde, N.² Garofolo, J.S.³ Le, A.⁴ Ajot, J.⁵ Laprun, C.⁶

48
- 0030676367
- Microphone array based speech recognition with different talker-array positions
- Munich, Germany, Apr
- M. Omologo, M. Matassoni, P. Svaizer, and D. Giuliani, "Microphone array based speech recognition with different talker-array positions," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP),Munich, Germany, Apr. 1997, pp. 227-230.
- (1997) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 227-230
- Omologo, M.¹ Matassoni, M.² Svaizer, P.³ Giuliani, D.⁴

49
- 0028996854
- WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition
- Detroit, MI, Apr
- T. R. al, "WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Detroit, MI, Apr. 1995, pp. 81-84.
- (1995) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 81-84
- al, T.R.¹

50
- 85009230793
- Factorial models and refiltering for speech separation and denoising
- Geneva, Switzerland, Sep
- S. Roweis, "Factorial models and refiltering for speech separation and denoising," in Proc. Eurospeech Conf. Speech Commun. Technol. (Eurospeech- 2003), Geneva, Switzerland, Sep. 2003, pp. 1009-1012.
- (2003) Proc. Eurospeech Conf. Speech Commun. Technol. (Eurospeech- 2003) , pp. 1009-1012
- Roweis, S.¹

51
- 0030681710
- Tracking multiple talkers using microphone-array measurements
- Munich, Germany, Apr
- D. Sturim, M. Brandstein, and H. Silverman, "Tracking multiple talkers using microphone-array measurements," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Munich, Germany, Apr. 1997, pp. 371-374.
- (1997) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 371-374
- Sturim, D.¹ Brandstein, M.² Silverman, H.³

52
- 0023985457
- Beamforming:A versatile approach to spatial filtering, IEEE Acoust., Speech
- Apr
- B. D. V.Veen and K. M. Buckley, "Beamforming:A versatile approach to spatial filtering," IEEE Acoust., Speech, Signal Process. Mag., vol. 5, no. 2, pp. 4-24, Apr. 1988.
- (1988) Signal Process. Mag , vol.5 , Issue.2 , pp. 4-24
- Veen, B.D.V.¹ Buckley, K.M.²

53
- 0034844366
- Sequential Monte Carlo fusion of sound and vision for speaker tracking
- Vancouver, BC, Canada, Jul
- J. Vermaak, M. Gagnet, A. Blake, and P. Perez, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," in Proc. Int. Conf. Comput. Vision (ICCV), Vancouver, BC, Canada, Jul. 2001, pp. 741-746.
- (2001) Proc. Int. Conf. Comput. Vision (ICCV) , pp. 741-746
- Vermaak, J.¹ Gagnet, M.² Blake, A.³ Perez, P.⁴

54
- 85143190952
- A. Waibel, T. Schultz, M. Bett, R. Malkin, I. Rogina, R. Stiefelhagen, and J. Yang, Smart: The smart meeting room task at ISL, in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Hong Kong, Apr. 2003, pp. IV-752-IV-754.
- A. Waibel, T. Schultz, M. Bett, R. Malkin, I. Rogina, R. Stiefelhagen, and J. Yang, "Smart: The smart meeting room task at ISL," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Hong Kong, Apr. 2003, pp. IV-752-IV-754.

55
- 0036298833
- Particle filter beamforming for acoustic source localization in a reverberant environment
- Orlando, FL, May
- D. Ward and R. Williamson, "Particle filter beamforming for acoustic source localization in a reverberant environment," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Orlando, FL, May 2002, pp. 1777-1780.
- (2002) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 1777-1780
- Ward, D.¹ Williamson, R.²

56
- 0030718943
- Multilingual large vocabulary speech recognition: The European SQUALE project
- S. J. Young et al., "Multilingual large vocabulary speech recognition: The European SQUALE project," Comput. Speech Lang., vol. 11, no. 1, pp. 73-89, 1997.
- (1997) Comput. Speech Lang , vol.11 , Issue.1 , pp. 73-89
- Young, S.J.¹

57
- 0023773764
- A microphone array with adaptive post-filtering for noise reduction in reverberant rooms
- New York, Apr
- R. Zelinski, "A microphone array with adaptive post-filtering for noise reduction in reverberant rooms," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), New York, Apr. 1988, pp. 2578-2581.
- (1988) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 2578-2581
- Zelinski, R.¹

58
- 0033284445
- Flexible camera calibration by viewing a plane from unknown orientations
- Kerkyra, Greece, Sep
- Z. Zhang, "Flexible camera calibration by viewing a plane from unknown orientations," in Proc. Int. Conf. Computer Vision (ICCV), Kerkyra, Greece, Sep. 1999, pp. 666-673.
- (1999) Proc. Int. Conf. Computer Vision (ICCV) , pp. 666-673
- Zhang, Z.¹

59
- 84962674645
- Multimodal 3-D tracking and event detection via the particle filter
- Vancouver, BC, Canada, Jul
- D. Zotkin, R. Duraiswami, and L. Davis, "Multimodal 3-D tracking and event detection via the particle filter," in Proc. Int. Conf. Comput. Vision, Workshop on Detection and Recognition of Events in Video (ICCV-EVENT), Vancouver, BC, Canada, Jul. 2001, pp. 20-27.
- (2001) Proc. Int. Conf. Comput. Vision, Workshop on Detection and Recognition of Events in Video (ICCV-EVENT) , pp. 20-27
- Zotkin, D.¹ Duraiswami, R.² Davis, L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.