메뉴 건너뛰기




Volumn , Issue , 2006, Pages 35-38

Speaker localization for microphone array-based ASR: The effects of accuracy on overlapping speech

Author keywords

Audio visual speaker tracking; Microphone array ASR

Indexed keywords

CAMERAS; COMPUTER MUSIC; ERROR ANALYSIS; INFORMATION THEORY; MICROPHONES; OPTIMAL CONTROL SYSTEMS;

EID: 34547166917     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1180995.1181004     Document Type: Conference Paper
Times cited : (11)

References (16)
  • 1
    • 34547156362 scopus 로고    scopus 로고
    • Detection and Separation of Speech Event using audio and video information fusion
    • F. Asano et al. "Detection and Separation of Speech Event using audio and video information fusion," Journal of Applied Signal Processing, 2004.
    • (2004) Journal of Applied Signal Processing
    • Asano, F.1
  • 3
    • 0030715160 scopus 로고    scopus 로고
    • Multi-modal tracking of faces for video communications
    • June
    • J. Crowley and P. Berard. "Multi-modal tracking of faces for video communications," CVPR, June, 1997.
    • (1997) CVPR
    • Crowley, J.1    Berard, P.2
  • 5
    • 32344434893 scopus 로고    scopus 로고
    • Multimodal Multispeaker Probabilistic Tracking in Meetings
    • Oct
    • D. Gatica-Perez et al. "Multimodal Multispeaker Probabilistic Tracking in Meetings," ICMI, Oct., 2005.
    • (2005) ICMI
    • Gatica-Perez, D.1
  • 7
    • 50949118216 scopus 로고    scopus 로고
    • Multiple View Geometry in Computer Vision
    • R. Hartley and A. Zisserman. "Multiple View Geometry in Computer Vision," CU Press, 2001.
    • (2001) CU Press
    • Hartley, R.1    Zisserman, A.2
  • 8
    • 33745533302 scopus 로고    scopus 로고
    • The Development of the AMI System for the Transcriptions of Speech in Meetings
    • July
    • T. Hain et al. "The Development of the AMI System for the Transcriptions of Speech in Meetings," Proc. MLMI, July, 2005.
    • (2005) Proc. MLMI
    • Hain, T.1
  • 9
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland. "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, 9(2):171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 10
    • 84890538086 scopus 로고    scopus 로고
    • A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays
    • Oct
    • G. Lathoud and I. McCowan. "A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays," Proc. SAPA, Oct., 2004.
    • (2004) Proc. SAPA
    • Lathoud, G.1    McCowan, I.2
  • 11
    • 33750570839 scopus 로고    scopus 로고
    • Speech Acquisition in Meetings with an Audio-Visual Sensor Array
    • July
    • I. McCowan et al. "Speech Acquisition in Meetings with an Audio-Visual Sensor Array," ICME, July, 2005.
    • (2005) ICME
    • McCowan, I.1
  • 12
    • 0141631692 scopus 로고    scopus 로고
    • Microphone array speech recognition: Experiments on overlapping speech in meetings
    • Apr
    • D. Moore and I. McCowan. "Microphone array speech recognition: Experiments on overlapping speech in meetings," Proc. ICASSP, Apr., 2003.
    • (2003) Proc. ICASSP
    • Moore, D.1    McCowan, I.2
  • 13
    • 34547169210 scopus 로고    scopus 로고
    • A joint particle filter for audio-visual speaker tracking
    • Oct
    • K. Nickel et al. "A joint particle filter for audio-visual speaker tracking," Proc. ICMI, Oct., 2005.
    • (2005) Proc. ICMI
    • Nickel, K.1
  • 14
    • 0028996854 scopus 로고
    • WSJCAMO: A British English Speech Corpus for Large Vocabulary Continuous Speech Recognition
    • April
    • T.Robinson et al. "WSJCAMO: A British English Speech Corpus for Large Vocabulary Continuous Speech Recognition," Proc. ICASSP, April, 1995.
    • (1995) Proc. ICASSP
    • Robinson, T.1
  • 15
    • 85009145345 scopus 로고    scopus 로고
    • Observations on overlap: Findings and implications for automatic processing of multi-party conversation
    • Sep
    • E. Shriberg et al. "Observations on overlap: findings and implications for automatic processing of multi-party conversation," Eurospeech, Sep., 2001.
    • (2001) Eurospeech
    • Shriberg, E.1
  • 16
    • 34250174176 scopus 로고    scopus 로고
    • Microphone Array Driven Speech Recognition: Influence of Localization on the Word Error Rate
    • July
    • M. Wolfel et al. "Microphone Array Driven Speech Recognition: Influence of Localization on the Word Error Rate," Proc. MLMI, July, 2005.
    • (2005) Proc. MLMI
    • Wolfel, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.