메뉴 건너뛰기




Volumn 4, Issue , 2007, Pages 2512-2515

Detection, diarization, and transcription of far-field lecture speech

Author keywords

Lectures; Smart rooms; Speaker diarization; Speech activity detection; Speech processing; Speech recognition

Indexed keywords

SPEECH; SPEECH PROCESSING; SPEECH RECOGNITION; TRANSCRIPTION;

EID: 56149089096     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (4)

References (13)
  • 3
    • 56149084482 scopus 로고    scopus 로고
    • The NIST SmartSpace Laboratory [Online, Available
    • "The NIST SmartSpace Laboratory" [Online]. Available: http://www.nist.gov/smartspace
  • 4
    • 77249114287 scopus 로고    scopus 로고
    • J.G. Fiscus, J. Ajot, M. Michel, and J.S. Garofolo, The Rich Transcription 2006 Spring meeting recognition evaluation, in Machine Learning for Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS 4299, pp. 309-322, 2006.
    • J.G. Fiscus, J. Ajot, M. Michel, and J.S. Garofolo, "The Rich Transcription 2006 Spring meeting recognition evaluation," in Machine Learning for Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS vol. 4299, pp. 309-322, 2006.
  • 5
    • 50449101617 scopus 로고    scopus 로고
    • The IBM RT06s evaluation system for speech activity detection in CHIL seminars
    • Machine learning for Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus Eds
    • E. Marcheret, G. Potamianos, K. Visweswariah, and J. Huang, "The IBM RT06s evaluation system for speech activity detection in CHIL seminars," in Machine learning for Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS vol. 4299, pp. 323-335, 2006.
    • (2006) LNCS , vol.4299 , pp. 323-335
    • Marcheret, E.1    Potamianos, G.2    Visweswariah, K.3    Huang, J.4
  • 6
    • 47949095692 scopus 로고    scopus 로고
    • The IBM Rich Transcription Spring 2006 speech-to-text system for lecture meetings
    • Machine Learning/or Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus Eds
    • J. Huang, M. Westphal, S. Chen, et al., "The IBM Rich Transcription Spring 2006 speech-to-text system for lecture meetings," in Machine Learning/or Multimodal Interaction, S. Renals, S. Bengio, and J.G. Fiscus (Eds.), LNCS vol. 4299, pp. 432-443, 2006.
    • (2006) LNCS , vol.4299 , pp. 432-443
    • Huang, J.1    Westphal, M.2    Chen, S.3
  • 7
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER)
    • Santa Barbara, CA, pp
    • J.G. Fiscus, "A post-processing system to yield reduced word error rates: Recogniser output voting error reduction (ROVER)," in Proc. ASRU Workshop, Santa Barbara, CA, pp. 347-352, 1997.
    • (1997) Proc. ASRU Workshop , pp. 347-352
    • Fiscus, J.G.1
  • 8
    • 33646818291 scopus 로고    scopus 로고
    • Constructing ensembles of ASR systems using randomized decision trees
    • Philadelphia, PA
    • O. Siohan, B. Ramabhadran, and B. Kingsbury, "Constructing ensembles of ASR systems using randomized decision trees," in Proc. Int. Conf. Acoustics Speech Signal Process., Philadelphia, PA, vol. 1, pp. 197-200, 2005.
    • (2005) Proc. Int. Conf. Acoustics Speech Signal Process , vol.1 , pp. 197-200
    • Siohan, O.1    Ramabhadran, B.2    Kingsbury, B.3
  • 9
    • 50449107696 scopus 로고    scopus 로고
    • Linguistic Data Consortium, University of Pennsylvania. Philadelphia, PA, Online, Available
    • "The LDC Corpus Catalog," Linguistic Data Consortium, University of Pennsylvania. Philadelphia, PA. [Online]. Available: http://www.ldc.upenn.edu
    • The LDC Corpus Catalog
  • 12
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and Ismoothing for improved discriminative training
    • Orlando, FL, pp
    • D. Povey and P.C. Woodland, "Minimum phone error and Ismoothing for improved discriminative training," in Proc. Int. Conf. Acoustics Speech Signal Process., Orlando, FL, pp. 105-108, 2002.
    • (2002) Proc. Int. Conf. Acoustics Speech Signal Process , pp. 105-108
    • Povey, D.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.