메뉴 건너뛰기




Volumn , Issue , 2013, Pages 162-167

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes

Author keywords

'CHiME' Challenge; Noise robust ASR

Indexed keywords

'CHIME' CHALLENGE; AUTOMATIC SPEECH RECOGNITION; BASELINE SYSTEMS; DOMESTIC ENVIRONMENTS; FUTURE CHALLENGES; NOISE-ROBUST ASR; SPEECH SEPARATION; SYSTEM COMBINATION;

EID: 84893704157     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2013.6707723     Document Type: Conference Paper
Times cited : (73)

References (24)
  • 7
    • 84893675434 scopus 로고    scopus 로고
    • The TUM+TUT+KUL approach to the 2nd chime challenge: Multi-stream ASR exploiting BLSTM networks and sparse NMF
    • Vancouver, Canada, June
    • J. T. Geiger, F. Weninger, A. Hurmalainen, J. F. Gemmeke, M. Wollmer, B. Schuller, G. Rigoll, and T. Virtanen, The TUM+TUT+KUL approach to the 2nd CHiME challenge: Multi-stream ASR exploiting BLSTM networks and sparse NMF, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 25-30.
    • (2013) Proc. CHiME-2013 , pp. 25-30
    • Geiger, J.T.1    Weninger, F.2    Hurmalainen, A.3    Gemmeke, J.F.4    Wollmer, M.5    Schuller, B.6    Rigoll, G.7    Virtanen, T.8
  • 8
    • 84893694758 scopus 로고    scopus 로고
    • HMMregularization for NMF-based noise robust ASR
    • Vancouver, Canada, June
    • J. F. Gemmeke, A. Hurmalainen, and T. Virtanen, HMMregularization for NMF-based noise robust ASR, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 47- 52.
    • (2013) Proc. CHiME-2013 , pp. 47-52
    • Gemmeke, J.F.1    Hurmalainen, A.2    Virtanen, T.3
  • 9
    • 84893652593 scopus 로고    scopus 로고
    • Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech
    • Vancouver, Canada, June
    • A. Hurmalainen, J. F. Gemmeke, and T. Virtanen, Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 13-18.
    • (2013) Proc. CHiME- 2013 , pp. 13-18
    • Hurmalainen, A.1    Gemmeke, J.F.2    Virtanen, T.3
  • 10
    • 84893698854 scopus 로고    scopus 로고
    • A fragment-decoding plus missing-data imputation ASR system evaluated on the 2nd chime challenge
    • Vancouver, Canada, June
    • N. Ma and J. Barker, A fragment-decoding plus missing-data imputation ASR system evaluated on the 2nd CHiME challenge, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 53-58.
    • (2013) Proc. CHiME-2013 , pp. 53-58
    • Ma, N.1    Barker, J.2
  • 11
    • 84893705681 scopus 로고    scopus 로고
    • Binaural signal processing for enhanced speech recognition robustness in complex listening environments
    • Vancouver, Canada, June
    • H. Meutzner, A. Schlesinger, S. Zeiler, and D. Kolossa, Binaural signal processing for enhanced speech recognition robustness in complex listening environments, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 7-12.
    • (2013) Proc. CHiME- 2013 , pp. 7-12
    • Meutzner, H.1    Schlesinger, A.2    Zeiler, S.3    Kolossa, D.4
  • 12
    • 84893670015 scopus 로고    scopus 로고
    • Noise robust distant automatic speech recognition utilizing nmf based source separation and auditory feature extraction
    • Vancouver, Canada, June
    • N. Moritz, M. R. Schadler, K. Adiloglu, B. T. Meyer, T. Jurgens, T. Gerkmann, B. Kollmeier, S. Doclo, and S. Goetze, Noise robust distant automatic speech recognition utilizing NMF based source separation and auditory feature extraction, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 1-6.
    • (2013) Proc. CHiME-2013 , pp. 1-6
    • Moritz, N.1    Schadler, M.R.2    Adiloglu, K.3    Meyer, B.T.4    Jurgens, T.5    Gerkmann, T.6    Kollmeier, B.7    Doclo, S.8    Goetze, S.9
  • 13
    • 84976225941 scopus 로고    scopus 로고
    • The 2nd 'CHIME' speech separation and recognition challenge: Approaches on single-channel source separation and model-driven speech enhancement
    • Vancouver, Canada, June
    • P. Mowlaee, J. A. Morales-Cordovilla, F. Pernkopf, H. Pessentheiner, M. Hagmuller, and G. Kubin, The 2nd 'CHIME' speech separation and recognition challenge: Approaches on single-channel source separation and model-driven speech enhancement, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 59-64.
    • (2013) Proc. CHiME-2013 , pp. 59-64
    • Mowlaee, P.1    Morales-Cordovilla, J.A.2    Pernkopf, F.3    Pessentheiner, H.4    Hagmuller, M.5    Kubin, G.6
  • 14
    • 84893685019 scopus 로고    scopus 로고
    • A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
    • Vancouver, Canada, June
    • F. Nesta, M. Matassoni, and R. F. Astudillo, A flexible spatial blind source extraction framework for robust speech recognition in noisy environments, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 33-38.
    • (2013) Proc. CHiME-2013 , pp. 33-38
    • Nesta, F.1    Matassoni, M.2    Astudillo, R.F.3
  • 15
    • 84893696094 scopus 로고    scopus 로고
    • Fusion of acoustic, perceptual and production features for robust speech recognition in highly non-stationary noise
    • Vancouver, Canada, June
    • G. Sivaraman, V. Mitra, and C. Y. Espy-Wilson, Fusion of acoustic, perceptual and production features for robust speech recognition in highly non-stationary noise, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 65-70.
    • (2013) Proc. CHiME- 2013 , pp. 65-70
    • Sivaraman, G.1    Mitra, V.2    Espy-Wilson, C.Y.3
  • 16
    • 84893674217 scopus 로고    scopus 로고
    • Employing stochastic constrained LMS algorithm for ASR frontend processing
    • Vancouver, Canada, June
    • M. Stadtschnitzer, D. Stein, and R. Bardeli, Employing stochastic constrained LMS algorithm for ASR frontend processing, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 71-72.
    • (2013) Proc. CHiME-2013 , pp. 71-72
    • Stadtschnitzer, M.1    Stein, D.2    Bardeli, R.3
  • 17
    • 84893671946 scopus 로고    scopus 로고
    • Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark
    • Vancouver, Canada, June
    • Y. Tachioka, S.Watanabe, J. L. Roux, and J. R. Hershey, Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 19-24.
    • (2013) Proc. CHiME-2013 , pp. 19-24
    • Tachioka, Y.1    Watanabe, S.2    Roux, J.L.3    Hershey, J.R.4
  • 18
    • 84893667550 scopus 로고    scopus 로고
    • Using full-rank spatial covariance models for noise-robust ASR
    • Vancouver, Canada, June
    • D. T. Tran, E. Vincent, D. Jouvet, and K. Adiloglu, Using full-rank spatial covariance models for noise-robust ASR, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 31- 32.
    • (2013) Proc. CHiME-2013 , pp. 31-32
    • Tran, D.T.1    Vincent, E.2    Jouvet, D.3    Adiloglu, K.4
  • 19
    • 84893638382 scopus 로고    scopus 로고
    • Noise-robust automatic speech recognition with exemplar-based sparse representations using multiple length adaptive dictionaries
    • Vancouver, Canada, June
    • E. Yilmaz, J. F. Gemmeke, and H. Van hamme, Noise-robust automatic speech recognition with exemplar-based sparse representations using multiple length adaptive dictionaries, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 39- 43.
    • (2013) Proc. CHiME-2013 , pp. 39-43
    • Yilmaz, E.1    Gemmeke, J.F.2    Van Hamme, H.3
  • 20
    • 84890541701 scopus 로고    scopus 로고
    • The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines
    • Vancouver, Canada, May, IEEE
    • E. Vincent, J. Barker, S. Watanabe, J. L. Roux, F. Nesta, and M. Matassoni, The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines, in Proc. ICASSP 2013, Vancouver, Canada, May 2013, IEEE.
    • (2013) Proc. ICASSP 2013
    • Vincent, E.1    Barker, J.2    Watanabe, S.3    Roux, J.L.4    Nesta, F.5    Matassoni, M.6
  • 21
    • 33750368310 scopus 로고    scopus 로고
    • An audio-visual corpus for speech perception and automatic speech recognition
    • DOI 10.1121/1.2229005
    • M. P. Cooke, J. Barker, S. P. Cunningham, and X. Shao, An audio-visual corpus for speech perception and automatic speech recognition, Journal of the Acoustical Society of America, vol. 120, pp. 2421-2424, 2006. (Pubitemid 44631681)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
    • Cooke, M.1    Barker, J.2    Cunningham, S.3    Shao, X.4
  • 23
    • 51449115975 scopus 로고    scopus 로고
    • Baseline WSJ acoustic models for HTK and sphinx: Training recipes and recognition experiments
    • University of Cambridge
    • K. Vertanen, Baseline WSJ acoustic models for HTK and Sphinx: Training recipes and recognition experiments, Tech. Rep., Cavendish Laboratory, University of Cambridge, 2006.
    • (2006) Tech. Rep., Cavendish Laboratory
    • Vertanen, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.