메뉴 건너뛰기




Volumn , Issue , 2013, Pages 126-130

The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines

Author keywords

'CHiME' Challenge; Noise robust ASR

Indexed keywords

'CHIME' CHALLENGE; AUTOMATIC SPEECH RECOGNITION; BASE-LINE PERFORMANCE; DOMESTIC ENVIRONMENTS; NOISE-ROBUST ASR; REAL-WORLD; SPEECH SEPARATION;

EID: 84890541701     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6637622     Document Type: Conference Paper
Times cited : (201)

References (29)
  • 5
    • 84987702417 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • D. Pearce and H.-G. Hirsch, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ICSLP, 2000, vol. 4, pp. 29-32.
    • (2000) Proc. ICSLP , vol.4 , pp. 29-32
    • Pearce, D.1    Hirsch, H.-G.2
  • 6
    • 40249114843 scopus 로고    scopus 로고
    • To separate speech!: A system for recognizing simultaneous speech
    • J. McDonough, K. Kumatani, T. Gehrig, E. Stoimenov, et al., "To separate speech!: A system for recognizing simultaneous speech," in Proc. MLMI, 2007, pp. 283-294.
    • (2007) Proc. MLMI , pp. 283-294
    • McDonough, J.1    Kumatani, K.2    Gehrig, T.3    Stoimenov, E.4
  • 7
    • 40249098657 scopus 로고    scopus 로고
    • Microphone array beamforming approach to blind speech separation
    • I. Himawan, I. McCowan, and M. Lincoln, "Microphone array beamforming approach to blind speech separation," in Proc. MLMI, 2007, pp. 295-305.
    • (2007) Proc. MLMI , pp. 295-305
    • Himawan, I.1    McCowan, I.2    Lincoln, M.3
  • 8
    • 69249202377 scopus 로고    scopus 로고
    • Monaural speech separation and recognition challenge
    • M. Cooke, J. R. Hershey, and S. J. Rennie, "Monaural Speech Separation and Recognition Challenge," Computer Speech and Language, vol. 24, pp. 94-111, 2010.
    • (2010) Computer Speech and Language , vol.24 , pp. 94-111
    • Cooke, M.1    Hershey, J.R.2    Rennie, S.J.3
  • 9
    • 84858069855 scopus 로고    scopus 로고
    • The signal separation evaluation campaign (2007-2010): Achievements and remaining challenges
    • E. Vincent, S. Araki, F. J. Theis, G. Nolte, et al., "The Signal Separation Evaluation Campaign (2007-2010): Achievements and remaining challenges," Signal Processing, vol. 92, pp. 1928-1936, 2012.
    • (2012) Signal Processing , vol.92 , pp. 1928-1936
    • Vincent, E.1    Araki, S.2    Theis, F.J.3    Nolte, G.4
  • 10
    • 84873898784 scopus 로고    scopus 로고
    • Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation
    • M. Delcroix, K. Kinoshita, T. Nakatani, S. Araki, et al., "Speech recognition in the presence of highly nonstationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation," in Proc. CHiME, 2011, pp. 12-17.
    • (2011) Proc. CHiME , pp. 12-17
    • Delcroix, M.1    Kinoshita, K.2    Nakatani, T.3    Araki, S.4
  • 11
    • 84890521030 scopus 로고    scopus 로고
    • Exemplar-based speech enhancement and its application to noise-robust automatic speech recognition
    • J. F. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based speech enhancement and its application to noise-robust automatic speech recognition," in Proc. CHiME, 2011, pp. 53-57.
    • (2011) Proc. CHiME , pp. 53-57
    • Gemmeke, J.F.1    Virtanen, T.2    Hurmalainen, A.3
  • 12
  • 13
    • 84890541336 scopus 로고    scopus 로고
    • Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments
    • H. Kallasjoki, S. Keronen, G. J. Brown, J. F. Gemmeke, et al., "Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments," in Proc. CHiME, 2011, pp. 58-63.
    • (2011) Proc. CHiME , pp. 58-63
    • Kallasjoki, H.1    Keronen, S.2    Brown, G.J.3    Gemmeke, J.F.4
  • 14
    • 84865727479 scopus 로고    scopus 로고
    • Zero-crossing based channel attentive weighting of cepstral features for robust speech recognition: The ETRI 2011 CHiME challenge system
    • Y.-I. Kim, H.-Y. Cho, and S.-H. Kim, "Zero-crossingbased channel attentive weighting of cepstral features for robust speech recognition: The ETRI 2011 CHiME challenge system," in Proc. Interspeech, 2011, pp. 1649-1652.
    • (2011) Proc. Interspeech , pp. 1649-1652
    • Kim, Y.-I.1    Cho, H.-Y.2    Kim, S.-H.3
  • 15
    • 84881045722 scopus 로고    scopus 로고
    • CHiME data separation based on target signal cancellation and noise masking
    • Z. Koldovský, J. Málek, J. Nouza, and M. Balík, "CHiME data separation based on target signal cancellation and noise masking," in Proc. CHiME, 2011, pp. 47-50.
    • (2011) Proc. CHiME , pp. 47-50
    • Koldovský, Z.1    Málek, J.2    Nouza, J.3    Balík, M.4
  • 16
    • 84890527807 scopus 로고    scopus 로고
    • CHiME Challenge: Approaches to robustness using beamforming and uncertainty-of-observation techniques
    • D. Kolossa, R. F. Astudillo, A. Abad, S. Zeiler, et al., "CHiME Challenge: Approaches to robustness using beamforming and uncertainty-of- observation techniques," in Proc. CHiME, 2011, pp. 6-11.
    • (2011) Proc. CHiME , pp. 6-11
    • Kolossa, D.1    Astudillo, R.F.2    Abad, A.3    Zeiler, S.4
  • 17
    • 84890522526 scopus 로고    scopus 로고
    • Recent advances in fragment-based speech recognition in reverberant multisource environments
    • N. Ma, J. Barker, H. Christensen, and P. Green, "Recent advances in fragment-based speech recognition in reverberant multisource environments," in Proc. CHiME, 2011, pp. 68-73.
    • (2011) Proc. CHiME , pp. 68-73
    • Ma, N.1    Barker, J.2    Christensen, H.3    Green, P.4
  • 18
    • 84869432703 scopus 로고    scopus 로고
    • A two-channel acoustic front-end for robust automatic speech recognition in noisy and reverberant environments
    • R. Maas, A. Schwarz, Y. Zheng, K. Reindl, et al., "A two-channel acoustic front-end for robust automatic speech recognition in noisy and reverberant environments," in Proc. CHiME, 2011, pp. 41-46.
    • (2011) Proc. CHiME , pp. 41-46
    • Maas, R.1    Schwarz, A.2    Zheng, Y.3    Reindl, K.4
  • 19
    • 84873926851 scopus 로고    scopus 로고
    • Robust automatic speech recognition through on-line semi blind source extraction
    • F. Nesta and M. Matassoni, "Robust automatic speech recognition through on-line semi blind source extraction," in Proc. CHiME, 2011, pp. 18-23.
    • (2011) Proc. CHiME , pp. 18-23
    • Nesta, F.1    Matassoni, M.2
  • 20
    • 84866037355 scopus 로고    scopus 로고
    • Using the FASST source separation toolbox for noise robust speech recognition
    • A. Ozerov and E. Vincent, "Using the FASST source separation toolbox for noise robust speech recognition," in Proc. CHiME, 2011, pp. 86-87.
    • (2011) Proc. CHiME , pp. 86-87
    • Ozerov, A.1    Vincent, E.2
  • 21
    • 84869834487 scopus 로고    scopus 로고
    • Robust speech recognition in multi-source noise environments using convolutive non-negative matrix factorization
    • R. Vipperla, S. Bozonnet, D. Wang, and N. Evans, "Robust speech recognition in multi-source noise environments using convolutive non-negative matrix factorization," in Proc. CHiME, 2011, pp. 74-79.
    • (2011) Proc. CHiME , pp. 74-79
    • Vipperla, R.1    Bozonnet, S.2    Wang, D.3    Evans, N.4
  • 22
    • 84857258863 scopus 로고    scopus 로고
    • The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments
    • F. Weninger, J. Geiger, M. Wöllmer, B. Schuller, and G. Rigoll, "The Munich 2011 CHiME challenge contribution: NMF-BLSTM speech enhancement and recognition for reverberated multisource environments," in Proc. CHiME, 2011, pp. 24-29.
    • (2011) Proc. CHiME , pp. 24-29
    • Weninger, F.1    Geiger, J.2    Wöllmer, M.3    Schuller, B.4    Rigoll, G.5
  • 24
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," Journal of the Acoustical Society of America, vol. 65, no. 4, pp. 943-950, 1979.
    • (1979) Journal of the Acoustical Society of America , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 25
    • 33749048958 scopus 로고    scopus 로고
    • Headrelated transfer function filter interpolation by root displacement
    • H. Hacihabiboǧlu, B. Günel, and A.M. Kondoz, "Headrelated transfer function filter interpolation by root displacement," in Proc. WASPAA, 2005, pp. 134-137.
    • (2005) Proc. WASPAA , pp. 134-137
    • Hacihabiboǧlu, H.1    Günel, B.2    Kondoz, A.M.3
  • 28
    • 84890503970 scopus 로고    scopus 로고
    • Effectiveness of discriminative training and feature transformation for reverberated and noisy speech
    • Y. Tachioka, S. Watanabe, and J. R. Hershey, "Effectiveness of discriminative training and feature transformation for reverberated and noisy speech," in Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Tachioka, Y.1    Watanabe, S.2    Hershey, J.R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.