SCOPUS 정보 검색 플랫폼

2013 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2013 - Proceedings

Volumn , Issue , 2013, Pages 162-167

The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes

(6) Vincent, Emmanuel a Barker, Jon b Watanabe, Shinji c Le Roux, Jonathan c Nesta, Francesco d Matassoni, Marco e

a INRIA (France)

b UNIVERSITY OF SHEFFIELD (United Kingdom)

c MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

d Rockwell Semiconductor Systems (United States)

e ITC IRST (Italy)

Author keywords

'CHiME' Challenge; Noise robust ASR

Indexed keywords

'CHIME' CHALLENGE; AUTOMATIC SPEECH RECOGNITION; BASELINE SYSTEMS; DOMESTIC ENVIRONMENTS; FUTURE CHALLENGES; NOISE-ROBUST ASR; SPEECH SEPARATION; SYSTEM COMBINATION;

EID: 84893704157 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2013.6707723 Document Type: Conference Paper

Times cited : (73)

References (24)

1
- 50949092983
- Eds., Springer
- S. Makino, T.-W. Lee, and H. Sawada, Eds., Blind speech separation, Springer, 2007.
- (2007) Blind Speech Separation
- Makino, S.¹ Lee, T.-W.² Sawada, H.³

2
- 50449083999
- Wiley
- M. Wolfel and J. McDonough, Distant Speech Recognition, Wiley, 2009.
- (2009) Distant Speech Recognition
- Wolfel, M.¹ McDonough, J.²

3
- 84891583985
- Eds., Wiley
- T. Virtanen, R. Singh, and B. Raj, Eds., Techniques for Noise Robustness in Automatic Speech Recognition, Wiley, 2012.
- (2012) Techniques for Noise Robustness in Automatic Speech Recognition
- Virtanen, T.¹ Singh, R.² Raj, B.³

4
- 84893690446
- Eds
- J. Barker and E. Vincent, Eds., Computer Speech and Language, vol. 27, 2013.
- (2013) Computer Speech and Language , vol.27
- Barker, J.¹ Vincent, E.²

5
- 84873898784
- Speech recognition in the presence of highly non-stationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation
- Florence, Italy, Sept
- M. Delcroix, K. Kinoshita, T. Nakatani, S. Araki, A. Ogawa, T. Hori, S. Watanabe, M. Fujimoto, T. Yoshioka, T. Oba, Y. Kubo, M. Souden, S.-J. Hahm, and A. Nakamura, Speech recognition in the presence of highly non-stationary noise based on spatial, spectral and temporal speech/noise modeling combined with dynamic variance adaptation, in Proc. CHiME-2011, Florence, Italy, Sept. 2011, pp. 12-17.
- (2011) Proc. CHiME-2011 , pp. 12-17
- Delcroix, M.¹ Kinoshita, K.² Nakatani, T.³ Araki, S.⁴ Ogawa, A.⁵ Hori, T.⁶ Watanabe, S.⁷ Fujimoto, M.⁸ Yoshioka, T.⁹ Oba, T.¹⁰ Kubo, Y.¹¹ Souden, M.¹² Hahm, S.-J.¹³ Nakamura, A.¹⁴

6
- 84878543263
- The PASCAL CHIME speech separation and recognition challenge
- J. Barker, E. Vincent, N. Ma, H. Christensen, and P. Green, The PASCAL CHiME speech separation and recognition challenge, Computer Speech and Language, vol. 27, no. 3, pp. 621-633, 2013.
- (2013) Computer Speech and Language , vol.27 , Issue.3 , pp. 621-633
- Barker, J.¹ Vincent, E.² Ma, N.³ Christensen, H.⁴ Green, P.⁵

7
- 84893675434
- The TUM+TUT+KUL approach to the 2nd chime challenge: Multi-stream ASR exploiting BLSTM networks and sparse NMF
- Vancouver, Canada, June
- J. T. Geiger, F. Weninger, A. Hurmalainen, J. F. Gemmeke, M. Wollmer, B. Schuller, G. Rigoll, and T. Virtanen, The TUM+TUT+KUL approach to the 2nd CHiME challenge: Multi-stream ASR exploiting BLSTM networks and sparse NMF, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 25-30.
- (2013) Proc. CHiME-2013 , pp. 25-30
- Geiger, J.T.¹ Weninger, F.² Hurmalainen, A.³ Gemmeke, J.F.⁴ Wollmer, M.⁵ Schuller, B.⁶ Rigoll, G.⁷ Virtanen, T.⁸

8
- 84893694758
- HMMregularization for NMF-based noise robust ASR
- Vancouver, Canada, June
- J. F. Gemmeke, A. Hurmalainen, and T. Virtanen, HMMregularization for NMF-based noise robust ASR, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 47- 52.
- (2013) Proc. CHiME-2013 , pp. 47-52
- Gemmeke, J.F.¹ Hurmalainen, A.² Virtanen, T.³

9
- 84893652593
- Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech
- Vancouver, Canada, June
- A. Hurmalainen, J. F. Gemmeke, and T. Virtanen, Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 13-18.
- (2013) Proc. CHiME- 2013 , pp. 13-18
- Hurmalainen, A.¹ Gemmeke, J.F.² Virtanen, T.³

10
- 84893698854
- A fragment-decoding plus missing-data imputation ASR system evaluated on the 2nd chime challenge
- Vancouver, Canada, June
- N. Ma and J. Barker, A fragment-decoding plus missing-data imputation ASR system evaluated on the 2nd CHiME challenge, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 53-58.
- (2013) Proc. CHiME-2013 , pp. 53-58
- Ma, N.¹ Barker, J.²

11
- 84893705681
- Binaural signal processing for enhanced speech recognition robustness in complex listening environments
- Vancouver, Canada, June
- H. Meutzner, A. Schlesinger, S. Zeiler, and D. Kolossa, Binaural signal processing for enhanced speech recognition robustness in complex listening environments, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 7-12.
- (2013) Proc. CHiME- 2013 , pp. 7-12
- Meutzner, H.¹ Schlesinger, A.² Zeiler, S.³ Kolossa, D.⁴

12
- 84893670015
- Noise robust distant automatic speech recognition utilizing nmf based source separation and auditory feature extraction
- Vancouver, Canada, June
- N. Moritz, M. R. Schadler, K. Adiloglu, B. T. Meyer, T. Jurgens, T. Gerkmann, B. Kollmeier, S. Doclo, and S. Goetze, Noise robust distant automatic speech recognition utilizing NMF based source separation and auditory feature extraction, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 1-6.
- (2013) Proc. CHiME-2013 , pp. 1-6
- Moritz, N.¹ Schadler, M.R.² Adiloglu, K.³ Meyer, B.T.⁴ Jurgens, T.⁵ Gerkmann, T.⁶ Kollmeier, B.⁷ Doclo, S.⁸ Goetze, S.⁹

13
- 84976225941
- The 2nd 'CHIME' speech separation and recognition challenge: Approaches on single-channel source separation and model-driven speech enhancement
- Vancouver, Canada, June
- P. Mowlaee, J. A. Morales-Cordovilla, F. Pernkopf, H. Pessentheiner, M. Hagmuller, and G. Kubin, The 2nd 'CHIME' speech separation and recognition challenge: Approaches on single-channel source separation and model-driven speech enhancement, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 59-64.
- (2013) Proc. CHiME-2013 , pp. 59-64
- Mowlaee, P.¹ Morales-Cordovilla, J.A.² Pernkopf, F.³ Pessentheiner, H.⁴ Hagmuller, M.⁵ Kubin, G.⁶

14
- 84893685019
- A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
- Vancouver, Canada, June
- F. Nesta, M. Matassoni, and R. F. Astudillo, A flexible spatial blind source extraction framework for robust speech recognition in noisy environments, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 33-38.
- (2013) Proc. CHiME-2013 , pp. 33-38
- Nesta, F.¹ Matassoni, M.² Astudillo, R.F.³

15
- 84893696094
- Fusion of acoustic, perceptual and production features for robust speech recognition in highly non-stationary noise
- Vancouver, Canada, June
- G. Sivaraman, V. Mitra, and C. Y. Espy-Wilson, Fusion of acoustic, perceptual and production features for robust speech recognition in highly non-stationary noise, in Proc. CHiME- 2013, Vancouver, Canada, June 2013, pp. 65-70.
- (2013) Proc. CHiME- 2013 , pp. 65-70
- Sivaraman, G.¹ Mitra, V.² Espy-Wilson, C.Y.³

16
- 84893674217
- Employing stochastic constrained LMS algorithm for ASR frontend processing
- Vancouver, Canada, June
- M. Stadtschnitzer, D. Stein, and R. Bardeli, Employing stochastic constrained LMS algorithm for ASR frontend processing, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 71-72.
- (2013) Proc. CHiME-2013 , pp. 71-72
- Stadtschnitzer, M.¹ Stein, D.² Bardeli, R.³

17
- 84893671946
- Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark
- Vancouver, Canada, June
- Y. Tachioka, S.Watanabe, J. L. Roux, and J. R. Hershey, Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 19-24.
- (2013) Proc. CHiME-2013 , pp. 19-24
- Tachioka, Y.¹ Watanabe, S.² Roux, J.L.³ Hershey, J.R.⁴

18
- 84893667550
- Using full-rank spatial covariance models for noise-robust ASR
- Vancouver, Canada, June
- D. T. Tran, E. Vincent, D. Jouvet, and K. Adiloglu, Using full-rank spatial covariance models for noise-robust ASR, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 31- 32.
- (2013) Proc. CHiME-2013 , pp. 31-32
- Tran, D.T.¹ Vincent, E.² Jouvet, D.³ Adiloglu, K.⁴

19
- 84893638382
- Noise-robust automatic speech recognition with exemplar-based sparse representations using multiple length adaptive dictionaries
- Vancouver, Canada, June
- E. Yilmaz, J. F. Gemmeke, and H. Van hamme, Noise-robust automatic speech recognition with exemplar-based sparse representations using multiple length adaptive dictionaries, in Proc. CHiME-2013, Vancouver, Canada, June 2013, pp. 39- 43.
- (2013) Proc. CHiME-2013 , pp. 39-43
- Yilmaz, E.¹ Gemmeke, J.F.² Van Hamme, H.³

20
- 84890541701
- The second 'chime' speech separation and recognition challenge: Datasets, tasks and baselines
- Vancouver, Canada, May, IEEE
- E. Vincent, J. Barker, S. Watanabe, J. L. Roux, F. Nesta, and M. Matassoni, The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines, in Proc. ICASSP 2013, Vancouver, Canada, May 2013, IEEE.
- (2013) Proc. ICASSP 2013
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Roux, J.L.⁴ Nesta, F.⁵ Matassoni, M.⁶

21
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- DOI 10.1121/1.2229005
- M. P. Cooke, J. Barker, S. P. Cunningham, and X. Shao, An audio-visual corpus for speech perception and automatic speech recognition, Journal of the Acoustical Society of America, vol. 120, pp. 2421-2424, 2006. (Pubitemid 44631681)
- (2006) Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

22
- 84893564226
- CSR-I (WSJ0) complete
- Philadelphia
- J. Garofalo, D. Graff, D. Paul, and D. Pallett, CSR-I (WSJ0) Complete, Linguistic Data Consortium, Philadelphia, 2007.
- (2007) Linguistic Data Consortium
- Garofalo, J.¹ Graff, D.² Paul, D.³ Pallett, D.⁴

23
- 51449115975
- Baseline WSJ acoustic models for HTK and sphinx: Training recipes and recognition experiments
- University of Cambridge
- K. Vertanen, Baseline WSJ acoustic models for HTK and Sphinx: Training recipes and recognition experiments, Tech. Rep., Cavendish Laboratory, University of Cambridge, 2006.
- (2006) Tech. Rep., Cavendish Laboratory
- Vertanen, K.¹

24
- 70349227800
- Analysis of confusion matrix to combine evidence for phoneme recognition
- S. R. M. Prasanna, B. Yegnanarayana, J. P. Pinto, and H. Hermansky, Analysis of confusion matrix to combine evidence for phoneme recognition, Tech. Rep. RR 07-27, IDIAP, 2007.
- (2007) Tech. Rep. RR 07-27, IDIAP
- Prasanna, S.R.M.¹ Yegnanarayana, B.² Pinto, J.P.³ Hermansky, H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.