SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 547-554

The Automatic Speech recogition in Reverberant Environments (ASpIRE) challenge

(1) Harper, Mary a

a IARPA ^* (United States)

Author keywords

mismatch; reverberation; robustness; speech recognition

Indexed keywords

REVERBERATION; ROBUSTNESS (CONTROL SYSTEMS); SPEECH;

ACOUSTIC ENVIRONMENT; AUTOMATIC SPEECH; AUTOMATIC SPEECH RECOGNITION; AUTOMATIC SPEECH RECOGNITION SYSTEM; MATCHED TRAININGS; MISMATCH; PERFORMANCE LEVEL; REVERBERANT ENVIRONMENT;

SPEECH RECOGNITION;

EID: 84964515036 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404843 Document Type: Conference Paper

Times cited : (87)

References (35)

1
- 84964501966
- [Accessed: September 19, 2014]
- NIST, "Automatic Speech Recognition Evaluations at NIST," 2009. Available: http://itl.nist.gov/iad/mig/publications/ ASRhistory/index.html [Accessed: September 19, 2014]
- (2009) Automatic Speech Recognition Evaluations at NIST

2
- 85032751613
- Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
- T. Yoshioka, A. Sehr, M. Delcroix, K. Kinoshita, R. Maas, T. Nakatani, and W. Kellermann, "Making Machines Understand Us in Reverberant Rooms: Robustness against Reverberation for Automatic Speech Recognition," IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 114-126, 2012
- (2012) IEEE Signal Processing Magazine , vol.29 , Issue.6 , pp. 114-126
- Yoshioka, T.¹ Sehr, A.² Delcroix, M.³ Kinoshita, K.⁴ Maas, R.⁵ Nakatani, T.⁶ Kellermann, W.⁷

3
- 0141814662
- The ICSI meeting corpus
- A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters, "The ICSI Meeting Corpus," in Proceedings of ICASSP, 2003
- (2003) Proceedings of ICASSP
- Janin, A.¹ Baron, D.² Edwards, J.³ Ellis, D.⁴ Gelbart, D.⁵ Morgan, N.⁶ Peskin, B.⁷ Pfau, T.⁸ Shriberg, E.⁹ Stolcke, A.¹⁰ Wooters, C.¹¹

4
- 84890465724
- The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions
- S. H. K. Parthasarathi, S. Y. Chang, J. Cohen, N. Morgan, and S. Wegmann, "The Blame Game in Meeting Room ASR: An Analysis of Feature Versus Model Errors in Noisy and Mismatched Conditions," in Proceedings of ICASSP, 2013
- (2013) Proceedings of ICASSP
- Parthasarathi, S.H.K.¹ Chang, S.Y.² Cohen, J.³ Morgan, N.⁴ Wegmann, S.⁵

5
- 0002344794
- Bootstrap methods: Another look at the jackknife
- B. Efron, "Bootstrap Methods: Another Look at the Jackknife," Annals of Statistics, vol. 7, no. 1, pp. 1-26, 1979
- (1979) Annals of Statistics , vol.7 , Issue.1 , pp. 1-26
- Efron, B.¹

6
- 34547548247
- The AMI system for the transcription of speech in meetings
- T. Hain, V. Wan, L. Burget, M. Karafiat, J. Dines, J. Vepa, G. Garau, and M. Lincoln, "The AMI System for the Transcription of Speech in Meetings," in Proceedings of ICASSP, 2007
- (2007) Proceedings of ICASSP
- Hain, T.¹ Wan, V.² Burget, L.³ Karafiat, M.⁴ Dines, J.⁵ Vepa, J.⁶ Garau, G.⁷ Lincoln, M.⁸

7
- 33846217002
- The multi-channel wall street journal audio visual corpus (MC-WSJ-AV): Specification and initial experiments
- M. Lincoln, I. McCowan, J. Vepa, and H. K. Maganti, "The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments," in Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, 2005
- (2005) Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding
- Lincoln, M.¹ McCowan, I.² Vepa, J.³ Maganti, H.K.⁴

8
- 84964509667
- Multi-channel WSJ audio
- Philadelphia: Linguistic Data Consortium
- M. Lincoln, E. Zwyssig, and I. McCowan, "Multi-Channel WSJ Audio," LDC2014S03, Philadelphia: Linguistic Data Consortium, 2014
- (2014) LDC2014S03
- Lincoln, M.¹ Zwyssig, E.² McCowan, I.³

9
- 84878543263
- The PASCAL CHiME speech separation and recognition challenge
- J. Barker, E. Vincent, N. Ma, H. Christensen, and P. Green, "The PASCAL CHiME Speech Separation and Recognition Challenge," Computer Speech and Language, vol. 27, no. 3, pp. 621-633, 2013
- (2013) Computer Speech and Language , vol.27 , Issue.3 , pp. 621-633
- Barker, J.¹ Vincent, E.² Ma, N.³ Christensen, H.⁴ Green, P.⁵

10
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- M. P. Cooke, J. Barker, S. P. Cunningham, and X. Shao, "An Audio-Visual Corpus for Speech Perception and Automatic Speech Recognition," Journal of the Acoustical Society of America, vol. 120, pp. 2421-2424, 2006
- (2006) Journal of the Acoustical Society of America , vol.120 , pp. 2421-2424
- Cooke, M.P.¹ Barker, J.² Cunningham, S.P.³ Shao, X.⁴

11
- 84890541701
- The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines
- E. Vincent, J. Barker, S. Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The Second 'CHiME' Speech Separation and Recognition Challenge: Datasets, Tasks and Baselines," in Proceedings of ICASSP , 2013
- (2013) Proceedings of ICASSP
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Le Roux, J.⁴ Nesta, F.⁵ Matassoni, M.⁶

12
- 84893704157
- The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes
- E. Vincent, J. Barker, S. Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The Second 'CHiME' Speech Separation and Recognition Challenge: An Overview of Challenge Systems and Outcomes," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2013
- (2013) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Le Roux, J.⁴ Nesta, F.⁵ Matassoni, M.⁶

13
- 84893564226
- CSR-I (WSJ0) complete
- Garofalo, J., Graff, D., Paul, D., and Pallett, D., "CSR-I (WSJ0) Complete," Linguistic Data Consortium, 2007
- (2007) Linguistic Data Consortium
- Garofalo, J.¹ Graff, D.² Paul, D.³ Pallett, D.⁴

14
- 84964452021
- The third 'CHiME' speech separation and recognition challenge: Dataset, task and baselines
- J. Barker, R. Marxer, E. Vincent, and S. Watanabe, "The Third 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Barker, J.¹ Marxer, R.² Vincent, E.³ Watanabe, S.⁴

15
- 84893622444
- The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
- K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, A. Sehr, W. Kellermann, and R. Maas, "The Reverb Challenge: A Common Evaluation Framework for Dereverberation and Recognition of Reverberant Speech," in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013
- (2013) Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
- Kinoshita, K.¹ Delcroix, M.² Yoshioka, T.³ Nakatani, T.⁴ Sehr, A.⁵ Kellermann, W.⁶ Maas, R.⁷

16
- 0028996854
- WSJCAM0: A british english speech corpus for large vocabulary continuous speech recognition
- T. Robinson, J. Fransen, D. Pye, J. Foote, and S. Renals, "WSJCAM0: A British English Speech Corpus for Large Vocabulary Continuous Speech Recognition," in Proceedings of ICASSP, 1995
- (1995) Proceedings of ICASSP
- Robinson, T.¹ Fransen, J.² Pye, D.³ Foote, J.⁴ Renals, S.⁵

17
- 84959118000
- The fisher corpus, a resource for the next generations of speech-to-text
- C. Cieri, D. Miller, and K. Walker, "The Fisher Corpus, A Resource for the Next Generations of Speech-to-Text," in Proceedings 4th International Conference on Language Resources and Evaluation, 2004
- (2004) Proceedings 4th International Conference on Language Resources and Evaluation
- Cieri, C.¹ Miller, D.² Walker, K.³

18
- 84906273235
- Mixer 6
- L. Brandschain, D. Graff, C. Cieri, K. Walker, C. Caruso, and A. Neely, "Mixer 6," in Proceedings of the Seventh International Conference on Language Resources and Evaluation, 2010
- (2010) Proceedings of the Seventh International Conference on Language Resources and Evaluation
- Brandschain, L.¹ Graff, D.² Cieri, C.³ Walker, K.⁴ Caruso, C.⁵ Neely, A.⁶

19
- 84964509684
- [Accessed: September 19, 2014]
- Linguistic Data Consortium, "Linguistic Data Consortium Transcription Guidelines (NQTR)," 2006 [online]. Available: https://catalog.ldc.upenn.edu/docs/LDC2010S01/trans-guide- nqrt-span.doc, [Accessed: September 19, 2014]
- (2006) Linguistic Data Consortium Transcription Guidelines (NQTR)

20
- 84964471999
- IARPA [Accessed: September 10, 2015]
- IARPA, "IARPA Announces Winners of its ASpIRE Challenge," 2015 [online]. Available: http://www.dni.gov /index.php/newsroom/press-releases/210-press-releases-2015/ 1252-iarpa-announces-winners-of-its-aspire-challenge [Accessed: September 10, 2015]
- (2015) IARPA Announces Winners of Its ASpIRE Challenge

21
- 84964470918
- Robust speech recognition in unknown reverberant and noisy conditions
- R. Hsiao, J. Ma, W. Hartmann, M. Karafiat, F. Grezl, L. Burget, I. Szoke, J. H. Cernocky, S. Watanabe, Z. Chen, S. H. Mallidi, H. Hermansky, S. Tsakalidis, and R. Schwartz, "Robust Speech Recognition in Unknown Reverberant and Noisy Conditions," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Hsiao, R.¹ Ma, J.² Hartmann, W.³ Karafiat, M.⁴ Grezl, F.⁵ Burget, L.⁶ Szoke, I.⁷ Cernocky, J.H.⁸ Watanabe, S.⁹ Chen, Z.¹⁰ Mallidi, S.H.¹¹ Hermansky, H.¹² Tsakalidis, S.¹³ Schwartz, R.¹⁴

22
- 0030638031
- A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (rover)
- J. G. Fiscus, "A Post-processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction (ROVER)," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 1997
- (1997) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Fiscus, J.G.¹

23
- 84964483822
- JHU ASpIRE system robust LVCSR with TDNNs, i-vector adaptation, and RNN-LMs
- V. Peddinti, G. Chen, V. Manohar, T. Ko, D. Povey, and S. Khudanpur, "JHU ASpIRE system: Robust LVCSR with TDNNs, i-vector Adaptation, and RNN-LMs," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Peddinti, V.¹ Chen, G.² Manohar, V.³ Ko, T.⁴ Povey, D.⁵ Khudanpur, S.⁶

24
- 84959115289
- A time delay neural network architecture for efficient modeling of long temporal contexts
- V. Peddinti, D. Povey, and S. Khudanpur, "A Time Delay Neural Network Architecture for Efficient Modeling of Long Temporal Contexts," in Proceedings of Interspeech, 2015
- (2015) Proceedings of Interspeech
- Peddinti, V.¹ Povey, D.² Khudanpur, S.³

25
- 84905259145
- I-vector based speaker adaptation of deep neural networks for French broadcast audio transcription
- V. Gupta, P. Kenny, P. Ouellet, and T. Stafylakis, "I-vector Based Speaker Adaptation of Deep Neural Networks for French Broadcast Audio Transcription," in Proceedings of ICASSP, 2014
- (2014) Proceedings of ICASSP
- Gupta, V.¹ Kenny, P.² Ouellet, P.³ Stafylakis, T.⁴

26
- 84964456696
- Single and multi-channel approaches for distant speech recognition under noisy reverberant conditions: I2r's system description for the aspire challenge
- J. Dennis and H. D. Tran, "Single and Multi-channel Approaches for Distant Speech Recognition under Noisy Reverberant Conditions: I2R's System Description for the ASpIRE Challenge," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Dennis, J.¹ Tran, H.D.²

27
- 0036299273
- Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio
- X. Sun, "Pitch Determination and Voice Quality Analysis Using Subharmonic-to-Harmonic Ratio," in Proceedings of ICASSP, 2002
- (2002) Proceedings of ICASSP
- Sun, X.¹

28
- 0034857681
- Speech dereverberation via maximum-kurtosis subband adaptive filtering
- B. W. Gillespie, H. S. Malvar, and D. Florêncio, "Speech Dereverberation via Maximum-kurtosis Subband Adaptive Filtering," in Proceedings of ICASSP, 2001
- (2001) Proceedings of ICASSP
- Gillespie, B.W.¹ Malvar, H.S.² Florêncio, D.³

29
- 84890521103
- Speaker adaptation of context dependent deep neural networks
- H. Liao, "Speaker Adaptation of Context Dependent Deep Neural Networks," in Proceedings of ICASSP, 2013
- (2013) Proceedings of ICASSP
- Liao, H.¹

30
- 84964466667
- Improving robustness against reverberation for automatic speech recognition
- V. Mitra, J. Van Hout, W. Wang, M. Graciarena, M. McLaren, H. Franco, and D. Vergyri, "Improving Robustness Against Reverberation For Automatic Speech Recognition," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Mitra, V.¹ Van Hout, J.² Wang, W.³ Graciarena, M.⁴ McLaren, M.⁵ Franco, H.⁶ Vergyri, D.⁷

31
- 84946693063
- Deep convolutional nets and robust features for reverberation-robust speech recognition
- V. Mitra, W. Wang, and H Franco, "Deep Convolutional Nets and Robust Features for Reverberation-robust Speech Recognition," in Proceedings of SLT, 2014
- (2014) Proceedings of SLT
- Mitra, V.¹ Wang, W.² Franco, H.³

32
- 84910075252
- Evaluating robust features on deep neural networks for speech recognition in noisy and channel mismatched conditions
- V. Mitra, W. Wang, H. Franco, Y. Lei, C. Bartels, and M. Graciarena, "Evaluating Robust Features on Deep Neural Networks for Speech Recognition in Noisy and Channel Mismatched Conditions," in Proceedings of Interspeech, 2014
- (2014) Proceedings of Interspeech
- Mitra, V.¹ Wang, W.² Franco, H.³ Lei, Y.⁴ Bartels, C.⁵ Graciarena, M.⁶

33
- 80051639873
- Gammatone subband magnitude-domain dereverberation for ASR
- K. Kumar, R. Singh, B. Raj, and R. Stern, "Gammatone Subband Magnitude-domain Dereverberation for ASR," in Proceedings of ICASSP, 2011
- (2011) Proceedings of ICASSP
- Kumar, K.¹ Singh, R.² Raj, B.³ Stern, R.⁴

34
- 84964476018
- Analysis of factors affecting system performance in the ASpIRE challenge
- J. Melot, N. Malyska, J. Ray, and W. Shen, "Analysis of Factors Affecting System Performance in the ASpIRE Challenge," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2015
- (2015) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop
- Melot, J.¹ Malyska, N.² Ray, J.³ Shen, W.⁴

35
- 50449086237
- Acoustic beamforming for speaker diarization of meetings
- X. Anguera, C. Wooters, and J. Hernando, "Acoustic Beamforming for Speaker Diarization of Meetings," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, pp. 2011-2023, 2007
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 2011-2023
- Anguera, X.¹ Wooters, C.² Hernando, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.