SCOPUS 정보 검색 플랫폼

2015 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015 - Proceedings

Volumn , Issue , 2016, Pages 518-524

Single and multi-channel approaches for distant speech recognition under noisy reverberant conditions: I2R'S system description for the ASpIRE challenge

(2) Dennis, Jonathan a Dat, Tran Huy a

a INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

ASpIRE Challenge; beamforming; distant speech recognition; mismatched conditions; reverberation

Indexed keywords

BEAMFORMING; DECODING; MICROPHONES; REVERBERATION; SPEECH;

ASPIRE CHALLENGE; AUTOMATIC SPEECH RECOGNITION; DISTANT SPEECH RECOGNITION; DISTRIBUTED BEAM-FORMING; MISMATCHED CONDITIONS; REVERBERANT ENVIRONMENT; SPEECH DEREVERBERATION; VOICE ACTIVITY DETECTION;

SPEECH RECOGNITION;

EID: 84964456696 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2015.7404839 Document Type: Conference Paper

Times cited : (7)

References (19)

1
- 84964515036
- The automatic speech recognition in reverberant environments (ASpIRE) challenge
- M Harper, "The Automatic Speech recognition In Reverberant Environments (ASpIRE) Challenge," in Automatic Speech Recognition and Understanding (ASRU), 2015, pp. 1-6
- (2015) Automatic Speech Recognition and Understanding (ASRU) , pp. 1-6
- Harper, M.¹

2
- 0036299273
- Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio
- Xuejing Sun, "Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio," in Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on. IEEE, 2002, vol. 1, pp. I-333
- (2002) Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference On. IEEE , vol.1 , pp. I-333
- Sun, X.¹

3
- 84865734075
- Joint robust voicing detection and pitch estimation based on residual harmonics
- Thomas Drugman and Abeer Alwan, "Joint robust voicing detection and pitch estimation based on residual harmonics.," in Interspeech, 2011, pp. 1973-1976
- (2011) Interspeech , pp. 1973-1976
- Drugman, T.¹ Alwan, A.²

4
- 0034857681
- Speech dereverberation via maximum-kurtosis subband adaptive filtering
- IEEE
- B.W. Gillespie, H.S. Malvar, and D.A.F. Florencio, "Speech dereverberation via maximum-kurtosis subband adaptive filtering," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2001, vol. 6, pp. 3701-3704, IEEE
- (2001) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , vol.6 , pp. 3701-3704
- Gillespie, B.W.¹ Malvar, H.S.² Florencio, D.A.F.³

5
- 84890521103
- Speaker adaptation of context dependent deep neural networks
- Hank Liao, "Speaker adaptation of context dependent deep neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7947-7951
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE , pp. 7947-7951
- Liao, H.¹

6
- 84893650076
- Semi-supervised training of deep neural networks
- Karel Vesely, Mirko Hannemann, and Lukas Burget, "Semi-supervised training of deep neural networks," in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 267-272
- (2013) Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop On. IEEE , pp. 267-272
- Vesely, K.¹ Hannemann, M.² Burget, L.³

7
- 78049409757
- Discriminative training based on an integrated view of mpe and MMI in margin and error space
- Erik McDermott, Shinji Watanabe, and Atsushi Nakamura, "Discriminative training based on an integrated view of mpe and MMI in margin and error space," in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 4894-4897
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference On. IEEE , pp. 4894-4897
- McDermott, E.¹ Watanabe, S.² Nakamura, A.³

8
- 84906274730
- Sequence-discriminative training of deep neural networks
- Karel Vesely, Arnab Ghoshal, Lukás Burget, and Daniel Povey, "Sequence-discriminative training of deep neural networks.," in INTERSPEECH, 2013, pp. 2345-2349
- (2013) INTERSPEECH , pp. 2345-2349
- Vesely, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

9
- 84874250121
- Tomáš Mikolov, "Statistical language models based on neural networks," 2012
- (2012) Statistical Language Models Based on Neural Networks
- Mikolov, T.¹

10
- 84959118000
- The fisher corpus: A resource for the next generations of speech-to-text
- Christopher Cieri, David Miller, and Kevin Walker, "The fisher corpus a resource for the next generations of speech-to-text.," in LREC, 2004, vol. 4, pp. 69-71
- (2004) LREC , vol.4 , pp. 69-71
- Cieri, C.¹ Miller, D.² Walker, K.³

11
- 84894407243
- Vikrant Tomar, "Blind dereverberation using maximum kurtosis of the speech residual," 2010
- (2010) Blind Dereverberation Using Maximum Kurtosis of the Speech Residual
- Tomar, V.¹

12
- 33745185408
- Extended advanced front end algorithm description, Version 1.1
- Tech. Rep. ES
- A Sorin and T Ramabadran, "Extended advanced front end algorithm description, Version 1.1," ETSI STQ Aurora DSR Working Group, Tech. Rep. ES, vol. 202, pp. 212, 2003
- (2003) ETSI STQ Aurora DSR Working Group , vol.202 , pp. 212
- Sorin, A.¹ Ramabadran, T.²

13
- 0016990291
- The generalized correlation method for estimation of time delay
- Charles H Knapp and G Clifford Carter, "The generalized correlation method for estimation of time delay," Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 24, no. 4, pp. 320-327, 1976
- (1976) Acoustics, Speech and Signal Processing, IEEE Transactions on , vol.24 , Issue.4 , pp. 320-327
- Knapp, C.H.¹ Clifford Carter, G.²

14
- 4644306223
- Time delay estimation in the presence of correlated noise and reverberation
- Yong Rui and Dinei Florencio, "Time delay estimation in the presence of correlated noise and reverberation," in Acoustics, Speech, and Signal Processing, 2004. Proceedings.( ICASSP'04). IEEE International Conference on. IEEE, 2004, vol. 2, pp. ii-133
- (2004) Acoustics, Speech, and Signal Processing, 2004. Proceedings.( ICASSP'04). IEEE International Conference On. IEEE , vol.2 , pp. ii-133
- Rui, Y.¹ Florencio, D.²

15
- 51449085960
- Why does phat work well in lownoise, reverberative environments?
- Cha Zhang, Dinei Florencio, and Zhengyou Zhang, "Why does phat work well in lownoise, reverberative environments?," in Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on. IEEE, 2008, pp. 2565-2568
- (2008) Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference On. IEEE , pp. 2565-2568
- Zhang, C.¹ Florencio, D.² Zhang, Z.³

16
- 50449086237
- Acoustic beamforming for speaker diarization of meetings
- Xavier Anguera, Chuck Wooters, and Javier Hernando, "Acoustic beamforming for speaker diarization of meetings," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 7, pp. 2011-2022, 2007
- (2007) Audio, Speech, and Language Processing, IEEE Transactions on , vol.15 , Issue.7 , pp. 2011-2022
- Anguera, X.¹ Wooters, C.² Hernando, J.³

17
- 84906235016
- The kaldi speech recognition toolkit
- IEEE
- Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukáš Burget, Ondej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlíček, Yanmin Qian, Petr Schwarz, Jan Silovsḱy, Georg Stemmer, and Karel Veseĺy, "The kaldi speech recognition toolkit," in IEEE workshop on automatic speech recognition and understanding (ASRU). 2011, IEEE
- (2011) IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlíček, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsḱy, J.¹¹ Stemmer, G.¹² Veseĺy, K.¹³

18
- 84964489434
- Carnegie Mellon University, "The carnegie mellon university pronouncing dictionary v07a," in [Online] http://www.speech.cs.cmu.edu/cgi-bin/cmudict, 2015
- (2015) The Carnegie Mellon University Pronouncing Dictionary v07a

19
- 84964489448
- Robust snr estimation of noisy speech based on Gaussian mixture modeling on log-power domain
- Tran Huy Dat, Kazuya Takeda, and Fumitada Itakura, "Robust snr estimation of noisy speech based on Gaussian mixture modeling on log-power domain," in ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction, 2004
- (2004) ISCA Tutorial and Research Workshop (ITRW) on Robustness Issues in Conversational Interaction
- Huy Dat, T.¹ Takeda, K.² Itakura, F.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.