SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 22, Issue 6, 2014, Pages 1037-1046

Memory-enhanced neural networks and NMF for robust ASR

(6) Geiger, Jürgen T a Weninger, Felix a Gemmeke, Jort F d Wöllmer, Martin c Schuller, Björn a,b Rigoll, Gerhard a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

b IMPERIAL COLLEGE LONDON (United Kingdom)

c BMW GROUP (Germany)

d UNIVERSITY OF LEUVEN (Belgium)

Author keywords

Long short term memory; Multi stream recognition; Noise robust speech recognition; Non negative matrix factorization

Indexed keywords

ACOUSTIC NOISE; ARTS COMPUTING; BRAIN; FACTORIZATION; MATRIX ALGEBRA; RECURRENT NEURAL NETWORKS; REVERBERATION; SPEECH; SPEECH ENHANCEMENT;

DISCRIMINATIVE TRAINING; DISTANT SPEECH RECOGNITION; LONG SHORT-TERM MEMORY; MULTI-STREAM; NOISE ROBUST SPEECH RECOGNITION; NONNEGATIVE MATRIX FACTORIZATION; ROBUSTNESS AGAINST NOISE; SPEECH ENHANCEMENT METHODS;

SPEECH RECOGNITION;

EID: 84910095643 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2014.2318514 Document Type: Article

Times cited : (30)

References (44)

1
- 84891583985
- New York, NY, USA: Wiley
- T. Virtanen, R. Singh, and B. Raj, Techniques for noise robustness in automatic speech recognition. New York, NY, USA: Wiley, 2012.
- (2012) Techniques for Noise Robustness in Automatic Speech Recognition
- Virtanen, T.¹ Singh, R.² Raj, B.³

2
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- Jul.
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio Process., vol. 9, no. 5, pp. 504-512, Jul. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.5 , pp. 504-512
- Martin, R.¹

3
- 51449100115
- Efficient model-based speech separation and denoising using non-negative subspace analysis
- S. J. Rennie, J. R. Hershey, and P. A. Olsen, "Efficient model-based speech separation and denoising using non-negative subspace analysis," in Proc. ICASSP, Las Vegas, NV, USA, 2008, pp. 1833-1836.
- Proc. ICASSP, Las Vegas, NV, USA, 2008 , pp. 1833-1836
- Rennie, S.J.¹ Hershey, J.R.² Olsen, P.A.³

4
- 38049021850
- Convolutive speech bases and their application to supervised speech separation
- Jan.
- P. Smaragdis, "Convolutive speech bases and their application to supervised speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 1-14, Jan. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 1-14
- Smaragdis, P.¹

5
- 85016663198
- RASTA-PLP speech analysis technique
- H. Hermansky, N. Morgan, A. Bayya, and P. Kohn, "RASTA-PLP speech analysis technique," in Proc. ICASSP, San Francisco, CA,USA, 1992, vol. 1, pp. 121-124.
- Proc. ICASSP, San Francisco, CA,USA, 1992 , vol.1 , pp. 121-124
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

6
- 77955673019
- Model-based feature enhancement for reverberant speech recognition
- Sep.
- A. Krueger and R. Haeb-Umbach, "Model-based feature enhancement for reverberant speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1692-1707, Sep. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.7 , pp. 1692-1707
- Krueger, A.¹ Haeb-Umbach, R.²

7
- 85017287487
- Linear discriminant analysis for improved large vocabulary continuous speech recognition
- R. Haeb-Umbach and H. Ney, "Linear discriminant analysis for improved large vocabulary continuous speech recognition," in Proc. ICASSP, San Francisco, CA, USA, 1992, pp. 13-16.
- Proc. ICASSP, San Francisco, CA, USA, 1992 , pp. 13-16
- Haeb-Umbach, R.¹ Ney, H.²

8
- 51449120120
- BoostedMMI for model and feature-space discriminative training
- D. Povey, D. Kanevsky, B. Kingsbury, B. Ramabhadran, G. Saon, and K. Visweswariah, "BoostedMMI for model and feature-space discriminative training," in Proc. ICASSP, Las Vegas, NV, USA, 2008, pp. 4057-4060.
- Proc. ICASSP, Las Vegas, NV, USA, 2008 , pp. 4057-4060
- Povey, D.¹ Kanevsky, D.² Kingsbury, B.³ Ramabhadran, B.⁴ Saon, G.⁵ Visweswariah, K.⁶

9
- 0032048385
- Speech recognition in noisy environments using first-order vector Taylor series
- D. Y. Kim, C. Kwan Un, and N. S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Commun., vol. 24, no. 1, pp. 39-49, 1998.
- (1998) Speech Commun. , vol.24 , Issue.1 , pp. 39-49
- Kim, D.Y.¹ Kwan Un, C.² Kim, N.S.³

10
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Nov.
- G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.E.⁴ Mohamed, A.-R.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.N.¹⁰ Kingsbury, B.¹¹

11
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- M. Seltzer, D. Yu, and Y. Wang, "An investigation of deep neural networks for noise robust speech recognition," in Proc. ICASSP, Vancouver, BC, Canada, 2013, pp. 7398-7402.
- Proc. ICASSP, Vancouver, BC, Canada, 2013 , pp. 7398-7402
- Seltzer, M.¹ Yu, D.² Wang, Y.³

12
- 84867626068
- Revisiting recurrent neural networks for robust ASR
- O. Vinyals, S. V. Ravuri, and D. Povey, "Revisiting recurrent neural networks for robust ASR," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4085-4088.
- Proc. ICASSP, Kyoto, Japan, 2012 , pp. 4085-4088
- Vinyals, O.¹ Ravuri, S.V.² Povey, D.³

13
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A.-R. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks," in Proc. ICASSP, 2013, pp. 6645-6649.
- Proc. ICASSP, 2013 , pp. 6645-6649
- Graves, A.¹ Mohamed, A.-R.² Hinton, G.³

14
- 0141741840
- Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
- S. C. Kremer and J. F. Kolen, Eds. Piscataway, NJ, USA: IEEE Press
- S. Hochreiter, Y. Bengio, P. Frasconi, and J. Schmidhuber, "Gradient flow in recurrent nets: The difficulty of learning long-term dependencies," in Field Guide to Dynamical Recurrent Networks, S. C. Kremer and J. F. Kolen, Eds. Piscataway, NJ, USA: IEEE Press, 2001.
- (2001) Field Guide to Dynamical Recurrent Networks
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

15
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Comput., vol. 9, no. 8, pp. 1735-1780, 1997.
- (1997) Neural Comput. , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

16
- 70450180507
- Robust in-car spelling recognition-a tandem BLSTM-HMM approach
- M. Wöllmer, F. Eyben, B. Schuller, Y. Sun, T. Moosmayr, and N. Nguyen-Thien, "Robust in-car spelling recognition-a tandem BLSTM-HMM approach," in Proc. Interspeech, Brighton, U.K., 2009, pp. 2507-2510.
- Proc. Interspeech, Brighton, U.K., 2009 , pp. 2507-2510
- Wöllmer, M.¹ Eyben, F.² Schuller, B.³ Sun, Y.⁴ Moosmayr, T.⁵ Nguyen-Thien, N.⁶

17
- 80051637579
- A multi-stream ASR framework for BLSTM modeling of conversational speech
- M. Wöllmer, F. Eyben, B. Schuller, and G. Rigoll, "A multi-stream ASR framework for BLSTM modeling of conversational speech," in Proc. ICASSP, Prague, Czech Republic, 2011, pp. 4860-4863.
- (2011) Proc. ICASSP, Prague, Czech Republic , pp. 4860-4863
- Wöllmer, M.¹ Eyben, F.² Schuller, B.³ Rigoll, G.⁴

18
- 85032752364
- Graphical model architectures for speech recognition
- Sep.
- J. A. Bilmes and C. Bartels, "Graphical model architectures for speech recognition," IEEE Signal Process. Mag., vol. 22, no. 5, pp. 89-100, Sep. 2005.
- (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 89-100
- Bilmes, J.A.¹ Bartels, C.²

19
- 9644308136
- Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR
- A. Hagen and A. Morris, "Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR," Comput. Speech Lang., vol. 19, no. 1, pp. 3-30, 2005.
- (2005) Comput. Speech Lang. , vol.19 , Issue.1 , pp. 3-30
- Hagen, A.¹ Morris, A.²

20
- 79959825120
- Using a DBN to integrate Sparse Classification and GMM-based ASR
- Y. Sun, J. F. Gemmeke, B. Cranen, L. tenBosch, and L. Boves, "Using a DBN to integrate Sparse Classification and GMM-based ASR," in Proc. Interspeech, Makuhari, Japan, 2010, pp. 2098-2101.
- Proc. Interspeech, Makuhari, Japan, 2010 , pp. 2098-2101
- Sun, Y.¹ Gemmeke, J.F.² Cranen, B.³ TenBosch, L.⁴ Boves, L.⁵

21
- 84878543263
- The PASCAL CHiME speech separation and recognition challenge
- J. P. Barker, E. Vincent, N. Ma, H. Christensen, and P. D. Green, "The PASCAL CHiME speech separation and recognition challenge," Comput. Speech Lang., vol. 27, no. 3, pp. 621-633, 2013.
- (2013) Comput. Speech Lang. , vol.27 , Issue.3 , pp. 621-633
- Barker, J.P.¹ Vincent, E.² Ma, N.³ Christensen, H.⁴ Green, P.D.⁵

22
- 84890541701
- The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines
- E. Vincent, J. Barker, S. Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The second 'CHiME' speech separation and recognition challenge: Datasets, tasks and baselines," in Proc. ICASSP, Vancouver, BC, Canada, 2013, pp. 126-130.
- Proc. ICASSP, Vancouver, BC, Canada, 2013 , pp. 126-130
- Vincent, E.¹ Barker, J.² Watanabe, S.³ Le Roux, J.⁴ Nesta, F.⁵ Matassoni, M.⁶

23
- 84883396653
- Noise Robust ASR in Reverberated Multisource Environments Applying Convolutive NMF and Long Short-Term Memory
- M. Wöllmer, F. Weninger, J. Geiger,B. Schuller, and G. Rigoll, "Noise Robust ASR in Reverberated Multisource Environments Applying Convolutive NMF and Long Short-Term Memory," Comput. Speech Lang., Special Issue Speech Separat. Recogn. Multisource Environ., vol. 27, pp. 780-797, 2013.
- (2013) Comput. Speech Lang., Special Issue Speech Separat. Recogn. Multisource Environ. , vol.27 , pp. 780-797
- Wöllmer, M.¹ Weninger, F.² Geiger, J.³ Schuller, B.⁴ Rigoll, G.⁵

24
- 84893675434
- The TUM+TUT+KUL Approach to the 2nd CHiME Challenge: Multi-Stream ASR Exploiting BLSTM Networks and Sparse NMF
- J. T. Geiger, F. Weninger, A. Hurmalainen, J. F. Gemmeke, M. Wöllmer, B. Schuller, G. Rigoll, and T. Virtanen, "The TUM+TUT+KUL Approach to the 2nd CHiME Challenge: Multi-Stream ASR Exploiting BLSTM Networks and Sparse NMF," in Proc. CHiME Workshop, Vancouver, BC, Canada, 2013, pp. 25-30.
- Proc. CHiME Workshop, Vancouver, BC, Canada, 2013 , pp. 25-30
- Geiger, J.T.¹ Weninger, F.² Hurmalainen, A.³ Gemmeke, J.F.⁴ Wöllmer, M.⁵ Schuller, B.⁶ Rigoll, G.⁷ Virtanen, T.⁸

25
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- Sep.
- J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, Sep. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J.¹ Virtanen, T.² Hurmalainen, A.³

26
- 84906222220
- Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?
- M. Delcroix, Y. Kubo, T. Nakatani, and A. Nakamura, "Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?," in Proc. Interspeech, Lyon, France, 2013, pp. 2992-2996.
- Proc. Interspeech, Lyon, France, 2013 , pp. 2992-2996
- Delcroix, M.¹ Kubo, Y.² Nakatani, T.³ Nakamura, A.⁴

27
- 84890503970
- Effectiveness of discriminative training and feature transformation for reverberated and noisy speech
- Y. Tachioka, S. Watanabe, and J. R. Hershey, "Effectiveness of discriminative training and feature transformation for reverberated and noisy speech," in Proc. ICASSP, Vancouver, BC, Canada, 2013, pp. 6935-6939.
- Proc. ICASSP, Vancouver, BC, Canada, 2013 , pp. 6935-6939
- Tachioka, Y.¹ Watanabe, S.² Hershey, J.R.³

28
- 84911377545
- The Kaldi speech recognition toolkit
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi speech recognition toolkit," in Proc. ASRU, Honolulu, HI, USA, 2011.
- Proc. ASRU, Honolulu, HI, USA, 2011
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlícek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stemmer, G.¹² Vesely, K.¹³

29
- 0033677121
- Maximum likelihood discriminant feature spaces
- G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. ICASSP, Istanbul, Turkey, 2000, pp. 1129-1132.
- Proc. ICASSP, Istanbul, Turkey, 2000 , pp. 1129-1132
- Saon, G.¹ Padmanabhan, M.² Gopinath, R.³ Chen, S.⁴

30
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.¹

31
- 84865766789
- Uncertainty measures for improving exemplar-based source separation
- H. Kallasjoki, U. Remes, J. F. Gemmeke, T. Virtanen, and K. J. Palomäki, "Uncertainty measures for improving exemplar-based source separation," in Proc. INTERSPEECH, Florence, Italy, 2011, pp. 469-472.
- Proc. INTERSPEECH, Florence, Italy, 2011 , pp. 469-472
- Kallasjoki, H.¹ Remes, U.² Gemmeke, J.F.³ Virtanen, T.⁴ Palomäki, K.J.⁵

32
- 85032752215
- Exemplar-based processing for speech recognition: An overview
- Nov.
- T. Sainath, B. Ramabhadran, D. Nahamoo, D. Kanevsky, D. Van Compernolle, K. Demuynck, J. Gemmeke, J. Bellegarda, and S. Sundaram, "Exemplar-based processing for speech recognition: An overview," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 98-113, Nov. 2012.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 98-113
- Sainath, T.¹ Ramabhadran, B.² Nahamoo, D.³ Kanevsky, D.⁴ Van Compernolle, D.⁵ Demuynck, K.⁶ Gemmeke, J.⁷ Bellegarda, J.⁸ Sundaram, S.⁹

33
- 84893652593
- Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech
- A. Hurmalainen, J. F. Gemmeke, and T. Virtanen, "Compact long context spectral factorisation models for noise robust recognition of medium vocabulary speech," in Proc. CHiME Workshop, Vancouver, BC, Canada, 2013, pp. 13-18.
- Proc. CHiME Workshop, Vancouver, BC, Canada, 2013 , pp. 13-18
- Hurmalainen, A.¹ Gemmeke, J.F.² Virtanen, T.³

34
- 84878390904
- Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise
- F. Weninger, M. Wöllmer, and B. Schuller, "Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise," in Proc. Interspeech, Portland, OR, USA, 2012, pp. 302-305.
- Proc. Interspeech, Portland, OR, USA, 2012 , pp. 302-305
- Weninger, F.¹ Wöllmer, M.² Schuller, B.³

35
- 0031268931
- Bidirectional recurrent neural networks
- Nov.
- M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," IEEE Trans. Signal Process., vol. 45, no. 11, pp. 2673-2681, Nov. 1997.
- (1997) IEEE Trans. Signal Process. , vol.45 , Issue.11 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

36
- 27744588611
- Framewise phoneme classification with bidirectional LSTM and other neural network architectures
- A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Netw., vol. 18, no. 5-6, pp. 602-610, 2005.
- (2005) Neural Netw. , vol.18 , Issue.5-6 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

37
- 70349284484
- Ph.D. dissertation, Technische Univ. München, Munich, Germany
- A. Graves, "Supervised sequence labelling with recurrent neural networks," Ph.D. dissertation, Technische Univ. München, Munich, Germany, 2008.
- (2008) Supervised Sequence Labelling with Recurrent Neural Networks
- Graves, A.¹

38
- 84055211743
- Acoustic modeling using deep belief networks
- Jan.
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 14-22, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

39
- 84893671946
- Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark
- Y. Tachioka, S. Watanabe, J. Le Roux, and J. R. Hershey, "Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark," in Proc. CHiME Workshop, Vancouver, BC, Canada, 2013, pp. 19-24.
- Proc. CHiME Workshop, Vancouver, BC, Canada, 2013 , pp. 19-24
- Tachioka, Y.¹ Watanabe, S.² Le Roux, J.³ Hershey, J.R.⁴

40
- 84867614588
- Analyzing the memory of BLSTM neural networks for enhanced emotion classification in dyadic spoken interactions
- M. Wöllmer, A. Metallinou, N. Katsamanis, B. Schuller, and S. Narayanan, "Analyzing the memory of BLSTM neural networks for enhanced emotion classification in dyadic spoken interactions," in Proc. ICASSP, Kyoto, Japan, 2012, pp. 4157-4160.
- Proc. ICASSP, Kyoto, Japan, 2012 , pp. 4157-4160
- Wöllmer, M.¹ Metallinou, A.² Katsamanis, N.³ Schuller, B.⁴ Narayanan, S.⁵

41
- 84863740422
- Toward a practical implementation of exemplar-based noise robust ASR
- J. F. Gemmeke, A. Hurmalainen, T. Virtanen, and Y. Sun, "Toward a practical implementation of exemplar-based noise robust ASR," in Proc. EUSIPCO, Barcelona, Spain, 2011, pp. 1490-1494.
- (2011) Proc. EUSIPCO, Barcelona, Spain , pp. 1490-1494
- Gemmeke, J.F.¹ Hurmalainen, A.² Virtanen, T.³ Sun, Y.⁴

42
- 84886818613
- Active-set Newton algorithm for overcomplete non-negative representations of audio
- Nov.
- T. Virtanen, J. Gemmeke, and B. Raj, "Active-set Newton algorithm for overcomplete non-negative representations of audio," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 11, pp. 2277-2289, Nov. 2013.
- (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.11 , pp. 2277-2289
- Virtanen, T.¹ Gemmeke, J.² Raj, B.³

43
- 84893685019
- A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
- F. Nesta, M. Matassoni, and R. F. Astudillo, "A flexible spatial blind source extraction framework for robust speech recognition in noisy environments," in Proc. CHiME Workshop, Vancouver, BC, Canada, 2013, pp. 33-38.
- Proc. CHiME Workshop, Vancouver, BC, Canada, 2013 , pp. 33-38
- Nesta, F.¹ Matassoni, M.² Astudillo, R.F.³

44
- 84905240834
- Recurrent deep neural networks for robust speech recognition
- to be published
- C. Weng, D. Yu, S. Watanabe, and B.-H. Juang, "Recurrent deep neural networks for robust speech recognition," in Proc. ICASSP, Florence, Italy, 2014, to be published.
- Proc. ICASSP, Florence, Italy, 2014
- Weng, C.¹ Yu, D.² Watanabe, S.³ Juang, B.-H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.