SCOPUS 정보 검색 플랫폼

2012 IEEE Workshop on Spoken Language Technology, SLT 2012 - Proceedings

Volumn , Issue , 2012, Pages 107-112

Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modeling

(3) Triefenbach, Fabian a Demuynck, Kris a Martens, Jean Pierre a

a GHENT UNIVERSITY (Belgium)

Author keywords

continuous speech recognition; reservoir computing; tandem acoustic modeling

Indexed keywords

ACOUSTIC MODELING; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; PHONEME RECOGNITION; RECOGNITION ACCURACY; RESERVOIR COMPUTING; WORD ERROR RATE REDUCTIONS;

RECURRENT NEURAL NETWORKS; VOCABULARY CONTROL;

CONTINUOUS SPEECH RECOGNITION;

EID: 84874271357 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2012.6424206 Document Type: Conference Paper

Times cited : (3)

References (19)

1
- 0003573244
- Kluwer Academic Publishers, Norwell, MA, USA
- Herve A. Bourlard and Nelson Morgan, Connectionist Speech Recognition: A Hybrid Approach, Kluwer Academic Publishers, Norwell, MA, USA, 1993.
- (1993) Connectionist Speech Recognition: A Hybrid Approach
- Bourlard, H.A.¹ Morgan, N.²

2
- 0033709098
- Tandem connectionist feature extraction for conventional hmm systems
- H. Hermansky, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional hmm systems," in Proc. of ICASSP, 2000, pp. 1635-1638.
- (2000) Proc. of ICASSP , pp. 1635-1638
- Hermansky, H.¹ Ellis, D.P.W.² Sharma, S.³

3
- 84865704330
- A bottom-up stepwise knowledgeintegration approach to large vocabulary continuous speech recognition using weighted finite state machines
- Sabato Marco Siniscalchi, Torbjørn Svendsen, and Chin-Hui Lee, "A bottom-up stepwise knowledgeintegration approach to large vocabulary continuous speech recognition using weighted finite state machines," in Proc. of INTERSPEECH, 2011, pp. 901-904.
- (2011) Proc. of INTERSPEECH , pp. 901-904
- Siniscalchi, S.M.¹ Svendsen, T.² Lee, C.³

4
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. of INTERSPEECH, 2011, pp. 437-440.
- (2011) Proc. of INTERSPEECH , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

5
- 84055222005
- Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
- G.E. Dahl, Dong Yu, Li Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

6
- 84865768819
- Deep convex net: A scalable architecture for speech pattern classification
- Li Deng and Dong Yu, "Deep convex net: A scalable architecture for speech pattern classification," in Proc. of INTERSPEECH, 2011, pp. 2285-2288.
- (2011) Proc. of INTERSPEECH , pp. 2285-2288
- Deng, L.¹ Yu, D.²

7
- 84867614591
- Scalable stacking and learning for building deep architectures
- Li Deng, Dong Yu, and John Platt, "Scalable stacking and learning for building deep architectures," in Proc. of ICASSP, 2012, pp. 2133-2136.
- (2012) Proc. of ICASSP , pp. 2133-2136
- Deng, L.¹ Yu, D.² Platt, J.³

8
- 84867606917
- A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition
- Brian Hutchinson, Li Deng, and Dong Yu, "A deep architecture with bilinear modeling of hidden representations: applications to phonetic recognition," in Proc. of ICASSP, 2012, pp. 4805-4808.
- (2012) Proc. of ICASSP , pp. 4805-4808
- Hutchinson, B.¹ Deng, L.² Yu, D.³

9
- 33749833931
- Tech. Rep., German National Research Center for Information Technology
- Herbert Jaeger, "Tutorial on training recurrent neural networks, covering BPTT, RTRL, EKF and the echo state network approach," Tech. Rep., German National Research Center for Information Technology, 2002.
- (2002) Tutorial on Training Recurrent Neural Networks, Covering BPTT, RTRL, EKF and the Echo State Network Approach
- Jaeger, H.¹

10
- 85161977591
- Phoneme recognition with large hierarchical reservoirs
- Fabian Triefenbach, Azarakhsh Jalalvand, Benjamin Schrauwen, and Jean-Pierre Martens, "Phoneme recognition with large hierarchical reservoirs," in Proc. Advances in Neural Information Processing Systems (NIPS), 2010, pp. 2307-2315.
- (2010) Proc. Advances in Neural Information Processing Systems (NIPS) , pp. 2307-2315
- Triefenbach, F.¹ Jalalvand, A.² Schrauwen, B.³ Martens, J.⁴

11
- 0025503558
- Backpropagation through time: What it does and how to do it
- oct
- P.J. Werbos, "Backpropagation through time: what it does and how to do it," Proceedings of the IEEE, vol. 78, no. 10, pp. 1550-1560, oct 1990.
- (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
- Werbos, P.J.¹

12
- 84857152342
- Can non-linear readout nodes enhance the performance of reservoir-based speech recognizers?
- Fabian Triefenbach and Jean-Pierre Martens, "Can non-linear readout nodes enhance the performance of reservoir-based speech recognizers?," in Proc. of IEEE Conference on Informatics & Computational Intelligence, 2011, pp. 262-267.
- (2011) Proc. of IEEE Conference on Informatics & Computational Intelligence , pp. 262-267
- Triefenbach, F.¹ Martens, J.²

13
- 84878591993
- Continuous digit recognition in noise: Reservoirs can do an excellent job
- Azarakhsh Jalalvand, Fabian Triefenbach, and Jean-Pierre Martens, "Continuous digit recognition in noise: Reservoirs can do an excellent job," in Proc. of INTERSPEECH, 2012.
- (2012) Proc. of INTERSPEECH
- Jalalvand, A.¹ Triefenbach, F.² Martens, J.³

14
- 85009097225
- On using mlp features in lvcsr
- Qifeng Zhu, Barry Chen, Nelson Morgan, and Andreas Stolcke, "On using mlp features in lvcsr," in Proc. of INTERSPEECH, 2004, pp. 921-924.
- (2004) Proc. of INTERSPEECH , pp. 921-924
- Zhu, Q.¹ Chen, B.² Morgan, N.³ Stolcke, A.⁴

15
- 84871612455
- Optimal feature sub-space selection based on discriminant analysis
- Kris Demuynck, Jacques Duchateau, and Dirk Van Compernolle, "Optimal feature sub-space selection based on discriminant analysis," in Proc. of EUROSPEECH, 1999, pp. 1311-1314.
- (1999) Proc. of EUROSPEECH , pp. 1311-1314
- Demuynck, K.¹ Duchateau, J.² Van Compernolle, D.³

16
- 84865734256
- Analysis and comparison of recent mlp features for lvcsr systems
- Fabio Valente, Mathew Magimai-Doss, and Wen Wang, "Analysis and comparison of recent mlp features for lvcsr systems," in Proc. of INTERSPEECH, 2011, pp. 1245-1248.
- (2011) Proc. of INTERSPEECH , pp. 1245-1248
- Valente, F.¹ Magimai-Doss, M.² Wang, W.³

17
- 0003548585
- Tech. Rep., NIST
- J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "The DARPA TIMIT acousticphonetic continuous speech corpus cd-rom," Tech. Rep., NIST, 1993.
- (1993) The DARPA TIMIT Acousticphonetic Continuous Speech Corpus Cd-rom
- Garofolo, J.¹ Lamel, L.² Fisher, W.³ Fiscus, J.⁴ Pallett, D.⁵ Dahlgren, N.⁶

18
- 0012330750
- The design for the wall street journal-based csr corpus
- Douglas B. Paul and Janet M. Baker, "The design for the wall street journal-based csr corpus," in Proc. of the workshop on Speech and Natural Language, 1992, pp. 357-362.
- (1992) Proc. of the Workshop on Speech and Natural Language , pp. 357-362
- Paul, D.B.¹ Baker, J.M.²

19
- 33646759445
- Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely
- Cheng Yang, Jean-Pierre Martens, Pol Ghesquiere, and Dirk Van Compernolle, "Pronunciation Variation Modeling for ASR: Large Improvements are possible but small ones are likely," in Proc. of ITRWon PMLA, 2002, pp. 123-128.
- (2002) Proc. of ITRWon PMLA , pp. 123-128
- Yang, C.¹ Martens, J.² Ghesquiere, P.³ Van Compernolle, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.