메뉴 건너뛰기




Volumn , Issue , 2010, Pages 1946-1949

Recognition of spontaneous conversational speech using long Short-Term Memory phoneme predictions

Author keywords

Context modeling; Large vocabulary continuous speech recognition; Long Short Term Memory; Recurrent neural networks

Indexed keywords

BRAIN; CONTINUOUS SPEECH RECOGNITION; FORECASTING; NETWORK ARCHITECTURE; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION; VOCABULARY CONTROL;

EID: 79959821052     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (19)

References (23)
  • 1
    • 0031222490 scopus 로고    scopus 로고
    • MMIE training of large vocabulary recognition systems
    • PII S0167639397000290
    • V. Valtchev, J. J. Odell, P. C. Woodland, and S. J. Young, "MMIE training of large vocabulary recognition systems," Speech Communication, vol. 22, no. 4, pp. 303-314, 1997. (Pubitemid 127433601)
    • (1997) Speech Communication , vol.22 , Issue.4 , pp. 303-314
    • Valtchev, V.1    Odell, J.J.2    Woodland, P.C.3    Young, S.J.4
  • 2
    • 0141591620 scopus 로고    scopus 로고
    • Recent improvements in the CU SONIC ASR system for noisy speech: The spine task
    • Hong Kong
    • B. Pellom and K. Hacioglu, "Recent improvements in the CU SONIC ASR system for noisy speech: the spine task," in Proc. of ICASSP, Hong Kong, 2003.
    • (2003) Proc. of ICASSP
    • Pellom, B.1    Hacioglu, K.2
  • 3
    • 48249106592 scopus 로고    scopus 로고
    • Static and dynamic modelling for the recognition of non-verbal vocalisations in conversational speech
    • Kloster Irsee, Germany
    • B. Schuller, F. Eyben, and G. Rigoll, "Static and dynamic modelling for the recognition of non-verbal vocalisations in conversational speech," in Proc. of PIT, Kloster Irsee, Germany, 2008, pp. 99-110.
    • (2008) Proc. of PIT , pp. 99-110
    • Schuller, B.1    Eyben, F.2    Rigoll, G.3
  • 4
    • 54349106040 scopus 로고    scopus 로고
    • Switching linear dynamic systems for noise robust speech recognition
    • B. Mesot and D. Barber, "Switching linear dynamic systems for noise robust speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 6, pp. 1850-1858, 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.6 , pp. 1850-1858
    • Mesot, B.1    Barber, D.2
  • 7
    • 0033709098 scopus 로고    scopus 로고
    • Tandem con-nectionist feature extraction for conventional HMM systems
    • Istanbul, Turkey
    • H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem con-nectionist feature extraction for conventional HMM systems," in Proc. of ICASSP, vol. 3, Istanbul, Turkey, 2000, pp. 1635-1638.
    • (2000) Proc. of ICASSP , vol.3 , pp. 1635-1638
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 8
    • 70450166492 scopus 로고    scopus 로고
    • Enhanced phone posteriors for improving speech recognition systems
    • H. Ketabdar and H. Bourlard, "Enhanced phone posteriors for improving speech recognition systems," in IDIAP-RR, no. 39, 2008.
    • (2008) IDIAP-RR , Issue.39
    • Ketabdar, H.1    Bourlard, H.2
  • 11
    • 33745213373 scopus 로고    scopus 로고
    • Multi-resolution RASTA filtering for TANDEM-based ASR
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR," in Proc. of European Conf. on Speech Communication and Technology, Lisbon, Portugal, 2008, pp. 361-364. (Pubitemid 43908074)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 361-364
    • Hermansky, H.1    Fousek, P.2
  • 12
    • 56449109755 scopus 로고    scopus 로고
    • Learning long-term dependencies with recurrent neural networks
    • A. M. Schaefer, S. Udluft, and H. G. Zimmermann, "Learning long-term dependencies with recurrent neural networks," Neuro-computing, vol. 71, no. 13-15, pp. 2481-2488, 2008.
    • (2008) Neuro-computing , vol.71 , Issue.13-15 , pp. 2481-2488
    • Schaefer, A.M.1    Udluft, S.2    Zimmermann, H.G.3
  • 14
    • 0031573117 scopus 로고    scopus 로고
    • Long Short-Term Memory
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997. (Pubitemid 127462305)
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 15
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • DOI 10.1016/j.neunet.2005.06.042, PII S0893608005001206
    • A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005. (Pubitemid 43186580)
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 16
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • DOI 10.1016/j.neunet.2005.06.042, PII S0893608005001206
    • A. Graves, S. Fernandez, and J. Schmidhuber, "Bidirectional LSTM networks for improved phoneme classification and recognition," in Proc. of ICANN, Warsaw, Poland, 2005, pp. 602-610. (Pubitemid 43186580)
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 17
    • 70349203870 scopus 로고    scopus 로고
    • Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
    • Taipei, Taiwan
    • M. Wöllmer, F. Eyben, J. Keshet, A. Graves, B. Schuller, and G. Rigoll, "Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks," in Proc. of ICASSP, Taipei, Taiwan, 2009.
    • (2009) Proc. of ICASSP
    • Wöllmer, M.1    Eyben, F.2    Keshet, J.3    Graves, A.4    Schuller, B.5    Rigoll, G.6
  • 18
    • 77949372271 scopus 로고    scopus 로고
    • A tandem BLSTM-DBN architecture for keyword spotting with enhanced context modeling
    • Vic, Spain
    • M. Wöllmer, F. Eyben, A. Graves, B. Schuller, and G. Rigoll, "A Tandem BLSTM-DBN architecture for keyword spotting with enhanced context modeling," in Proc. of NOLISP 2009, Vic, Spain, 2009.
    • (2009) Proc. of NOLISP 2009
    • Wöllmer, M.1    Eyben, F.2    Graves, A.3    Schuller, B.4    Rigoll, G.5
  • 19
    • 38149014113 scopus 로고    scopus 로고
    • An application of recurrent neural networks to discriminative keyword spotting
    • Porto, Portugal
    • S. Fernandez, A. Graves, and J. Schmidhuber, "An application of recurrent neural networks to discriminative keyword spotting," in Proc. of ICANN, Porto, Portugal, 2007, pp. 220-229.
    • (2007) Proc. of ICANN , pp. 220-229
    • Fernandez, S.1    Graves, A.2    Schmidhuber, J.3
  • 21
    • 70349199112 scopus 로고    scopus 로고
    • COSINE - A corpus of multi-party conversational speech in noisy environments
    • Taipei, Taiwan
    • A. Stupakov, E. Hanusa, J. Bilmes, and D. Fox, "COSINE - a corpus of multi-party conversational speech in noisy environments," in Proc. of ICASSP, Taipei, Taiwan, 2009.
    • (2009) Proc. of ICASSP
    • Stupakov, A.1    Hanusa, E.2    Bilmes, J.3    Fox, D.4
  • 22
    • 0031268931 scopus 로고    scopus 로고
    • Bidirectional recurrent neural networks
    • PII S1053587X97080550
    • M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," IEEE Transactions on Signal Processing, vol. 45, pp. 2673-2681, November 1997. (Pubitemid 127766336)
    • (1997) IEEE Transactions on Signal Processing , vol.45 , Issue.11 , pp. 2673-2681
    • Schuster, M.1    Paliwal, K.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.