메뉴 건너뛰기




Volumn , Issue , 2011, Pages 4860-4863

A multi-stream ASR framework for BLSTM modeling of conversational speech

Author keywords

Context Modeling; Conversational Speech Recognition; Long Short Term Memory; Recurrent Neural Networks

Indexed keywords

CONTEXT MODELING; CONVERSATIONAL SPEECH RECOGNITION; DATA STREAM; MULTI-STREAM; SHORT TERM MEMORY; TANDEM SYSTEM; TRIPHONES;

EID: 80051637579     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2011.5947444     Document Type: Conference Paper
Times cited : (36)

References (17)
  • 1
    • 78651563436 scopus 로고    scopus 로고
    • Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework
    • M. Wöllmer, F. Eyben, A. Graves, B. Schuller, and G. Rigoll, "Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework," Cognitive Computation, vol. 2, no. 3, pp. 180-190, 2010.
    • (2010) Cognitive Computation , vol.2 , Issue.3 , pp. 180-190
    • Wöllmer, M.1    Eyben, F.2    Graves, A.3    Schuller, B.4    Rigoll, G.5
  • 2
    • 54349106040 scopus 로고    scopus 로고
    • Switching linear dynamic systems for noise robust speech recognition
    • B. Mesot and D. Barber, "Switching linear dynamic systems for noise robust speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 6, pp. 1850-1858, 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.6 , pp. 1850-1858
    • Mesot, B.1    Barber, D.2
  • 4
    • 34547522358 scopus 로고    scopus 로고
    • An acoustic model based on Kullback-Leibler divergence for posterior features
    • G. Aradilla, J. Vepa, and H. Bourlard, "An acoustic model based on Kullback-Leibler divergence for posterior features," in Proc. of ICASSP, Honolulu, HI, 2007, pp. 657-660.
    • Proc. of ICASSP, Honolulu, HI, 2007 , pp. 657-660
    • Aradilla, G.1    Vepa, J.2    Bourlard, H.3
  • 6
    • 78049359820 scopus 로고    scopus 로고
    • Spoken term detection with connectionist temporal classification - A novel hybrid CTC-DBN approach
    • M. Wöllmer, F. Eyben, B. Schuller, and G. Rigoll, "Spoken term detection with connectionist temporal classification - a novel hybrid CTC-DBN approach," in Proc. of ICASSP, Dallas, Texas, 2010, pp. 5274-5277.
    • Proc. of ICASSP, Dallas, Texas, 2010 , pp. 5274-5277
    • Wöllmer, M.1    Eyben, F.2    Schuller, B.3    Rigoll, G.4
  • 9
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 10
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Networks, vol. 18, no. 5-6, pp. 602-610, 2005.
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 14
  • 17
    • 77956721304 scopus 로고    scopus 로고
    • Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
    • M. Wöllmer, B. Schuller, F. Eyben, and G. Rigoll, "Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening," IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 867-881, 2010.
    • (2010) IEEE Journal of Selected Topics in Signal Processing , vol.4 , Issue.5 , pp. 867-881
    • Wöllmer, M.1    Schuller, B.2    Eyben, F.3    Rigoll, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.